LOCUS       CCD63165.2              1897 aa    PRT              CON 06-FEB-2024
DEFINITION  Caenorhabditis elegans VWFA domain-containing protein protein.
ACCESSION   BX284606-254
PROTEIN_ID  CCD63165.2
SOURCE      Caenorhabditis elegans
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
            Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
            Caenorhabditis.
REFERENCE   1  (bases 1 to 17718942)
  AUTHORS   WormBase.
  CONSRTM   WormBase Consortium
  JOURNAL   Submitted (04-FEB-2024) to the INSDC. WormBase Group, European
            Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email:
            help@wormbase.org
REFERENCE   2  (bases 1 to 17718942)
  AUTHORS   Sulson J.E., Waterston R.
  JOURNAL   Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project:
            Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome
            Institute at Washington University, St. Louis, MO 63110, USA.
REFERENCE   3  (bases 1 to 17718942)
  AUTHORS   Sulson J.E., Waterston R.
  CONSRTM   Caenorhabditis elegans Sequencing Consortium
  TITLE     Genome sequence of the nematode C. elegans: a platform for
            investigating biology
  JOURNAL   Science 282(5396), 2012-2018(1998).
COMMENT     Annotated features correspond to WormBase release WS292.
            Protein-coding gene structures below are the result of integration
            and manual review of the following types of data: ab initio
            predictions by Genefinder (P. Green and L. Hillier, pers. comm.);
            alignments to published proteins and cDNAs; genome sequence
            conservation with other nematodes (e.g. to C. briggsae using WABA:
            Genome Res. 2000. 10:1115-1125); sequence features (such as
            trans-splice and polyA sites).
            Sources of data: large-scale EST projects of Yuji Kohara
            (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome
            cloning project (http://worfdb.dfci.harvard.edu); RST large-scale
            sequencing project (Genome Res. 2009. 19:2334-2342); IST library
            (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010
            Unpublished); UTRome EST data submission (UTRome v1 Mangone M.
            Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep
            sequencing data (454 read clusters - Makedonka Mitreva,
            unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66);
            Numerous data sets from the modENCODE project (Science. 2010.
            330:1775-87); Individual C. elegans Nucleotide Database
            submissions; Personal communications with C. elegans researchers;
            Non-Coding gene structures below are derived using the following
            methods and data: ab initio prediction of tRNAs by tRNAscan-SE
            (Nucl. Acids. Res., 25, 955-964); integration and appraisal of
            miRNAs from miRBase (http://www.mirbase.org); integration and
            appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell.
            2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87);
            manual curation of novel published ncRNAs from the literature.
FEATURES             Qualifiers
     source          /organism="Caenorhabditis elegans"
                     /chromosome="X"
                     /strain="Bristol N2"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:6239"
     protein         /transl_table=1
                     /gene="R193.2"
                     /locus_tag="CELE_R193.2"
                     /standard_name="R193.2"
                     /note="Confirmed by transcript evidence"
                     /db_xref="EnsemblGenomes-Gn:WBGene00020128"
                     /db_xref="EnsemblGenomes-Tr:R193.2"
                     /db_xref="GOA:Q9N5F6"
                     /db_xref="InterPro:IPR002035"
                     /db_xref="InterPro:IPR036465"
                     /db_xref="UniProtKB/TrEMBL:Q9N5F6"
                     /db_xref="WormBase:WBGene00020128"
     intron_pos      5:1 (1/36)
     intron_pos      46:1 (2/36)
     intron_pos      100:0 (3/36)
     intron_pos      124:2 (4/36)
     intron_pos      163:1 (5/36)
     intron_pos      270:2 (6/36)
     intron_pos      306:2 (7/36)
     intron_pos      332:2 (8/36)
     intron_pos      469:0 (9/36)
     intron_pos      504:0 (10/36)
     intron_pos      530:2 (11/36)
     intron_pos      560:2 (12/36)
     intron_pos      589:1 (13/36)
     intron_pos      669:0 (14/36)
     intron_pos      742:2 (15/36)
     intron_pos      829:0 (16/36)
     intron_pos      864:1 (17/36)
     intron_pos      934:0 (18/36)
     intron_pos      966:2 (19/36)
     intron_pos      994:2 (20/36)
     intron_pos      1040:0 (21/36)
     intron_pos      1067:1 (22/36)
     intron_pos      1156:0 (23/36)
     intron_pos      1217:2 (24/36)
     intron_pos      1333:2 (25/36)
     intron_pos      1357:0 (26/36)
     intron_pos      1407:2 (27/36)
     intron_pos      1448:2 (28/36)
     intron_pos      1480:1 (29/36)
     intron_pos      1548:0 (30/36)
     intron_pos      1634:0 (31/36)
     intron_pos      1682:1 (32/36)
     intron_pos      1785:2 (33/36)
     intron_pos      1815:2 (34/36)
     intron_pos      1852:0 (35/36)
     intron_pos      1885:1 (36/36)
BEGIN
        1 MGFVDFIKSL KLREWVNIGF FFVSLAMLIA GSVMLGVAIA SIPPMPPMPE DIPTTTPVPT
       61 IQPVVIFTTS LVAGISWNSD YANSNSDAFK TLATGISTNI SNAYTSPTNS FGSAPLTTQI
      121 NAFTQNPNGV NFYAELIFPD TTATPASVIN ALSSQGYLTN AFVSSTSQCS GLIPPTQPVN
      181 PSVSPSVPPS NMPPTTTAPM KLGAVCDSSS TPFRNIFLVD VSVPIIGTLD NKLAMISNYL
      241 SNAASLVNLD QNTNWNNQEF RLVVYGANEP TPMGRARNQP AWASIVANLN SSIVNPTGAD
      301 GHQLTDALRF VYNNYRPIPG VAGNIIVVGD GFDFDEAQSA SPIASALKSQ YFFSLGFILM
      361 STSQAQQGVV MQLATDYSHF YPIDTVDYLM NPSVLQTQAQ WICAAFYPTA APPTPPPTRL
      421 PLITTIGTTT NPAVRTTVNP LFPPIASCKQ NVLFLIDQSQ TLLLSGYNSA IQFAKNTATS
      481 LSTYNSQTTF AFIIYNQVVI AESNGYTDLQ TFTGAISNAG QQLGTSDVTV GFNEALNFIK
      541 TKSQYNDDSV TSLLYYITDG TDYAGLVPKV INTTLTIRSQ LQTEVIGVDL IETVQSKQNV
      601 QNATQFGVYD GYSQSIYVGV SQPNGILDAN ILNSTNHQLR CKDYSNCYTG LTFVIETSEA
      661 EGANYIDVET RAVLNVMSHY QSMISPFKMS VSLVYFSSPD NLAPNQSGQS AVLLDHATDG
      721 NSAINTLNNT NLPIGAASDL QLGFSLTANN IRNGYLQNNN LVIFFARGNY EKLSNCCPDP
      781 TADAAAVRQL ATVQGVVIGP YSSKNQLDAL TGSNSIDANA IMGSSPLNTT DSRASVAQKI
      841 SDAILPVVDN FMNNQYCPGI PQFVNPPCED PIDTLILLHA NNQNNWNNIL NFTAYQLIPD
      901 LLGATGAVSG RALTSSTPIN FAIASYYYLD VLIHADFTYL LSPADYQSLV SSITFRPTQG
      961 TATLSTAYKK AIEIFQDGRN YASKNVILIT DRMDISDMEN AFSEHDAMVQ LVGGYTSALT
     1021 INTNTIPGAD YQVNINAYQL SGYNKYSVRQ LANSLTQHTC IYLPLQPPTQ QPASPSPTLP
     1081 GPIPVVKARS VWPDITLLVD TSSSPDNAMT DVYFEKIRTF LDILLIKYSV GEKGSRFTLA
     1141 TFDGTTVEYS CIFVQTNNYY DLSSCRNEKL SLFSRSHQNY RDVASVLANV RQNVYENTTS
     1201 GYRALNENFL VLFTLGSSSS SVSQELSNLQ RKGIRTISIG LSSNMTPSAL SSFAQNAFMV
     1261 SDWNSSFTGI DTNYNLADRI YQMTTKKRTP STSNFFGNLI YVVDQSGNSY ADHSNIVQFV
     1321 SDSVTPFLVG NIKTQISIVP FSDNVIAPLQ LSSSQLAVDQ YLTSWKSSMS SSSYTANVGN
     1381 AIQYVSNMIG NQPDRPTYII YVVGSTNLTG TSYSKQLLSG QQLYVANYNQ SASTNFQELV
     1441 YSSNNVFSVS SSIHLLNRVV SMQVSTPNPV LALSNRIYSD QQSEDSITFP TSSIAADIIF
     1501 LLDETGLTDS DFSIMKSFLQ DFTSKFSVGP SSTQFALQTY NGRTIPHDGF HLFESTSNDV
     1561 VKQRIQQLTL AKATENSTDA DLAGAIEQEI FFFLTEANGW RDDVTTYTII LSHADSFYTK
     1621 DTGTAMQIKN LTSVFALGLN NQRFDYVRNF TNTGFYETVR NVSSLSINSP AVNDLLVTLN
     1681 NDYKNTIYPT PISGKDAVKA DYIFLVDSAL GEGYTQNVQN FLNSFITNVG NFTSTGNDTK
     1741 MAVVTYGRSV NTVWSLTDLQ DVTSLRIQIS NFQITTSSGI SNLRRGIDSI ILNEPEFGVD
     1801 PNRPNYIIVI TGSETISPNI DGPTSRHLNT RYSTYVIQTN FDNSTFSYTP QVLGTQLTSD
     1861 RVAHSPDLYL NGDRLGGFVD WISNEYTAWQ LTFPNKL
//