LOCUS CCD63165.2 1897 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans VWFA domain-containing protein protein. ACCESSION BX284606-254 PROTEIN_ID CCD63165.2 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="R193.2" /locus_tag="CELE_R193.2" /standard_name="R193.2" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00020128" /db_xref="EnsemblGenomes-Tr:R193.2" /db_xref="GOA:Q9N5F6" /db_xref="InterPro:IPR002035" /db_xref="InterPro:IPR036465" /db_xref="UniProtKB/TrEMBL:Q9N5F6" /db_xref="WormBase:WBGene00020128" intron_pos 5:1 (1/36) intron_pos 46:1 (2/36) intron_pos 100:0 (3/36) intron_pos 124:2 (4/36) intron_pos 163:1 (5/36) intron_pos 270:2 (6/36) intron_pos 306:2 (7/36) intron_pos 332:2 (8/36) intron_pos 469:0 (9/36) intron_pos 504:0 (10/36) intron_pos 530:2 (11/36) intron_pos 560:2 (12/36) intron_pos 589:1 (13/36) intron_pos 669:0 (14/36) intron_pos 742:2 (15/36) intron_pos 829:0 (16/36) intron_pos 864:1 (17/36) intron_pos 934:0 (18/36) intron_pos 966:2 (19/36) intron_pos 994:2 (20/36) intron_pos 1040:0 (21/36) intron_pos 1067:1 (22/36) intron_pos 1156:0 (23/36) intron_pos 1217:2 (24/36) intron_pos 1333:2 (25/36) intron_pos 1357:0 (26/36) intron_pos 1407:2 (27/36) intron_pos 1448:2 (28/36) intron_pos 1480:1 (29/36) intron_pos 1548:0 (30/36) intron_pos 1634:0 (31/36) intron_pos 1682:1 (32/36) intron_pos 1785:2 (33/36) intron_pos 1815:2 (34/36) intron_pos 1852:0 (35/36) intron_pos 1885:1 (36/36) BEGIN 1 MGFVDFIKSL KLREWVNIGF FFVSLAMLIA GSVMLGVAIA SIPPMPPMPE DIPTTTPVPT 61 IQPVVIFTTS LVAGISWNSD YANSNSDAFK TLATGISTNI SNAYTSPTNS FGSAPLTTQI 121 NAFTQNPNGV NFYAELIFPD TTATPASVIN ALSSQGYLTN AFVSSTSQCS GLIPPTQPVN 181 PSVSPSVPPS NMPPTTTAPM KLGAVCDSSS TPFRNIFLVD VSVPIIGTLD NKLAMISNYL 241 SNAASLVNLD QNTNWNNQEF RLVVYGANEP TPMGRARNQP AWASIVANLN SSIVNPTGAD 301 GHQLTDALRF VYNNYRPIPG VAGNIIVVGD GFDFDEAQSA SPIASALKSQ YFFSLGFILM 361 STSQAQQGVV MQLATDYSHF YPIDTVDYLM NPSVLQTQAQ WICAAFYPTA APPTPPPTRL 421 PLITTIGTTT NPAVRTTVNP LFPPIASCKQ NVLFLIDQSQ TLLLSGYNSA IQFAKNTATS 481 LSTYNSQTTF AFIIYNQVVI AESNGYTDLQ TFTGAISNAG QQLGTSDVTV GFNEALNFIK 541 TKSQYNDDSV TSLLYYITDG TDYAGLVPKV INTTLTIRSQ LQTEVIGVDL IETVQSKQNV 601 QNATQFGVYD GYSQSIYVGV SQPNGILDAN ILNSTNHQLR CKDYSNCYTG LTFVIETSEA 661 EGANYIDVET RAVLNVMSHY QSMISPFKMS VSLVYFSSPD NLAPNQSGQS AVLLDHATDG 721 NSAINTLNNT NLPIGAASDL QLGFSLTANN IRNGYLQNNN LVIFFARGNY EKLSNCCPDP 781 TADAAAVRQL ATVQGVVIGP YSSKNQLDAL TGSNSIDANA IMGSSPLNTT DSRASVAQKI 841 SDAILPVVDN FMNNQYCPGI PQFVNPPCED PIDTLILLHA NNQNNWNNIL NFTAYQLIPD 901 LLGATGAVSG RALTSSTPIN FAIASYYYLD VLIHADFTYL LSPADYQSLV SSITFRPTQG 961 TATLSTAYKK AIEIFQDGRN YASKNVILIT DRMDISDMEN AFSEHDAMVQ LVGGYTSALT 1021 INTNTIPGAD YQVNINAYQL SGYNKYSVRQ LANSLTQHTC IYLPLQPPTQ QPASPSPTLP 1081 GPIPVVKARS VWPDITLLVD TSSSPDNAMT DVYFEKIRTF LDILLIKYSV GEKGSRFTLA 1141 TFDGTTVEYS CIFVQTNNYY DLSSCRNEKL SLFSRSHQNY RDVASVLANV RQNVYENTTS 1201 GYRALNENFL VLFTLGSSSS SVSQELSNLQ RKGIRTISIG LSSNMTPSAL SSFAQNAFMV 1261 SDWNSSFTGI DTNYNLADRI YQMTTKKRTP STSNFFGNLI YVVDQSGNSY ADHSNIVQFV 1321 SDSVTPFLVG NIKTQISIVP FSDNVIAPLQ LSSSQLAVDQ YLTSWKSSMS SSSYTANVGN 1381 AIQYVSNMIG NQPDRPTYII YVVGSTNLTG TSYSKQLLSG QQLYVANYNQ SASTNFQELV 1441 YSSNNVFSVS SSIHLLNRVV SMQVSTPNPV LALSNRIYSD QQSEDSITFP TSSIAADIIF 1501 LLDETGLTDS DFSIMKSFLQ DFTSKFSVGP SSTQFALQTY NGRTIPHDGF HLFESTSNDV 1561 VKQRIQQLTL AKATENSTDA DLAGAIEQEI FFFLTEANGW RDDVTTYTII LSHADSFYTK 1621 DTGTAMQIKN LTSVFALGLN NQRFDYVRNF TNTGFYETVR NVSSLSINSP AVNDLLVTLN 1681 NDYKNTIYPT PISGKDAVKA DYIFLVDSAL GEGYTQNVQN FLNSFITNVG NFTSTGNDTK 1741 MAVVTYGRSV NTVWSLTDLQ DVTSLRIQIS NFQITTSSGI SNLRRGIDSI ILNEPEFGVD 1801 PNRPNYIIVI TGSETISPNI DGPTSRHLNT RYSTYVIQTN FDNSTFSYTP QVLGTQLTSD 1861 RVAHSPDLYL NGDRLGGFVD WISNEYTAWQ LTFPNKL //