LOCUS CCD70484.2 2311 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Titin-like protein. ACCESSION BX284606-841 PROTEIN_ID CCD70484.2 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="F35A5.1" /locus_tag="CELE_F35A5.1" /standard_name="F35A5.1" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00018024" /db_xref="EnsemblGenomes-Tr:F35A5.1" /db_xref="GOA:Q20007" /db_xref="InterPro:IPR033183" /db_xref="UniProtKB/TrEMBL:Q20007" /db_xref="WormBase:WBGene00018024" intron_pos 2076:2 (1/6) intron_pos 2097:1 (2/6) intron_pos 2168:0 (3/6) intron_pos 2197:0 (4/6) intron_pos 2242:0 (5/6) intron_pos 2305:0 (6/6) BEGIN 1 MSRAPPTPIK NPAKKWKPPW ESVDEEEEME VDEETPAPSK LEKKPSLKRK DAPTKPVPSP 61 GAPSPVPIKN PVKKWKAPWE DDEPMEEAPA APVPAKKVRD PSPKKVPAKP RDASPKKIMA 121 AKKEPETLPA VPPTPVKNPV KKFKAPWEDD EVDVEDVKDA PTVPAKKTPV LKKKEPAAAA 181 KPRDPSPKKA APSKEHDPIV PPTPIKNPAK KWKPPWEDDE VPTEEIKEPE PATRKVPALK 241 KKEPSTSVKP VSDPSPTKKV PVKKEPEVPP TPIKNPTKKW KPPWEDETPV EEVKEPPVPE 301 KKAPVLKKKD PAPAAKARDP SPSKAAPKKV EPSSPVVPPT PVKNPVKKYK PPWEVDDEPA 361 EEVKKPSAPE KKTPVLKRKE PEPSSTTPSS DPSPKKAAPA VKPRDSSPKK ATPLQADPKA 421 QEVPPTPVKN PVKKYKPPWE VDDEDPVEEV KQPEAPAKKT PVLKRKEPAA KDTAKPATSK 481 TPETPEKKDP VKPRDSSPKK VAAKPDSAQA PATPVKNPVK KWRPPWEDDE TPADDVSKPT 541 DAKKTPSLAK KDPAPAKESL KPKADTKAPA KPRDPSPKKV APTAPEKKTP VLAKKEPAGP 601 ADSKTKEPEK SKPRDPSPKK AVPAKPVPKT EVAPAAVKKP EPISKPKDTA PKKAEPNSPV 661 VPPTPVKNPV KKWKPPWEDD DAPAKPVSLP EPEKKTPVLA KKAPTKPDSE AAADPVSGPS 721 SKDPKLAKKA PVKPRDPSPM KAVPIKPAPK TEVPPAVVKK PEPVAKSRDP SPKKAKAEPN 781 SPVVPPTPVK NPVKKWKPPW EDDDAPAEPV NVPEPEKKTP VLAKKTPVKP RDPSPKKAVP 841 AKPSTKTDAP PVSVKKPEPV SKPKEPSPKK AEPNSPVVPP TPVKNPVKKW KPPWEDDDEP 901 TEEVKKPSEP EKKTPVLAKK EPEKPKDAPK VAAKPRDPSP KKAVPEKEPA KVAAKPRDLS 961 PKKAIPIPAN TQEAPPTPVK NPVKKWKPPW EDDDEPAEPV SAPEPEKKTP VLAKKAPAKP 1021 RDPSPKKAAP VAAKPDPKIP EVPPTPVKNP VKKWKPPWED DDEPSEPVSA PEPEKKTPVL 1081 AKKAPTKPAT KPDSEAAADP VSGPTSKDPK LSKKAPVEKP KPTTDPKDDK LKPSPAKKPE 1141 KAPEPAAPKK WKPVWDDDPD EPEADFTVPA PSKKPDTEDP ADPLGGPKTK DPKLNKKAPA 1201 EKPTEKPKPK EVSKEPPKPT EPPKPAAPKK WKPPWEDDPD EPEADFTMPA PKKPDTEDPA 1261 DSLGGPKPKD PKLAKKAPAK KPTETPKPKE VPKEPPKAAE PPKPAAPKKW KPPWEDDPDE 1321 PEADFTMPAP KKPDSEDPAD SLGGPKPKDP KLASKAPAKK PSETPKPKEV PKELPKPAEP 1381 SKPAAPKKWR PPWEDDPDEP EEPEADFTMP APKKPDTEDP ADSLGGPKPK DPKLAKKAPA 1441 KKPTETLKPK DAPKEPPKPA DPPKPAALKK PPWEDDPDEP EADFTMPAPK KPDTEDLADP 1501 PGGPKPKDPK LAKKAPAKKP SETPKPEDAP KEPPKPAEPP KPAAPKKWKP PWEEDPDEPE 1561 EPEADFTMPT PKKPDTEDPA DPLGRPKPKD PKLAKKAPAK KPSETPKPKD APKEPPKPAE 1621 PPKPAAPKKW KPPWEEDPDE PEEPEADYTM PAPKKPDTED PADPLGGPKP KDPKLAKKAP 1681 AKKPTDKPKS KDVPKEAPKP AEPPKPAAPK KWKPPWEEDP DEPEEPEADF TMPAPKKPDT 1741 EDPADPLGGP KKKDPKLAKK APSKKPTDKP KPKDLPKEEP KPAEPPKPAA PKKWKPPWEE 1801 DPDEPEEPEA DFTMPAPKKP DTEDPADPLG GPKKKDPKLA KKAPAKKPTD KPKPKDAPKD 1861 AKPTPEEPAK PVAPKKWKPP WEEDPDEPEE PEADFTMPAK KKPDTEDPAD PLGGPNKKDP 1921 KLAKKAPTKK PADKPKPSEE PEKPVAPKKW KPPWEEDPDD EPEADFTVPI KPGEDEDEPE 1981 DADDEEEPED EPAEDEPKKK KPKKHRKRPK KKKPVVEPEK EPTPEPVVPK APKWIAPIKK 2041 PEEPIPMPPK EKTIAERNKE ERIPPALRYA KKPRELEVYI PFVIPWEQTA ALITQEGMGA 2101 FGKSRAANVE VNFGDKPIVQ GAVDSKTVIP LWNDESKCAN RSGMTAFGAP REIDQNVVDH 2161 HVFNLMDKGK SQGIIPLLAK GTTYHPHGEY GTIRRQTADV KYKDGWKPGM DSESHGFISR 2221 QFIANSKEKA GSNLLDKRRT IISDALPQSK ECEAMIPLMF DGRAVETREG SEFGSFRPLV 2281 TNATGGYLMS YADEMKCKNI IPFQTAPSLV R //