LOCUS       CCD70484.2              2311 aa    PRT              CON 06-FEB-2024
DEFINITION  Caenorhabditis elegans Titin-like protein.
ACCESSION   BX284606-841
PROTEIN_ID  CCD70484.2
SOURCE      Caenorhabditis elegans
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
            Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
            Caenorhabditis.
REFERENCE   1  (bases 1 to 17718942)
  AUTHORS   WormBase.
  CONSRTM   WormBase Consortium
  JOURNAL   Submitted (04-FEB-2024) to the INSDC. WormBase Group, European
            Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email:
            help@wormbase.org
REFERENCE   2  (bases 1 to 17718942)
  AUTHORS   Sulson J.E., Waterston R.
  JOURNAL   Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project:
            Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome
            Institute at Washington University, St. Louis, MO 63110, USA.
REFERENCE   3  (bases 1 to 17718942)
  AUTHORS   Sulson J.E., Waterston R.
  CONSRTM   Caenorhabditis elegans Sequencing Consortium
  TITLE     Genome sequence of the nematode C. elegans: a platform for
            investigating biology
  JOURNAL   Science 282(5396), 2012-2018(1998).
COMMENT     Annotated features correspond to WormBase release WS292.
            Protein-coding gene structures below are the result of integration
            and manual review of the following types of data: ab initio
            predictions by Genefinder (P. Green and L. Hillier, pers. comm.);
            alignments to published proteins and cDNAs; genome sequence
            conservation with other nematodes (e.g. to C. briggsae using WABA:
            Genome Res. 2000. 10:1115-1125); sequence features (such as
            trans-splice and polyA sites).
            Sources of data: large-scale EST projects of Yuji Kohara
            (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome
            cloning project (http://worfdb.dfci.harvard.edu); RST large-scale
            sequencing project (Genome Res. 2009. 19:2334-2342); IST library
            (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010
            Unpublished); UTRome EST data submission (UTRome v1 Mangone M.
            Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep
            sequencing data (454 read clusters - Makedonka Mitreva,
            unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66);
            Numerous data sets from the modENCODE project (Science. 2010.
            330:1775-87); Individual C. elegans Nucleotide Database
            submissions; Personal communications with C. elegans researchers;
            Non-Coding gene structures below are derived using the following
            methods and data: ab initio prediction of tRNAs by tRNAscan-SE
            (Nucl. Acids. Res., 25, 955-964); integration and appraisal of
            miRNAs from miRBase (http://www.mirbase.org); integration and
            appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell.
            2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87);
            manual curation of novel published ncRNAs from the literature.
FEATURES             Qualifiers
     source          /organism="Caenorhabditis elegans"
                     /chromosome="X"
                     /strain="Bristol N2"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:6239"
     protein         /transl_table=1
                     /gene="F35A5.1"
                     /locus_tag="CELE_F35A5.1"
                     /standard_name="F35A5.1"
                     /note="Confirmed by transcript evidence"
                     /db_xref="EnsemblGenomes-Gn:WBGene00018024"
                     /db_xref="EnsemblGenomes-Tr:F35A5.1"
                     /db_xref="GOA:Q20007"
                     /db_xref="InterPro:IPR033183"
                     /db_xref="UniProtKB/TrEMBL:Q20007"
                     /db_xref="WormBase:WBGene00018024"
     intron_pos      2076:2 (1/6)
     intron_pos      2097:1 (2/6)
     intron_pos      2168:0 (3/6)
     intron_pos      2197:0 (4/6)
     intron_pos      2242:0 (5/6)
     intron_pos      2305:0 (6/6)
BEGIN
        1 MSRAPPTPIK NPAKKWKPPW ESVDEEEEME VDEETPAPSK LEKKPSLKRK DAPTKPVPSP
       61 GAPSPVPIKN PVKKWKAPWE DDEPMEEAPA APVPAKKVRD PSPKKVPAKP RDASPKKIMA
      121 AKKEPETLPA VPPTPVKNPV KKFKAPWEDD EVDVEDVKDA PTVPAKKTPV LKKKEPAAAA
      181 KPRDPSPKKA APSKEHDPIV PPTPIKNPAK KWKPPWEDDE VPTEEIKEPE PATRKVPALK
      241 KKEPSTSVKP VSDPSPTKKV PVKKEPEVPP TPIKNPTKKW KPPWEDETPV EEVKEPPVPE
      301 KKAPVLKKKD PAPAAKARDP SPSKAAPKKV EPSSPVVPPT PVKNPVKKYK PPWEVDDEPA
      361 EEVKKPSAPE KKTPVLKRKE PEPSSTTPSS DPSPKKAAPA VKPRDSSPKK ATPLQADPKA
      421 QEVPPTPVKN PVKKYKPPWE VDDEDPVEEV KQPEAPAKKT PVLKRKEPAA KDTAKPATSK
      481 TPETPEKKDP VKPRDSSPKK VAAKPDSAQA PATPVKNPVK KWRPPWEDDE TPADDVSKPT
      541 DAKKTPSLAK KDPAPAKESL KPKADTKAPA KPRDPSPKKV APTAPEKKTP VLAKKEPAGP
      601 ADSKTKEPEK SKPRDPSPKK AVPAKPVPKT EVAPAAVKKP EPISKPKDTA PKKAEPNSPV
      661 VPPTPVKNPV KKWKPPWEDD DAPAKPVSLP EPEKKTPVLA KKAPTKPDSE AAADPVSGPS
      721 SKDPKLAKKA PVKPRDPSPM KAVPIKPAPK TEVPPAVVKK PEPVAKSRDP SPKKAKAEPN
      781 SPVVPPTPVK NPVKKWKPPW EDDDAPAEPV NVPEPEKKTP VLAKKTPVKP RDPSPKKAVP
      841 AKPSTKTDAP PVSVKKPEPV SKPKEPSPKK AEPNSPVVPP TPVKNPVKKW KPPWEDDDEP
      901 TEEVKKPSEP EKKTPVLAKK EPEKPKDAPK VAAKPRDPSP KKAVPEKEPA KVAAKPRDLS
      961 PKKAIPIPAN TQEAPPTPVK NPVKKWKPPW EDDDEPAEPV SAPEPEKKTP VLAKKAPAKP
     1021 RDPSPKKAAP VAAKPDPKIP EVPPTPVKNP VKKWKPPWED DDEPSEPVSA PEPEKKTPVL
     1081 AKKAPTKPAT KPDSEAAADP VSGPTSKDPK LSKKAPVEKP KPTTDPKDDK LKPSPAKKPE
     1141 KAPEPAAPKK WKPVWDDDPD EPEADFTVPA PSKKPDTEDP ADPLGGPKTK DPKLNKKAPA
     1201 EKPTEKPKPK EVSKEPPKPT EPPKPAAPKK WKPPWEDDPD EPEADFTMPA PKKPDTEDPA
     1261 DSLGGPKPKD PKLAKKAPAK KPTETPKPKE VPKEPPKAAE PPKPAAPKKW KPPWEDDPDE
     1321 PEADFTMPAP KKPDSEDPAD SLGGPKPKDP KLASKAPAKK PSETPKPKEV PKELPKPAEP
     1381 SKPAAPKKWR PPWEDDPDEP EEPEADFTMP APKKPDTEDP ADSLGGPKPK DPKLAKKAPA
     1441 KKPTETLKPK DAPKEPPKPA DPPKPAALKK PPWEDDPDEP EADFTMPAPK KPDTEDLADP
     1501 PGGPKPKDPK LAKKAPAKKP SETPKPEDAP KEPPKPAEPP KPAAPKKWKP PWEEDPDEPE
     1561 EPEADFTMPT PKKPDTEDPA DPLGRPKPKD PKLAKKAPAK KPSETPKPKD APKEPPKPAE
     1621 PPKPAAPKKW KPPWEEDPDE PEEPEADYTM PAPKKPDTED PADPLGGPKP KDPKLAKKAP
     1681 AKKPTDKPKS KDVPKEAPKP AEPPKPAAPK KWKPPWEEDP DEPEEPEADF TMPAPKKPDT
     1741 EDPADPLGGP KKKDPKLAKK APSKKPTDKP KPKDLPKEEP KPAEPPKPAA PKKWKPPWEE
     1801 DPDEPEEPEA DFTMPAPKKP DTEDPADPLG GPKKKDPKLA KKAPAKKPTD KPKPKDAPKD
     1861 AKPTPEEPAK PVAPKKWKPP WEEDPDEPEE PEADFTMPAK KKPDTEDPAD PLGGPNKKDP
     1921 KLAKKAPTKK PADKPKPSEE PEKPVAPKKW KPPWEEDPDD EPEADFTVPI KPGEDEDEPE
     1981 DADDEEEPED EPAEDEPKKK KPKKHRKRPK KKKPVVEPEK EPTPEPVVPK APKWIAPIKK
     2041 PEEPIPMPPK EKTIAERNKE ERIPPALRYA KKPRELEVYI PFVIPWEQTA ALITQEGMGA
     2101 FGKSRAANVE VNFGDKPIVQ GAVDSKTVIP LWNDESKCAN RSGMTAFGAP REIDQNVVDH
     2161 HVFNLMDKGK SQGIIPLLAK GTTYHPHGEY GTIRRQTADV KYKDGWKPGM DSESHGFISR
     2221 QFIANSKEKA GSNLLDKRRT IISDALPQSK ECEAMIPLMF DGRAVETREG SEFGSFRPLV
     2281 TNATGGYLMS YADEMKCKNI IPFQTAPSLV R
//