LOCUS       CAA88964.4              3532 aa    PRT              CON 06-FEB-2024
DEFINITION  Caenorhabditis elegans Apple domain-containing protein protein.
ACCESSION   BX284602-3969
PROTEIN_ID  CAA88964.4
SOURCE      Caenorhabditis elegans
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
            Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
            Caenorhabditis.
REFERENCE   1  (bases 1 to 15279421)
  AUTHORS   WormBase.
  CONSRTM   WormBase Consortium
  JOURNAL   Submitted (04-FEB-2024) to the INSDC. WormBase Group, European
            Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email:
            help@wormbase.org
REFERENCE   2  (bases 1 to 15279421)
  AUTHORS   Sulson J.E., Waterston R.
  JOURNAL   Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project:
            Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome
            Institute at Washington University, St. Louis, MO 63110, USA.
REFERENCE   3  (bases 1 to 15279421)
  AUTHORS   Sulson J.E., Waterston R.
  CONSRTM   Caenorhabditis elegans Sequencing Consortium
  TITLE     Genome sequence of the nematode C. elegans: a platform for
            investigating biology
  JOURNAL   Science 282(5396), 2012-2018(1998).
COMMENT     Annotated features correspond to WormBase release WS292.
            Protein-coding gene structures below are the result of integration
            and manual review of the following types of data: ab initio
            predictions by Genefinder (P. Green and L. Hillier, pers. comm.);
            alignments to published proteins and cDNAs; genome sequence
            conservation with other nematodes (e.g. to C. briggsae using WABA:
            Genome Res. 2000. 10:1115-1125); sequence features (such as
            trans-splice and polyA sites).
            Sources of data: large-scale EST projects of Yuji Kohara
            (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome
            cloning project (http://worfdb.dfci.harvard.edu); RST large-scale
            sequencing project (Genome Res. 2009. 19:2334-2342); IST library
            (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010
            Unpublished); UTRome EST data submission (UTRome v1 Mangone M.
            Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep
            sequencing data (454 read clusters - Makedonka Mitreva,
            unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66);
            Numerous data sets from the modENCODE project (Science. 2010.
            330:1775-87); Individual C. elegans Nucleotide Database
            submissions; Personal communications with C. elegans researchers;
            Non-Coding gene structures below are derived using the following
            methods and data: ab initio prediction of tRNAs by tRNAscan-SE
            (Nucl. Acids. Res., 25, 955-964); integration and appraisal of
            miRNAs from miRBase (http://www.mirbase.org); integration and
            appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell.
            2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87);
            manual curation of novel published ncRNAs from the literature.
FEATURES             Qualifiers
     source          /organism="Caenorhabditis elegans"
                     /chromosome="II"
                     /strain="Bristol N2"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:6239"
     protein         /transl_table=1
                     /gene="srap-1"
                     /locus_tag="CELE_T06D8.1"
                     /standard_name="T06D8.1a"
                     /note="Confirmed by transcript evidence"
                     /db_xref="EnsemblGenomes-Gn:WBGene00011522"
                     /db_xref="EnsemblGenomes-Tr:T06D8.1a"
                     /db_xref="InterPro:IPR003609"
                     /db_xref="InterPro:IPR009557"
                     /db_xref="UniProtKB/TrEMBL:G5ECB4"
                     /db_xref="WormBase:WBGene00011522"
     intron_pos      43:1 (1/23)
     intron_pos      110:1 (2/23)
     intron_pos      140:1 (3/23)
     intron_pos      181:1 (4/23)
     intron_pos      205:1 (5/23)
     intron_pos      239:1 (6/23)
     intron_pos      265:2 (7/23)
     intron_pos      323:1 (8/23)
     intron_pos      402:1 (9/23)
     intron_pos      442:0 (10/23)
     intron_pos      489:1 (11/23)
     intron_pos      664:1 (12/23)
     intron_pos      827:1 (13/23)
     intron_pos      1181:1 (14/23)
     intron_pos      3095:0 (15/23)
     intron_pos      3136:0 (16/23)
     intron_pos      3174:0 (17/23)
     intron_pos      3208:1 (18/23)
     intron_pos      3223:1 (19/23)
     intron_pos      3319:1 (20/23)
     intron_pos      3347:1 (21/23)
     intron_pos      3391:0 (22/23)
     intron_pos      3445:1 (23/23)
BEGIN
        1 MKPRWWHNSS SQTSQFLIFI PLLTLLASSS VNAKPLNLPQ TITCADNIYV YVNSTEADRS
       61 PYIFVEIKTA TIHDCINSCF GNQFCYSLKY DQSKADSCSL FYFAAYNCTG QALVPAKSVV
      121 YNGGAVTIDC LRCPSNGDFV TAPPFTSFTE QTITAVGDRG ETLIEKPLVE DITHNIDSKL
      181 ESTTPSGHST AIVDLHVEDT TTAVGAETTS TSAPEPIAST KIAPVQTAQH NRKGNYYPAC
      241 YINFQVEEIS TQPNFDHYTI KPAKSANACA RFCFVGLCTV AVYSPSTGEC RLGRDRREKC
      301 TDSENKFSYT GADDVVLQCF RCSSKKVPPT ADVTKTTVSF QEEEHVTTQA KADETTTAEP
      361 STTTATESST SLAESNDQEP AVATKVEMAK DKEGVKTTQR KHCVIKFQAR PLSDRPENLK
      421 AKFELNVPVD SIELCATRCY QDGCSGARFD PADRSCTLSY DDPQFCARGN VFIHYEAKEA
      481 TWLHCVNCYT VKPSDFDEVR TGTTAATPKG IESTTVASSE TTSAEAVTTT QKQADITSTT
      541 AAPELTSSAS QESTTTTVAT STVKQSESDG SDDFQRGCLI KFQARPLTER PKELSAKFET
      601 EIKVDSVEMC ATRCYQDGCS GARFDPVWST CSLSYDEKHF CARGDVFLQY MAKEVTWIHC
      661 VNCYAIKSTA AADLPKVPSK TNDDNEITTT SAPAAVTNPW GETETPLSEG EKTTTSSPKE
      721 EAATLKTIGQ EVDDDSLLKG CIVHFQSQPI EERNKEFTAP FELNLKVPTA EICAHRCYQD
      781 GCTAAKYDPS TQQCSLAYED KPFCGNGRLV NIDRSDKTVW IHCLSCVPLK NSKPAENTDG
      841 EVTESNLPDG SGEHGADTAS GEEPTSTPIT APTDFSNDDQ VTEASGEETT TAAATEASSE
      901 ETTTSAVTEG SGEETTVPTT VESSGEEPAL SSTSVPTELS KDDQVTEASG EETTTAAATE
      961 ASSEETTTPA VTEASGEEST TSAVTEGSGE EITVPTTVES SGEEPALSST SVPTELSKDD
     1021 QVTEASGEET TTAAATEASS EETTTPAVTE ASGEESTTSA VTEGSGEEIT VHTTVESSGE
     1081 EPAISSTSIP TELSKDDQVT EASGEETTTA AATEASLEAT TTPAVTEASG EEITTSAVTE
     1141 ESGEETTVVA VVESSGEEQA SSSTSIPTEL SKDDQVTEAS GEETTTAAAT EASSEATTTP
     1201 AVTEASGEET TVVAVVESSG EEPASSSTSI PTELSKDNQV TEASGEETNT AAVTEGSGEE
     1261 TTTAAATETS SEETTISAVT EASGEETTVV AVVQSSGEEP ASSSTSIPTE LSKDDQVTEA
     1321 SGEETTTAAA TEASSEETTT LAVTEGSGEE TTVVAVVESS GEEPASSSTS IPTELSKDDQ
     1381 VTEASGEETT TAAVTEGSGE ETVTPAATEA SSEATTTPAG TEASGEETTT SAVTEGSGEE
     1441 NTVVAVVESS GEEPASSSTS IPTELSNDDQ VTEGSGEETT TAAATETSSE ETTTSVVTEG
     1501 SGEETTTSAV TEASGEETTT SAVTEGSGEE NTVVAVVQSS GEEPASSSTS IPTELSKDDQ
     1561 VTEASGEETT TAAATEASSE ETTTSAVTEG SGEETTTSAV SEGSGDETTT AAATEASSEE
     1621 TITSAVTEGS GEETTTSAVT EGSGEETTTA AATEASSEET ITSAVTEGSG EETTTSAVTE
     1681 GSGEEITVPT TVESSGEEPA LSSMSIPTEL SKDDQVTEAS GEETTTAAAT ETSSEETTTS
     1741 VVTEGSGEQT TVVAVVESSG EEPASSSTSI PTELSKDDQV TEASGEETTT AAATEASSEE
     1801 TITSAVTEGS GEETTVVAVV ESSGEEPASS STSVPTELSK DDQVTEASGE ETTTAAATEA
     1861 SSEETTTSAV TEGSGEETTT SAVTEASSEA TTTPAGTEAS GEETTTSAVT EGSGEETTVV
     1921 AVVESSGEEP ASSSTSIPTE LSKNDQVTEA SGEETITAAA TEASEETTTS AVTEGSGEDT
     1981 TVVAVVELSG EQPASSSTSI PTELSKDDQV TEASGEETTT AAATEASEET TTSAVTEGSG
     2041 EETTVVAVVE SSGEEPASSS TSIPTELSKD DQVTEASGEE TTTAAATEAS EETTTSAVTE
     2101 GSGEDTTVVA VVESSGEQPA SSSTSIPTEL SKDDQVTEAS GEETTTAAAT EASEETTTSA
     2161 VTEGSGEDTT VVAVVESSGE QPASSSTSIP TELSKDDQVT EASGEETTTA AATEASEETT
     2221 TSAVTEGSGE ETTVVAVVES SGEEPASSST SIPTELSKDD KVTEASGEET TTAAATDASS
     2281 EETTTSAVTE GSGEETTVVA VVESSDEEPA SSSTSIPTEL SKDDQVTEAS GEETTTAAAT
     2341 EASEETTTSA VTEGSGEETT VVAVVESSGE EPASSSTSIP TELSKDDKVT EASGEETTTA
     2401 AATDASSEET TTSAVTEGSG EETTVVAVVE SSDEEPASSS TSIPTELSKD DQVTEASGEE
     2461 TTTAAATEAS EETTTSAVTE GSGEETTVVA VVESSGEEPA SSSTSIPTEL SKDDQVTEAS
     2521 GEETTTAAAT EASEETTTSA VTEGSGEDTT VVAVVESSGE QPASSSTSIP TELSKDDQVT
     2581 EASGEETTTA AATEASEETT TSAVTEGSGE ETTVVAVVES SGEEPASSST SIPTELSKDD
     2641 QVTEASGEET TTAAATEASS EETTTSAVTE GSGEETTTSA VTEGSGEETT TSAVPEGENS
     2701 TTEAPAFVTG SEIEIPSSEE SSSTTTHDPS IPVITPKPSV SSTIENVMSK TSSEEAAEKK
     2761 IIGEHQTGKD DDAGKEDEDN MPAFVTANPA GTSTTESAEN VTSTGEEDEN IKMAKELGKQ
     2821 FAADLAKLAA KDGVNLTETA DAKDSGETAH VEDEQVSSTE SSIGSEETTT TVNKETTEEH
     2881 HEASGEEDDA PAFVTGAPTD STTEASVSTT SAITDETTSV AADESTSTSA GEVQSSSAII
     2941 DSATVASEEQ TSSEATSVIE SSGEEVTTTD ENLVTSTVAQ LEEGSGITAA ESKDEDSVTT
     3001 EATSQSTTVS ESSDGSGEST VAPNDSETST TESSQSTTDE GSGVTAAESK DEESSTTEAP
     3061 AFVTSKTSGS EEDEEDSPDT HEFLTGIDET MFNKSLVPDT HREDLPNNVG FVPSSEPKPK
     3121 NPDEEEEEEE DDGTKSDDYE DNVSKKISST SAPTTTTTEA AGATTEPPVQ LVKDLIDALA
     3181 AGGLDFVLGR PRKPTSQAAQ DMINRKLGPI QRLLPQAIEN KHECTTGRVR FVASEMVDLS
     3241 QHFERDAVAF SLEHCARMCF ETSCVRAAFT RFPRPVCLMH YADQKTAHLD TNCTDVTPTT
     3301 SWTFTKINQV VAIDCVTCAD EKKTHDITSF SVDEPSSSSD NIPLHQGLSS KCDGRVEFQV
     3361 IPVASLPKLN ITNDVPASSP ADCARKCFEM KNCKTAGFIP SPSGTIAQGV CLLTSDDVVC
     3421 GNLADFVPQH AALHPFVVSC IRCTSCTYNI RPVTPTRTMP TMKVHEQADN VQECAKLCAD
     3481 MKCTMAKYEN NTKICSMTRE PVTEETCPQE VATQIHDSLL PVSIECVKCS GN
//