LOCUS CAA88964.4 3532 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Apple domain-containing protein protein. ACCESSION BX284602-3969 PROTEIN_ID CAA88964.4 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 15279421) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="II" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="srap-1" /locus_tag="CELE_T06D8.1" /standard_name="T06D8.1a" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00011522" /db_xref="EnsemblGenomes-Tr:T06D8.1a" /db_xref="InterPro:IPR003609" /db_xref="InterPro:IPR009557" /db_xref="UniProtKB/TrEMBL:G5ECB4" /db_xref="WormBase:WBGene00011522" intron_pos 43:1 (1/23) intron_pos 110:1 (2/23) intron_pos 140:1 (3/23) intron_pos 181:1 (4/23) intron_pos 205:1 (5/23) intron_pos 239:1 (6/23) intron_pos 265:2 (7/23) intron_pos 323:1 (8/23) intron_pos 402:1 (9/23) intron_pos 442:0 (10/23) intron_pos 489:1 (11/23) intron_pos 664:1 (12/23) intron_pos 827:1 (13/23) intron_pos 1181:1 (14/23) intron_pos 3095:0 (15/23) intron_pos 3136:0 (16/23) intron_pos 3174:0 (17/23) intron_pos 3208:1 (18/23) intron_pos 3223:1 (19/23) intron_pos 3319:1 (20/23) intron_pos 3347:1 (21/23) intron_pos 3391:0 (22/23) intron_pos 3445:1 (23/23) BEGIN 1 MKPRWWHNSS SQTSQFLIFI PLLTLLASSS VNAKPLNLPQ TITCADNIYV YVNSTEADRS 61 PYIFVEIKTA TIHDCINSCF GNQFCYSLKY DQSKADSCSL FYFAAYNCTG QALVPAKSVV 121 YNGGAVTIDC LRCPSNGDFV TAPPFTSFTE QTITAVGDRG ETLIEKPLVE DITHNIDSKL 181 ESTTPSGHST AIVDLHVEDT TTAVGAETTS TSAPEPIAST KIAPVQTAQH NRKGNYYPAC 241 YINFQVEEIS TQPNFDHYTI KPAKSANACA RFCFVGLCTV AVYSPSTGEC RLGRDRREKC 301 TDSENKFSYT GADDVVLQCF RCSSKKVPPT ADVTKTTVSF QEEEHVTTQA KADETTTAEP 361 STTTATESST SLAESNDQEP AVATKVEMAK DKEGVKTTQR KHCVIKFQAR PLSDRPENLK 421 AKFELNVPVD SIELCATRCY QDGCSGARFD PADRSCTLSY DDPQFCARGN VFIHYEAKEA 481 TWLHCVNCYT VKPSDFDEVR TGTTAATPKG IESTTVASSE TTSAEAVTTT QKQADITSTT 541 AAPELTSSAS QESTTTTVAT STVKQSESDG SDDFQRGCLI KFQARPLTER PKELSAKFET 601 EIKVDSVEMC ATRCYQDGCS GARFDPVWST CSLSYDEKHF CARGDVFLQY MAKEVTWIHC 661 VNCYAIKSTA AADLPKVPSK TNDDNEITTT SAPAAVTNPW GETETPLSEG EKTTTSSPKE 721 EAATLKTIGQ EVDDDSLLKG CIVHFQSQPI EERNKEFTAP FELNLKVPTA EICAHRCYQD 781 GCTAAKYDPS TQQCSLAYED KPFCGNGRLV NIDRSDKTVW IHCLSCVPLK NSKPAENTDG 841 EVTESNLPDG SGEHGADTAS GEEPTSTPIT APTDFSNDDQ VTEASGEETT TAAATEASSE 901 ETTTSAVTEG SGEETTVPTT VESSGEEPAL SSTSVPTELS KDDQVTEASG EETTTAAATE 961 ASSEETTTPA VTEASGEEST TSAVTEGSGE EITVPTTVES SGEEPALSST SVPTELSKDD 1021 QVTEASGEET TTAAATEASS EETTTPAVTE ASGEESTTSA VTEGSGEEIT VHTTVESSGE 1081 EPAISSTSIP TELSKDDQVT EASGEETTTA AATEASLEAT TTPAVTEASG EEITTSAVTE 1141 ESGEETTVVA VVESSGEEQA SSSTSIPTEL SKDDQVTEAS GEETTTAAAT EASSEATTTP 1201 AVTEASGEET TVVAVVESSG EEPASSSTSI PTELSKDNQV TEASGEETNT AAVTEGSGEE 1261 TTTAAATETS SEETTISAVT EASGEETTVV AVVQSSGEEP ASSSTSIPTE LSKDDQVTEA 1321 SGEETTTAAA TEASSEETTT LAVTEGSGEE TTVVAVVESS GEEPASSSTS IPTELSKDDQ 1381 VTEASGEETT TAAVTEGSGE ETVTPAATEA SSEATTTPAG TEASGEETTT SAVTEGSGEE 1441 NTVVAVVESS GEEPASSSTS IPTELSNDDQ VTEGSGEETT TAAATETSSE ETTTSVVTEG 1501 SGEETTTSAV TEASGEETTT SAVTEGSGEE NTVVAVVQSS GEEPASSSTS IPTELSKDDQ 1561 VTEASGEETT TAAATEASSE ETTTSAVTEG SGEETTTSAV SEGSGDETTT AAATEASSEE 1621 TITSAVTEGS GEETTTSAVT EGSGEETTTA AATEASSEET ITSAVTEGSG EETTTSAVTE 1681 GSGEEITVPT TVESSGEEPA LSSMSIPTEL SKDDQVTEAS GEETTTAAAT ETSSEETTTS 1741 VVTEGSGEQT TVVAVVESSG EEPASSSTSI PTELSKDDQV TEASGEETTT AAATEASSEE 1801 TITSAVTEGS GEETTVVAVV ESSGEEPASS STSVPTELSK DDQVTEASGE ETTTAAATEA 1861 SSEETTTSAV TEGSGEETTT SAVTEASSEA TTTPAGTEAS GEETTTSAVT EGSGEETTVV 1921 AVVESSGEEP ASSSTSIPTE LSKNDQVTEA SGEETITAAA TEASEETTTS AVTEGSGEDT 1981 TVVAVVELSG EQPASSSTSI PTELSKDDQV TEASGEETTT AAATEASEET TTSAVTEGSG 2041 EETTVVAVVE SSGEEPASSS TSIPTELSKD DQVTEASGEE TTTAAATEAS EETTTSAVTE 2101 GSGEDTTVVA VVESSGEQPA SSSTSIPTEL SKDDQVTEAS GEETTTAAAT EASEETTTSA 2161 VTEGSGEDTT VVAVVESSGE QPASSSTSIP TELSKDDQVT EASGEETTTA AATEASEETT 2221 TSAVTEGSGE ETTVVAVVES SGEEPASSST SIPTELSKDD KVTEASGEET TTAAATDASS 2281 EETTTSAVTE GSGEETTVVA VVESSDEEPA SSSTSIPTEL SKDDQVTEAS GEETTTAAAT 2341 EASEETTTSA VTEGSGEETT VVAVVESSGE EPASSSTSIP TELSKDDKVT EASGEETTTA 2401 AATDASSEET TTSAVTEGSG EETTVVAVVE SSDEEPASSS TSIPTELSKD DQVTEASGEE 2461 TTTAAATEAS EETTTSAVTE GSGEETTVVA VVESSGEEPA SSSTSIPTEL SKDDQVTEAS 2521 GEETTTAAAT EASEETTTSA VTEGSGEDTT VVAVVESSGE QPASSSTSIP TELSKDDQVT 2581 EASGEETTTA AATEASEETT TSAVTEGSGE ETTVVAVVES SGEEPASSST SIPTELSKDD 2641 QVTEASGEET TTAAATEASS EETTTSAVTE GSGEETTTSA VTEGSGEETT TSAVPEGENS 2701 TTEAPAFVTG SEIEIPSSEE SSSTTTHDPS IPVITPKPSV SSTIENVMSK TSSEEAAEKK 2761 IIGEHQTGKD DDAGKEDEDN MPAFVTANPA GTSTTESAEN VTSTGEEDEN IKMAKELGKQ 2821 FAADLAKLAA KDGVNLTETA DAKDSGETAH VEDEQVSSTE SSIGSEETTT TVNKETTEEH 2881 HEASGEEDDA PAFVTGAPTD STTEASVSTT SAITDETTSV AADESTSTSA GEVQSSSAII 2941 DSATVASEEQ TSSEATSVIE SSGEEVTTTD ENLVTSTVAQ LEEGSGITAA ESKDEDSVTT 3001 EATSQSTTVS ESSDGSGEST VAPNDSETST TESSQSTTDE GSGVTAAESK DEESSTTEAP 3061 AFVTSKTSGS EEDEEDSPDT HEFLTGIDET MFNKSLVPDT HREDLPNNVG FVPSSEPKPK 3121 NPDEEEEEEE DDGTKSDDYE DNVSKKISST SAPTTTTTEA AGATTEPPVQ LVKDLIDALA 3181 AGGLDFVLGR PRKPTSQAAQ DMINRKLGPI QRLLPQAIEN KHECTTGRVR FVASEMVDLS 3241 QHFERDAVAF SLEHCARMCF ETSCVRAAFT RFPRPVCLMH YADQKTAHLD TNCTDVTPTT 3301 SWTFTKINQV VAIDCVTCAD EKKTHDITSF SVDEPSSSSD NIPLHQGLSS KCDGRVEFQV 3361 IPVASLPKLN ITNDVPASSP ADCARKCFEM KNCKTAGFIP SPSGTIAQGV CLLTSDDVVC 3421 GNLADFVPQH AALHPFVVSC IRCTSCTYNI RPVTPTRTMP TMKVHEQADN VQECAKLCAD 3481 MKCTMAKYEN NTKICSMTRE PVTEETCPQE VATQIHDSLL PVSIECVKCS GN //