LOCUS CAA0059123.1 1899 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans protein-tyrosine-phosphatase protein. ACCESSION BX284602-3898 PROTEIN_ID CAA0059123.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 15279421) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="II" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="ptp-3" /locus_tag="CELE_C09D8.1" /standard_name="C09D8.1m" /note="Confirmed by transcript evidence" intron_pos 50:1 (1/25) intron_pos 95:1 (2/25) intron_pos 142:1 (3/25) intron_pos 183:2 (4/25) intron_pos 207:2 (5/25) intron_pos 236:0 (6/25) intron_pos 318:0 (7/25) intron_pos 365:1 (8/25) intron_pos 434:2 (9/25) intron_pos 458:1 (10/25) intron_pos 468:1 (11/25) intron_pos 485:0 (12/25) intron_pos 573:1 (13/25) intron_pos 644:1 (14/25) intron_pos 692:0 (15/25) intron_pos 729:0 (16/25) intron_pos 773:1 (17/25) intron_pos 802:0 (18/25) intron_pos 861:0 (19/25) intron_pos 1033:0 (20/25) intron_pos 1125:0 (21/25) intron_pos 1158:0 (22/25) intron_pos 1697:2 (23/25) intron_pos 1782:0 (24/25) intron_pos 1864:0 (25/25) BEGIN 1 MQSSSILIYL LTVVGCSIAF LDNTLPLYLF KNLDTKETST VQTGLVTDKV RRVPPYFSYK 61 LERQYVVGVG GNINLTCVAV GYPMPRVFWK KTDLMVLDDP STAPIGKNVL TLTHVESTEN 121 FTCVAVSALG NIEATTTVIA KELPPPPVNI VVSSVTSESV VITWKPPKYN EAINKYVVNY 181 RLKSENKDDA ALFAKDSRAE DESDSERYSE GRSSRGKTME TLENSLVIDG LVAFQTYEFT 241 VRSAGPVGVG LESLPVEAQT KPSKPATAPV SPQARSLNRD SILVKWGPCE QPNGLITGYK 301 VYYTNDLVTT PIREWKQHDA KSDEFMTTIN GLEPDSRYFV RVIAQNSEGD SPLSTLVTVA 361 TRQGIPGQPP MLTVKALDSR RMQLTWDKPL YSSPVVGYTV RYNTSDGEKE LTLTSPHEKH 421 VVTGLHPDKY YYFRVAAYSD RGQGEFTEPM ISKTIASRGS EEESTAPVPS APRNFNAELT 481 SATSVKLTWD APAAANGALL GYYVYLDRMV NGEPVVEKGS KKRIVMIRDS SKRYFELDSL 541 DPNTEYSFRL NAFNRNGDGE FSERKSIITQ GIPPEAPEIV SVSLDRDEPP VVARIEWKMP 601 KMKPNETPIE KYNLWLRAQG YPDSYVKAKT VDGTDLSTTI SGLWMGVVYD VLLAAENREG 661 RSQNATETIA TPVGSPDGEP IDVQYEVMKG KIVVSWRPPS EEKRNGNITS YKAILSAMDA 721 TADRYEQPVP APSTSSTFEV NVRRAYLFKV AAATMKGIGP YSPVLTINPD PAALVGPPTN 781 VRVEATSNST AVVQWDFESQ KADSFVVKYM HEPGNRMDTE KWKQLPVVSI DKENPKRFAV 841 VSDLNAHKPY AFCVLAVKNN RQGPCSDPPT VLESVTPTYM VQNLRVLWKT SNSVQLTWEY 901 NGPRNVGFYV NHTGRKDYVN HELQEKTMST PGFGQDVDEK HREYLWTNLR PHMMYTIHVG 961 VRTLPPGARK YWPQEVVTIT DPTGPPFVDV PKLVDSSGTQ PGQQMIRLTP ATEEYGPISH 1021 YWIILVPANY STEDVVNLDP IELEKATAEK RAQLARSLSV SPSKKLKRKA SEVGDDSQSA 1081 SYHPKEKRAR RATVPGAYVT ARLSADRVKQ QYRNNQPFIV GDSQLYDGFT NYPLEHNLHY 1141 RLMMRAFAKN DVRTKDSFEQ RAPMSEKLSR MYSDSVLTEP FTIKSALRGA SQKSSPWVGA 1201 CIAFLVLFSI VGMLICWWLR CNKKSAGRHP RHGSITKVAL TGNIMNGGGG IPGETSKLLS 1261 TSNEYGRQIM NPYEQMNGNH HMESSMDLYP LPTSHSRSNG YAPVPVAIPS LPNNGNNMTT 1321 VSHPAVPIAE LANHIERLRM NNNAGFQSEF ESIETGQHFT WEHSSADMNK HKNRYANVAA 1381 YDHSRVVLSN VEGYPGMDYI NANYVDGYDK PRSYIATQGP LPETFSDFWR MVWEEQSVTI 1441 VMLTNLEERS RVKCDQYWPS RGTATYGDIE VTLLESVHLA HYTMRTMRLK MVGEPEVREI 1501 KHLQYTAWPD HGVPDHPTPF LIFLKRVKTL NPNDAGPIIS HCSAGIGRTG AFIVIDCMLE 1561 RLRYDNTVDI YGCVTALRAQ RSYMVQTEEQ YIFIHDAVLD AVNSGSTEVP ASRLHQHLHI 1621 LSQPSADQLS GIDMEFRHLT TLKWTSNRCT VANLPVNRPK NRMLSAVPYD SNRVIMRLLP 1681 GADGSDYINA SWIDGYKERG AYIATQAPTN ETAADFWRAI WEHNSPIIAM LVRTNERGQE 1741 QCSDYWPLET GVQVGMLVVE PMAEYDMKHY HLREFRISDI NTREVRTVRQ FHFMEWPDVG 1801 KPHTADHFLD FVTQVHNTYA QFGCTGPITV HCCSGAGRTA VFIALSIILD RMRAEHVVDV 1861 FTTVKLLRTE RQNMIQEPEQ YHFLYLAAYE YLAAYDNFS //