LOCUS VTW47540.1 1098 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans IRS-type PTB domain-containing protein protein. ACCESSION BX284606-1644 PROTEIN_ID VTW47540.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="ist-1" /locus_tag="CELE_C54D1.3" /standard_name="C54D1.3d" /note="Confirmed by transcript evidence" /db_xref="UniProtKB/TrEMBL:A0A4V0INB3" /db_xref="WormBase:WBGene00002163" intron_pos 44:1 (1/17) intron_pos 122:0 (2/17) intron_pos 168:0 (3/17) intron_pos 226:1 (4/17) intron_pos 267:2 (5/17) intron_pos 307:0 (6/17) intron_pos 369:2 (7/17) intron_pos 373:2 (8/17) intron_pos 450:2 (9/17) intron_pos 468:2 (10/17) intron_pos 542:2 (11/17) intron_pos 589:0 (12/17) intron_pos 656:0 (13/17) intron_pos 737:2 (14/17) intron_pos 791:2 (15/17) intron_pos 952:1 (16/17) intron_pos 1000:2 (17/17) BEGIN 1 MLPAEPTSGE PKNDGEYVET MFDATEEKKQ ETIAPKQEET VKKLSMAARE ERIRESRQQG 61 QLKEKRLKNP DSKGDTSHDK PTRETWKPLA VEDLPKDEDP DEFGIPEVYK CGNCLVGFAP 121 VKKKKLMFVT LTERCLELHE SEKSYRAGKA AKHMVDLSMS FNVHSEHYDA KLKKCLCLMG 181 PDETICMRPE GGILTIEGWR RAIVKLIHES RRRKMDRVPR PEDIFDAAYD VRVCLFPKNL 241 EKYVESLKTD GFTNICTVAK ELLGKKRLCL YPNTLAIVDL CIEPTAYGLP PAGFPPFRAS 301 SMFILERNTV AYYGFRENYF YVRIGKGSPY RGFELLFQVD TNEVCKEIYS RLRALADRDL 361 ENRKQESIRR PESDMHGMES LSVPSPLLHR TKLSLDSPVI TPRDRRLMLG RECLSFASLE 421 KDDSAPSSPF ANYNRPRGSL GNFQLDHLSR FDQPRGSIHS LGNIPIDRPR VSTLQSVKPG 481 KEETGLREAL LESYGHLRKK NSTQSERGEI VVPEQERKMS DNRPRLRTQD IKTVTKPELK 541 SEEPIRPRLP PLKFNANKPS GEFLLSLQRE KELMEEARKN GYDGRTHNPD GTPREIKKDF 601 LDTVCPQAAA PEVVIKEKII NNDTYTLMGP ADWGKLEDIV KDDYSDSGDS CYSSRRGTGS 661 QPTRPAASHL ACQMQNRTQS FGAKQQTFQN RLPPTVNLPD SERKISAANQ GTSNQLDLPQ 721 EDPRKRAFSL GSKNFFNLIG LNDFRRLVSK RHRTSSPNHT STSGISLNSS NASPSASSNF 781 LASSEYLEHA RTDSFGSARS SPKSLHTQRT SSPKRRSDED LISIDFSRLG KNSASDTKRF 841 PFGGGGPGGS FDYDREKREK EDNNRRDREK AAMDLKRKEE WEARELAEKK RELERKIQAK 901 KDRKLEKGRG KDREKDKDHD HDKEFRPKAD SGIADCTPSS SFSGKKDHKN DGSSDSAYMD 961 KEGLKYLADI KKRKKEQGAI DTTNSSSSSL STIVSIEDNK VTRRITETVE TKVVTADLAK 1021 ALGAIDPNRR SSACIEINRK PSICTAIMEE REGEASETDS RGEPSDRKTT AAPVTRKPAS 1081 TGSSMSPFRR LKFLSFRK //