LOCUS CCD66907.1 1331 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Alpha,alpha-trehalose-phosphate synthase [UDP-forming] 1 protein. ACCESSION BX284606-4079 PROTEIN_ID CCD66907.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="tps-1" /locus_tag="CELE_ZK54.2" /standard_name="ZK54.2a" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00006602" /db_xref="WormBase:WBGene00006602" intron_pos 32:0 (1/20) intron_pos 57:0 (2/20) intron_pos 170:0 (3/20) intron_pos 201:0 (4/20) intron_pos 275:0 (5/20) intron_pos 324:0 (6/20) intron_pos 384:0 (7/20) intron_pos 426:2 (8/20) intron_pos 453:0 (9/20) intron_pos 487:0 (10/20) intron_pos 539:0 (11/20) intron_pos 613:0 (12/20) intron_pos 650:0 (13/20) intron_pos 877:0 (14/20) intron_pos 920:0 (15/20) intron_pos 983:0 (16/20) intron_pos 1019:2 (17/20) intron_pos 1116:0 (18/20) intron_pos 1194:0 (19/20) intron_pos 1262:0 (20/20) BEGIN 1 MTDTATGVHS NANGVEKVPT PVFSIEGEPT QETAPTRMDP FDRPKNDNDP FEDALKRCHK 61 ILEKLDCPFV TGKEKDLDES DDMTENEDHD EMANEDDGIP SNEKKVETRK MDCTSGQLLA 121 PKLPEKAESI SSASESSEDS ESVAILKYTV RTCYAIWKKR QKNSEIALKG LMIVLELCLS 181 QPSARDDAFS ALLETLGYNT VTFWKAVVPQ IYNSDLSYAT QYREALLFSL VLYDVNHSKN 241 RLRELYAAVP GVRQSMLGIR AKQFGERYRH LQMKIARSRA SSRMSSKMGS EENLPAMASM 301 MNDVVFDEEP HTQSPLVDMS HDKQRVINVS NAPPVSISRK TSGSWEIKQG SGGLVACVDP 361 VMSADKKNIW LSNLGVNMQE ELKEHSTSTN SLGLPLIKQA CAGEVFCVLE RNEKKEELTP 421 KQQAVESDMS LLSVLNTYNK HSYQLNPVVV NQDDYNTYYG GISNGLLWPA LHNLPQYISP 481 CFDDPELLRE QWCAYVRVNY LFAINAARNS RAQDFIWIHD YHLMLCGQIM RSLESSLDIG 541 FFIHIPFQPP ANFMTKYKTV ADPIMRALLR FTKVGFQTSR DRDTFVKLVA KHIKRTKIEY 601 DSRLDRYTIE HDGWTCSLGV FPVSIKIADF VNIAKNPQTI IEAEEIKKQI MGRSADGGQL 661 FFSVERFDYT KGISEKLRAW QRYFEKYPDR IGKDVLFQVA VTNRRSVDSY RQYQDDVLAV 721 ADLINQKFKS EDYPEWKPVI FETDGLPRTR LIAHYLAMDI GVVTPSKDGM NLVAKEMLVC 781 NPTASLVLST GAGTEVQLSN AQFYSEQEGK CYHRVEEISN TEAFADNFFA AATESKETRT 841 KHGEKINQFL CVHDIDEWSD QFLDPKWTHE VISQCEVKQL GQFYGLMSRT AQVRRQIVEC 901 VLKGLPIRPH FRYSLENAKN SLESSCKSGT KLSLEADEES GEEKGFEITY DIHDELSEME 961 KDLAFLAFIQ SDEYENAEEF IKTLGSFYEG GPVLFKNEVK QAAEMLQQGI HYNTFFTDRD 1021 GTLKSYACSY PTSVQPAYSA VIQAQFARRC ATFCAIVTTA PLLHTGILEV ATIPEGYYAY 1081 GASAGREWYL NPAQQFKDRS FSAIDLNLMN KVFDIIEELL ERPEFRIFKW IGSGIQKHCG 1141 HITIAKQDVN GTIPSRKVIR LYEQLVKIVN DFDPNGTTLT MRESDLDFKI YVKAKLKGRI 1201 FNKGHGIRLV RERLKPNMSK GNCLVCGDNE SDIPMLEECL KLAGSKVYTI WVTADTNLQE 1261 KVTQLCDRFS CSNIHFVSCP QVLLGAMAYA TAHTLVDERN RKLDYYYDSD TPMDQEESST 1321 LGASLGTSFG N //