LOCUS BAL40555.1 631 aa PRT BCT 04-JUN-2019 DEFINITION Escherichia coli str. K-12 substr. MDS42 thiamin (pyrimidine moiety) biosynthesis protein protein. ACCESSION AP012306-3235 PROTEIN_ID BAL40555.1 SOURCE Escherichia coli str. K-12 substr. MDS42 ORGANISM Escherichia coli str. K-12 substr. MDS42 Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 3976195) AUTHORS Ying,B.W., Seno,S., Suzuki,S. and Yomo,T. TITLE Direct Submission JOURNAL Submitted (30-NOV-2011) to the DDBJ/EMBL/GenBank databases. Contact:Shigeto Seno Osaka University, Graduate School of Information Science and Technology; Yamadaoka 1-5, Suita, Osaka 565-0871, Japan REFERENCE 2 AUTHORS Fredens,J., Wang,K., de la Torre,D., Funke,L.F.H., Robertson,W.E., Christova,Y., Chia,T., Schmied,W.H., Dunkelmann,D.L., Beranek,V., Uttamapinant,C., Llamazares,A.G., Elliott,T.S. and Chin,J.W. TITLE Total synthesis of Escherichia coli with a recoded genome JOURNAL Nature 569, 514-518 (2019) REMARK DOI:10.1038/s41586-019-1192-5 COMMENT MDS42 is a genome reduced strain constructed from MG1655 by Posfai et al (originally reported in Science 2006), and is commercially available from Scarab Genomics (the founder is Dr. Frederick R. Blattner). We re-confirmed the whole genome sequence of MDS42 by the next generation resequencing technology based on SOLiD3 (Applied Biosystems) ##Genome-Assembly-Data-START## Assembly Method :: Bioscope v. 1.3 Genome Coverage :: 300x Sequencing Technology :: SOLiD 3 ##Genome-Assembly-Data-END## FEATURES Qualifiers source /db_xref="taxon:1110693" /mol_type="genomic DNA" /organism="Escherichia coli str. K-12 substr. MDS42" /strain="K-12" /sub_strain="MDS42" protein /gene="thiC" /gene_synonym="ECK3986" /gene_synonym="JW3958" /gene_synonym="b3994" /locus_tag="ECMDS42_3432" /transl_table=11 BEGIN 1 MSATKLTRRE QRARAQHFID TLEGTAFPNS KRIYITGTHP GVRVPMREIQ LSPTLIGGSK 61 EQPQYEENEA IPVYDTSGPY GDPQIAINVQ QGLAKLRQPW IDARGDTEEL TVRSSDYTKA 121 RLADDGLDEL RFSGVLTPKR AKAGRRVTQL HYARQGIITP EMEFIAIREN MGRERIRSEV 181 LRHQHPGMSF GAHLPENITA EFVRDEVAAG RAIIPANINH PESEPMIIGR NFLVKVNANI 241 GNSAVTSSIE EEVEKLVWST RWGADTVMDL STGRYIHETR EWILRNSPVP IGTVPIYQAL 301 EKVNGIAEDL TWEAFRDTLL EQAEQGVDYF TIHAGVLLRY VPMTAKRLTG IVSRGGSIMA 361 KWCLSHHQEN FLYQHFREIC EICAAYDVSL SLGDGLRPGS IQDANDEAQF AELHTLGELT 421 KIAWEYDVQV MIEGPGHVPM QMIRRNMTEE LEHCHEAPFY TLGPLTTDIA PGYDHFTSGI 481 GAAMIGWFGC AMLCYVTPKE HLGLPNKEDV KQGLITYKIA AHAADLAKGH PGAQIRDNAM 541 SKARFEFRWE DQFNLALDPF TARAYHDETL PQESGKVAHF CSMCGPKFCS MKISQEVRDY 601 AATQTIEMGM ADMSENFRAR GGEIYLRKEE A //