LOCUS BAL38882.1 648 aa PRT BCT 04-JUN-2019 DEFINITION Escherichia coli str. K-12 substr. MDS42 conserved protein protein. ACCESSION AP012306-1562 PROTEIN_ID BAL38882.1 SOURCE Escherichia coli str. K-12 substr. MDS42 ORGANISM Escherichia coli str. K-12 substr. MDS42 Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 3976195) AUTHORS Ying,B.W., Seno,S., Suzuki,S. and Yomo,T. TITLE Direct Submission JOURNAL Submitted (30-NOV-2011) to the DDBJ/EMBL/GenBank databases. Contact:Shigeto Seno Osaka University, Graduate School of Information Science and Technology; Yamadaoka 1-5, Suita, Osaka 565-0871, Japan REFERENCE 2 AUTHORS Fredens,J., Wang,K., de la Torre,D., Funke,L.F.H., Robertson,W.E., Christova,Y., Chia,T., Schmied,W.H., Dunkelmann,D.L., Beranek,V., Uttamapinant,C., Llamazares,A.G., Elliott,T.S. and Chin,J.W. TITLE Total synthesis of Escherichia coli with a recoded genome JOURNAL Nature 569, 514-518 (2019) REMARK DOI:10.1038/s41586-019-1192-5 COMMENT MDS42 is a genome reduced strain constructed from MG1655 by Posfai et al (originally reported in Science 2006), and is commercially available from Scarab Genomics (the founder is Dr. Frederick R. Blattner). We re-confirmed the whole genome sequence of MDS42 by the next generation resequencing technology based on SOLiD3 (Applied Biosystems) ##Genome-Assembly-Data-START## Assembly Method :: Bioscope v. 1.3 Genome Coverage :: 300x Sequencing Technology :: SOLiD 3 ##Genome-Assembly-Data-END## FEATURES Qualifiers source /db_xref="taxon:1110693" /mol_type="genomic DNA" /organism="Escherichia coli str. K-12 substr. MDS42" /strain="K-12" /sub_strain="MDS42" protein /gene="yegI" /gene_synonym="ECK2064" /gene_synonym="JW2055" /gene_synonym="b2070" /locus_tag="ECMDS42_1651" /transl_table=11 BEGIN 1 MKTNIKVFTS TGELTTLGRE LGKGGEGAVY DIEEFVDSVA KIYHTPPPAL KQDKLAFMAA 61 TADAQLLNYV AWPQATLHGG RGGKVIGFMM PKVSGKEPIH MIYSPAHRRQ SYPHCAWDFL 121 LYVARNIASS FATVHEHGHV VGDVNQNSFM VGRDSKVVLI DSDSFQINAN GTLHLCEVGV 181 SHFTPPELQT LPSFVGFERT ENHDNFGLAL LIFHVLFGGR HPYSGVPLIS DAGNALETDI 241 THFRYAYASD NQRRGLKPPP RSIPLSMLPS DVEAMFQQAF TESGVATGRP TAKAWVAALD 301 SLRQQLKKCI VSAMHVYPAH LTDCPWCALD NQGVIYFIDL GEEVITTGGN FVLAKVWAMV 361 MASVAPPALQ LPLPDHFQPT GRPLPLGLLR REYIILLEIA LSALSLLLCG LQAEPRYIIL 421 VPVLAAIWII GSLTSKAYKA EVQQRREAFN RAKMDYDHLV RQIQQVGGLE GFIAKRTMLE 481 KMKDEILGLP EEEKRALAAL HDTARERQKQ KFLEGFFIDV ASIPGVGPAR KAALRSFGIE 541 TAADVTRRGV KQVKGFGDHL TQAVIDWKAS CERRFVFRPN EAITPADRQA VMAKMTAKRH 601 RLESALTVGA TELQRFRLHA PARTMPLMEP LRQAAEKLAQ AQADLSRC //