LOCUS BAL38876.1 617 aa PRT BCT 04-JUN-2019 DEFINITION Escherichia coli str. K-12 substr. MDS42 predicted assembly protein protein. ACCESSION AP012306-1556 PROTEIN_ID BAL38876.1 SOURCE Escherichia coli str. K-12 substr. MDS42 ORGANISM Escherichia coli str. K-12 substr. MDS42 Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 3976195) AUTHORS Ying,B.W., Seno,S., Suzuki,S. and Yomo,T. TITLE Direct Submission JOURNAL Submitted (30-NOV-2011) to the DDBJ/EMBL/GenBank databases. Contact:Shigeto Seno Osaka University, Graduate School of Information Science and Technology; Yamadaoka 1-5, Suita, Osaka 565-0871, Japan REFERENCE 2 AUTHORS Fredens,J., Wang,K., de la Torre,D., Funke,L.F.H., Robertson,W.E., Christova,Y., Chia,T., Schmied,W.H., Dunkelmann,D.L., Beranek,V., Uttamapinant,C., Llamazares,A.G., Elliott,T.S. and Chin,J.W. TITLE Total synthesis of Escherichia coli with a recoded genome JOURNAL Nature 569, 514-518 (2019) REMARK DOI:10.1038/s41586-019-1192-5 COMMENT MDS42 is a genome reduced strain constructed from MG1655 by Posfai et al (originally reported in Science 2006), and is commercially available from Scarab Genomics (the founder is Dr. Frederick R. Blattner). We re-confirmed the whole genome sequence of MDS42 by the next generation resequencing technology based on SOLiD3 (Applied Biosystems) ##Genome-Assembly-Data-START## Assembly Method :: Bioscope v. 1.3 Genome Coverage :: 300x Sequencing Technology :: SOLiD 3 ##Genome-Assembly-Data-END## FEATURES Qualifiers source /db_xref="taxon:1110693" /mol_type="genomic DNA" /organism="Escherichia coli str. K-12 substr. MDS42" /strain="K-12" /sub_strain="MDS42" protein /gene="asmA" /gene_synonym="ECK2058" /gene_synonym="JW2049" /gene_synonym="b2064" /locus_tag="ECMDS42_1645" /transl_table=11 BEGIN 1 MRRFLTTLMI LLVVLVAGLS ALVLLVNPND FRDYMVKQVA ARSGYQLQLD GPLRWHVWPQ 61 LSILSGRMSL TAQGASQPLV RADNMRLDVA LLPLLSHQLS VKQVMLKGAV IQLTPQTEAV 121 RSEDAPVAPR DNTLPDLSDD RGWSFDISSL KVADSVLVFQ HEDDEQVTIR NIRLQMEQDP 181 QHRGSFEFSG RVNRDQRDLT ISLNGTVDAS DYPHDLTAAI EQINWQLQGA DLPKQGIQGQ 241 GSFQAQWQES HKRLSFNQIS LTANDSTLSG QAQVTLTEKP EWQLRLQFPQ LNLDNLIPLN 301 ETANGENGAA QQGQSQSTLP RPVISSRIDE PAYQGLQGFT ADILLQASNV RWRGMNFTDV 361 ATQMTNKSGL LEITQLQGKL NGGQVSLPGT LDATSINPRI NFQPRLENVE IGTILKAFNY 421 PISLTGKMSL AGDFSGADID ADAFRHNWQG QAHVEMTDTR MEGMNFQQMI QQAVERNGGD 481 VKAAENFDNV TRLDRFTTDL TLKDGVVTLN DMQGQSPVLA LTGEGMLNLA DQTCDTQFDI 541 RVVGGWNGES KLIDFLKETP VPLRVYGNWQ QLNYSLQVDQ LLRKHLQDEA KRRLNDWAER 601 NKDSRNGKDV KKLLEKM //