LOCUS       BAL38581.1              1538 aa    PRT              BCT 04-JUN-2019
DEFINITION  Escherichia coli str. K-12 substr. MDS42 predicted ATP-
            dependent helicase protein.
ACCESSION   AP012306-1261
PROTEIN_ID  BAL38581.1
SOURCE      Escherichia coli str. K-12 substr. MDS42
  ORGANISM  Escherichia coli str. K-12 substr. MDS42
            Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Escherichia.
REFERENCE   1  (bases 1 to 3976195)
  AUTHORS   Ying,B.W., Seno,S., Suzuki,S. and Yomo,T.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-NOV-2011) to the DDBJ/EMBL/GenBank databases.
            Contact:Shigeto Seno
            Osaka University, Graduate School of Information Science and
            Technology; Yamadaoka 1-5, Suita, Osaka 565-0871, Japan
REFERENCE   2
  AUTHORS   Fredens,J., Wang,K., de la Torre,D., Funke,L.F.H., Robertson,W.E.,
            Christova,Y., Chia,T., Schmied,W.H., Dunkelmann,D.L., Beranek,V.,
            Uttamapinant,C., Llamazares,A.G., Elliott,T.S. and Chin,J.W.
  TITLE     Total synthesis of Escherichia coli with a recoded genome
  JOURNAL   Nature 569, 514-518 (2019)
  REMARK    DOI:10.1038/s41586-019-1192-5
COMMENT     MDS42 is a genome reduced strain constructed from MG1655 by Posfai
            et al (originally reported in Science 2006), and is commercially
            available from Scarab Genomics (the founder is Dr. Frederick R.
            Blattner). We re-confirmed the whole genome sequence of MDS42 by
            the next generation resequencing technology based on SOLiD3
            (Applied Biosystems)
            
            ##Genome-Assembly-Data-START##
            Assembly Method       :: Bioscope v. 1.3
            Genome Coverage       :: 300x
            Sequencing Technology :: SOLiD 3
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /db_xref="taxon:1110693"
                     /mol_type="genomic DNA"
                     /organism="Escherichia coli str. K-12 substr. MDS42"
                     /strain="K-12"
                     /sub_strain="MDS42"
     protein         /gene="lhr"
                     /gene_synonym="ECK1649"
                     /gene_synonym="JW1645"
                     /gene_synonym="b1653"
                     /locus_tag="ECMDS42_1324"
                     /transl_table=11
BEGIN
        1 MADNPDPSSL LPDVFSPATR DWFLRAFKQP TAVQPQTWHV AARSEHALVI APTGSGKTLA
       61 AFLYALDRLF REGGEDTREA HKRKTSRILY ISPIKALGTD VQRNLQIPLK GIADERRRRG
      121 ETEVNLRVGI RTGDTPAQER SKLTRNPPDI LITTPESLYL MLTSRARETL RGVETVIIDE
      181 VHAVAGSKRG AHLALSLERL DALLHTSAQR IGLSATVRSA SDVAAFLGGD RPVTVVNPPA
      241 MRHPQIRIVV PVANMDDVSS VASGTGEDSH AGREGSIWPY IETGILDEVL RHRSTIVFTN
      301 SRGLAEKLTA RLNELYAARL QRSPSIAVDA AHFESTSGAT SNRVQSSDVF IARSHHGSVS
      361 KEQRAITEQA LKSGELRCVV ATSSLELGID MGAVDLVIQV ATPLSVASGL QRIGRAGHQV
      421 GGVSKGLFFP RTRRDLVDSA VIVECMFAGR LENLTPPHNP LDVLAQQTVA AAAMDALQVD
      481 EWYSRVRRAA PWKDLPRRVF DATLDMLSGR YPSGDFSAFR PKLVWNRETG ILTARPGAQL
      541 LAVTSGGTIP DRGMYSVLLP EGEEKAGSRR VGELDEEMVY ESRVNDIITL GATSWRIQQI
      601 TRDQVIVTPA PGRSARLPFW RGEGNGRPAE LGEMIGDFLH LLADGAFFSG TIPPWLAEEN
      661 TIANIQGLIE EQRNATGIVP GSRHLVLERC RDEIGDWRII LHSPYGRRVH EPWAVAIAGR
      721 IHALWGADAS VVASDDGIVA RIPDTDGKLP DAAIFLFEPE KLLQIVREAV GSSALFAARF
      781 RECAARALLM PGRTPGHRTP LWQQRLRASQ LLEIAQGYPD FPVILETLRE CLQDVYDLPA
      841 LERLMRRLNG GEIQISDVTT TTPSPFATSL LFGYVAEFMY QSDAPLAERR ASVLSLDSEL
      901 LRNLLGQVDP GELLDPQVIR QVEEELQRLA PGRRAKGEEG LFDLLRELGP MTVEDLAQRH
      961 TGSSEEVASY LENLLAVKRI FPAMISGQER LACMDDAARL RDALGVRLPE SLPEIYLHRV
     1021 SYPLRDLFLR YLRAHALVTA EQLAHEFSLG IAIVEEQLQQ LREQGLVMNL QQDIWVSDEV
     1081 FRRLRLRSLQ AAREATRPVA ATTYARLLLE RQGVLPATDG SPALFASTSP GVYEGVDGVM
     1141 RVIEQLAGVG LPASLWESQI LPARVRDYSS EMLDELLATG AVIWSGQKKL GEDDGLVALH
     1201 LQEYAAESFT PAEADQANRS ALQQAIVAVL ADGGAWFAQQ ISQRIRDKIG ESVDLSALQE
     1261 ALWALVWQGV ITSDIWAPLR ALTRSSSNAR TSTRRSHRAR RGRPVYAQPV SPRVSYNTPN
     1321 LAGRWSLLQV EPLNDTERML ALAENMLDRY GIISRQAVIA ENIPGGFPSM QTLCRSMEDS
     1381 GRIMRGRFVE GLGGAQFAER LTIDRLRDLA TQATQTRHYT PVALSANDPA NVWGNLLPWP
     1441 AHPATLVPTR RAGALVVVSG GKLLLYLAQG GKKMLVWQEK EELLAPEVFH ALTTALRREP
     1501 RLRFTLTEVN DLPVRQTPMF TLLREAGFSS SPQGLDWG
//