LOCUS BCA76624.1 500 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli proline/glycine betaine transporter protein. ACCESSION AP022815-4654 PROTEIN_ID BCA76624.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /gene="proP" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN83540.1" /locus_tag="JE86ST05C_46530" /note="DFAST-ECOLI:AAN83540.1 proline/glycine betaine transporter [pid:98.8%, q_cov:100.0%, s_cov:97.8%, Eval:8.3e-286]" /note="MGA_4667" /transl_table=11 BEGIN 1 MLKRKKVKPI TLRDVTIIDD GKLRKAITAA SLGNAMEWFD FGVYGFVAYA LGKVFFPGAD 61 PSVQMVAALA TFSVPFLIRP LGGLFFGMLG DKYGRQKILA ITIVIMSIST FCIGLIPSYD 121 TIGIWAPILL LICKMAQGFS VGGEYTGASI FVAEYSPDRK RGFMGSWLDF GSIAGFVLGA 181 GVVVLISTIV GEANFLDWGW RIPFFIALPL GIIGLYLRHA LEETPAFQQH VDKLEQGDRE 241 GLQDGPKVSF KEIATKHWRS LLTCIGLVIA TNVTYYMLLT YMPSYLSHNL HYSEDHGVLI 301 IIAIMIGMLF VQPVMGLLSD RFGRRPFVLL GSVALFVLAI PAFILINSNV IGLIFAGLLM 361 LAVILNCFTG VMASTLPAMF PTHIRYSALA AAFNISVLVA GLTPTLAAWL VESSQNLMMP 421 AYYLMVVAVI GLITGVTMKE TANRPLKGAT PAASDIQEAK EILVEHYDNI EQKIDNIDHE 481 IADLQAKRTR LVQQHPRIDE //