LOCUS BCA68373.1 495 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli aldehyde dehydrogenase protein. ACCESSION AP022811-1697 PROTEIN_ID BCA68373.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="1.2.1.3" /gene="puuC" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC74382.1" /locus_tag="JE86ST02C_16970" /note="DFAST-ECOLI:AAC74382.1 aldehyde dehydrogenase [pid:98.0%, q_cov:100.0%, s_cov:100.0%, Eval:2.1e-281]" /note="MGA_1698" /transl_table=11 BEGIN 1 MNFHHLAYWQ DKALSLAIEN RLFINGEYTA AAENETFETV DPVTQAPLAK IARGKSVDID 61 RAVSAARGVF ERGDWSLSSP AKRKAVLNKL ADLMEAHAEE LALLETLDTG KPIRHSLRDD 121 IPGAARAIRW YAEAIDKVYG EVATTSSHEL AMIVREPVGV IAAIVPWNFP LLLTCWKLGP 181 ALAAGNSVIL KPSEKSPLSA IRLAGLAKEA GLPDGVLNVV TGFGHEAGQA LSRHNDIDAI 241 AFTGSTRTGK QLLKDAGDSN MKRVWLEAGG KSANLVFADC PDLQKAASAT AAGIFYNQGQ 301 VCIAGTRLLL EESIADEFLA LLKQQAQNWQ PGHPLDPATT MGTLIDCAHA DSVHSFIREG 361 ESKGQLLLDG RNAELAAAIG PTIFVDVDPN ASLSREEIFG PVLVVTRFTS EDQALQLAND 421 SQYGLGAAVW TRDLSRAHRM SRRLKAGSVF VNNYNDGDMT VPFGGYKQSG NGRDKSLHAL 481 EKFTELKTIW ISLEA //