LOCUS BCA71221.1 529 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli IMP cyclohydrolase and phosphoribosylaminoimidazole carboxamide formyltransferase protein. ACCESSION AP022811-4545 PROTEIN_ID BCA71221.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /EC_number="2.1.2.3" /gene="purH" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAC76980.1" /locus_tag="JE86ST02C_45440" /note="DFAST-ECOLI:AAC76980.1 IMP cyclohydrolase and phosphoribosylaminoimidazolecarboxamide formyltransferase [pid:99.4%, q_cov:100.0%, s_cov:100.0%, Eval:8.0e-303]" /note="MGA_4555" /transl_table=11 BEGIN 1 MQQRRPVRRA LLSVSDKAGI VEFAQALSAR GVELLSTGGT ARLLAEKGLP VTEVSDYTGF 61 PEMMDGRVKT LHPKVHGGIL GRRGQDDAIM EEHQIQPIDM VVVNLYPFAQ TVAREGCSLE 121 DAVENIDIGG PTMVRSAAKN HKDVAIVVKS SDYDAIIKEM DDNEGSLTLA TRFDLAIKAF 181 EHTAAYDSMI ANYFGSMVPA YHGESKEAAG RFPRTLNLNF IKKQDMRYGE NSHQQAAFYI 241 EENVKEASVA TATQVQGKAL SYNNIADTDA ALECVKEFAE PACVIVKHAN PCGVAIGNSI 301 LDAYDRAYKT DPTSAFGGII AFNRELDAET AQAIISRQFV EVIIAPSASE EALKITAAKQ 361 NVRVLTCGQW GERVPGLDFK RVNGGLLVQD RDLGMVGAEE LRVVTQRQPT EQELRDALFC 421 WKVAKFVKSN AIVYAKNNMT IGIGAGQMSR VYSAKIAGIK AADEGLEVKG SSMASDAFFP 481 FRDGIDAAAA AGVTCVIQPG GSIRDDEVIA AADEHGIAML FTDMRHFRH //