LOCUS BCA67668.1 758 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli ATP-binding component of serine protease protein. ACCESSION AP022811-992 PROTEIN_ID BCA67668.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /gene="clpA" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:AAN79491.1" /locus_tag="JE86ST02C_09920" /note="DFAST-ECOLI:AAN79491.1 ATP-binding component of serine protease [pid:99.9%, q_cov:100.0%, s_cov:99.3%, Eval:0.0e+00]" /note="MGA_993" /transl_table=11 BEGIN 1 MLNQELELSL NMAFARAREH RHEFMTVEHL LLALLSNPSA REALEACSVD LVALRQELEA 61 FIEQTTPVLP ASEEERDTQP TLSFQRVLQR AVFHVQSSGR NEVTGANVLV AIFSEQESQA 121 AYLLRKHEVS RLDVVNFISH GTRKDEPTQS SDPGSQPNSE EQAGGEERME NFTTNLNQLA 181 RVGGIDPLIG REKELERAIQ VLCRRRKNNP LLVGESGVGK TAIAEGLAWR IVQGDVPEVM 241 ADCTIYSLDI GSLLAGTKYR GDFEKRFKAL LKQLEQDTNS ILFIDEIHTI IGAGAASGGQ 301 VDAANLIKPM LSSGKIRVIG STTYQEFSNI FEKDRALARR FQKIDITEPS IEETVQIING 361 LKPKYEAHHD VRYTAKAVRA AVELAVKYIN DRHLPDKAID VIDEAGARAR LMPVSKRKKT 421 VNVADIESVV ARIARIPEKS VSQSDRDTLK NLGDRLKMLV FGQDKAIEAL TEAIKMARAG 481 LGHEHKPVGS FLFAGPTGVG KTEVTVQLSK ALGIELLRFD MSEYMERHTV SRLIGAPPGY 541 VGFDQGGLLT DAVIKHPHAV LLLDEIEKAH PDVFNILLQV MDNGTLTDNN GRKADFRNVV 601 LVMTTNAGVR ETERKSIGLI HQDNSTDAME EIKKIFTPEF RNRLDNIIWF DHLSTDVIHQ 661 VVDKFIVELQ VQLDQKGVSL EVSQEARNWL AEKGYDRAMG ARPMARVIQD NLKKPLANEL 721 LFGSLVDGGQ VTVALDKEKN ELTYGFQSAQ KHKAEAAH //