LOCUS BCA70749.1 1616 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli adhesin protein. ACCESSION AP022811-4073 PROTEIN_ID BCA70749.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAG79408.1" /locus_tag="JE86ST02C_40730" /note="DFAST-ECOLI:BAG79408.1 adhesin [pid:93.4%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_4080" /transl_table=11 BEGIN 1 MNKIFKVIWN PATGNYTVTS ETAKSRGKKS GRSKLLISAL VAGGMLSSFG ALANAGNDNG 61 QGVDYGSGSA GDGWVAIGKG AKANTFMNTS GSSTAVGYDA IAEGQYSSAI GSKTHAIGGA 121 SMAFGVSAIS EGDRSIALGA SSYSLGQYSM ALGRYSKALG KLSIAMGDSS KAEGANAIAL 181 GNATKATEIM SIALGDTANA SKAYSMALGA SSVASEENAI ALGRSSVASG TDSLAFGRQS 241 LASAANAIAI GAETEAAENA TAIGNNAKAK GTNSMAMGFG SLADKVNTIA LGNGSQALAD 301 NAIAIGQGNK ADGVDAIALG NGSQSRGLNT IALGTASNAT GDKSLALGSN SSANGINSVA 361 LGADSIADLD NTVSVGNSSL KRKIVNVKNG AIKSDSYDAI NGSQLYAISD SVAKRLGGGA 421 AVDVDDGTVT APTYNLKNGS KNNVGAALAV LDENTLQWDQ TKGKYSAAHG TSSPTASVIT 481 DVADGTISAS SKDAVNGSQL KATNDDVEAN TANIATNTSN IATNTANIAT NTTNITNLTD 541 SVGDLQADAL LWNETKKAFS AAHGQDTTSK ITNVKDADLT ADSTDAVNGS QLKTTNDAVA 601 TNTTNIANNT SNIATNTTNI SNLTETVTNL GEDALKWDKD NGVFTAAHGT ETTSKITNVK 661 DGDLTTGSTD AVNGSQLKTT NDAVATNTTN IATNTTNISN LTETVTNLGE DALKWDKDNG 721 VFTAAHGNNT ANKITNILDG TVTATSSDAI NGSQLYDLSS NIATYFGGNA SVNTDGVFTG 781 PTYKIGETNY YNVGDALAAI NSSFSTSLGD ALLWDATAGK FSAKHGTNGD ASVITDVADG 841 EISDSSSDAV NGSQLHGVSS YVVDALGGGA EVNADGTITA PTYTIANADY DNVGDALNAI 901 DTTLDDALLW DADAGENGAF SAAHGKDKTA SVITNVANGA ISAASSDAIN GSQLYTTNKY 961 IADALGGDAE VNADGTITAP TYTIANAEYN NVGDALDALD DNALLWDETA NGGAGAYNAS 1021 HDGKASIITN VANGSISEDS TDAVNGSQLN ATNMMIEQNT QIINQLAGNT DATYIQENGA 1081 GINYVRTNDD GLAFNDASAQ GVGATAIGYN SVAKGDSSVA IGQGSYSDVD TGIALGSSSV 1141 SSRVIAKGSR DTSITENGVV IGYDTTDGEL LGALSIGDDG KYRQIINVAD GSEAHDAVTV 1201 RQLQNAIGAV ATTPTKYFHA NSTEEDSLAV GTDSLAMGAK TIVNGDKGIG IGYGAYVDAN 1261 ALNGIAIGSN AQVIHVNSIA IGNGSTTTRG AQTNYTAYNM DAPQNSVGEF SVGSADGQRQ 1321 ITNVAAGSAD TDAVNVGQLK VTDAQVSQNT QSITNLDNRV TNLDSRVTNI ENGIGDIVTT 1381 GSTKYFKTNT DGVDASAQGK DSVAIGSGSI AAADNSVALG TGSVATEENT ISVGSSTNQR 1441 RITNVAAGKN DTDAVNVAQL KSSEAGGVRY DTKADGSIDY SNITLGGGNG GTTRISNVSA 1501 GVNNNDAVNY AQLKQSVQET KQYTDQRMVE MDNKLSKTES KLSGGIASAM AMTGLPQAYT 1561 PGASMASIGG GTYNGESAVA LGVSMVSANG RWVYKLQGST NSQGEYSAAL GAGIQW //