LOCUS BCA67019.1 1349 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli AidA-I family adhesin protein. ACCESSION AP022811-343 PROTEIN_ID BCA67019.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5283470) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 117.0X Sequencing Technology :: Illumina MiSeq, PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="1999" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST02" protein /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:ADD55052.1" /locus_tag="JE86ST02C_03430" /note="DFAST-ECOLI:ADD55052.1 AidA-I family adhesin [pid:94.9%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_344" /transl_table=11 BEGIN 1 MAFNALLFMQ SWFYLDVLLE IVMNKIYRLK WNRSRNCWSV CSELGSRVKG KKSRAVLISA 61 ISLYSSLVFA DDVIVNQDKT IDFGKENQSI DYRITVTDNA NLVINTTDTS RPRLTLASGG 121 GLDITGGKVT INGPLNFLLK GTGFLNVSNA GSELYADDLY ESNSGMRHDQ GYFNVSNGGK 181 IHVKGTSRLT YTQGNVSGEG SQVNAGTFFM GVYGGYGGNQ FLSVNNGGEV NAREHISLGY 241 YDQRSDTTLV VSDGGKISAP KISLSTNSEL ALGAQEGSAA KAAGIIDAEK IEFVWAKTSD 301 KKITLNHTDK NATISADIVS GSEGLGYINA LNGTTYLTGD NSAFSGKVKI EQNGALGITQ 361 NIGTAEINNR GKLHLKADDS MTFANKISGN GTISIDSGTV ALTGNNYAFS GYIDVASGAV 421 AVISEDKNIG RADLDVDGKL QINANKDWVF DNDLQGRGIV EINMGNHEFS FDEFAYTDWF 481 QGSLAFQNTT FNLEKNAEFL QRGGITAGQG SQVTVGKGAH SISTLGFSGG TVDFGALTAG 541 AQMTEGTVNV SKTLDLRGEG VIQVSDSDVV SSVSRDIDSA LSLTEVDDGN STIKLVDAQG 601 AEVLGDAGNL QLQDKNGQIL SSSAQRDIQQ NGQKAAVGTY DYRLTSGVNN DGLYIGYGLT 661 QLDLHATDSD ALVLSSNGKS ENAADLSAKI TGSGDLAFSS QKGQTVSLSN KDNDYTGVTD 721 LRSGTLLLNN DNVLGNTHEL RLAAETELDM NGHSQTVGTL NGSADSLLSL NGGSLTVTNG 781 GTSTGSLTGS GELNIQGGTL DIAGDNSNLT ANVNIANSAN VLVSHAQGLG SANVENNGTL 841 ALNNSAEKRA AASVNYTLGG NLTNNGTLMT GMSGQQAGNV LVVKGNYHGN NGQLVMNTVL 901 NGDDSVTDKL VVEGDTSGTT AVTVNNAGGT GAKTLNGIEL IHVDGKSEGE FVQAGRIVAG 961 AYDYTLARGQ GANSGNWYLT SGSDSPELQP EPDPMPNPEP NPNPEPNPNP TPTPGPDLNV 1021 DNDLRPEAGS YIANLAAANT MFTTRLHERL GNTYYTDMVT GEQKQTTMWM RHEGGHNKWR 1081 DGSGQLKTQS NRYVLQLGGD VAQWSQNGSD SWHVGVMAGY GNSDSKTISS RTGYRAKASV 1141 NGYSTGLYAT WYADDESRNG AYLDSWAQYS WFDNTVKGDD LQSESYKSKG FTASLEAGYK 1201 HKLAEFNGSQ GTRNEWYVQP QAQVTWIGVK ADKHRESNGT LVHSNGDGNV QTRLGVKTWL 1261 KSHHKMDDGK SREFQPFVEV NWLHNSKDFS TSMDGVSVTQ DGARNIAEIK TGVEGQLNAN 1321 LNVWGNVGVQ VADRGYNDTS AMVGIKWQF //