LOCUS BCA73522.1 955 aa PRT BCT 06-NOV-2020 DEFINITION Escherichia coli adhesin protein. ACCESSION AP022815-1552 PROTEIN_ID BCA73522.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5327513) AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Direct Submission JOURNAL Submitted (21-FEB-2020) to the DDBJ/EMBL/GenBank databases. Contact:Ken-ichi Lee National Institute of Infectious Diseases, Deaprtment of Bacteriology I; 1-23-1,Toyama, Shinjuku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Kimata,K., Lee,K., Watahiki,M., Isobe,J., Ohnishi,M. and Iyoda,S. TITLE Global distribution of epidemic-related Shiga toxin 2 encoding phages among enteroaggregative Escherichia coli JOURNAL Sci. Rep. 10, 11738 (2020) REMARK Publication Status: Online-Only DOI:10.1038/s41598-020-68462-9 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: Unicycler v. 0.4.7 Genome Coverage :: 135.0X Sequencing Technology :: Illumina MiSeq, Oxford Nanopore MinION ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2014" /db_xref="taxon:562" /host="Homo sapiens" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="JE86-ST05" protein /gene="ycgV" /inference="COORDINATES:ab initio prediction:MetaGeneAnnotator" /inference="similar to AA sequence:INSD:BAG76775.1" /locus_tag="JE86ST05C_15520" /note="DFAST-ECOLI:BAG76775.1 adhesin [pid:94.5%, q_cov:100.0%, s_cov:100.0%, Eval:0.0e+00]" /note="MGA_1553" /transl_table=11 BEGIN 1 MGIKQHNGNT KADRLAELNI RSPSIQLIKF GAIGLNAIIF SPLLIAADTG SQYGTNITIN 61 DGDRITGDTA DPSGNLYGVM TPAGNTPGNI NLGNDVTVNV NDASGYAKGI IIQGKNSSLT 121 ANRLTVGVVG QTSAIGINLI GDYTHADLGT GSTIKSNDDG IIIGHSSTLT ATQFTIENSN 181 GIGLTINDYG TSVDLGSGSK IKTDGSTGVY IGGLNGNNAN GAARFTATDL TIDVQGYSAM 241 GINVQKNSVV DLGTNSSIKT SGDNAHGLWS FGQVSANALT VDVTGAAANG VEVRGGTTTI 301 GADSHISSAQ GGGLVASGSD ATINFSGTAA QRNSIFSGGS YGASAQTATA VINMQNTDIT 361 VDRNGSLALG LWALSGGRIT GDSLAITGAA GARGIYAMTN SQIDLTSDLV IDMSTPDQMA 421 IATQHDDGYA ASRINASGRM LINGSVLSKG GLINLDMHPG SVWTGSSLSD NVNGGKLDVA 481 MNNSVWNVTS NSNLDTLALS HSTVDFASHG STAGTFATLN VENLSGNSTF IMRADVVGEG 541 NGVNNKGDLL NISGSSAGNH VLAIRNQGSE ATTGNEVLTV VKTTDGAASF SASSQVELGG 601 YLYDVRKNGT NWELYASGTV PEPTPNPEPT PAPAQPPIVN PDPTPEPAPT PKPTTTADAG 661 GNYLNVGYLL NYVENRTLMQ RMGDLRNQSK DGNIWLRSYG GSLDSFASGK LSGFDMGYSG 721 IQFGGDKRLS DVMPLYVGLY IGSTHASPDY SGGDGTARSD YMGMYASYMA QNGFYSDLVI 781 KASRQKNSFH VLDSQNNGVN ANGTANGMSI SLEAGQRFNL SPTGYGFYIE PQTQLTYSHQ 841 NEMAMKASNG LNIHLNHYES LLGRASMILG YDITAGNSQL NIYVKTGAIR EFSGDTEYLL 901 NNSREKYSFK GNGWNNGVGV SAQYNKQHTF YLEADYTQGN LFDQKQVNGG YRFSF //