LOCUS BBF47121.1 961 aa PRT BCT 27-JUL-2018 DEFINITION Escherichia coli adhesin protein. ACCESSION AP018796-1481 PROTEIN_ID BBF47121.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5470440) AUTHORS Kusumoto,M. and Akiba,M. TITLE Direct Submission JOURNAL Submitted (20-JUL-2018) to the DDBJ/EMBL/GenBank databases. Contact:Masahiro Kusumoto National Institute of Animal Health; Chuzan 2702, Kagoshima, Kagoshima 891-0105, Japan REFERENCE 2 AUTHORS Kusumoto,M., Misumi,W., Ogura,Y., Hayashi,T. and Akiba,M. TITLE Genomic analysis of colistin resistant EHEC isolated from cattle in Japan. JOURNAL Unpublished (2018) COMMENT Annotated using prokka 1.11 from http://www.vicbioinformatics.com. Annotated at DFAST https://dfast.nig.ac.jp/ ##Genome-Assembly-Data-START## Assembly Method :: RS HGAP Assembly v. 3.0 Genome Coverage :: 143x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2001" /country="Japan" /db_xref="taxon:562" /host="Bos taurus" /isolation_source="feces" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="E2855" protein /gene="ycgV" /inference="ab initio prediction:Prodigal:2.6" /inference="similar to AA sequence:INSD:BAG76775.1" /locus_tag="E2855_01516" /transl_table=11 BEGIN 1 MGIKQHNGNT KADRLAELKI RSPSIQLIKF GAIGLNAIIF SPLLIAADTG SQYGTNITIN 61 EGDRITGDTA DPSGNLYGVM TPAGNTPGNI NLGNDVTVNV NDASGYAKGI IIQGKNSSLT 121 ANRLTVDVVG QTSAIGINLI GDYTHADLGT GSTIKSNDDG IIIGHSSTLT ATQFTIENSN 181 GIGLTINDYG TSVDLGSGSK ITTDGSTGVY IGGLNGNNAN GAARFTATDL TIDVQGYSAM 241 GINVQKNSVV DLGTNSSIKT NGDNAHGLWS FGQVSANALT VDVTGAAANG VEVRGGTTTI 301 GADSHISSAQ GGGLVTSGSD ATINFSGTAA QRNSIFSGGS YGASAQTATA VVNMQNTDIT 361 VDRNGSLALG LWALSGGRIT GDSLAITGAA GARGIYAMTN SQIDLTSDLV IDMSTPDQMA 421 IATQHDDGYA ASRINASGRM LINGSVLSKG GLINLDMHPG SVWTGSSLSD NVNGGKLDVA 481 MNNSVWNVTS NSNLDTLALS HSTVDFASHG STAGTFATLN VENLSGNSTF IMRADVVGEG 541 NGVNNKGDLL NISGSSAGNH VLAIRNQGSE ATTGNEVLTV VKTTDGAASF SASSQVELGG 601 YLYDVRKNGT NWELYASGTV PEPTPNPEPT PAPAQPPIVN PDPTPEPDPT PNPTPTPKPT 661 TTADAGGNYL NVGYLLNYVE NRTLMQRMGD LRNQSKDGNI WLRSYGGSLD SFASGKLSGF 721 DMGYSGIQFG GDKRLSDVMP LYVGLYIGST HASPDYSGGD GTARSDYMGM YASYMAHNGF 781 YSDLVVKASR QKNSFHVLDS QNNGVNANGT ANGLSISLEA GQRFNLTPTG YGFYIEPQTQ 841 LTYSHQNEMA MKASNGLNIH LNHYESLLGR ASMILGYDIT AGNSQLNMYV KTGAIREFSG 901 DTDYLLNNSR EKYSFKGNGW NNGVGVSAQY NKQHTFYLEA DYTQSNLFDQ KQVNGGYRFS 961 F //