LOCUS BAM98214.1 1216 aa PRT BCT 07-OCT-2016 DEFINITION Helicobacter pylori OK310 cag pathogenicity island protein protein. ACCESSION AP012601-786 PROTEIN_ID BAM98214.1 SOURCE Helicobacter pylori OK310 ORGANISM Helicobacter pylori OK310 Bacteria; Campylobacterota; Epsilonproteobacteria; Campylobacterales; Helicobacteraceae; Helicobacter. REFERENCE 1 (bases 1 to 1591278) AUTHORS Kobayashi,I. and Yahara,K. TITLE Direct Submission JOURNAL Submitted (07-NOV-2012) to the DDBJ/EMBL/GenBank databases. Contact:Koji Yahara University of Tokyo, GRADUATE SCHOOL OF FRONTIER SCIENCE; 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan URL :http://www.ims.u-tokyo.ac.jp/ikobaya REFERENCE 2 AUTHORS Yahara,K., Furuta,Y., Oshima,K., Yoshida,M., Azuma,T., Hattori,M., Uchiyama,I. and Kobayashi,I. TITLE Chromosome Painting In Silico in a Bacterial Species Reveals Fine Population Structure JOURNAL Mol. Biol. Evol. 30, 1454-1464 (2013) REMARK DOI:10.1093/molbev/mst055 COMMENT ##Genome-Assembly-Data-START## Assembly Method :: phrap v. 1.080730 Genome Coverage :: 7.7x Sequencing Technology :: Sanger ##Genome-Assembly-Data-END## FEATURES Qualifiers source /db_xref="taxon:1248726" /mol_type="genomic DNA" /organism="Helicobacter pylori OK310" /strain="OK310" protein /gene="cagA" /locus_tag="HPOK310_0786" /transl_table=11 BEGIN 1 MTNETIDQTI TPDQTDFVPQ RFINNLQVAF IKVDSAVASF DPDQKPIVDK NDRDNRQAFE 61 KISQLREEYA NKAIKNPTKK NQYFSDFINK SNDLINKDNL IAVDSSVESF RKFGDQRYQI 121 FTSWVSLQKD PSKINTQKIR DFMENIIQPP ISDDKEKAEF LRSAKQSFAG IIIGNQIRSD 181 QKFMGVFDES LKERQEAEKN AEPAGGDWLD IFLSFVFNKK QSSDLKETLD QEPVPHVQPD 241 IATTTTDIQG LPPESRDLLD ERGNFSKFTL GDMEMLDVEG VADIDPNYKF NQLLIHNNAL 301 SSVLMGSHNG VEPEKVSLLY GGNGGPEARH DWNATVGYKN QQGSNVATLI NVHMKNGSGL 361 VIAGGEKGVN NPSFYLYKED QLTGLKQALS QKEIQNKVDF MEFLAKNNAR LDNLSEKEKE 421 KFQTEIEDFQ KNPKAYLDAL GNDHIAFVSK KDKKHLALVT EFGNGELSYT LKDYGKKQDK 481 ALDRETKTTL QGNLKHDGVM FVNYSNFKYT NASKSPDKGV GATNGVSHLE ANFSKVAVFN 541 LPNLNNLAIT SYMRRDLEGK LSAKGLSLQE ANKLIKDFLN SNKELVEKAL NFNKTVAEAK 601 NTGNYDEVKK AQKDLEKSIR KREHLEKEVT KKMENRNGNK NRMEAKAQAN SQKDKIFAII 661 NEEAGKEARG AACVQNLKSI RMELSDKLEN INKNLKDLDK SFDEFKNGKN KDFSKTEETL 721 KALKDSVKDL GINPEWISKV ENLNTALNEF KNGKNKDFSK VIQAKSDLEN SIKDVIINQK 781 ITDKVDNLNQ AVSIAKATGD FSGVEQALAD LKNFSKGQLT QQAQKNEDFN TGKNSELYQS 841 VKNGVNGTLV GNGLSKAEAT TLSKNFSDIK KELNAKLGNF NNNNNDGLKN SIEPIYAKVN 901 KKKAGQAASP EESIYTQVAK KVNAKIDQLN QAASGFGNVG QAGFPLKRHT KVDDLSKVGL 961 SANHEPIYAT IDDLGGPFPL KRHTKVDDLS KVGLSANHEP IYATIDDLGG PFPLTRHTKV 1021 DDLSKVGLSR EQELTQKIDN LNQALSEAEA CHFGNLEQMI DKLKDSTKKN VMNLYVESAK 1081 KVPTSLSAKL DNYATNSHTR INSNVKNGTI NEKETSMLMR KNPEWLKLVN DKIVAHNVGS 1141 APLSAYDKIG FNQKNMKDYS DSFKFSTRLS NAVKDIKSDF VQFLTNIFSM GSYSLMKASV 1201 EHGVKNTNTK GGFQKS //