LOCUS BAX19586.1 715 aa PRT BCT 27-MAR-2019
DEFINITION Escherichia coli hypothetical protein protein.
ACCESSION AP017620-244
PROTEIN_ID BAX19586.1
SOURCE Escherichia coli
ORGANISM Escherichia coli
Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE 1 (bases 1 to 5042704)
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Direct Submission
JOURNAL Submitted (27-JUL-2016)
Contact:Tsuyoshi Sekizuka
National Institute of Infectious Diseases, Pathogen Genomics
Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE 2
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Elucidation of quantitative structural diversity of remarkable
rearrangement regions, shufflons, in IncI2 plasmids
JOURNAL Sci. Rep. 7, 928 (2017)
REMARK Publication Status: Online-Only
DOI:10.1038/s41598-017-01082-y
COMMENT ##Genome-Assembly-Data-START##
Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v.
8.2
Genome Coverage :: 200x
Sequencing Technology :: MiSeq; PacBio RSII
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /collection_date="2013"
/db_xref="taxon:562"
/geo_loc_name="Japan"
/host="Bos taurus"
/mol_type="genomic DNA"
/organism="Escherichia coli"
/strain="MRY15-131"
protein /locus_tag="MRY15131_c02440"
/transl_table=11
BEGIN
1 MSGNIIVSTL TNRVVWDVDQ TSYGKALKAV KSIKAAHEKP AKALEKAQKR TAQMEGKASL
61 ASAKAQTAKL RQAEQLSKQQ QKQAQIQAKM ERDAIAHANK MTSLQARQLS QQEQAARQNA
121 RLAAISAKVQ AAARSRAMTY NPNMGGQHYD PGLVSRQTDA MNRGHGAVAA DIAATKRAMA
181 LEEKRQREKE ASLKRQSTAY DTLRRASLTI SHIEGASLAD KMKAVQAIKE ATKGYSEQRY
241 TIAEMRHELA KATLETRRQA RINRQIVKDK AKAKRAGRIN AQRLTSERSE RGYGGLAATG
301 VIAGGSLIGS MGVARVGQTL QGSVERNQQM KALQSQGISR LEYEAVRDVV ANKTGFNISP
361 DKALDFSADY NDKVGQLTAN GTWKTNKNGQ STLSGGGELG DLINAIAGKS NVATAKRVVK
421 ELQGIDFNLF LVYLKQLQKT FKFTDKETAA FAEAINDASK ILPAFADGGE AVQKRMDDLA
481 RNGYALSEAQ QEQLNRLSAL SVEYSRINDS LSDIFSSGFA DGLGEVGKNT SSFSKGMAGL
541 VPIMDSLGKA VGQVTADMLL FAGRVGQRVN EGATLPDAVY NTVVDDSANG AADWIKEKTG
601 FDPRSIGQTL KQIYPWLDSS APSNIGSGTA YNDPALALNV PGLNYQTPQQ PTFTVPTIDS
661 IVQRQPIPVA VTGEAEIKLS PLDINVNDGA INGLIDTKLR SYEQAQNNMI LGIAQ
//