LOCUS BAX20240.1 759 aa PRT BCT 27-MAR-2019
DEFINITION Escherichia coli hypothetical protein protein.
ACCESSION AP017620-898
PROTEIN_ID BAX20240.1
SOURCE Escherichia coli
ORGANISM Escherichia coli
Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE 1 (bases 1 to 5042704)
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Direct Submission
JOURNAL Submitted (27-JUL-2016)
Contact:Tsuyoshi Sekizuka
National Institute of Infectious Diseases, Pathogen Genomics
Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE 2
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Elucidation of quantitative structural diversity of remarkable
rearrangement regions, shufflons, in IncI2 plasmids
JOURNAL Sci. Rep. 7, 928 (2017)
REMARK Publication Status: Online-Only
DOI:10.1038/s41598-017-01082-y
COMMENT ##Genome-Assembly-Data-START##
Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v.
8.2
Genome Coverage :: 200x
Sequencing Technology :: MiSeq; PacBio RSII
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /collection_date="2013"
/db_xref="taxon:562"
/geo_loc_name="Japan"
/host="Bos taurus"
/mol_type="genomic DNA"
/organism="Escherichia coli"
/strain="MRY15-131"
protein /locus_tag="MRY15131_c08980"
/transl_table=11
BEGIN
1 MSEPLIVGIR HHSPACARLV KSLIESQRPR YVLIEGPADF NDRVDELFLA HQLPVAIYSY
61 CQYQDGTAPG RGAWTPFAEF SPEWQALQAA RRIQAQTYFI DLPCWAQSEE DDDSPDTQEE
121 SQTLLLRATR MDNSDTLWDH LFEDESQQTA LPSALAHYFA QLRGDSPGDA LNRQREAFMA
181 RWIAWAMQQN NGDVLVVCGG WHAPALAKMW REYTQEINTP ELPSLADVVT GCYLTPYSEK
241 RLDVLAGYLS GMPAPVWQNW CWRWGLQQAG EQLLKTVLTR LRQHHLPAST ADMAAAHLHA
301 MALAQLRGHT LPLRTDWLDA IAGSLIKEAL NAPLPWSYRG VIHPDTDPIL LTLIDTLAGD
361 GFGKLAPSTP QPPLPKDVTC ELERTAISLP AELTLNRFNP NGLAQSQVLH RLAILEIPGI
421 VRQQGSTLTL AGNGEERWKL TRPLSQHAAL IEAACFGATL QEAARHKLEA DMLDAGGIGS
481 ITTCLSQAAL AGLASFSQQL LEQLTLLIAQ ENQFAEMGQA LEVLYALWQL DEISGMQGAQ
541 ILQTTLCAAI DRTLWLCESN GRPDEKEFHA HLHSWQALCH ILRDLHSGVN LPGVSLSAAV
601 ALLERRSQAI HAPALDRGAA LGALMRLEHP NASAEAALTM LAQLSPAQSG EALHGLLALA
661 RHQLACQLAF IAGFSSHLNQ LSDDDFINAL PDLRAAMAWL PPRERGTLAH QVLEHYQLAQ
721 LPISALQMPL HCPPQAIAHH QQLEQQALAS LQHWGVFHV
//