LOCUS BAX21813.1 672 aa PRT BCT 27-MAR-2019
DEFINITION Escherichia coli hydrogenase-4 component B protein.
ACCESSION AP017620-2471
PROTEIN_ID BAX21813.1
SOURCE Escherichia coli
ORGANISM Escherichia coli
Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE 1 (bases 1 to 5042704)
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Direct Submission
JOURNAL Submitted (27-JUL-2016)
Contact:Tsuyoshi Sekizuka
National Institute of Infectious Diseases, Pathogen Genomics
Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE 2
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Elucidation of quantitative structural diversity of remarkable
rearrangement regions, shufflons, in IncI2 plasmids
JOURNAL Sci. Rep. 7, 928 (2017)
REMARK Publication Status: Online-Only
DOI:10.1038/s41598-017-01082-y
COMMENT ##Genome-Assembly-Data-START##
Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v.
8.2
Genome Coverage :: 200x
Sequencing Technology :: MiSeq; PacBio RSII
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /collection_date="2013"
/db_xref="taxon:562"
/geo_loc_name="Japan"
/host="Bos taurus"
/mol_type="genomic DNA"
/organism="Escherichia coli"
/strain="MRY15-131"
protein /gene="hyfB_1"
/locus_tag="MRY15131_c24710"
/transl_table=11
BEGIN
1 MDALQLLTWS LILYLFASLA SLFLLGLDRL AIKLSGITSL VGGVIGIISG ITQLHAGVTL
61 AARFATPFEF ADLTLRMDSL SAFMVLVISL LVVVCSLYSL TYMREYEGKG AAAMGFFMNL
121 FIASMVALLV MDNAFWFIVL FEMMSLSSWF LVIARQDKTS INAGMLYFFI AHAGSVLIMI
181 AFLLMGRESG SLDFASFRTH SLSPGLASAV FLLAFFGFGA KAGMMPLHSW LPRAHPAAPS
241 HASALMSGVM VKIGIFGILK VAMDLLAQTG LPLWWGILVM AIGAISALLG VLYALAEQDI
301 KRLLAWSTVE NVGIILLAVG VSMVGLSLHD PLLTVVGLLG ALFHLLNHAL FKGLLFLGAG
361 AIISRLHTHD MEKMGALAKR MPWTAAACLI GCLAISAIPP LNGFISEWYT WQSLFSLSRV
421 EAVALQLAGP IAMVMLAVTG GLAVMCFVKM YGITFCGAPR STHAEEAQDV PNTMIVAMLL
481 LAALCVFIAL SASWLAPKIM HIAHAFTNTP PVTVASGIAL VPGTFHTRVT PSLLLLLLLA
541 MPLLPGLYWL WCRSRRAAFR RTGDAWACGY GWDNAMAPSG NGVMQPLRVV FCALFRLRQQ
601 LDPTLRLNKG LAHVTARAQS TEPFWDERVI RPIVSATQRL AKEIQHLQSG DFRLYCLYVV
661 AALVVLLIAI AV
//