LOCUS BAX20057.1 565 aa PRT BCT 27-MAR-2019
DEFINITION Escherichia coli urocanate hydratase protein.
ACCESSION AP017620-715
PROTEIN_ID BAX20057.1
SOURCE Escherichia coli
ORGANISM Escherichia coli
Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE 1 (bases 1 to 5042704)
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Direct Submission
JOURNAL Submitted (27-JUL-2016)
Contact:Tsuyoshi Sekizuka
National Institute of Infectious Diseases, Pathogen Genomics
Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE 2
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Elucidation of quantitative structural diversity of remarkable
rearrangement regions, shufflons, in IncI2 plasmids
JOURNAL Sci. Rep. 7, 928 (2017)
REMARK Publication Status: Online-Only
DOI:10.1038/s41598-017-01082-y
COMMENT ##Genome-Assembly-Data-START##
Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v.
8.2
Genome Coverage :: 200x
Sequencing Technology :: MiSeq; PacBio RSII
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /collection_date="2013"
/db_xref="taxon:562"
/geo_loc_name="Japan"
/host="Bos taurus"
/mol_type="genomic DNA"
/organism="Escherichia coli"
/strain="MRY15-131"
protein /gene="hutU"
/locus_tag="MRY15131_c07150"
/transl_table=11
BEGIN
1 MSKPSDPRID TSRVIHAPHG TQLHCKNWQI EAAYRMLQNN LDPDVAENPQ HLVVYGGIGR
61 AARNWECFDK ILESLKNLEA DESLLVQSGK PVGVFKTHTD APRVLIANSN LVPQWANWDH
121 FNELDRKGLF MYGQMTAGSW IYIGSQGIVQ GTFETFVEAG RQHYNNSLSG KWILTAGLGG
181 MGGAQPLAAT LAGACSLNIE CQQSSIDFRL RTRYVDKQAK DLDDALALIK YHTERKDAVS
241 IALLGNAAEI VPELVKRARQ GAIKPDLVTD QTSAHDLVYG YLPMGWSVEQ WRAAQQDPQQ
301 HARLQKEAAQ SCVAHVQGML DFQAMGIPVV DYGNNIRQVA FNEGLQNAFD FPGFVPAYIR
361 PMFCEGKGPF RWVALSGDPE DIYKTDAKIK ELFPNNVLTH RWLDMARERI AFQGLPSRIC
421 WLGLGERHLA GLAFNEMVRN GELKAPIVIG RDHLDTGSVA SPNRETESML DGSDAVSDWP
481 LLNGLLNTAG GATWVSLHHG GGVGMGYSQH AGMVIVADGS EEAEKRLARV LVNDCASGVM
541 RHADAGYEAA IECAKNYGLK LPMIR
//