LOCUS BAX20057.1 565 aa PRT BCT 27-MAR-2019 DEFINITION Escherichia coli urocanate hydratase protein. ACCESSION AP017620-715 PROTEIN_ID BAX20057.1 SOURCE Escherichia coli ORGANISM Escherichia coli Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Escherichia. REFERENCE 1 (bases 1 to 5042704) AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Direct Submission JOURNAL Submitted (27-JUL-2016) Contact:Tsuyoshi Sekizuka National Institute of Infectious Diseases, Pathogen Genomics Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan REFERENCE 2 AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K., Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M. TITLE Elucidation of quantitative structural diversity of remarkable rearrangement regions, shufflons, in IncI2 plasmids JOURNAL Sci. Rep. 7, 928 (2017) REMARK Publication Status: Online-Only DOI:10.1038/s41598-017-01082-y COMMENT ##Genome-Assembly-Data-START## Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v. 8.2 Genome Coverage :: 200x Sequencing Technology :: MiSeq; PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /collection_date="2013" /db_xref="taxon:562" /geo_loc_name="Japan" /host="Bos taurus" /mol_type="genomic DNA" /organism="Escherichia coli" /strain="MRY15-131" protein /gene="hutU" /locus_tag="MRY15131_c07150" /transl_table=11 BEGIN 1 MSKPSDPRID TSRVIHAPHG TQLHCKNWQI EAAYRMLQNN LDPDVAENPQ HLVVYGGIGR 61 AARNWECFDK ILESLKNLEA DESLLVQSGK PVGVFKTHTD APRVLIANSN LVPQWANWDH 121 FNELDRKGLF MYGQMTAGSW IYIGSQGIVQ GTFETFVEAG RQHYNNSLSG KWILTAGLGG 181 MGGAQPLAAT LAGACSLNIE CQQSSIDFRL RTRYVDKQAK DLDDALALIK YHTERKDAVS 241 IALLGNAAEI VPELVKRARQ GAIKPDLVTD QTSAHDLVYG YLPMGWSVEQ WRAAQQDPQQ 301 HARLQKEAAQ SCVAHVQGML DFQAMGIPVV DYGNNIRQVA FNEGLQNAFD FPGFVPAYIR 361 PMFCEGKGPF RWVALSGDPE DIYKTDAKIK ELFPNNVLTH RWLDMARERI AFQGLPSRIC 421 WLGLGERHLA GLAFNEMVRN GELKAPIVIG RDHLDTGSVA SPNRETESML DGSDAVSDWP 481 LLNGLLNTAG GATWVSLHHG GGVGMGYSQH AGMVIVADGS EEAEKRLARV LVNDCASGVM 541 RHADAGYEAA IECAKNYGLK LPMIR //