LOCUS BAX20687.1 618 aa PRT BCT 27-MAR-2019
DEFINITION Escherichia coli protease 4 protein.
ACCESSION AP017620-1345
PROTEIN_ID BAX20687.1
SOURCE Escherichia coli
ORGANISM Escherichia coli
Bacteria; Pseudomonadati; Pseudomonadota; Gammaproteobacteria;
Enterobacterales; Enterobacteriaceae; Escherichia.
REFERENCE 1 (bases 1 to 5042704)
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Direct Submission
JOURNAL Submitted (27-JUL-2016)
Contact:Tsuyoshi Sekizuka
National Institute of Infectious Diseases, Pathogen Genomics
Center; 1-23-1 Toyama, Shinjuku-ku, Tokyo 162-8640, Japan
REFERENCE 2
AUTHORS Sekizuka,T., Kawanishi,M., Ohnishi,M., Shima,A., Kato,K.,
Yamashita,A., Matsui,M., Suzuki,S. and Kuroda,M.
TITLE Elucidation of quantitative structural diversity of remarkable
rearrangement regions, shufflons, in IncI2 plasmids
JOURNAL Sci. Rep. 7, 928 (2017)
REMARK Publication Status: Online-Only
DOI:10.1038/s41598-017-01082-y
COMMENT ##Genome-Assembly-Data-START##
Assembly Method :: HGAP Assembly v. 3.0; Celera Assembler v.
8.2
Genome Coverage :: 200x
Sequencing Technology :: MiSeq; PacBio RSII
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /collection_date="2013"
/db_xref="taxon:562"
/geo_loc_name="Japan"
/host="Bos taurus"
/mol_type="genomic DNA"
/organism="Escherichia coli"
/strain="MRY15-131"
protein /gene="sppA_2"
/locus_tag="MRY15131_c13450"
/transl_table=11
BEGIN
1 MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL
61 LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL
121 KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH
181 GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
241 NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG
301 WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR
361 LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY
421 IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS
481 IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA
541 KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA
601 FNDPQNRYAF CLTCANVR
//