LOCUS AAL18966.1 820 aa PRT BCT 24-JUL-2018 DEFINITION Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 aspartokinase I protein. ACCESSION AE006468-2 PROTEIN_ID AAL18966.1 SOURCE Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 ORGANISM Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Salmonella. REFERENCE 1 (bases 1 to 4857450) AUTHORS McClelland,M., Sanderson,K.E., Spieth,J., Clifton,S.W., Latreille,P., Courtney,L., Porwollik,S., Ali,J., Dante,M., Du,F., Hou,S., Layman,D., Leonard,S., Nguyen,C., Scott,K., Holmes,A., Grewal,N., Mulvaney,E., Ryan,E., Sun,H., Florea,L., Miller,W., Stoneking,T., Nhan,M., Waterston,R. and Wilson,R.K. TITLE Complete genome sequence of Salmonella enterica serovar Typhimurium LT2 JOURNAL Nature 413 (6858), 852-856 (2001) PUBMED 11677609 REFERENCE 2 (bases 1 to 4857450) CONSRTM The Salmonella typhimurium Genome Sequencing Project TITLE Direct Submission JOURNAL Submitted (29-MAR-2001) Genome Sequencing Center, Department of Genetics, Washington University School of Medicine, 4444 Forest Park Boulevard, St. Louis, MO 63108, USA REFERENCE 3 (bases 1 to 4857450) AUTHORS McClelland,M., Jain,A., Saraogi,P., Mendelson,R., Westerman,R., SanMiguel,P. and Csonka,L. TITLE Direct Submission JOURNAL Submitted (13-JAN-2016) Department of Microbiology and Molecular Genetics, University of California, Irvine, CA 92697, USA REMARK Sequence update by submitter COMMENT On Jan 13, 2016 this sequence version replaced AE006468.1. Supported by NIH grant 5U 01 AI43283 Coding sequences below are predicted from manually evaluated computer analysis, using similarity information and the programs; GLIMMER; http://www.tigr.org/softlab/glimmer/glimmer.html and GeneMark; http://opal.biology.gatech.edu/GeneMark/ EC numbers were kindly provided by Junko Yabuzaki and the Kyoto Encyclopedia of Genes and Genomes; http://www.genome.ad.jp/kegg/, and Pedro Romero and Peter Karp at EcoCyc; http://ecocyc.PangeaSystems.com/ecocyc/ The analyses of ribosome binding sites and promoter binding sites were kindly provided by Heladia Salgado, Julio Collado-Vides and ReguonDB; http://kinich.cifn.unam.mx:8850/db/regulondb_intro.frameset This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data (i.e., phred quality >= 30); an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. FEATURES Qualifiers source 2776826..2844430,2879238..4857450) /organism="Salmonella enterica subsp. enterica serovar Typhimurium str. LT2" /mol_type="genomic DNA" /strain="LT2" /serovar="Typhimurium" /sub_species="enterica" /culture_collection="ATCC:700720" /culture_collection="SGSC:1412" /type_material="type strain of Salmonella enterica" /db_xref="taxon:99287" /focus source /organism="Salmonella phage Fels-1" /mol_type="genomic DNA" /db_xref="taxon:128975" source /organism="Phage Gifsy-2" /mol_type="genomic DNA" /db_xref="taxon:129862" source /organism="Phage Gifsy-1" /mol_type="genomic DNA" /db_xref="taxon:129861" source /organism="Salmonella virus Fels2" /mol_type="genomic DNA" /db_xref="taxon:194701" protein /gene="thrA" /locus_tag="STM0002" /EC_number="1.1.1.3" /EC_number="2.7.2.4" /note="bifunctional; N-terminaus is aspartokinase I and C terminus is homoserine dehydrogenase I; similar to E. coli aspartokinase I, homoserine dehydrogenase I (AAC73113.1); Blastp hit to AAC73113.1 (820 aa), 94% identity in aa 1 - 820" /transl_table=11 BEGIN 1 MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTIGGQDA 61 LPNISDAERI FSDLLAGLAS AQPGFPLARL KMVVEQEFAQ IKHVLHGISL LGQCPDSINA 121 ALICRGEKMS IAIMAGLLEA RGHRVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASQIP 181 ADHMILMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV 241 PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASSD 301 DDNLPVKGIS NLNNMAMFSV SGPGMKGMIG MAARVFAAMS RAGISVVLIT QSSSEYSISF 361 CVPQSDCARA RRAMQDEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL 421 ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL 481 LEQLKRQQTW LKNKHIDLRV CGVANSKALL TNVHGLNLDN WQAELAQANA PFNLGRLIRL 541 VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH QLRFAAAQSR 601 RKFLYDTNVG AGLPVIENLQ NLLNAGDELQ KFSGILSGSL SFIFGKLEEG MSLSQATALA 661 REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELSDIV IEPVLPNEFD ASGDVTAFMA 721 HLPQLDDAFA ARVAKARDEG KVLRYVGNIE EDGVCRVKIA EVDGNDPLFK VKNGENALAF 781 YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV //