LOCUS ABV12387.1 3163 aa PRT BCT 31-JAN-2014 DEFINITION Citrobacter koseri ATCC BAA-895 hypothetical protein protein. ACCESSION CP000822-1208 PROTEIN_ID ABV12387.1 SOURCE Citrobacter koseri ATCC BAA-895 ORGANISM Citrobacter koseri ATCC BAA-895 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Citrobacter. REFERENCE 1 (bases 1 to 4720462) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. CONSRTM The Citrobacter koseri Genome Sequencing Project TITLE Direct Submission JOURNAL Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT Citrobacter (diversus) koseri--Citrobacter cells are isolated from water, sewage, soils, and food, as well as from the feces of man and other animals, where they may be normal inhabitants. They can be found in urine, sputum, and other clinical specimens. They can sometimes be opportunistic pathogens particularly in immunocompromised patients in hospitals or in infants (Pepperell et al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and references therein). The strain of Citrobacter koseri being sequenced, strain CDC 4225-83, was isolated in 1983 in Maryland, where it caused neonatal meningitis. It was provided by Caroline Mohr and Melissa Campbell of CDC. The strain is available from the American Type Culture Collection as ATCC BAA-895 or from the Salmonella Genetic Stock Centre as SGSC4696. The genome was sequenced to 8X coverage, using plasmid and fosmid libraries and was finished to an error rate of less than 1 per 10,000 bases. Automated annotation was performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data (i.e., phred quality >=30); an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. FEATURES Qualifiers source /organism="Citrobacter koseri ATCC BAA-895" /mol_type="genomic DNA" /strain="ATCC BAA-895" /db_xref="ATCC:BAA-895" /db_xref="taxon:290338" protein /locus_tag="CKO_01247" /inference="protein motif:Gene3D:IPR001227" /inference="protein motif:Gene3D:IPR009081" /inference="protein motif:HMMPanther:IPR000794" /inference="protein motif:HMMPfam:IPR000794" /inference="protein motif:HMMPfam:IPR001031" /inference="protein motif:HMMPfam:IPR001227" /inference="protein motif:HMMPfam:IPR001242" /inference="protein motif:HMMPfam:IPR006163" /inference="protein motif:HMMPfam:IPR013217" /inference="protein motif:HMMPfam:IPR013624" /inference="protein motif:HMMPfam:IPR013968" /inference="protein motif:ScanRegExp:IPR000794" /inference="protein motif:ScanRegExp:IPR006162" /inference="protein motif:superfamily:IPR009081" /note="KEGG: eci:UTI89_C2184 0. irp1; HMWP1 nonribosomal peptide/polyketide synthase K04786; COG: COG3319 Thioesterase domains of type I polyketide synthases or non-ribosomal peptide synthetases" /transl_table=11 /db_xref="InterPro:IPR000794" /db_xref="InterPro:IPR001031" /db_xref="InterPro:IPR001227" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR006162" /db_xref="InterPro:IPR006163" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR013217" /db_xref="InterPro:IPR013624" /db_xref="InterPro:IPR013968" BEGIN 1 MDNLRFSSAP TADSIDASIA QHYPDCEPVA VIGYACHFPE SPDGETFWQN LLEGRECSRR 61 FTREELLAVG LDAAIIDDPH YVNIGTVLDN ADCFDATLFG YSRQEAESMD PQQRLFLQAV 121 WHALEHAGYA PGAVPHKTGV FASSRMSTYP GREALNVTEV AQVKGLQSLM GNDKDYIATR 181 AAYKLNLHGP ALSVQTACSS SLVAVHLACE SLRAGESDMA VAGGVALSFP QQAGYRYQPG 241 MIFSPDGHCR PFDASAEGTW AGNGLGCVVL RRLRDALLSG DPIISVILSS AVNNDGNRKV 301 GYTAPSVAGQ QAVIEEALML AAIDDRQVGY IETHGTGTPL GDAIEIEALR NVYAPRPQDQ 361 RCALGSVKSN MGHLDTAAGI AGLLKTVLAV NRGQIPPLLN FHTPNPALKL EESPFTIPVS 421 AQAWQDEMRY AGVSSFGIGG TNCHMIVASL PDALNARLPN TDSGRKSTAL LLSAASDSAL 481 RRLATDYAGA LRENADASSL AFTALHARRL DLPFRLAAPL NRETAEALSA WAGEKSGALV 541 YSGHGASGKQ VWLFTGQGSH WRTMGQTMYQ HSTAFADTLD RCFSACSEML TPSLREAMFN 601 PDSAQLDNMA WAQPAIVAFE IAMAAHWRAE GLKPDFAIGH SVGEFAAAVV CGHYTIEQVM 661 PLVCRRGALM QQCASGAMVA VFADEDTLMP LARQFELDLA ANNGTQHTVF SGPEARLAVF 721 CATLSQHDIN YRRLSVTGAA HSALLEPILD RFQDACAGLH AEPGQIPIIS TLTADVIDES 781 TLNQADYWRR HMRQPVRFIQ SIQVAHQLGA RVFLEMGPDA QLVACGQREY RDNAYWIASA 841 RRNKEASDVL NQALLQLYAA GVALPWADLL AGDGQRIAAP CYPFDTERYW KERVSPACEP 901 ADAALSAGLE VASRAATALD LPRLEALKQC ATRLHAIYVD QLVQRCTGDA IENGVDAMTI 961 MRRGRLLPRY QQLLQRLLNN CVVDGDYRCT DGRYVRARPI EHQQRESLLT ELAGYCEGFQ 1021 AIPDTIARAG DRLYEMMSGA EEPVAIIFPQ SASDGVEVLY QEFSFGRYFN QIAAGVLRGI 1081 VQTRQPRQPL RILEVGGGTG GTTAWLLPEL NGVPALEYHF TDISALFTRR AQQKFADYDF 1141 VKYSELDLEK EAQSQGFQAQ SYDLIVAANV IHATRHIGRT LDNLRPLLKP GGRLLMREIT 1201 QPMRLFDFVF GPLVLPLQDL DAREGELFLT TAQWQQQCRH AGFSKVAWLP QDGSPTAGMS 1261 EHIILATLPG QAVSAVTFTA PSEPVLGQAL TDNGDYLADW SDCAGQPERF NARWQEAWRL 1321 LSQRHGDALP VEPPPVAAPE WLEEVRLSWQ NEAFSRGQMR VEARHPDGEW LPLSPTAPLP 1381 APQTHYQWRW TPLNVASVDH PLTFSFSTGT LARSDELAQY GIIHDPHASS RLMIVEESED 1441 TLALAEKVIA ALTASAAGLI VVTRRAWRVE ENEALSASHH ALWALLRVAA NEQPERLIAA 1501 IDLAENTPWE TLHQGLSAVS LSQRWLAARG DTLWLPSLAP NTGCAAELPA NVFIGDNRWH 1561 LVTGAFGGLG RLAVNWLREK GARRIALLAP RVDESWLRDV EGGQTRVCRC DVGDAGQLAT 1621 VLDDLAANGG IAGAIHAAGV LADAPLQELD DHQLAAVFAV KAQAANQLLQ TLRNHDGRYL 1681 ILYSSAAATL GAPGQSAHAL ACGYLDGLAQ QFSTLDAPKT LSVAWGAWGE SGRAATPEML 1741 VTLASRGMGA LSDAEGCWHL EQAVMRGAPW RLAMRVFTDK MPPLQQALFN ISATEKAATP 1801 VIPPADDNAF NGSLSDETAV MAWLKKRIAV QLRLSDPASL RPNQDLLQLG MDSLLFLELS 1861 SDIQHYLGVR INAERAWQDL SPHGLTQLIC SKPETTPAAS QPEVLRHDAD ERYAPFPLTP 1921 IQHAYWLGRT HLIGYGGVAC HVLFEWDKRH DEFDLAILEK AWNQLIARHD MLRMVVDADG 1981 QQRVLATTPE YHIQRDDLRA LSPEEQRIAL EKRRHELSYR VLPADQWPLF ELVVSEIDDC 2041 HYRLHMNLDL LQFDVQSFKV MMDDLAQVWR GETLAPLAIT FRDYVMAEQA RRQTSAWHDA 2101 WDYWQEKLPQ LPLAPELPVV ETPPETPHFT TFKSTIGKTE WQAVKQRWQQ QGVTPSAALL 2161 TLFAATLERW SRTTAFTLNL TFFNRQPIHP QINQLIGDFT SVTLVDFNFS TPVTLQEQMQ 2221 QTQQRLWQNM AHSEMNGVEV IRELGRLRGS QRQPLMPVVF TSMLGMTLEG MTIDQAMSHL 2281 FGEPCYVFTQ TPQVWLDHQV MESDGELMFS WYCMDNVLEP GAAEAMFNDY CAILQAVIAA 2341 PESLKTLASG IAGHIPRRRW PLNAQTDYDL RDIEQATLEY PGIRQTRAEM TEQGALTLDI 2401 VMADDPSPSA ATPDEHELTQ LALPLPEQAQ LDELEATWRW LEARALQGIA ATLNRHGLFT 2461 TPEIAHRFSA IVQALSAQAS HQRLLRQWLQ CLTERAWLIR EGESWRCRVP LSEIPEPQEA 2521 CPPSQWSQAL AQYLETCIAR HDALFSGQCS PLELLFNEQH RVTDALYRDN PASACLNRYT 2581 AQIAALCSAE RILEVGAGTA ATTAPVLKAT RNTRQSYHFT DVSAQFLNDA RARFHDESRV 2641 SYALFDINQP LDFTAHPEAG YDLIVAVNVL HDASHVVQTL RRLKLLLKAG GRLLIVEATE 2701 RNSVFQLASV GFIEGLSGYR DFRRRDEKPM LTRSAWQEVL VQAGFANELA WPAQESSPLR 2761 QHLLVARSPG VNRPDKKAVS HYLQQRFGTG LPILQIRQRE ALFTPLHAPS DAPTEPAKPT 2821 PVAGGNPALE KQVAELWQSL LSRPVARHHD FFELGGDSLM ATRMVAQLNR RGIARANLQD 2881 LFSHSTLSDF CAHLQAATSG EDNPIPLCQG DGDETLFVFH ASDGDISAWL PLASALNRRV 2941 FGLQAKSPQR FATLDQMIDE YVGCIRRQQP HGPYVLAGWS YGAFLAAGTA QRLYAKGEQV 3001 RIALIDPVCR QDFCCENRAA LLRLLAEGQT PLALPEHFDQ QTPDSQLADF ISLAKTAGMV 3061 SQNLTLQAAE TWLDNIAHLL RLLTEHTPGE SVPVPCLMVY AAGRPARWTP AETEWQGWIN 3121 NADDAVIEAS HWQIMMEAPH VQACAQHITR WLCATSTQPE NTL //