LOCUS ABV12023.1 1455 aa PRT BCT 31-JAN-2014 DEFINITION Citrobacter koseri ATCC BAA-895 hypothetical protein protein. ACCESSION CP000822-844 PROTEIN_ID ABV12023.1 SOURCE Citrobacter koseri ATCC BAA-895 ORGANISM Citrobacter koseri ATCC BAA-895 Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales; Enterobacteriaceae; Citrobacter. REFERENCE 1 (bases 1 to 4720462) AUTHORS McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J., Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K., Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R. CONSRTM The Citrobacter koseri Genome Sequencing Project TITLE Direct Submission JOURNAL Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444 Forest Park Parkway, St. Louis, MO 63108, USA COMMENT Citrobacter (diversus) koseri--Citrobacter cells are isolated from water, sewage, soils, and food, as well as from the feces of man and other animals, where they may be normal inhabitants. They can be found in urine, sputum, and other clinical specimens. They can sometimes be opportunistic pathogens particularly in immunocompromised patients in hospitals or in infants (Pepperell et al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and references therein). The strain of Citrobacter koseri being sequenced, strain CDC 4225-83, was isolated in 1983 in Maryland, where it caused neonatal meningitis. It was provided by Caroline Mohr and Melissa Campbell of CDC. The strain is available from the American Type Culture Collection as ATCC BAA-895 or from the Salmonella Genetic Stock Centre as SGSC4696. The genome was sequenced to 8X coverage, using plasmid and fosmid libraries and was finished to an error rate of less than 1 per 10,000 bases. Automated annotation was performed and manual annotation will continue in the labs of Michael McClelland and Kenneth Sanderson. The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH) has funded this project. Coding sequences below are predicted using GeneMark v3.3 and Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and Glimmer2 were blasted against NCBI's non-redundant (NR) database and predictions generated based on protein alignments. RNA genes were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence was finished as follows unless otherwise noted: all regions were double stranded, sequenced with an alternate chemistries or covered by high quality data (i.e., phred quality >=30); an attempt was made to resolve all sequencing problems, such as compressions and repeats; all regions were covered by sequence from more than one m13 subclone. FEATURES Qualifiers source /organism="Citrobacter koseri ATCC BAA-895" /mol_type="genomic DNA" /strain="ATCC BAA-895" /db_xref="ATCC:BAA-895" /db_xref="taxon:290338" protein /locus_tag="CKO_00873" /inference="protein motif:Gene3D:IPR009081" /inference="protein motif:HMMPfam:IPR000873" /inference="protein motif:HMMPfam:IPR001242" /inference="protein motif:HMMPfam:IPR006163" /inference="protein motif:HMMTigr:IPR010071" /inference="protein motif:ScanRegExp:IPR000873" /inference="protein motif:superfamily:IPR009081" /note="KEGG: eci:UTI89_C2210 0. putative peptide synthetase K03367; COG: COG1020 Non-ribosomal peptide synthetase modules and related proteins; Psort location: Cytoplasmic, score:9.97" /transl_table=11 /db_xref="InterPro:IPR000873" /db_xref="InterPro:IPR001242" /db_xref="InterPro:IPR006163" /db_xref="InterPro:IPR009081" /db_xref="InterPro:IPR010071" BEGIN 1 MMSGNPLSWP QEQCHIIDQL YPYSAVNIIG GVVTIEGIVD LPRLHAAIQS AIRQFDALRM 61 WFVMGEESEV VSQVQPYHWR DIRHLTFSPD YDKENLRPAA IETFVDEWFR QPFTLLAHDL 121 FEFVTFTCGE QYSGYLFKAH HGIADGWSMA LLSNHVKRAY EQQDVPDDAS PAYSAFLAQQ 181 QSYQASTRFA VDRGWWRDYI DEYRDCFPDS SPIVTTEGIS CSTWLEPAMI NRLYRLCNRY 241 GCTLNTLFIA LFALYRARVW GEEKGVIGVP LANRHTREAR RCFGMFTNQL PLAYRLVRTE 301 RFCERVAFFQ RELKRGFKHS KYPITLFNQD LAEQGGGKLR AFDYCVNYYN FTYERHIAGA 361 AQRVESYYSG EQSYKLQIVL QTVNNHKESL RLSLEALRSA FTPHQLTAMK NGLLDLVTAL 421 DRQPDARLGD LEVYPAPHVA LACGSLKPSF TSRFAAQVVE HGDRTALIDN EQSLTYRQLD 481 DAVERVARYL RQQGIGRGQV VGIIAEHSAQ TVMVIYGILR CGAAFLPLNP ALPTTRLYAM 541 CRKAQVAHIL YDPAMHELTQ ALAFPASSLL QALATSALAR EPWPAIEPQD LAYVLFTSGS 601 TGEPKGVQVS HGNLANYLHF AAERYFTAQD RAALYSSLSF DLTITTLFAP LCVGASISVC 661 RHAESETLLR MAVVDQPNTV IKLTPAHLRL LCAAGISSEQ IRTLVVGGED FKRDLARKAA 721 ALFPQAVIYN EYGPTEATVG CMIYRYTGQE TLPSLPIGMA IDGCQVAICS PWGCPVPEGE 781 TGELVIYGAS VTQGYIDAPQ QTAAAYLKDT NGVMIGYRSG DIGYAIAPNT LVYQGRKDDQ 841 VKINGYRIEL CEIEQALLSA PQVESAAVAV IDDVQGQHSG LLACVTPSSV DVATVMQHLR 901 QQLPTYMQPK QCCAIAQLPL SHNGKVDVRQ MVATVRNTAP ASGSERLGDA AIRHSVRVCV 961 EGALEQTEFD DNENLYVLGL DSIKSIQIAA QLRHHGWTMS AVQVMECGTV NAICEFLASH 1021 TTVSQLAQYA HNTRIDLPAL RWFTQLALPV PNVYNHVIVL KVLPGCPLEQ LHNRLHTLIQ 1081 QQPALHSALD AEGRLLVCDP NVCYPNEVLT EYSTAQWTLA EVIAQCNSML DVTNGRVFTA 1141 ALLHAPQPAS STLVLCAHHL CVDMHSWYLI LSTLDAVSTV NGTSNSGLHR WNDYLASKTV 1201 DSATHESWRT VCQTLPLHFP PVSLPDDSLP RTRAWREDFR HPCVRRLFES SGNTAYSAET 1261 YVLTALALVL RYYSEEPWCR IEMEGMGRGC WPDEPDVADT VGWFTLFYPW AIPLHGDMAT 1321 LLSAIASDLA KRTHGGGDYG LLQMRHAPED SLAQGIRMNY IGVQAQPSLR YFHIDHFNSD 1381 IYTAPENALG CVLEFNIARS AADGLSFHCR FDPTRIALND VQLLLARYKN SLTDLDAWLC 1441 QHSATLTGAP TLWTL //