LOCUS       ABV15216.1               444 aa    PRT              BCT 31-JAN-2014
DEFINITION  Citrobacter koseri ATCC BAA-895 hypothetical protein protein.
ACCESSION   CP000822-4037
PROTEIN_ID  ABV15216.1
SOURCE      Citrobacter koseri ATCC BAA-895
  ORGANISM  Citrobacter koseri ATCC BAA-895
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Citrobacter.
REFERENCE   1  (bases 1 to 4720462)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Citrobacter koseri Genome Sequencing Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     Citrobacter (diversus) koseri--Citrobacter cells are isolated from
            water, sewage, soils, and food, as well as from the feces of man
            and other animals, where they may be normal inhabitants. They can
            be found in urine, sputum, and other clinical specimens. They can
            sometimes be opportunistic pathogens particularly in
            immunocompromised patients in hospitals or in infants (Pepperell et
            al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and
            references therein).
            
            The strain of Citrobacter koseri being sequenced, strain CDC
            4225-83, was isolated in 1983 in Maryland, where it caused neonatal
            meningitis. It was provided by Caroline Mohr and Melissa Campbell
            of CDC. The strain is available from the American Type Culture
            Collection as ATCC BAA-895 or from the Salmonella Genetic Stock
            Centre as SGSC4696. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data (i.e., phred quality >=30); an attempt was
            made to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Citrobacter koseri ATCC BAA-895"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-895"
                     /db_xref="ATCC:BAA-895"
                     /db_xref="taxon:290338"
     protein         /locus_tag="CKO_04151"
                     /inference="protein motif:HMMPfam:IPR005269"
                     /inference="similar to AA sequence:INSD:AAL21848.1"
                     /note="KEGG: hch:HCH_00651 4.5e-129 predicted Rossmann
                     fold nucleotide-binding protein K00052; COG: COG1611
                     Predicted Rossmann fold nucleotide-binding protein; Psort
                     location: Cytoplasmic, score:8.96"
                     /transl_table=11
                     /db_xref="InterPro:IPR005269"
BEGIN
        1 MDMLSQLEVD MLKRTASSDL YQLFRNCSLA VLNSGSLTDN SKELLSRFES FDINVLRRER
       61 GVKLELINPP EDAFVDGRII RALQANLFAV LRDILFVNGQ IHNAGRFQHL DLESSVHITN
      121 LVFSILRNAR ALHVGEAPNM VVCWGGHSIN ENEYLYARRV GTQLGLRELN ICTGCGPGAM
      181 EAPMKGAAVG HAQQRYKDSR FIGMTEPSII AAEPPNPLVN ELIIMPDIEK RLEAFVRIAH
      241 GIIIFPGGVG TAEELLYLLG ILMNPANKDQ VLPLILTGPK ESADYFRVLD EFIVHTLGED
      301 ARRYYRVIID DATEVARLMK KAMPLVKENR RDTGDAYSFN WSIRIAPDLQ IPFEPSHENM
      361 ANLQLYPDQP VEILAADLRR AFSGIVAGNV KEVGIRAIEE FGPYKIHGDR EMMRRMDDLL
      421 QGFVAQHRMK LPGSAYIPCY EICA
//