LOCUS       ABV13931.1              1025 aa    PRT              BCT 31-JAN-2014
DEFINITION  Citrobacter koseri ATCC BAA-895 hypothetical protein protein.
ACCESSION   CP000822-2752
PROTEIN_ID  ABV13931.1
SOURCE      Citrobacter koseri ATCC BAA-895
  ORGANISM  Citrobacter koseri ATCC BAA-895
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Citrobacter.
REFERENCE   1  (bases 1 to 4720462)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Citrobacter koseri Genome Sequencing Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     Citrobacter (diversus) koseri--Citrobacter cells are isolated from
            water, sewage, soils, and food, as well as from the feces of man
            and other animals, where they may be normal inhabitants. They can
            be found in urine, sputum, and other clinical specimens. They can
            sometimes be opportunistic pathogens particularly in
            immunocompromised patients in hospitals or in infants (Pepperell et
            al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and
            references therein).
            
            The strain of Citrobacter koseri being sequenced, strain CDC
            4225-83, was isolated in 1983 in Maryland, where it caused neonatal
            meningitis. It was provided by Caroline Mohr and Melissa Campbell
            of CDC. The strain is available from the American Type Culture
            Collection as ATCC BAA-895 or from the Salmonella Genetic Stock
            Centre as SGSC4696. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data (i.e., phred quality >=30); an attempt was
            made to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Citrobacter koseri ATCC BAA-895"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-895"
                     /db_xref="ATCC:BAA-895"
                     /db_xref="taxon:290338"
     protein         /locus_tag="CKO_02825"
                     /inference="protein motif:FPrintScan:IPR006101"
                     /inference="protein motif:Gene3D:IPR004199"
                     /inference="protein motif:Gene3D:IPR013781"
                     /inference="protein motif:Gene3D:IPR013812"
                     /inference="protein motif:HMMPfam:IPR004199"
                     /inference="protein motif:HMMPfam:IPR006102"
                     /inference="protein motif:HMMPfam:IPR006103"
                     /inference="protein motif:HMMPfam:IPR006104"
                     /inference="protein motif:ScanRegExp:IPR006101"
                     /inference="protein motif:superfamily:IPR006102"
                     /inference="protein motif:superfamily:IPR008979"
                     /inference="protein motif:superfamily:IPR011013"
                     /note="KEGG: eci:UTI89_C0371 0. lacZ; beta-galactosidase
                     K01190; COG: COG3250
                     Beta-galactosidase/beta-glucuronidase"
                     /transl_table=11
                     /db_xref="InterPro:IPR004199"
                     /db_xref="InterPro:IPR006101"
                     /db_xref="InterPro:IPR006102"
                     /db_xref="InterPro:IPR006103"
                     /db_xref="InterPro:IPR006104"
                     /db_xref="InterPro:IPR008979"
                     /db_xref="InterPro:IPR011013"
                     /db_xref="InterPro:IPR013781"
                     /db_xref="InterPro:IPR013812"
BEGIN
        1 MPQNADSLAV VLKCRDWENP GVTQLNRLEA HPPFCSWRNA DDARVNRDSA QKRSLNGEWT
       61 FAWFSAPEAV PESWRTSDLQ QADSVRVPSN WQMDGYDAPI YTNVTYPIPV NPPFVPADNP
      121 TGCYSLTFSI DADWLQAGQT RIIFDGVNSA FHLWCNSRWV GYGQDSRLPS EFDLTHFLLK
      181 GENRLAVMVL RWSDGSYLED QDMWRMSGIF RDVSLLHKPA TQIRDLRINT RFNDDFSRAV
      241 LEAEVRTTGE RRDDLRVTVQ LWDSETFVGE KTAPLGSEII DERGAYPDRT TLRLNVEHPA
      301 LWSAETPHLY RAVVQLHAAD GTLIEAEACD VGFRQVSIEN GLLLLNGKPL LIRGANRHEH
      361 HPENGQVMDK ETMIKDILLM KQNNFNAVRC SHYPNHPLWY TLCDRYGLYV VDEANIETHG
      421 MVPMNRLSDD PVWLPAMSQR VTRMVQRDRN HPSIIIWSLG NESGHGANHD ALYRWIKSED
      481 PSRPVQYEGG GANTAATDII CPMYARVDQD QPFPAVPKWS IKKWLSMPGE QRPLILCEYA
      541 HAMGNSLGGY AKYWQAFRQY PRLQGGFVWD WVDQSLIKYD DDGRPWSAYG GDFGDAPNDR
      601 QFCMNGLVFA DRTPHPSLYE AKHAQQFFQF ALLPGAECQI EVTSEYLFRH SDNEVLHWSL
      661 ALDGNPLTAG VVTLDIPPQG RQIIALPALP EAETAGQLWL TVRVEQPQAT AWSQAGHISA
      721 WQQWKLGEKL ATQWPQHAGN APQLTSSGTA FRIVAGEKRW EFSRQQGVLT QFWIGEEAQL
      781 LTPLVDQFTR APLDNDIGVS EATRIDPNAW VERWKAAGHY QAKAVLLQCD ADTLASAVLI
      841 TTAHAWQYQG ETLFISRKTY RIDGNGEMQI TVDVDVASGT PHPARIGLSC QLAQVAERVN
      901 WLGLGPHENY PDRLSAACFE RWDLPLEEMY TPYVFPSENG LRCGTRELLY GAHQWRGDFQ
      961 FNISRYGQKQ LMETSHRHLL QPEAGTWLNI DGFHMGVGGD DSWSPSVSAE YQLSAGRYHY
     1021 QISWR
//