LOCUS       ABV14942.1               660 aa    PRT              BCT 31-JAN-2014
DEFINITION  Citrobacter koseri ATCC BAA-895 hypothetical protein protein.
ACCESSION   CP000822-3763
PROTEIN_ID  ABV14942.1
SOURCE      Citrobacter koseri ATCC BAA-895
  ORGANISM  Citrobacter koseri ATCC BAA-895
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Citrobacter.
REFERENCE   1  (bases 1 to 4720462)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Citrobacter koseri Genome Sequencing Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     Citrobacter (diversus) koseri--Citrobacter cells are isolated from
            water, sewage, soils, and food, as well as from the feces of man
            and other animals, where they may be normal inhabitants. They can
            be found in urine, sputum, and other clinical specimens. They can
            sometimes be opportunistic pathogens particularly in
            immunocompromised patients in hospitals or in infants (Pepperell et
            al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and
            references therein).
            
            The strain of Citrobacter koseri being sequenced, strain CDC
            4225-83, was isolated in 1983 in Maryland, where it caused neonatal
            meningitis. It was provided by Caroline Mohr and Melissa Campbell
            of CDC. The strain is available from the American Type Culture
            Collection as ATCC BAA-895 or from the Salmonella Genetic Stock
            Centre as SGSC4696. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data (i.e., phred quality >=30); an attempt was
            made to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Citrobacter koseri ATCC BAA-895"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-895"
                     /db_xref="ATCC:BAA-895"
                     /db_xref="taxon:290338"
     protein         /locus_tag="CKO_03866"
                     /inference="protein motif:HMMPfam:IPR008514"
                     /inference="similar to AA sequence:INSD:ABR77224.1"
                     /note="KEGG: eci:UTI89_C0121 6.4e-50 usp; uropathogenic
                     specific protein K01150; COG: NOG14030 non supervised
                     orthologous group"
                     /transl_table=11
                     /db_xref="InterPro:IPR008514"
BEGIN
        1 MGDIIYVTIE GEHQGDISSG CGTTTSVGNR WQQGHEDEIF VFSLTQGISN TGMGVKHQGL
       61 SFSKVIDRAS PLLTNAINNN ENLKMRFDIY RINRFGRWEK YYVIKLRGAR LNRLVSESRQ
      121 NSLDYEYISL DYDYIHCQHL LAGTEFDYLV TPERYNQLFP VAQVISPPPE PEKRKVTLVL
      181 GIFFDGTGNN AVNTRNMLAA CTAQHFDIDS PDAEIILQKS ASEKMGLSGT EATSYYGYYT
      241 NIHWLNELYL KRYPPDGHYI QYAVYIEGIG TQAGEADSMI GLGLGTSDYG VIAKTDDAVA
      301 QLAEAIKATI RMLKGKFIIE NLLFDIFGFS RGAAAARHFA NRVQSEDGAI INAINAGMVK
      361 QVYTGKPAGK TRFMGIFDTV TAVGTPFNGL NPHSADTGDV NIRLRPGVAQ KVFHITAQHE
      421 CRYNFALNSV APAWPEITLP GVHSDIGGGY LPKTREDLFL TRPQVDTLPS NQPDERSGAY
      481 RKTMAQLPVL EASPAIAPIM RTNEITPEVW SDDYAATDRY GQMQKRSFAA LTMRQRIVRN
      541 DWSKVILRVM VDAAKEAGVM FNENLEVDEL KIPADLSLFL ERAISMGKSA RLRRPTIGFT
      601 KQDIEIIAKT YIHCSANWNA VAVSKTGKPQ GGASASETIG FVNRPDAGWL RTIYNMDGKQ
//