ABV12023.1

LOCUS       ABV12023.1              1455 aa    PRT              BCT 31-JAN-2014
DEFINITION  Citrobacter koseri ATCC BAA-895 hypothetical protein protein.
ACCESSION   CP000822-844
PROTEIN_ID  ABV12023.1
SOURCE      Citrobacter koseri ATCC BAA-895
  ORGANISM  Citrobacter koseri ATCC BAA-895
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Citrobacter.
REFERENCE   1  (bases 1 to 4720462)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Citrobacter koseri Genome Sequencing Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     Citrobacter (diversus) koseri--Citrobacter cells are isolated from
            water, sewage, soils, and food, as well as from the feces of man
            and other animals, where they may be normal inhabitants. They can
            be found in urine, sputum, and other clinical specimens. They can
            sometimes be opportunistic pathogens particularly in
            immunocompromised patients in hospitals or in infants (Pepperell et
            al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and
            references therein).
            
            The strain of Citrobacter koseri being sequenced, strain CDC
            4225-83, was isolated in 1983 in Maryland, where it caused neonatal
            meningitis. It was provided by Caroline Mohr and Melissa Campbell
            of CDC. The strain is available from the American Type Culture
            Collection as ATCC BAA-895 or from the Salmonella Genetic Stock
            Centre as SGSC4696. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data (i.e., phred quality >=30); an attempt was
            made to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Citrobacter koseri ATCC BAA-895"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-895"
                     /db_xref="ATCC:BAA-895"
                     /db_xref="taxon:290338"
     protein         /locus_tag="CKO_00873"
                     /inference="protein motif:Gene3D:IPR009081"
                     /inference="protein motif:HMMPfam:IPR000873"
                     /inference="protein motif:HMMPfam:IPR001242"
                     /inference="protein motif:HMMPfam:IPR006163"
                     /inference="protein motif:HMMTigr:IPR010071"
                     /inference="protein motif:ScanRegExp:IPR000873"
                     /inference="protein motif:superfamily:IPR009081"
                     /note="KEGG: eci:UTI89_C2210 0. putative peptide
                     synthetase K03367; COG: COG1020 Non-ribosomal peptide
                     synthetase modules and related proteins; Psort location:
                     Cytoplasmic, score:9.97"
                     /transl_table=11
                     /db_xref="InterPro:IPR000873"
                     /db_xref="InterPro:IPR001242"
                     /db_xref="InterPro:IPR006163"
                     /db_xref="InterPro:IPR009081"
                     /db_xref="InterPro:IPR010071"
BEGIN
        1 MMSGNPLSWP QEQCHIIDQL YPYSAVNIIG GVVTIEGIVD LPRLHAAIQS AIRQFDALRM
       61 WFVMGEESEV VSQVQPYHWR DIRHLTFSPD YDKENLRPAA IETFVDEWFR QPFTLLAHDL
      121 FEFVTFTCGE QYSGYLFKAH HGIADGWSMA LLSNHVKRAY EQQDVPDDAS PAYSAFLAQQ
      181 QSYQASTRFA VDRGWWRDYI DEYRDCFPDS SPIVTTEGIS CSTWLEPAMI NRLYRLCNRY
      241 GCTLNTLFIA LFALYRARVW GEEKGVIGVP LANRHTREAR RCFGMFTNQL PLAYRLVRTE
      301 RFCERVAFFQ RELKRGFKHS KYPITLFNQD LAEQGGGKLR AFDYCVNYYN FTYERHIAGA
      361 AQRVESYYSG EQSYKLQIVL QTVNNHKESL RLSLEALRSA FTPHQLTAMK NGLLDLVTAL
      421 DRQPDARLGD LEVYPAPHVA LACGSLKPSF TSRFAAQVVE HGDRTALIDN EQSLTYRQLD
      481 DAVERVARYL RQQGIGRGQV VGIIAEHSAQ TVMVIYGILR CGAAFLPLNP ALPTTRLYAM
      541 CRKAQVAHIL YDPAMHELTQ ALAFPASSLL QALATSALAR EPWPAIEPQD LAYVLFTSGS
      601 TGEPKGVQVS HGNLANYLHF AAERYFTAQD RAALYSSLSF DLTITTLFAP LCVGASISVC
      661 RHAESETLLR MAVVDQPNTV IKLTPAHLRL LCAAGISSEQ IRTLVVGGED FKRDLARKAA
      721 ALFPQAVIYN EYGPTEATVG CMIYRYTGQE TLPSLPIGMA IDGCQVAICS PWGCPVPEGE
      781 TGELVIYGAS VTQGYIDAPQ QTAAAYLKDT NGVMIGYRSG DIGYAIAPNT LVYQGRKDDQ
      841 VKINGYRIEL CEIEQALLSA PQVESAAVAV IDDVQGQHSG LLACVTPSSV DVATVMQHLR
      901 QQLPTYMQPK QCCAIAQLPL SHNGKVDVRQ MVATVRNTAP ASGSERLGDA AIRHSVRVCV
      961 EGALEQTEFD DNENLYVLGL DSIKSIQIAA QLRHHGWTMS AVQVMECGTV NAICEFLASH
     1021 TTVSQLAQYA HNTRIDLPAL RWFTQLALPV PNVYNHVIVL KVLPGCPLEQ LHNRLHTLIQ
     1081 QQPALHSALD AEGRLLVCDP NVCYPNEVLT EYSTAQWTLA EVIAQCNSML DVTNGRVFTA
     1141 ALLHAPQPAS STLVLCAHHL CVDMHSWYLI LSTLDAVSTV NGTSNSGLHR WNDYLASKTV
     1201 DSATHESWRT VCQTLPLHFP PVSLPDDSLP RTRAWREDFR HPCVRRLFES SGNTAYSAET
     1261 YVLTALALVL RYYSEEPWCR IEMEGMGRGC WPDEPDVADT VGWFTLFYPW AIPLHGDMAT
     1321 LLSAIASDLA KRTHGGGDYG LLQMRHAPED SLAQGIRMNY IGVQAQPSLR YFHIDHFNSD
     1381 IYTAPENALG CVLEFNIARS AADGLSFHCR FDPTRIALND VQLLLARYKN SLTDLDAWLC
     1441 QHSATLTGAP TLWTL
//