LOCUS       ABV12387.1              3163 aa    PRT              BCT 31-JAN-2014
DEFINITION  Citrobacter koseri ATCC BAA-895 hypothetical protein protein.
ACCESSION   CP000822-1208
PROTEIN_ID  ABV12387.1
SOURCE      Citrobacter koseri ATCC BAA-895
  ORGANISM  Citrobacter koseri ATCC BAA-895
            Bacteria; Proteobacteria; Gammaproteobacteria; Enterobacterales;
            Enterobacteriaceae; Citrobacter.
REFERENCE   1  (bases 1 to 4720462)
  AUTHORS   McClelland,M., Sanderson,E.K., Porwollik,S., Spieth,J.,
            Clifton,W.S., Latreille,P., Courtney,L., Wang,C., Pepin,K.,
            Bhonagiri,V., Nash,W., Johnson,M., Thiruvilangam,P. and Wilson,R.
  CONSRTM   The Citrobacter koseri Genome Sequencing Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-AUG-2007) Genetics, Genome Sequencing Center, 4444
            Forest Park Parkway, St. Louis, MO 63108, USA
COMMENT     Citrobacter (diversus) koseri--Citrobacter cells are isolated from
            water, sewage, soils, and food, as well as from the feces of man
            and other animals, where they may be normal inhabitants. They can
            be found in urine, sputum, and other clinical specimens. They can
            sometimes be opportunistic pathogens particularly in
            immunocompromised patients in hospitals or in infants (Pepperell et
            al., Antimicrob Agents Chemother. 2002 Nov;46(11):3555-60. and
            references therein).
            
            The strain of Citrobacter koseri being sequenced, strain CDC
            4225-83, was isolated in 1983 in Maryland, where it caused neonatal
            meningitis. It was provided by Caroline Mohr and Melissa Campbell
            of CDC. The strain is available from the American Type Culture
            Collection as ATCC BAA-895 or from the Salmonella Genetic Stock
            Centre as SGSC4696. The genome was sequenced to 8X coverage, using
            plasmid and fosmid libraries and was finished to an error rate of
            less than 1 per 10,000 bases. Automated annotation was performed
            and manual annotation will continue in the labs of Michael
            McClelland and Kenneth Sanderson. The National Institute of Allergy
            and Infectious Diseases (NIAID), National Institutes of Health
            (NIH) has funded this project.
            
            Coding sequences below are predicted using GeneMark v3.3 and
            Glimmer2 v2.13.Intergenic regions not spanned by GeneMark and
            Glimmer2 were blasted against NCBI's non-redundant (NR) database
            and predictions generated based on protein alignments. RNA genes
            were determined usingtRNAscan-SE 1.23 or Rfam v8.0. This sequence
            was finished as follows unless otherwise noted: all regions were
            double stranded, sequenced with an alternate chemistries or covered
            by high quality data (i.e., phred quality >=30); an attempt was
            made to resolve all sequencing problems, such as compressions and
            repeats; all regions were covered by sequence from more than one
            m13 subclone.
FEATURES             Qualifiers
     source          /organism="Citrobacter koseri ATCC BAA-895"
                     /mol_type="genomic DNA"
                     /strain="ATCC BAA-895"
                     /db_xref="ATCC:BAA-895"
                     /db_xref="taxon:290338"
     protein         /locus_tag="CKO_01247"
                     /inference="protein motif:Gene3D:IPR001227"
                     /inference="protein motif:Gene3D:IPR009081"
                     /inference="protein motif:HMMPanther:IPR000794"
                     /inference="protein motif:HMMPfam:IPR000794"
                     /inference="protein motif:HMMPfam:IPR001031"
                     /inference="protein motif:HMMPfam:IPR001227"
                     /inference="protein motif:HMMPfam:IPR001242"
                     /inference="protein motif:HMMPfam:IPR006163"
                     /inference="protein motif:HMMPfam:IPR013217"
                     /inference="protein motif:HMMPfam:IPR013624"
                     /inference="protein motif:HMMPfam:IPR013968"
                     /inference="protein motif:ScanRegExp:IPR000794"
                     /inference="protein motif:ScanRegExp:IPR006162"
                     /inference="protein motif:superfamily:IPR009081"
                     /note="KEGG: eci:UTI89_C2184 0. irp1; HMWP1 nonribosomal
                     peptide/polyketide synthase K04786; COG: COG3319
                     Thioesterase domains of type I polyketide synthases or
                     non-ribosomal peptide synthetases"
                     /transl_table=11
                     /db_xref="InterPro:IPR000794"
                     /db_xref="InterPro:IPR001031"
                     /db_xref="InterPro:IPR001227"
                     /db_xref="InterPro:IPR001242"
                     /db_xref="InterPro:IPR006162"
                     /db_xref="InterPro:IPR006163"
                     /db_xref="InterPro:IPR009081"
                     /db_xref="InterPro:IPR013217"
                     /db_xref="InterPro:IPR013624"
                     /db_xref="InterPro:IPR013968"
BEGIN
        1 MDNLRFSSAP TADSIDASIA QHYPDCEPVA VIGYACHFPE SPDGETFWQN LLEGRECSRR
       61 FTREELLAVG LDAAIIDDPH YVNIGTVLDN ADCFDATLFG YSRQEAESMD PQQRLFLQAV
      121 WHALEHAGYA PGAVPHKTGV FASSRMSTYP GREALNVTEV AQVKGLQSLM GNDKDYIATR
      181 AAYKLNLHGP ALSVQTACSS SLVAVHLACE SLRAGESDMA VAGGVALSFP QQAGYRYQPG
      241 MIFSPDGHCR PFDASAEGTW AGNGLGCVVL RRLRDALLSG DPIISVILSS AVNNDGNRKV
      301 GYTAPSVAGQ QAVIEEALML AAIDDRQVGY IETHGTGTPL GDAIEIEALR NVYAPRPQDQ
      361 RCALGSVKSN MGHLDTAAGI AGLLKTVLAV NRGQIPPLLN FHTPNPALKL EESPFTIPVS
      421 AQAWQDEMRY AGVSSFGIGG TNCHMIVASL PDALNARLPN TDSGRKSTAL LLSAASDSAL
      481 RRLATDYAGA LRENADASSL AFTALHARRL DLPFRLAAPL NRETAEALSA WAGEKSGALV
      541 YSGHGASGKQ VWLFTGQGSH WRTMGQTMYQ HSTAFADTLD RCFSACSEML TPSLREAMFN
      601 PDSAQLDNMA WAQPAIVAFE IAMAAHWRAE GLKPDFAIGH SVGEFAAAVV CGHYTIEQVM
      661 PLVCRRGALM QQCASGAMVA VFADEDTLMP LARQFELDLA ANNGTQHTVF SGPEARLAVF
      721 CATLSQHDIN YRRLSVTGAA HSALLEPILD RFQDACAGLH AEPGQIPIIS TLTADVIDES
      781 TLNQADYWRR HMRQPVRFIQ SIQVAHQLGA RVFLEMGPDA QLVACGQREY RDNAYWIASA
      841 RRNKEASDVL NQALLQLYAA GVALPWADLL AGDGQRIAAP CYPFDTERYW KERVSPACEP
      901 ADAALSAGLE VASRAATALD LPRLEALKQC ATRLHAIYVD QLVQRCTGDA IENGVDAMTI
      961 MRRGRLLPRY QQLLQRLLNN CVVDGDYRCT DGRYVRARPI EHQQRESLLT ELAGYCEGFQ
     1021 AIPDTIARAG DRLYEMMSGA EEPVAIIFPQ SASDGVEVLY QEFSFGRYFN QIAAGVLRGI
     1081 VQTRQPRQPL RILEVGGGTG GTTAWLLPEL NGVPALEYHF TDISALFTRR AQQKFADYDF
     1141 VKYSELDLEK EAQSQGFQAQ SYDLIVAANV IHATRHIGRT LDNLRPLLKP GGRLLMREIT
     1201 QPMRLFDFVF GPLVLPLQDL DAREGELFLT TAQWQQQCRH AGFSKVAWLP QDGSPTAGMS
     1261 EHIILATLPG QAVSAVTFTA PSEPVLGQAL TDNGDYLADW SDCAGQPERF NARWQEAWRL
     1321 LSQRHGDALP VEPPPVAAPE WLEEVRLSWQ NEAFSRGQMR VEARHPDGEW LPLSPTAPLP
     1381 APQTHYQWRW TPLNVASVDH PLTFSFSTGT LARSDELAQY GIIHDPHASS RLMIVEESED
     1441 TLALAEKVIA ALTASAAGLI VVTRRAWRVE ENEALSASHH ALWALLRVAA NEQPERLIAA
     1501 IDLAENTPWE TLHQGLSAVS LSQRWLAARG DTLWLPSLAP NTGCAAELPA NVFIGDNRWH
     1561 LVTGAFGGLG RLAVNWLREK GARRIALLAP RVDESWLRDV EGGQTRVCRC DVGDAGQLAT
     1621 VLDDLAANGG IAGAIHAAGV LADAPLQELD DHQLAAVFAV KAQAANQLLQ TLRNHDGRYL
     1681 ILYSSAAATL GAPGQSAHAL ACGYLDGLAQ QFSTLDAPKT LSVAWGAWGE SGRAATPEML
     1741 VTLASRGMGA LSDAEGCWHL EQAVMRGAPW RLAMRVFTDK MPPLQQALFN ISATEKAATP
     1801 VIPPADDNAF NGSLSDETAV MAWLKKRIAV QLRLSDPASL RPNQDLLQLG MDSLLFLELS
     1861 SDIQHYLGVR INAERAWQDL SPHGLTQLIC SKPETTPAAS QPEVLRHDAD ERYAPFPLTP
     1921 IQHAYWLGRT HLIGYGGVAC HVLFEWDKRH DEFDLAILEK AWNQLIARHD MLRMVVDADG
     1981 QQRVLATTPE YHIQRDDLRA LSPEEQRIAL EKRRHELSYR VLPADQWPLF ELVVSEIDDC
     2041 HYRLHMNLDL LQFDVQSFKV MMDDLAQVWR GETLAPLAIT FRDYVMAEQA RRQTSAWHDA
     2101 WDYWQEKLPQ LPLAPELPVV ETPPETPHFT TFKSTIGKTE WQAVKQRWQQ QGVTPSAALL
     2161 TLFAATLERW SRTTAFTLNL TFFNRQPIHP QINQLIGDFT SVTLVDFNFS TPVTLQEQMQ
     2221 QTQQRLWQNM AHSEMNGVEV IRELGRLRGS QRQPLMPVVF TSMLGMTLEG MTIDQAMSHL
     2281 FGEPCYVFTQ TPQVWLDHQV MESDGELMFS WYCMDNVLEP GAAEAMFNDY CAILQAVIAA
     2341 PESLKTLASG IAGHIPRRRW PLNAQTDYDL RDIEQATLEY PGIRQTRAEM TEQGALTLDI
     2401 VMADDPSPSA ATPDEHELTQ LALPLPEQAQ LDELEATWRW LEARALQGIA ATLNRHGLFT
     2461 TPEIAHRFSA IVQALSAQAS HQRLLRQWLQ CLTERAWLIR EGESWRCRVP LSEIPEPQEA
     2521 CPPSQWSQAL AQYLETCIAR HDALFSGQCS PLELLFNEQH RVTDALYRDN PASACLNRYT
     2581 AQIAALCSAE RILEVGAGTA ATTAPVLKAT RNTRQSYHFT DVSAQFLNDA RARFHDESRV
     2641 SYALFDINQP LDFTAHPEAG YDLIVAVNVL HDASHVVQTL RRLKLLLKAG GRLLIVEATE
     2701 RNSVFQLASV GFIEGLSGYR DFRRRDEKPM LTRSAWQEVL VQAGFANELA WPAQESSPLR
     2761 QHLLVARSPG VNRPDKKAVS HYLQQRFGTG LPILQIRQRE ALFTPLHAPS DAPTEPAKPT
     2821 PVAGGNPALE KQVAELWQSL LSRPVARHHD FFELGGDSLM ATRMVAQLNR RGIARANLQD
     2881 LFSHSTLSDF CAHLQAATSG EDNPIPLCQG DGDETLFVFH ASDGDISAWL PLASALNRRV
     2941 FGLQAKSPQR FATLDQMIDE YVGCIRRQQP HGPYVLAGWS YGAFLAAGTA QRLYAKGEQV
     3001 RIALIDPVCR QDFCCENRAA LLRLLAEGQT PLALPEHFDQ QTPDSQLADF ISLAKTAGMV
     3061 SQNLTLQAAE TWLDNIAHLL RLLTEHTPGE SVPVPCLMVY AAGRPARWTP AETEWQGWIN
     3121 NADDAVIEAS HWQIMMEAPH VQACAQHITR WLCATSTQPE NTL
//