LOCUS       QJU57725.1              1057 aa    PRT              BCT 18-MAY-2020
DEFINITION  Sphingomonas sp. AP4-R1 CHAT domain-containing protein protein.
ACCESSION   CP053346-1558
PROTEIN_ID  QJU57725.1
SOURCE      Sphingomonas sp. AP4-R1
  ORGANISM  Sphingomonas sp. AP4-R1
            Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales;
            Sphingomonadaceae; Sphingomonas.
REFERENCE   1  (bases 1 to 5252057)
  AUTHORS   Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W.
  TITLE     Genome sequencing of strain KACC 21605
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 5252057)
  AUTHORS   Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-MAY-2020) Agricultural Mircrobiology Division,
            National Institute of Agricultural Sciences, 166
            Nongsaengmyeong-ro, Iseo-myeon, Wanju-gun, Jeollabuk-do 55365,
            South Korea
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            This genome has a base modification file available.
            
            ##Genome-Assembly-Data-START##
            Assembly Date          :: APR-2020
            Assembly Method        :: RS HGAP Assembly v. 3.0
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 99.0x
            Sequencing Technology  :: PacBio RSII
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 05/11/2020 21:04:38
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.11
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 4,788
            CDSs (total)                      :: 4,729
            Genes (coding)                    :: 4,613
            CDSs (with protein)               :: 4,613
            Genes (RNA)                       :: 59
            rRNAs                             :: 2, 2, 2 (5S, 16S, 23S)
            complete rRNAs                    :: 2, 2, 2 (5S, 16S, 23S)
            tRNAs                             :: 50
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 116
            CDSs (without protein)            :: 116
            Pseudo Genes (ambiguous residues) :: 0 of 116
            Pseudo Genes (frameshifted)       :: 56 of 116
            Pseudo Genes (incomplete)         :: 58 of 116
            Pseudo Genes (internal stop)      :: 26 of 116
            Pseudo Genes (multiple problems)  :: 22 of 116
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Sphingomonas sp. AP4-R1"
                     /mol_type="genomic DNA"
                     /strain="AP4-R1"
                     /host="Malus prunifolia (crab apple)"
                     /culture_collection="KACC:21605"
                     /db_xref="taxon:2735134"
                     /geo_loc_name="South Korea: Naju-si"
                     /collection_date="23-Oct-2019"
                     /collected_by="Jun Heo, Soon-Wo Kwon"
                     /identified_by="Jun Heo, Soon-Wo Kwon"
     protein         /locus_tag="HL653_07910"
                     /inference="COORDINATES: protein motif:HMM:NF024180.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MWAKRGALVA VAALLASGSG TAAPRVRGAA PDIFVLGRDA TGEPCSATRN WRDPRLGDAF
       61 DSAWGLTCRG VAASRQQGYA LRLAPTHAAG ADETDCGAPL QQTIAGIGPV ETRACYDETL
      121 ATAAVRVRFR HDGRLYVGAS ASTALGPLEA LLRTIARVAP APADRDVVVK PTLAADRLQQ
      181 PAKIAQAKDA RAGFDAAAAL QASIQLNHRG LYVEASRLLN DALSRLDGEA APLTRVELEL
      241 EAGLADSNIR QFDAADEHFT RANAVLSAHA GADRAAALEA KRAVYRGLDL LNRRQWAEAY
      301 AAMGATDARD NPLLDPATLS ALNQAPAGAG VSAALSAVDG AQLARLLLEA QRRWARSVAL
      361 LALGRTRDSN EELDRAAEGI GELQRSVDSD ALVAIKSRVQ RQYARVAVRE GKVDLALSLF
      421 DCSIATLQGQ PPSLAKPCPL DRPVRRAGTP GNAEGLMIAE TELERTSILA RRPGVAMKDL
      481 LAAYDAGVDA LIASTDASGI VPSSLENYLE TLATLYAQTP TDEIAERYFR AVQSVGEPAI
      541 ARQLAQLQSV VTGDGMLGAK VRDRGELERQ IVQLRYAIAS APADDAAGLA ALESQRRGAE
      601 AKLVDLNATI SSDSRYRAVD DRPATIAQMR AALRPGEIYV KVSRLRGRAW ATTIDGAHTW
      661 IYPLAGTSDD VDETATAVLN SIRDNSDTLP IFDVASAHTL FRLLAGPAEP AILGSKAIVF
      721 DLSGALQNLP ASVMVAEEAS VKAYAARAEQ EPNDYSHVDF LAGRAEISNA LSPRSFLIAR
      781 ALPESAAARP FIGFGQNAPP IEGGARDTAR PISFGVGCPM SYLNLANIMR AVRPVSAAEL
      841 GIAAQALGDP TAPEVTGAAF TDTAMMAASD AGDYTRYQVI HFATHGLPES HWGCSVVPPS
      901 LLTTLGAPAP IDQPQSDGLL TFSEIARLRL DANLVVLLAC ETAAGVSTRG GRLGGQDESA
      961 ATLDGLVRAF ITANARAVLA TYWKVPDGEA TLDFVRTLYT TGRTGTIGAA LRRAQTQVIE
     1021 RPDISHPYFW APFFLVGDAS KTMLSQPLPA TQHVATR
//