LOCUS       QEO14447.1              1667 aa    PRT              BCT 10-SEP-2019
DEFINITION  Agromyces intestinalis glycoside hydrolase family 3 protein.
ACCESSION   CP043505-1668
PROTEIN_ID  QEO14447.1
SOURCE      Agromyces intestinalis
  ORGANISM  Agromyces intestinalis
            Bacteria; Actinobacteria; Micrococcales; Microbacteriaceae;
            Agromyces.
REFERENCE   1  (bases 1 to 3800200)
  AUTHORS   Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W.
  TITLE     Genome sequencing of strain KACC 19306
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 3800200)
  AUTHORS   Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-SEP-2019) Agricultural Mircrobiology Division,
            National Institute of Agricultural Sciences, 166
            Nongsaengmyeong-ro, Iseo-myeon, Wanju-gun, Jeollabuk-do 55365,
            Korea, Republic of
COMMENT     The annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (PGAP). Information about PGAP can be found here:
            https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            This genome has a base modification file available.
            
            ##Genome-Assembly-Data-START##
            Assembly Date          :: AUG-2019
            Assembly Method        :: RS HGAP Assembly v. 3.0
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 174.0x
            Sequencing Technology  :: PacBio RSII; Illumina HiSeq
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 09/03/2019 16:08:18
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline (PGAP)
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS-2+
            Annotation Software revision      :: 4.9
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 3,466
            CDSs (total)                      :: 3,410
            Genes (coding)                    :: 3,349
            CDSs (with protein)               :: 3,349
            Genes (RNA)                       :: 56
            rRNAs                             :: 2, 2, 2 (5S, 16S, 23S)
            complete rRNAs                    :: 2, 2, 2 (5S, 16S, 23S)
            tRNAs                             :: 47
            ncRNAs                            :: 3
            Pseudo Genes (total)              :: 61
            CDSs (without protein)            :: 61
            Pseudo Genes (ambiguous residues) :: 0 of 61
            Pseudo Genes (frameshifted)       :: 7 of 61
            Pseudo Genes (incomplete)         :: 55 of 61
            Pseudo Genes (internal stop)      :: 0 of 61
            Pseudo Genes (multiple problems)  :: 1 of 61
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Agromyces intestinalis"
                     /mol_type="genomic DNA"
                     /strain="KACC 19306"
                     /isolation_source="gut of larva"
                     /host="Protaetia brevitarsis seulensis"
                     /culture_collection="KACC:19306"
                     /type_material="type strain of Agromyces intestinalis"
                     /db_xref="taxon:2592652"
                     /country="South Korea: Jeonju"
                     /collection_date="2017"
                     /collected_by="Soo-Jin Kim, Soon-Wo Kwon"
                     /identified_by="Soo-Jin Kim, Jun Heo, Soon-Wo Kwon"
     protein         /locus_tag="FLP10_08450"
                     /inference="COORDINATES: protein
                     motif:HMM:PF00754.23,HMM:PF01915.20,HMM:PF03160.12,
                     HMM:PF03425.11"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MPRDIRPARP HRRLAAALAA LGVVASGLIA SAPAAAADAD LAQLGTITAS AAQDDGDGSF
       61 PASNAIDGDP TTRWASGNGP DADTEFTAWL QSDLGAVATV TRIDLAWEAA FAAGYELRVA
      121 TADPSDPASW TTVHTETAGS GGDESIVLDE PVQARYVRLD MLQRTSFTWD PAQLHWYGYS
      181 LFSFGVFGSL PSPAVAFDRT SIGVDAGDTV EVAVNLTNAA TGAQTVRVTS GGGTAVAGDD
      241 YTALDETLTF AEGETTKTVS VATVGHGALA PSRTFVLTLS DPSDGLVLGS RSQVTVTLRP
      301 SGEAPNAGAV DVLADFEAGV PAGWFAWGST GPVTPQLGTA TDDTVPGSAP GNHVFTATVA
      361 GTPVASDWFG FTNDTPAADW SHADGFRFWF LGKGTGKALN FELKSEGKLF DRTVVDDTAG
      421 WRLVSVLFDD LRLKGNPGAP DRFLPSASNG YAVTLTGLGS GTWSFDDVAI FERAIMIDDF
      481 EGEVPVGSGP SGIFSWVAPG GEVTVGVGEQ ERADVTDNHV LTGSYRVPAG QWGGITDNLD
      541 SPQDWSGFRG IRLWWYASQP TNPASPTAGG DVQVELKDGG PDGEHAELWA ATFKDNWGSS
      601 TSRWKLVELP FSAFTLGGNQ PGSAETKNGT LDLTAAWGFA LHFPKGTDDT RYAIDDVQVY
      661 GTPASAANAT VTARPDVVLV DGGDTAEVTL AVTTASGDPL DLPVTVSYAN GDGTAEAGTH
      721 FAAFAGELTF PAGAESGSTQ SFTVQTSAVD GPDEARVIPV VLDGDGARLP DSVKIVINAH
      781 DLPYLDASLP VAERVEDLLG RMSVAEKAGQ MAQAERLGLQ SPAQIADLAL GSVLSGGGST
      841 PPGNTPEAWA DMVDGYQRQA RSTALQIPLL YGVDAVHGHS NVLDATIFPH NTGLGATHDA
      901 DLVEAIGRAT AIETKATGPN WTFAPCLCVT RDERWGRSYE SFGEDPALVR AYAEAVTVGL
      961 QGADPADKSA ADTLLATAKH WVGDGGTAYD PTKVGSGYPI DQGITSADSL ADFVRLHVDP
     1021 YLPSIAAGVG SIMPSYSAVD LGDGPVRMHE HELLNTEVLK GELGFTGFLI SDWEGIDKLP
     1081 GGSYADKAVR SVNAGLDMAM APYNFAAFIS AIEQGVASGA IEPTRVDDAV RRILTQKFEL
     1141 GLFEQPFTDR SRQGEFGGDA HRAIARQAAA ESQVLLKNDG VLPLPKSGSY YVAGSNADDL
     1201 GNQMGGWTIS WQGGSGDTTA GTSILEGIRQ VAPGASVTVS TDASAPTDGF DAGIVVVGET
     1261 PYAEGQGDVG NNGKGLDLKA ADRQAIDRVC SAMDCVVLVV AGRTQLVTDQ LGEMNALVSS
     1321 FLPGSEGAGV ADVLFGDVPF TGRLPITWPA SAAQVPINVG DDEYEPLFAF GWGERTDVPA
     1381 DRVAWVADAL DALGAPAAEA AGAVAALRDA DVWAVDGTVS PDADDVQHAL DLLAAAAATL
     1441 PGTERGTLAH ADLLVSVARD LAQHAIAGGT AADLPDLAAR TADAEVALLN GDPVEAVRLL
     1501 ASVLGIEIDS TDPTDPGKAQ VAGTLSASTA RVGDELTFSA TGFVADETLV GTLFSDPVDL
     1561 GTVDATEAGI GSITFTVPEG LDPGTHTVQL EGSAQIASAT FTLLSDDAPG GPGDGGTGGG
     1621 AGTGGSSAAG SGSVAMTGVE IWLTVLLGIA LVAAGAILMP AVARRRR
//