LOCUS QJU60401.1 587 aa PRT BCT 18-MAY-2020 DEFINITION Sphingomonas sp. AP4-R1 hypothetical protein protein. ACCESSION CP053346-416 PROTEIN_ID QJU60401.1 SOURCE Sphingomonas sp. AP4-R1 ORGANISM Sphingomonas sp. AP4-R1 Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales; Sphingomonadaceae; Sphingomonas. REFERENCE 1 (bases 1 to 5252057) AUTHORS Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W. TITLE Genome sequencing of strain KACC 21605 JOURNAL Unpublished REFERENCE 2 (bases 1 to 5252057) AUTHORS Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W. TITLE Direct Submission JOURNAL Submitted (08-MAY-2020) Agricultural Mircrobiology Division, National Institute of Agricultural Sciences, 166 Nongsaengmyeong-ro, Iseo-myeon, Wanju-gun, Jeollabuk-do 55365, South Korea COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ This genome has a base modification file available. ##Genome-Assembly-Data-START## Assembly Date :: APR-2020 Assembly Method :: RS HGAP Assembly v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 99.0x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/11/2020 21:04:38 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,788 CDSs (total) :: 4,729 Genes (coding) :: 4,613 CDSs (with protein) :: 4,613 Genes (RNA) :: 59 rRNAs :: 2, 2, 2 (5S, 16S, 23S) complete rRNAs :: 2, 2, 2 (5S, 16S, 23S) tRNAs :: 50 ncRNAs :: 3 Pseudo Genes (total) :: 116 CDSs (without protein) :: 116 Pseudo Genes (ambiguous residues) :: 0 of 116 Pseudo Genes (frameshifted) :: 56 of 116 Pseudo Genes (incomplete) :: 58 of 116 Pseudo Genes (internal stop) :: 26 of 116 Pseudo Genes (multiple problems) :: 22 of 116 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Sphingomonas sp. AP4-R1" /mol_type="genomic DNA" /strain="AP4-R1" /host="Malus prunifolia (crab apple)" /culture_collection="KACC:21605" /db_xref="taxon:2735134" /geo_loc_name="South Korea: Naju-si" /collection_date="23-Oct-2019" /collected_by="Jun Heo, Soon-Wo Kwon" /identified_by="Jun Heo, Soon-Wo Kwon" protein /locus_tag="HL653_02115" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_019014315.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MAVLLLIAAI GIVTAYGQQR DAQASVGPTF AASLRFPGGN DVVTRVAPFS DEARARGLLP 61 GDRLIAIDGR AIPYFDLDAI QSLLAGPDGS TYRLTVQTGE ASPRTITLTR NSHYIDQAYA 121 GSGLTFAIRS WTVFGFRMLD AVLALATALL LFIRRPTDPV AAMLALCAVP LENGLLFLWP 181 DAGLALTGFS SVRHMLFLTT ILIYPNGRFV PRWTIWLTPL IPVIELVSFQ STTVEAAIGL 241 RQLLLFAFLI AAAGAFYLRY RSTPPGAERQ QIKLIMLGTV GWVLFGLVSL VLERISAASA 301 TVAMSAWFSL ASHLAGSFAA IALEAGILLS LLRLRLYDAD SLISRSAAYT FLTLLLGPTF 361 AATDKVVQVL SAQSFGENAR AMSAGFGAAI AATLIAPLNK RVRRWTELRF QGEIVALRDD 421 LPKHVGDLRE FATLDVLLAG VLDRIRQGIR TRHVAILLRG ENGGFHPLTS PGLTGETVLT 481 WAQDWQPEPG NRLEGDRDDP LFPVRIRLGD EEPGMIGWIL LGPRPDGSFF GKDERKALAE 541 IAAPIALAIR IVRERDARMA GHIARNRELE ERIATLESRL GAFRDSA //