LOCUS QJU56420.1 1259 aa PRT BCT 18-MAY-2020 DEFINITION Sphingomonas sp. AP4-R1 error-prone DNA polymerase protein. ACCESSION CP053346-51 PROTEIN_ID QJU56420.1 SOURCE Sphingomonas sp. AP4-R1 ORGANISM Sphingomonas sp. AP4-R1 Bacteria; Proteobacteria; Alphaproteobacteria; Sphingomonadales; Sphingomonadaceae; Sphingomonas. REFERENCE 1 (bases 1 to 5252057) AUTHORS Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W. TITLE Genome sequencing of strain KACC 21605 JOURNAL Unpublished REFERENCE 2 (bases 1 to 5252057) AUTHORS Heo,J., Kim,S.-J., Kim,J.-S., Hong,S.-B. and Kwon,S.-W. TITLE Direct Submission JOURNAL Submitted (08-MAY-2020) Agricultural Mircrobiology Division, National Institute of Agricultural Sciences, 166 Nongsaengmyeong-ro, Iseo-myeon, Wanju-gun, Jeollabuk-do 55365, South Korea COMMENT The annotation was added by the NCBI Prokaryotic Genome Annotation Pipeline (PGAP). Information about PGAP can be found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/ This genome has a base modification file available. ##Genome-Assembly-Data-START## Assembly Date :: APR-2020 Assembly Method :: RS HGAP Assembly v. 3.0 Genome Representation :: Full Expected Final Version :: Yes Genome Coverage :: 99.0x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## ##Genome-Annotation-Data-START## Annotation Provider :: NCBI Annotation Date :: 05/11/2020 21:04:38 Annotation Pipeline :: NCBI Prokaryotic Genome Annotation Pipeline (PGAP) Annotation Method :: Best-placed reference protein set; GeneMarkS-2+ Annotation Software revision :: 4.11 Features Annotated :: Gene; CDS; rRNA; tRNA; ncRNA; repeat_region Genes (total) :: 4,788 CDSs (total) :: 4,729 Genes (coding) :: 4,613 CDSs (with protein) :: 4,613 Genes (RNA) :: 59 rRNAs :: 2, 2, 2 (5S, 16S, 23S) complete rRNAs :: 2, 2, 2 (5S, 16S, 23S) tRNAs :: 50 ncRNAs :: 3 Pseudo Genes (total) :: 116 CDSs (without protein) :: 116 Pseudo Genes (ambiguous residues) :: 0 of 116 Pseudo Genes (frameshifted) :: 56 of 116 Pseudo Genes (incomplete) :: 58 of 116 Pseudo Genes (internal stop) :: 26 of 116 Pseudo Genes (multiple problems) :: 22 of 116 ##Genome-Annotation-Data-END## FEATURES Qualifiers source /organism="Sphingomonas sp. AP4-R1" /mol_type="genomic DNA" /strain="AP4-R1" /host="Malus prunifolia (crab apple)" /culture_collection="KACC:21605" /db_xref="taxon:2735134" /geo_loc_name="South Korea: Naju-si" /collection_date="23-Oct-2019" /collected_by="Jun Heo, Soon-Wo Kwon" /identified_by="Jun Heo, Soon-Wo Kwon" protein /locus_tag="HL653_00255" /EC_number="2.7.7.7" /inference="COORDINATES: similar to AA sequence:RefSeq:WP_013845998.1" /note="Derived by automated computational analysis using gene prediction method: Protein Homology." /transl_table=11 BEGIN 1 MPLSPPPFVE LVAASHFSFL HGASHGSDLV ATALALKHPG LGLADRNTVA GVVRAWKALK 61 DAREDAAALG APLAPFKLAT GARLVFADGT PDIVAYPEDR RGWGRLTRLL SDGNLRSDKG 121 DCLLRLDDLL GYSESLLLIV LPESSRRDGE GIARKTPTIL PEDEDDSLPA QASIQPLEAG 181 RGSRRDAETQ REKEEKEAAP GDEAHASLSA PLRLRVNTPS PRPRGNDDSK HGNVVPFPSR 241 LREEIEGRTA PAEGRKTANV GSPPPAPPAS GSGALRRTLE TLRAACPDRV WLALPATASG 301 TDVRRRARLL ALAREAGVPP LATTDALYAT REERPLQDVL TCIRLGVTIA QAGRRLEANA 361 ERHLKSAAEM AHLYRDCPQA IAETTRLLAR IAFDLAQLKY EYPHEPVPEG WKPQKWLQHL 421 VVTAARTLWG AGKTPPKALR MLREEFRLIR KMGYAYYFLT VHDVVQFARE QDPPILCQGR 481 GSAANSMVCY LLGVTPIDPA ANNLLFSRFL SEERSEPPDI DVDFEHERRE EVMQYIYRRY 541 GRHRAGIVAT VIHYRSRSAV REVGKALGLT DDVTSRLVST VWGSYSSRME EERFHETGFR 601 LDNPEIARLN HFVGRLLEAP FPRHLSQHVG GFVLTEDRLD ETVPIHHAAM EDRTFVEWDK 661 DDIDALGLMK VDILALGMLT CIRKAFDLIR DHDGTDHSLR DIPREQPDVY AMLQKGDSIG 721 LFQVESRAQM NMLPRLKPKE LYDLVIQVAI VRPGPIQGNM VHPYLRRRQG IEKWSFPAPS 781 PPHPADELHD ILGKTLGVPL FQEQAMKLAI VAAEFSPVDA NKFRRAMATF RNVGTMPEFE 841 EKMVGGMTKR GYTEEFAQRC FSQIKGFGSY GFPESHAQSF AILVYASAYL KRRHPAAFCA 901 ALLNSQPMGF YAPAQIVRDA REHGVEVCPI DVTASGWDNR LEVGAKGPAV RLGFRQIDGF 961 KEEWAKAVEE ALFLPIADGE GDHLRQQMVE GKSPAPGYCP STAFGGPSPP APRGEDVVER 1021 LARLIPARAL RLLADADAFR SLELGRRDAL WEVRRTPHDA LPLFAAAKAR ELAEETDAQL 1081 PAMPLSEEVA ADYQMTRLSL KQHPMAFLRG LFRDEGILSA AELAALPDGR AARMAGVVLV 1141 RQRPGEGKAI FVTLEDETGV TNVLLWAADF EKQRAAVMAS RLMEVRGVVQ KSEEGVVHLM 1201 TTQVVDRTAE LDRLSADHQT RPLLSRADEF AHPQHPRKKG AERRHPRNVR VLPPSRDFH //