LOCUS       ARN72363.1               407 aa    PRT              BCT 02-MAY-2017
DEFINITION  Nonlabens tegetincola DNA polymerase IV protein.
ACCESSION   CP019342-2387
PROTEIN_ID  ARN72363.1
SOURCE      Nonlabens tegetincola
  ORGANISM  Nonlabens tegetincola
            Bacteria; Bacteroidota; Flavobacteriia; Flavobacteriales;
            Flavobacteriaceae; Nonlabens.
REFERENCE   1  (bases 1 to 2835711)
  AUTHORS   Kumagai,Y.
  TITLE     Trade-off between light-utilization and light-protection in marine
            flavobacteria
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 2835711)
  AUTHORS   Kumagai,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (19-JAN-2017) Marine Microbiology, Atmosphere and Ocean
            Research Institute, The University of Tokyo, 5-1-5, Kashiwanoha,
            Kashiwa-shi, Chiba 277-8564, Japan
COMMENT     Annotation was added by the NCBI Prokaryotic Genome Annotation
            Pipeline (released 2013). Information about the Pipeline can be
            found here: https://www.ncbi.nlm.nih.gov/genome/annotation_prok/
            
            ##Genome-Assembly-Data-START##
            Assembly Method        :: Sprai v. 0.9.5.1.3
            Genome Representation  :: Full
            Expected Final Version :: Yes
            Genome Coverage        :: 37.72x
            Sequencing Technology  :: PacBio
            ##Genome-Assembly-Data-END##
            
            ##Genome-Annotation-Data-START##
            Annotation Provider               :: NCBI
            Annotation Date                   :: 01/24/2017 14:22:44
            Annotation Pipeline               :: NCBI Prokaryotic Genome
                                                 Annotation Pipeline
            Annotation Method                 :: Best-placed reference protein
                                                 set; GeneMarkS+
            Annotation Software revision      :: 4.0
            Features Annotated                :: Gene; CDS; rRNA; tRNA; ncRNA;
                                                 repeat_region
            Genes (total)                     :: 2,581
            CDS (total)                       :: 2,532
            Genes (coding)                    :: 2,475
            CDS (coding)                      :: 2,475
            Genes (RNA)                       :: 49
            rRNAs                             :: 3, 3, 3 (5S, 16S, 23S)
            complete rRNAs                    :: 3, 3, 3 (5S, 16S, 23S)
            tRNAs                             :: 36
            ncRNAs                            :: 4
            Pseudo Genes (total)              :: 57
            Pseudo Genes (ambiguous residues) :: 0 of 57
            Pseudo Genes (frameshifted)       :: 54 of 57
            Pseudo Genes (incomplete)         :: 2 of 57
            Pseudo Genes (internal stop)      :: 2 of 57
            Pseudo Genes (multiple problems)  :: 1 of 57
            ##Genome-Annotation-Data-END##
FEATURES             Qualifiers
     source          /organism="Nonlabens tegetincola"
                     /mol_type="genomic DNA"
                     /strain="NBRC 100970"
                     /isolation_source="marine sediment"
                     /culture_collection="NBRC:100970"
                     /type_material="type strain of Sandarakinotalea sediminis"
                     /db_xref="taxon:323273"
                     /country="Japan: Katsuura"
     protein         /locus_tag="BST91_12175"
                     /inference="COORDINATES: similar to AA
                     sequence:RefSeq:WP_013551961.1"
                     /note="Derived by automated computational analysis using
                     gene prediction method: Protein Homology."
                     /transl_table=11
BEGIN
        1 MSVQKKHIMH LDLDTFYVSV ERKMDSRLKT KPILVGGTSD RGVVAACSYE TRGYGVHSGM
       61 SMKLARQLCP EAIVIRGNAG VYSKHSDEIT EIIKEEVPLF EKSSIDEFYA DLTGMDKFYG
      121 CYKFATEMRR RIIRETGLPI SFGLSINKVV SKVATGEAKP NNQLMIDYGL EKPFLAPLSI
      181 KKIPQVGDKT YQTLRNLGIK KIKTIQEMPS DVMQNVLGKN GLLIWRRAHG IDNTPVIAFN
      241 ERKSISTERT FDKDTIDVKR LHSILTAMTE NLAFQLRRGE KLTACVSIKI RYSDFNTYSK
      301 QVKIPYCSAD HILIPKVLEL FKQLYNRRLR VRLIGVRFSH LVTGSYQINL FDDTEQALNL
      361 YHALDHIREK YGDRKVIRAS GMGAKTIGRM MNPFNGLPPV VLAHRKQ
//