LOCUS       AUB34937.1               429 aa    PRT              BCT 28-SEP-2021
DEFINITION  Nostoc flagelliforme CCNUN1 wcaI, colanic acid biosynthesis
            glycosyl transferase WcaI protein.
ACCESSION   CP024785-747
PROTEIN_ID  AUB34937.1
SOURCE      Nostoc flagelliforme CCNUN1
  ORGANISM  Nostoc flagelliforme CCNUN1
            Bacteria; Cyanobacteria; Nostocales; Nostocaceae; Nostoc.
REFERENCE   1  (bases 1 to 8363872)
  AUTHORS   Shang,J.L., Chen,M., Hou,S., Li,T., Yang,Y.W., Li,Q., Jiang,H.B.,
            Dai,G.Z., Zhang,Z.C., Hess,W.R. and Qiu,B.S.
  TITLE     Genomic and transcriptomic insights into the survival of the
            subaerial cyanobacterium Nostoc flagelliforme in arid and exposed
            habitats
  JOURNAL   Environ Microbiol 21 (2), 845-863 (2019)
   PUBMED   30623567
REFERENCE   2  (bases 1 to 8363872)
  AUTHORS   Shang,J.
  TITLE     Direct Submission
  JOURNAL   Submitted (08-NOV-2017) College of Life Sciences, Central China
            Normal University, Central China Normal University, Wuhan, Hubei
            430079, China
COMMENT     Bacteria and source DNA available from the submitter.
            
            ##Genome-Assembly-Data-START##
            Assembly Date          :: APR-2016
            Assembly Method        :: HGAP3 v. v2.2.0
            Expected Final Version :: yes
            Genome Coverage        :: 228.0x
            Sequencing Technology  :: PacBio
            ##Genome-Assembly-Data-END##
FEATURES             Qualifiers
     source          /organism="Nostoc flagelliforme CCNUN1"
                     /mol_type="genomic DNA"
                     /strain="CCNUN1"
                     /isolation_source="desert soil"
                     /db_xref="taxon:2038116"
                     /country="China: Sunitezuoqi"
                     /collection_date="2005-09"
     protein         /locus_tag="COO91_00779"
                     /transl_table=11
BEGIN
        1 MQILIYSYNY HPEPIGIAPL MTELAEGLVK RGHQVRVITG MPNYPQRQIY DGYRGKLYVT
       61 EHKNGVKIQR SYLRIKSKPN LIDRLLLELS FVFTSLPQSL KGERPDVILL TVPPLLVCLP
      121 ATLIAWLYNC PVVLNVQDIL PEAAVRVGLI KNKLMIQALE ALEKFAYRTA HTISVIADGF
      181 VDNLIYKGVP ANKIACIPNW VNLNFIRPLP KENNSWRATH QLDGKFVVLY SGNIALTQGL
      241 ETVIEAASRL RHINDIVFAI AGEPQALERL QKHCLACGAD NVLLLPLQPR EKLPQMLAAA
      301 DVGLIVQKSN VISFNMPSKI PLLLASGRPI VASVPAAGTA AKAIKESGGG IIVEPESADA
      361 LATAVLDLYN QPELAAQLGR KGRKFAVENY SFEQALDRYE QLFSDAIAKK ATNLDILPKL
      421 SSKESLVDI
//