LOCUS AUB35992.1 1012 aa PRT BCT 28-SEP-2021 DEFINITION Nostoc flagelliforme CCNUN1 Large exoprotein involved in heme utilization or adhesion protein. ACCESSION CP024785-1802 PROTEIN_ID AUB35992.1 SOURCE Nostoc flagelliforme CCNUN1 ORGANISM Nostoc flagelliforme CCNUN1 Bacteria; Cyanobacteria; Nostocales; Nostocaceae; Nostoc. REFERENCE 1 (bases 1 to 8363872) AUTHORS Shang,J.L., Chen,M., Hou,S., Li,T., Yang,Y.W., Li,Q., Jiang,H.B., Dai,G.Z., Zhang,Z.C., Hess,W.R. and Qiu,B.S. TITLE Genomic and transcriptomic insights into the survival of the subaerial cyanobacterium Nostoc flagelliforme in arid and exposed habitats JOURNAL Environ Microbiol 21 (2), 845-863 (2019) PUBMED 30623567 REFERENCE 2 (bases 1 to 8363872) AUTHORS Shang,J. TITLE Direct Submission JOURNAL Submitted (08-NOV-2017) College of Life Sciences, Central China Normal University, Central China Normal University, Wuhan, Hubei 430079, China COMMENT Bacteria and source DNA available from the submitter. ##Genome-Assembly-Data-START## Assembly Date :: APR-2016 Assembly Method :: HGAP3 v. v2.2.0 Expected Final Version :: yes Genome Coverage :: 228.0x Sequencing Technology :: PacBio ##Genome-Assembly-Data-END## FEATURES Qualifiers source /organism="Nostoc flagelliforme CCNUN1" /mol_type="genomic DNA" /strain="CCNUN1" /isolation_source="desert soil" /db_xref="taxon:2038116" /country="China: Sunitezuoqi" /collection_date="2005-09" protein /locus_tag="COO91_01885" /transl_table=11 BEGIN 1 MTINPSALLF NQINQNAVIQ NSSVAFAGID PAGFIGFGLR VPDGKSLLLV GGNVSMDGGE 61 LNAFGGRVEL GGLTQAGSVA LGVDGDNFSL IFPENVTRSE VSLTKAAIYV EGTGGGDIAV 121 NARNLEILGE SVLSAGIGQG LGTPETIGGD ITLNATGEIK VADRSGVINE VGLGSLGNGG 181 NITIDSGSLS LRDRALLTAS TYGQGNAGNV TVRAKNTVTL ADANIFSTVG SGGIGNGGNI 241 DINAATLSLI DDAQLSTSTY GQGNAGNVTV RTRDTVSLAD ANILSTVESG GVGKGGNIDI 301 NAATLSLING AQLLTITREA SATQPAGRGD AGNVNVNATG IVEIAGEKNG FNSGILSRVN 361 TGTVGNGGNI TIDSGSLSLR DRAQLSTSTY GQGNAGNVTV RTRDAVSLDN ASIFSTVESG 421 GIGKGGNIDI LAATLSLTDG AALTTSTRRA SATQPPGQGD AGNVNVNVTG IVEIAGEKNG 481 LRSGIFSSVG TGTVGNGGNI TIDSGSLSLR DRAQLSTSTY GQGNAGNVTV RTRDAVSLDN 541 ASIFSTVESG GIGKGGNIDI LAATLSLTDG AALIASTYGQ GNAGNVTVQA KDAVTLADAR 601 IFSSVETGGV GKGGNIDINA ATLSLTDSAS LQTLTRGASA IQPAGRGNAG NVNVNVTGSV 661 NIAGEKNGFA SGIYSRVDTG TVGNGGNITI DSGSFSLRDG ARLNAQTLGQ GNAGTIKVNT 721 ADFLTISGNS SNLNSGFFVN SQSPTGTAGD IIVTSPRITL DNSGTLNAQS ASGNGGNINL 781 QTDLLLLRRG ASISTTAGTA EAGGNGGNIT INAPSGFIVA VPSENSDITA NAYTGSGGRV 841 DIRAIGIYGI QPRSNPTSLS DITASSEFGV NGTVELNTPD IDPNSGLVNL PTVPVDTQVA 901 QTCQAGGNLA KSSFTITGRG GLPPNPGDAL NADAVQVDLV ALNPSIREGK SPLVTIKPTT 961 ATPKPIVEAT GWVMNAKGEV QLIANAPAIT PHASWQNPVS CRDPLDVSFQ SF //