LOCUS AUB34447.1 1460 aa PRT BCT 28-SEP-2021 DEFINITION Nostoc flagelliforme CCNUN1 Large exoprotein involved in heme utilization or adhesion protein. ACCESSION CP024785-257 PROTEIN_ID AUB34447.1 SOURCE Nostoc flagelliforme CCNUN1 ORGANISM Nostoc flagelliforme CCNUN1 Bacteria; Cyanobacteria; Nostocales; Nostocaceae; Nostoc. REFERENCE 1 (bases 1 to 8363872) AUTHORS Shang,J.L., Chen,M., Hou,S., Li,T., Yang,Y.W., Li,Q., Jiang,H.B., Dai,G.Z., Zhang,Z.C., Hess,W.R. and Qiu,B.S. TITLE Genomic and transcriptomic insights into the survival of the subaerial cyanobacterium Nostoc flagelliforme in arid and exposed habitats JOURNAL Environ Microbiol 21 (2), 845-863 (2019) PUBMED 30623567 REFERENCE 2 (bases 1 to 8363872) AUTHORS Shang,J. TITLE Direct Submission JOURNAL Submitted (08-NOV-2017) College of Life Sciences, Central China Normal University, Central China Normal University, Wuhan, Hubei 430079, China COMMENT Bacteria and source DNA available from the submitter. ##Genome-Assembly-Data-START## Assembly Date :: APR-2016 Assembly Method :: HGAP3 v. v2.2.0 Expected Final Version :: yes Genome Coverage :: 228.0x Sequencing Technology :: PacBio ##Genome-Assembly-Data-END## FEATURES Qualifiers source /organism="Nostoc flagelliforme CCNUN1" /mol_type="genomic DNA" /strain="CCNUN1" /isolation_source="desert soil" /db_xref="taxon:2038116" /country="China: Sunitezuoqi" /collection_date="2005-09" protein /locus_tag="COO91_00267" /transl_table=11 BEGIN 1 MSFRAMRLDW LQGLGIAIAS AIALYANISV AQIIPDGTLP NNSNVTLENN TFKITGGSQA 61 RGNLFHSFKD FSVSTGSEAF FNNAADIQNI ISRVTGKSIS NIDGLIRANG TANLFLINPN 121 GIIFGQNARL NIGGSFFATS ANSMKFADGF EFSAKKPQST PLLTINVPIG LQFGSNTGGV 181 LVRGSSLQVN PGRSLTLVGG NVSMDGGKLV APSGRVELGG VTGENTVGLF PNRDFLGLNF 241 PQGVPQADVS LTNQAEVNVL ASSGGSIAIN AANLNISEGS QLITGISDVG LSKTPAGDIK 301 INATGIVAIA DSSNISNQVL ENAVGNSGNI NINADSLSLS NNSFLAASTN GQGNAGNVTI 361 NTNSLSVFES SFLAASANGE GDAGNITINA RDTVSFNKES HAYTDVTVKG NGGDIKITTG 421 SLFVTDDSFL ASSTKGDGNS GNIIINARDA ISLDRSWAYT DVGENGNGQA GNFNIASDSL 481 SLTNGSQLDA STKNDGNAGN ITIDTGLLSV SDNSFLAASA NGQQGNAGNI TIDARDRILF 541 DRAGQVYTNV TGTGKGGDLN ITTGSLSVVE GSYLAASANG QQGNAGNITI NARDTVSFAK 601 GSQAYTDVTV KGNGGDIKIT TGSLFVTDDS FLASSTRGEG NSGNITINAR GAISFDKGSA 661 YTDVKENGNG QAGNFNITSD SLSLTNSARL IASTKNEGNA GNITINTGVF SVTNESFLAA 721 SANGQQGNAG NITINARDRV LFDLGSQAYT NVTTKGKGGD ISISTGLLTV TNSSKLNAST 781 SGVGDAGNIT FSTGSFSVTN DSQVTASTSG VGNAGNITIN TNSLFATNES FLAASANGRG 841 NAGNITINAR DHISFDYGAK VYTDVNQLPF GDTTEAEGNG GDIKITSESL SLTNGAQLIS 901 NTQGEGNAGS IIIDTGILSV TSKSFLAASA NGQNGDAGNI TINAGARVLF DQGSAFSNVG 961 ICSLIAVGCT LTNNTIKGNG GNIRITAPSL QLINGSFLAT GVGDDNNLEK KVEGNAGGIS 1021 IYLRDGLSLD KSFIATTLFA NGKGEAGDID IQAGFISSYQ SLISASTQGQ GNAGGVSVQT 1081 KGAISLADSD ISTAVQKGAT GDAQGINITA RSLLLTDGAQ LNTVTSGEGK AGNILINTTD 1141 KVNVSGTNTT VAPTDLFQNN IPPRFNSPPQ FVDAVSSGIF SSTNSSGVGG NITVNTNTFS 1201 ISKSAVVDAR TTASGTGGAI NINTNTIDAI SGGQLTAITS GSGRAGSITL NATDSATITD 1261 SDLTYSARLA QFGADNIDIY GKPKVGNQGA ASRLTVSSVG SGSAGDLTVQ ANSIRLDNSA 1321 KISADTTGGG GDIFLNSPLL LLRRGSSITT NASGNEITGG NITIDNKNGF IIAVPNENSD 1381 IRADSANFRG GNVTIKNIAG IFGIQSRNKP SPNTNDITAT GATPNLSGNV QINTPDVNPS 1441 NGLVELPTNL VDASSACRQK //