LOCUS AUB36150.1 1568 aa PRT BCT 28-SEP-2021 DEFINITION Nostoc flagelliforme CCNUN1 Phage-related tail protein protein. ACCESSION CP024785-1960 PROTEIN_ID AUB36150.1 SOURCE Nostoc flagelliforme CCNUN1 ORGANISM Nostoc flagelliforme CCNUN1 Bacteria; Cyanobacteria; Nostocales; Nostocaceae; Nostoc. REFERENCE 1 (bases 1 to 8363872) AUTHORS Shang,J.L., Chen,M., Hou,S., Li,T., Yang,Y.W., Li,Q., Jiang,H.B., Dai,G.Z., Zhang,Z.C., Hess,W.R. and Qiu,B.S. TITLE Genomic and transcriptomic insights into the survival of the subaerial cyanobacterium Nostoc flagelliforme in arid and exposed habitats JOURNAL Environ Microbiol 21 (2), 845-863 (2019) PUBMED 30623567 REFERENCE 2 (bases 1 to 8363872) AUTHORS Shang,J. TITLE Direct Submission JOURNAL Submitted (08-NOV-2017) College of Life Sciences, Central China Normal University, Central China Normal University, Wuhan, Hubei 430079, China COMMENT Bacteria and source DNA available from the submitter. ##Genome-Assembly-Data-START## Assembly Date :: APR-2016 Assembly Method :: HGAP3 v. v2.2.0 Expected Final Version :: yes Genome Coverage :: 228.0x Sequencing Technology :: PacBio ##Genome-Assembly-Data-END## FEATURES Qualifiers source /organism="Nostoc flagelliforme CCNUN1" /mol_type="genomic DNA" /strain="CCNUN1" /isolation_source="desert soil" /db_xref="taxon:2038116" /country="China: Sunitezuoqi" /collection_date="2005-09" protein /locus_tag="COO91_02051" /transl_table=11 BEGIN 1 MSLVGSAEVR VALNRAQLER DVTQVETLIR NLGTRQAVIQ LNATSARQAL ADITTRAERL 61 AGQISRAQAV GVDTRAARER LEELARLSER IGNRIRQQES LNINSSRARQ ELTELRQQAE 121 RLSGRIREAN RVSVDTSGAR RALQLLGEEA QRTTRDLGQG LIKGLSGVES FESLGRGIGQ 181 NIGSGIRQGL TSAISSVVDT TRNIIGSSLN ASRQFSGSIR SFAALSGDDP NSAAIKGVRE 241 EVEKLAIATT KTPQQIAGAA IELTKLGFGA KETKKELAGL VQLSEATGSS IEKAASIVGA 301 TNNVFQRSAK DIADIVAATA NSTAADANDF LQAVSKAGGV AKSNNQDLET LATAFGLIRN 361 AGFEAEAAAT AVKTAINRLA APNPKGQEAL TQLGVQIRDE TTKEMRNLIQ LVPDFRAALG 421 KVDPGTRSKL TKTIFGDEGG PAFLALLATS QEKIDSTYQT IRNSSGRAAE TSEKLVKGLD 481 GELKRFEGSV GLLQVRLGDA FAPAAESVVA FGNKVTNNLL TTEGLFGSIT SAAQEFGDEL 541 ANNSELAEQV ESALGLAIEE LSKQGITIIK EFTETLRENP RLLADMVSGT AELVKFLAQA 601 AKFVTDIASG LSAGKRELDI LYSVGGTEGE GSRQAIRGMG GTQQEVSEFD KELEKRLTAA 661 NLPTQDFGII GPARGRYDKI VGDTAAVFQD RIIARNREQQ KQKATEDAAA QSDAARIPTL 721 AASAKAAATE AAKPPVVKAA QTTAKDALKQ EASAASKAKA GIDSREAAGI LSVKQSQLKG 781 DIDPEQAQEK ITKIQTKANQ EELVAQQEHL KRLQGLKAKG VIDSAKYGEE ELKSTKQIST 841 LKAQILEAEL KERDNNNRRI LENLDRANKQ ATAAIAASAS ARILIVKQQL ANQEITEKQA 901 AERILKIHQD ITASDIQLLN KQLADVDKLE KSKVITAKQA TDKRIELQSQ LAAKNQELVD 961 QEIEGQKRIR DEAIQTIDDR IAGDKRRSDA AITNLDAEQK RLVDFAQKAD EITKSLIESR 1021 AGLDKAVTDA ELNSGQIAID RTNRALDARK QADEKDIEPK VKGRLEGEIK RLGFKSGDTE 1081 LDILAKRQEQ EDKLAQTKLK ALEQEQKLKR ALEEQDIRRL QYAEKQADIE ARKGLLAAKQ 1141 AQNEAKGALS TAEALAPGKE RDNAIANAKE QIAISSEGTK LAEEAVGLAK DQTAAVKEIV 1201 ESKREALKID QQSAREQFNA AEEARKSAQR EESAQAISRN PDAYIPKYLI NNQGDGIKAS 1261 APTGGDSELY LPPKSKSKND PAYAVPQKTN SLRDQLEMQP DKSFLEKYLP PPNTKAGDIL 1321 EQSKGAIPQN QAGLFFDSAP IVAEIQKLNS NITALASRPT TLQVSSPTPV TTDALQEGTD 1381 ITGTSMPAGG TSGRGWLSAI WKLLSDRFPA SVNGKIPVTT DALQEGTDIN SAVMPAGGAS 1441 GRGWLSAIWK LLSDRLMPAG TLASYLGNSN GATLKVTPGT IYAFTCSNNN SVVRYFQIFD 1501 KSTPPTSGDI PIIQFPTGTN DALLIIGQDI LGGAGITLPT AVSWGFSSTR LVYTPATASD 1561 CSATVRWS //