LOCUS AUB41983.1 2560 aa PRT BCT 28-SEP-2021
DEFINITION Nostoc flagelliforme CCNUN1 hypothetical protein protein.
ACCESSION CP024785-7793
PROTEIN_ID AUB41983.1
SOURCE Nostoc flagelliforme CCNUN1
ORGANISM Nostoc flagelliforme CCNUN1
Bacteria; Cyanobacteriota; Cyanophyceae; Nostocales; Nostocaceae;
Nostoc.
REFERENCE 1 (bases 1 to 8363872)
AUTHORS Shang,J.L., Chen,M., Hou,S., Li,T., Yang,Y.W., Li,Q., Jiang,H.B.,
Dai,G.Z., Zhang,Z.C., Hess,W.R. and Qiu,B.S.
TITLE Genomic and transcriptomic insights into the survival of the
subaerial cyanobacterium Nostoc flagelliforme in arid and exposed
habitats
JOURNAL Environ Microbiol 21 (2), 845-863 (2019)
PUBMED 30623567
REFERENCE 2 (bases 1 to 8363872)
AUTHORS Shang,J.
TITLE Direct Submission
JOURNAL Submitted (08-NOV-2017) College of Life Sciences, Central China
Normal University, Central China Normal University, Wuhan, Hubei
430079, China
COMMENT Bacteria and source DNA available from the submitter.
##Genome-Assembly-Data-START##
Assembly Date :: APR-2016
Assembly Method :: HGAP3 v. v2.2.0
Expected Final Version :: yes
Genome Coverage :: 228.0x
Sequencing Technology :: PacBio
##Genome-Assembly-Data-END##
FEATURES Qualifiers
source /organism="Nostoc flagelliforme CCNUN1"
/mol_type="genomic DNA"
/strain="CCNUN1"
/isolation_source="desert soil"
/db_xref="taxon:2038116"
/geo_loc_name="China: Sunitezuoqi"
/collection_date="2005-09"
protein /locus_tag="COO91_08078"
/transl_table=11
BEGIN
1 MLKDSNRNSN NQVGQTDKSY FSPPPAISLP KGGGAIQGMG EKFAANPVTG TGSVSVPIST
61 TPGRSGFTLQ LSLSYDSGAG NGVFGLGWNL SLSEITRKTD KGLPRYWDNE ESDIFILSGA
121 EDLVPVLELD GTLDETLRDG YRIRKYRPRI EGLFARIEHW TNLEGESHWR SISKDNITTV
181 YGRTAESRIF DPNEPTHVFS WLICESYDDK GNVIRYEYKA ENSEGVDIAQ AHERNRTSES
241 RSANRYLKRI KYGNLVPRLM QPDLVETDWL FEVVFDYGDH HPTDPKPNDP GVWSVRNDPF
301 SSYRAGFEVR TYRLCQRVLM FHHFQNETDV GRDCLVRSLL FNYSYEQNPT NVRNPIYSLL
361 LSVKQTGYKR NPEGGYIHKS LPPLEFSYSE AKIDETIREV EPASLENLPQ GLDGTRYQWV
421 DLDGEGLSGI LTEQGNGWFY KRNLSPINTV KTNGSEHIEA RFAPVELVAS KPAIALSNGA
481 QFLDLAGDGQ LDVVALHSPT PGFYERTHDE NWESFIAFKS LPNLDWDNPN LKFIDLDGDG
541 HSDILITEDD CFVWYPSLAE DGFGAAKRVH QPWDEEQGPR VVFADSTESI HLADMSGDGL
601 TDIVRIRNGE VCYWPNLGYG RFGAKVTMDN APWFDAQDIF NQRRIVLADI DGSGTTDILY
661 LGGKGVQVYF NQSGNGWVAQ GTLRHFPAID NVASVTIIDL LGNGTACLVW SSPLLGNAQR
721 VMRYIDLMGG QKPHLLIKTV NNMGAETVVQ YAPSTKFYLQ DKFAGKPWIT KLPFPVHVVE
781 RVETYDRISR NRFVTRHAYH HGYFDGVERE FRGFGMVEQW DTEEIGTIQP GEASSDSTNL
841 DAASFVPPVH TKTWFHTGIY IDREHISSFF AQTEYYREPQ YRILPDTTEA QQKIIEARFQ
901 ASLLPDTILP AGLTAEEERE AARSLKGHIL RQEIYALDGS DKQPHPYSVS ERNSEIRLEQ
961 ALQTNRHAVF FTHDRETIDY HYERNPADPR VTHAMTLEVD EFGNGLKSVA IAYPRRQLAS
1021 PDNRYDEQKK LFITYSETQV TNKADETDWY LIGVPIESRT YEITGVTPQA GVGEASPQET
1081 RFSVSDFYDR TTNGEISGYI TAPEIPYEAT PTLNILQKRL IERVRTLYRP NTEADTTDPT
1141 SLLLGKVESL ALPYESYKLA FTPELLTLVY GDRVNPNLLT DEGKYVFLDG AWWIPSGRQA
1201 FKPDWFYLPY KSRDPFGNLS QITYDEYHLL MESTVDPLGN TVQAENNYRV LQAQKIIDQN
1261 ANRAEVAFDA LGMVVGTAVR GKENENKGDS LDGFEPDLDE ATILEHIQNP LVNPQAILGK
1321 ATTRLVYDLW AYYRTKQSTP NGEEHGQPDV VYTLARKQHD ADLVTGELTE IQHSFLYSDG
1381 FGREIQTKIQ AEPGLIDDND PTPINPRWVG TGWKIYNNKG KPIKQYEPFF SATHGFEYAK
1441 QLGVSSTLFY DPLERVIATL HPNHTYEKVV FDPWQQVTWD VNDTAAQKDV NGNLIIDPKH
1501 DSEVGNFFSA LDEGDYLPTW YVSRKDGQLG TAEKDAATKA AAHANTPSVA HLDSLGRTFL
1561 TIADNGLVNG LPQLYETHVE LDIEGNQLIV TDALNRQVMR SVLLIKDAQG NIVSQVNAFD
1621 MLSHQLYSHS MDAGERWTLN NVAEKPIRGW DSRNHEVRHT YDRLQRPTQL WVRQGTDPEV
1681 LAECIVYGDS ADSGLTLIET QAANLRGQVY QHYDGAGVIT SVGFDFKGNL LSSRRQLATE
1741 YKQQVNWVGL ADLTNVADIA NAAALLLETE IFTSSTEYDA LNRPTRLITP DHSKIRPTYN
1801 EANLLERVEV ELRGAGVWTT FVNDIDYDAK GQRQLIEYGS GVRTEYTYDD KTFRLKTTKT
1861 GDNARLQNLS YTYDPVGNIT TIQDNAQQTV FFDNAVVSPS TEYVYDALYR LIQADGREHA
1921 GQAVNPQSEY QPENKPHYDY NDFTRRNLPH PNDGQAMRNY RQLYEYDSVG NILNLIHQVN
1981 GTTLWKRRYD YATDSNRLLS TSLPSDLDTQ PLPVRYFYDE HGSMTQMPHL LQMQWDFKDQ
2041 LQRVDLGGGG TAYYVYDASG QRVRKVVEKS VGLTEERIYL GSYEIFRRRN GNGLKLERET
2101 LHIMDDLRRI ALVETKTVDV DTPVVSPTPL IRYQFDNHLG SASLELDKDG QVISYEEYYP
2161 YGNTSYQAGR SVAEVSLKRY RYTGKERDEE SGLYYHGARY YAAWLGRWTS CDPAGMVDGV
2221 NLYSYTQNNP IKFTDYTGTE TELIPGVPNS QLVKYAELVN ELKKNPVVRD QQALDLKIQK
2281 LFGEENRKVA AQVPCYKDPQ QQQREELNAR QKLDDEAKKN IYNGPTIRKP PPPSRFELGI
2341 QAIKIAQTTP LAGLGFTTAV VILDQHDPKT VLAATQFAGT IGNIAGGVAA THTAKSEFKN
2401 LGASAQPTKP EAAEVAPKAT GGTTPATPVA PPPALAQARS NLLQTLRQRL LSYDPKRQQF
2461 IPSEFKIAGI MESRFGPVSR DSSGATDWIT GGRRTIDAVG PVPSEHFNLE SFTTQITKHL
2521 SKTDIVPIDL SGLTSGQIND VQNFLKGLSA PQRKQIIIIP
//