LOCUS VTR96500.1 1591 aa PRT BCT 03-FEB-2020 DEFINITION Gemmata massiliana Uncharacterized protein OS=Singulisphaera acidiphila (strain ATCC BAA-1392 / DSM 18658 / VKM B-2454 / MOB10) GN=Sinac_6530 PE=4 SV=1 protein. ACCESSION LR593886-5801 PROTEIN_ID VTR96500.1 SOURCE Gemmata massiliana ORGANISM Gemmata massiliana Bacteria; Planctomycetes; Planctomycetia; Gemmatales; Gemmataceae; Gemmata. REFERENCE 1 AUTHORS CONSRTM Science for Life Laboratories JOURNAL Submitted (14-MAY-2019) to the INSDC. DEPARTMENT OF CELL AND MOLECULAR BIOLOGY, Uppsala University, Molecular Evolution, Biomedicinskt centrum (BMC), Husargatan 3, 752 37 Uppsala, Sweden FEATURES Qualifiers source /organism="Gemmata massiliana" /chromosome="1" /isolate="Soil9" /mol_type="genomic DNA" /isolation_source="soil" /db_xref="taxon:1210884" protein /locus_tag="SOIL9_12140" /note="BLAST_uniprot:hit_1 ; ACCESSION=tr|L0DP61|L0DP61_SINAD ; ALN/Q_length_ratio=0.334 ; DESCRIPTION=Uncharacterized protein OS=Singulisphaera acidiphila (strain ATCC BAA-1392 / DSM 18658 / VKM B-2454 / MOB10) GN=Sinac_6530 PE=4 SV=1 ; EVALUE=3e-54 ; Q/S_length_ratio=0.825" /note="BLAST_uniprot:hit_2 ; ACCESSION=tr|A6CCB4|A6CCB4_9PLAN ; ALN/Q_length_ratio=0.590 ; DESCRIPTION=Uncharacterized protein OS=Planctomyces maris DSM 8797 GN=PM8797T_31980 PE=4 SV=1 ; EVALUE=2e-40 ; Q/S_length_ratio=1.338" /note="BLAST_uniprot:hit_3 ; ACCESSION=tr|D5SNM6|D5SNM6_PLAL2 ; ALN/Q_length_ratio=0.377 ; DESCRIPTION=Uncharacterized protein OS=Planctomyces limnophilus (strain ATCC 43296 / DSM 3776 / IFAM 1008 / 290) GN=Plim_2314 PE=4 SV=1 ; EVALUE=2e-31 ; Q/S_length_ratio=1.036" /note="BLAST_uniprot:hit_4 ; ACCESSION=tr|E8R404|E8R404_ISOPI ; ALN/Q_length_ratio=0.333 ; DESCRIPTION=Uncharacterized protein OS=Isosphaera pallida (strain ATCC 43644 / DSM 9630 / IS1B) GN=Isop_3170 PE=4 SV=1 ; EVALUE=1e-25 ; Q/S_length_ratio=0.967" /note="BLAST_uniprot:hit_5 ; ACCESSION=tr|F0SL52|F0SL52_PLABD ; ALN/Q_length_ratio=0.646 ; DESCRIPTION=Uncharacterized protein OS=Planctomyces brasiliensis (strain ATCC 49424 / DSM 5305 / JCM 21570 / NBRC 103401 / IFAM 1448) GN=Plabr_3338 PE=4 SV=1 ; EVALUE=2e-25 ; Q/S_length_ratio=1.268" /note="BLAST_uniprot:hit_6 ; ACCESSION=tr|A3ZU70|A3ZU70_9PLAN ; ALN/Q_length_ratio=0.276 ; DESCRIPTION=Circumsporozoite protein-putative membrane associated protein-like OS=Blastopirellula marina DSM 3645 GN=DSM3645_21569 PE=4 SV=1 ; EVALUE=6e-23 ; Q/S_length_ratio=1.234" BEGIN 1 MAAVLEQAEA ATKNGTRVDE QIAQATSRIR THDITFGGLV LVAFVLVYAT AMILLDKYLG 61 LAEWLRQVAF LGFLAACAGI AYATILSPLR KKINPLYAAK RVESTIDDAK NSVTGYVDAQ 121 QQGTLNATVK AALAHRAAKS VAAADVNKAV DHRGLLYLGG TSVALFLTLI VLFFVFRPAQ 181 FSSLVGRAFV PFGSGDIVTR THIDVVRPDP AEATVTTGQT IVVAVHVGGK IPSADGPERV 241 RLMIRHNQAD PNYTELPMVQ GDTSRDFELR VPDHLVQYGF WYKVAGGDYT TPEYRVTVRT 301 LPLFTEFQAT YEYPAYLRRK TEPTNDPQLR APRGTTVTLV GRTNRDVREG TMVVEPGGVR 361 VTGTPVADKP DSLAFKLKLT ESGSYKLSFN ATTGEHSTDS FQSRILVEAD KAPEIAINKP 421 EEDEITAPTN GQIAIDGKVG DDFGIDTITL KMKIVSPVER PLLDRPYLNG KDKSFHRKKD 481 NTWPTDVDYK GSVDLAQLKA DPTGLPLTLT PDMVIEFWLE ATDNCTEPKP NLGRSVAKRV 541 RLTPPKVEPQ DKQKLDQDKA NRKDEEQKHD AQQEQKLDQE NRDPKNGNNG GQNQNQPDQK 601 TEPKNGTNGE GTKEGPPDPT KMPPPNKDKS EPKTDNPMGG MSETGNPMGS PTPKGGMNDP 661 SSKPMGSTEP NGTNTQNGTN PDRPMPEAPP PKSPEEKSVQ DRADKLNNAI EKEKQEGGSG 721 KSNPSANENE RTDSAQQKKQ PPAGDMGNAT EPKPEPKQGE PNNPMQDNAP ASGKSEGKLE 781 KPSNPAEPKP EPKQGEPKPG NDPMNKAGQK NTAPSETRDE PVGAPPGVDK EPKQSQPNPK 841 DPNPKDPKDP NPNPDQKQDP NSGSKAKSAT QKGDEQQGGM SDSTDKKDPA ADAGSGPKPM 901 REPTRGEDKP NQPQNQPQPA GGTKPENKQP DAGDAKPNKA PAPSENKPKP EDMMSGGTGT 961 PESKPEGDAN QSTKPNSTGA AETKSAGNKN APMNGGNSGA DEGRDKPPPQ EGAQPSGGRD 1021 QEPPKQKELD DNQRKELEEA ARNLTSPDEK KKQDARNKLD KAIGEDKRKE MEKLANDLQS 1081 PDENKRAEAQ RKLDELKKQA QQQQGKPNGE NGGKPDEKQM KELEQAAKDL NSPDKNKQQE 1141 ARDKLDKAIG EDKRKELEQL AKDLQSGDKS KQQAAQKKIE DAVKNAKGGG GDQGDQKNAP 1201 KLDEKQMKEL EQAAKDLTSP DEKKKQDARE KLDKAIGEDK RKEMEKLAND LQSPDEKTRA 1261 EAQRKLDELK KQAQQQQGKP NGENGGKPDE KQMKEIADAM KDLQSGDEQK KQAAQQKLDK 1321 MVGEKNRKEA EQLMKDLQSG NKETREAAEK KLDDLKKQLE KQQAGKNGKD GKGKEPSKEE 1381 LADLMKKAQD LQSKDKDTRE KAEKDLDNKL GKENREKLQK ELEDKKPGGD PQQDEKLKEQ 1441 LEQMAKEPSN QSHEPTQNGP GSSPPPKGAM EEDPQNRLKT AELRLEDFER KRYDEEFQKK 1501 QGFSEAEYKK FLDDYTKHVE NLRKEANQPA GNKPPQPGSS EPGGPILGGG GNKVAPSAKL 1561 DSSGTGGGST VAPPGFENSK NKFQKLIQEK K //