LOCUS VTR93982.1 1880 aa PRT BCT 03-FEB-2020 DEFINITION Gemmata massiliana Uncharacterized protein OS=Blastopirellula marina DSM 3645 GN=DSM3645_15135 PE=4 SV=1 protein. ACCESSION LR593886-3283 PROTEIN_ID VTR93982.1 SOURCE Gemmata massiliana ORGANISM Gemmata massiliana Bacteria; Planctomycetes; Planctomycetia; Gemmatales; Gemmataceae; Gemmata. REFERENCE 1 AUTHORS CONSRTM Science for Life Laboratories JOURNAL Submitted (14-MAY-2019) to the INSDC. DEPARTMENT OF CELL AND MOLECULAR BIOLOGY, Uppsala University, Molecular Evolution, Biomedicinskt centrum (BMC), Husargatan 3, 752 37 Uppsala, Sweden FEATURES Qualifiers source /organism="Gemmata massiliana" /chromosome="1" /isolate="Soil9" /mol_type="genomic DNA" /isolation_source="soil" /db_xref="taxon:1210884" protein /locus_tag="SOIL9_37320" /note="BLAST_uniprot:hit_1 ; ACCESSION=tr|A4A246|A4A246_9PLAN ; ALN/Q_length_ratio=0.228 ; DESCRIPTION=Uncharacterized protein OS=Blastopirellula marina DSM 3645 GN=DSM3645_15135 PE=4 SV=1 ; EVALUE=1e-119 ; Q/S_length_ratio=1.094" BEGIN 1 MARNAAGMLL CLGLGAAVMY GSTNYRTLLE RIASETAAPA TDTPNTAPAP LPPEVTAAPR 61 AAEPVRRDPN VRPASLTDEM KPEPVKSVRL EDAQGIVWDL NKPPVDGVAT LGTAVKLKLD 121 RTIANQLLLS TDDPAVREKP VDDVPGVKRS DENGTTFVAL DLTKLKANDG KINFELRVKS 181 VGPLTSDPYK FTILVPSEPS DAISGQVTEY ATSPPLRPAP ISSVVGMVDV YGKPQYEPYL 241 RLFGQVKGAA TLRFVLQPEG GVATEVDGDI GRNASGAWDA RVRLPVLNDL TKAKLFMRVE 301 YTNARLYFTN PLPLRFNFVP ETLDPPALDV PVSAEGKTVT PASDFSTPGL PAYYSNGTKF 361 DLKVTPPQNA QVLVAYVDNA AIFPGISVVP APGGDGKVTF QKIDVGNGRD HVIRVAAVRG 421 SFVGTPAQIK LMVATTPPTV ESVNAQPGFG QSNGIGTEKI VIRFSSKNPI DSNFKADTFK 481 VSHNENKAQR GNLPIGTPAY DKLTNSVTLN VGAILPGTYT VAILKNKITD VFGNALPEST 541 EVQDGGNAFE ATVSTQAATD KAAPVDTPGT TLQTGPPVDF RIYLKPPVYD EGFNPSDRVE 601 SRVARLYYYR DAHRVAQLIN RSVKSYNAAD VDVRRRAANR TRDDANKAED ERKRLEFLSV 661 KAAQDARAAE AELNNLQNRV AGARGTAEQA RRSLIQKRIE IDEAKRMGRP PGEIAGLQAE 721 AETIERALIS ADSIDRQGPA DIARAQGNVA AKREMEAKAV EAWMVKEFEE RRLRENQFRV 781 EVSASTADPD TYAPGRPDSD DPVMRVSVSV IGEGLIQLRG PIKGLNVIRT MLNQIDAPVG 841 QVRVSVHTLQ VNGERGDRMD KVVANIQRYL DHSRFLTAQS SQMLRNAVTS VAARKAAEAA 901 ETLAPGWTQW DRDQKYLYAF FGKDFTDELT QLDSEFLKTG NKLLALNSMD STSLSSALFL 961 MALAKNEVRM EIIQEFLATV QRDLPQAEAR YYMIGIATPK HCDACKDKKE YLLASNSPFE 1021 SLVGFFNAHV TGTDTLNPLQ REFIRLAQIF KAQMITEMQL RQRVMERSLL EERVGTNYLD 1081 ELRKAKQMED DAKKVLRALQ EQLNLASAQS QSSLNGFQVL LEKATTEIGV VSQVFAKEQN 1141 QNVVIGQRQY TAPKSMGDKS KGEPRSWQFT SRNLRDLLDQ TGDLLNQFYY IDPDAYKRYS 1201 DTIKLIKKAA TTGELCDEEY SDLKKGLDEL LNLLRTEAAA VQKNLDTIRG QLQGDNANPL 1261 EAAARYRIFR EEVLNRLRQG GEFRTAAEAI FKEADPTFRG LEGTAGQYAA ALKSARDARR 1321 PLDEKKLLDL LVDEMEDKYV EILEGMRART ANVDNYLKSV TTALDDDFNT QYYLPSFRRA 1381 REASRFWDVT LAQIETTSVL TNNRALGKVS PAASYEFDLP KRDIAITEGF RSAKALFDEY 1441 GALMNDPSFL ALAKLYSGNP VSMVQGGGGD LAAIRSVLPG LTTTPDERIM AQAGPGQKQF 1501 GTALDGLIPD PAIYKFETGT GYEIRPVLSP DGQAVVFKFD YLYTTDVREP VRADEKHLGR 1561 VKRHLIHTDV QLGNFELREV SKYMVALKVA RTGRGVQFLQ DLPGVGILFR PLPSASSSLQ 1621 QNLIYSQATI FPTLFDLMGL RYAPAIADLD PLADRMAEFA ARYRRLDVEQ RLYDIAATRV 1681 DDAVRTPYGE RRYDYYRSQM TLPWRHPNGS TGVGLRLHDG ILREGYDPTI KFPETQFAPG 1741 FSPEGRPKPY NPNTPYGPPA FDPPYTIPQP PPGVPYPGAT GVTGPRVPTP VKMPPGSSGT 1801 FVPGYGPVGV GKPVPITPTH PTTPSTRPTE LPGTSQPPII TTGPPVMSQP PASPQTPLAP 1861 VRPIPQPTVP ALSPPPTGGK //