LOCUS VTR91519.1 873 aa PRT BCT 03-FEB-2020
DEFINITION Gemmata massiliana Uncharacterized protein OS=Candidatus
Entotheonella sp. TSY2 GN=ETSY2_22110 PE=4 SV=1 protein.
ACCESSION LR593886-820
PROTEIN_ID VTR91519.1
SOURCE Gemmata massiliana
ORGANISM Gemmata massiliana
Bacteria; Planctomycetes; Planctomycetia; Gemmatales; Gemmataceae;
Gemmata.
REFERENCE 1
AUTHORS
CONSRTM Science for Life Laboratories
JOURNAL Submitted (14-MAY-2019) to the INSDC. DEPARTMENT OF CELL AND
MOLECULAR BIOLOGY, Uppsala University, Molecular Evolution,
Biomedicinskt centrum (BMC), Husargatan 3, 752 37 Uppsala, Sweden
FEATURES Qualifiers
source /organism="Gemmata massiliana"
/chromosome="1"
/isolate="Soil9"
/mol_type="genomic DNA"
/isolation_source="soil"
/db_xref="taxon:1210884"
protein /locus_tag="SOIL9_61950"
/note="BLAST_uniprot:hit_1 ;
ACCESSION=tr|W4M6L4|W4M6L4_9DELT ;
ALN/Q_length_ratio=0.537 ; DESCRIPTION=Uncharacterized
protein OS=Candidatus Entotheonella sp. TSY2
GN=ETSY2_22110 PE=4 SV=1 ; EVALUE=5e-50 ;
Q/S_length_ratio=1.267"
/note="BLAST_uniprot:hit_2 ;
ACCESSION=tr|W4LK30|W4LK30_9DELT ;
ALN/Q_length_ratio=0.549 ; DESCRIPTION=Uncharacterized
protein OS=Candidatus Entotheonella sp. TSY1
GN=ETSY1_21580 PE=4 SV=1 ; EVALUE=5e-48 ;
Q/S_length_ratio=1.267"
BEGIN
1 MNATALAPEP TLTGRTRDVL KRAGNPFRNY FARNPDDEVC ARFHVPELFA AERDLLHAII
61 DLYRYDPQTH SEVVPILGNK GAGKTHLLHS IKHGVGGQWQ LLVTPGVYQR DSDFLEYLLF
121 QIIDTLLGGG KQKGVRPLDF IGDQLVRRQL GVALRELSDE EKVELFPPPG LGRWARRLGL
181 GTQQARERAE WLAENLSGYS NFARMPTPIA QALTDAGLTP QKAFDLVCTH IQKNEAHNTA
241 GLMRRHIFQG FAKAALLRDE SELANFLTYG FAELEFHVRP TRQDLVLALF KVLTEVFRSL
301 KTPVVVAFDQ LEDLLLARRT DDAHRTAEAF FAGIVQVMHQ IDGLCFLIFA ERGLWNRFVP
361 SLDGYIQDRL NNPVHVPKHG TVKAIRLEAP PADLVRRVVE ARLRSCLGEL PAGESVSEIF
421 PFVDEQITRI ARTEPTLRDM LQQFRHLFDH VVYGPDDAQA PVARVSEPTP VKAVEVELIQ
481 PDAAPELPAG RFDVTAEIAA MPALPAPVIE PKLPELPVSY DPVSRIAALL GRDEDDVLPP
541 LPTLTIKHVE VIEAPAEEPP VAEVVSPAPL LMLPPAVTDD VPMAVAVDLE EIVEESITSS
601 ELVVEVQEPP ALPVVQAEAP TVVAAAPEVP AVAPFPSAVP ANVSPATVVK PAVSVAATAP
661 AVARDSHAAL VELWEQEQRA ARRKLEPEGA LTGATRELQA GLGAFLSVCH EHGVKVGPWR
721 LQHVVNEWSY GEHPTYGVVT IAHWACKDAQ PWRMGLGLFL ARGAGKPKDL EVKLAVLDTE
781 PAVVDLLVLL RPEDDIATTG KSKTLLQDAE RRGKHTRLEP VSLDGFAQMY AFPRWLAAVR
841 ESLPEGAPLP NLADIIQEKG EKLLEQVCMP VQG
//