LOCUS BDA40738.1 2069 aa PRT PLN 05-OCT-2021 DEFINITION Coccomyxa sp. Obi probable Bromodomain-containing protein bet-1 at N-terminal half protein. ACCESSION AP024988-391 PROTEIN_ID BDA40738.1 SOURCE Coccomyxa sp. Obi ORGANISM Coccomyxa sp. Obi Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa. REFERENCE 1 (bases 1 to 3766449) AUTHORS Harayama,S. and Ide,Y. TITLE Direct Submission JOURNAL Submitted (11-AUG-2021) to the DDBJ/EMBL/GenBank databases. Contact:Shigeaki Harayama Chuo University, Research and Development Initiative; 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan REFERENCE 2 AUTHORS Harayama,S. TITLE The genome sequence of the unicellular green alga, Coccomyxa sp. strain Obi JOURNAL Unpublished (2021) COMMENT ##Genome-Assembly-Data-START## Assembly Method :: SMART Analysis v. 2.3.0. Assembly Name :: COCOBI_1.0 Genome Coverage :: 170x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /chromosome="1" /db_xref="taxon:2315456" /mol_type="genomic DNA" /organism="Coccomyxa sp. Obi" protein /locus_tag="COCOBI_01-3920" /transl_table=1 intron_pos 159:1 (1/27) intron_pos 210:1 (2/27) intron_pos 244:2 (3/27) intron_pos 316:0 (4/27) intron_pos 392:0 (5/27) intron_pos 469:0 (6/27) intron_pos 529:0 (7/27) intron_pos 561:0 (8/27) intron_pos 640:0 (9/27) intron_pos 697:0 (10/27) intron_pos 797:1 (11/27) intron_pos 845:1 (12/27) intron_pos 910:0 (13/27) intron_pos 984:0 (14/27) intron_pos 1068:1 (15/27) intron_pos 1133:2 (16/27) intron_pos 1192:0 (17/27) intron_pos 1230:1 (18/27) intron_pos 1277:1 (19/27) intron_pos 1320:2 (20/27) intron_pos 1402:0 (21/27) intron_pos 1466:0 (22/27) intron_pos 1562:1 (23/27) intron_pos 1605:0 (24/27) intron_pos 1670:0 (25/27) intron_pos 1728:0 (26/27) intron_pos 1858:1 (27/27) BEGIN 1 MDVSDPMEGL AQAKKVLSQI MRLKTSKLLF NEPVDAEALG LDDYFDKVKQ PMDFGTIMGR 61 LSQGVNSYKR PSQVLRDVNL VFQNCFTYND SEADTVTREL CSEVKNAFNK RWTEAGLSLD 121 SSEEFRQETP PSATKTAKNG AIAWSSEAAV PPELSYEEEG TELPVRHLDN FQLCTSSNLS 181 NLVPLEDVEG AAEEIIAQGE VIPADKVKPG SKRDKIFVTT GPVLDWTIDR SNPLSVWIVT 241 GSSWYRLHTP SPAYASTYTS AQQKVEMCAV TLQMMDRNGK LTAGAAAMQA AAELGLDEDH 301 ADIAFAEEQV STWTEGDAPS EAAGKGRKRK AEAGAPADKH AADEPPSTAE NGDEGAADDL 361 DPDDEDSEQD DGAIVRKKKK RRRRRKSGSG LENGGTPLGR KGKLALQREA AAAEAAAEAR 421 TRSKREREAD TARGPPPDPA RSFRLPQPLL PDLLTVWELL QMLAPVLQTP YLPFWRLEAA 481 VCPGPMKAGV PIVPGYEEPG SGKGAEDVKA PVEATRKSKR TREDELEEEV ETPAATPIKL 541 RIKMSAGKHK KTPGSAMPST GVAYIPPTDD AGLASGLVLR EVHSALLRGA DGKGLSSDAS 601 DAPRTAKLAA ADAAAAAWTG RVSKAILDAP PFAVDDDARE AAVHLAYAEY DDLSVLERIA 661 ILRGLSALAL SADAVRDYIS ARLEAMPAPA PPRLKKKEEG EGKEGAKEDE AAPAEPAKST 721 AAAASAGPQK KAAAGTASVA GSAEEWEQWM EASRVGVRRP LGTDSHKRCY WALGGRASAW 781 RVYVEDKDSS LWGWYEGDKL VELVEWLRAG AIEREAPLLD ALAQAPLPRK AGAPSHNQNI 841 AAPAGEEGAA AVLTAAELEA RRWDGYRGLV APQLRGEGSW PRASLQAAFE LRVQAAVDAM 901 LMRVPFWFTG KDRVEQILGA YEAMGLAKTA SEMAKALLKV EVLLTAAGAM GTAWYKHWQS 961 NWQRAASYCV DWKHALLLTA TLQVHQCRDR GAFSRPAFMR IASEEHCQLY FPEHGDQVVL 1021 LRTGLQHHLK KYMQTLALVP LDTPPPPSPG QQASPPPQPS SPAPDMQDDD EGAEDAGEED 1081 EQPKAVQVPQ QLELPLPENI KEEVKEQWGS LAAAVESLRP IERFRVVGLA YRRTMLDGSA 1141 AGPLLDDLSE EKLLAGGAAW PCTWILLQPS TKGPRHQSPT QPIAVPLRID NALPDYMVKA 1201 ELFDKGMHRM WSPGDRFRMY FGGRAGSKQG GVYYRGVVSE VFQEQPELEA PEDQWAAYDP 1261 WEAVEVAWDG KSGGHSGQLE CVNPWELETD PDFERTEGER QREQEAEARA LRAQAIALRR 1321 EAEERAEAEK LALERAAQEK AAMEKAIAER EERARAAAAA PRRASAAAAA AASPAESGDE 1381 SGPPLYMPVQ HRPGDHGAVP SDVLDMLKPL GRERFMTLLT NWHRGVHGKF KVPIFAHQEL 1441 DLYRVFWSVM DKGGYEIVSA NKQWKEVCRC LGVDLRGQTS ASFNMRQNYE RCLFEFEDYL 1501 STGAYVADIN AGRAPSADAI LPFQPRHPIA ALPVPGVAPT RHTPVPPRAT TPPERTVTTR 1561 GGEGSSRRVS YRDMLVHDDE EDADMDVDDD DSPEATPPPV VRRPGRKPMG GRVATRRFGP 1621 ALHAQGHNWV GKLVLRYWPD QGGWWEAKIG EYSTSRSKHR LIYDADTDKE SFEWVDFREL 1681 TDEEIKPHSL HPLPLPKPAT MPVPLPMTPL TVAPVIVPPV PGTTPAQSNA MNAIVTALRA 1741 GLERETLLRM VDAASTALQL SEEQNRQPAG PPAAQLPGAQ SAPQDAAAAP GGAMSRGLSG 1801 AASAAELDGG DTEEAGRTGD GDSAAPGDAA AEERPRSGAG RISLRIKMKQ GGDASSSEAQ 1861 GQERGARKRR LEVADGTPEP EEEEQRHPPR KRHRLANASQ STSPAPQAAE PETAPPSADG 1921 EAAAGGGSHD VGLVHGNGAV EAMDEEDEVA SPRNACMQPE SDMAEAAVDG SDAGSGEDSA 1981 RLTDAAEAAS EEAGPKQVAD QDGAEVEVEV GAAGSSDEDS AVADENHADE QHLSGDEGAE 2041 NDANASLPAD DGLETEASGE AIGVAQEEG //