LOCUS BDA40561.1 2301 aa PRT PLN 05-OCT-2021 DEFINITION Coccomyxa sp. Obi hypothetical protein protein. ACCESSION AP024988-214 PROTEIN_ID BDA40561.1 SOURCE Coccomyxa sp. Obi ORGANISM Coccomyxa sp. Obi Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa. REFERENCE 1 (bases 1 to 3766449) AUTHORS Harayama,S. and Ide,Y. TITLE Direct Submission JOURNAL Submitted (11-AUG-2021) to the DDBJ/EMBL/GenBank databases. Contact:Shigeaki Harayama Chuo University, Research and Development Initiative; 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan REFERENCE 2 AUTHORS Harayama,S. TITLE The genome sequence of the unicellular green alga, Coccomyxa sp. strain Obi JOURNAL Unpublished (2021) COMMENT ##Genome-Assembly-Data-START## Assembly Method :: SMART Analysis v. 2.3.0. Assembly Name :: COCOBI_1.0 Genome Coverage :: 170x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /chromosome="1" /db_xref="taxon:2315456" /mol_type="genomic DNA" /organism="Coccomyxa sp. Obi" protein /locus_tag="COCOBI_01-2140" /transl_table=1 intron_pos 110:1 (1/19) intron_pos 572:2 (2/19) intron_pos 613:0 (3/19) intron_pos 689:0 (4/19) intron_pos 735:1 (5/19) intron_pos 1058:1 (6/19) intron_pos 1325:2 (7/19) intron_pos 1462:2 (8/19) intron_pos 1504:1 (9/19) intron_pos 1528:0 (10/19) intron_pos 1699:0 (11/19) intron_pos 1752:2 (12/19) intron_pos 1857:0 (13/19) intron_pos 1894:2 (14/19) intron_pos 1945:1 (15/19) intron_pos 2001:0 (16/19) intron_pos 2106:1 (17/19) intron_pos 2174:0 (18/19) intron_pos 2226:0 (19/19) BEGIN 1 MDQGMTSAEQ HGSPFKENEG DAAKVCSQDS HASQRPALQE NAFQPLLTPG GPLALSPPRT 61 LRSSQKWGSL SASRILGLSP NGRPRGRSLV LSPMCSRAAP ASQGHQPQAG VEGVLPGVHM 121 QDSSSLDFVP PRAARAVPPL GRMCHSQPCP LSGPWPDQSQ GIPDNQGTPN ETGEPSEPGR 181 GHDGDVMGIQ PNAAAAGAEA ASDGRDIYAH ASPSGQQATR PDAPNSAAAE QPDGNVGQPN 241 MVVDEAGQQP AETSVSVQNP GEMEKDEDTE ARMGLDEMGG TLGDGPSLQL NLSLGLGSQD 301 DHIMPTEVEP KGFEVTARQL VATERELTGG FCPETQVEVA LQVPAPITSF DGPAQQQPPV 361 CPDTQILDCP FAPPAQAHPS PSAAQPSAPG GQRAPHQSPD APLAPHAGAT AYLLTEVQPT 421 AVPAAEVEAR AEAAAQPSVA AAPAAIHMPA QESRKASPPQ LNAHTVDAAS TAAASALVQE 481 RRDSAEEEPP GGVPGPMGSP PRSAGVQPSE QVVALELLDM PEGSKLAVEA PSPLRTPPSG 541 RPSLSLTLLS EDFVSESEGT QRAVQAADII HRYSYKCLLG DAVGGVFSSV LAADMEALAA 601 GAGRKVSRTL SPETTRWERR LVEAWRDHEL LGWDVDGPGD PETSGDPGGL TQEVDAVMCA 661 EEEISSRREE LERTGELRAV RAVREHVQVR PIAAPEAPRP ACGAQPSAAS GPGRKSHCLA 721 VEAEGAAQIT ADSVDRACSA GAVVASGKVA TDAIDEVAAV MEALVAAASA CPSDQAAEAA 781 AGQDPTAAKR LSDQATGTPA SEEQLALPDA DCAPMEIDSG DGGSACAAMQ PEDATLEPTA 841 AAVDQLAANA PAPAVPVHGK VASIVADEDD EDSPSLQLNL QLETQYPEQA QQHFSELVPD 901 SEAANMGANG TSIESEPLAP ARPPVSSTGA APGPVPLGSS GAAQQGGTPA SAGTLEGAAD 961 GGSSAAGPSH HLVPDSEAAG HSPSAAAAAP SEAAPVSTSG GVQQAERPAL EGRVPAARSQ 1021 DDVWGAILSI STQPSCGAGP AQKAPAWQPR SGPGIVAGER QANQSQPSPG ARGWQREVGS 1081 SGLRLRPVWG SAPASREATP LTGDTLLTPP KRTLLGGPPS IGLTRPAASP GSPDTPSDVA 1141 ATADVLHLYS RPLKKDIQTP ESQRKRPRSL ASESPDDMAA RHEAACGEPE AGARVGRASP 1201 QSPDPSGSFV PDTYPGIGAG CGGLDEHAGH GNVGDEPVDI QEEEFRNVLP MNEERDVQPA 1261 KRRRLSLEGG RKDRQAEPVQ ERDSAKKGLH AMAVHAPCGR ATGAKPMTRA RADELASDAR 1321 PRMTRLQEQK QKRKEEQLKA GRLHRAGQVR PISARKRRTS GVALKKLPAH PSFKRKREAA 1381 GGTDAPNAGE PADDGHQGPA AEKHAAVLDR AQGPNAAAHG MTGQDDVSEA GRNVEDQSAS 1441 GAAAPQAAGG NMRSGRVRDP YSFPETQDVR LENPARGRVG LATANDQLHI SKRSHAQGQA 1501 HVEGLAGSNL NHGAGRDGAQ AIRNPSRAEP SHGPDAPQRS GNTAEANHGQ GRWAPQRKRT 1561 ALAAERATAG AAGAADRPQR TKRQPLRFED YQQGSIAAQD TAEVQVAAPL VTGGHKAAKE 1621 AAGKAAGPSR RRLVRLSDLD TQHADNAGAH APPEDKGRPR GMSGPADVEA DGCDEDAGDL 1681 ADSAGGQEGA LVEEQPAESA ANPGAQERED VPRPALKPPK LGCSKCRYSR MGCGTCRTKA 1741 GAAGPLSASP KRISGPKSTR RASLPSEAPE PQPRSTRRAS LPQEAPEAQQ DIAAPAGRRQ 1801 SVGRRKSEPA LVHEARAGVL AGKAFLVTGF EEGAVRRRVH KLICDHGGSV IDSIPKGWRD 1861 VDGVVAGKAC DRRAKYLFAH VTGAPALKAS FLERCVSAGK WLPLRAADAW LPAREPRLVF 1921 HGFRVCLHGE RGFVQQFGAL LPHAGCVVVP EMEAPLDSPD PKRRRLLPPS CVGQPNCDLV 1981 IFDKADPLQA HSDLHNLLRQ ARRLTIPAKS VDWMIAALVS GNRPPFIPVM PLPQKRRTAP 2041 EPQQAAGNSS RPGEEAAAAQ DAAPSAEAAE DGSGALVEDS ESFGEPPFEI RPSLASVSFG 2101 GPGAAGPRSP QRAAEGHNRD GALRQLDHAA AMPGRRPAGQ CALMWLGEPS KDPPAGLNMH 2161 ASHHRTFYSA VAKGDAEMAV GDDVMVEMLG EEQPRVARLE ALWSELPVDG CERLLARCRF 2221 YYRPSETSFM SSSKPNELFA SDHVEQRLPA STLLHKCTVL CAPPGDMAAV LERYASLGPH 2281 RYFCLYRYDH LGEALKPVPA G //