LOCUS BDA40905.1 2136 aa PRT PLN 05-OCT-2021 DEFINITION Coccomyxa sp. Obi probable HEAT repeat-containing protein 1 protein. ACCESSION AP024988-558 PROTEIN_ID BDA40905.1 SOURCE Coccomyxa sp. Obi ORGANISM Coccomyxa sp. Obi Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa. REFERENCE 1 (bases 1 to 3766449) AUTHORS Harayama,S. and Ide,Y. TITLE Direct Submission JOURNAL Submitted (11-AUG-2021) to the DDBJ/EMBL/GenBank databases. Contact:Shigeaki Harayama Chuo University, Research and Development Initiative; 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan REFERENCE 2 AUTHORS Harayama,S. TITLE The genome sequence of the unicellular green alga, Coccomyxa sp. strain Obi JOURNAL Unpublished (2021) COMMENT ##Genome-Assembly-Data-START## Assembly Method :: SMART Analysis v. 2.3.0. Assembly Name :: COCOBI_1.0 Genome Coverage :: 170x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /chromosome="1" /db_xref="taxon:2315456" /mol_type="genomic DNA" /organism="Coccomyxa sp. Obi" protein /locus_tag="COCOBI_01-5590" /transl_table=1 intron_pos 50:1 (1/25) intron_pos 121:2 (2/25) intron_pos 218:0 (3/25) intron_pos 266:1 (4/25) intron_pos 307:0 (5/25) intron_pos 390:2 (6/25) intron_pos 475:0 (7/25) intron_pos 559:0 (8/25) intron_pos 613:1 (9/25) intron_pos 694:1 (10/25) intron_pos 832:0 (11/25) intron_pos 963:2 (12/25) intron_pos 1028:0 (13/25) intron_pos 1115:0 (14/25) intron_pos 1236:2 (15/25) intron_pos 1310:0 (16/25) intron_pos 1381:0 (17/25) intron_pos 1420:0 (18/25) intron_pos 1489:0 (19/25) intron_pos 1554:2 (20/25) intron_pos 1686:0 (21/25) intron_pos 1779:0 (22/25) intron_pos 1874:1 (23/25) intron_pos 1958:0 (24/25) intron_pos 2057:0 (25/25) BEGIN 1 MTSVLAQQLQ ALGKPGISLP RGKGRPSLLF EAHQAADIDL QTIYEIALQG LVELCQADDS 61 LKPFLNTLFR RDRVGLEREH QNKAINEKMD QSVTGFLTLL SRHFLKDAAT KVLEFLIRQY 121 RVNETNVSAL MSCALPYHST LQFVRLVQTM RLDGLWTFLA PMQQSGAPLP RSTLVQRCCN 181 DAALLQFVCQ AAQQAASPAW LSFYGVVVCE VIAAVPKVTE DAIAQLLPFM LEGLKGSAPK 241 EQRMATYMIV MQLIAKSTPA PQLFNVLCVE VAKGAKEGPT GPCLLVLTGL LKSQAGHASL 301 PPKTLKHIMR LPHLLDELSA KHKQMAPLLQ SLLSSLVGNR DNAEACHKLL QDLIAGVPMA 361 EHAEAAAIQL LRTLQTATQE ELPGMQQTMR TLDVCYPIQV DAAVNKALEP LAKNKKGDAA 421 KRLGGDVESV EKAKKVLEFV TGALDKRSAR MPLPDASTTL AHAIDAPSAE LRRTAVMQLE 481 KSADAENGEE DDWAMEVLLR RLADDDAGVV NAVLDSPLLM SVPAAALFDG LSSALQRATA 541 VLRSAGGDKA EARGIARKVL KLLAGPFLRE HAVYQQRIIV LLLDHVIILP TQRKLSLAAL 601 KALKKQDVPE LSGLHSVDAE LPEKSPKKAA EATSDNSSAD YVHNRKIIAA IAQGLHGHRD 661 AEGVVAFATA GHEPAQHVLL LALCHACSEA AEKGSSAAGV ILRILLNDPA ALSSNAAAEL 721 PDVPAHLTDD GLPTEEHFSS LATNPLDTHG ALLRHSLLVA LQHASPAVLE QCAQGPHALF 781 QLIASLSPAH VYDAHLCALV DTVAKQISPA AFLSDIFGHQ RADESDSSNV QVRALDLLRS 841 RYSVPRKAHK AADATAVFHT VLPRLLAAIS SPQRPVRAAA LAALQSAVAN IYGSHAKGTL 901 TQPHLAELVS AITQHRDMFE ADPGALAASL RYALQHAQNA GPASAKKGAK RGKASKAGAS 961 DKRSLFSLSN AAATALAEYF LAVLPEQHGE NGAPAALLLL HSIDTSANAS AALRVGSLLL 1021 SRYVAHKVDA SEQQLMRELL GLYTPDALAG LAEQQEAPKF SHNPVDILLS ILARQPTDAE 1081 AERSVNSMRH LALSRLSAAL YAAIPADRQV QLLLAVLHAS SSDPDEACRA AAHHALANAP 1141 VSAAALQNIL SMEPSAPAVT PAKKARAAKP AESEGRALAS LDNCIAALEL LQWKANVVDG 1201 AELVGTLSSL LPQLLAISNE AAPAPADDTS EDIPSRSVAA GYGLQLALTA LQLVVTRHAQ 1261 EGAADSVDLQ AVLECARQAP NSMARTAALT LLAQLASVLP RSTLEHVVEV VTAQASTLGQ 1321 DADAVSAEAS ARVLRAFACA WVAAGHVESE LIHLVVRASQ GVPARSRLPL LTALSAPLPS 1381 VTALQSICIA LLELAVQSPG GDEGDEEFWL STAVDMVGKV AWETRAQAFQ GMLEECVQEA 1441 KSADLQAAAL RFIADQVAAL TESSANALDA PTRQALTHLM QSATNKLQAS EGEVEGALTA 1501 LLGALEHAMP AADYLKGLVT LTQHDNQALR RRALRLFISR MDSLQAGPQE HQEGAAAEAA 1561 SSAAAEAGAA VAEQLPRLLA EDQSAGVRQA ALLSLSSVAR ACGREHPDLV LAALPSALAA 1621 TKDAQRPVRS SALATVAACA AALGTRSLPQ LVPLVTAVLS AVETAGNALT AAEGAQPEQA 1681 EAHDADEERR RSGAAAEALH VEAASALAAL DALAAGMGAF LAPYLPRVLA LVLQRSILAC 1741 RASQVAATAS HAWRVLASAI PPRLLLPPLF AQLNPALQGG AVPARALLQM VERAVAAMQP 1801 ATAAAHADAL FGFLQQALDV RQRDGISAAD ADAVERAATA TLVAATMKLS EKQFKPLFLR 1861 LLAWASTPPA GQPDVQPLGR QAVLYGVIYA LTERLRSVFV PYFRYVMDGA IALLAGQAGD 1921 ASQPKKKRKK SKTLEPISDV AAADAGAALN TWLLRFRVLQ SVRACFQYDS VQFVDEDLFR 1981 RLLPALVAQL AAFPPPDVLP MLAANEDASL APVGGADGAA MWRAQDSAYG RAVVTALVEL 2041 AVSCGSDTLW KPFNDKVWKR AMQSEVPMER SRALAVIHAF VDRVREDWSF LIGESVMPFS 2101 EIAEVTDSSL QAQTKQLRDL ILDITGEDMY ELSKIS //