LOCUS BDA40778.1 906 aa PRT PLN 05-OCT-2021 DEFINITION Coccomyxa sp. Obi probable DNA mismatch repair protein MSH6 at C-terminar half protein. ACCESSION AP024988-431 PROTEIN_ID BDA40778.1 SOURCE Coccomyxa sp. Obi ORGANISM Coccomyxa sp. Obi Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae; Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa. REFERENCE 1 (bases 1 to 3766449) AUTHORS Harayama,S. and Ide,Y. TITLE Direct Submission JOURNAL Submitted (11-AUG-2021) to the DDBJ/EMBL/GenBank databases. Contact:Shigeaki Harayama Chuo University, Research and Development Initiative; 1-13-27 Kasuga, Bunkyo-ku, Tokyo 112-8551, Japan REFERENCE 2 AUTHORS Harayama,S. TITLE The genome sequence of the unicellular green alga, Coccomyxa sp. strain Obi JOURNAL Unpublished (2021) COMMENT ##Genome-Assembly-Data-START## Assembly Method :: SMART Analysis v. 2.3.0. Assembly Name :: COCOBI_1.0 Genome Coverage :: 170x Sequencing Technology :: PacBio RSII ##Genome-Assembly-Data-END## FEATURES Qualifiers source /chromosome="1" /db_xref="taxon:2315456" /mol_type="genomic DNA" /organism="Coccomyxa sp. Obi" protein /locus_tag="COCOBI_01-4320" /transl_table=1 intron_pos 103:2 (1/11) intron_pos 156:1 (2/11) intron_pos 190:0 (3/11) intron_pos 248:0 (4/11) intron_pos 297:0 (5/11) intron_pos 359:0 (6/11) intron_pos 397:1 (7/11) intron_pos 455:0 (8/11) intron_pos 518:2 (9/11) intron_pos 672:1 (10/11) intron_pos 836:0 (11/11) BEGIN 1 MRNRHLAQNA LKASDTIAEA VLPVELSPIT CSRGAVQSQQ HLEAAPQHSS EKSCQDGMSA 61 GQHADLLNVS GLQSGGSSRG SFGAVGQGSS QPQEMQRCRS VGSTDVSSAS LGHTSINTPS 121 EVEDADVSSA PAYGQAHGAL LGADDCRSSG AGNLPGGPEL PNVAGDTAQD KAENARAMAE 181 AAMKEAKAKA RCAAEMMAVA QQKVVLESRS LKRKRSLGLK SYVGPPSHML LGVRISVYWP 241 DDDAFYKGRI VEVLDEDDRV LVKYDDEMDE ELHLLCERFQ WLAPRAQSAG ASSQLHMEMA 301 RLGAEGVEET PQVPDTSDMG FKGAPEGAAA VGGRISIHFA GVGIWCRGEV LAYDASREKH 361 HILYEDGENE WVRLSREAYT WCPNCAESAY PAGLPPGTAA PSGQEAIGWR VSIYWPDNST 421 FYVGEVIGFD NVTGRHHVLY DNGDQEHLAL NAAKVCWVMP PSVSSQAARP EEALHQSAPK 481 GMALGTDLAI SGRSLRRSGS AGRAVRQAAL LQEQAPSRFE AKAWLKEELA GQGDGELLPF 541 DDDDDAMLDE GLCADDFPRG IAGEQLKGEM PTWWHEQLHS HSAAVLISGL EPRRVRSSQL 601 PSLAEEDAAA WADVRGPAAA RGGNYMLRTQ RSLPDCTTIG TARIGLPLTR AMTMPTAGDC 661 QLDDLICQLD AGAEGISMSS PVTVGLEESM DMALPGMDLP AMLLESSNPQ QGSSLVASSS 721 APLVGGSAHG SVEIESQLSG LAAVGMRDFL SAGAPDVADV AQFLCGDEDA AAATEGGARQ 781 LVDSSTSLAS LSSARGSALS MASDTSVVYI GAPSSQPLQP SMPGSPGLCG ANTPESARRM 841 LHWEMGRKSA LGEPLQSPSS SAALARVPLA PEPLDTSVDL NLNAGSPDGL AMATELQTLL 901 AEMDWG //