LOCUS EOA29011.1 623 aa PRT CON 21-MAR-2015 DEFINITION Capsella rubella hypothetical protein protein. ACCESSION KB870808-1191 PROTEIN_ID EOA29011.1 SOURCE Capsella rubella ORGANISM Capsella rubella Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella. REFERENCE 1 (bases 1 to 14994381) AUTHORS Schmutz,J., Prochnik,S., Nordborg,M., Weigel,D., Rokhsar,D. and Wright,S. TITLE Genome sequencing of Capsella rubella JOURNAL Nat. Genet. (2013) In press REFERENCE 2 (bases 1 to 14994381) AUTHORS Schmutz,J., Prochnik,S., Nordborg,M., Weigel,D., Rokhsar,D. and Wright,S. CONSRTM US DOE Joint Genome Institute (JGI-PGF) TITLE Direct Submission JOURNAL Submitted (01-APR-2013) US DOE Joint Genome Institute, Mitchell Drive B100, Walnut Creek, CA 94598-1698, USA COMMENT ##Genome-Assembly-Data-START## Assembly Method :: ARACHNE v. 20071016_modified Assembly Name :: Caprub1_0 Genome Coverage :: 22.35X Sequencing Technology :: ABI 3739; Roche 454FLX ##Genome-Assembly-Data-END## FEATURES Qualifiers source /organism="Capsella rubella" /mol_type="genomic DNA" /submitter_seqid="scaffold_4" /cultivar="Monte Gargano" /bio_material="ABRC:CS22697" /db_xref="taxon:81985" /chromosome="Unknown" protein /locus_tag="CARUB_v10025264mg" /note="encoded by transcript CARUB_v10025264m" /db_xref="Phytozome:Carubv10025264m.p" intron_pos 99:1 (1/12) intron_pos 236:2 (2/12) intron_pos 330:1 (3/12) intron_pos 360:0 (4/12) intron_pos 383:1 (5/12) intron_pos 411:1 (6/12) intron_pos 437:0 (7/12) intron_pos 464:1 (8/12) intron_pos 492:0 (9/12) intron_pos 514:1 (10/12) intron_pos 557:1 (11/12) intron_pos 593:0 (12/12) BEGIN 1 MKNVGLVVIV WIIVGWSSCM GRFVVEKNNL RVTSPESIRG VYECALGNFG IPQYGGSMSG 61 AVLYPKANQK ACKNFADFDI SFRSRVAGLP TFVLVDRGDC YFTLKAWNAQ RAGAATILVA 121 DNRPEQLITM DAPEDETTDA DYLQNITIPS ALVSRSLGSA IKTAIAHGEP VHISLDWREA 181 LPHPNDRVAY ELWTNSNDEC GPKCDAQIQF LKRFKGAAQI LEKGGYTRFT PHYITWYCPE 241 AFLASRQCKS QCINGGRYCA PDPEQDFSRG YNGKDVIVQN LRQACFFRVT NESGKPWLWW 301 DYVTDFAIRC PMKQEKYNKK CADQVIHSLG VDVKKIDKCI GDIEANTENP VLKEEQHAQV 361 GKGSRGDVTI LPTIVINNRQ YRGKLQRSAV LKALCSGFRE TTEPPICLTE DIETNECLQN 421 NGGCWEDKTT NITACRDTFR GRVCQCPIVQ GVKFLGDGYT HCEASGALRC GINNGGCWKH 481 TQMGKTYSAC RDDHSKGCKC PPGFKGDGLK DCQDVNECEE KTACQCRGCK CKNTWGSYEC 541 SCSGSLLYIR EHDICINKDA RGDVSWGVIW IIIMGLGAAA LGAYTIYKYR IRTYMDSEIR 601 AIMAQYMPLD NNPNSQPSSQ LEL //