LOCUS BAC83916.1 534 aa PRT PLN 16-FEB-2008 DEFINITION Oryza sativa Japonica Group methylmalonate semi-aldehyde dehydrogenase protein. ACCESSION AP005179-16 PROTEIN_ID BAC83916.1 SOURCE Oryza sativa Japonica Group (Japanese rice) ORGANISM Oryza sativa Japonica Group Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa. REFERENCE 1 (bases 1 to 127992) AUTHORS Sasaki,T., Matsumoto,T. and Katayose,Y. TITLE Direct Submission JOURNAL Submitted (15-MAY-2002) to the DDBJ/EMBL/GenBank databases. Contact:Takuji Sasaki National Institute of Agrobiological Sciences, Rice Genome Research Program; Kannondai 2-1-2, Tsukuba, Ibaraki 305-8602, Japan URL :http://rgp.dna.affrc.go.jp/ REFERENCE 2 AUTHORS Sasaki,T., Matsumoto,T. and Katayose,Y. TITLE Oryza sativa nipponbare(GA3) genomic DNA, chromosome 7, BAC clone:OSJNBb0084L07 JOURNAL Published Only in Database(2002) COMMENT Genes were predicted from the integrated results of the following: GENSCAN (http://CCR-081.mit.edu/GENSCAN.html), FGENESH (http://www.softberry.com/), GeneMark.hmm (http://opal.biology.gatech.edu/GeneMark/), GlimmerM (http://www.tigr.org/tdb/glimmerm/glmr_form.html), RiceHMM (http://rgp.dna.affrc.go.jp/RiceHMM/), SplicePredictor (http://bioinformatics.iastate.edu/cgi-bin/sp.cgi), sim4 (http://globin.cse.psu.edu/html/docs/sim4.html), gap2 (http://www.tigr.org/software/glimmerm/), BLASTN and BLASTX. The genomic sequence was searched against NCBI NonRedundant Protein database, nr (ftp://ncbi.nlm.nih.gov/blast/db) and the cDNA sequence database at RGP or DDBJ. Protein homologies of the coding regions were searched against NCBI NonRedundant Protein database with BLASTP. ESTs represent the identified cDNA sequences using BLASTN with the corresponding DDBJ accession no. and RGP clone ID. Full-length cDNAs represent the identified cDNA sequences using BLASTN with the corresponding DDBJ accession no. A gene with identity or significant homology to a protein is classified based on the protein name to indicate the homology level such as same name, 'putative-' and '-like protein'. A gene without significant homology to any protein but with full-length cDNA or EST homology (covering almost the entire length of partial sequence) is classified as an 'unknown' protein. A gene predicted by two or more gene prediction programs is classified as a 'hypothetical' protein according to IRGSP standard. A gene predicted by a single gene prediction program is also classified as a probable 'hypothetical' protein and is included as a miscellaneous feature of the sequence. The orientation of the sequence is from M13rev to -21M13 of the BAC clone. This sequence of OSJNBb0084L07 clone has an overlap with P0506C07 (DDBJ: AP004384) clone at 5' end and with OSJNBb0002L09 (DDBJ: AP005877) at 3' end. Detailed information on overlap and assembly quality together with annotation of this entry is available at http://rgp.dna.affrc.go.jp/GenomeSeq.html FEATURES Qualifiers source /chromosome="7" /clone="OSJNBb0084L07" /cultivar="Nipponbare" /db_xref="taxon:39947" /mol_type="genomic DNA" /organism="Oryza sativa Japonica Group" protein /gene="OSJNBb0084L07.21" /note="contains EST(s): AU070251(S20304),C72889(E2423),AU032148(R3630),AU032149 (R3630),AU165534(E2423)" /note="contains full-length cDNA(s): AK065917" intron_pos 10:1 (1/18) intron_pos 39:0 (2/18) intron_pos 64:0 (3/18) intron_pos 118:0 (4/18) intron_pos 144:1 (5/18) intron_pos 196:0 (6/18) intron_pos 217:1 (7/18) intron_pos 244:0 (8/18) intron_pos 266:0 (9/18) intron_pos 283:0 (10/18) intron_pos 333:2 (11/18) intron_pos 364:0 (12/18) intron_pos 391:0 (13/18) intron_pos 417:0 (14/18) intron_pos 429:0 (15/18) intron_pos 443:2 (16/18) intron_pos 470:0 (17/18) intron_pos 500:1 (18/18) BEGIN 1 MLRAALLRSG SGLRRPPMAA PLSTAAAASW LSDSASSPPR VRLLIGGEFV ESRADEHVDV 61 TNPATQEVVS RIPLTTADEF RAAVDAARTA FPGWRNTPVT TRQRIMLKYQ ELIRANMDKL 121 AENITTEQGK TLKDAWGDVF RGLEVVEHAC GMGTLQMGEY VSNVSNGIDT FSIREPLGVC 181 AGICPFNFPA MIPLWMFPIA VTCGNTFVLK PSEKDPGAAM MLAELAMEAG LPKGVLNIVH 241 GTHDVVNNIC DDEDIKAVSF VGSNIAGMHI YSRASAKGKR VQSNMGAKNH AIILPDADRD 301 ATLNALIAAG FGAAGQRCMA LSTAVFVGGS EPWEDELVKR ASSLVVNSGM ASDADLGPVI 361 SKQAKERICK LIQSGADNGA RVLLDGRDIV VPNFENGNFV GPTLLADVKS EMECYKEEIF 421 GPVLLLMKAE SLDDAIQIVN RNKYGNGASI FTTSGVSARK FQTDIEAGQV GINVPIPVPL 481 PFFSFTGSKA SFAGDLNFYG KAGVQFFTQI KTVTQQWKES PAQRVSLSMP TSQK //