LOCUS BC030589 3257 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens aldehyde dehydrogenase 1 family, member A2, mRNA (cDNA clone MGC:26444 IMAGE:4826743), complete cds. ACCESSION BC030589 VERSION BC030589.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3257) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3257) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (13-MAY-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Sep 16, 2003 this sequence version replaced BC030589.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 33 Row: o Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 25777723. FEATURES Location/Qualifiers source 1..3257 /db_xref="H-InvDB:HIT000041057" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:26444 IMAGE:4826743" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3257 /gene="ALDH1A2" /gene_synonym="MGC26444" /gene_synonym="RALDH(II)" /gene_synonym="RALDH2" /gene_synonym="RALDH2-T" /db_xref="GeneID:8854" /db_xref="HGNC:HGNC:15472" /db_xref="MIM:603687" CDS 32..1474 /gene="ALDH1A2" /gene_synonym="MGC26444" /gene_synonym="RALDH(II)" /gene_synonym="RALDH2" /gene_synonym="RALDH2-T" /codon_start=1 /product="aldehyde dehydrogenase 1 family, member A2" /protein_id="AAH30589.1" /db_xref="GeneID:8854" /db_xref="HGNC:HGNC:15472" /db_xref="MIM:603687" /translation="MTSSKIEMPGEVKADPAALMASLHLLPSPTPNLEIKYTKIFINN EWQNSESGRVFPVYNPATGEQVCEVQEADKADIDKAVQAARLAFSLGSVWRRMDASER GRLLDKLADLVERDRAVLATMESLNGGKPFLQAFYVDLQGVIKTFRYYAGWADKIHGM TIPVDGDYFTFTRHEPIGVCGQIIPWNFPLLMFAWKIAPALCCGNTVVIKPAEQTPLS ALYMGALIKEVGKLIQEAAGRSNLKRVTLELGGKSPNIIFADADLDYAVEQAHQGVFF NQGQCCTAGSRIFVEESIYEEFVRRSVERAKRRVVGSPFDPTTEQGPQIDKKQYNKIL ELIQSGVAEGAKLECGGKGLGRKGFFIEPTVFSNVTDDMRIAKEEIFGPVQEILRFKT MDEVIERANNSDFGLVAAVFTNDINKALTVSSAMQAGTVWINCYNALNAQSPFGGFKM SGNGREMGEFGLREYSEVKTVTVKIPQKNS" BASE COUNT 886 a 743 c 760 g 868 t ORIGIN 1 ggcgcgggct agggacaccc ggcccgccac catgacttcc agcaagatag agatgcccgg 61 cgaggtgaag gccgaccccg ccgccctcat ggcgtcgctg cacctcctgc cgtcgcccac 121 gcccaatctc gaaattaagt acaccaagat ctttataaac aacgagtggc agaactcaga 181 gagtgggaga gtgttccctg tctataatcc agccacagga gaacaggtgt gtgaagttca 241 agaagcagac aaggcagata tagacaaagc agtgcaggca gcccgcctgg ctttctctct 301 tggttcagtg tggagaagga tggatgcttc agaaagggga cgtctgttgg ataagcttgc 361 agacttggtg gaacgggaca gggcagttct tgcaaccatg gaatccctaa atggtggcaa 421 accattcctg caagcttttt atgtggattt gcagggcgtc atcaaaacct ttcgatatta 481 cgcaggctgg gctgataaaa ttcatgggat gaccattcct gtagatggag actattttac 541 ctttacaaga catgaaccca ttggagtgtg tggacagatc atcccatgga acttccccct 601 gctgatgttt gcctggaaaa tagctccagc tttgtgctgt ggcaatacag tagttattaa 661 gccagcagag caaacaccac tcagtgcact ctacatggga gccctcatca aggaggttgg 721 aaagcttatc caagaagcag ctggaagaag taatttgaag agagtaactc tggaacttgg 781 aggcaaaagt cctaatatta tttttgctga tgctgacttg gactatgctg tggagcaggc 841 ccaccagggt gtgttcttca atcaaggtca gtgctgcact gcaggctctc gcatcttcgt 901 ggaggagtcc atctatgagg agtttgtgag aagaagcgtg gagcgggcca agaggcgcgt 961 agtggggagt ccctttgacc ccaccactga gcagggtccc cagattgata agaaacagta 1021 caacaagatc ttggaactca tccagagtgg tgtggctgag ggcgccaagc tggaatgtgg 1081 aggcaaagga ctgggccgaa aggggttttt cattgagccc acagtgtttt ccaacgtcac 1141 tgatgatatg cggattgcca aggaggagat ctttggccct gttcaggaaa ttttgagatt 1201 taagacgatg gatgaagtta tcgaaagagc caataactca gactttggac tcgtagcagc 1261 tgtctttact aatgacatca acaaggccct cacagtgtct tctgcaatgc aagctgggac 1321 tgtttggatc aattgttaca atgccttaaa tgcccagagc ccctttgggg gattcaagat 1381 gtctggaaat gggagagaaa tgggagaatt tggcttgcgg gagtactcag aagttaagac 1441 ggtgacagta aagatccccc agaagaactc ctaagaaggc caagaaggag gatgaagccc 1501 agcctgcacg tctgtccctc tctgctttct ctgtagggcc cagctctcag gaatacaaag 1561 ttgagccacg gtccttactt aaagattgaa aagataacat gtaggccagg caggtcactg 1621 cacaactaaa gcaaaccagc tgggtacagt ttcttggcac tctgtaaggg gccaccttaa 1681 tcataccaaa tattggggaa agtgggataa agggaggagg aggagctagc agacacatcc 1741 agtatctcct tctggagcac aggatgaaat aagggagctg tattatttca tgtctttgtc 1801 acaaagaact ttcctctcaa ggaaaggtga cctttctcct gtcttcattt tcctccttcc 1861 aggccctcct cgctcaccca cccctccctc tcttccaagg agatgtcagc tgagctcatt 1921 ctggggcaga tgtttgggcc gggaacaatt tttcaaggtt gtaaagccaa attatcattt 1981 catgttatcc atttcttcaa agcaaaacat gaaatggttt tagctagagt cagaccagaa 2041 tgaaaatgcc aggagctggt acactacaga tgtagtaaga acctgggata ttcctgaccc 2101 aatctggttt tcttttaccc ataaataaca tgaatgaaaa aagattggga caatagagac 2161 tggaagtcat catgtgcagt tcaccgcttc tgagcttgct gcagttttgg ggtgtgtgtg 2221 tattagattc cttctcagtt attctggaat aaggcaagga gtgggttgtt tttcatagct 2281 agataagatc ttttccaaag tttttcttag aaccaaccaa aaaacaatcc gagtaggccc 2341 aagaatttga taatgctgga tgccttgcag acatcattca gtttctaata ttgggcaaca 2401 attattatta aatgaattat ttctgtagtt ggaatctgta ccttctgaac ctctacacca 2461 ataactgctg caggtgtgat tttggtctgt cacactgtac atctatcata atgtgccctg 2521 tatctattgg cagtgacctt ggaaaatctg gccaagccta ggggtttcct tttccatttg 2581 ccaagttcca ttgtgccagg actgccgtgc tccactgagc tcctctgtca caccccattc 2641 ttgcccctca ctgggcaggc catggcctac agcttgcagg gagtaaagca ggcccgcctc 2701 cctttcttcc catccacata ctcctcttct gctttccagt gactccacca gtttgatgtg 2761 ggaagtgtta gcttcctttc cttcttccat cccttcttcc atctttccag ctgtcaaatc 2821 caatccagtc tctaacctaa atgcagatca tttatttaaa agtaccaaac ataacccaga 2881 gtatgtggaa tatgggcaac atatatatag ccttctgtat ttaacgatct tctgcttctt 2941 aaccgtacca gttttctatt tataactctt atctatccat gatgttttaa agtctccact 3001 tgctgttatt tacaaacgac agtgcattca gcagcccagt gccgtgagcc ctgacagatg 3061 ccgtatttct gagtgcttcc atgtgaatgc tgccctcctg tagcatgtgt ccaagtggac 3121 atagccacta accaactagt tacctttgga ctgcaacaaa aaatgtgaaa atgaagattt 3181 atttctttta atttacttaa aaagaaacct ctgtgctagc aataaagcat ttatattgtg 3241 caaaaaaaaa aaaaaaa //