LOCUS       BC001505                2088 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens aldehyde dehydrogenase 1 family, member A1, mRNA (cDNA
            clone MGC:2318 IMAGE:2988388), complete cds.
ACCESSION   BC001505
VERSION     BC001505.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2088)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2088)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (21-DEC-2000) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Oct 8, 2003 this sequence version replaced BC001505.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 3 Row: n Column: 13
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 25777722.
FEATURES             Location/Qualifiers
     source          1..2088
                     /db_xref="H-InvDB:HIT000030461"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:2318 IMAGE:2988388"
                     /tissue_type="Colon, adenocarcinoma"
                     /clone_lib="NIH_MGC_15"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2088
                     /gene="ALDH1A1"
                     /gene_synonym="ALDC"
                     /gene_synonym="ALDH-E1"
                     /gene_synonym="ALDH11"
                     /gene_synonym="MGC2318"
                     /gene_synonym="RALDH1"
                     /db_xref="GeneID:216"
                     /db_xref="HGNC:HGNC:402"
                     /db_xref="MIM:100640"
     CDS             26..1531
                     /gene="ALDH1A1"
                     /gene_synonym="ALDC"
                     /gene_synonym="ALDH-E1"
                     /gene_synonym="ALDH11"
                     /gene_synonym="MGC2318"
                     /gene_synonym="RALDH1"
                     /codon_start=1
                     /product="aldehyde dehydrogenase 1 family, member A1"
                     /protein_id="AAH01505.1"
                     /db_xref="GeneID:216"
                     /db_xref="HGNC:HGNC:402"
                     /db_xref="MIM:100640"
                     /translation="MSSSGTPDLPVLLTDLKIQYTKIFINNEWHDSVSGKKFPVFNPA
                     TEEELCQVEEGDKEDVDKAVKAARQAFQIGSPWRTMDASERGRLLYKLADLIERDRLL
                     LATMESMNGGKLYSNAYLNDLAGCIKTLRYCAGWADKIQGRTIPIDGNFFTYTRHEPI
                     GVCGQIIPWNFPLVMLIWKIGPALSCGNTVVVKPAEQTPLTALHVASLIKEAGFPPGV
                     VNIVPGYGPTAGAAISSHMDIDKVAFTGSTEVGKLIKEAAGKSNLKRVTLELGGKSPC
                     IVLADADLDNAVEFAHHGVFYHQGQCCIAASRIFVEESIYDEFVRRSVERAKKYILGN
                     PLTPGVTQGPQIDKEQYDKILDLIESGKKEGAKLECGGGPWGNKGYFVQPTVFSNVTD
                     EMRIAKEEIFGPVQQIMKFKSLDDVIKRANNTFYGLSAGVFTKDIDKAITISSALQAG
                     TVWVNCYGVVSAQCPFGGFKMSGNGRELGEYGFHEYTEVKTVTVKISQKNS"
BASE COUNT          650 a          388 c          478 g          572 t
ORIGIN      
        1 ctgtgttcca ggagccgaat cagaaatgtc atcctcaggc acgccagact tacctgtcct
       61 actcaccgat ttgaagattc aatatactaa gatcttcata aacaatgaat ggcatgattc
      121 agtgagtggc aagaaatttc ctgtctttaa tcctgcaact gaggaggagc tctgccaggt
      181 agaagaagga gataaggagg atgttgacaa ggcagtgaag gccgcaagac aggcttttca
      241 gattggatcc ccgtggcgta ctatggatgc ttccgagagg gggcgactat tatacaagtt
      301 ggctgattta atcgaaagag atcgtctgct gctggcgaca atggagtcaa tgaatggtgg
      361 aaaactctat tccaatgcat atctgaatga tttagcaggc tgcatcaaaa cattgcgcta
      421 ctgtgcaggt tgggctgaca agatccaggg ccgtacaata ccaattgatg gaaatttttt
      481 tacatataca agacatgaac ctattggtgt atgtggccaa atcattcctt ggaatttccc
      541 gttggttatg ctcatttgga agatagggcc tgcactgagc tgtggaaaca cagtggttgt
      601 caaaccagca gagcaaactc ctctcactgc tctccacgtg gcatctttaa taaaagaggc
      661 agggtttcct cctggagtag tgaatattgt tcctggttat gggcctacag caggggcagc
      721 catttcttct cacatggata tagacaaagt agccttcaca ggatcaacag aggttggcaa
      781 gttgatcaaa gaagctgccg ggaaaagcaa tctgaagagg gtgaccctgg agcttggagg
      841 aaagagccct tgcattgtgt tagctgatgc cgacttggac aatgctgttg aatttgcaca
      901 ccatggggta ttctaccacc agggccagtg ttgtatagcc gcatccagga tttttgtgga
      961 agaatcaatt tatgatgagt ttgttcgaag gagtgttgag cgggctaaga agtatatcct
     1021 tggaaatcct ctgaccccag gagtcactca aggccctcag attgacaagg aacaatatga
     1081 taaaatactt gacctcattg agagtgggaa gaaagaaggg gccaaactgg aatgtggagg
     1141 aggcccgtgg gggaataaag gctactttgt ccagcccaca gtgttctcta atgttacaga
     1201 tgagatgcgc attgccaaag aggagatttt tggaccagtg cagcaaatca tgaagtttaa
     1261 atctttagat gacgtgatca aaagagcaaa caatactttc tatggcttat cagcaggagt
     1321 gtttaccaaa gacattgata aagccataac aatctcctct gctctgcagg caggaacagt
     1381 gtgggtgaat tgctatggcg tggtaagtgc ccagtgcccc tttggtggat tcaagatgtc
     1441 tggaaatgga agagaactgg gagagtacgg tttccatgaa tatacagagg tcaaaacagt
     1501 cacagtgaaa atctctcaga agaactcata aagaaaatac aagagtggag agaagctctt
     1561 caatagctaa gcatctcctt acagtcacta atatagtaga ttttaaagac aaaatttttc
     1621 ttttcttgat ttttttaaac ataagctaaa tcatattagt attaatacta cccatagaaa
     1681 acttgacatg tagcttcttc tgaaagaatt atttgccttc tgaaatgtga cccccaagtc
     1741 ctatcctaaa taaaaaaaga caaattcgga tgtatgatct ctctagcttt gtcatagtta
     1801 tgtgattttc ctttgtagct acttttgcag gataataatt ttatagaaaa ggaacagttg
     1861 catttagctt ctttccctta gtgactcttg aagtacttaa catacacgtt aactgtagag
     1921 taaattgctc tgttcccagt agttataaag tccttggact gttttgaaaa gtttcctagg
     1981 atgtcatgtc tgcttgtcaa aagaaataat ccctgtaata tttagctgta aactgaatat
     2041 aaagcttaat aaaaacaacc ttgcatgaaa aaaaaaaaaa aaaaaaaa
//