LOCUS BC023600 3140 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens aldehyde dehydrogenase 4 family, member A1, mRNA (cDNA clone MGC:23086 IMAGE:4548787), complete cds. ACCESSION BC023600 VERSION BC023600.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3140) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3140) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC023600.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP/Gazdar cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 33 Row: b Column: 12 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 25777733. FEATURES Location/Qualifiers source 1..3140 /db_xref="H-InvDB:HIT000050845" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:23086 IMAGE:4548787" /tissue_type="Lung, large cell carcinoma" /clone_lib="NIH_MGC_18" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3140 /gene="ALDH4A1" /gene_synonym="P5CD" /gene_synonym="P5CDh" /gene_synonym="P5CDhL" /gene_synonym="P5CDhS" /db_xref="GeneID:8659" /db_xref="HGNC:HGNC:406" /db_xref="MIM:606811" CDS 18..1709 /gene="ALDH4A1" /gene_synonym="P5CD" /gene_synonym="P5CDh" /gene_synonym="P5CDhL" /gene_synonym="P5CDhS" /codon_start=1 /product="aldehyde dehydrogenase 4 family, member A1" /protein_id="AAH23600.1" /db_xref="GeneID:8659" /db_xref="HGNC:HGNC:406" /db_xref="MIM:606811" /translation="MLLPAPALRRALLSRPWTGAGLRWKHTSSLKVANEPVLAFTQGS PERDALQKALKDLKGRMEAIPCVVGDEEVWTSDVQYQVSPFNHGHKVAKFCYADKSLL NKAIEAALAARKEWDLKPIADRAQIFLKAADMLSGPRRAEILAKTMVGQGKTVIQAEI DAAAELIDFFRFNAKYAVELEGQQPISVPPSTNSTVYRGLEGFVAAISPFNFTAIGGN LAGAPALMGNVVLWKPSDTAMLASYAVYRILREAGLPPNIIQFVPADGPLFGDTVTSS EHLCGINFTGSVPTFKHLWKQVAQNLDRFHTFPRLAGECGGKNFHFVHRSADVESVVS GTLRSAFEYGGQKCSACSRLYVPHSLWPQIKGRLLEEHSRIKVGDPAEDFGTFFSAVI DAKSFARIKKWLEHARSSPSLTILAGGKCDDSVGYFVEPCIVESKDPQEPIMKEEIFG PVLSVYVYPDDKYKETLQLVDSTTSYGLTGAVFSQDKDVVQEATKVLRNAAGNFYIND KSTGSIVGQQPFGGARASGTNDKPGGPHYILRWTSPQVIKETHKPLGDWSYAYMQ" BASE COUNT 589 a 954 c 943 g 654 t ORIGIN 1 cccgcttcta acccgagatg ctgctgccgg cgcccgcgct ccgccgcgcc ctgctgtccc 61 gcccctggac cggggccggc ctgcggtgga agcacacctc ctccctgaag gtggccaacg 121 agcccgtctt agccttcacg cagggcagcc ctgagcgaga tgccctgcaa aaggccttga 181 aggacctgaa gggccggatg gaagccatcc catgcgtggt gggggatgag gaggtgtgga 241 cgtcggacgt gcagtaccaa gtgtcgcctt ttaaccatgg acataaggtg gccaagttct 301 gttatgcaga caagagcctg ctcaacaaag ccattgaggc tgccctggct gcccggaaag 361 agtgggacct gaagcctatt gcagaccggg cccagatctt cctgaaggcg gcagacatgc 421 tgagtgggcc gcgcagggct gagatcctcg ccaagaccat ggtgggacag ggtaagaccg 481 tgatccaagc ggagattgac gctgcagcgg aactcatcga cttcttccgg ttcaatgcca 541 agtatgcggt ggagctggag gggcagcagc ccatcagcgt gcccccgagc accaacagca 601 cggtgtaccg gggtctggag ggcttcgtgg cggccatctc gccctttaac ttcactgcaa 661 tcggcggcaa cctggcgggg gcaccggccc tgatgggcaa cgtggtccta tggaagccca 721 gtgacactgc catgctggcc agctatgctg tctaccgcat ccttcgggag gctggcctgc 781 cccccaacat catccagttt gtgccagctg atgggcccct atttggggac actgtcacca 841 gctcagagca cctctgtggc atcaacttca caggcagtgt gcccaccttc aaacacctgt 901 ggaagcaggt ggcccagaac ctggaccggt tccacacctt cccacgcctg gctggagagt 961 gcggcggaaa gaacttccac ttcgtgcacc gctcggccga cgtggagagc gtggtgagcg 1021 ggaccctccg ctcagccttc gagtacggtg gccagaagtg ttccgcgtgc tcgcgtctct 1081 acgtgccgca ctcgctgtgg ccgcagatca aagggcggct gctggaggag cacagtcgga 1141 tcaaagtggg cgaccctgca gaggattttg ggaccttctt ctctgcagtg attgatgcca 1201 agtcctttgc ccgtatcaag aagtggctgg agcacgcacg ctcctcaccc agcctcacca 1261 tcctggccgg gggcaagtgt gatgactccg tgggctactt tgtggagccc tgcatcgtgg 1321 agagcaagga ccctcaggag cccatcatga aggaggagat cttcgggcct gtactgtctg 1381 tgtacgtcta cccggatgac aagtacaagg agacgctgca gctggttgac agcaccacca 1441 gctatggcct cacgggggca gtgttctccc aggataagga cgtcgtgcag gaggccacaa 1501 aggtgctgag gaatgctgcc ggcaacttct acatcaacga caagtccact ggctcgatag 1561 tgggccagca gccctttggg ggggcccgag cctctggaac caatgacaag ccagggggcc 1621 cacactacat cctgcgctgg acgtcgccgc aggtcatcaa ggagacacat aagcccctgg 1681 gggactggag ctacgcgtac atgcagtgag cccctctcgg gctccaccgt ccagctgtct 1741 gtccgtccag gtggccgacc tcactgcaca gaccccactc cagcccctcc accccttctt 1801 catgcacagc tgcctttcta taatccgggc ttgactccct tcttaccact gtattctggc 1861 ctctcccatg cctcaggctc tggtttgaga tcgtgctggg gaggaacatg gccactaccc 1921 cttatcccat cggccatgtg ggaggtatga ccctggtgcc tggcaggttc tccctctgcc 1981 ctccactggg cccagtggct cagggacctg gggaaaggag atggagcagc tcttgggatc 2041 ctttggggaa aaggaggcca ttctgggccc cttggcaaac ctcaccactc acagaggctc 2101 ctggccttga tccctgcccc tccaggtgtc cagggtaaag tgtaactcag actgacctgt 2161 ggggcacagg gggcaccagc tggccttgcc ctctctggtc tgggctgtct accttcctca 2221 ctgtatcttt gcccagaccc acctgggcca gtaggcccct gtccccagcc acacacctta 2281 gatgctggca tgccttactc caggtgcctg tgtttggccg aggcctgtgt gattcccggt 2341 ctgcaccaca tggcggggtt ggggggccgc tggaggccac ctgccaaggc gtgggatggg 2401 atggtcctgc cggtttaggc cgtgattctg gaaaaccttg gatgggcctt cgtcctatgt 2461 cagccttccc tttgatcctc aggccctacc tgtagagacc tccactccta gagccagtct 2521 cagggtctgg gatttccctg caggagctca gccaccactg tgccatggtg acacaggcca 2581 aggcagacat tggccctccc ttctcccagc ccccagaggc ctggccttgg gttcgtcagc 2641 atgggccgag gacgttgcct gtagaatcct cctctgcctg ggagtggctc tgtgtggacc 2701 agtccctcac tggcccattc tttttttgac gcagccaatc tgtgaccacg attcctccca 2761 cagatgcctc ctgcttggat tctgagtggt cagagatctg taaagcatga ctttcaagga 2821 tggttcttag gggactgtga aagtgttggg tcttcctcca ggatgcctgc atgggacccc 2881 acccggagct ggtgtggcca ttccccaagt gccactggcc catggatggg ggtgggtgct 2941 ggtgccagct gggctgggtg tgggttctgt gtccttccag gatatgtgtc atttcccatg 3001 aggggccggg gcaggtggct gggtgggggc acaggctgga gtattcttag ttctactggt 3061 tctacactgt gaggtggcaa tgggatttgc tcagatgcca cccaataaaa tgcctgttac 3121 ttaaaaaaaa aaaaaaaaaa //