LOCUS BC023600 3140 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens aldehyde dehydrogenase 4 family, member A1, mRNA (cDNA
clone MGC:23086 IMAGE:4548787), complete cds.
ACCESSION BC023600
VERSION BC023600.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3140)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3140)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-FEB-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Dec 19, 2003 this sequence version replaced BC023600.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP/Gazdar
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 33 Row: b Column: 12
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 25777733.
FEATURES Location/Qualifiers
source 1..3140
/db_xref="H-InvDB:HIT000050845"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:23086 IMAGE:4548787"
/tissue_type="Lung, large cell carcinoma"
/clone_lib="NIH_MGC_18"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3140
/gene="ALDH4A1"
/gene_synonym="P5CD"
/gene_synonym="P5CDh"
/gene_synonym="P5CDhL"
/gene_synonym="P5CDhS"
/db_xref="GeneID:8659"
/db_xref="HGNC:HGNC:406"
/db_xref="MIM:606811"
CDS 18..1709
/gene="ALDH4A1"
/gene_synonym="P5CD"
/gene_synonym="P5CDh"
/gene_synonym="P5CDhL"
/gene_synonym="P5CDhS"
/codon_start=1
/product="aldehyde dehydrogenase 4 family, member A1"
/protein_id="AAH23600.1"
/db_xref="GeneID:8659"
/db_xref="HGNC:HGNC:406"
/db_xref="MIM:606811"
/translation="MLLPAPALRRALLSRPWTGAGLRWKHTSSLKVANEPVLAFTQGS
PERDALQKALKDLKGRMEAIPCVVGDEEVWTSDVQYQVSPFNHGHKVAKFCYADKSLL
NKAIEAALAARKEWDLKPIADRAQIFLKAADMLSGPRRAEILAKTMVGQGKTVIQAEI
DAAAELIDFFRFNAKYAVELEGQQPISVPPSTNSTVYRGLEGFVAAISPFNFTAIGGN
LAGAPALMGNVVLWKPSDTAMLASYAVYRILREAGLPPNIIQFVPADGPLFGDTVTSS
EHLCGINFTGSVPTFKHLWKQVAQNLDRFHTFPRLAGECGGKNFHFVHRSADVESVVS
GTLRSAFEYGGQKCSACSRLYVPHSLWPQIKGRLLEEHSRIKVGDPAEDFGTFFSAVI
DAKSFARIKKWLEHARSSPSLTILAGGKCDDSVGYFVEPCIVESKDPQEPIMKEEIFG
PVLSVYVYPDDKYKETLQLVDSTTSYGLTGAVFSQDKDVVQEATKVLRNAAGNFYIND
KSTGSIVGQQPFGGARASGTNDKPGGPHYILRWTSPQVIKETHKPLGDWSYAYMQ"
BASE COUNT 589 a 954 c 943 g 654 t
ORIGIN
1 cccgcttcta acccgagatg ctgctgccgg cgcccgcgct ccgccgcgcc ctgctgtccc
61 gcccctggac cggggccggc ctgcggtgga agcacacctc ctccctgaag gtggccaacg
121 agcccgtctt agccttcacg cagggcagcc ctgagcgaga tgccctgcaa aaggccttga
181 aggacctgaa gggccggatg gaagccatcc catgcgtggt gggggatgag gaggtgtgga
241 cgtcggacgt gcagtaccaa gtgtcgcctt ttaaccatgg acataaggtg gccaagttct
301 gttatgcaga caagagcctg ctcaacaaag ccattgaggc tgccctggct gcccggaaag
361 agtgggacct gaagcctatt gcagaccggg cccagatctt cctgaaggcg gcagacatgc
421 tgagtgggcc gcgcagggct gagatcctcg ccaagaccat ggtgggacag ggtaagaccg
481 tgatccaagc ggagattgac gctgcagcgg aactcatcga cttcttccgg ttcaatgcca
541 agtatgcggt ggagctggag gggcagcagc ccatcagcgt gcccccgagc accaacagca
601 cggtgtaccg gggtctggag ggcttcgtgg cggccatctc gccctttaac ttcactgcaa
661 tcggcggcaa cctggcgggg gcaccggccc tgatgggcaa cgtggtccta tggaagccca
721 gtgacactgc catgctggcc agctatgctg tctaccgcat ccttcgggag gctggcctgc
781 cccccaacat catccagttt gtgccagctg atgggcccct atttggggac actgtcacca
841 gctcagagca cctctgtggc atcaacttca caggcagtgt gcccaccttc aaacacctgt
901 ggaagcaggt ggcccagaac ctggaccggt tccacacctt cccacgcctg gctggagagt
961 gcggcggaaa gaacttccac ttcgtgcacc gctcggccga cgtggagagc gtggtgagcg
1021 ggaccctccg ctcagccttc gagtacggtg gccagaagtg ttccgcgtgc tcgcgtctct
1081 acgtgccgca ctcgctgtgg ccgcagatca aagggcggct gctggaggag cacagtcgga
1141 tcaaagtggg cgaccctgca gaggattttg ggaccttctt ctctgcagtg attgatgcca
1201 agtcctttgc ccgtatcaag aagtggctgg agcacgcacg ctcctcaccc agcctcacca
1261 tcctggccgg gggcaagtgt gatgactccg tgggctactt tgtggagccc tgcatcgtgg
1321 agagcaagga ccctcaggag cccatcatga aggaggagat cttcgggcct gtactgtctg
1381 tgtacgtcta cccggatgac aagtacaagg agacgctgca gctggttgac agcaccacca
1441 gctatggcct cacgggggca gtgttctccc aggataagga cgtcgtgcag gaggccacaa
1501 aggtgctgag gaatgctgcc ggcaacttct acatcaacga caagtccact ggctcgatag
1561 tgggccagca gccctttggg ggggcccgag cctctggaac caatgacaag ccagggggcc
1621 cacactacat cctgcgctgg acgtcgccgc aggtcatcaa ggagacacat aagcccctgg
1681 gggactggag ctacgcgtac atgcagtgag cccctctcgg gctccaccgt ccagctgtct
1741 gtccgtccag gtggccgacc tcactgcaca gaccccactc cagcccctcc accccttctt
1801 catgcacagc tgcctttcta taatccgggc ttgactccct tcttaccact gtattctggc
1861 ctctcccatg cctcaggctc tggtttgaga tcgtgctggg gaggaacatg gccactaccc
1921 cttatcccat cggccatgtg ggaggtatga ccctggtgcc tggcaggttc tccctctgcc
1981 ctccactggg cccagtggct cagggacctg gggaaaggag atggagcagc tcttgggatc
2041 ctttggggaa aaggaggcca ttctgggccc cttggcaaac ctcaccactc acagaggctc
2101 ctggccttga tccctgcccc tccaggtgtc cagggtaaag tgtaactcag actgacctgt
2161 ggggcacagg gggcaccagc tggccttgcc ctctctggtc tgggctgtct accttcctca
2221 ctgtatcttt gcccagaccc acctgggcca gtaggcccct gtccccagcc acacacctta
2281 gatgctggca tgccttactc caggtgcctg tgtttggccg aggcctgtgt gattcccggt
2341 ctgcaccaca tggcggggtt ggggggccgc tggaggccac ctgccaaggc gtgggatggg
2401 atggtcctgc cggtttaggc cgtgattctg gaaaaccttg gatgggcctt cgtcctatgt
2461 cagccttccc tttgatcctc aggccctacc tgtagagacc tccactccta gagccagtct
2521 cagggtctgg gatttccctg caggagctca gccaccactg tgccatggtg acacaggcca
2581 aggcagacat tggccctccc ttctcccagc ccccagaggc ctggccttgg gttcgtcagc
2641 atgggccgag gacgttgcct gtagaatcct cctctgcctg ggagtggctc tgtgtggacc
2701 agtccctcac tggcccattc tttttttgac gcagccaatc tgtgaccacg attcctccca
2761 cagatgcctc ctgcttggat tctgagtggt cagagatctg taaagcatga ctttcaagga
2821 tggttcttag gggactgtga aagtgttggg tcttcctcca ggatgcctgc atgggacccc
2881 acccggagct ggtgtggcca ttccccaagt gccactggcc catggatggg ggtgggtgct
2941 ggtgccagct gggctgggtg tgggttctgt gtccttccag gatatgtgtc atttcccatg
3001 aggggccggg gcaggtggct gggtgggggc acaggctgga gtattcttag ttctactggt
3061 tctacactgt gaggtggcaa tgggatttgc tcagatgcca cccaataaaa tgcctgttac
3121 ttaaaaaaaa aaaaaaaaaa
//