LOCUS       HSU49897                2680 bp    mRNA    linear   HUM 02-OCT-1997
DEFINITION  Homo sapiens phenylalanine hydroxylase (PAH) mRNA, complete cds.
ACCESSION   U49897
VERSION     U49897.1
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2680)
  AUTHORS   Konecki,D.S., Wang,Y., Trefz,F.K., Lichter-Konecki,U. and Woo,S.L.
  TITLE     Structural characterization of the 5' regions of the human
            phenylalanine hydroxylase gene
  JOURNAL   Biochemistry 31 (35), 8363-8368 (1992)
   PUBMED   1326329
REFERENCE   2  (bases 1 to 2680)
  AUTHORS   Nowacki,P., Byck,S., Prevost,L. and Scriver,C.R.
  TITLE     The PAH mutation analysis consortium database: update 1996
  JOURNAL   Nucleic Acids Res. 25 (1), 139-142 (1997)
   PUBMED   9016524
REFERENCE   3  (bases 1 to 2680)
  AUTHORS   Scriver,C., Nowacki,P.M., Byck,S. and Prevost,L.
  TITLE     Direct Submission
  JOURNAL   Submitted (26-FEB-1996) Charles Scriver, Debelle Laboratory for
            Biochemical Genetics, McGill University-Montreal Children's
            Hospital Research Institute, 2300 Tupper St., Montreal, Quebec H3H
            1P3, Canada
REFERENCE   4  (bases 1 to 2680)
  AUTHORS   Scriver,C., Nowacki,P.M., Byck,S. and Prevost,L.
  TITLE     Direct Submission
  JOURNAL   Submitted (30-SEP-1997) Charles Scriver, Debelle Laboratory for
            Biochemical Genetics, McGill University-Montreal Children's
            Hospital Research Institute, 2300 Tupper St., Montreal, Quebec H3H
            1P3, Canada
  REMARK    Sequence update by submitter
COMMENT     On Oct 2, 1997 this sequence version replaced gi:2447212.
FEATURES             Location/Qualifiers
     source          1..2680
                     /db_xref="H-InvDB:HIT000220124"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /chromosome="12"
                     /map="12q22-q2401"
     gene            1..2680
                     /gene="PAH"
     5'UTR           1..472
                     /gene="PAH"
     CDS             473..1831
                     /gene="PAH"
                     /EC_number="1.14.16.1"
                     /codon_start=1
                     /product="phenylalanine hydroxylase"
                     /protein_id="AAC51772.1"
                     /translation="MSTAVLENPGLGRKLSDFGQETSYIEDNCNQNGAISLIFSLKEE
                     VGALAKVLRLFEENDVNLTHIESRPSRLKKDEYEFFTHLDKRSLPALTNIIKILRHDI
                     GATVHELSRDKKKDTVPWFPRTIQELDRFANQILSYGAELDADHPGFKDPVYRARRKQ
                     FADIAYNYRHGQPIPRVEYMEEEKKTWGTVFKTLKSLYKTHACYEYNHIFPLLEKYCG
                     FHEDNIPQLEDVSQFLQTCTGFRLRPVAGLLSSRDFLGGLAFRVFHCTQYIRHGSKPM
                     YTPEPDICHELLGHVPLFSDRSFAQFSQEIGLASLGAPDEYIEKLATIYWFTVEFGLC
                     KQGDSIKAYGAGLLSSFGELQYCLSEKPKLLPLELEKTAIQNYTVTEFQPLYYVAESF
                     NDAKEKVRNFAATIPRPFSVRYDPYTQRIEVLDNTQQLKILADSINSEIGILCSALQK
                     IK"
     misc_feature    473
                     /gene="PAH"
                     /note="for mutation nomenclature purposes the #1 'a' of
                     the atg codon is located at this position of the sequence"
     3'UTR           1832..2680
                     /gene="PAH"
     polyA_site      2680
                     /gene="PAH"
                     /note="18 A nucleotides"
BASE COUNT          762 a          616 c          566 g          736 t
ORIGIN      
        1 cagctggggg taaggggggc ggattattca tataattgtt ataccagacg gtcgcaggct
       61 tagtccaatt gcagagaact cgcttcccag gcttctgaga gtcccggaag tgcctaaacc
      121 tgtctaatcg acggggcttg ggtggcccgt cgctccctgg cttcttccct ttacccaggg
      181 cgggcagcga agtggtgcct cctgcgtccc ccacaccctc cctcagcccc tcccctccgg
      241 cccgtcctgg gcaggtgacc tggagcatcc ggcaggctgc cctggcctcc tgcgtcagga
      301 caagcccacg aggggcgtta ctgtgcggag atgcaccacg caagagacac cctttgtaac
      361 tctcttctcc tccctagtgc gaggttaaaa ccttcagccc cacgtgctgt ttgcaaacct
      421 gcctgtacct gaggccctaa aaagccagag acctcactcc cggggagcca gcatgtccac
      481 tgcggtcctg gaaaacccag gcttgggcag gaaactctct gactttggac aggaaacaag
      541 ctatattgaa gacaactgca atcaaaatgg tgccatatca ctgatcttct cactcaaaga
      601 agaagttggt gcattggcca aagtattgcg cttatttgag gagaatgatg taaacctgac
      661 ccacattgaa tctagacctt ctcgtttaaa gaaagatgag tatgaatttt tcacccattt
      721 ggataaacgt agcctgcctg ctctgacaaa catcatcaag atcttgaggc atgacattgg
      781 tgccactgtc catgagcttt cacgagataa gaagaaagac acagtgccct ggttcccaag
      841 aaccattcaa gagctggaca gatttgccaa tcagattctc agctatggag cggaactgga
      901 tgctgaccac cctggtttta aagatcctgt gtaccgtgca agacggaagc agtttgctga
      961 cattgcctac aactaccgcc atgggcagcc catccctcga gtggaataca tggaggaaga
     1021 aaagaaaaca tggggcacag tgttcaagac tctgaagtcc ttgtataaaa cccatgcttg
     1081 ctatgagtac aatcacattt ttccacttct tgaaaagtac tgtggcttcc atgaagataa
     1141 cattccccag ctggaagacg tttctcaatt cctgcagact tgcactggtt tccgcctccg
     1201 acctgtggct ggcctgcttt cctctcggga tttcttgggt ggcctggcct tccgagtctt
     1261 ccactgcaca cagtacatca gacatggatc caagcccatg tatacccccg aacctgacat
     1321 ctgccatgag ctgttgggac atgtgccctt gttttcagat cgcagctttg cccagttttc
     1381 ccaggaaatt ggccttgcct ctctgggtgc acctgatgaa tacattgaaa agctcgccac
     1441 aatttactgg tttactgtgg agtttgggct ctgcaaacaa ggagactcca taaaggcata
     1501 tggtgctggg ctcctgtcat cctttggtga attacagtac tgcttatcag agaagccaaa
     1561 gcttctcccc ctggagctgg agaagacagc catccaaaat tacactgtca cggagttcca
     1621 gcccctgtat tacgtggcag agagttttaa tgatgccaag gagaaagtaa ggaactttgc
     1681 tgccacaata cctcggccct tctcagttcg ctacgaccca tacacccaaa ggattgaggt
     1741 cttggacaat acccagcagc ttaagatttt ggctgattcc attaacagtg aaattggaat
     1801 cctttgcagt gccctccaga aaataaagta aagccatgga cagaatgtgg tctgtcagct
     1861 gtgaatctgt tgatggagat ccaactattt ctttcatcag aaaaagtccg aaaagcaaac
     1921 cttaatttga aataacagcc ttaaatcctt tacaagatgg agaaacaaca aataagtcaa
     1981 aataatctga aatgacagga tatgagtaca tactcaagag cataatggta aatcttttgg
     2041 ggtcatcttt gatttagaga tgataatccc atactctcaa ttgagttaaa tcagtaatct
     2101 gtcgcatttc atcaagatta attaaaattt gggacctgct tcattcaagc ttcatatatg
     2161 ctttgcagag aactcataaa ggagcatata aggctaaatg taaaacacaa gactgtcatt
     2221 agaattgaat tattgggctt aatataaatc gtaacctatg aagtttattt tctattttag
     2281 ttaactatga ttccaattac tactttgtta ttgtacctaa gtaaattttc tttaggtcag
     2341 aagcccatta aaatagttac aagcattgaa cttctttagt attatattaa tataaaaaca
     2401 tttttgtatg ttttattgta atcataaata ctgctgtata aggtaataaa actctgcacc
     2461 taatccccat aacttccagt atcattttcc aattaattat caagtctgtt ttgggaaaca
     2521 ctttgaggac atttatgatg cagcagatgt tgactaaagg cttggttggt agatattcag
     2581 gaaatgttca ctgaataaat aagtaaatac attattgaaa agcaaatctg tataaatgtg
     2641 aaatttttat ttgtattagt aataaaacat tagtagttta
//