LOCUS       HUMGCBPRC               2275 bp    mRNA    linear   HUM 27-APR-1993
DEFINITION  Human glucocerebrosidase mRNA, complete cds.
ACCESSION   M19285
VERSION     M19285.1
KEYWORDS    glucocerebrosidase.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2275)
  AUTHORS   Horowitz,M.
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 140)
  AUTHORS   Reiner,O., Wigderson,M. and Horowitz,M.
  TITLE     Structural analysis of the human glucocerebrosidase genes
  JOURNAL   DNA 7 (2), 107-116 (1988)
   PUBMED   3359914
COMMENT     Original source text: Human erythroid cell line K562, cDNA to mRNA,
            clone p7-2-1.
            Draft entry and computer-readable sequence for [2],[1] kindly
            provided by M,Horowitz, 13-MAY-1988.
            An alternativa initiation codon is locate at positions 183-185 and
            alternative poly-adenylation signals at positions 2250-2255 (major)
            and 1774-1779 (minor).
FEATURES             Location/Qualifiers
     source          1..2275
                     /db_xref="H-InvDB:HIT000194682"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
     mRNA            <1..2275
                     /product="GCB mRNA"
     CDS             123..1733
                     /note="glucocerebrosidase"
                     /codon_start=1
                     /protein_id="AAA35880.1"
                     /translation="MEFSSPSREECPKPLSRVSIMAGSLTGLLLLQAVSWASGARPCI
                     PKSFGYSSVVCVCNATYCDSFDPPTFPALGTFSRYESTRSGRRMELSMGPIQANHTGT
                     GLLLTLQPEQKFQKVKGFGGAMTDAAALNILALSPPAQNLLLKSYFSEEGIGYNIIRV
                     PMASCDFSIRTYTYADTPDDFQLHNFSLPEEDTKLKIPLIHRALQLAQRPVSLLASPW
                     TSPTWLKTNGAVNGKGSLKGQPGDIYHQTWARYFVKFLDAYAEHKLQFWAVTAENEPS
                     AGLLSGYPFQCLGFTPEHQRDFIARDLGPTLANSTHHNVRLLMLDDQRLLLPHWAKVV
                     LTDPEAAKYVHGIAVHWYLDFLAPAKATLGETHRLFPNTMLFASEACVGSKFWEQSVR
                     LGSWDRGMQYSHSIITNLLYHVVGWTDWNLALNPEGGPNWVRNFVDSPIIVDITKDTF
                     YKQPMFYHLGHFSKFIPEGSQRVGLVASQKNDLDAVALMHPDGSAVVVVLNRSSKDVP
                     LTIKDPAVGFLETISPGYSIHTYLWHRQ"
BASE COUNT          523 a          652 c          596 g          503 t
ORIGIN      32 bp upstream of KpnI site; chromosome 1q21-31.
        1 gctaacctag tgcctatagc taaggcaggt acctgcatcc ttgtttttgt ttagtggatc
       61 ctctatcctt cagagactct ggaacccctg tggtcttctc ttcatctaat gaccctgagg
      121 ggatggagtt ttcaagtcct tccagagagg aatgtcccaa gcctttgagt agggtaagca
      181 tcatggctgg cagcctcaca ggtttgcttc tacttcaggc agtgtcgtgg gcatcaggtg
      241 cccgcccctg catccctaaa agcttcggct acagctcggt ggtgtgtgtc tgcaatgcca
      301 catactgtga ctcctttgac cccccgacct ttcctgccct tggtaccttc agccgctatg
      361 agagtacacg cagtgggcga cggatggagc tgagtatggg gcccatccag gctaatcaca
      421 cgggcacagg cctgctactg accctgcagc cagaacagaa gttccagaaa gtgaagggat
      481 ttggaggggc catgacagat gctgctgctc tcaacatcct tgccctgtca ccccctgccc
      541 aaaatttgct acttaaatcg tacttctctg aagaaggaat cggatataac atcatccggg
      601 tacccatggc cagctgtgac ttctccatcc gcacctacac ctatgcagac acccctgatg
      661 atttccagtt gcacaacttc agcctcccag aggaagatac caagctcaag atacccctga
      721 ttcaccgagc cctgcagttg gcccagcgtc ccgtttcact ccttgccagc ccctggacat
      781 cacccacttg gctcaagacc aatggagcgg tgaatgggaa ggggtcactc aagggacagc
      841 ccggagacat ctaccaccag acctgggcca gatactttgt gaagttcctg gatgcctatg
      901 ctgagcacaa gttacagttc tgggcagtga cagctgaaaa tgagccttct gctgggctgt
      961 tgagtggata ccccttccag tgcctgggct tcacccctga acatcagcga gacttcattg
     1021 cccgtgacct aggtcctacc ctcgccaaca gtactcacca caatgtccgc ctactcatgc
     1081 tggatgacca acgcttgctg ctgccccact gggcaaaggt ggtactgaca gacccagaag
     1141 cagctaaata tgttcatggc attgctgtac attggtacct ggactttctg gctccagcca
     1201 aagccaccct aggggagaca caccgcctgt tccccaacac catgctcttt gcctcagagg
     1261 cctgtgtggg ctccaagttc tgggagcaga gtgtgcggct aggctcctgg gatcgaggga
     1321 tgcagtacag ccacagcatc atcacgaacc tcctgtacca tgtggtcggc tggaccgact
     1381 ggaaccttgc cctgaacccc gaaggaggac ccaattgggt gcgtaacttt gtcgacagtc
     1441 ccatcattgt agacatcacc aaggacacgt tttacaaaca gcccatgttc taccaccttg
     1501 gccacttcag caagttcatt cctgagggct cccagagagt ggggctggtt gccagtcaga
     1561 agaacgacct ggacgcagtg gcactgatgc atcccgatgg ctctgctgtt gtggtcgtgc
     1621 taaaccgctc ctctaaggat gtgcctctta ccatcaagga tcctgctgtg ggcttcctgg
     1681 agacaatctc acctggctac tccattcaca cctacctgtg gcatcgccag tgatggagca
     1741 gatactcaag gaggcactgg gctcagcctg ggcattaaag ggacagagtc agctcacacg
     1801 ctgtctgtga ctaaagaggg cacagcaggg ccagtgtgag cttacagcga cgtaagccca
     1861 ggggcaatgg tttgggtgac tcactttccc ctctaggtgg tgcccagggc tggaggcccc
     1921 tagaaaaaga tcagtaagcc ccagtgtccc cccagccccc atgcttatgt gaacatgcgc
     1981 tgtgtgctgc ttgctttgga aactngcctg ggtccaggcc tagggtgagc tcactgtccg
     2041 tacaaacaca agatcagggc tgagggtaag gaaaagaaga gactaggaaa gctgggccca
     2101 aaactggaga ctgtttgtct ttcctagaga tgcagaactg ggcccgtgga gcagcagtgt
     2161 cagcatcagg gcggaagcct taaagcagca gcgggtgtgc ccaggcaccc agatgattcc
     2221 tatggcacca gccaggaaaa atggcagctc ttaaaggaga aaatgtttga gccca
//