LOCUS       BC040431                3687 bp    mRNA    linear   HUM 07-AUG-2008
DEFINITION  Homo sapiens glucosidase, alpha; acid, mRNA (cDNA clone MGC:48746
            IMAGE:5739991), complete cds.
ACCESSION   BC040431
VERSION     BC040431.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3687)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3687)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (29-NOV-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 84 Row: j Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 11496988.
FEATURES             Location/Qualifiers
     source          1..3687
                     /db_xref="H-InvDB:HIT000052383"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:48746 IMAGE:5739991"
                     /tissue_type="Duodenum, adenocarcinoma"
                     /clone_lib="NIH_MGC_88"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3687
                     /gene="GAA"
                     /gene_synonym="LYAG"
                     /db_xref="GeneID:2548"
                     /db_xref="HGNC:HGNC:4065"
                     /db_xref="MIM:606800"
     CDS             145..3003
                     /gene="GAA"
                     /gene_synonym="LYAG"
                     /codon_start=1
                     /product="glucosidase, alpha; acid"
                     /protein_id="AAH40431.1"
                     /db_xref="GeneID:2548"
                     /db_xref="HGNC:HGNC:4065"
                     /db_xref="MIM:606800"
                     /translation="MGVRHPPCSHRLLAVCALVSLATAALLGHILLHDFLLVPRELSG
                     SSPVLEETHPAHQQGASRPGPRDAQAHPGRPRAVPTQCDVPPNSRFDCAPDKAITQEQ
                     CEARGCCYIPAKQGLQGAQMGQPWCFFPPSYPSYKLENLSSSEMGYTATLTRTTPTFF
                     PKDILTLRLDVMMETENRLHFTIKDPANRRYEVPLETPHVHSRAPSPLYSVEFSEEPF
                     GVIVRRQLDGRVLLNTTVAPLFFADQFLQLSTSLPSQYITGLAEHLSPLMLSTSWTRI
                     TLWNRDLAPTPGANLYGSHPFYLALEDGGSAHGVFLLNSNAMDVVLQPSPALSWRSTG
                     GILDVYIFLGPEPKSVVQQYLDVVGYPFMPPYWGLGFHLCRWGYSSTAITRQVVENMT
                     RAHFPLDVQWNDLDYMDSRRDFTFNKDGFRDFPAMVQELHQGGRRYMMIVDPAISSSG
                     PAGSYRPYDEGLRRGVFITNETGQPLIGKVWPGSTAFPDFTNPTALAWWEDMVAEFHD
                     QVPFDGMWIDMNEPSNFIRGSEDGCPNNELENPPYVPGVVGGTLQAATICASSHQFLS
                     THYNLHNLYGLTEAIASHRALVKARGTRPFVISRSTFAGHGRYAGHWTGDVWSSWEQL
                     ASSVPEILQFNLLGVPLVGADVCGFLGNTSEELCVRWTQLGAFYPFMRNHNSLLSLPQ
                     EPYSFSEPAQQAMRKALTLRYALLPHLYTLFHQAHVAGETVARPLFLEFPKDSSTWTV
                     DHQLLWGEALLITPVLQAGKAEVTGYFPLGTWYDLQTVPVEALGSLPPPPAAPREPAI
                     HSEGQWVTLPAPLDTINVHLRAGYIIPLQGPGLTTTESRQQPMALAVALTKGGEARGE
                     LFWDDGESLEVLERGAYTQVIFLARNNTIVNELVRVTSEGAGLQLQKVTVLGVATAPQ
                     QVLSNGVPVSNFTYSPDTKVLDICVSLLMGEQFLVSWC"
BASE COUNT          759 a         1203 c         1075 g          650 t
ORIGIN      
        1 tccgcgggcg gccagggcgc gcgtgcgcgg aggttctcct cgtccgcccg ttgttcagcg
       61 agggaggctc tgcgcgtgcc gcagctgacg gggaaactga ggcacggagc gggcctgtag
      121 gagctgtcca ggccatctcc aaccatggga gtgaggcacc cgccctgctc ccaccggctc
      181 ctggccgtct gcgccctcgt gtccttggca accgctgcac tcctggggca catcctactc
      241 catgatttcc tgctggttcc ccgagagctg agtggctcct ccccagtcct ggaggagact
      301 cacccagctc accagcaggg agccagtaga ccagggcccc gggatgccca ggcacacccc
      361 ggccgtccca gagcagtgcc cacacagtgc gacgtccccc ccaacagccg cttcgattgc
      421 gcccctgaca aggccatcac ccaggaacag tgcgaggccc gcggctgttg ctacatccct
      481 gcaaagcagg ggctgcaggg agcccagatg gggcagccct ggtgcttctt cccacccagc
      541 taccccagct acaagctgga gaacctgagc tcctctgaaa tgggctacac ggccaccctg
      601 acccgtacca cccccacctt cttccccaag gacatcctga ccctgcggct ggacgtgatg
      661 atggagactg agaaccgcct ccacttcacg atcaaagatc cagctaacag gcgctacgag
      721 gtgcccttgg agaccccgca tgtccacagc cgggcaccgt ccccactcta cagcgtggag
      781 ttctccgagg agcccttcgg ggtgatcgtg cgccggcagc tggacggccg cgtgctgctg
      841 aacacgacgg tggcgcccct gttctttgcg gaccagttcc ttcagctgtc cacctcgctg
      901 ccctcgcagt atatcacagg cctcgccgag cacctcagtc ccctgatgct cagcaccagc
      961 tggaccagga tcaccctgtg gaaccgggac cttgcgccca cgcccggtgc gaacctctac
     1021 gggtctcacc ctttctacct ggcgctggag gacggcgggt cggcacacgg ggtgttcctg
     1081 ctaaacagca atgccatgga tgtggtcctg cagccgagcc ctgcccttag ctggaggtcg
     1141 acaggtggga tcctggatgt ctacatcttc ctgggcccag agcccaagag cgtggtgcag
     1201 cagtacctgg acgttgtggg atacccgttc atgccgccat actggggcct gggcttccac
     1261 ctgtgccgct ggggctactc ctccaccgct atcacccgcc aggtggtgga gaacatgacc
     1321 agggcccact tccccctgga cgtccagtgg aacgacctgg actacatgga ctcccggagg
     1381 gacttcacgt tcaacaagga tggcttccgg gacttcccgg ccatggtgca ggagctgcac
     1441 cagggcggcc ggcgctacat gatgatcgtg gatcctgcca tcagcagctc gggccctgcc
     1501 gggagctaca ggccctacga cgagggtctg cggagggggg ttttcatcac caacgagacc
     1561 ggccagccgc tgattgggaa ggtatggccc gggtccactg ccttccccga cttcaccaac
     1621 cccacagccc tggcctggtg ggaggacatg gtggctgagt tccatgacca ggtgcccttc
     1681 gacggcatgt ggattgacat gaacgagcct tccaacttca tcaggggctc tgaggacggc
     1741 tgccccaaca atgagctgga gaacccaccc tacgtgcctg gggtggttgg ggggaccctc
     1801 caggcggcca ccatctgtgc ctccagccac cagtttctct ccacacacta caacctgcac
     1861 aacctctacg gcctgaccga agccatcgcc tcccacaggg cgctggtgaa ggctcggggg
     1921 acacgcccat ttgtgatctc ccgctcgacc tttgctggcc acggccgata cgccggccac
     1981 tggacggggg acgtgtggag ctcctgggag cagctcgcct cctccgtgcc agaaatcctg
     2041 cagtttaacc tgctgggggt gcctctggtc ggggccgacg tctgcggctt cctgggcaac
     2101 acctcagagg agctgtgtgt gcgctggacc cagctggggg ccttctaccc cttcatgcgg
     2161 aaccacaaca gcctgctcag tctgccccag gagccgtaca gcttcagcga gccggcccag
     2221 caggccatga ggaaggccct caccctgcgc tacgcactcc tcccccacct ctacacactg
     2281 ttccaccagg cccacgtcgc gggggagacc gtggcccggc ccctcttcct ggagttcccc
     2341 aaggactcta gcacctggac tgtggaccac cagctcctgt ggggggaggc cctgctcatc
     2401 accccagtgc tccaggccgg gaaggccgaa gtgactggct acttcccctt gggcacatgg
     2461 tacgacctgc agacggtgcc agtagaggcc cttggcagcc tcccaccccc acctgcagct
     2521 ccccgtgagc cagccatcca cagcgagggg cagtgggtga cgctgccggc ccccctggac
     2581 accatcaacg tccacctccg ggctgggtac atcatccccc tgcagggccc tggcctcaca
     2641 accacagagt cccgccagca gcccatggcc ctggctgtgg ccctgaccaa gggtggggag
     2701 gcccgagggg agcttttctg ggacgatgga gagagcctgg aagtgctgga gcgaggggcc
     2761 tacacacagg tcatcttcct ggccaggaat aacacgatcg tgaatgagct ggtacgtgtg
     2821 accagtgagg gagctggcct gcagctgcag aaggtgactg tcctgggcgt ggccacggcg
     2881 ccccagcagg tcctctccaa cggtgtccct gtctccaact tcacctacag ccccgacacc
     2941 aaggtcctgg acatctgtgt ctcgctgttg atgggagagc agtttctcgt cagctggtgt
     3001 tagccgggcg gagtgtgtta gtctctccag agggaggctg gttccccagg gaagcagagc
     3061 ctgtgtgcgg gcagcagctg tgtgcgggcc tgggggttgc atgtgtcacc tggagctggg
     3121 cactaaccat tccaagccgc cgcatcgctt gtttccacct cctgggccgg ggctctggcc
     3181 cccaacgtgt ctaggagagc tttctcccta gatcgcactg tgggccgggg ccctggaggg
     3241 ctgctctgtg ttaataagat tgtaaggttt gccctcctca cctgttgccg gcatgcgggt
     3301 agtattagcc acccccctcc atctgttccc agcaccggag aagggggtgc tcaggtggag
     3361 gtgtggggta tgcacctgag ctcctgcttc gcgcctgctg ctctgcccca acgcgaccgc
     3421 tgcccggctg cccagagggc tggatgcctg ccggtccccg agcaagcctg ggaactcagg
     3481 aaaattcaca ggacttggga gattctaaat cttaagtgca attattttta ataaaagggg
     3541 catttggaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3601 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3661 aaaaaaaaaa aaaaaaaaaa aaaaaaa
//