LOCUS BC040431 3687 bp mRNA linear HUM 07-AUG-2008 DEFINITION Homo sapiens glucosidase, alpha; acid, mRNA (cDNA clone MGC:48746 IMAGE:5739991), complete cds. ACCESSION BC040431 VERSION BC040431.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3687) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3687) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (29-NOV-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 84 Row: j Column: 8 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 11496988. FEATURES Location/Qualifiers source 1..3687 /db_xref="H-InvDB:HIT000052383" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:48746 IMAGE:5739991" /tissue_type="Duodenum, adenocarcinoma" /clone_lib="NIH_MGC_88" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3687 /gene="GAA" /gene_synonym="LYAG" /db_xref="GeneID:2548" /db_xref="HGNC:HGNC:4065" /db_xref="MIM:606800" CDS 145..3003 /gene="GAA" /gene_synonym="LYAG" /codon_start=1 /product="glucosidase, alpha; acid" /protein_id="AAH40431.1" /db_xref="GeneID:2548" /db_xref="HGNC:HGNC:4065" /db_xref="MIM:606800" /translation="MGVRHPPCSHRLLAVCALVSLATAALLGHILLHDFLLVPRELSG SSPVLEETHPAHQQGASRPGPRDAQAHPGRPRAVPTQCDVPPNSRFDCAPDKAITQEQ CEARGCCYIPAKQGLQGAQMGQPWCFFPPSYPSYKLENLSSSEMGYTATLTRTTPTFF PKDILTLRLDVMMETENRLHFTIKDPANRRYEVPLETPHVHSRAPSPLYSVEFSEEPF GVIVRRQLDGRVLLNTTVAPLFFADQFLQLSTSLPSQYITGLAEHLSPLMLSTSWTRI TLWNRDLAPTPGANLYGSHPFYLALEDGGSAHGVFLLNSNAMDVVLQPSPALSWRSTG GILDVYIFLGPEPKSVVQQYLDVVGYPFMPPYWGLGFHLCRWGYSSTAITRQVVENMT RAHFPLDVQWNDLDYMDSRRDFTFNKDGFRDFPAMVQELHQGGRRYMMIVDPAISSSG PAGSYRPYDEGLRRGVFITNETGQPLIGKVWPGSTAFPDFTNPTALAWWEDMVAEFHD QVPFDGMWIDMNEPSNFIRGSEDGCPNNELENPPYVPGVVGGTLQAATICASSHQFLS THYNLHNLYGLTEAIASHRALVKARGTRPFVISRSTFAGHGRYAGHWTGDVWSSWEQL ASSVPEILQFNLLGVPLVGADVCGFLGNTSEELCVRWTQLGAFYPFMRNHNSLLSLPQ EPYSFSEPAQQAMRKALTLRYALLPHLYTLFHQAHVAGETVARPLFLEFPKDSSTWTV DHQLLWGEALLITPVLQAGKAEVTGYFPLGTWYDLQTVPVEALGSLPPPPAAPREPAI HSEGQWVTLPAPLDTINVHLRAGYIIPLQGPGLTTTESRQQPMALAVALTKGGEARGE LFWDDGESLEVLERGAYTQVIFLARNNTIVNELVRVTSEGAGLQLQKVTVLGVATAPQ QVLSNGVPVSNFTYSPDTKVLDICVSLLMGEQFLVSWC" BASE COUNT 759 a 1203 c 1075 g 650 t ORIGIN 1 tccgcgggcg gccagggcgc gcgtgcgcgg aggttctcct cgtccgcccg ttgttcagcg 61 agggaggctc tgcgcgtgcc gcagctgacg gggaaactga ggcacggagc gggcctgtag 121 gagctgtcca ggccatctcc aaccatggga gtgaggcacc cgccctgctc ccaccggctc 181 ctggccgtct gcgccctcgt gtccttggca accgctgcac tcctggggca catcctactc 241 catgatttcc tgctggttcc ccgagagctg agtggctcct ccccagtcct ggaggagact 301 cacccagctc accagcaggg agccagtaga ccagggcccc gggatgccca ggcacacccc 361 ggccgtccca gagcagtgcc cacacagtgc gacgtccccc ccaacagccg cttcgattgc 421 gcccctgaca aggccatcac ccaggaacag tgcgaggccc gcggctgttg ctacatccct 481 gcaaagcagg ggctgcaggg agcccagatg gggcagccct ggtgcttctt cccacccagc 541 taccccagct acaagctgga gaacctgagc tcctctgaaa tgggctacac ggccaccctg 601 acccgtacca cccccacctt cttccccaag gacatcctga ccctgcggct ggacgtgatg 661 atggagactg agaaccgcct ccacttcacg atcaaagatc cagctaacag gcgctacgag 721 gtgcccttgg agaccccgca tgtccacagc cgggcaccgt ccccactcta cagcgtggag 781 ttctccgagg agcccttcgg ggtgatcgtg cgccggcagc tggacggccg cgtgctgctg 841 aacacgacgg tggcgcccct gttctttgcg gaccagttcc ttcagctgtc cacctcgctg 901 ccctcgcagt atatcacagg cctcgccgag cacctcagtc ccctgatgct cagcaccagc 961 tggaccagga tcaccctgtg gaaccgggac cttgcgccca cgcccggtgc gaacctctac 1021 gggtctcacc ctttctacct ggcgctggag gacggcgggt cggcacacgg ggtgttcctg 1081 ctaaacagca atgccatgga tgtggtcctg cagccgagcc ctgcccttag ctggaggtcg 1141 acaggtggga tcctggatgt ctacatcttc ctgggcccag agcccaagag cgtggtgcag 1201 cagtacctgg acgttgtggg atacccgttc atgccgccat actggggcct gggcttccac 1261 ctgtgccgct ggggctactc ctccaccgct atcacccgcc aggtggtgga gaacatgacc 1321 agggcccact tccccctgga cgtccagtgg aacgacctgg actacatgga ctcccggagg 1381 gacttcacgt tcaacaagga tggcttccgg gacttcccgg ccatggtgca ggagctgcac 1441 cagggcggcc ggcgctacat gatgatcgtg gatcctgcca tcagcagctc gggccctgcc 1501 gggagctaca ggccctacga cgagggtctg cggagggggg ttttcatcac caacgagacc 1561 ggccagccgc tgattgggaa ggtatggccc gggtccactg ccttccccga cttcaccaac 1621 cccacagccc tggcctggtg ggaggacatg gtggctgagt tccatgacca ggtgcccttc 1681 gacggcatgt ggattgacat gaacgagcct tccaacttca tcaggggctc tgaggacggc 1741 tgccccaaca atgagctgga gaacccaccc tacgtgcctg gggtggttgg ggggaccctc 1801 caggcggcca ccatctgtgc ctccagccac cagtttctct ccacacacta caacctgcac 1861 aacctctacg gcctgaccga agccatcgcc tcccacaggg cgctggtgaa ggctcggggg 1921 acacgcccat ttgtgatctc ccgctcgacc tttgctggcc acggccgata cgccggccac 1981 tggacggggg acgtgtggag ctcctgggag cagctcgcct cctccgtgcc agaaatcctg 2041 cagtttaacc tgctgggggt gcctctggtc ggggccgacg tctgcggctt cctgggcaac 2101 acctcagagg agctgtgtgt gcgctggacc cagctggggg ccttctaccc cttcatgcgg 2161 aaccacaaca gcctgctcag tctgccccag gagccgtaca gcttcagcga gccggcccag 2221 caggccatga ggaaggccct caccctgcgc tacgcactcc tcccccacct ctacacactg 2281 ttccaccagg cccacgtcgc gggggagacc gtggcccggc ccctcttcct ggagttcccc 2341 aaggactcta gcacctggac tgtggaccac cagctcctgt ggggggaggc cctgctcatc 2401 accccagtgc tccaggccgg gaaggccgaa gtgactggct acttcccctt gggcacatgg 2461 tacgacctgc agacggtgcc agtagaggcc cttggcagcc tcccaccccc acctgcagct 2521 ccccgtgagc cagccatcca cagcgagggg cagtgggtga cgctgccggc ccccctggac 2581 accatcaacg tccacctccg ggctgggtac atcatccccc tgcagggccc tggcctcaca 2641 accacagagt cccgccagca gcccatggcc ctggctgtgg ccctgaccaa gggtggggag 2701 gcccgagggg agcttttctg ggacgatgga gagagcctgg aagtgctgga gcgaggggcc 2761 tacacacagg tcatcttcct ggccaggaat aacacgatcg tgaatgagct ggtacgtgtg 2821 accagtgagg gagctggcct gcagctgcag aaggtgactg tcctgggcgt ggccacggcg 2881 ccccagcagg tcctctccaa cggtgtccct gtctccaact tcacctacag ccccgacacc 2941 aaggtcctgg acatctgtgt ctcgctgttg atgggagagc agtttctcgt cagctggtgt 3001 tagccgggcg gagtgtgtta gtctctccag agggaggctg gttccccagg gaagcagagc 3061 ctgtgtgcgg gcagcagctg tgtgcgggcc tgggggttgc atgtgtcacc tggagctggg 3121 cactaaccat tccaagccgc cgcatcgctt gtttccacct cctgggccgg ggctctggcc 3181 cccaacgtgt ctaggagagc tttctcccta gatcgcactg tgggccgggg ccctggaggg 3241 ctgctctgtg ttaataagat tgtaaggttt gccctcctca cctgttgccg gcatgcgggt 3301 agtattagcc acccccctcc atctgttccc agcaccggag aagggggtgc tcaggtggag 3361 gtgtggggta tgcacctgag ctcctgcttc gcgcctgctg ctctgcccca acgcgaccgc 3421 tgcccggctg cccagagggc tggatgcctg ccggtccccg agcaagcctg ggaactcagg 3481 aaaattcaca ggacttggga gattctaaat cttaagtgca attattttta ataaaagggg 3541 catttggaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3601 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3661 aaaaaaaaaa aaaaaaaaaa aaaaaaa //