LOCUS BC040431 3687 bp mRNA linear HUM 07-AUG-2008
DEFINITION Homo sapiens glucosidase, alpha; acid, mRNA (cDNA clone MGC:48746
IMAGE:5739991), complete cds.
ACCESSION BC040431
VERSION BC040431.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3687)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3687)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (29-NOV-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 84 Row: j Column: 8
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 11496988.
FEATURES Location/Qualifiers
source 1..3687
/db_xref="H-InvDB:HIT000052383"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:48746 IMAGE:5739991"
/tissue_type="Duodenum, adenocarcinoma"
/clone_lib="NIH_MGC_88"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3687
/gene="GAA"
/gene_synonym="LYAG"
/db_xref="GeneID:2548"
/db_xref="HGNC:HGNC:4065"
/db_xref="MIM:606800"
CDS 145..3003
/gene="GAA"
/gene_synonym="LYAG"
/codon_start=1
/product="glucosidase, alpha; acid"
/protein_id="AAH40431.1"
/db_xref="GeneID:2548"
/db_xref="HGNC:HGNC:4065"
/db_xref="MIM:606800"
/translation="MGVRHPPCSHRLLAVCALVSLATAALLGHILLHDFLLVPRELSG
SSPVLEETHPAHQQGASRPGPRDAQAHPGRPRAVPTQCDVPPNSRFDCAPDKAITQEQ
CEARGCCYIPAKQGLQGAQMGQPWCFFPPSYPSYKLENLSSSEMGYTATLTRTTPTFF
PKDILTLRLDVMMETENRLHFTIKDPANRRYEVPLETPHVHSRAPSPLYSVEFSEEPF
GVIVRRQLDGRVLLNTTVAPLFFADQFLQLSTSLPSQYITGLAEHLSPLMLSTSWTRI
TLWNRDLAPTPGANLYGSHPFYLALEDGGSAHGVFLLNSNAMDVVLQPSPALSWRSTG
GILDVYIFLGPEPKSVVQQYLDVVGYPFMPPYWGLGFHLCRWGYSSTAITRQVVENMT
RAHFPLDVQWNDLDYMDSRRDFTFNKDGFRDFPAMVQELHQGGRRYMMIVDPAISSSG
PAGSYRPYDEGLRRGVFITNETGQPLIGKVWPGSTAFPDFTNPTALAWWEDMVAEFHD
QVPFDGMWIDMNEPSNFIRGSEDGCPNNELENPPYVPGVVGGTLQAATICASSHQFLS
THYNLHNLYGLTEAIASHRALVKARGTRPFVISRSTFAGHGRYAGHWTGDVWSSWEQL
ASSVPEILQFNLLGVPLVGADVCGFLGNTSEELCVRWTQLGAFYPFMRNHNSLLSLPQ
EPYSFSEPAQQAMRKALTLRYALLPHLYTLFHQAHVAGETVARPLFLEFPKDSSTWTV
DHQLLWGEALLITPVLQAGKAEVTGYFPLGTWYDLQTVPVEALGSLPPPPAAPREPAI
HSEGQWVTLPAPLDTINVHLRAGYIIPLQGPGLTTTESRQQPMALAVALTKGGEARGE
LFWDDGESLEVLERGAYTQVIFLARNNTIVNELVRVTSEGAGLQLQKVTVLGVATAPQ
QVLSNGVPVSNFTYSPDTKVLDICVSLLMGEQFLVSWC"
BASE COUNT 759 a 1203 c 1075 g 650 t
ORIGIN
1 tccgcgggcg gccagggcgc gcgtgcgcgg aggttctcct cgtccgcccg ttgttcagcg
61 agggaggctc tgcgcgtgcc gcagctgacg gggaaactga ggcacggagc gggcctgtag
121 gagctgtcca ggccatctcc aaccatggga gtgaggcacc cgccctgctc ccaccggctc
181 ctggccgtct gcgccctcgt gtccttggca accgctgcac tcctggggca catcctactc
241 catgatttcc tgctggttcc ccgagagctg agtggctcct ccccagtcct ggaggagact
301 cacccagctc accagcaggg agccagtaga ccagggcccc gggatgccca ggcacacccc
361 ggccgtccca gagcagtgcc cacacagtgc gacgtccccc ccaacagccg cttcgattgc
421 gcccctgaca aggccatcac ccaggaacag tgcgaggccc gcggctgttg ctacatccct
481 gcaaagcagg ggctgcaggg agcccagatg gggcagccct ggtgcttctt cccacccagc
541 taccccagct acaagctgga gaacctgagc tcctctgaaa tgggctacac ggccaccctg
601 acccgtacca cccccacctt cttccccaag gacatcctga ccctgcggct ggacgtgatg
661 atggagactg agaaccgcct ccacttcacg atcaaagatc cagctaacag gcgctacgag
721 gtgcccttgg agaccccgca tgtccacagc cgggcaccgt ccccactcta cagcgtggag
781 ttctccgagg agcccttcgg ggtgatcgtg cgccggcagc tggacggccg cgtgctgctg
841 aacacgacgg tggcgcccct gttctttgcg gaccagttcc ttcagctgtc cacctcgctg
901 ccctcgcagt atatcacagg cctcgccgag cacctcagtc ccctgatgct cagcaccagc
961 tggaccagga tcaccctgtg gaaccgggac cttgcgccca cgcccggtgc gaacctctac
1021 gggtctcacc ctttctacct ggcgctggag gacggcgggt cggcacacgg ggtgttcctg
1081 ctaaacagca atgccatgga tgtggtcctg cagccgagcc ctgcccttag ctggaggtcg
1141 acaggtggga tcctggatgt ctacatcttc ctgggcccag agcccaagag cgtggtgcag
1201 cagtacctgg acgttgtggg atacccgttc atgccgccat actggggcct gggcttccac
1261 ctgtgccgct ggggctactc ctccaccgct atcacccgcc aggtggtgga gaacatgacc
1321 agggcccact tccccctgga cgtccagtgg aacgacctgg actacatgga ctcccggagg
1381 gacttcacgt tcaacaagga tggcttccgg gacttcccgg ccatggtgca ggagctgcac
1441 cagggcggcc ggcgctacat gatgatcgtg gatcctgcca tcagcagctc gggccctgcc
1501 gggagctaca ggccctacga cgagggtctg cggagggggg ttttcatcac caacgagacc
1561 ggccagccgc tgattgggaa ggtatggccc gggtccactg ccttccccga cttcaccaac
1621 cccacagccc tggcctggtg ggaggacatg gtggctgagt tccatgacca ggtgcccttc
1681 gacggcatgt ggattgacat gaacgagcct tccaacttca tcaggggctc tgaggacggc
1741 tgccccaaca atgagctgga gaacccaccc tacgtgcctg gggtggttgg ggggaccctc
1801 caggcggcca ccatctgtgc ctccagccac cagtttctct ccacacacta caacctgcac
1861 aacctctacg gcctgaccga agccatcgcc tcccacaggg cgctggtgaa ggctcggggg
1921 acacgcccat ttgtgatctc ccgctcgacc tttgctggcc acggccgata cgccggccac
1981 tggacggggg acgtgtggag ctcctgggag cagctcgcct cctccgtgcc agaaatcctg
2041 cagtttaacc tgctgggggt gcctctggtc ggggccgacg tctgcggctt cctgggcaac
2101 acctcagagg agctgtgtgt gcgctggacc cagctggggg ccttctaccc cttcatgcgg
2161 aaccacaaca gcctgctcag tctgccccag gagccgtaca gcttcagcga gccggcccag
2221 caggccatga ggaaggccct caccctgcgc tacgcactcc tcccccacct ctacacactg
2281 ttccaccagg cccacgtcgc gggggagacc gtggcccggc ccctcttcct ggagttcccc
2341 aaggactcta gcacctggac tgtggaccac cagctcctgt ggggggaggc cctgctcatc
2401 accccagtgc tccaggccgg gaaggccgaa gtgactggct acttcccctt gggcacatgg
2461 tacgacctgc agacggtgcc agtagaggcc cttggcagcc tcccaccccc acctgcagct
2521 ccccgtgagc cagccatcca cagcgagggg cagtgggtga cgctgccggc ccccctggac
2581 accatcaacg tccacctccg ggctgggtac atcatccccc tgcagggccc tggcctcaca
2641 accacagagt cccgccagca gcccatggcc ctggctgtgg ccctgaccaa gggtggggag
2701 gcccgagggg agcttttctg ggacgatgga gagagcctgg aagtgctgga gcgaggggcc
2761 tacacacagg tcatcttcct ggccaggaat aacacgatcg tgaatgagct ggtacgtgtg
2821 accagtgagg gagctggcct gcagctgcag aaggtgactg tcctgggcgt ggccacggcg
2881 ccccagcagg tcctctccaa cggtgtccct gtctccaact tcacctacag ccccgacacc
2941 aaggtcctgg acatctgtgt ctcgctgttg atgggagagc agtttctcgt cagctggtgt
3001 tagccgggcg gagtgtgtta gtctctccag agggaggctg gttccccagg gaagcagagc
3061 ctgtgtgcgg gcagcagctg tgtgcgggcc tgggggttgc atgtgtcacc tggagctggg
3121 cactaaccat tccaagccgc cgcatcgctt gtttccacct cctgggccgg ggctctggcc
3181 cccaacgtgt ctaggagagc tttctcccta gatcgcactg tgggccgggg ccctggaggg
3241 ctgctctgtg ttaataagat tgtaaggttt gccctcctca cctgttgccg gcatgcgggt
3301 agtattagcc acccccctcc atctgttccc agcaccggag aagggggtgc tcaggtggag
3361 gtgtggggta tgcacctgag ctcctgcttc gcgcctgctg ctctgcccca acgcgaccgc
3421 tgcccggctg cccagagggc tggatgcctg ccggtccccg agcaagcctg ggaactcagg
3481 aaaattcaca ggacttggga gattctaaat cttaagtgca attattttta ataaaagggg
3541 catttggaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
3601 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
3661 aaaaaaaaaa aaaaaaaaaa aaaaaaa
//