LOCUS BC053991 2502 bp mRNA linear HUM 04-AUG-2008
DEFINITION Homo sapiens N-acetylglucosaminidase, alpha-, mRNA (cDNA clone
MGC:59849 IMAGE:6213852), complete cds.
ACCESSION BC053991
VERSION BC053991.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2502)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2502)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (24-JUN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 47 Row: g Column: 9
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 66346697.
FEATURES Location/Qualifiers
source 1..2502
/db_xref="H-InvDB:HIT000054033"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:59849 IMAGE:6213852"
/tissue_type="Skin, melanoma, melanotic"
/clone_lib="NIH_MGC_112"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2502
/gene="NAGLU"
/gene_synonym="MPS-IIIB"
/gene_synonym="MPS3B"
/gene_synonym="NAG"
/gene_synonym="UFHSD"
/db_xref="GeneID:4669"
/db_xref="HGNC:HGNC:7632"
/db_xref="MIM:609701"
CDS 24..2255
/gene="NAGLU"
/gene_synonym="MPS-IIIB"
/gene_synonym="MPS3B"
/gene_synonym="NAG"
/gene_synonym="UFHSD"
/codon_start=1
/product="N-acetylglucosaminidase, alpha-"
/protein_id="AAH53991.1"
/db_xref="GeneID:4669"
/db_xref="HGNC:HGNC:7632"
/db_xref="MIM:609701"
/translation="MEAVAVAAAVGVLLLAGAGGAAGDEAREAAAVRALVARLLGPGP
AADFSVSVERALAAKPGLDTYSLGGGGAARVRVRGSTGVAAAAGLHRYLRDFCGCHVA
WSGSQLRLPRPLPAVPGELTEATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALN
GINLALAWSGQEAIWQRVYLALGLTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSW
HIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCSYS
CSFLLAPEDPIFPIIGSLFLRELIKEFGTDHIYGADTFNEMQPPSSEPSYLAAATTAV
YEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYT
RTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQ
NEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEA
CRGHNRSPLVRRPSLQMNTSIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ
AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQ
ARAAAVSEAEADFYEQNSRYQLTLWGPEGNILDYANKQLAGLVANYYTPRWRLFLEAL
VDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPGWVA
GSW"
BASE COUNT 454 a 781 c 798 g 469 t
ORIGIN
1 gcgggacccg caggactgag accatggagg cggtggcggt ggccgcggcg gtgggggtcc
61 ttctcctggc cggggccggg ggcgcggcag gcgacgaggc ccgggaggcg gcggccgtgc
121 gggcgctcgt ggcccggctg ctggggccag gccccgcggc cgacttctcc gtgtcggtgg
181 agcgcgctct ggctgccaag ccgggcttgg acacctacag cctgggcggc ggcggcgcgg
241 cgcgcgtgcg ggtgcgcggc tccacgggcg tggcggccgc cgcggggctg caccgctacc
301 tgcgcgactt ctgtggctgc cacgtggcct ggtccggctc tcagctgcgc ctgccgcggc
361 cactgccagc cgtgccgggg gagctgaccg aggccacgcc caacaggtac cgctattacc
421 agaatgtgtg cacgcaaagc tactccttcg tgtggtggga ctgggcccgc tgggagcgag
481 agatagactg gatggcgctg aatggcatca acctggcact ggcctggagc ggccaggagg
541 ccatctggca gcgggtgtac ctggccttgg gcctgaccca ggcagagatc aatgagttct
601 ttactggtcc tgccttcctg gcctgggggc gaatgggcaa cctgcacacc tgggatggcc
661 ccctgccccc ctcctggcac atcaagcagc tttacctgca gcaccgggtc ctggaccaga
721 tgcgctcctt cggcatgacc ccagtgctgc ctgcattcgc ggggcatgtt cccgaggctg
781 tcaccagggt gttccctcag gtcaatgtca cgaagatggg cagttggggc cactttaact
841 gttcctactc ctgctccttc cttctggctc cggaagaccc catattcccc atcatcggga
901 gcctcttcct gcgagagctg atcaaagagt ttggcacaga ccacatctat ggggccgaca
961 ctttcaatga gatgcagcca ccttcctcag agccctccta ccttgccgca gccaccactg
1021 ccgtctatga ggccatgact gcagtggata ctgaggctgt gtggctgctc caaggctggc
1081 tcttccagca ccagccgcag ttctgggggc ccgcccagat cagggctgtg ctgggagctg
1141 tgccccgtgg ccgcctcctg gttctggacc tgtttgctga gagccagcct gtgtataccc
1201 gcactgcctc cttccagggc cagcccttca tctggtgcat gctgcacaac tttgggggaa
1261 accatggtct ttttggagcc ctagaggctg tgaacggagg cccagaagct gcccgcctct
1321 tccccaactc caccatggta ggcacgggca tggcccccga gggcatcagc cagaacgaag
1381 tggtctattc cctcatggct gagctgggct ggcgaaagga cccagtgcca gatttggcag
1441 cctgggtgac cagctttgcc gcccggcggt atggggtctc ccacccggac gcaggggcag
1501 cgtggaggct actgctccgg agtgtgtaca actgctccgg ggaggcctgc aggggccaca
1561 atcgtagccc gctggtcagg cggccgtccc tacagatgaa taccagcatc tggtacaacc
1621 gatctgatgt gtttgaggcc tggcggctgc tgctcacatc tgctccctcc ctggccacca
1681 gccccgcctt ccgctacgac ctgctggacc tcactcggca ggcagtgcag gagctggtca
1741 gcttgtacta tgaggaggca agaagcgcct acctgagcaa ggagctggcc tccctgttga
1801 gggctggagg cgtcctggcc tatgagctgc tgccggcact ggacgaggtg ctggctagtg
1861 acagccgctt cttgctgggc agctggctag agcaggcccg agcagcggca gtcagtgagg
1921 ccgaggccga tttctacgag cagaacagcc gctaccagct gaccttgtgg gggccagaag
1981 gcaacatcct ggactatgcc aacaagcagc tggcggggtt ggtggccaac tactacaccc
2041 ctcgctggcg gcttttcctg gaggcgctgg ttgacagtgt ggcccagggc atccctttcc
2101 aacagcacca gtttgacaaa aatgtcttcc aactggagca ggccttcgtt ctcagcaagc
2161 agaggtaccc cagccagccg cgaggagaca ctgtggacct ggccaagaag atcttcctca
2221 aatattaccc cggctgggtg gccggctctt ggtgatagat tcgccaccac tgggccttgt
2281 tttccgctaa ttccagggca gattccaggg cccagagctg gacagacatc acaggataac
2341 ccaggcctgg gaggaggccc cacggcctgc tggtggggtc tgacctgggg ggattggagg
2401 gaaatgacct gccctccacc accacccaaa gtgtgggatt aaagtactgt tttctttcca
2461 cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa
//