LOCUS BC053991 2502 bp mRNA linear HUM 04-AUG-2008 DEFINITION Homo sapiens N-acetylglucosaminidase, alpha-, mRNA (cDNA clone MGC:59849 IMAGE:6213852), complete cds. ACCESSION BC053991 VERSION BC053991.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2502) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2502) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (24-JUN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 47 Row: g Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 66346697. FEATURES Location/Qualifiers source 1..2502 /db_xref="H-InvDB:HIT000054033" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:59849 IMAGE:6213852" /tissue_type="Skin, melanoma, melanotic" /clone_lib="NIH_MGC_112" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2502 /gene="NAGLU" /gene_synonym="MPS-IIIB" /gene_synonym="MPS3B" /gene_synonym="NAG" /gene_synonym="UFHSD" /db_xref="GeneID:4669" /db_xref="HGNC:HGNC:7632" /db_xref="MIM:609701" CDS 24..2255 /gene="NAGLU" /gene_synonym="MPS-IIIB" /gene_synonym="MPS3B" /gene_synonym="NAG" /gene_synonym="UFHSD" /codon_start=1 /product="N-acetylglucosaminidase, alpha-" /protein_id="AAH53991.1" /db_xref="GeneID:4669" /db_xref="HGNC:HGNC:7632" /db_xref="MIM:609701" /translation="MEAVAVAAAVGVLLLAGAGGAAGDEAREAAAVRALVARLLGPGP AADFSVSVERALAAKPGLDTYSLGGGGAARVRVRGSTGVAAAAGLHRYLRDFCGCHVA WSGSQLRLPRPLPAVPGELTEATPNRYRYYQNVCTQSYSFVWWDWARWEREIDWMALN GINLALAWSGQEAIWQRVYLALGLTQAEINEFFTGPAFLAWGRMGNLHTWDGPLPPSW HIKQLYLQHRVLDQMRSFGMTPVLPAFAGHVPEAVTRVFPQVNVTKMGSWGHFNCSYS CSFLLAPEDPIFPIIGSLFLRELIKEFGTDHIYGADTFNEMQPPSSEPSYLAAATTAV YEAMTAVDTEAVWLLQGWLFQHQPQFWGPAQIRAVLGAVPRGRLLVLDLFAESQPVYT RTASFQGQPFIWCMLHNFGGNHGLFGALEAVNGGPEAARLFPNSTMVGTGMAPEGISQ NEVVYSLMAELGWRKDPVPDLAAWVTSFAARRYGVSHPDAGAAWRLLLRSVYNCSGEA CRGHNRSPLVRRPSLQMNTSIWYNRSDVFEAWRLLLTSAPSLATSPAFRYDLLDLTRQ AVQELVSLYYEEARSAYLSKELASLLRAGGVLAYELLPALDEVLASDSRFLLGSWLEQ ARAAAVSEAEADFYEQNSRYQLTLWGPEGNILDYANKQLAGLVANYYTPRWRLFLEAL VDSVAQGIPFQQHQFDKNVFQLEQAFVLSKQRYPSQPRGDTVDLAKKIFLKYYPGWVA GSW" BASE COUNT 454 a 781 c 798 g 469 t ORIGIN 1 gcgggacccg caggactgag accatggagg cggtggcggt ggccgcggcg gtgggggtcc 61 ttctcctggc cggggccggg ggcgcggcag gcgacgaggc ccgggaggcg gcggccgtgc 121 gggcgctcgt ggcccggctg ctggggccag gccccgcggc cgacttctcc gtgtcggtgg 181 agcgcgctct ggctgccaag ccgggcttgg acacctacag cctgggcggc ggcggcgcgg 241 cgcgcgtgcg ggtgcgcggc tccacgggcg tggcggccgc cgcggggctg caccgctacc 301 tgcgcgactt ctgtggctgc cacgtggcct ggtccggctc tcagctgcgc ctgccgcggc 361 cactgccagc cgtgccgggg gagctgaccg aggccacgcc caacaggtac cgctattacc 421 agaatgtgtg cacgcaaagc tactccttcg tgtggtggga ctgggcccgc tgggagcgag 481 agatagactg gatggcgctg aatggcatca acctggcact ggcctggagc ggccaggagg 541 ccatctggca gcgggtgtac ctggccttgg gcctgaccca ggcagagatc aatgagttct 601 ttactggtcc tgccttcctg gcctgggggc gaatgggcaa cctgcacacc tgggatggcc 661 ccctgccccc ctcctggcac atcaagcagc tttacctgca gcaccgggtc ctggaccaga 721 tgcgctcctt cggcatgacc ccagtgctgc ctgcattcgc ggggcatgtt cccgaggctg 781 tcaccagggt gttccctcag gtcaatgtca cgaagatggg cagttggggc cactttaact 841 gttcctactc ctgctccttc cttctggctc cggaagaccc catattcccc atcatcggga 901 gcctcttcct gcgagagctg atcaaagagt ttggcacaga ccacatctat ggggccgaca 961 ctttcaatga gatgcagcca ccttcctcag agccctccta ccttgccgca gccaccactg 1021 ccgtctatga ggccatgact gcagtggata ctgaggctgt gtggctgctc caaggctggc 1081 tcttccagca ccagccgcag ttctgggggc ccgcccagat cagggctgtg ctgggagctg 1141 tgccccgtgg ccgcctcctg gttctggacc tgtttgctga gagccagcct gtgtataccc 1201 gcactgcctc cttccagggc cagcccttca tctggtgcat gctgcacaac tttgggggaa 1261 accatggtct ttttggagcc ctagaggctg tgaacggagg cccagaagct gcccgcctct 1321 tccccaactc caccatggta ggcacgggca tggcccccga gggcatcagc cagaacgaag 1381 tggtctattc cctcatggct gagctgggct ggcgaaagga cccagtgcca gatttggcag 1441 cctgggtgac cagctttgcc gcccggcggt atggggtctc ccacccggac gcaggggcag 1501 cgtggaggct actgctccgg agtgtgtaca actgctccgg ggaggcctgc aggggccaca 1561 atcgtagccc gctggtcagg cggccgtccc tacagatgaa taccagcatc tggtacaacc 1621 gatctgatgt gtttgaggcc tggcggctgc tgctcacatc tgctccctcc ctggccacca 1681 gccccgcctt ccgctacgac ctgctggacc tcactcggca ggcagtgcag gagctggtca 1741 gcttgtacta tgaggaggca agaagcgcct acctgagcaa ggagctggcc tccctgttga 1801 gggctggagg cgtcctggcc tatgagctgc tgccggcact ggacgaggtg ctggctagtg 1861 acagccgctt cttgctgggc agctggctag agcaggcccg agcagcggca gtcagtgagg 1921 ccgaggccga tttctacgag cagaacagcc gctaccagct gaccttgtgg gggccagaag 1981 gcaacatcct ggactatgcc aacaagcagc tggcggggtt ggtggccaac tactacaccc 2041 ctcgctggcg gcttttcctg gaggcgctgg ttgacagtgt ggcccagggc atccctttcc 2101 aacagcacca gtttgacaaa aatgtcttcc aactggagca ggccttcgtt ctcagcaagc 2161 agaggtaccc cagccagccg cgaggagaca ctgtggacct ggccaagaag atcttcctca 2221 aatattaccc cggctgggtg gccggctctt ggtgatagat tcgccaccac tgggccttgt 2281 tttccgctaa ttccagggca gattccaggg cccagagctg gacagacatc acaggataac 2341 ccaggcctgg gaggaggccc cacggcctgc tggtggggtc tgacctgggg ggattggagg 2401 gaaatgacct gccctccacc accacccaaa gtgtgggatt aaagtactgt tttctttcca 2461 cttaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa //