LOCUS BC047318 2740 bp mRNA linear HUM 04-AUG-2008 DEFINITION Homo sapiens N-sulfoglucosamine sulfohydrolase, mRNA (cDNA clone MGC:51931 IMAGE:5226903), complete cds. ACCESSION BC047318 VERSION BC047318.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2740) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2740) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: k Column: 20 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31543619. FEATURES Location/Qualifiers source 1..2740 /db_xref="H-InvDB:HIT000053108" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:51931 IMAGE:5226903" /tissue_type="Pancreas, Spleen, adult pooled" /clone_lib="NIH_MGC_120" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2740 /gene="SGSH" /gene_synonym="HSS" /gene_synonym="MPS3A" /gene_synonym="SFMD" /db_xref="GeneID:6448" /db_xref="HGNC:HGNC:10818" /db_xref="MIM:605270" CDS 13..1521 /gene="SGSH" /gene_synonym="HSS" /gene_synonym="MPS3A" /gene_synonym="SFMD" /codon_start=1 /product="N-sulfoglucosamine sulfohydrolase" /protein_id="AAH47318.1" /db_xref="GeneID:6448" /db_xref="HGNC:HGNC:10818" /db_xref="MIM:605270" /translation="MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSA IATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFD KVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGRNITRIKLLVR KFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIPDWTPQAYDPL DVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTSDNGI PFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYA IFGSKTIHLTGRSLLPALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNF KMPFPIDQDFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHNEL" BASE COUNT 528 a 901 c 763 g 548 t ORIGIN 1 gccgccgccg ccatgagctg ccccgtgccc gcctgctgcg cgctgctgct agtcctgggg 61 ctctgccggg cgcgtccccg gaacgcactg ctgctcctcg cggatgacgg aggctttgag 121 agtggcgcgt acaacaacag cgccatcgcc accccgcacc tggacgcctt ggcccgccgc 181 agcctcctct ttcgcaatgc cttcacctcg gtcagcagct gctctcccag ccgcgccagc 241 ctcctcactg gcctgcccca gcatcagaat gggatgtacg ggctgcacca ggacgtgcac 301 cacttcaact ccttcgacaa ggtgcggagc ctgccgctgc tgctcagcca agctggtgtg 361 cgcacaggca tcatcgggaa gaagcacgtg gggccggaga ccgtgtaccc gtttgacttt 421 gcgtacacgg aggagaatgg ctccgtcctc caggtggggc ggaacatcac tagaattaag 481 ctgctcgtcc ggaaattcct gcagactcag gatgaccggc ctttcttcct ctacgtcgcc 541 ttccacgacc cccaccgctg tgggcactcc cagccccagt acggaacctt ctgtgagaag 601 tttggcaacg gagagagcgg catgggtcgt atcccagact ggacccccca ggcctacgac 661 ccactggacg tgctggtgcc ttacttcgtc cccaacaccc cggcagcccg agccgacctg 721 gccgctcagt acaccaccgt aggccgcatg gaccaaggag ttggactggt gctccaggag 781 ctgcgtgacg ccggtgtcct gaacgacaca ctggtgatct tcacgtccga caacgggatc 841 cccttcccca gcggcaggac caacctgtac tggccgggca ctgctgaacc cttactggtg 901 tcatccccgg agcacccaaa acgctggggc caagtcagcg aggcctacgt gagcctccta 961 gacctcacgc ccaccatctt ggattggttc tcgatcccgt accccagcta cgccatcttt 1021 ggctcgaaga ccatccacct cactggccgg tccctcctgc cggcgctgga ggccgagccc 1081 ctctgggcca ccgtctttgg cagccagagc caccacgagg tcaccatgtc ctaccccatg 1141 cgctccgtgc agcaccggca cttccgcctc gtgcacaacc tcaacttcaa gatgcccttt 1201 cccatcgacc aggacttcta cgtctcaccc accttccagg acctcctgaa ccgcactaca 1261 gctggtcagc ccacgggctg gtacaaggac ctccgtcatt actactaccg ggcgcgctgg 1321 gagctctacg accggagccg ggacccccac gagacccaga acctggccac cgacccgcgc 1381 tttgctcagc ttctggagat gcttcgggac cagctggcca agtggcagtg ggagacccac 1441 gacccctggg tgtgcgcccc cgacggcgtc ctggaggaga agctctctcc ccagtgccag 1501 cccctccaca atgagctgtg accatcccag gaggcctgtg cacacatccc aggcatgtcc 1561 cagacacatc ccacacgtgt ccgtgtggcc ggccagcctg gggagtagtg gcaacagccc 1621 ttccgtccac actcccatcc aaggagggtt cttccttcct gtggggtcac tcttgccatt 1681 gcctggaggg ggaccagagc atgtgaccag agcatgtgcc cagcccctcc accaccaggg 1741 gcactgccgt catggcaggg gacacagttg tccttgtgtc tgaaccatgt cccagcacgg 1801 gaattctaga catacgtggt ctgcggacag ggcagtgccc ccagcccagg acaagggagt 1861 cttgttttct ggcttggttt ggggacctgc aaatgggagg cctgaggccc tcttcaggct 1921 ttggcagcca cagatacttc tgaacccttc acagagagca ggcaggggct tcggtgccgc 1981 gtgggcagta cgcaggtccc accgacactc acctgggagc acggcgcctg gctcttacca 2041 gcgtctggcc tagaggaagc ctttgagcga cctttgggca ggtttctgct tcttctgttt 2101 tgccccatgg tcaagtccct gttccccagg caggtttcag ctgattggca gcaggctccc 2161 tgagtgatga gcttgaacct gtggtgtttc tgggcagaag cttatctttt ttgagagtgt 2221 ccgaagatga aggcatggcg atgcccgtcc tctggcttgg gttaattctt cggtgacact 2281 ggcattgctg ggtggtgatg cccgtcctct ggcttgggtt aattcttcgg tgacactggc 2341 gttgctgggt ggcaatgccc gtcctctggc ttgggctaat tcttcggtga cactggcgtt 2401 gctgggtggc gatgcccgtc ctctggcttg ggttaattct tggatgacgt cggcgttgct 2461 gggagaatgt gccgttcctg ccctgcctcc acccacctcg ggagcagaag cccggcctgg 2521 acacccctcg gcctggacac ccctcgaagg agagggcgct tccttgagta ggtgggctcc 2581 ccttgccctt ccctccctat cactccatac tggggtgggc tggaggaggc cacaggccag 2641 ctattgtaaa agctttttat tttagtaaaa tatacagaag ttaaaaaaaa aaaaaaaaaa 2701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa //