LOCUS       BC047318                2740 bp    mRNA    linear   HUM 04-AUG-2008
DEFINITION  Homo sapiens N-sulfoglucosamine sulfohydrolase, mRNA (cDNA clone
            MGC:51931 IMAGE:5226903), complete cds.
ACCESSION   BC047318
VERSION     BC047318.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2740)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2740)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (28-FEB-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 93 Row: k Column: 20
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 31543619.
FEATURES             Location/Qualifiers
     source          1..2740
                     /db_xref="H-InvDB:HIT000053108"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:51931 IMAGE:5226903"
                     /tissue_type="Pancreas, Spleen, adult pooled"
                     /clone_lib="NIH_MGC_120"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2740
                     /gene="SGSH"
                     /gene_synonym="HSS"
                     /gene_synonym="MPS3A"
                     /gene_synonym="SFMD"
                     /db_xref="GeneID:6448"
                     /db_xref="HGNC:HGNC:10818"
                     /db_xref="MIM:605270"
     CDS             13..1521
                     /gene="SGSH"
                     /gene_synonym="HSS"
                     /gene_synonym="MPS3A"
                     /gene_synonym="SFMD"
                     /codon_start=1
                     /product="N-sulfoglucosamine sulfohydrolase"
                     /protein_id="AAH47318.1"
                     /db_xref="GeneID:6448"
                     /db_xref="HGNC:HGNC:10818"
                     /db_xref="MIM:605270"
                     /translation="MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSA
                     IATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFD
                     KVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGRNITRIKLLVR
                     KFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIPDWTPQAYDPL
                     DVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTSDNGI
                     PFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYA
                     IFGSKTIHLTGRSLLPALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNF
                     KMPFPIDQDFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN
                     LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHNEL"
BASE COUNT          528 a          901 c          763 g          548 t
ORIGIN      
        1 gccgccgccg ccatgagctg ccccgtgccc gcctgctgcg cgctgctgct agtcctgggg
       61 ctctgccggg cgcgtccccg gaacgcactg ctgctcctcg cggatgacgg aggctttgag
      121 agtggcgcgt acaacaacag cgccatcgcc accccgcacc tggacgcctt ggcccgccgc
      181 agcctcctct ttcgcaatgc cttcacctcg gtcagcagct gctctcccag ccgcgccagc
      241 ctcctcactg gcctgcccca gcatcagaat gggatgtacg ggctgcacca ggacgtgcac
      301 cacttcaact ccttcgacaa ggtgcggagc ctgccgctgc tgctcagcca agctggtgtg
      361 cgcacaggca tcatcgggaa gaagcacgtg gggccggaga ccgtgtaccc gtttgacttt
      421 gcgtacacgg aggagaatgg ctccgtcctc caggtggggc ggaacatcac tagaattaag
      481 ctgctcgtcc ggaaattcct gcagactcag gatgaccggc ctttcttcct ctacgtcgcc
      541 ttccacgacc cccaccgctg tgggcactcc cagccccagt acggaacctt ctgtgagaag
      601 tttggcaacg gagagagcgg catgggtcgt atcccagact ggacccccca ggcctacgac
      661 ccactggacg tgctggtgcc ttacttcgtc cccaacaccc cggcagcccg agccgacctg
      721 gccgctcagt acaccaccgt aggccgcatg gaccaaggag ttggactggt gctccaggag
      781 ctgcgtgacg ccggtgtcct gaacgacaca ctggtgatct tcacgtccga caacgggatc
      841 cccttcccca gcggcaggac caacctgtac tggccgggca ctgctgaacc cttactggtg
      901 tcatccccgg agcacccaaa acgctggggc caagtcagcg aggcctacgt gagcctccta
      961 gacctcacgc ccaccatctt ggattggttc tcgatcccgt accccagcta cgccatcttt
     1021 ggctcgaaga ccatccacct cactggccgg tccctcctgc cggcgctgga ggccgagccc
     1081 ctctgggcca ccgtctttgg cagccagagc caccacgagg tcaccatgtc ctaccccatg
     1141 cgctccgtgc agcaccggca cttccgcctc gtgcacaacc tcaacttcaa gatgcccttt
     1201 cccatcgacc aggacttcta cgtctcaccc accttccagg acctcctgaa ccgcactaca
     1261 gctggtcagc ccacgggctg gtacaaggac ctccgtcatt actactaccg ggcgcgctgg
     1321 gagctctacg accggagccg ggacccccac gagacccaga acctggccac cgacccgcgc
     1381 tttgctcagc ttctggagat gcttcgggac cagctggcca agtggcagtg ggagacccac
     1441 gacccctggg tgtgcgcccc cgacggcgtc ctggaggaga agctctctcc ccagtgccag
     1501 cccctccaca atgagctgtg accatcccag gaggcctgtg cacacatccc aggcatgtcc
     1561 cagacacatc ccacacgtgt ccgtgtggcc ggccagcctg gggagtagtg gcaacagccc
     1621 ttccgtccac actcccatcc aaggagggtt cttccttcct gtggggtcac tcttgccatt
     1681 gcctggaggg ggaccagagc atgtgaccag agcatgtgcc cagcccctcc accaccaggg
     1741 gcactgccgt catggcaggg gacacagttg tccttgtgtc tgaaccatgt cccagcacgg
     1801 gaattctaga catacgtggt ctgcggacag ggcagtgccc ccagcccagg acaagggagt
     1861 cttgttttct ggcttggttt ggggacctgc aaatgggagg cctgaggccc tcttcaggct
     1921 ttggcagcca cagatacttc tgaacccttc acagagagca ggcaggggct tcggtgccgc
     1981 gtgggcagta cgcaggtccc accgacactc acctgggagc acggcgcctg gctcttacca
     2041 gcgtctggcc tagaggaagc ctttgagcga cctttgggca ggtttctgct tcttctgttt
     2101 tgccccatgg tcaagtccct gttccccagg caggtttcag ctgattggca gcaggctccc
     2161 tgagtgatga gcttgaacct gtggtgtttc tgggcagaag cttatctttt ttgagagtgt
     2221 ccgaagatga aggcatggcg atgcccgtcc tctggcttgg gttaattctt cggtgacact
     2281 ggcattgctg ggtggtgatg cccgtcctct ggcttgggtt aattcttcgg tgacactggc
     2341 gttgctgggt ggcaatgccc gtcctctggc ttgggctaat tcttcggtga cactggcgtt
     2401 gctgggtggc gatgcccgtc ctctggcttg ggttaattct tggatgacgt cggcgttgct
     2461 gggagaatgt gccgttcctg ccctgcctcc acccacctcg ggagcagaag cccggcctgg
     2521 acacccctcg gcctggacac ccctcgaagg agagggcgct tccttgagta ggtgggctcc
     2581 ccttgccctt ccctccctat cactccatac tggggtgggc tggaggaggc cacaggccag
     2641 ctattgtaaa agctttttat tttagtaaaa tatacagaag ttaaaaaaaa aaaaaaaaaa
     2701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//