LOCUS BC047318 2740 bp mRNA linear HUM 04-AUG-2008
DEFINITION Homo sapiens N-sulfoglucosamine sulfohydrolase, mRNA (cDNA clone
MGC:51931 IMAGE:5226903), complete cds.
ACCESSION BC047318
VERSION BC047318.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2740)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2740)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (28-FEB-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 93 Row: k Column: 20
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31543619.
FEATURES Location/Qualifiers
source 1..2740
/db_xref="H-InvDB:HIT000053108"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:51931 IMAGE:5226903"
/tissue_type="Pancreas, Spleen, adult pooled"
/clone_lib="NIH_MGC_120"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2740
/gene="SGSH"
/gene_synonym="HSS"
/gene_synonym="MPS3A"
/gene_synonym="SFMD"
/db_xref="GeneID:6448"
/db_xref="HGNC:HGNC:10818"
/db_xref="MIM:605270"
CDS 13..1521
/gene="SGSH"
/gene_synonym="HSS"
/gene_synonym="MPS3A"
/gene_synonym="SFMD"
/codon_start=1
/product="N-sulfoglucosamine sulfohydrolase"
/protein_id="AAH47318.1"
/db_xref="GeneID:6448"
/db_xref="HGNC:HGNC:10818"
/db_xref="MIM:605270"
/translation="MSCPVPACCALLLVLGLCRARPRNALLLLADDGGFESGAYNNSA
IATPHLDALARRSLLFRNAFTSVSSCSPSRASLLTGLPQHQNGMYGLHQDVHHFNSFD
KVRSLPLLLSQAGVRTGIIGKKHVGPETVYPFDFAYTEENGSVLQVGRNITRIKLLVR
KFLQTQDDRPFFLYVAFHDPHRCGHSQPQYGTFCEKFGNGESGMGRIPDWTPQAYDPL
DVLVPYFVPNTPAARADLAAQYTTVGRMDQGVGLVLQELRDAGVLNDTLVIFTSDNGI
PFPSGRTNLYWPGTAEPLLVSSPEHPKRWGQVSEAYVSLLDLTPTILDWFSIPYPSYA
IFGSKTIHLTGRSLLPALEAEPLWATVFGSQSHHEVTMSYPMRSVQHRHFRLVHNLNF
KMPFPIDQDFYVSPTFQDLLNRTTAGQPTGWYKDLRHYYYRARWELYDRSRDPHETQN
LATDPRFAQLLEMLRDQLAKWQWETHDPWVCAPDGVLEEKLSPQCQPLHNEL"
BASE COUNT 528 a 901 c 763 g 548 t
ORIGIN
1 gccgccgccg ccatgagctg ccccgtgccc gcctgctgcg cgctgctgct agtcctgggg
61 ctctgccggg cgcgtccccg gaacgcactg ctgctcctcg cggatgacgg aggctttgag
121 agtggcgcgt acaacaacag cgccatcgcc accccgcacc tggacgcctt ggcccgccgc
181 agcctcctct ttcgcaatgc cttcacctcg gtcagcagct gctctcccag ccgcgccagc
241 ctcctcactg gcctgcccca gcatcagaat gggatgtacg ggctgcacca ggacgtgcac
301 cacttcaact ccttcgacaa ggtgcggagc ctgccgctgc tgctcagcca agctggtgtg
361 cgcacaggca tcatcgggaa gaagcacgtg gggccggaga ccgtgtaccc gtttgacttt
421 gcgtacacgg aggagaatgg ctccgtcctc caggtggggc ggaacatcac tagaattaag
481 ctgctcgtcc ggaaattcct gcagactcag gatgaccggc ctttcttcct ctacgtcgcc
541 ttccacgacc cccaccgctg tgggcactcc cagccccagt acggaacctt ctgtgagaag
601 tttggcaacg gagagagcgg catgggtcgt atcccagact ggacccccca ggcctacgac
661 ccactggacg tgctggtgcc ttacttcgtc cccaacaccc cggcagcccg agccgacctg
721 gccgctcagt acaccaccgt aggccgcatg gaccaaggag ttggactggt gctccaggag
781 ctgcgtgacg ccggtgtcct gaacgacaca ctggtgatct tcacgtccga caacgggatc
841 cccttcccca gcggcaggac caacctgtac tggccgggca ctgctgaacc cttactggtg
901 tcatccccgg agcacccaaa acgctggggc caagtcagcg aggcctacgt gagcctccta
961 gacctcacgc ccaccatctt ggattggttc tcgatcccgt accccagcta cgccatcttt
1021 ggctcgaaga ccatccacct cactggccgg tccctcctgc cggcgctgga ggccgagccc
1081 ctctgggcca ccgtctttgg cagccagagc caccacgagg tcaccatgtc ctaccccatg
1141 cgctccgtgc agcaccggca cttccgcctc gtgcacaacc tcaacttcaa gatgcccttt
1201 cccatcgacc aggacttcta cgtctcaccc accttccagg acctcctgaa ccgcactaca
1261 gctggtcagc ccacgggctg gtacaaggac ctccgtcatt actactaccg ggcgcgctgg
1321 gagctctacg accggagccg ggacccccac gagacccaga acctggccac cgacccgcgc
1381 tttgctcagc ttctggagat gcttcgggac cagctggcca agtggcagtg ggagacccac
1441 gacccctggg tgtgcgcccc cgacggcgtc ctggaggaga agctctctcc ccagtgccag
1501 cccctccaca atgagctgtg accatcccag gaggcctgtg cacacatccc aggcatgtcc
1561 cagacacatc ccacacgtgt ccgtgtggcc ggccagcctg gggagtagtg gcaacagccc
1621 ttccgtccac actcccatcc aaggagggtt cttccttcct gtggggtcac tcttgccatt
1681 gcctggaggg ggaccagagc atgtgaccag agcatgtgcc cagcccctcc accaccaggg
1741 gcactgccgt catggcaggg gacacagttg tccttgtgtc tgaaccatgt cccagcacgg
1801 gaattctaga catacgtggt ctgcggacag ggcagtgccc ccagcccagg acaagggagt
1861 cttgttttct ggcttggttt ggggacctgc aaatgggagg cctgaggccc tcttcaggct
1921 ttggcagcca cagatacttc tgaacccttc acagagagca ggcaggggct tcggtgccgc
1981 gtgggcagta cgcaggtccc accgacactc acctgggagc acggcgcctg gctcttacca
2041 gcgtctggcc tagaggaagc ctttgagcga cctttgggca ggtttctgct tcttctgttt
2101 tgccccatgg tcaagtccct gttccccagg caggtttcag ctgattggca gcaggctccc
2161 tgagtgatga gcttgaacct gtggtgtttc tgggcagaag cttatctttt ttgagagtgt
2221 ccgaagatga aggcatggcg atgcccgtcc tctggcttgg gttaattctt cggtgacact
2281 ggcattgctg ggtggtgatg cccgtcctct ggcttgggtt aattcttcgg tgacactggc
2341 gttgctgggt ggcaatgccc gtcctctggc ttgggctaat tcttcggtga cactggcgtt
2401 gctgggtggc gatgcccgtc ctctggcttg ggttaattct tggatgacgt cggcgttgct
2461 gggagaatgt gccgttcctg ccctgcctcc acccacctcg ggagcagaag cccggcctgg
2521 acacccctcg gcctggacac ccctcgaagg agagggcgct tccttgagta ggtgggctcc
2581 ccttgccctt ccctccctat cactccatac tggggtgggc tggaggaggc cacaggccag
2641 ctattgtaaa agctttttat tttagtaaaa tatacagaag ttaaaaaaaa aaaaaaaaaa
2701 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//