LOCUS       BC029051                2013 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens arylsulfatase B, mRNA (cDNA clone MGC:34518
            IMAGE:5186657), complete cds.
ACCESSION   BC029051
VERSION     BC029051.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2013)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2013)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-MAY-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 50 Row: g Column: 11
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 38569406.
FEATURES             Location/Qualifiers
     source          1..2013
                     /db_xref="H-InvDB:HIT000040669"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:34518 IMAGE:5186657"
                     /tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
                     /clone_lib="NIH_MGC_116"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2013
                     /gene="ARSB"
                     /gene_synonym="ASB"
                     /gene_synonym="G4S"
                     /gene_synonym="MPS6"
                     /db_xref="GeneID:411"
                     /db_xref="HGNC:HGNC:714"
                     /db_xref="MIM:253200"
     CDS             356..1597
                     /gene="ARSB"
                     /gene_synonym="ASB"
                     /gene_synonym="G4S"
                     /gene_synonym="MPS6"
                     /codon_start=1
                     /product="arylsulfatase B"
                     /protein_id="AAH29051.1"
                     /db_xref="GeneID:411"
                     /db_xref="HGNC:HGNC:714"
                     /db_xref="MIM:253200"
                     /translation="MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRP
                     PHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR
                     YQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRR
                     GFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKR
                     AIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEA
                     VGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVAS
                     PLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELL
                     HNIDPNFVDSSPYWPECSLLL"
BASE COUNT          505 a          558 c          536 g          414 t
ORIGIN      
        1 cttgaaagta accgcacctt ccaaagggca ccgtgcaatc agactgaaac cacggtgcaa
       61 atttaattgc cggggaagat aacgggcctt ggtgccctcc aagcgtcagc tgagtttcca
      121 agaagccggg cagcgggcgc ccgcgggttc gtctctggct cctcctccgc cacagcagcc
      181 gggggcccgg gtcggaggcg gcgggggccg agcgcccggc ctcgcaagcc cacggcccgc
      241 tgggggtgcc gtcccgcgcc ggggcggagc aggccccggc agcccagttc ctcattctat
      301 cagcggtaca aggggctggt ggcgccacag gcgctgggac cgcgggcgga caaggatggg
      361 tccgcgcggc gcggcgagct tgccccgagg ccccggacct cggcggctgc tcctccccgt
      421 cgtcctcccg ctgctgctgc tgctgttgtt ggcgccgccg ggctcgggcg ccggggccag
      481 ccggccgccc cacctggtct tcttgctggc agacgaccta ggctggaacg acgtcggctt
      541 ccacggctcc cgcatccgca cgccgcacct ggacgcgctg gcggccggcg gggtgctcct
      601 ggacaactac tacacgcagc cgctgtgcac gccgtcgcgg agccagctgc tcactggccg
      661 ctaccagatc cgtacaggtt tacagcacca aataatctgg ccctgtcagc ccagctgtgt
      721 tcctctggat gaaaaactcc tgccccagct cctaaaagaa gcaggttata ctacccatat
      781 ggtcggaaaa tggcacctgg gaatgtaccg gaaagaatgc cttccaaccc gccgaggatt
      841 tgatacctac tttggatatc tcctgggtag tgaagattat tattcccatg aacgctgtac
      901 attaattgac gctctgaatg tcacacgatg tgctcttgat tttcgagatg gcgaagaagt
      961 tgcaacagga tataaaaata tgtattcaac aaacatattc accaaaaggg ctatagccct
     1021 cataactaac catccaccag agaagcctct gtttctctac cttgctctcc agtctgtgca
     1081 tgagcccctt caggtccctg aggaatactt gaagccatat gactttatcc aagacaagaa
     1141 caggcatcac tatgcaggaa tggtgtccct tatggatgaa gcagtaggaa atgtcactgc
     1201 agctttaaaa agcagtgggc tctggaacaa cacggtgttc atcttttcta cagataacgg
     1261 agggcagact ttggcagggg gtaataactg gccccttcga ggaagaaaat ggagcctgtg
     1321 ggaaggaggc gtccgagggg tgggctttgt ggcaagcccc ttgctgaagc agaagggcgt
     1381 gaagaaccgg gagctcatcc acatctctga ctggctgcca acactcgtga agctggccag
     1441 gggacacacc aatggcacaa agcctctgga tggcttcgac gtgtggaaaa ccatcagtga
     1501 aggaagccca tcccccagaa ttgagctgct gcataatatt gacccgaact tcgtggactc
     1561 ttcaccgtac tggcctgagt gctcgctgct gttgtagcta ccaccaactt tctactgaag
     1621 atgataaccc agggcataag aaatgacttc agacccaagg ttctgaaagg gcccctcaag
     1681 gcctcgggtg gctcctgcag aagtggcaga agaggcggga actaggaacc tggcatcata
     1741 ggaaaagtgc cttctccaag aaagaagggg ccccaagagg ctgtcttact tagatcaacc
     1801 ataaactacc acagatgggt catttcttat actatttcaa aatatctttg aagatgagaa
     1861 ttcatttgtg tccttcatag accaaagttc tttgtgttac cttttcccaa aagtaaattc
     1921 ctttcccttt attcattcct tgtggaaata aaatgcaagc cctttaaaaa aaaaaaaaaa
     1981 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa
//