LOCUS       BC012375                2028 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens arylsulfatase G, mRNA (cDNA clone MGC:8996
            IMAGE:3882163), complete cds.
ACCESSION   BC012375
VERSION     BC012375.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2028)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2028)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (15-AUG-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP/Gazdar
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 21 Row: o Column: 14
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 45430056.
FEATURES             Location/Qualifiers
     source          1..2028
                     /db_xref="H-InvDB:HIT000035788"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:8996 IMAGE:3882163"
                     /tissue_type="Lung, large cell carcinoma"
                     /clone_lib="NIH_MGC_68"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2028
                     /gene="ARSG"
                     /gene_synonym="KIAA1001"
                     /db_xref="GeneID:22901"
                     /db_xref="HGNC:HGNC:24102"
                     /db_xref="MIM:610008"
     CDS             40..1617
                     /gene="ARSG"
                     /gene_synonym="KIAA1001"
                     /codon_start=1
                     /product="arylsulfatase G"
                     /protein_id="AAH12375.1"
                     /db_xref="GeneID:22901"
                     /db_xref="HGNC:HGNC:24102"
                     /db_xref="MIM:610008"
                     /translation="MGWLFLKVLLAGVSFSGFLYPLVDFCISGKTRGQKPNFVIILAD
                     DMGWGDLGANWAETKDTANLDKMASEGMRFVDFHAAASTCSPSRASLLTGRLGLRNGV
                     TRNFAVTSVGGLPLNETTLAEVLQQAGYVTGIIGKWHLGHHGSYHPNFRGFDYYFGIP
                     YSHDMGCTDTPGYNHPPCPACPQGDGPSRNLQRDCYTDVALPLYENLNIVEQPVNLSS
                     LAQKYAEKATQFIQRASTSGRPFLLYVALAHMHVPLPVTQLPAAPRGRSLYGAGLWEM
                     DSLVGQIKDKVDHTVKENTFLWFTGDNGPWAQKCELAGSVGPFTGFWQTRQGGSPAKQ
                     TTWEGGHRVPALAYWPGRVPVNVTSTALLSVLDIFPTVVALAQASLPQGRRFDGVDVS
                     EVLFGRSQPGHRVLFHPNSGAAGEFGALQTVRLERYKAFYITGGARACDGSTGPELQH
                     KFPLIFNLEDDTAEAVPLERGGAEYQAVLPEVRKVLADVLQDIANDNISSADYTQDPS
                     VTPCCNPYQIACRCQAA"
BASE COUNT          472 a          532 c          542 g          482 t
ORIGIN      
        1 gccgtcgctc cagacaatcg gaatcctgcc ttcaccacca tgggctggct ttttctaaag
       61 gttttgttgg cgggagtgag tttctcagga tttctttatc ctcttgtgga tttttgcatc
      121 agtgggaaaa caagaggaca gaagccaaac tttgtgatta ttttggccga tgacatgggg
      181 tggggtgacc tgggagcaaa ctgggcagaa acaaaggaca ctgccaacct tgataagatg
      241 gcttcggagg gaatgaggtt tgtggatttc catgcagctg cctccacctg ctcaccctcc
      301 cgggcttcct tgctcaccgg ccggcttggc cttcgcaatg gagtcacacg caactttgca
      361 gtcacttctg tgggaggcct tccgctcaac gagaccacct tggcagaggt gctgcagcag
      421 gcgggttacg tcactgggat aataggcaaa tggcatcttg gacaccacgg ctcttatcac
      481 cccaacttcc gtggttttga ttactacttt ggaatcccat atagccatga tatgggctgt
      541 actgatactc caggctacaa ccaccctcct tgtccagcgt gtccacaggg tgatggacca
      601 tcaaggaacc ttcaaagaga ctgttacact gacgtggccc tccctcttta tgaaaacctc
      661 aacattgtgg agcagccggt gaacttgagc agccttgccc agaagtatgc tgagaaagca
      721 acccagttca tccagcgtgc aagcaccagc gggaggccct tcctgctcta tgtggctctg
      781 gcccacatgc acgtgccctt acctgtgact cagctaccag cagcgccacg gggcagaagc
      841 ctgtatggtg cagggctctg ggagatggac agtctggtgg gccagatcaa ggacaaagtt
      901 gaccacacag tgaaggaaaa cacattcctc tggtttacag gagacaatgg cccgtgggct
      961 cagaagtgtg agctagcggg cagtgtgggt cccttcactg gattttggca aactcgtcaa
     1021 gggggaagtc cagccaagca gacgacctgg gaaggagggc accgggtccc agcactggct
     1081 tactggcctg gcagagttcc agttaatgtc accagcactg ccttgttaag cgtgctggac
     1141 atttttccaa ctgtggtagc cctggcccag gccagcttac ctcaaggacg gcgctttgat
     1201 ggtgtggacg tctccgaggt gctctttggc cggtcacagc ctgggcacag ggtgctgttc
     1261 caccccaaca gcggggcagc tggagagttt ggagccctgc agactgtccg cctggagcgt
     1321 tacaaggcct tctacattac cggtggagcc agggcgtgtg atgggagcac ggggcctgag
     1381 ctgcagcata agtttcctct gattttcaac ctggaagacg ataccgcaga agctgtgccc
     1441 ctagaaagag gtggtgcgga gtaccaggct gtgctgcccg aggtcagaaa ggttcttgca
     1501 gacgtcctcc aagacattgc caacgacaac atctccagcg cagattacac tcaggaccct
     1561 tcagtaactc cctgctgtaa tccctaccaa attgcctgcc gctgtcaagc cgcataacag
     1621 accaattttt attccacgag gaggagtacc tggaaattag gcaagtttgc ttccaaattt
     1681 catttttacc ctctttacaa acacacgctt tagtttagtc ttggagttta gttttggagt
     1741 tagccttgca tatcccttct gtatcctgtc cctcctccac gccgacccga gagcagctga
     1801 gctgcgctgg ctctgggcag ggagtgtgcc ttaatgggaa gcacacgggc tttggagtca
     1861 ggcacaggtg ccagctccag cttttgaact tgggcaattg tttaacctaa cctgcaagtt
     1921 gattttgagg gttaaataaa ggcatacatg aaaatgcctg gcaaattacc tgacacagag
     1981 cagacattca atacatttta gtttccttgt ttcaaaaaaa aaaaaaaa
//