LOCUS       BC018723                2496 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens carbohydrate (N-acetylgalactosamine 4-0)
            sulfotransferase 8, mRNA (cDNA clone MGC:31814 IMAGE:4137487),
            complete cds.
ACCESSION   BC018723
VERSION     BC018723.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2496)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2496)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (07-DEC-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
            Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
            Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
            Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
            Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald,  Mike R. Mayo, Josh Moran, Diana Palmquist, JR
            Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
            Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 40 Row: c Column: 19
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 21361615.
FEATURES             Location/Qualifiers
     source          1..2496
                     /db_xref="H-InvDB:HIT000038422"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:31814 IMAGE:4137487"
                     /tissue_type="Muscle, rhabdomyosarcoma"
                     /clone_lib="NIH_MGC_17"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2496
                     /gene="CHST8"
                     /db_xref="GeneID:64377"
                     /db_xref="HGNC:HGNC:15993"
                     /db_xref="MIM:610190"
     CDS             759..2033
                     /gene="CHST8"
                     /codon_start=1
                     /product="carbohydrate (N-acetylgalactosamine 4-0)
                     sulfotransferase 8"
                     /protein_id="AAH18723.1"
                     /db_xref="GeneID:64377"
                     /db_xref="HGNC:HGNC:15993"
                     /db_xref="MIM:610190"
                     /translation="MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPG
                     IKFNIRPRQPHHDLPPGGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRG
                     TRLRLRQRRRRLLIKKMPAAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVM
                     QEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASS
                     TADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPN
                     SYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRL
                     CSPCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQEARTTARIAHQYF
                     AQLSALQRQRTYDFYYMDYLMFNYSKPFADLY"
BASE COUNT          483 a          830 c          759 g          424 t
ORIGIN      
        1 gggactcgtc ccagagtttg ctgccgccgc cgccgctgcc attagagcgg ttttaattag
       61 ctgcaggatt tttaatgagc tgaaatcaag aagagcccca tctcagaagt aagcgagagc
      121 cgcgcgcagg gaagaactag tctgacaaaa cagaaggagg tggtgctctc gggggactgg
      181 gctgtaggct cccaccgatc gatcggcgac gagcagatga gttcccgagg ccaggaggct
      241 gaggagggaa gccagggctc agagaccagc gcccggtgca gagtccccgc tgctgccacc
      301 gcggcagaag gcacggtgcc ggattcaagg tgacggcagt gcctggctcc attctcctca
      361 agttgccata gcagcctctc tcccaacaag cagcttctcc gggagccctc agccgggagg
      421 gacgtcattg ttctctcctg cccgtgtggt gcaaggagac ccaccggaca gagaactttc
      481 ccgagcccgg ggtccgctga ccacgtgggt gcacccatgc tgcctggagc caggcacagg
      541 gacaggtccc cacggccaca gggaatcctc tgggcagctg aggggagcgt ccaggcccag
      601 aagcagctgc agccaagcgt gtccttggag ccgtccatgc gtctgtccgc ctgctgccgg
      661 tcgccactgg agatctcggc ctgatggacg cctggtgtgg acgatgaggg aagaacgtgc
      721 cccccacacc caagaggtga cccctgagcc agccccggat gaccctgcga cctggaacaa
      781 tgcggctggc ctgcatgttc tcttccatcc tgctgttcgg agctgcaggc ctcctcctct
      841 tcatcagcct gcaggaccct acggagctcg ccccccagca ggtgccagga ataaagttca
      901 acatcaggcc aaggcagccc caccacgacc tcccaccagg cggctcccag gatggtgact
      961 tgaaggaacc cacagagagg gtcactcggg acttatccag tggggccccg aggggccgca
     1021 acctgccagc gcctgaccag cctcaacccc cgctgcagag gggaacccgt ctgcggctcc
     1081 gccagcgccg tcgccgtctg ctcatcaaga aaatgccagc tgcggcgacc atcccggcca
     1141 acagctcgga cgcgcccttc atccggccgg gacccgggac gctggatggc cgctgggtca
     1201 gcctgcaccg gagccagcag gagcgcaagc gggtgatgca ggaggcctgc gccaagtacc
     1261 gggcgagcag cagccgccgg gccgtcacgc cccgccacgt gtcccgtatc ttcgtggagg
     1321 accgccaccg cgtgctctac tgcgaggtgc ccaaggccgg ctgctccaat tggaagcggg
     1381 tgctcatggt gctggccggc ctggcctcgt ccactgccga catccagcac aacaccgtcc
     1441 actatggcag cgctctcaag cgcctggaca ccttcgaccg ccagggtatc ttgcaccgtc
     1501 tcagcaccta caccaagatg ctctttgtcc gcgagccctt cgagaggctg gtgtccgcct
     1561 tccgcgacaa gtttgagcac cccaacagct actatcaccc ggtcttcggc aaggccatcc
     1621 tggcccggta ccgcgccaat gcctctcggg aggccctgcg gaccggctct ggggtgcgtt
     1681 ttcccgagtt cgtccagtac ctgctggacg tgcaccggcc cgtggggatg gacattcact
     1741 gggaccatgt cagccggctc tgcagcccct gcctcatcga ctacgatttc gtaggcaagt
     1801 tcgagagcat ggaggacgat gccaacttct tcctgagcct catccgcgcg ccgcggaacc
     1861 tgaccttccc ccggttcaag gaccggcact cgcaggaggc gcggaccaca gcgaggatcg
     1921 cccaccagta cttcgcccaa ctctcggccc tgcaaaggca gcgcacctac gacttctact
     1981 acatggatta cctgatgttc aactattcca agccctttgc agatctgtac tgaggggcgc
     2041 cgcagctggc cggggccgcc ctgccccggt cactcacctg tgctcccggg catcctcctg
     2101 tccctggctc ctcatcctgg gagcaacagg gctctgagga cgtgaggagc catcgctgtg
     2161 ggaggcagca ggccccgggt ggggggcaga ggcgcccagc cttggatggg gaccccagcc
     2221 cctggcctgt acctgtttcc tcattccttg gctgagggag aggctgagaa ctgggcagac
     2281 acccctggag ctcagccgac agttttgatg agcagggaag tctgaggccc agaggacggg
     2341 gggcccagcg gtaagggatg tcccgcactc ccttagccat tgccttggac caaaccacgt
     2401 ggtttgcagc ttttctacga gccagggggg aggttccctt ggattaaggt tccaaataaa
     2461 gcacatggtt tccagagcaa aaaaaaaaaa aaaaaa
//