LOCUS       BC028235                2453 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens carbohydrate (keratan sulfate Gal-6) sulfotransferase
            1, mRNA (cDNA clone MGC:40025 IMAGE:5245338), complete cds.
ACCESSION   BC028235
VERSION     BC028235.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2453)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2453)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (08-APR-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 62 Row: n Column: 8
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 31542307.
FEATURES             Location/Qualifiers
     source          1..2453
                     /db_xref="H-InvDB:HIT000040504"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:40025 IMAGE:5245338"
                     /tissue_type="Brain, fetal, whole pooled"
                     /clone_lib="NIH_MGC_121"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2453
                     /gene="CHST1"
                     /gene_synonym="C6ST"
                     /gene_synonym="KS6ST"
                     /gene_synonym="KSGal6ST"
                     /db_xref="GeneID:8534"
                     /db_xref="HGNC:HGNC:1969"
                     /db_xref="MIM:603797"
     CDS             340..1575
                     /gene="CHST1"
                     /gene_synonym="C6ST"
                     /gene_synonym="KS6ST"
                     /gene_synonym="KSGal6ST"
                     /codon_start=1
                     /product="carbohydrate (keratan sulfate Gal-6)
                     sulfotransferase 1"
                     /protein_id="AAH28235.1"
                     /db_xref="GeneID:8534"
                     /db_xref="HGNC:HGNC:1969"
                     /db_xref="MIM:603797"
                     /translation="MQCSWKAVLLLALASIAIQYTAIRTFTAKSFHTCPGLAEAGLAE
                     RLCEESPTFAYNLSRKTHILILATTRSGSSFVGQLFNQHLDVFYLFEPLYHVQNTLIP
                     RFTQGKSPADRRVMLGASRDLLRSLYDCDLYFLENYIKPPPVNHTTDRIFRRGASRVL
                     CSRPVCDPPGPADLVLEEGDCVRKCGLLNLTVAAEACRERSHVAIKTVRVPEVNDLRA
                     LVEDPRLNLKVIQLVRDPRGILASRSETFRDTYRLWRLWYGTGRKPYNLDVTQLTTVC
                     EDFSNSVSTGLMRPPWLKGKYMLVRYEDLARNPMKKTEEIYGFLGIPLDSHVARWIQN
                     NTRGDPTLGKHKYGTVRNSAATAEKWRFRLSYDIVAFAQNACQQVLAQLGYKIAASEE
                     ELKNPSVSLVEERDFRPFS"
BASE COUNT          500 a          799 c          670 g          484 t
ORIGIN      
        1 gccgcggctg ctgccgccgc cgccacccgc gggtccccgg cgaccctgac tccagacccg
       61 aggatggagc cggcgctggg cgctgcagct gctcccggcg cgtccccgac caggtagctg
      121 gtgtcacttc ggtgtggttg gaagaagact ttctccccag ctgcattccc ggaggcgccc
      181 tttcgacctg gaggccgggt ctgctggcca cagggctgcc gcactggctg ggactgccag
      241 ctgggcctgg agacgctggt ggctgtggac tccccagctt ggagcagtcc ctctttgacc
      301 tcaccccttg gagaagcagc cccatgaagg tgcccagcca tgcaatgttc ctggaaggcc
      361 gtcctcctcc ttgccctggc ctccattgcc atccagtaca cggccatccg caccttcacc
      421 gccaagtcct ttcacacctg ccccgggctg gcagaggccg ggctggccga gcgactgtgc
      481 gaggagagcc ccaccttcgc ctacaacctc tcccgcaaga cccacatcct catcctggcc
      541 accacgcgca gcggctcctc cttcgtgggc cagctcttca accagcacct ggacgtcttc
      601 tacctgtttg agcccctcta ccacgtccag aacacgctca tcccccgctt cacccagggc
      661 aagagcccgg ccgaccggcg ggtcatgcta ggcgccagcc gcgacctcct gcggagcctc
      721 tacgactgcg acctctactt cctggagaac tacatcaagc cgccgccggt caaccacacc
      781 accgacagga tcttccgccg cggggccagc cgggtcctct gctcccggcc tgtgtgcgac
      841 cctccggggc cagccgacct ggtcctggag gagggggact gtgtgcgcaa gtgcgggcta
      901 ctcaacctga ccgtggcggc cgaggcgtgc cgcgagcgca gccacgtggc catcaagacg
      961 gtgcgcgtgc ccgaggtgaa cgacctgcgc gccctggtgg aagacccgcg attaaacctc
     1021 aaggtcatcc agctggtccg agacccccgc ggcattctgg cttcgcgcag cgagaccttc
     1081 cgcgacacgt accggctctg gcggctctgg tacggcaccg ggaggaaacc ctacaacctg
     1141 gacgtgacgc agctgaccac ggtgtgcgag gacttctcca actccgtgtc caccggcctc
     1201 atgcggcccc cgtggctcaa gggcaagtac atgttggtgc gctacgagga cctggctcgg
     1261 aaccctatga agaagaccga ggagatctac gggttcctgg gcatcccgct ggacagccac
     1321 gtggcccgct ggatccagaa caacacgcgg ggcgacccca ccctgggcaa gcacaaatac
     1381 ggcaccgtgc gaaactcggc ggccacggcc gagaagtggc gcttccgcct ctcctacgac
     1441 atcgtggcct ttgcccagaa cgcctgccag caggtgctgg cccagctggg ctacaagatc
     1501 gccgcctcgg aggaggagct gaagaacccc tcggtcagcc tggtggagga gcgggacttc
     1561 cgccccttct cgtgacccgg gcggtgcggg tgggggcggg aggcgcaagg tgtcggtttt
     1621 gataaaatgg accgttttta actgttgcct tattaacccc tccctctccc acctcatctt
     1681 cgtgtccttc ctgcccccag ctcaccccac tcccttctgc cccttttttg tctctgaaat
     1741 ttgcactacg tcttggacgg gaatcactgg ggcagagggc gcctgaagta gggtcccgcc
     1801 ccccccaccc cattcagaca catggatgtt gggtctctgt gcggacggtg acaatgttta
     1861 caagcaccac atttacacat ccacacacgc acacgggcac tcgcgaggcg acttctcaag
     1921 cttttgaatg ggtgagtggt cgggtatcta gtttttgcac tgtcttacta ttcaaggtaa
     1981 gaggatacaa acaagaggac cacttgtctc taatttatga atggtgtcca tcctttcccc
     2041 atccctgcct cctgcccctg acgcccattt ccccccttag agcagcgaaa ctgccccctc
     2101 ctgcccgccc ttgcctgtcg gtgaggcagg tttttactgt gaggtgaacg tggacctgtt
     2161 tctgtttcca gtctgtggtg atgctgtctg tctgtctgag tctcgtggcc gcccctggac
     2221 cagtgatgac tgatgaatct tatgagcctc tgattgatct cggggtccat ctgtgatatt
     2281 tctttgtgcc aaaaagaaaa aaaaagagtg gatcagtttg ctaaatgaac attgaaattg
     2341 aaatgcttta tctgtgtttt ctgtaaataa aagagtgcaa taatcaaaaa aaaaaaaaaa
     2401 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa
//