LOCUS       BC014649                1745 bp    mRNA    linear   HUM 30-MAR-2004
DEFINITION  Homo sapiens galactose-3-O-sulfotransferase 1, mRNA (cDNA clone
            MGC:20064 IMAGE:4564703), complete cds.
ACCESSION   BC014649
VERSION     BC014649.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1745)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1745)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (24-SEP-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steve Jones, Sarah Barber, Mabel Brown-John, Yaron Butterfield,
            Andy Chan, Steve S. Chand, William Chow, Alison Cloutier, Ruth
            Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
            Kim MacDonald, Amara Masson, Mike R. Mayo, Josh Moran, Ryan Morin,
            Teika Olson, Diana Palmquist, Anca Petrescu, Anna Liisa Prahbu,
            Parvaneh Saeedi, JR Santos, Angelique Schnerch, Ursula Skalska,
            Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie
            Schein, Asim Siddiqui, Rob Holt, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 29 Row: h Column: 22
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4758087.
FEATURES             Location/Qualifiers
     source          1..1745
                     /db_xref="H-InvDB:HIT000036830"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:20064 IMAGE:4564703"
                     /tissue_type="Kidney, renal cell adenocarcinoma"
                     /clone_lib="NIH_MGC_14"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..1745
                     /gene="GAL3ST1"
                     /gene_synonym="CST"
                     /db_xref="GeneID:9514"
                     /db_xref="MIM:602300"
     CDS             136..1407
                     /gene="GAL3ST1"
                     /gene_synonym="CST"
                     /codon_start=1
                     /product="GAL3ST1 protein"
                     /protein_id="AAH14649.1"
                     /db_xref="GeneID:9514"
                     /db_xref="MIM:602300"
                     /translation="MLPPQKKPWESMAKGLVLGALFTSFLLLMYSYAVPPLHAGLAST
                     TPEAAASCSPPALEPEAVIRANGSAGECQPRRNIVFLKTHKTASSTLLNILFRFGQKH
                     RLKFAFPNGRNDFDYPTFFARSLVQDYRPGACFNIICNHMRFHYDEVRGLVPTNAIFI
                     TVLRDPARLFESSFHYFGPVVPLTWKLSAGDKLTEFLQDPDRYYDPNGFNAHYLRNLL
                     FFDLGYDNSLDPSSPQVQEHILEVERRFHLVLLQEYFDESLVLLKDLLCWELEDVLYF
                     KLNARRDSPVPRLSGELYGRATAWNMLDSHLYRHFNASFWRKVEAFGRERMAREVAAL
                     RHANERMRTICIDGGHAVDAAAIQDEAMQPWQPLGTKSILGYNLKKSIGQRHAQLCRR
                     MLTPEIQYLMDLGANLWVTKLWKFIRDFLRW"
     misc_feature    136..1359
                     /gene="GAL3ST1"
                     /gene_synonym="CST"
                     /note="Gal-3-0_sulfotr; Region:
                     Galactose-3-O-sulfotransferase. This family consists of
                     several mammalian galactose-3-O-sulfotransferase proteins.
                     Gal-3-O-sulfotransferase is thought to play a critical
                     role in 3'-sulfation of N-acetyllactosamine in both O- and
                     N-glycans"
                     /db_xref="CDD:pfam06990"
BASE COUNT          344 a          595 c          491 g          315 t
ORIGIN      
        1 tccagaaggt accttgggtc tcactgtgtc atccagagct ggagtgcagc ggcacagtca
       61 tggctcactg gaactcaggc tcaagcaatc ctcccgcctc agccttccaa gtaactagga
      121 ctacaggtgt ctgagatgct gccaccgcag aagaagccct gggagtccat ggctaagggg
      181 ctggtgctgg gcgcgctctt cactagtttc ctgctgctga tgtactccta tgccgtgccc
      241 ccgctgcatg ccggcctggc ctccacgacc ccggaggccg cagcgtcctg ctctccacct
      301 gcactcgagc cagaggcagt gatccgggcc aacggctcgg cgggggagtg ccagccgcgg
      361 cgcaacatcg tgttcttgaa gacgcacaag acggccagca gcaccctgct caacatcctg
      421 ttccgcttcg gccagaagca ccggctcaag ttcgccttcc ctaacggccg caatgacttc
      481 gactacccga ccttcttcgc ccgcagcctg gtgcaggact atcggcccgg ggcctgcttc
      541 aacatcatct gcaaccacat gcgcttccac tacgacgagg tgcgcggcct ggtgccgacc
      601 aacgccatct tcatcacggt gctccgcgac cccgcccgct tgttcgagtc ctccttccac
      661 tacttcgggc cggtggtgcc cctcacgtgg aagctctcgg ccggcgacaa gctgaccgag
      721 ttcctgcaag acccggatcg ctactacgac cccaacggct tcaatgccca ctacctccga
      781 aacctgctct tcttcgacct gggctatgac aacagcctgg accccagcag cccgcaggtg
      841 caggagcaca tcctggaggt ggagcgtcgc ttccacctgg tgctccttca agagtacttc
      901 gacgagtcgc tggtgctgct gaaggacctg ctgtgctggg agctggagga cgtgctctac
      961 ttcaagctca acgcccgccg cgactcgccc gtgccgcggc tctcggggga gctgtatggg
     1021 cgcgccaccg cctggaacat gctggactcc cacctctacc gccacttcaa cgccagcttc
     1081 tggcgcaagg tggaggcctt cgggcgggag cgcatggccc gcgaggtggc cgccctgcgc
     1141 catgccaacg agcgcatgcg gaccatctgc atcgacgggg gccacgccgt ggacgccgcc
     1201 gccatccagg acgaggccat gcagccctgg cagccgctgg gcaccaagtc catcctgggc
     1261 tacaacctca agaagagcat cgggcagcgg cacgcgcagc tctgccggcg catgctcacg
     1321 cccgagatcc agtacctgat ggacctcggc gccaacctgt gggtcaccaa gctctggaag
     1381 ttcattcgcg atttcctgcg gtggtgacgt cccaccgccc agcggcttgc ctgcctgctc
     1441 gctccctgca gaggggctga gcaggacgcc gctggtgctg gccgccccca gccccctcct
     1501 ggtgccacct cagaccccgg ggtgaggggg ggctccctgg ggggaggcag ccagccaaga
     1561 ctgggcccat gaacacagag agggcctaac cgagatcagt atttaactaa ttataccagt
     1621 ttttattaaa cccctttccc tccccgataa agaatgttct atttctgcct ccccttaaag
     1681 gggagacctc agaagtaaag gaatttgatg ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1741 aaaaa
//