LOCUS BC014649 1745 bp mRNA linear HUM 30-MAR-2004
DEFINITION Homo sapiens galactose-3-O-sulfotransferase 1, mRNA (cDNA clone
MGC:20064 IMAGE:4564703), complete cds.
ACCESSION BC014649
VERSION BC014649.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1745)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1745)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (24-SEP-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: DCTD/DTP
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Steve Jones, Sarah Barber, Mabel Brown-John, Yaron Butterfield,
Andy Chan, Steve S. Chand, William Chow, Alison Cloutier, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Amara Masson, Mike R. Mayo, Josh Moran, Ryan Morin,
Teika Olson, Diana Palmquist, Anca Petrescu, Anna Liisa Prahbu,
Parvaneh Saeedi, JR Santos, Angelique Schnerch, Ursula Skalska,
Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie
Schein, Asim Siddiqui, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 29 Row: h Column: 22
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4758087.
FEATURES Location/Qualifiers
source 1..1745
/db_xref="H-InvDB:HIT000036830"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:20064 IMAGE:4564703"
/tissue_type="Kidney, renal cell adenocarcinoma"
/clone_lib="NIH_MGC_14"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..1745
/gene="GAL3ST1"
/gene_synonym="CST"
/db_xref="GeneID:9514"
/db_xref="MIM:602300"
CDS 136..1407
/gene="GAL3ST1"
/gene_synonym="CST"
/codon_start=1
/product="GAL3ST1 protein"
/protein_id="AAH14649.1"
/db_xref="GeneID:9514"
/db_xref="MIM:602300"
/translation="MLPPQKKPWESMAKGLVLGALFTSFLLLMYSYAVPPLHAGLAST
TPEAAASCSPPALEPEAVIRANGSAGECQPRRNIVFLKTHKTASSTLLNILFRFGQKH
RLKFAFPNGRNDFDYPTFFARSLVQDYRPGACFNIICNHMRFHYDEVRGLVPTNAIFI
TVLRDPARLFESSFHYFGPVVPLTWKLSAGDKLTEFLQDPDRYYDPNGFNAHYLRNLL
FFDLGYDNSLDPSSPQVQEHILEVERRFHLVLLQEYFDESLVLLKDLLCWELEDVLYF
KLNARRDSPVPRLSGELYGRATAWNMLDSHLYRHFNASFWRKVEAFGRERMAREVAAL
RHANERMRTICIDGGHAVDAAAIQDEAMQPWQPLGTKSILGYNLKKSIGQRHAQLCRR
MLTPEIQYLMDLGANLWVTKLWKFIRDFLRW"
misc_feature 136..1359
/gene="GAL3ST1"
/gene_synonym="CST"
/note="Gal-3-0_sulfotr; Region:
Galactose-3-O-sulfotransferase. This family consists of
several mammalian galactose-3-O-sulfotransferase proteins.
Gal-3-O-sulfotransferase is thought to play a critical
role in 3'-sulfation of N-acetyllactosamine in both O- and
N-glycans"
/db_xref="CDD:pfam06990"
BASE COUNT 344 a 595 c 491 g 315 t
ORIGIN
1 tccagaaggt accttgggtc tcactgtgtc atccagagct ggagtgcagc ggcacagtca
61 tggctcactg gaactcaggc tcaagcaatc ctcccgcctc agccttccaa gtaactagga
121 ctacaggtgt ctgagatgct gccaccgcag aagaagccct gggagtccat ggctaagggg
181 ctggtgctgg gcgcgctctt cactagtttc ctgctgctga tgtactccta tgccgtgccc
241 ccgctgcatg ccggcctggc ctccacgacc ccggaggccg cagcgtcctg ctctccacct
301 gcactcgagc cagaggcagt gatccgggcc aacggctcgg cgggggagtg ccagccgcgg
361 cgcaacatcg tgttcttgaa gacgcacaag acggccagca gcaccctgct caacatcctg
421 ttccgcttcg gccagaagca ccggctcaag ttcgccttcc ctaacggccg caatgacttc
481 gactacccga ccttcttcgc ccgcagcctg gtgcaggact atcggcccgg ggcctgcttc
541 aacatcatct gcaaccacat gcgcttccac tacgacgagg tgcgcggcct ggtgccgacc
601 aacgccatct tcatcacggt gctccgcgac cccgcccgct tgttcgagtc ctccttccac
661 tacttcgggc cggtggtgcc cctcacgtgg aagctctcgg ccggcgacaa gctgaccgag
721 ttcctgcaag acccggatcg ctactacgac cccaacggct tcaatgccca ctacctccga
781 aacctgctct tcttcgacct gggctatgac aacagcctgg accccagcag cccgcaggtg
841 caggagcaca tcctggaggt ggagcgtcgc ttccacctgg tgctccttca agagtacttc
901 gacgagtcgc tggtgctgct gaaggacctg ctgtgctggg agctggagga cgtgctctac
961 ttcaagctca acgcccgccg cgactcgccc gtgccgcggc tctcggggga gctgtatggg
1021 cgcgccaccg cctggaacat gctggactcc cacctctacc gccacttcaa cgccagcttc
1081 tggcgcaagg tggaggcctt cgggcgggag cgcatggccc gcgaggtggc cgccctgcgc
1141 catgccaacg agcgcatgcg gaccatctgc atcgacgggg gccacgccgt ggacgccgcc
1201 gccatccagg acgaggccat gcagccctgg cagccgctgg gcaccaagtc catcctgggc
1261 tacaacctca agaagagcat cgggcagcgg cacgcgcagc tctgccggcg catgctcacg
1321 cccgagatcc agtacctgat ggacctcggc gccaacctgt gggtcaccaa gctctggaag
1381 ttcattcgcg atttcctgcg gtggtgacgt cccaccgccc agcggcttgc ctgcctgctc
1441 gctccctgca gaggggctga gcaggacgcc gctggtgctg gccgccccca gccccctcct
1501 ggtgccacct cagaccccgg ggtgaggggg ggctccctgg ggggaggcag ccagccaaga
1561 ctgggcccat gaacacagag agggcctaac cgagatcagt atttaactaa ttataccagt
1621 ttttattaaa cccctttccc tccccgataa agaatgttct atttctgcct ccccttaaag
1681 gggagacctc agaagtaaag gaatttgatg ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa
1741 aaaaa
//