LOCUS BC014649 1745 bp mRNA linear HUM 30-MAR-2004 DEFINITION Homo sapiens galactose-3-O-sulfotransferase 1, mRNA (cDNA clone MGC:20064 IMAGE:4564703), complete cds. ACCESSION BC014649 VERSION BC014649.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1745) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1745) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (24-SEP-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Steve Jones, Sarah Barber, Mabel Brown-John, Yaron Butterfield, Andy Chan, Steve S. Chand, William Chow, Alison Cloutier, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Amara Masson, Mike R. Mayo, Josh Moran, Ryan Morin, Teika Olson, Diana Palmquist, Anca Petrescu, Anna Liisa Prahbu, Parvaneh Saeedi, JR Santos, Angelique Schnerch, Ursula Skalska, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 29 Row: h Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4758087. FEATURES Location/Qualifiers source 1..1745 /db_xref="H-InvDB:HIT000036830" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:20064 IMAGE:4564703" /tissue_type="Kidney, renal cell adenocarcinoma" /clone_lib="NIH_MGC_14" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1745 /gene="GAL3ST1" /gene_synonym="CST" /db_xref="GeneID:9514" /db_xref="MIM:602300" CDS 136..1407 /gene="GAL3ST1" /gene_synonym="CST" /codon_start=1 /product="GAL3ST1 protein" /protein_id="AAH14649.1" /db_xref="GeneID:9514" /db_xref="MIM:602300" /translation="MLPPQKKPWESMAKGLVLGALFTSFLLLMYSYAVPPLHAGLAST TPEAAASCSPPALEPEAVIRANGSAGECQPRRNIVFLKTHKTASSTLLNILFRFGQKH RLKFAFPNGRNDFDYPTFFARSLVQDYRPGACFNIICNHMRFHYDEVRGLVPTNAIFI TVLRDPARLFESSFHYFGPVVPLTWKLSAGDKLTEFLQDPDRYYDPNGFNAHYLRNLL FFDLGYDNSLDPSSPQVQEHILEVERRFHLVLLQEYFDESLVLLKDLLCWELEDVLYF KLNARRDSPVPRLSGELYGRATAWNMLDSHLYRHFNASFWRKVEAFGRERMAREVAAL RHANERMRTICIDGGHAVDAAAIQDEAMQPWQPLGTKSILGYNLKKSIGQRHAQLCRR MLTPEIQYLMDLGANLWVTKLWKFIRDFLRW" misc_feature 136..1359 /gene="GAL3ST1" /gene_synonym="CST" /note="Gal-3-0_sulfotr; Region: Galactose-3-O-sulfotransferase. This family consists of several mammalian galactose-3-O-sulfotransferase proteins. Gal-3-O-sulfotransferase is thought to play a critical role in 3'-sulfation of N-acetyllactosamine in both O- and N-glycans" /db_xref="CDD:pfam06990" BASE COUNT 344 a 595 c 491 g 315 t ORIGIN 1 tccagaaggt accttgggtc tcactgtgtc atccagagct ggagtgcagc ggcacagtca 61 tggctcactg gaactcaggc tcaagcaatc ctcccgcctc agccttccaa gtaactagga 121 ctacaggtgt ctgagatgct gccaccgcag aagaagccct gggagtccat ggctaagggg 181 ctggtgctgg gcgcgctctt cactagtttc ctgctgctga tgtactccta tgccgtgccc 241 ccgctgcatg ccggcctggc ctccacgacc ccggaggccg cagcgtcctg ctctccacct 301 gcactcgagc cagaggcagt gatccgggcc aacggctcgg cgggggagtg ccagccgcgg 361 cgcaacatcg tgttcttgaa gacgcacaag acggccagca gcaccctgct caacatcctg 421 ttccgcttcg gccagaagca ccggctcaag ttcgccttcc ctaacggccg caatgacttc 481 gactacccga ccttcttcgc ccgcagcctg gtgcaggact atcggcccgg ggcctgcttc 541 aacatcatct gcaaccacat gcgcttccac tacgacgagg tgcgcggcct ggtgccgacc 601 aacgccatct tcatcacggt gctccgcgac cccgcccgct tgttcgagtc ctccttccac 661 tacttcgggc cggtggtgcc cctcacgtgg aagctctcgg ccggcgacaa gctgaccgag 721 ttcctgcaag acccggatcg ctactacgac cccaacggct tcaatgccca ctacctccga 781 aacctgctct tcttcgacct gggctatgac aacagcctgg accccagcag cccgcaggtg 841 caggagcaca tcctggaggt ggagcgtcgc ttccacctgg tgctccttca agagtacttc 901 gacgagtcgc tggtgctgct gaaggacctg ctgtgctggg agctggagga cgtgctctac 961 ttcaagctca acgcccgccg cgactcgccc gtgccgcggc tctcggggga gctgtatggg 1021 cgcgccaccg cctggaacat gctggactcc cacctctacc gccacttcaa cgccagcttc 1081 tggcgcaagg tggaggcctt cgggcgggag cgcatggccc gcgaggtggc cgccctgcgc 1141 catgccaacg agcgcatgcg gaccatctgc atcgacgggg gccacgccgt ggacgccgcc 1201 gccatccagg acgaggccat gcagccctgg cagccgctgg gcaccaagtc catcctgggc 1261 tacaacctca agaagagcat cgggcagcgg cacgcgcagc tctgccggcg catgctcacg 1321 cccgagatcc agtacctgat ggacctcggc gccaacctgt gggtcaccaa gctctggaag 1381 ttcattcgcg atttcctgcg gtggtgacgt cccaccgccc agcggcttgc ctgcctgctc 1441 gctccctgca gaggggctga gcaggacgcc gctggtgctg gccgccccca gccccctcct 1501 ggtgccacct cagaccccgg ggtgaggggg ggctccctgg ggggaggcag ccagccaaga 1561 ctgggcccat gaacacagag agggcctaac cgagatcagt atttaactaa ttataccagt 1621 ttttattaaa cccctttccc tccccgataa agaatgttct atttctgcct ccccttaaag 1681 gggagacctc agaagtaaag gaatttgatg ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa 1741 aaaaa //