LOCUS BC019077 1637 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens galactose-3-O-sulfotransferase 1, mRNA (cDNA clone MGC:29648 IMAGE:4562507), complete cds. ACCESSION BC019077 VERSION BC019077.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1637) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1637) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-DEC-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC019077.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 39 Row: d Column: 2 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4758087. FEATURES Location/Qualifiers source 1..1637 /db_xref="H-InvDB:HIT000038536" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:29648 IMAGE:4562507" /tissue_type="Kidney, renal cell adenocarcinoma" /clone_lib="NIH_MGC_14" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..1637 /gene="GAL3ST1" /gene_synonym="CST" /db_xref="GeneID:9514" /db_xref="HGNC:HGNC:24240" /db_xref="MIM:602300" CDS 31..1302 /gene="GAL3ST1" /gene_synonym="CST" /codon_start=1 /product="galactose-3-O-sulfotransferase 1" /protein_id="AAH19077.1" /db_xref="GeneID:9514" /db_xref="HGNC:HGNC:24240" /db_xref="MIM:602300" /translation="MLPPQKKPWESMAKGLVLGALFTSFLLLMYSYAVPPLHAGLAST TPEAAASCSPPALEPEAVIRANGSAGECQPRRNIVFLKTHKTASSTLLNILFRFGQKH RLKFAFPNGRNDFDYPTFFARSLVQDYRPGACFNIICNHMRFHYDEVRGLVPTNAIFI TVLRDPARLFESSFHYFGPVVPLTWKLSAGDKLTEFLQDPDRYYDPNGFNAHYLRNLL FFDLGYDNSLDPSSPQVQEHILEVERRFHLVLLQEYFDESLVLLKDLLCWELEDVLYF KLNARRDSPVPRLSGELYGRATAWNMLDSHLYRHFNASFWRKVEAFGRERMAREVAAL RHANERMRTICIDGGHAVDAAAIQDEAMQPWQPLGTKSILGYNLKKSIGQRHAQLCRR MLTPEIQYLMDLGANLWVTKLWKFIRDFLRW" BASE COUNT 306 a 561 c 467 g 303 t ORIGIN 1 ctgattccag aaggtacctt ggtgtctgag atgctgccac cgcagaagaa gccctgggag 61 tccatggcta aggggctggt gctgggcgcg ctcttcacta gtttcctgct gctgatgtac 121 tcctatgccg tgcccccgct gcatgccggc ctggcctcca cgaccccgga ggccgcagcg 181 tcctgctctc cacctgcact cgagccagag gcagtgatcc gggccaacgg ctcggcgggg 241 gagtgccagc cgcggcgcaa catcgtgttc ttgaagacgc acaagacggc cagcagcacc 301 ctgctcaaca tcctgttccg cttcggccag aagcaccggc tcaagttcgc cttccctaac 361 ggccgcaatg acttcgacta cccgaccttc ttcgcccgca gcctggtgca ggactatcgg 421 cccggggcct gcttcaacat catctgcaac cacatgcgct tccactacga cgaggtgcgc 481 ggcctggtgc cgaccaacgc catcttcatc acggtgctcc gcgaccccgc ccgcttgttc 541 gagtcctcct tccactactt cgggccggtg gtgcccctca cgtggaagct ctcggccggc 601 gacaagctga ccgagttcct gcaagacccg gatcgctact acgaccccaa cggcttcaat 661 gcccactacc tccgaaacct gctcttcttc gacctgggct atgacaacag cctggacccc 721 agcagcccgc aggtgcagga gcacatcctg gaggtggagc gtcgcttcca cctggtgctc 781 cttcaagagt acttcgacga gtcgctggtg ctgctgaagg acctgctgtg ctgggagctg 841 gaggacgtgc tctacttcaa gctcaacgcc cgccgcgact cgcccgtgcc gcggctctcg 901 ggggagctgt atgggcgcgc caccgcctgg aacatgctgg actcccacct ctaccgccac 961 ttcaacgcca gcttctggcg caaggtggag gccttcgggc gggagcgcat ggcccgcgag 1021 gtggccgccc tgcgccatgc caacgagcgc atgcggacca tctgcatcga cgggggccac 1081 gccgtggacg ccgccgccat ccaggacgag gccatgcagc cctggcagcc gctgggcacc 1141 aagtccatcc tgggctacaa cctcaagaag agcatcgggc agcggcacgc gcagctctgc 1201 cggcgcatgc tcacgcccga gatccagtac ctgatggacc tcggcgccaa cctgtgggtc 1261 accaagctct ggaagttcat tcgcgatttc ctgcggtggt gacgtcccac cgcccagcgg 1321 cttgcctgcc tgctcgctcc ctgcagaggg gctgagcagg acgccgctgg tgctggccgc 1381 ccccagcccc ctcctggtgc cacctcagac cccggggtga gggggggctc cctgggggga 1441 ggcagccagc caagactggg cccatgaaca cagagagggc ctaaccgaga tcagtattta 1501 actaattata ccagttttta ttaaacccct ttccctcccc gataaagaat gttctatttc 1561 tgcctcccct taaaggggag acctcagaag taaaggaatt tgatgttgtg tttttgttaa 1621 aaaaaaaaaa aaaaaaa //