LOCUS BC012976 2382 bp mRNA linear HUM 08-FEB-2007 DEFINITION Homo sapiens galactose-3-O-sulfotransferase 4, mRNA (cDNA clone MGC:15045 IMAGE:3636329), complete cds. ACCESSION BC012976 VERSION BC012976.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2382) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2382) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (20-AUG-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC012976.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 24 Row: f Column: 17 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31542834. FEATURES Location/Qualifiers source 1..2382 /db_xref="H-InvDB:HIT000036056" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:15045 IMAGE:3636329" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2382 /gene="GAL3ST4" /gene_synonym="GAL3ST-4" /db_xref="GeneID:79690" /db_xref="HGNC:HGNC:24145" /db_xref="MIM:608235" CDS 213..1673 /gene="GAL3ST4" /gene_synonym="GAL3ST-4" /codon_start=1 /product="galactose-3-O-sulfotransferase 4" /protein_id="AAH12976.1" /db_xref="GeneID:79690" /db_xref="HGNC:HGNC:24145" /db_xref="MIM:608235" /translation="MGPLSPARTLRLWGPRSLGVALGVFMTIGFALQLLGGPFQRRLP GLQLRQPSAPSLRPALPSCPPRQRLVFLKTHKSGSSSVLSLLHRYGDQHGLRFALPAR YQFGYPKLFQASRVKGYRPQGGGTQLPFHILCHHMRFNLKEVLQVMPSDSFFFSIVRD PAALARSAFSYYKSTSSAFRKSPSLAAFLANPRGFYRPGARGDHYARNLLWFDFGLPF PPEKRAKRGNIHPPRDPNPPQLQVLPSGAGPRAQTLNPNALIHPVSTVTDHRSQISSP ASFDLGSSSFIQWGLAWLDSVFDLVMVAEYFDESLVLLADALCWGLDDVVGFMHNAQA GHKQGLSTVSNSGLTAEDRQLTARARAWNNLDWALYVHFNRSLWARIEKYGQGRLQTA VAELRARREALAKHCLVGGEASDPKYITDRRFRPFQFGSAKVLGYILRSGLSPQDQEE CERLATPELQYKDKLDVKQFPPTVSLPLKTSRPLSP" BASE COUNT 485 a 752 c 581 g 564 t ORIGIN 1 ccttgctgct ccccttgtgg cctgacgctg acagaggcaa aaatctgcta actcaggggg 61 cagactcaac caagactgtg agcaggcctg gggaatgacc ccccgatctc caaccagtgc 121 cttccgcagc tgcacggctg tctccagctg tctctgcccc tcttcctggc cctggctcca 181 tctctctgtc acctcaccct tccctgtgcc acatgggccc tctctctcct gccaggacgc 241 tgcggctctg gggacctcgg agcctggggg tggctctggg agtcttcatg accattggct 301 ttgcactcca gctcttggga gggcccttcc agaggaggct acctgggcta cagctccgac 361 agccctcggc cccatcccta cgaccagccc ttccgtcctg cccaccccgg cagcgactgg 421 tgttcctgaa gacacataaa tccgggagca gctctgtgct gagcctgctt caccgctatg 481 gggaccagca cgggctgcgc ttcgccctcc ctgcccgcta ccagtttggc tacccaaagc 541 tcttccaggc ctctagggta aaaggctacc gcccacaggg tggaggcacc cagctcccct 601 tccacatcct ctgtcaccac atgaggttca acctgaaaga ggtacttcag gtcatgcctt 661 ctgacagctt ctttttttcc attgtccgag acccagcggc tctggctcgc tctgccttct 721 cctactataa atccacctca tcagccttcc gcaagtcacc atctttggct gccttcctgg 781 ccaatcctcg aggcttctac aggcctgggg cccgtgggga ccactacgct cgcaacttac 841 tatggtttga ctttggcctg ccctttcccc cagagaagag ggccaagaga gggaatattc 901 atccccccag agaccccaac cccccacagc tgcaggtctt gccttctggt gctggccctc 961 gagcccaaac cctcaatccc aatgccctca tccatcctgt ttccactgtt actgatcatc 1021 gcagccagat atcaagccct gcctctttcg atttggggtc ttcatccttc atccagtggg 1081 gtctggcctg gctggactct gtctttgacc tggtcatggt ggctgagtac ttcgatgagt 1141 cattggttct gctggcagat gccctgtgct ggggtctaga tgacgtggtg ggcttcatgc 1201 acaatgccca ggctggacat aagcagggcc tcagcactgt cagcaacagt ggactgactg 1261 cggaggaccg gcagctgact gcacgggccc gagcctggaa caacctggac tgggctctct 1321 atgtccactt caaccgcagt ctctgggcac ggatagagaa atacggccag ggccggctgc 1381 agacagctgt ggccgagctc cgggctcgcc gagaggccct agcgaaacat tgtctggtag 1441 ggggtgaggc ttctgacccc aaatacatca ctgatcgccg gttccgcccc ttccagtttg 1501 ggtcagctaa ggttttgggc tatatacttc ggagtggatt gagcccccaa gaccaagagg 1561 aatgtgagcg cctagctacc cctgagctcc agtacaagga caagctggat gtcaagcagt 1621 tcccccctac cgtctcactg cccctcaaga cttcaaggcc actctcccca taaacatcag 1681 actacagatt taggtggaag agcagccatg tttgaagggc acatgtgatg agtggggggc 1741 agcaagatgc catttctgca tctcccagaa gggatgagtc tttgtcccga tgcaagcccc 1801 ctcttcgctg ggctcccagc agtgcttccc tcctccaccc tccactcatt ttgttctttc 1861 cccccaactt tttttttttt gaaacggagt cttgctctgt cccccaggct ggagtgcagt 1921 ggcatgatct cggctcactg caacctctgc ctcccaggtt caagcgattc tcctgcctca 1981 gcctccagag tagctaggat tacagatacg tgccaccata cccggctaat ttttatattt 2041 ttagagacag ggattcaaca tgttggttag gctggtcttg aactcctcac ctcaggtgat 2101 ccacatgact ctgcctccca aagtgctgcc attacaggcg tgagccacta ggcctgacct 2161 ccccttcccc tttcctgccc caaggcagat ccacatcacc gaagctccct agaggggcaa 2221 aagatggagt gagccacagg aagtttgggg cgtggtgagt tggaatgata cgtccatttc 2281 tctatgaaat atttgctact agactgttca tttctctctg acatgtttgt tgaatgaata 2341 aataatttga aacttcaaaa aaaaaaaaaa aaaaaaaaaa aa //