LOCUS AF105378 3201 bp mRNA linear HUM 16-OCT-2007 DEFINITION Homo sapiens heparan sulfate 3-O-sulfotransferase-4 (HS3ST4) mRNA, complete cds. ACCESSION AF105378 VERSION AF105378.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1008 to 3201) AUTHORS Shworak,N.W., Liu,J., Petros,L.M., Zhang,L., Kobayashi,M., Copeland,N.G., Jenkins,N.A. and Rosenberg,R.D. TITLE Multiple isoforms of heparan sulfate D-glucosaminyl 3-O-sulfotransferase. Isolation, characterization, and expression of human cdnas and identification of distinct genomic loci JOURNAL J. Biol. Chem. 274 (8), 5170-5184 (1999) PUBMED 9988767 REFERENCE 2 (bases 1 to 3201) AUTHORS Lawrence,R., Yabe,T., Hajmohammadi,S., Rhodes,J., McNeely,M., Liu,J., Lamperti,E.D., Toselli,P.A., Lech,M., Spear,P.G., Rosenberg,R.D. and Shworak,N.W. TITLE The principal neuronal gD-type 3-O-sulfotransferases and their products in central and peripheral nervous system tissues JOURNAL Matrix Biol. 26 (6), 442-455 (2007) PUBMED 17482450 REFERENCE 3 (bases 1008 to 3201) AUTHORS Shworak,N.W., Liu,J. and Rosenberg,R.D. TITLE Direct Submission JOURNAL Submitted (12-NOV-1998) Biology, Massachusetts Institute of Technology, 31 Ames St, Cambridge, MA 02139, USA REFERENCE 4 (bases 1 to 3201) AUTHORS Lawrence,R., Yabe,T., HajMohammadi,S., Rhodes,J., McNeely,M., Liu,J., Lamperti,E.D., Toselli,P.A., Spear,P.G., Rosenberg,R.D. and Shworak,N.W. TITLE Direct Submission JOURNAL Submitted (18-NOV-2003) Medicine, Dartmouth Medical School, One Medical Center Drive, Lebanon, NH 03756, USA REMARK Nucleotide sequence updated by submitter COMMENT On Apr 16, 2004 this sequence version replaced AF105378.1. FEATURES Location/Qualifiers source 1..3201 /db_xref="H-InvDB:HIT000069486" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /chromosome="16" /map="16p11.2" /tissue_type="brain" /dev_stage="infant" /note="isolated by hybridization screening using IMAGE Consortium clone 23279 found in GenBank Accession Number T75445" gene 1..3201 /gene="HS3ST4" /gene_synonym="3OST4" 5'UTR 1..389 /gene="HS3ST4" /gene_synonym="3OST4" stem_loop join(282..311,352..363) /gene="HS3ST4" /gene_synonym="3OST4" /note="paired inverted repeat" CDS 390..1760 /gene="HS3ST4" /gene_synonym="3OST4" /EC_number="2.8.2.23" /note="heparan sulfate D-glucosaminyl 3-O-sulfotransferase-4; similar to EC 2.8.2.30; creates 3-O-sulfated HS motifs that trigger cellular entry of herpes simplex virus-1" /codon_start=1 /product="heparan sulfate 3-O-sulfotransferase-4" /protein_id="AAD30210.2" /translation="MARWPAPPPPPPPPPPLAAPPPPGASAKGPPARKLLFMCTLSLS VTYLCYSLLGGSGSLQFPLALQESPGAAAEPPPSPPPPSLLPTPVRLGAPSQPPAPPP LDNASHGEPPEPPEQPAAPGTDGWGLPSGGGGARDAWLRTPLAPSEMITAQSALPERE AQESSTTDEDLAGRRAANGSSERGGAVSTPDYGEKKLPQALIIGVKKGGTRALLEAIR VHPDVRAVGVEPHFFDRNYEKGLEWYRNVMPKTLDGQITMEKTPSYFVTNEAPKRIHS MAKDIKLIVVVRNPVTRAISDYTQTLSKKPEIPTFEVLAFKNRTLGLIDASWSAIRIG IYALHLENWLQYFPLSQILFVSGERLIVDPAGEMAKVQDFLGLKRVVTKKHFYFNKTK GFPCLKKPEDSSAPRCLGKSKGRTHPRIDPDVIHRLRKFYKPFNLMFYQMTGQDFQWE QEEGDK" misc_feature 492..548 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative; transmembrane region" misc_feature 591..593 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative O-linked; glycosylation site" misc_feature 621..623 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative O-linked; glycosylation site" misc_feature 636..638 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative O-linked; glycosylation site" misc_feature 672..674 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative O-linked; glycosylation site" misc_feature 702..710 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative N-linked; glycosylation site" misc_feature 840..842 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative O-linked; glycosylation site" misc_feature 849..851 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative O-linked; glycosylation site" misc_feature 921..929 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative N-linked; glycosylation site" misc_feature 966..1757 /gene="HS3ST4" /gene_synonym="3OST4" /note="Region: putative protein sulfotransferase domain" misc_feature 1341..1349 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative N-linked; glycosylation site" misc_feature 1353..1364 /gene="HS3ST4" /gene_synonym="3OST4" /note="putative N-linked; glycosylation site" 3'UTR 1761..3201 /gene="HS3ST4" /gene_synonym="3OST4" regulatory 3178..3182 /regulatory_class="polyA_signal_sequence" /gene="HS3ST4" /gene_synonym="3OST4" polyA_site 3201 /gene="HS3ST4" /gene_synonym="3OST4" BASE COUNT 696 a 931 c 945 g 629 t ORIGIN 1 cagcgcgcgg cggcggcggc agaggctgaa gcagaagccg cggcggagcc ggggaagcgg 61 gggcgctgca gacggagcag gtgccgccgg cgggtccgcg cgcccccctc ggtccccttg 121 cctgaggctg aggggggggc ggtggtgggg gggccactcg gactcggcgg gcagcgtggg 181 gcggggggcc atgcggccgg gctcccccct ggcgcagcgg gacagcggcc agggccgggg 241 gcgcagcggc gtcgcttcat gcagccgggg cggctgggca gcggcggcgg cggcggcggc 301 ggcggcggcg ggggcggcgg ctgaaaccat gtccgggcag cgccgggggc tgccgccgcc 361 gccgccgccg ccgcgagccg ggagccgcga tggcccggtg gcccgcacct cctccgcctc 421 cgcctccgcc tccacctctg gccgcgccgc cgccgcccgg cgcctctgct aaggggccgc 481 cggcgcgcaa gctgcttttt atgtgcacct tgtccctgtc tgtcacctac ctgtgctaca 541 gcctcctggg cggctcgggc tccctgcaat tccctctggc gctgcaggag tcgccgggcg 601 ccgccgccga gcccccgccg agcccgccgc caccctctct gctgcctacc cccgtgcgcc 661 tcggcgcccc ctcgcagccg cccgcgccgc cgccgctgga caacgcgagc cacggggagc 721 cgcccgagcc cccagagcag ccagccgccc ccgggaccga cggctggggg ctgccgagcg 781 gcggcggagg cgcccgggac gcctggctcc ggaccccgct ggcccccagc gagatgatca 841 cggctcagag cgcgctgccg gagagggaag cgcaggagtc cagcaccacc gacgaggatc 901 tcgcaggccg gagagcggcc aacgggagca gcgagagggg cggcgccgtc agcacccccg 961 actatgggga gaagaagctg ccacaggcgc tcatcatcgg ggtcaagaaa ggagggaccc 1021 gcgcgctgct ggaggcgatc cgcgtgcacc cggacgtgcg ggcggtgggc gtagagccgc 1081 acttcttcga caggaactac gaaaaggggt tggagtggta cagaaatgtg atgcccaaga 1141 ctttggatgg gcaaataacc atggagaaga ctccaagtta ctttgtgaca aatgaggctc 1201 ccaagcgcat tcactccatg gccaaggaca tcaaactgat tgtggtggtg agaaaccccg 1261 tgaccagggc catctctgac tacacgcaga cactgtcaaa gaaacccgag atccccacct 1321 ttgaggtgct ggccttcaaa aaccggaccc tcgggctgat cgatgcttcc tggagtgcca 1381 ttcgaatagg gatctatgcg ctgcatctgg aaaactggct ccagtatttc cccctctccc 1441 agatcctctt tgtcagtggt gagcgactca ttgtggaccc cgccggggaa atggccaaag 1501 tacaggattt tctaggcctc aaacgtgttg tgactaagaa gcatttctat ttcaacaaaa 1561 ccaaggggtt cccttgccta aagaagccag aagacagcag tgccccgagg tgcttaggca 1621 agagcaaagg tcggactcat cctcgcattg acccagatgt catccacaga ctgaggaaat 1681 tctacaaacc cttcaacttg atgttttacc aaatgactgg tcaagatttt cagtgggaac 1741 aggaagaggg tgataaatga ggctagagag gcagaggaag gctagtcaat aagctaagga 1801 ggctccttgc ctgagtcctt gaatacccca gcttctgcag cttcacttgc tggagtgcca 1861 agtagatctc ctcctccttc atgcagccag gattgcctcc agtgctgtta gcttaggcaa 1921 acaggtggat cccatggcat ccccatggag gaaccaggcc catctgggca gcagcatctg 1981 gttgaccaga tggccaccag aacccactgt tcattcttat cttctgctag ttaatatagc 2041 ctgaagacag aggataaata gttgtcaatg tcagagacag tgctattaat gtatatgtga 2101 gcgacaaaaa aggtctgctt tataggggtt ctcactctag cttggggagc ccagggttct 2161 agccctgtat ctgtcatggg cacctgctgt ctaaacctct gcttgggctt ctccccagaa 2221 tgcactttgt ggctgagtgc tccaggactc ctagggagca agctcctccc tctaaggtgt 2281 ttctagtctt ctctttaaag gtctcatccc acaacccctg acttcctccc tccccacatc 2341 atgaaggcag aggcatgcac attcctcact gaaaaagaaa acacacaccc acccacacac 2401 acacacacag aagaaaatga aagctgacac acctcgaagc cttctttcca agagccctct 2461 aaatggggtt gggtctcact cttcatgagt atcctgggtt gtgcagaagc ttagcatatg 2521 cccttgtgtt cggatcaggc ccacagggct gctcaaagag tagagtaatt gtaaccgagg 2581 tcagagctct ggggttggca gagatgagtg gccatatctg ggggtaaaag aagaaatcct 2641 gtcctcttgg tgggaggtta ccttacctga agaccatctc tcccaagcac tgtagttctg 2701 agcatgtttt tggggtggac tctgtcccct agggtcccta gaagggcaaa gaccagagag 2761 ttgacaagtc tgttattagg aataatcctt agccatgtaa tggagaaagg agcagtcagc 2821 attcttccaa tttgccccac caccacctcc tcgggcttca ttttctctat ttagagatgg 2881 cagagagtga ggtagtggcg agaaagctga ctccattcat cagatccagt ttatgagggt 2941 tgggggtgag caagggctgt ctgcagaaac ccccatcaag agctgctgaa tgaagtgtcc 3001 cttcccatca gtttgattca attaaaatgc atcatttgac ataaagcact tgttcacaga 3061 tctccaaaac caggaattgt tctagtaaaa ctggaaattt gtatgagtgg ggggagttaa 3121 atctgttcag ctgttattaa actgtcattt ctcccgctaa atgaaaaccg tgttgttata 3181 aagcttaatg caacctgatt a //