LOCUS       AF105378                3201 bp    mRNA    linear   HUM 16-OCT-2007
DEFINITION  Homo sapiens heparan sulfate 3-O-sulfotransferase-4 (HS3ST4) mRNA,
            complete cds.
ACCESSION   AF105378
VERSION     AF105378.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1008 to 3201)
  AUTHORS   Shworak,N.W., Liu,J., Petros,L.M., Zhang,L., Kobayashi,M.,
            Copeland,N.G., Jenkins,N.A. and Rosenberg,R.D.
  TITLE     Multiple isoforms of heparan sulfate D-glucosaminyl
            3-O-sulfotransferase. Isolation, characterization, and expression
            of human cdnas and identification of distinct genomic loci
  JOURNAL   J. Biol. Chem. 274 (8), 5170-5184 (1999)
   PUBMED   9988767
REFERENCE   2  (bases 1 to 3201)
  AUTHORS   Lawrence,R., Yabe,T., Hajmohammadi,S., Rhodes,J., McNeely,M.,
            Liu,J., Lamperti,E.D., Toselli,P.A., Lech,M., Spear,P.G.,
            Rosenberg,R.D. and Shworak,N.W.
  TITLE     The principal neuronal gD-type 3-O-sulfotransferases and their
            products in central and peripheral nervous system tissues
  JOURNAL   Matrix Biol. 26 (6), 442-455 (2007)
   PUBMED   17482450
REFERENCE   3  (bases 1008 to 3201)
  AUTHORS   Shworak,N.W., Liu,J. and Rosenberg,R.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (12-NOV-1998) Biology, Massachusetts Institute of
            Technology, 31 Ames St, Cambridge, MA 02139, USA
REFERENCE   4  (bases 1 to 3201)
  AUTHORS   Lawrence,R., Yabe,T., HajMohammadi,S., Rhodes,J., McNeely,M.,
            Liu,J., Lamperti,E.D., Toselli,P.A., Spear,P.G., Rosenberg,R.D. and
            Shworak,N.W.
  TITLE     Direct Submission
  JOURNAL   Submitted (18-NOV-2003) Medicine, Dartmouth Medical School, One
            Medical Center Drive, Lebanon, NH 03756, USA
  REMARK    Nucleotide sequence updated by submitter
COMMENT     On Apr 16, 2004 this sequence version replaced AF105378.1.
FEATURES             Location/Qualifiers
     source          1..3201
                     /db_xref="H-InvDB:HIT000069486"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /chromosome="16"
                     /map="16p11.2"
                     /tissue_type="brain"
                     /dev_stage="infant"
                     /note="isolated by hybridization screening using IMAGE
                     Consortium clone 23279 found in GenBank Accession Number
                     T75445"
     gene            1..3201
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
     5'UTR           1..389
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
     stem_loop       join(282..311,352..363)
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="paired inverted repeat"
     CDS             390..1760
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /EC_number="2.8.2.23"
                     /note="heparan sulfate D-glucosaminyl
                     3-O-sulfotransferase-4; similar to EC 2.8.2.30; creates
                     3-O-sulfated HS motifs that trigger cellular entry of
                     herpes simplex virus-1"
                     /codon_start=1
                     /product="heparan sulfate 3-O-sulfotransferase-4"
                     /protein_id="AAD30210.2"
                     /translation="MARWPAPPPPPPPPPPLAAPPPPGASAKGPPARKLLFMCTLSLS
                     VTYLCYSLLGGSGSLQFPLALQESPGAAAEPPPSPPPPSLLPTPVRLGAPSQPPAPPP
                     LDNASHGEPPEPPEQPAAPGTDGWGLPSGGGGARDAWLRTPLAPSEMITAQSALPERE
                     AQESSTTDEDLAGRRAANGSSERGGAVSTPDYGEKKLPQALIIGVKKGGTRALLEAIR
                     VHPDVRAVGVEPHFFDRNYEKGLEWYRNVMPKTLDGQITMEKTPSYFVTNEAPKRIHS
                     MAKDIKLIVVVRNPVTRAISDYTQTLSKKPEIPTFEVLAFKNRTLGLIDASWSAIRIG
                     IYALHLENWLQYFPLSQILFVSGERLIVDPAGEMAKVQDFLGLKRVVTKKHFYFNKTK
                     GFPCLKKPEDSSAPRCLGKSKGRTHPRIDPDVIHRLRKFYKPFNLMFYQMTGQDFQWE
                     QEEGDK"
     misc_feature    492..548
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative; transmembrane region"
     misc_feature    591..593
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative O-linked; glycosylation site"
     misc_feature    621..623
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative O-linked; glycosylation site"
     misc_feature    636..638
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative O-linked; glycosylation site"
     misc_feature    672..674
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative O-linked; glycosylation site"
     misc_feature    702..710
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative N-linked; glycosylation site"
     misc_feature    840..842
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative O-linked; glycosylation site"
     misc_feature    849..851
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative O-linked; glycosylation site"
     misc_feature    921..929
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative N-linked; glycosylation site"
     misc_feature    966..1757
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="Region: putative protein sulfotransferase domain"
     misc_feature    1341..1349
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative N-linked; glycosylation site"
     misc_feature    1353..1364
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
                     /note="putative N-linked; glycosylation site"
     3'UTR           1761..3201
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
     regulatory      3178..3182
                     /regulatory_class="polyA_signal_sequence"
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
     polyA_site      3201
                     /gene="HS3ST4"
                     /gene_synonym="3OST4"
BASE COUNT          696 a          931 c          945 g          629 t
ORIGIN      
        1 cagcgcgcgg cggcggcggc agaggctgaa gcagaagccg cggcggagcc ggggaagcgg
       61 gggcgctgca gacggagcag gtgccgccgg cgggtccgcg cgcccccctc ggtccccttg
      121 cctgaggctg aggggggggc ggtggtgggg gggccactcg gactcggcgg gcagcgtggg
      181 gcggggggcc atgcggccgg gctcccccct ggcgcagcgg gacagcggcc agggccgggg
      241 gcgcagcggc gtcgcttcat gcagccgggg cggctgggca gcggcggcgg cggcggcggc
      301 ggcggcggcg ggggcggcgg ctgaaaccat gtccgggcag cgccgggggc tgccgccgcc
      361 gccgccgccg ccgcgagccg ggagccgcga tggcccggtg gcccgcacct cctccgcctc
      421 cgcctccgcc tccacctctg gccgcgccgc cgccgcccgg cgcctctgct aaggggccgc
      481 cggcgcgcaa gctgcttttt atgtgcacct tgtccctgtc tgtcacctac ctgtgctaca
      541 gcctcctggg cggctcgggc tccctgcaat tccctctggc gctgcaggag tcgccgggcg
      601 ccgccgccga gcccccgccg agcccgccgc caccctctct gctgcctacc cccgtgcgcc
      661 tcggcgcccc ctcgcagccg cccgcgccgc cgccgctgga caacgcgagc cacggggagc
      721 cgcccgagcc cccagagcag ccagccgccc ccgggaccga cggctggggg ctgccgagcg
      781 gcggcggagg cgcccgggac gcctggctcc ggaccccgct ggcccccagc gagatgatca
      841 cggctcagag cgcgctgccg gagagggaag cgcaggagtc cagcaccacc gacgaggatc
      901 tcgcaggccg gagagcggcc aacgggagca gcgagagggg cggcgccgtc agcacccccg
      961 actatgggga gaagaagctg ccacaggcgc tcatcatcgg ggtcaagaaa ggagggaccc
     1021 gcgcgctgct ggaggcgatc cgcgtgcacc cggacgtgcg ggcggtgggc gtagagccgc
     1081 acttcttcga caggaactac gaaaaggggt tggagtggta cagaaatgtg atgcccaaga
     1141 ctttggatgg gcaaataacc atggagaaga ctccaagtta ctttgtgaca aatgaggctc
     1201 ccaagcgcat tcactccatg gccaaggaca tcaaactgat tgtggtggtg agaaaccccg
     1261 tgaccagggc catctctgac tacacgcaga cactgtcaaa gaaacccgag atccccacct
     1321 ttgaggtgct ggccttcaaa aaccggaccc tcgggctgat cgatgcttcc tggagtgcca
     1381 ttcgaatagg gatctatgcg ctgcatctgg aaaactggct ccagtatttc cccctctccc
     1441 agatcctctt tgtcagtggt gagcgactca ttgtggaccc cgccggggaa atggccaaag
     1501 tacaggattt tctaggcctc aaacgtgttg tgactaagaa gcatttctat ttcaacaaaa
     1561 ccaaggggtt cccttgccta aagaagccag aagacagcag tgccccgagg tgcttaggca
     1621 agagcaaagg tcggactcat cctcgcattg acccagatgt catccacaga ctgaggaaat
     1681 tctacaaacc cttcaacttg atgttttacc aaatgactgg tcaagatttt cagtgggaac
     1741 aggaagaggg tgataaatga ggctagagag gcagaggaag gctagtcaat aagctaagga
     1801 ggctccttgc ctgagtcctt gaatacccca gcttctgcag cttcacttgc tggagtgcca
     1861 agtagatctc ctcctccttc atgcagccag gattgcctcc agtgctgtta gcttaggcaa
     1921 acaggtggat cccatggcat ccccatggag gaaccaggcc catctgggca gcagcatctg
     1981 gttgaccaga tggccaccag aacccactgt tcattcttat cttctgctag ttaatatagc
     2041 ctgaagacag aggataaata gttgtcaatg tcagagacag tgctattaat gtatatgtga
     2101 gcgacaaaaa aggtctgctt tataggggtt ctcactctag cttggggagc ccagggttct
     2161 agccctgtat ctgtcatggg cacctgctgt ctaaacctct gcttgggctt ctccccagaa
     2221 tgcactttgt ggctgagtgc tccaggactc ctagggagca agctcctccc tctaaggtgt
     2281 ttctagtctt ctctttaaag gtctcatccc acaacccctg acttcctccc tccccacatc
     2341 atgaaggcag aggcatgcac attcctcact gaaaaagaaa acacacaccc acccacacac
     2401 acacacacag aagaaaatga aagctgacac acctcgaagc cttctttcca agagccctct
     2461 aaatggggtt gggtctcact cttcatgagt atcctgggtt gtgcagaagc ttagcatatg
     2521 cccttgtgtt cggatcaggc ccacagggct gctcaaagag tagagtaatt gtaaccgagg
     2581 tcagagctct ggggttggca gagatgagtg gccatatctg ggggtaaaag aagaaatcct
     2641 gtcctcttgg tgggaggtta ccttacctga agaccatctc tcccaagcac tgtagttctg
     2701 agcatgtttt tggggtggac tctgtcccct agggtcccta gaagggcaaa gaccagagag
     2761 ttgacaagtc tgttattagg aataatcctt agccatgtaa tggagaaagg agcagtcagc
     2821 attcttccaa tttgccccac caccacctcc tcgggcttca ttttctctat ttagagatgg
     2881 cagagagtga ggtagtggcg agaaagctga ctccattcat cagatccagt ttatgagggt
     2941 tgggggtgag caagggctgt ctgcagaaac ccccatcaag agctgctgaa tgaagtgtcc
     3001 cttcccatca gtttgattca attaaaatgc atcatttgac ataaagcact tgttcacaga
     3061 tctccaaaac caggaattgt tctagtaaaa ctggaaattt gtatgagtgg ggggagttaa
     3121 atctgttcag ctgttattaa actgtcattt ctcccgctaa atgaaaaccg tgttgttata
     3181 aagcttaatg caacctgatt a
//