LOCUS       HUMVWFM                 6360 bp    mRNA    linear   HUM 07-AUG-1995
DEFINITION  Human von Willebrand factor mRNA, 3' end.
ACCESSION   M10321 M10312 M10320 M12782
VERSION     M10321.1
KEYWORDS    glycoprotein; von Willebrand factor.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 404; 1648 to 6360)
  AUTHORS   Sadler,J.E., Shelton-Inloes,B.B., Sorace,J.M., Harlan,J.M.,
            Titani,K. and Davie,E.W.
  TITLE     Cloning and characterization of two cDNAs coding for human von
            Willebrand factor
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 82 (19), 6394-6398 (1985)
   PUBMED   2864688
REFERENCE   2  (bases 123 to 2056)
  AUTHORS   Shelton-Inloes,B.B., Titani,K. and Sadler,J.E.
  TITLE     cDNA sequences for human von Willebrand factor reveal five types of
            repeated domains and five possible protein sequence polymorphisms
  JOURNAL   Biochemistry 25 (11), 3164-3171 (1986)
   PUBMED   3488076
COMMENT     Original source text: Human umbilical vein endothelial cell, cDNA
            to mRNA, clones lambda-HvWF[1,3] [1]; clones lambda-HvWF[2,4] [2].
            Draft entry and sequence in computer readable form for [2] kindly
            provided by J.E.Sadler, 11-AUG-1986.  The authors note numerous
            repeated protein regions and domains The DNA used for sequencing
            was obtained from a number of individuals.  The variations are
            probably alleles.  A number of combinations with the variations
            listed below is possible.
FEATURES             Location/Qualifiers
     source          1..6360
                     /db_xref="H-InvDB:HIT000194077"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /map="12pter-p12"
     gene            1..6360
                     /gene="F8VWF"
     mRNA            <1..6360
                     /gene="F8VWF"
                     /product="vWF mRNA"
     CDS             <1..6226
                     /gene="F8VWF"
                     /note="von Willebrand factor prepropeptide"
                     /codon_start=2
                     /protein_id="AAB59512.1"
                     /db_xref="GDB:G00-119-125"
                     /translation="WATCPGSLLPDAVLSSPLSHRSKRSLSCRPHMVKLVCPADNLRA
                     EGLECAKTCQNYDLECMSMGSVSGCLCPPGMVRHENRCVALERCPCFHQGKEYAPGET
                     VKIGCNTCVCRDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYLFPGECQYVLVQDYCG
                     SNPGTFRILVGNKGCSHPSVKCKKRVTILVEGGEIELFDGEVNVKRPMKDETHFEVVE
                     SGRYIILLLGKALSVVWDRHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVE
                     EDPVDFGNSWKVSSQCADTRKVPLDSSPATCHNNIMKQTMVDSSCRILTSDVFQDCNK
                     LVDPEPYLDVCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTATLCPQSCE
                     ERNLRENGYECEWRYNSCAPACQVTCQHPEPLACPVQCVEGCHAHCPPGKILDELLQT
                     CVDPEDCPVCEVAGRRFASGKKVTLNPSDPEHCQICHCDVVNLTCEACQEPGGLVVPP
                     TDAPVSPTTLYVEDISEPPLHDFYCSRLLDLVFLLDGSSRLSEAEFEVLKAFVVDMME
                     RLRISQKWVRVAVVEYHDGSHAYIGLKDRKRPSELRRIASQVKYAGSQVASTSEVLKY
                     TLFQIFSKIDRPEASRIALLLMASQEPQRMSRNFVRYVQGLKKKKVIVIPVGIGPHAN
                     LKQIRLIEKQAPENKAFVLSSVDELEQQRDEIVSYLCDLAPEAPPPTLPPDMAQVTVG
                     PGLLGVSTLGPKRNSMVLDVAFVLEGSDKIGEADFNRSKEFMEEVIQRMDVGQDSIHV
                     TVLQYSYMVTVEYPFSEAQSKGDILQRVREIRYQGGNRTNTGLALRYLSDHSFLVSQG
                     DREQAPNLVYMVTGNPASDEIKRLPGDIQVVPIGVGPNANVQELERIGWPNAPILIQD
                     FETLPREAPDLVLQRCCSGEGLQIPTLSPAPDCSQPLDVILLLDGSSSFPASYFDEMK
                     SFAKAFISKANIGPRLTQVSVLQYGSITTIDVPWNVVPEKAHLLSLVDVMQREGGPSQ
                     IGDALGFAVRYLTSEMHGARPGASKAVVILVTDVSVDSVDAAADAARSNRVTVFPIGI
                     GDRYDAAQLRILAGPAGDSNVVKLQRIEDLPTMVTLGNSFLHKLCSGFVRICMDEDGN
                     EKRPGDVWTLPDQCHTVTCQPDGQTLLKSHRVNCDRGLRPSCPNSQSPVKVEETCGCR
                     WTCPCVCTGSSTRHIVTFDGQNFKLTGSCSYVLFQNKEQDLEVILHNGACSPGARQGC
                     MKSIEVKHSALSVELHSDMEVTVNGRLVSVPYVGGNMEVNVYGAIMHEVRFNHLGHIF
                     TFTPQNNEFQLQLSPKTFASKTYGLCGICDENGANDFMLRDGTVTTDWKTLVQEWTVQ
                     RPGQTCQPILEEQCLVPDSSHCQVLLLPLFAECHKVLAPATFYAICQQDSSHQEQVCE
                     VIASYAHLCRTNGVCVDWRTPDFCAMSCPPSLVYNHCEHGCPRHCDGNVSSCGDHPSE
                     GCFCPPDKVMLEGSCVPEEACTQCIGEDGVQHQFLEAWVPDHQPCQICTCLSGRKVNC
                     TTQPCPTAKAPTCGLCEVARLRQNADQCCPEYECVCDPVSCDLPPVPHCERGLQPTLT
                     NPGECRPNFTCACRKEECKRVSPPSCPPHRLPTLRKTQCCDEYECACNCVNSTVSCPL
                     GYLASTATNDCGCTTTTCLPDKVCVHRSTIYPVGQFWEEGCDVCTCTDMEDAVMGLRV
                     AQCSQKPCEDSCRSGFTYVLHEGECCGRCLPSACEVVTGSPRGDSQSSWKSVGSQWAS
                     PENPCLINECVRVKEEVFIQQRNVSCPQLEVPVCPSGFQLSCKTSACCPSCRCERMEA
                     CMLNGTVIGPGKTVMIDVCTTCRCMVQVGVISGFKLECRKTTCNPCPLGYKEENNTGE
                     CCGRCLPTACTIQLRGGQIMTLKRDETLQDGCDTHFCKVNERGEYFWEKRVTGCPPFD
                     EHKCLAEGGKIMKIPGTCCDTCEEPECNDITARLQYVKVGSCKSEVEVDIHYCQGKCA
                     SKAMYSIDINDVQDQCSCCSPTRTEPMQVALHCTNGSVVYHEVLNAMECKCSPRKCSK
                     "
     mat_peptide     74..6223
                     /gene="F8VWF"
                     /product="von Willebrand factor mature peptide"
     variation       149
                     /gene="F8VWF"
                     /note="g in lambda-HvWF2; a in lambda-HvWF[1,4]"
                     /replace="a"
     variation       169
                     /gene="F8VWF"
                     /note="t in lambda-HvWF[1,4]; c in lambda-HvWF2"
                     /replace="c"
     variation       339
                     /gene="F8VWF"
                     /note="g in lambda-HvWF[1,2]; a in lambda-HvWF4"
                     /replace="a"
     variation       353
                     /gene="F8VWF"
                     /note="a in lambda-HvWF[1,2]; g in lambda-HvWF4"
                     /replace="g"
     variation       664
                     /gene="F8VWF"
                     /note="g in lambda-HvWF4; a in lambda-HvWF2"
                     /replace="a"
     variation       1648
                     /gene="F8VWF"
                     /note="g in lambda-HvWF4; t in lambda-HvWF3"
                     /replace="t"
     variation       1795
                     /gene="F8VWF"
                     /note="g in lambda-HvWF4; a in lambda-HvWF3"
                     /replace="a"
     variation       1925
                     /gene="F8VWF"
                     /note="g in lambda-HvWF3; a in lambda-HvWF4"
                     /replace="a"
BASE COUNT         1336 a         1812 c         1877 g         1335 t
ORIGIN      Chromosome 12pter-p12; 19 bp upsteam of HindIII site.
        1 ctgggccacg tgccccggaa gcttgctgcc tgacgctgtc ctcagcagtc ccctgtctca
       61 tcgcagcaaa aggagcctat cctgtcggcc ccacatggtc aagctggtgt gtcccgctga
      121 caacctgcgg gctgaagggc tcgagtgtgc caaaacgtgc cagaactatg acctggagtg
      181 catgagcatg ggctctgtct ctggctgcct ctgccccccg ggcatggtcc ggcatgagaa
      241 cagatgtgtg gccctggaaa ggtgtccctg cttccatcag ggcaaggagt atgcccctgg
      301 agaaacagtg aagattggct gcaacacttg tgtctgtcgg gaccggaagt ggaactgcac
      361 agaccatgtg tgtgatgcca cgtgctccac gatcggcatg gcccactacc tcaccttcga
      421 cgggctcaaa tacctgttcc ccggggagtg ccagtacgtt ctggtgcagg attactgcgg
      481 cagtaaccct gggacctttc ggatcctagt ggggaataag ggatgcagcc acccctcagt
      541 gaaatgcaag aaacgggtca ccatcctggt ggagggagga gagattgagc tgtttgacgg
      601 ggaggtgaat gtgaagaggc ccatgaagga tgagactcac tttgaggtgg tggagtctgg
      661 ccggtacatc attctgctgc tgggcaaagc cctctccgtg gtctgggacc gccacctgag
      721 catctccgtg gtcctgaagc agacatacca ggagaaagtg tgtggcctgt gtgggaattt
      781 tgatggcatc cagaacaatg acctcaccag cagcaacctc caagtggagg aagaccctgt
      841 ggactttggg aactcctgga aagtgagctc gcagtgtgct gacaccagaa aagtgcctct
      901 ggactcatcc cctgccacct gccataacaa catcatgaag cagacgatgg tggattcctc
      961 ctgtagaatc cttaccagtg acgtcttcca ggactgcaac aagctggtgg accccgagcc
     1021 atatctggat gtctgcattt acgacacctg ctcctgtgag tccattgggg actgcgcctg
     1081 cttctgcgac accattgctg cctatgccca cgtgtgtgcc cagcatggca aggtggtgac
     1141 ctggaggacg gccacattgt gcccccagag ctgcgaggag aggaatctcc gggagaacgg
     1201 gtatgagtgt gagtggcgct ataatagctg tgcacctgcc tgtcaagtca cgtgtcagca
     1261 ccctgagcca ctggcctgcc ctgtgcagtg tgtggagggc tgccatgccc actgccctcc
     1321 agggaaaatc ctggatgagc ttttgcagac ctgcgttgac cctgaagact gtccagtgtg
     1381 tgaggtggct ggccggcgtt ttgcctcagg aaagaaagtc accttgaatc ccagtgaccc
     1441 tgagcactgc cagatttgcc actgtgatgt tgtcaacctc acctgtgaag cctgccagga
     1501 gccgggaggc ctggtggtgc ctcccacaga tgccccggtg agccccacca ctctgtatgt
     1561 ggaggacatc tcggaaccgc cgttgcacga tttctactgc agcaggctac tggacctggt
     1621 cttcctgctg gatggctcct ccaggctgtc cgaggctgag tttgaagtgc tgaaggcctt
     1681 tgtggtggac atgatggagc ggctgcgcat ctcccagaag tgggtccgcg tggccgtggt
     1741 ggagtaccac gacggctccc acgcctacat cgggctcaag gaccggaagc gaccgtcaga
     1801 gctgcggcgc attgccagcc aggtgaagta tgcgggcagc caggtggcct ccaccagcga
     1861 ggtcttgaaa tacacactgt tccaaatctt cagcaagatc gaccgccctg aagcctcccg
     1921 catcgccctg ctcctgatgg ccagccagga gccccaacgg atgtcccgga actttgtccg
     1981 ctacgtccag ggcctgaaga agaagaaggt cattgtgatc ccggtgggca ttgggcccca
     2041 tgccaacctc aagcagatcc gcctcatcga gaagcaggcc cctgagaaca aggccttcgt
     2101 gctgagcagt gtggatgagc tggagcagca aagggacgag atcgttagct acctctgtga
     2161 ccttgcccct gaagcccctc ctcctactct gccccccgac atggcacaag tcactgtggg
     2221 cccggggctc ttgggggttt cgaccctggg gcccaagagg aactccatgg ttctggatgt
     2281 ggcgttcgtc ctggaaggat cggacaaaat tggtgaagcc gacttcaaca ggagcaagga
     2341 gttcatggag gaggtgattc agcggatgga tgtgggccag gacagcatcc acgtcacggt
     2401 gctgcagtac tcctacatgg tgaccgtgga gtaccccttc agcgaggcac agtccaaagg
     2461 ggacatcctg cagcgggtgc gagagatccg ctaccagggc ggcaacagga ccaacactgg
     2521 gctggccctg cggtacctct ctgaccacag cttcttggtc agccagggtg accgggagca
     2581 ggcgcccaac ctggtctaca tggtcaccgg aaatcctgcc tctgatgaga tcaagaggct
     2641 gcctggagac atccaggtgg tgcccattgg agtgggccct aatgccaacg tgcaggagct
     2701 ggagaggatt ggctggccca atgcccctat cctcatccag gactttgaga cgctcccccg
     2761 agaggctcct gacctggtgc tgcagaggtg ctgctccgga gaggggctgc agatccccac
     2821 cctctcccca gcacctgact gcagccagcc cctggacgtg atccttctcc tggatggctc
     2881 ctccagtttc ccagcttctt attttgatga aatgaagagt ttcgccaagg ctttcatttc
     2941 aaaagccaat atagggcctc gtctcactca ggtgtcagtg ctgcagtatg gaagcatcac
     3001 caccattgac gtgccatgga acgtggtccc ggagaaagcc catttgctga gccttgtgga
     3061 cgtcatgcag cgggagggag gccccagcca aatcggggat gccttgggct ttgctgtgcg
     3121 atacttgact tcagaaatgc atggtgccag gccgggagcc tcaaaggcgg tggtcatcct
     3181 ggtcacggac gtctctgtgg attcagtgga tgcagcagct gatgccgcca ggtccaacag
     3241 agtgacagtg ttccctattg gaattggaga tcgctacgat gcagcccagc tacggatctt
     3301 ggcaggccca gcaggcgact ccaacgtggt gaagctccag cgaatcgaag acctccctac
     3361 catggtcacc ttgggcaatt ccttcctcca caaactgtgc tctggatttg ttaggatttg
     3421 catggatgag gatgggaatg agaagaggcc cggggacgtc tggaccttgc cagaccagtg
     3481 ccacaccgtg acttgccagc cagatggcca gaccttgctg aagagtcatc gggtcaactg
     3541 tgaccggggg ctgaggcctt cgtgccctaa cagccagtcc cctgttaaag tggaagagac
     3601 ctgtggctgc cgctggacct gcccctgcgt gtgcacaggc agctccactc ggcacatcgt
     3661 gacctttgat gggcagaatt tcaagctgac tggcagctgt tcttatgtcc tatttcaaaa
     3721 caaggagcag gacctggagg tgattctcca taatggtgcc tgcagccctg gagcaaggca
     3781 gggctgcatg aaatccatcg aggtgaagca cagtgccctc tccgtcgagc tgcacagtga
     3841 catggaggtg acggtgaatg ggagactggt ctctgttcct tacgtgggtg ggaacatgga
     3901 agtcaacgtt tatggtgcca tcatgcatga ggtcagattc aatcaccttg gtcacatctt
     3961 cacattcact ccacaaaaca atgagttcca actgcagctc agccccaaga cttttgcttc
     4021 aaagacgtat ggtctgtgtg ggatctgtga tgagaacgga gccaatgact tcatgctgag
     4081 ggatggcaca gtcaccacag actggaaaac acttgttcag gaatggactg tgcagcggcc
     4141 aggacagacg tgccagccca tcctggagga gcagtgtctt gtccccgaca gctcccactg
     4201 ccaggtcctc ctcttaccac tgtttgctga atgccacaag gtcctggctc cagccacatt
     4261 ctatgccatc tgccagcagg acagttcgca ccaggagcaa gtgtgtgagg tgatcgcctc
     4321 ttatgcccac ctctgtcgga ccaacggggt ctgcgttgac tggaggacac ctgatttctg
     4381 tgctatgtca tgcccaccat ctctggtcta caaccactgt gagcatggct gtccccggca
     4441 ctgtgatggc aacgtgagct cctgtgggga ccatccctcc gaaggctgtt tctgccctcc
     4501 agataaagtc atgttggaag gcagctgtgt ccctgaagag gcctgcactc agtgcattgg
     4561 tgaggatgga gtccagcacc agttcctgga agcctgggtc ccggaccacc agccctgtca
     4621 gatctgcaca tgcctcagcg ggcggaaggt caactgcaca acgcagccct gccccacggc
     4681 caaagctccc acgtgtggcc tgtgtgaagt agcccgcctc cgccagaatg cagaccagtg
     4741 ctgccccgag tatgagtgtg tgtgtgaccc agtgagctgt gacctgcccc cagtgcctca
     4801 ctgtgaacgt ggcctccagc ccacactgac caaccctggc gagtgcagac ccaacttcac
     4861 ctgcgcctgc aggaaggagg agtgcaaaag agtgtcccca ccctcctgcc ccccgcaccg
     4921 tttgcccacc cttcggaaga cccagtgctg tgatgagtat gagtgtgcct gcaactgtgt
     4981 caactccaca gtgagctgtc cccttgggta cttggcctca accgccacca atgactgtgg
     5041 ctgtaccaca accacctgcc ttcccgacaa ggtgtgtgtc caccgaagca ccatctaccc
     5101 tgtgggccag ttctgggagg agggctgcga tgtgtgcacc tgcaccgaca tggaggatgc
     5161 cgtgatgggc ctccgcgtgg cccagtgctc ccagaagccc tgtgaggaca gctgtcggtc
     5221 gggcttcact tacgttctgc atgaaggcga gtgctgtgga aggtgcctgc catctgcctg
     5281 tgaggtggtg actggctcac cgcgggggga ctcccagtct tcctggaaga gtgtcggctc
     5341 ccagtgggcc tccccggaga acccctgcct catcaatgag tgtgtccgag tgaaggagga
     5401 ggtctttata caacaaagga acgtctcctg cccccagctg gaggtccctg tctgcccctc
     5461 gggctttcag ctgagctgta agacctcagc gtgctgccca agctgtcgct gtgagcgcat
     5521 ggaggcctgc atgctcaatg gcactgtcat tgggcccggg aagactgtga tgatcgatgt
     5581 gtgcacgacc tgccgctgca tggtgcaggt gggggtcatc tctggattca agctggagtg
     5641 caggaagacc acctgcaacc cctgccccct gggttacaag gaagaaaata acacaggtga
     5701 atgttgtggg agatgtttgc ctacggcttg caccattcag ctaagaggag gacagatcat
     5761 gacactgaag cgtgatgaga cgctccagga tggctgtgat actcacttct gcaaggtcaa
     5821 tgagagagga gagtacttct gggagaagag ggtcacaggc tgcccaccct ttgatgaaca
     5881 caagtgtctg gctgagggag gtaaaattat gaaaattcca ggcacctgct gtgacacatg
     5941 tgaggagcct gagtgcaacg acatcactgc caggctgcag tatgtcaagg tgggaagctg
     6001 taagtctgaa gtagaggtgg atatccacta ctgccagggc aaatgtgcca gcaaagccat
     6061 gtactccatt gacatcaacg atgtgcagga ccagtgctcc tgctgctctc cgacacggac
     6121 ggagcccatg caggtggccc tgcactgcac caatggctct gttgtgtacc atgaggttct
     6181 caatgccatg gagtgcaaat gctcccccag gaagtgcagc aagtgaggct gctgcagctg
     6241 catgggtgcc tgctgctgcc tgccttggcc tgatggccag gccagagtgc tgccagtcct
     6301 ctgcatgttc tgctcttgtg cccttctgag cccacaataa aggctgagct cttatcttgc
//