LOCUS       HUMHSPG2B              14327 bp    mRNA    linear   HUM 08-NOV-1994
DEFINITION  Human heparan sulfate proteoglycan (HSPG2) mRNA, complete cds.
ACCESSION   M85289
VERSION     M85289.1
KEYWORDS    HSPG2 gene; heparan sulfate proteoglycan.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 14327)
  AUTHORS   Dodge,G.R., Kovalszky,I., Chu,M.L., Hassell,J.R., McBride,O.W.,
            Yi,H.F. and Iozzo,R.V.
  TITLE     Heparan sulfate proteoglycan of human colon: partial molecular
            cloning, cellular expression, and mapping of the gene (HSPG2) to
            the short arm of human chromosome 1
  JOURNAL   Genomics 10 (3), 673-680 (1991)
   PUBMED   1679749
REFERENCE   2  (bases 1 to 14327)
  AUTHORS   Murdoch,A.D., Dodge,G.R., Cohen,I., Tuan,R.S. and Iozzo,R.V.
  TITLE     Primary structure of the human heparan sulfate proteoglycan from
            basement membrane (HSPG2/perlecan). A chimeric molecule with
            multiple domains homologous to the low density lipoprotein
            receptor, laminin, neural cell adhesion molecules, and epidermal
            growth factor
  JOURNAL   J. Biol. Chem. 267 (12), 8544-8557 (1992)
   PUBMED   1569102
COMMENT     Original source text: Homo sapiens skin; colon cDNA to mRNA.
FEATURES             Location/Qualifiers
     source          1..14327
                     /db_xref="H-InvDB:HIT000196616"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /map="1p36.1-p35"
                     /cell_line="WiDr; CRL 1262"
                     /cell_type="fibroblast; amnion"
                     /tissue_type="skin; colon"
     gene            1..14327
                     /gene="HSPG2"
     CDS             81..13256
                     /gene="HSPG2"
                     /codon_start=1
                     /product="heparan sulfate proteoglycan"
                     /protein_id="AAA52700.1"
                     /db_xref="GDB:G00-126-372"
                     /translation="MGWRAPGALLLALLLHGRLLAVTHGLRAYDGLSLPEDIETVTAS
                     QMRWTHSYLSDDEYMLADSISGDDLGSGDLGSGDFQMVYFRALVNFTRSIEYSPQLED
                     AGSREFREVSEAVVDTLESEYLKIPGDQVVSVVFIKELDGWVFVELDVGSEGNADGAQ
                     IQEMLLRVISSGSVASYVTSPQGFQFRRLGTVPQFPRACTEAEFACHSYNECVALEYR
                     CDRRPDCRDMSDELNCEEPVLGISPTFSLLVETTSLPPRPETTIMRQPPVTHAPQPLL
                     PGSVRPLPCGPQEAACRNGHCIPRDYLCDGQEDCEDGSDELDCGPPPPCEPNEFPCGN
                     GHCALKLWRCDGDFDCEDRTDEANCPTKRPEEVCGPTQFRCVSTNMCIPASFHCDEES
                     DCPDRSDEFGCMPPQVVTPPRESIQASRGQTVTFTCVAIGVPTPIINWRLNWGHIPSH
                     PRVTVTSEGGRGTLIIRDVKESDQGAYTCEAMNARGMVFGIPDGVLELVPQRGPCPDG
                     HFYLEHSAACLPCFCFGITSVCQSTRRFRDQIRLRFDQPDDFKGVNVTMPAQPGTPPL
                     SSTQLQIDPSLHEFQLVDLSRRFLVHDSFWALPEQFLGNKVDSYGGSLRYNVRYELAR
                     GMLEPVQRPDVVLVGAGYRLLSRGHTPTQPGALNQRQVQFSEEHWVHESGRPVQRAEL
                     LQVLQSLEAVLIQTVYNTKMASVGLSDIAMDTTVTHATSHGRAHSVEECRCPIGYSGL
                     SCESCDAHFTRVPGGPYLGTCSGCSCNGHASSCDPVYGHCLNCQHNTEGPQCNKCKAG
                     FFGDAMKATATSCRPCPCPYIDASRRFSDTCFLDTDGQATCDACAPGYTGRRCESCAP
                     GYEGNPIQPGGKCRPVNQEIVRCDERGSMGTSGEACRCKNNVVGRLCNECADGSFHLS
                     TRNPDGCLKCFCMGVSRHCTSSSWSRAQLHGASEEPGHFSLTNAASTHTTNEGIFSPT
                     PGELGFSSFHRLLSGPYFWSLPSRFLGDKVTSYGGELRFTVTQRSQPGSTPLHGQPLV
                     VLQGNNIILEHHVAQEPSPGQPSTFIVPFREQAWQRPDGQPATREHLLMALAGIDTLL
                     IRASYAQQPAESRVSGISMDVAVPEETGQDPALEVEQCSCPPGYRGPSCQDCDTGYTR
                     TPSGLYLGTCERCSCHGHSEACEPETGACQGCQHHTEGPRCEQCQPGYYGDAQRGTPQ
                     DCQLCPCYGDPAAGQAAHTCFLDTDGHPTCDACSPGHSGRHCERCAPGYYGNPSQGQP
                     CQRDSQVPGPIGCNCDPQGSVSSQCDAAGQCQCKAQVEGLTCSHCRPHHFHLSASNPD
                     GCLPCFCMGITQQCASSAYTRHLISTHFAPGDFQGFALVNPQRNSRLTGEFTVEPVPE
                     GAQLSFGNFAQLGHESFYWQLPETYQGDKVAAYGGKLRYTLSYTAGPQGSPLSDPDVQ
                     ITGNNIMLVASQPALQGPERRSYEIMFREEFWRRPDGQPATREHLLMALADLDELLIR
                     ATFSSVPLVASISAVSLEVAQPGPSNRPRALEVEECRCPPGYIGLSCQDCAPGYTRTG
                     SGLYLGHCELCECNGHSDLCHPETGACSQCQHNAAGEFCELCAPGYYGDATAGTPEDC
                     QPCACPLTNPENMFSRTCESLGAGGYRCTACEPGYTGQYCEQCGPGYVGNPSVQGGQC
                     LPETNQAPLVVEVHPARSIVPQGGSHSLRCQVSGSPPHYFYWSREDGRPVPSGTQQRH
                     QGSELHFPSVQPSDAGVYICTCRNLHQSNTSRAELLVTEAPSKPITVTVEEQRSQSVR
                     PGADVTFICTAKSKSPAYTLVWTRLHNGKLPTRAMDFNGILTIRNVQLSDAGTYVCTG
                     SNMFAMDQGTATLHVQASGTLSAPVVSIHPPQLTVQPGQLAEFRCSATGSPTPTLEWT
                     GGPGGQLPAKAQIHGGILRLPAVEPTDQAQYLCRAHSSAGQQVARAVLHVHGGGGPRV
                     QVSPERTQVHAGRTVRLYCRAAGVPSATITWRKEGGSLPPQARSERTDIATLLIPAIT
                     TADAGFYLCVATSPAGTAQARMQVVVLSASDASPPGVKIESSSPSVTEGQTLDLNCVV
                     AGSAHAQVTWYRRGGSLPPHTQVHGSRLRLPQVSPADSGEYVCRVENGSGPKEASITV
                     SVLHGTHSGPSYTPVPGSTRPIRIEPSSSHVAEGQTLDLNCVVPGQAHAQVTWHKRGG
                     SLPARHQTHGSLLRLHQVTPADSGEYVCHVVGTSGPLEASVLVTIEASVIPGPIPPVR
                     IESSSSTVAEGQTLDLSCVVAGQAHAQVTWYKRGGSLPARHQVRGSRLYIFQASPADA
                     GQYVCRASNGMEASITVTVTGTQGANLAYPAGSTQPIRIEPSSSQVAEGQTLDLNCVV
                     PGQSHAQVTWHKRGGSLPVRHQTHGSLLRLYQASPADSGEYVCRVLGSSVPLEASVLV
                     TIEPAGSVPALGVTPTVRIESSSSQVAEGQTLDLNCLVAGQAHAQVTWHKRGGSLPAR
                     HQVHGSRLRLLQVTPADSGEYVCRVVGSSGTQEASVLVTIQQRLSGSHSQGVAYPVRI
                     ESSSASLANGHTLDLNCLVASQAPHTITWYKRGGSLPSRHQIVGSRLRIPQVTPADSG
                     EYVCHVSNGAGSRETSLIVTIQGSGSSHVPSVSPPIRIESSSPTVVEGQTLDLNCVVA
                     RQPQAIITWYKRGGSLPSRHQTHGSHLRLHQMSVADSGEYVCRANNNIDALEASIVIS
                     VSPSAGSPSAPGSSMPIRIESSSSHVAEGETLDLNCVVPGQAHAQVTWHKRGGSLPSH
                     HQTRGSRLRLHHVSPADSGEYVCRVMGSSGPLEASVLVTIEASGSSAVHVPAPGGAPP
                     IRIEPSSSRVAEGQTLDLKCVVPGQAHAQVTWHKRGGNLPARHQVHGPLLRLNQVSPA
                     DSGEYSCQVTGSSGTLEASVLVTIEPSSPGPIPAPGLAQPIYIEASSSHVTEGQTLDL
                     NCVVPGQAHAQVTWYKRGGSLPARHQTHGSQLRLHLVSPADSGEYVCRAASGPGPEQE
                     ASFTVTVPPSEGSSYRLRSPVISIDPPSSTVQQGQDASFKCLIHDGAAPISLEWKTRN
                     QELEDNVHISPNGSIITIVGTRPSNHGTYRCVASNAYGVAQSVVNLSVHGPPTVSVLP
                     EGPVWVKVGKAVTLECVSAGEPRSSARWTRISSTPAKLEQRTYGLMDSHAVLQISSAK
                     PSDAGTYVCLAQNALGTAQKQVEVIVDTGAMAPGAPQVQAEEAELTVEAGHTATLRCS
                     ATGSPAPTIHWSKLRSPLPWQHRLEGDTLIIPRVAQQDSGQYICNATSPAGHAEATII
                     LHVESPPYATTVPEHASVQAGETVQLQCLAHGTPPLTFQWSRVGSSLPGRATARNELL
                     HFERAAPEDSGRYRCRVTNKVGSAEAFAQLLVQGPPGSLPATSIPAGSTPTVQVTPQL
                     ETKSIGASVEFHCAVPSDQGTQLRWFKEGGQLPPGHSVQDGVLRIQNLDQSCQGTYIC
                     QAHGPWGKAQASAQLVIQALPSVLINIRTSVQTVVVGHAVEFECLALGDPKPQVTWSK
                     VGGHLRPGIVQSGGVVRIAHVELADAGQYRCTATNAAGTTQSHVLLLVQALPQISMPQ
                     EVRVPAGSAAVFPCIASGYPTPDISWSKLDGSLPPDSRLENNMLMLPSVRPQDAGTYV
                     CTATNRQGKVKAFAHLQVPERVVPYFTQTPYSFLPLPTIKDAYRKFEIKITFRPDSAD
                     GMLLYNGQKRVPGSPTNLANRQPDFISFGLVGGRPEFRFDAGSGMATIRHPTPLALGH
                     FHTVTLLRSLTQGSLIVGDLAPVNGTSQGKFQGLDLNEELYLGGYPDYGAIPKAGLSS
                     GFIGCVRELRIQGEEIVFHDLNLTAHGISHCPTCRDRPCQNGGQCHDSESSSYVCVCP
                     AGFTGSRCEHSQALHCHPEACGPDATCVNRPDGRGYTCRCHLGRSGLRCEEGVTVTTP
                     SLSGAGSYLALPALTNTHHELRLDVEFKPLAPDGVLLFSGGKSGPVEDFVSLAMVGGH
                     LEFRYELGSGLAVLRSAEPLALGRWHRVSAERLNKDGSLRVNGGRPVLRSSPGKSQGL
                     NLHTLLYLGGVEPSVPLSPATNMSAHFRGCVGEVSVNGKRLDLTYSFLGSQGIGQCYD
                     SSPCERQPCQHGATCMPAGEYEFQCLCRDGFKGDLCEHEENPCQLREPCLHGGTCQGT
                     RCLCLPGFSGPRCQQGSGHGIAESDWHLEGSGGNDAPGQYGAYFHDDGFLAFPGHVFS
                     RSLPEVPETIELEVRTSTASGLLLWQGVEVGEAGQGKDFISLGLQDGHLVFRYQLGSG
                     EARLVSEDPINDGEWHRVTALREGRRGSIQVDGEELVSGRSPGPNVAVNAKGSVYIGG
                     APDVATLTGGRFSSGITGCVKNLVLHSARPGAPPPQPLDLQHRAQAGANTRPCPS"
     sig_peptide     81..143
                     /gene="HSPG2"
                     /note="G00-126-372"
     mat_peptide     144..13253
                     /gene="HSPG2"
                     /product="heparan sulfate proteoglycan"
                     /note="G00-126-372"
     misc_feature    144..659
                     /gene="HSPG2"
                     /note="unique region with 3 glycosamnioglycan attachment
                     sites (domain I); G00-126-372"
     misc_feature    660..1289
                     /gene="HSPG2"
                     /note="shares homology with low density lipoprotein (LDL)
                     receptor (domain II); G00-126-372"
     misc_feature    1593..5108
                     /gene="HSPG2"
                     /note="shares homology with the short arm of laminin A
                     chain (domain III); G00-126-372"
     repeat_region   5109..11138
                     /gene="HSPG2"
                     /note="shares homology with immunoglobulin repeats of
                     N-CAM (domain IV); G00-126-372"
     misc_feature    11139..13253
                     /gene="HSPG2"
                     /note="shares homology with G-domain of laminin A chain
                     and EGF (domain V); G00-126-372"
     polyA_site      14327
                     /gene="HSPG2"
                     /note="G00-126-372"
BASE COUNT         2615 a         4940 c         4299 g         2473 t
ORIGIN      
        1 ggccggcgag cgggcggctg cgggcggcgc ggagcgggcg gcgcggagcg agcgagcgag
       61 agagcggcgc gggccgggcc atggggtggc gggcgccggg cgcgctgctg ctggcgctgc
      121 tgctgcacgg gcggctgctg gcggtgaccc atgggctgag ggcatacgat ggcttgtctc
      181 tgcctgagga catagagacc gtcacagcaa gccaaatgcg ctggacacat tcgtaccttt
      241 ctgatgatga gtacatgctg gctgacagca tctcaggaga cgacctgggc agtggggacc
      301 tgggcagcgg ggacttccag atggtttatt tccgagccct ggtgaatttc actcgctcca
      361 tcgagtacag ccctcagctg gaggatgcag gctccagaga gtttcgagag gtgtccgagg
      421 ctgtggtaga cacgctggag tcggagtact tgaaaattcc cggagaccag gttgtcagtg
      481 tggtgttcat caaggagctg gatggctggg tttttgtgga gctcgatgtg ggctcggaag
      541 ggaatgcgga tggtgctcag attcaggaga tgctgctcag ggtcatctcc agcggctctg
      601 tggcctccta cgtcacctct ccccagggat tccagttccg acgcctgggc acagtgcccc
      661 agttcccaag agcctgcacg gaggccgagt ttgcctgcca cagctacaat gagtgtgtgg
      721 ccctggagta tcgctgtgac cggcggcccg actgcaggga catgtctgat gagctcaatt
      781 gtgaggagcc agtcctgggt atcagcccca cattctctct ccttgtggag acgacatctt
      841 taccgccccg gccagagaca accatcatgc gacagccacc agtcacccac gctcctcagc
      901 ccctgcttcc cggttccgtc aggcccctgc cctgtgggcc ccaggaggcc gcatgccgca
      961 atgggcactg catccccaga gactacctct gcgacggaca ggaggactgc gaggacggca
     1021 gcgatgagct agactgtggc cccccgccac cctgtgagcc caacgagttc ccctgcggga
     1081 atggacattg tgccctcaag ctgtggcgct gcgatggtga ctttgactgt gaggaccgaa
     1141 ctgatgaagc caactgcccc accaagcgtc ctgaggaagt gtgcgggccc acacagttcc
     1201 gatgcgtctc taccaacatg tgcatcccag ccagcttcca ctgtgacgag gagagcgact
     1261 gtcctgaccg gagcgacgag tttggctgca tgccccccca ggtggtgaca cctccccggg
     1321 agtccatcca ggcttcccgg ggccagacag tgaccttcac ctgcgtggcc attggcgtcc
     1381 ccacccccat catcaattgg aggctcaact ggggccacat cccctctcat cccagggtga
     1441 cagtgaccag cgagggtggc cgtggcacac tgatcatccg tgatgtgaag gagtcagacc
     1501 agggtgccta cacctgtgag gccatgaacg cccggggcat ggtgtttggc attcctgacg
     1561 gtgtccttga gctcgtccca caacgaggcc cctgccctga cggccacttc tacctggagc
     1621 acagcgccgc ctgcctgccc tgcttctgct ttggcatcac cagcgtgtgc cagagcaccc
     1681 gccgcttccg ggaccagatc aggctgcgct ttgaccaacc cgatgacttc aagggtgtga
     1741 atgtgacaat gcctgcgcag cccggcacgc cacccctctc ctccacgcag ctgcagatcg
     1801 acccatccct gcacgagttc cagctagtag acctgtcccg ccgcttcctc gtccacgact
     1861 ccttctgggc tctgcctgaa cagttcctgg gcaacaaggt ggactcctat ggcggctccc
     1921 tgcgttacaa cgtgcgctac gagttggccc gtggcatgct ggagccagtg cagcggccgg
     1981 acgtggtcct cgtgggtgcc gggtaccgcc tcctctcccg aggccacaca cccacccaac
     2041 ctggtgctct gaaccagcgc caggtccagt tctctgagga gcactgggtc catgagtctg
     2101 gccggccggt gcagcgcgcg gagctgctgc aggtgctgca gagcctggag gccgtgctca
     2161 tccagaccgt gtacaacacc aagatggcta gcgtgggact tagcgacatc gccatggata
     2221 ccaccgtcac ccatgccacc agccatggcc gtgcccacag tgtggaggag tgcagatgcc
     2281 ccattggcta ttctggcttg tcctgcgaga gctgtgatgc ccacttcact cgggtgcctg
     2341 gtgggcccta cctgggcacc tgctctggtt gcagttgcaa tggccatgcc agctcctgtg
     2401 accctgtgta tggccactgc ctgaattgcc agcacaacac ggaggggcca cagtgcaaca
     2461 agtgcaaggc tggcttcttt ggggacgcca tgaaggccac ggccacttcc tgccggccct
     2521 gcccttgccc atacatcgat gcctcccgca gattctcaga cacttgcttc ctggacacgg
     2581 atggccaagc cacatgtgac gcctgtgccc caggctacac tggccgccgc tgtgagagct
     2641 gtgcccccgg atacgagggc aaccccatcc agcccggcgg gaagtgcagg cccgtcaacc
     2701 aggagattgt gcgctgtgac gagcgtggca gcatggggac ctccggggag gcctgccgct
     2761 gtaagaacaa tgtggtgggg cgcttgtgca atgaatgtgc tgacggctct ttccacctga
     2821 gtacccgaaa ccccgatggc tgcctcaagt gcttctgcat gggtgtcagt cgccactgca
     2881 ccagctcttc atggagccgt gcccagttgc atggggcctc tgaggagcct ggtcacttca
     2941 gcctgaccaa cgccgcaagc acccacacca ccaacgaggg catcttctcc cccacgcccg
     3001 gggaactggg attctcctcc ttccacagac tcttatctgg accctacttc tggagcctcc
     3061 cttcacgctt cctgggggac aaggtgacct cctatggagg agagctgcgc ttcacagtga
     3121 cccagaggtc ccagccgggc tccacacccc tgcacgggca gccgttggtg gtgctgcaag
     3181 gtaacaacat catcctagag caccatgtgg cccaggagcc cagccccggc cagcccagca
     3241 ccttcattgt gcctttccgg gagcaagcat ggcagcggcc cgatgggcag ccagccacac
     3301 gggagcacct gctgatggca ctggcaggca tcgacaccct cctgatccga gcatcctacg
     3361 cccagcagcc cgctgagagc agggtctctg gcatcagcat ggacgtggct gtgcccgagg
     3421 aaaccggcca ggaccccgcg ctggaagtgg aacagtgctc ctgcccaccc gggtaccgtg
     3481 ggccgtcctg ccaggactgt gacacaggct acacacgcac gcccagtggc ctctacctgg
     3541 gtacctgtga acgctgcagc tgccatggcc actcagaggc ctgcgagcca gaaacaggtg
     3601 cctgccaggg ctgccagcat cacacggagg gccctcggtg tgagcagtgc cagccaggat
     3661 actacgggga cgcccagcgg gggacaccac aggactgcca gctgtgcccc tgctacggag
     3721 accctgctgc cggccaggct gcccacactt gttttctgga cacagacggc caccccacct
     3781 gtgatgcgtg ctccccaggc cacagtgggc gtcactgtga gaggtgcgcc cctggctact
     3841 atggcaaccc cagccagggc cagccatgcc agagagacag ccaggtgcca gggcccatag
     3901 gctgcaactg tgacccccaa ggcagcgtca gcagccagtg tgatgctgct ggtcagtgcc
     3961 agtgcaaggc ccaggtagaa ggcctcactt gcagccactg ccggccccac cacttccacc
     4021 tgagtgccag caacccagac ggctgcctgc cctgcttctg tatgggcatc acccagcagt
     4081 gcgccagctc tgcctacaca cgccacctga tctccaccca ctttgcccct ggggacttcc
     4141 aaggctttgc cctggtgaac ccacagcgaa acagccgcct gacaggagaa ttcactgtgg
     4201 aacccgtgcc cgagggtgcc cagctctctt ttggcaactt tgcccaactc ggccatgagt
     4261 ccttctactg gcagctgccg gagacatacc agggagacaa ggtggcggcc tacggtggga
     4321 agttgcgata caccctctcc tacacagcag gcccacaggg cagcccactc tcggaccccg
     4381 atgtgcagat cacgggcaac aacatcatgc tagtggcctc ccagccagcg ctgcagggcc
     4441 cagagaggag gagctacgag atcatgttcc gagaggaatt ctggcgccgg cccgatgggc
     4501 agccggccac acgcgagcac ctcctgatgg cactggccga cctggatgag ctcctgatcc
     4561 gggccacgtt ctcctccgtg ccgctggtgg ccagcatcag cgcagtcagc ctggaggtcg
     4621 cccagccggg gccctcaaac agaccccgcg ccctcgaggt ggaggagtgc cgctgcccgc
     4681 caggctacat cggtctgtcc tgccaggact gtgcccccgg ctacacgcgc accgggagtg
     4741 ggctctacct cggccactgc gagctatgtg aatgcaatgg ccactcagac ctgtgccacc
     4801 cagagactgg ggcctgctcg caatgccagc acaacgccgc aggggagttc tgcgagcttt
     4861 gtgcccctgg ctactacgga gatgccacag ccgggacgcc tgaggactgc cagccctgtg
     4921 cctgcccact gaccaaccca gagaacatgt tttcccgcac ctgtgagagc ctgggagccg
     4981 gcgggtaccg ctgcacggcc tgcgaacccg gctacactgg ccagtactgt gagcagtgtg
     5041 gcccaggtta cgtgggtaac cccagtgtgc aagggggcca gtgcctgcca gagacaaacc
     5101 aagccccact ggtggtcgag gtccatcctg ctcgaagcat agtgccccaa ggtggctccc
     5161 actccctgcg gtgtcaggtc agtgggagcc caccccacta cttctattgg tcccgtgagg
     5221 atgggcggcc tgtgcccagc ggcacccagc agcgacatca aggctccgag ctccacttcc
     5281 ccagcgtcca gccctcggat gctggggtct acatttgcac ctgccgtaat ctccaccaat
     5341 ccaataccag ccgggcagag ctgctggtca ctgaggctcc aagcaagccc atcacagtga
     5401 ctgtggagga gcagcggagc cagagcgtgc gccccggagc tgacgtcacc ttcatctgca
     5461 cagccaaaag caagtcccca gcctataccc tggtgtggac ccgcctgcac aacgggaaac
     5521 tgcccacccg agccatggat ttcaatggca tcctgaccat tcgcaacgtc cagctgagtg
     5581 atgcaggcac ctacgtgtgc accggctcca acatgtttgc catggaccag ggcacagcca
     5641 ctctacatgt gcaggcctcg ggcaccttgt ccgcccccgt ggtctccatc catccgccac
     5701 agctcacagt gcagcccggg caactggcgg agttccgctg cagcgccaca gggagcccca
     5761 cgcccaccct cgagtggaca gggggccccg gcggccagct ccctgcgaag gcacaaatcc
     5821 acggcggcat cctgcgcctg ccagctgtcg agcccacgga tcaggcccag tacttgtgcc
     5881 gagcccacag cagcgctggg cagcaggtgg ccagggctgt gctccacgtg catgggggcg
     5941 gtgggcccag agtccaagtg agcccagaga ggacccaggt ccacgcaggc cggaccgtca
     6001 ggctgtactg cagggctgca ggcgtgccta gcgccaccat cacctggagg aaggaagggg
     6061 gcagcctccc accacaggcc cggtcagagc gcacagacat cgcgacactg ctcatcccag
     6121 ccatcacgac tgctgacgcc ggcttctacc tctgcgtggc caccagccct gcaggcactg
     6181 cccaggcccg gatgcaagtg gttgtccttt cagcctcaga tgccagccca ccgggggtca
     6241 agattgagtc ctcatcgcct tctgtgacag aagggcaaac actcgacctc aactgtgtgg
     6301 tggcagggtc agcccatgcc caggtcacct ggtacaggcg agggggtagc ctgcctcccc
     6361 acacccaggt gcacggctcc cgtctgcggc tcccccaggt ctcaccagct gattctggag
     6421 aatatgtgtg ccgtgtggag aatggatcgg gccccaagga ggcctccatt actgtgtctg
     6481 tgctccacgg cacccattct ggccccagct acaccccagt gcccggcagc acccggccca
     6541 tccgcatcga gccctcctcc tcacacgtgg cggaagggca gaccctggat ctgaactgcg
     6601 tggtgcccgg gcaggcccac gcccaggtca cgtggcacaa gcgtgggggc agcctccctg
     6661 cccggcacca gacccacggc tcgctgctgc ggctgcacca ggtgaccccg gccgactcag
     6721 gcgagtatgt gtgccatgtg gtgggcacct ccggccccct agaggcctca gtcctggtca
     6781 ccatcgaagc ctctgtcatc cctggaccca tcccacctgt caggatcgag tcttcatcct
     6841 ccacagtggc cgagggccag accctggatc tgagctgcgt ggtggcaggg caggcccacg
     6901 cccaggtcac atggtacaag cgtgggggca gcctccctgc ccggcaccag gttcgtggct
     6961 cccgcctgta catcttccag gcctcacctg ccgatgcggg acagtacgtc tgccgggcca
     7021 gcaacggcat ggaggcctcc atcacggtca cagtaactgg gacccagggg gccaacttag
     7081 cctaccctgc cggcagcacc cagcccatcc gcatcgagcc ctcctcctcg caagtggcgg
     7141 aagggcagac cctggatctg aactgcgtgg tgcccgggca gtcccatgcc caggtcacgt
     7201 ggcacaagcg tgggggcagc ctccctgtcc ggcaccagac ccacggctcc ctgctgagac
     7261 tctaccaagc gtcccccgcc gactcgggcg agtacgtgtg ccgagtgttg ggcagctccg
     7321 tgcctctaga ggcctctgtc ctggtcacca ttgagcctgc gggctcagtg cctgcacttg
     7381 gggtcacccc cacggtccgg atcgagtcat cgtcttcgca agtggccgag gggcagaccc
     7441 tggacctgaa ctgcctcgtt gctggtcagg cccatgccca ggtcacgtgg cacaagcgcg
     7501 ggggcagcct cccggcccgg caccaggtgc atggctcgag gctacgcctg ctccaggtga
     7561 ccccagctga ttcaggggag tacgtgtgcc gtgtggtcgg cagctcaggt acccaggaag
     7621 cctcagtcct tgtcaccatc cagcagcgcc ttagtggctc ccactcccag ggtgtggcgt
     7681 accccgtccg catcgagtcc tcctcagcct ccctggccaa tggacacacc ctggacctca
     7741 actgcctggt tgccagccag gctccccaca ccatcacctg gtataagcgt ggaggcagct
     7801 tacccagccg gcaccagatc gtgggctccc ggctgcggat ccctcaggtg actccggcag
     7861 actcgggcga gtacgtgtgt cacgtcagta acggtgcagg ctcccgggag acctcgctca
     7921 tcgtcaccat ccagggcagc ggttcctccc acgtgcccag cgtctcccca ccgatcagga
     7981 tcgagtcgtc ttcccccacg gtggtggaag ggcagacctt ggatctgaac tgcgtggtcg
     8041 ccaggcagcc ccaggctatc atcacatggt acaagcgtgg gggcagcctt ccctcccgac
     8101 accagaccca tggctcccac ctgcggttgc accaaatgtc tgtggctgac tcgggcgagt
     8161 atgtgtgccg ggccaacaac aacatcgatg ccctggaggc ctccatcgtc atctccgtct
     8221 cccctagcgc cggcagcccc tccgcccctg gcagctccat gcccatcaga attgagtcat
     8281 cctcctcaca cgtggccgaa ggggagaccc tggatctgaa ctgcgtggtc cccgggcagg
     8341 cccatgccca ggtcacttgg cacaagcgtg ggggcagcct ccccagtcac catcagaccc
     8401 gcggctcacg gctgcggctg caccatgtgt ccccggccga ctcgggtgaa tacgtgtgcc
     8461 gggtgatggg cagctctggc cccctggagg cctcagtcct ggtcaccatc gaagcctctg
     8521 gctcaagtgc tgtccacgtc cccgccccag gtggagcccc acccatccgc atcgagccct
     8581 cctcctcccg agtggcagaa gggcagaccc tggatctgaa gtgcgtggtg cccgggcagg
     8641 cccacgccca ggtcacatgg cacaagcgtg gaggaaacct ccctgcccgg caccaggtcc
     8701 acggcccact gctgaggctg aaccaggtgt ccccggctga ctctggcgag tactcgtgcc
     8761 aagtgaccgg aagctcaggc accctggagg catctgtcct ggtcacaatt gagccctcca
     8821 gcccaggacc cattcctgct ccaggactgg cccagcccat ctacatcgag gcctcctctt
     8881 cacacgtgac tgaagggcag actctggatc tgaactgtgt ggtgcccggg caggcccatg
     8941 cccaggtcac gtggtacaag cgcgggggca gcctccccgc ccggcaccag acccatggct
     9001 cccagctgcg gctccacctc gtctcccctg ccgactcagg cgagtatgtg tgtcgtgcag
     9061 ccagcggccc aggccctgag caagaagcct ccttcacagt caccgtcccg cccagtgagg
     9121 ggtcttccta ccgccttagg agcccggtca tctccatcga cccgcccagc agcaccgtgc
     9181 agcagggcca ggatgccagc ttcaagtgcc tcatccatga cggggcagcc cccatcagcc
     9241 tcgagtggaa gacccggaac caggagctgg aggacaacgt ccacatcagt cccaatggct
     9301 ccatcatcac catcgtgggc acccggccca gcaaccacgg tacctaccgc tgcgtggcct
     9361 ccaatgccta cggtgtggcc cagagtgtgg tgaacctcag tgtgcacggg ccccctacag
     9421 tgtccgtgct ccccgagggc cccgtgtggg tgaaagtggg aaaggctgtc accctggagt
     9481 gtgtcagtgc cggggagccc cgctcctctg ctcgttggac ccggatcagc agcacccctg
     9541 ccaagttgga gcagcggaca tatgggctca tggacagcca cgcggtgctg cagatttcat
     9601 cagctaaacc atcagatgcg ggcacttatg tgtgccttgc tcagaatgca ctaggcacag
     9661 cacagaagca ggtggaggtg atcgtggaca cgggcgccat ggccccaggg gcccctcagg
     9721 tccaagctga agaagctgag ctgactgtgg aggctggaca cacggccacc ttgcgctgct
     9781 cagccacagg cagccccgcg cccaccatcc actggtccaa gctgcgttcc ccactgccct
     9841 ggcagcaccg gctggaaggt gacacactca tcataccccg ggtagcccag caggactcgg
     9901 gccagtacat ctgcaatgcc actagccctg ctgggcacgc tgaggccacc atcatcctgc
     9961 acgtggagag cccaccatat gccaccacgg tcccagagca cgcttcggtg caggcagggg
    10021 agacggtgca gctccagtgc ctggctcacg ggacaccccc actcaccttc cagtggagcc
    10081 gcgtgggcag cagccttcct gggagggcga ccgccaggaa cgagctgctg cactttgagc
    10141 gtgcagcccc tgaggactca ggccgctacc gctgccgggt caccaacaag gtgggctcag
    10201 ccgaggcctt tgcccagctg ctcgtccaag gccctcccgg ctctctccct gccacctcca
    10261 tcccagcagg gtccacgccc accgtgcagg tcacgcctca gctagagacc aagagcattg
    10321 gggccagcgt tgagttccac tgtgctgtgc ccagcgacca gggtacccag ctccgttggt
    10381 tcaaggaagg gggtcagctg cctccgggtc acagcgtgca ggatggggtg ctccgaatcc
    10441 agaacttgga ccagagctgc caagggacgt atatatgcca ggcccatgga ccttggggga
    10501 aggcccaggc cagtgcccag ctggttatcc aagccctgcc ctcggtgctc atcaacatcc
    10561 ggacctctgt gcagaccgtg gtggttggcc acgccgtgga gttcgaatgc ctggcactgg
    10621 gtgaccccaa gcctcaggtg acatggagca aagttggagg gcacctgcgg ccaggcattg
    10681 tgcagagcgg aggtgtcgtc aggatcgccc acgtagagct ggctgatgcg ggacagtatc
    10741 gctgcactgc caccaacgca gctggcacca cacaatccca cgtcctgctg cttgtgcaag
    10801 ccttgcccca gatctcaatg ccccaagaag tccgtgtgcc tgctggttct gcagctgtct
    10861 tcccctgcat agcctcaggc taccccactc ctgacatcag ctggagcaag ctggatggca
    10921 gcctgccacc tgacagccgc ctggagaaca acatgctgat gctgccctca gtccgacccc
    10981 aggacgcagg tacctacgtc tgcaccgcca ctaaccgcca gggcaaggtc aaagcctttg
    11041 cccacctgca ggtgccagag cgggtggtgc cctacttcac gcagaccccc tactccttcc
    11101 taccgctgcc caccatcaag gatgcctaca ggaagttcga gatcaagatc accttccggc
    11161 ccgactcagc cgatgggatg ctgctgtaca atgggcagaa gcgagtccca gggagcccca
    11221 ccaacctggc caaccggcag cccgacttca tctccttcgg cctcgtgggg ggaaggcccg
    11281 agttccggtt cgatgcaggc tcaggcatgg ccaccatccg ccatcccaca ccactggccc
    11341 tgggccattt ccacaccgtg accctgctgc gcagcctcac ccagggctcc ctgattgtgg
    11401 gtgacctggc cccggtcaat gggacctccc agggcaagtt ccagggcctg gatctgaacg
    11461 aggaactcta cctgggtggc tatcctgact atggtgccat ccccaaggcg gggctgagca
    11521 gcggcttcat aggctgtgtc cgggagctgc gcatccaggg cgaggagatc gtcttccatg
    11581 acctcaacct cacggcgcac ggcatctccc actgccccac ctgtcgggac cggccctgcc
    11641 agaatggcgg tcagtgccat gactctgaga gcagcagcta cgtgtgcgtc tgcccagctg
    11701 gcttcaccgg gagccgctgt gagcactcgc aggccctgca ctgccatcca gaggcctgtg
    11761 ggcccgacgc cacctgtgtg aaccggcctg acggtcgagg ctacacctgc cgctgccacc
    11821 tgggccgctc ggggttgcgg tgtgaggaag gtgtgacagt gaccaccccc tcgctgtcgg
    11881 gtgctggctc ctacctggca ctgcccgccc tcaccaacac acaccacgag ctacgcctgg
    11941 acgtggagtt caagccactc gcccctgacg gggtcctgct gttcagcggg gggaagagcg
    12001 ggcctgtgga ggacttcgtg tccctggcga tggtgggcgg ccacctggag ttccgctatg
    12061 agttggggtc agggctggcc gttctgcgga gcgccgagcc gctggccctg ggccgctggc
    12121 accgtgtgtc tgcagagcgt ctcaacaagg acggcagcct gcgggtgaat ggtggacgcc
    12181 ctgtgctgcg ctcctcgccc ggcaagagcc agggcctcaa cctgcacacc ctgctctacc
    12241 tggggggtgt ggagccttcc gtgccactgt ccccggccac caacatgagc gctcacttcc
    12301 gcggctgtgt gggcgaggtg tcagtgaatg gcaaacggct ggacctcacc tacagtttcc
    12361 taggcagcca gggcatcggg caatgctatg atagctcccc atgtgagcgc cagccttgcc
    12421 aacatggtgc cacgtgcatg cccgctggcg agtatgagtt ccagtgcctg tgtcgagatg
    12481 gattcaaagg agacctgtgt gagcacgagg agaacccctg ccagctccgt gaaccctgtc
    12541 tgcatggggg cacctgccag ggcacccgct gcctctgcct ccctggcttc tctggcccac
    12601 gctgccaaca aggctctgga catggcatag cagagtccga ctggcatctt gaaggcagcg
    12661 ggggcaatga tgcccctggg cagtacggag cctatttcca cgatgatggc ttcctcgcct
    12721 tccctggcca tgtcttctcc aggagcctgc ccgaggtgcc cgagaccatc gagctggagg
    12781 ttcggaccag cacagccagt ggcctcctgc tctggcaggg tgtggaggtg ggagaggccg
    12841 gccaaggcaa ggacttcatc agcctcgggc ttcaagacgg gcaccttgtc ttcaggtacc
    12901 agctgggtag tggggaggcc cgcctggtct ctgaggaccc catcaatgac ggcgagtggc
    12961 accgggtgac agcactgcgg gagggccgca gaggttccat ccaagtcgac ggtgaggagc
    13021 tggtcagcgg ccggtcccca ggtcccaacg tggcagtcaa cgccaagggc agcgtctaca
    13081 tcggcggagc ccctgacgtg gccacgctga ccgggggcag attctcctcg ggcatcacag
    13141 gctgtgtcaa gaacctggtg ctgcactcgg cccgacccgg cgccccgccc ccacagcccc
    13201 tggacctgca gcaccgcgcc caggccgggg ccaacacacg cccctgcccc tcgtaggcac
    13261 ctgcctgccc cacacggact cccgggccac gccccagccc gacaatgtcg agtatattat
    13321 tattaatatt attatgaatt tttgtaagaa accgaggcga tgccacgctt tgctgctacc
    13381 gccctgggct ggactggagg tgggcatgcc accctcacac acacagctgg gcaaagccac
    13441 aaggctggcc agcaaggcag gttggatggg agtgggcacc tcagaaagtc accaggactt
    13501 ggggtcagga acagtggctg ggtgggccca gaactgcccc cactgtcccc ctacccaccg
    13561 atggagcccc cagatagagc tgggtggcct gtttctgcag cccttgggca gttctcactc
    13621 ctaggagagc caacctcggc ttgtgggctg gtgccccaca gctacctgag acgggcatcg
    13681 caggagtctc tgccacccac tcaggattgg gaattgtctt tagtgccggc tgtggagcaa
    13741 aaggcagctc acccctgggc aggcggtccc catccccacc agctcgtttt tcagcacccc
    13801 cacccacctc cacccagccc ctggcacctc ctctggcaga ctccccctcc taccacgtcc
    13861 tcctggcctg cattcccacc ccctcctgcc agcacacagc ctggggtccc tccctcaggg
    13921 gctgtaaggg aaggcccacc ccaactctta ccaggagctg ctacaggcag agcccagcac
    13981 tgatagggcc ccgcccaccg ggccccgccc accccaggcc acatccccac ccatctggaa
    14041 gtgaaggccc agggactcct ccaacagaca acggacggac ggatgccgct ggtgctcagg
    14101 aagagctagt gccttaggtg ggggaaggca ggactcacga ctgagagaga gaggaggggg
    14161 atatgaccac cctgccccat ctgcaggagc ctgaagatcc agctcaagtg ccatcctgcc
    14221 agtggccccc agactgtggg gttgggacgc ctggcctctg tgtcctagaa gggaccctcc
    14281 tgtggtcttt gtcttgattt ttcttaataa acggtgctat ccccgcc
//