LOCUS HUM17BHYD 21764 bp DNA linear HUM 16-OCT-2008
DEFINITION Human 17-beta-hydroxysteroid dehydrogenase (EDH17B1 and EDH17B2)
genes, complete coding regions and flanks.
ACCESSION M84472
VERSION M84472.1
KEYWORDS 17-beta-hydroxysteroid dehydrogenase; Alu repeat; estradiol
17-beta-dehydrogenase.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 21764)
AUTHORS Peltoketo,H., Isomaa,V., Maentausta,O. and Vihko,R.
TITLE Complete amino acid sequence of human placental 17
beta-hydroxysteroid dehydrogenase deduced from cDNA
JOURNAL FEBS Lett. 239 (1), 73-77 (1988)
PUBMED 2846351
REFERENCE 2 (bases 1 to 21764)
AUTHORS The,V.L., Labrie,C., Zhao,H.F., Couet,J., Lachance,Y., Simard,J.,
Leblanc,G., Cote,J., Berube,D., Gagne,R. and Labrie,F.
TITLE Characterization of cDNAs for human estradiol 17 beta-dehydrogenase
and assignment of the gene to chromosome 17: evidence of two mRNA
species with distinct 5'-termini in human placenta
JOURNAL Mol. Endocrinol. 3 (8), 1301-1309 (1989)
PUBMED 2779584
REFERENCE 3 (bases 1 to 21764)
AUTHORS Luu-The,V., Labrie,C., Simard,J., Lachance,Y., Zhao,H.F., Couet,J.,
Leblanc,G. and Labrie,F.
TITLE Structure of two in tandem human 17 beta-hydroxysteroid
dehydrogenase genes
JOURNAL Mol. Endocrinol. 4 (2), 268-275 (1990)
PUBMED 2330005
REFERENCE 4 (bases 1 to 21764)
AUTHORS Peltoketo,H., Isomaa,V. and Vihko,R.
TITLE Genomic organization and DNA sequences of human 17
beta-hydroxysteroid dehydrogenase genes and flanking regions.
Localization of multiple Alu sequences and putative cis-acting
elements
JOURNAL Eur. J. Biochem. 209 (1), 459-466 (1992)
PUBMED 1327779
COMMENT On May 11, 1995 this sequence version replaced gi:177126.
FEATURES Location/Qualifiers
source 1..21764
/organism="Homo sapiens"
/mol_type="genomic DNA"
/db_xref="taxon:9606"
/chromosome="17"
/map="17q11-q21"
/cell_type="leukocyte"
repeat_region 883..21598
/rpt_family="Alu"
/rpt_type=dispersed
/rpt_unit_range=10904..11038
repeat_region 883..1163
/rpt_family="Alu"
/rpt_type=direct
repeat_region 1874..2154
/rpt_family="Alu"
/rpt_type=direct
repeat_region 2457..2736
/rpt_family="Alu"
/rpt_type=direct
repeat_region complement(3960..4104)
/note="right arm"
/rpt_family="Alu"
/rpt_type=inverted
repeat_region complement(4122..4404)
/rpt_family="Alu"
/rpt_type=inverted
repeat_region complement(4405..4538)
/note="left arm"
/rpt_family="Alu"
/rpt_type=inverted
repeat_region complement(4553..4834)
/rpt_family="Alu"
/rpt_type=inverted
repeat_region complement(6631..6785)
/note="right arm"
/rpt_family="Alu"
/rpt_type=inverted
repeat_region complement(6792..7073)
/rpt_family="Alu"
/rpt_type=inverted
repeat_region complement(7545..7864)
/rpt_family="Alu"
/rpt_type=inverted
repeat_region 7730..20156
/note="duplicated regions D1 and D2; direct repeats"
/rpt_type=tandem
gene 7730..13913
/gene="EDH17B1"
repeat_region 7730..13913
/gene="EDH17B1"
/note="G00-119-860"
misc_difference 7802..7805
/gene="EDH17B1"
/note="present in clone lambda17-HSD5; absent in clone
lambda17-HSD3"
misc_difference 7809..7815
/gene="EDH17B1"
/note="present in clone lambda17-HSD5; absent in clone
lambda17-HSD3"
prim_transcript 7816..11014
/gene="EDH17B1"
/note="G00-119-860"
misc_difference 8095
/gene="EDH17B1"
/note="G in clone lambda17-HSD5; A in clone lambda17-HSD3;
G00-119-860"
misc_difference 8178
/gene="EDH17B1"
/note="C in clone lambda17-HSD5; T in clone lambda17-HSD3;
G00-119-860"
repeat_region 10880..11014
/gene="EDH17B1"
/note="left arm"
/rpt_family="Alu"
/rpt_type=direct
repeat_region complement(11393..11650)
/rpt_family="Alu"
/rpt_type=inverted
misc_difference 12267
/gene="EDH17B1"
/note="C in clone lambda17-HSD5; T in clone lambda17-HSD3"
repeat_region 13208..13490
/gene="EDH17B1"
/rpt_family="Alu"
/rpt_type=direct
repeat_region 13520..13794
/gene="EDH17B1"
/rpt_family="Alu"
/rpt_type=direct
repeat_region 13811..13913
/gene="EDH17B1"
/note="truncated"
/rpt_family="Alu"
/rpt_type=direct
gene 13914..20156
/gene="EDH17B2"
repeat_region 13914..20156
/gene="EDH17B2"
/note="G00-127-970"
repeat_region complement(13914..14049)
/rpt_family="Alu"
/rpt_type=inverted
mRNA join(13999..15062,15156..15323,15471..15650,15826..15919,
16436..16613,16700..17244)
/gene="EDH17B2"
/note="G00-127-970"
prim_transcript 13999..17244
/gene="EDH17B2"
/note="G00-127-970"
exon 13999..15062
/gene="EDH17B2"
/note="G00-127-970"
/number=1
regulatory complement(14865..14868)
/regulatory_class="CAAT_signal"
/note="G00-127-970"
protein_bind 14897..14898
/gene="EDH17B2"
/note="G00-127-970"
/bound_moiety="transcription factor AP-2"
regulatory 14902..14915
/regulatory_class="GC_signal"
/gene="EDH17B2"
/note="G00-127-970"
regulatory 14925..14931
/regulatory_class="TATA_box"
/gene="EDH17B2"
/note="G00-127-970"
CDS join(14966..15062,15156..15323,15471..15650,15826..15919,
16436..16613,16700..16969)
/gene="EDH17B2"
/EC_number="1.1.1.62"
/note="estradiol 17-beta-dehydrogenase"
/codon_start=1
/product="17-beta-hydroxysteroid dehydrogenase"
/protein_id="AAB16942.1"
/translation="MARTVVLITGCSSGIGLHLAVRLASDPSQSFKVYATLRDLKTQG
RLWEAARALACPPGSLETLQLDVRDSKSVAAARERVTEGRVDVLVCNAGLGLLGPLEA
LGEDAVASVLDVNVVGTVRMLQAFLPDMKRRGSGRVLVTGSVGGLMGLPFNDVYCASK
FALEGLCESLAVLLLPFGVHLSLIECGPVHTAFMEKVLGSPEEVLDRTDIHTFHRFYQ
YLAHSKQVFREAAQNPEEVAEVFLTALRAPKPTLRYFTTERFLPLLRMRLDDPSGSNY
VTAMHREVFGDVPAKAEAGAEAGGGAGPGAEDEAGRSAVGDPELGDPPAAPQ"
intron 15063..15155
/gene="EDH17B2"
/note="G00-127-970"
/number=1
exon 15156..15323
/gene="EDH17B2"
/note="G00-127-970"
/number=2
intron 15324..15470
/gene="EDH17B2"
/note="G00-127-970"
/number=2
exon 15471..15650
/gene="EDH17B2"
/note="G00-127-970"
/number=3
intron 15651..15825
/gene="EDH17B2"
/note="G00-127-970"
/number=3
exon 15826..15919
/gene="EDH17B2"
/note="G00-127-970"
/number=4
intron 15920..16435
/gene="EDH17B2"
/note="G00-127-970"
/number=4
exon 16436..16613
/gene="EDH17B2"
/note="G00-127-970"
/number=5
intron 16614..16699
/gene="EDH17B2"
/note="G00-127-970"
/number=5
exon 16700..17244
/gene="EDH17B2"
/note="G00-127-970"
/number=6
repeat_region 17102..17236
/gene="EDH17B2"
/note="left arm"
/rpt_family="Alu"
/rpt_type=direct
repeat_region complement(17612..17873)
/rpt_family="Alu"
/rpt_type=inverted
repeat_region 19437..19715
/gene="EDH17B2"
/rpt_family="Alu"
/rpt_type=direct
repeat_region 19731..20014
/gene="EDH17B2"
/rpt_family="Alu"
/rpt_type=direct
repeat_region 20062..20256
/note="truncated"
/rpt_family="Alu"
/rpt_type=direct
repeat_region 20273..20546
/rpt_family="Alu"
/rpt_type=direct
repeat_region 20547..20825
/rpt_family="Alu"
/rpt_type=direct
repeat_region complement(21076..21276)
/note="truncated"
/rpt_family="Alu"
/rpt_type=inverted
repeat_region 21308..21598
/rpt_family="Alu"
/rpt_type=direct
BASE COUNT 4884 a 5912 c 6084 g 4884 t
ORIGIN
1 aactcatggg cagctctaga tctgcctccc tggaagcttc tgtgctgcaa tggctgctcc
61 aggctctgct taagctcttc acacagttgc cctgcccttc catctggcac tcttgctcca
121 tgaagccttc taaggccttc ctcttggggg aaagcccctt tgtgccccat ctcctcaccc
181 atgcgacaaa ggcaacacag tgaactcacc tactcacagg tctctttcct ctgggctgtg
241 ggctccttga tggcagcgtt cggattttgt ctcagtagcc ctagcgccca gcacaaagaa
301 gcaatgagtg aatggttgtt gaatgaatga atgaatgaat gaagatgaat atatttctat
361 gtgtgggccc ttcttcctca ggtgtacctg gccttgggcc tgacccaggc agagatcaat
421 gagttcttta ctggtcctgc cttcctggcc tgggggcgaa tgggcaacct gcacacctgg
481 gatggccccc tgcccccctc ctggcacatc aagcagcttt acctgcaggt aaaaggatgg
541 aaaagggaag gggcagaatc ggtgatagat ggtcatgggc ccaggaaggg tggtattagg
601 ccggccccag ggctcttaac tgaggcgggg ggctgcgtgt atcctgggag atgagggcct
661 tctcatagga cagcagtggc catgctcacc acccttcctt ctgttcctcc agcaccgggt
721 cctggaccag atgcgctcct tcggcatgac cccagtgctg cctgcattcg cggggcatgt
781 tcccgaggct gtcaccaggt gaggttccgc tcaccccctc cacttagctc agagagggaa
841 ttttattccc ttctagaaca tgacttaaaa acttaagctc tgggccgggc gcagtggctc
901 acgcctgtaa tcccagcact ttgggaggcc gagttgggcg gatcacctga ggtcaggagt
961 tcgagaccag cctggccaac atggtgaaac cctgtctcta ctaaaaatat aaaaattagc
1021 tgggcatggt ggcacgcgcc tgtaatccca tctacttagg aggctgagac aggagaattg
1081 cttaaacctg ggaggcagac gttgcagtga gtcaagatca cgccattgca ctccagcctg
1141 ggtgacgagc gaaactctgt ctcaaacaaa caaacaagct ctggacgtag gcctgggttt
1201 gatttctgac tctgctacta attagctgtg tgacttcggg cagatgacat gactgctctg
1261 tgcctcagtt tccttacttg taaaatggga tctctaccca ctcgctgtag ggtttgtaat
1321 tatctctcga tctatctgtg actttgcaca gagtgctagc aaatggcagc ccctgggagt
1381 ggccgcaggg gtgctccagt gtcccttgtc ccttcctgtt cctcgggctt tcccagccat
1441 cctgtaaatg tcttgggaaa agtcttcaag gctcacctga gacctcccct ccttcaggaa
1501 gccttgctag tgccccgcat gacctccttt gcacctgcta atgtctggct cccatactct
1561 cgtaggactt aatgcatgcc agtggcctcc ctgcccgcct ctttgccccc atcaccaggt
1621 ggcaggaaac tcactcattc attcaataaa cttggtccag ctgtctgagg ctgccagaac
1681 tggctgtgct gggtcctggg aggcggcaag aaaggtgccc aagggcttac ccctgatagg
1741 agagatatgt tggctgaagg atacaatgtg gggacaagga caggaatata tgtgggttcc
1801 gctctcctct gccgggagag aggggcagga agggctcagg gcagagccca gccttgaaaa
1861 atgagtgttg cttggacgga cgcttggcta atgcttgtaa tcctagcgtt ttgggaggct
1921 gaggcgtatg gatcacctgc ggtcaggagt taaagaccag cctggccaac atggcgaaac
1981 cccatctcta ctaaaagtac aaaaattagc caggcgtggt ggcgggctcc tgtaatccca
2041 gctactcggt aggctgaggc atgagaatct cttgaagcca ggggccagag actgcagtga
2101 gccgagatca caccacttca ctccagcctg ggtgacagag tgagactccg tctcaaaaaa
2161 aaaaaaaaaa aaaggaaaga aaattaaaca cctcatgttc tcactcatag tgggagttga
2221 acaatgagaa caacatggac acaggaaggg gaacatcaca caccggggcc tttcgcggtg
2281 tgggggtcaa ggggaggagt agcattggga cagatactta atgcatgcgg ggctgaaaac
2341 ctagatgatg ggttgatggg tgcagcaaac caccatggca catgtatacc tatgcaacaa
2401 acctgcatgt tctgcacaga actgaactga aagtataatt aaaaaaaaaa aaaaaagctg
2461 ggtgcggtgg cccacacctg taatcccagc actttgggag gccgagacgg gcggatcaca
2521 aggtcagcag atcgagacca tcctggctaa cacagtgaaa ctcagtctct actaaaaata
2581 caaaaaatta gccgggtgtg gtggcgggca cctgtagtcc cagctactag ggaggctgag
2641 gcaggagaat ggcatgaacc tgggaggcag agcttgcagt gagctgagaa tgcgccactg
2701 cactccagcc tgggggacag agtgagactc tgcctcaaaa aaaaaaaaaa aaagaaagaa
2761 aaaggagcgt tgcttgtttc aggccacagg aaggggagag atagtgaaag tttttcagag
2821 aaggtggcca gggaaggaga agaaaggact gtaggcagag agcatagcct gtacaaagcc
2881 atagaggcaa gagaaaccag gagctgtaga gaagttggca aggctgttga acactatggt
2941 gaacactatg gcggcttcca tgaaatatct gagcttttgc tccccactag ggtgttccct
3001 caggtcaatg tcacgaagat gggcagttgg ggccacttta actgttccta ctcctgctcc
3061 ttccttctgg ctccggaaga ccccatattc cccatcatcg ggagcctctt cctgcgagag
3121 ctgatcaaag agtttggcac agaccacatc tatggggccg acactttcaa tgagatgcag
3181 ccaccttcct cagagccctc ctaccttgcc gcagccacca ctgccgtcta tgaggccatg
3241 actgcaggta cagtgcctgg gtggggtggg agagcccccc agaccctcaa aaagaaggga
3301 gtagcagatg tcagtagggg taggcagagg gactggaata atgcctcgcc ataacacaca
3361 gtactttata gtttaccaag cacgtgtaca catgcgttgt ctcagtgaat cccactgtgg
3421 ttgagaggtg agctctggaa gccaacaacc tgggtcacac ctcgcgctcc tatttcctgg
3481 ccgtgtgact tatgactcat gacctccttc ccagtgtctc gtttgctttt cctgtaaact
3541 gggactacct cataggtaga ataacgcctg gcccagagca aaggccacta agagctagct
3601 atgaacaagg attttgtttc atctctgcgt ggttgctgaa gtaggcactg caggcaggag
3661 gtgagtggat gtgcctaaag gcactaagtg cgcatcctgc tacaaaactg tgaagccagg
3721 gctccttcct gccacttaaa ggaggagtgg agcagagggc gcccaagtca ggaatgactt
3781 agtggagagg cgtctgtgtt ggccaggaag ggaacagatc agctcagcct ttcttgagca
3841 gtactgctcc aagtgtgacc caaaaccagc agcagcagca gcagcagccc gagctgtgag
3901 atggcaaatt ctcaggccct acccaagacc tgaaggagaa gctacatttt tttttttttg
3961 aagacagatt tcactctgtt gctgaggctg gagcacagtg gcacaatctc atctcactgc
4021 aaccttcgtc tcctaggttc aagcgattct cctgcctcag cctcccgagt agctgggact
4081 ataggcaccc gccaccacgc ccggcaattt ttgtttgttt tgagatagag tctcgctctg
4141 tcacccaggc tggagtgcag tggcacgatc tcagttcact gcaacctctg cttcctgagt
4201 tcaagcgatt ctcctgcctc agcctcctga gtagctggga ttacaggcgc cccccaacca
4261 cactcggcta atttttgtat ttttagtaga gacggggttt cgctatgtag gtcaagctgg
4321 tttcaaactc ctgacctcaa atgattcgcc cacttcagcc tcccaaagtg ctgggattac
4381 aggtgtgagc caccttgcct ggccaatttt tgtattttta gtagaaacag gtttcaccat
4441 ggtggccaga ctggtctcaa actcctgacc tcaggtgaac tgcccacctc agcctcccaa
4501 agtactggta ttacaggcgt gatccactgc gactggcctt gattttgttt ttgagacaga
4561 atcttactct gtcgcccaga ctggagtgca gtggcacaat ctcagctcac tgcaacttct
4621 gcctcatggg ttcaagtgat tcttgtgcct ctacctcccg agtagccggg attacaggca
4681 cctgccatta cgctaggcta atttttgtat ttttagtata gacagggttt ccccacattg
4741 gccaggctgg tctggaactc ctgggctcaa gtgatccacc tgcttcagcc cctcagagta
4801 ctgggattat aggtgtgggc caccacgccc attcagaaac ctccatgttt taaggagcct
4861 ctgggtaact ctcatgttca cccaagctgc tgaaccctgt cttggagttt tcagagggac
4921 gcgtatgtgc cacagacgtc ccgctggtgg gggtcatggg aagccatgac ctgggataga
4981 cagtcgtctg tagagtgggg tgaacattcc ctgggccctc tgtttcatca ctcctcttct
5041 ctgttccccc tacctcctgt ccacagtgga tactgaggct gtgtggctgc tccaaggctg
5101 gctcttccag caccagccgc agttctgggg gcccgcccag atcagggctg tgctgggagc
5161 tgtgccccgt ggccgcctcc tggttctgga cctgtttgct gagagccagc ctgtgtatac
5221 ccgcactgcc tccttccagg gccagccctt catctggtgc atgctgcaca actttggggg
5281 aaaccatggt ctttttggag ccctagaggc tgtgaacgga ggcccagaag ctgcccgcct
5341 cttccccaac tccaccatgg taggcacggg catggccccc gagggcatca gccagaacga
5401 agtggtctat tccctcatgg ctgagctggg ctggcgaaag gacccagtgc cagatttggc
5461 agcctgggtg accagctttg ccgcccggcg gtatggggtc tcccacccgg acgcaggggc
5521 agcgtggagg ctactgctcc ggagtgtgta caactgctcc ggggaggcct gcaggggcca
5581 caatcgtagc ccgctggtca ggcggccgtc cctacagatg aataccagca tctggtacaa
5641 ccgatctgat gtgtttgagg cctggcggct gctgctcaca tctgctccct ccctggccac
5701 cagccccgcc ttccgctacg acctgctgga cctcactcgg caggcagtgc aggagctggt
5761 cagcttgtac tatgaggagg caagaagcgc ctacctgagc aaggagctgg cctccctgtt
5821 gagggctgga ggcgtcctgg cctatgagct gctgccggca ctggacgagg tgctggctag
5881 tgacagccgc ttcttgctgg gcagctggct agagcaggcc cgagcagcgc agtcagtgag
5941 gccgaggccg atttctacga gcagaacagc cgctaccagc tgaccttgtg ggggccagaa
6001 ggcaacatcc tggactatgc caacaagcag ctggcggggt tggtggccaa ctactacacc
6061 cctcgctggc ggcttttcct ggaggcgctg gttgacagtg tggcccaggg catccctttc
6121 caacagcacc agtttgacaa aaatgtcttc caactggagc aggccttcgt tctcagcaag
6181 cagaggtacc ccagccagcc gcgaggagac actgtggacc tggccaagaa gatcttcctc
6241 aaatattacc ccggctgggt ggccggctct tggtgataga ttcgccacca ctgggccttg
6301 ttttccgcta attccagggc agattccagg gcccagagct ggacagacat cacaggataa
6361 cccaggcctg ggaggaggcc ccacggcctg ctggtggggt ctgacctggg gggattggag
6421 ggaaatgacc tgccctccac caccacccaa agtgtgggat taaagtactg ttttctttcc
6481 acttaaactg atgagtcccc tgggtctgtc aaaatgagaa ggtcactgct gccacgcttg
6541 ggaggactca gggctatagc atggccctgg ggtgggacct gttctcccat cccttgcctc
6601 acgtccctgt ttttgtttgt ttgtttgttt gtgacggagc cttggtctgt tgcccaggct
6661 tgagtacaat ggcacagtct cggctcactg caacctccgc ctcctgggtt caagcaattc
6721 ttgtgcctca gcctccccgg tagctgggac tataggcatg caccaccaca ccaggctaat
6781 tttttttttc caagatggag tcttgctctg tcgcccaggt tggagtttag tggcaccata
6841 ttggtttact gcaacctctg cctcccgggt tcaagcaatt ctcctgcctc agtctaccag
6901 ggagttagga ctacgggcct gtgccatcac gcctggctaa tttttgtatt tttcatagag
6961 ataaggtttc accatgttgg ccaggctggt ctttaactcc tgaactcaag tgatccacct
7021 gcctcggcct tccaaagtgc tgggattaca ggagtgagcc accgtgcccg gccatgtctc
7081 tctttttaac actaatgtta ccctgacctt tgaacgtaga atgcccttct gttgcaggaa
7141 aacctctttt caaaccatgt ttgtcctttg ctggcatgcc acagcaacag tcaccaacac
7201 agaagacttc tgtgaccaaa tatttggagg attttcccca cacacaccaa gcagcagaca
7261 tcagctgggt gtcctccaat tcagttccaa tgtaatcaac cagagacagc atcagatccc
7321 acagggttag ggtgcagatc catgagacca ccccctcctt cccaacggtt acaagtcctg
7381 atccctggaa cttctgacta actggcttca agttggagtt cccatgaccc ccttcccctc
7441 tttggagtca actcatttgc gacagtgacc cacgaaacac agggaaaccc ttattatgtt
7501 tattgcttta ttacagagga aaaaaatttt tttctttctt ttttgagaca gggtctcact
7561 ctgtcatcca gaatgactgc agtggcagga tctggctccg tcacccaggc tggagtgcag
7621 tggcatgatc tcggctcact acagcctcca tcccccccaa accccacgcc tcagcgcccc
7681 accccgcaag tggctgggac tctaagcata caccaccaca cccagctaat ttttttgtag
7741 tttttgcaaa gacggggtct cattctgttg ccctggctgg tcttgaactc ctgagctcca
7801 gcaatcccct tgtcttggcc tcccaaagtg ctgggattac aggcatcagc caccgtgccc
7861 aacctcaaag gatattttaa aggatagaaa taaacagcca tatgaagaga tacagacagg
7921 gcggtctgga agggcccaga gcaggagctt ctatctccat agagttgggg ttacatcacc
7981 ctccaggcac atggatgagt tcttcacctt ctgtcagcct ccacacgttc agctctcaga
8041 agcttcccga accctgtcct ttgggccttt tatggagaac tccattggct gtccgtgact
8101 gaagcatgga caactgtgat aatgtgattg ggcaaaaagg gtctgatcta agcccagcaa
8161 ggccagtcca gattcttcgg gcctttgtgc agcattcctt tctccagggt atggggcaag
8221 gacccactct ggaatgagga tcctacaacc cacaatcaga ttagagtcct gccttgggca
8281 gctgaaaaga ggacaggaga aggtcaaaga gaggaaaggc tgttttttga ggcctgaggc
8341 gccccaacat gacaacgaaa gactgtaacc atggtcatgt gagttatgag ctaggaaccc
8401 tggacgaaac caacacatat acaatcatct cccacctccc aacaccttta ctttcacagc
8461 ctctgcagca aactgcggtc actataatcg ctcctgtggc acagaggcat acccagggga
8521 atctgcccag ggggccactc tgtgcccaca tgggaaccca catctgcttg taaagcctcc
8581 cctccctctg accagaaacg aggacagttt gttgttccaa gcagtgggct catgtctgtt
8641 ttggctcaga acagggtggg gagagcgggc cagggacccg aggagggctt atccttgaga
8701 ttgcgtggga gacacaacaa ggggtggggg cccgcaggcg gggcggggcg aagcaggtga
8761 tatccagccc agagccccag cctctcccca cagtctcacc atggcctgca ccgtggtgct
8821 catcaccggc tgttcctcag gtattggcct gcacttggca atacatctgg ctttggaccc
8881 atcccagagc ttcaaaggta tagataggta gggacaggga gggagagaag ggaaaagccc
8941 ttggaggcca gaagagaagt cagatcttcc tcctctccca aaacctccag tgtatgccac
9001 gttgagggac ctgaaaacac agggccggct gtgggaggcg gcccgggccc tggcatgccc
9061 tcagggatcc ctggagaggt tgcagctgga tgtaaggaac tcaagctccc tggccgctgc
9121 ccgggaacgc gtgaccgagg gccgtgtgga tgtgctgggt gagcctcctg gaagcatatg
9181 ggctcctagg agccttctcc gccctgcgtt gaaaccaaca tgtccccagg cccctggagc
9241 atgaggggac aggccgtgct gagggtgatg ctgaggcggg ctggttgggc ctctgtcccc
9301 gcagtgtgta acacaggcct gggcctgctg gggccgctgg gggaggacgc cgtggcctct
9361 gtactggatg tgaatgtagt aggtactgtg cgggtgctgc aggacttcct gccagacatg
9421 aagcggccgg gttcgggacg cgtgttggtg actgggagca tgggaggatt gatgggtgag
9481 tggcagggac cgggccccgg agctccagat tctttgtgtg cagagctgag ccttgaaggc
9541 aggctcctta gggggtgggg tgcaatcagc ttggaggggc actgcctgcc gggggatgac
9601 cccctggccg ctgcgcctca ggaacctcat cttcccaccc aagggctgcc tttcaatgac
9661 gtttattgcg ccagcaagtt cgcgctcgaa ggcttatgcg agagtctggc ggttctgctg
9721 ctgccctttg gggtccagtg agtcaacacc cccgtctcct caaccctctt aactctgacc
9781 tagagatgcc gagcaccctg tcctgcggaa gccgctctgg tctctgcccg gcttacattt
9841 gctgcgtgcc aggcacttag gctggagcat tggcacgcat tgtgccactt gctgacctgg
9901 ctgctgaagt gttggtattg ttatggggaa gctccagcca ggagaggtta ggtgactggc
9961 ccaaggtcat gcagcggccg gggatcccgc caggttcgaa ttctgacacc agggctacct
10021 ggcagcctca gatgggtttg ggagggctgt cgagcaataa cccgctattc aaatgttctg
10081 gttatcccca gcgctctttc caccttcggg acgcagcggt gctgttctgg gtcgtggcca
10141 gggccggggt cggggccggt gctggggcag gagatgggac ttggcgcctg gtcgctccgt
10201 ccctgcccac ttgcggctct cgggccagca gcgtgagcct gatcgagtgc ggccctgtgg
10261 acaccgcctt catgcagaag gtgttgggcg gtcccgacca ggtgatggac cgcacgaaca
10321 cccggacctt ccgcctcttg cactaatacc tccaccacag caaggagatc taccgcgagg
10381 aggcgcagca ccctgaggag gtggtgaagg tgagcggggg gcgggactcc gggagcgggg
10441 gcggtgcgtc gtcctgcgcg cagccggggc cagagctcct ctcccgccgc cgcaggtctt
10501 cctcaccgct atgcgcgccc cgaagccgac cctgcgctac ttcacaacca ggcgcttcct
10561 gcaccagctg ctgatgcgcc tggacgaccc ctttggcttc gactacgccg ccgccatgca
10621 ccgggacgtg ttcgccgacg atcccgcaga ggccgaggct ggggccgggg ctggggccga
10681 ggccgggggc ggggccggtg ggatgggaga ccctgagctc agcgatcctc tggccgcccc
10741 gcaagaaagg ctccgtcagc cactgtctcc cgcgccctcc tttgtctcct gggcctgtgc
10801 ggtccctggg gatgggacgg cggtgacggc tgtggatggc taattaagac agatcacgtt
10861 agcccgttat atctgcgcgg ctaggcgcga tggctgtcgc ctataatccc agagctttgg
10921 aaggccgagg caggaggatc gctccaggcc aggagttcca gaccagcctg agcaacatag
10981 tgagacaccc catctctaaa ataaaaaaat tagcacagtg gcaccattcc ttgagctcag
11041 gagttggagg ctgcagtggg catgatcgag ctactgctct ccagcttggg tgatagagtg
11101 agacactgtc aattaattaa tctaatcaac caaccaaccc aacaacccag aaaccaaggt
11161 ccagaaagaa gccagcccag gatcatgcct caagtccaca gtaaagccca gacacagtca
11221 ctggataccc aaggggcatc tgcaggacga gttaggtggg acttgggttg gggtagagtc
11281 aggttgtgac ctgcactcca ttagccatga gacctcaggc aagttccttg ctttctctga
11341 gtgctttcct ttcctttccc ttttcttttc ttttttgttt ttcttttgtt ttgagatgga
11401 gtttcactct tgttgcccag cctggagtgc aatagcgcgg tctcagttca ccacaacctc
11461 tgcctcccgg gttcaagtgt agctaggatt gcaggcatgt gccaccacgc ccagctaatt
11521 ttgtgttttt agtagagacg aggtttctcc atgttggtca gactggtctt ggactcccga
11581 cctcaggtga tccgcccgcc ctggcctctc agagtgctgg gattacaggc atgacccacc
11641 ccatccggcc ctctgagtct ttcttatctg taaaatgggt ataataatac ctatctaatc
11701 ggttttagta atggtggaga taatgcctga aagtgctagt atagaggttt aatagccagc
11761 aaggactgtt attagaatga atagtaatat gactactgtc acattttgca aatgtgtaaa
11821 gaagaaagag ggctaagtga atcaaaggga gacagcccca ctcaccctgt tctgccccag
11881 aggactgagc gatccccacc atctggcaag ctgcctcacc aggaggtcca gctggggcta
11941 caggactggc tactttgcta caatggcccg tctttcctga gcccagtgga gggtcccagg
12001 ggggcagaag tcattgatag gggccagtag ggttgtagag ccactgtctg aacttctgtg
12061 gaggtctggt gcaggggagg tgtgaacaga taggaggcta ggtgaggatg cagcagagga
12121 agggcaggag tccgcaggag ggggcggtac cgaggcaggg attcagctgg gtctgagagg
12181 gaggcaaggc tgagggggat ggccctttgg agtgggcagg gacatgacca cagaaagcct
12241 gggaagtgga aacagacagg agcctgctgg agctcttgga gctttgtgtg gggcactagg
12301 ggaaggaggc aggcactcca ccctgacctg cccctacctc tggatgaggg tcttctctgc
12361 ctgttggatg atgtgctgcc cctgctcttg gaggaagagg cctcccagcc accccctcct
12421 gccagtcgcc ttgccctgct gacccagaca gatctgctat ccctacagca tccaggaagg
12481 ccacagggag ggggcccagg tggcatgggt ttctgaggcc tgggatctgc tctcagtccc
12541 gctctcgcca accccctttg cccctctgtg acagtgtgtg tacatatgcg tgcatgtgtg
12601 tacctgtgtg gtgtggctca caaggtcacc cctgggggtg ggataataag aggtaaagtg
12661 tgaccccctc tcttccgtac attcattttt tcaagcttgt gaatattcac attgttaata
12721 taatcttctg cagtcccatg acttcgaata gcacttagat gccgatgacc cccccaccaa
12781 agaaacccca ctgtctccca cagactccag gtctgtatct ccaactgcct actcaataca
12841 acacgtccac ctgatgcaat gaacttaatg tgtctaaacc ccaattccac atttccagtc
12901 ctcctaccgg ctcctccccc agtcttctct actttactgt atgacaactc catttttccc
12961 aatactggtt ttaaaaaccc tggagtagtc cttgactcca gtctctctca cctttcaacc
13021 aattcagcag caaatactga cagctctacc tttgctcatc aaagcatatc cccaaatcac
13081 cctattccag actgcacact gtcatctctc ccttcaatta cagcagtagc ctcctaacta
13141 gttttcttga ttccactctt gcccatcagc agtgaaatta ccccagagca gttaaaatga
13201 tcttttagga tgggcacagt ggctcacgcc tgtaatacca gcactttggg aggtggaggc
13261 tggctgatcc ccttaggttg ggagttcaag accatcctca ccaacaggga gaaaccccat
13321 ctctactaaa aatacaaaaa ttagccggat gtggtgggac gtgtctgtaa tcccaactac
13381 ttgggaggct gagacaggag aactgcttga acccgggagg tggaggttgc agtgagccag
13441 aatcgtgcca cagcactcca gcctgggcaa gaagcgcgaa actccgtctc aaaataaaat
13501 aatcttctaa aaatgacagg gccaggtggg gtggcacttt tttataatcc gagcactttg
13561 ggaggctgag gtgggcagat cgcttgacat caggggtttg agaccagcct ggccaacatg
13621 gtgaaacttc gtctctacta aaaatactaa aaattagctg ggcgtggtgg cgggtgcctg
13681 taatcccagc tactcgggag gctgatgcag gagaatcgcc tgaaccaggc agatgcagga
13741 tgcagtgagc caagattaag acactgcact ccaacctaag caatactttg tctcaaaaat
13801 aaaaaaaagc ctgggaaaca aagtgagacc ccgtctctac aaaaaagtca aaaaattagc
13861 tgggtatggt ggcagtgatg gcacacacct gtagtcccgg ctacttggga agctttttaa
13921 tattttttgc agagaccggg tctcactctg ttacctggcc tggtcttgaa ctcctgggct
13981 ccaacaatcc cctttcctgg gcctcccaaa gtgctgggat tagaggcatg agccaccgtg
14041 cccagcctca aagcatattt taaaggatag aaataaacag ccatatgaag agatacagac
14101 agggcggtct ggaagggtcc agagcaggag cttctatctc catagagttg gggttacgtc
14161 accctctggg cacattctgt cagcctccac acgttcagct ctcagaagct cccgaaccct
14221 gtcctttggg ccttttatgg agaactccat tggctgtcca tgactgaagc atggacaact
14281 gtgataatgt gattgggcaa aaagggtctg atctaagccc agcaaggcca gtccagattc
14341 tttgggcctt tgtgcagcat tcctttctcc agggtatggg gcaaggaccc actctggaat
14401 gaggatccta caacccacaa tcagattaga gtcctgcctt gggcagctga aaagaggaca
14461 ggagaaggtc agagagacga aaggctgttt tttgaggcct gaggcacccc aacatgacaa
14521 cgtaagactg taaccatggt catgtgagtt atgagctagg aaccctggac gaaaccaaca
14581 catatacaat catctcccac ctcccaacgc ctttactttc acagcctctg cagcaaactg
14641 cggtcactat aatcgctcct gtggcacaga ggcataccca ggggaatctg cccagggggc
14701 cactctgtgc ccacgtggga acccacacct gcttgtaaag cctcccctcc ctctgaccag
14761 caaccaggac agtttgttgt tccaagcagt gggctcatgt ctgttttggc tcagaacagg
14821 gtggggagag cgggccaggg acccgaggaa ggcttatcct tgagattgcg tgggagacac
14881 aacaaggggt gggggcccgc aggcggggcg gggcgaagca ggtgatatca agcccagagc
14941 cccagcctct ccccacagtc tcaccatggc ccgcaccgtg gtgctcatca ccggctgttc
15001 ctcgggcatc ggcctgcact tggccgtacg tctggcttca gatccatccc agagcttcaa
15061 aggtatagat aggcagggac agggagggag agaagggagg agcccttgga ggctagaagg
15121 gaagtcagat cttcctcctc tcccaaaacc tccagtgtat gccacgttga gggacctgaa
15181 aacacagggc cggctgtggg aggcggcccg ggccctggca tgccctccgg gatccctgga
15241 gacgttgcag ctggacgtaa gggactcaaa atccgtggcc gctgcccggg aacgcgtgac
15301 tgagggccgc gtggacgtgc tgggtgagcc tcctggaagc atatgggctc ctaggagcct
15361 tctccgccct gcgttgaaac caacatgttc ccaggcccag ggagcacgag gggacaggcc
15421 gtgctgaggg tgatgctgag gcgggctggt cgggcctctt gtctccgcag tgtgtaacgc
15481 aggcctgggc ctgctggggc cgctggaggc gctgggggag gacgccgtgg cctctgtgct
15541 ggacgtgaat gtagtaggga ctgtgcggat gctgcaggcc ttcctgccag acatgaagag
15601 gcgcggttcg ggacgcgtgt tggtgaccgg gagcgtggga ggattgatgg gtgagtggta
15661 gggagtggcc tcggcagctc cagattcttt gtgtgcggag ctgagccttg aaggcaggtt
15721 ccgcggggag ggtggagggg ggtgccgtca gcttggaggg gcaccgtctg cccggggatg
15781 accccctggc cgctgcgcct caggaacctc gtctccccac ctaagggctg cctttcaatg
15841 acgtttattg cgccagcaag ttcgcgctcg aaggcttatg cgagagtctg gcggttctgc
15901 tgctgccctt tggggtccag tgagtcaaca cccccgttcc ccgaaccctc ttaactctga
15961 cctagagatg ccgagcaccc tgtcctgcgg gagccgctct ggggcgatct ccctggccct
16021 ctctgcccgg ctcacattag ctgtgtgcca ggcacttggg ctgggcgcat ggcacgcatt
16081 gtgccacctg ctgacctagc tgctgaagtg ttggtattgt tatggggaag ctccagccca
16141 gagaggttag gtgactggcc caaggtcaca cagcggccag ggaccccgct agattcgaat
16201 cctgacacca gggctccctt gtagcctcag atggatttgg gagggctgct ccggcaggaa
16261 cccgcgtttc aaatgttctg gttatcccca gcgccctttc cgcctcactt cccagcgcac
16321 ggtgctgctc gcggtcgggg gccgggacgt ggttggggct gggactgggg ttggggctgg
16381 gactggggcc tggctggcgt ccgccccctc ccactcgttg ctctccggcc agcagcttga
16441 gcctgatcga gtgcggccca gtgcacaccg ccttcatgga gaaggtgttg ggcagcccag
16501 aggaggtgct ggaccgcacg gacatccaca ccttccaccg cttctaccaa tacctcgccc
16561 acagcaagca agtctttcgc gaggcggcgc agaaccctga ggaggtggcg gaggtgagcg
16621 ccgggctgga ctccaggagt gggggcggtg cgtcctccgg cgcgcagcgg tggccacagc
16681 tctcctcccg ccgccgcagg tcttcctcac cgctttgcgc gccccgaagc cgaccctgcg
16741 ctacttcacc accgagcgct tcctgcccct gctgcggatg cgcctggacg accccagcgg
16801 ctccaactac gtcaccgcca tgcaccggga agtgttcggc gacgttccgg caaaggccga
16861 ggctggggcc gaggctgggg gcggggccgg gcctggggca gaggacgagg ccgggcgcag
16921 tgcggtgggg gaccctgagc tcggcgatcc tccggccgcc ccgcagtaaa ggcttcctca
16981 gccgctgtct cccgcgccct tctttgtccc ctgggtctgt gtggtccctg gggatggggc
17041 ggcggtagca gctgtgggtg gctaattaag atagatcgcg ttagccagtt ttaccagcgc
17101 agctaggcgc gatggctgtc gcctgtaatg ccagcgcttt gggaggcgga ggcaggagga
17161 tcgctcaagc cccggagttg gagaccagcc agagcaacac agtgagaccc ccatctctac
17221 aaaaataaag aaaatttaaa aatcagcaca gtggcaccat tccttgagcc caggagttgg
17281 agggtgcagt gagcatgatg gggccactgc actccagcct gggtgacaga gtgagaccct
17341 gtcaattaat caaatgaacc aaccaaccga aaaactcaga aaccaaggtc cagaaagaag
17401 ccagcccagg atcacacctc aagtccatat taaaggccag acacagtctc tggataccca
17461 aggggcatct gcaggaggag ttaggtggga attggcttgg ggtagagtca ggttgtgact
17521 tggaccccat tagccatgag acctcaggca agttccttgc tttctctcag tctttctttt
17581 ctttctttcc tttctttttt ttttattttc tgagacagag tttcgctctt gttgcctagg
17641 ctggagtgca atggggcaat ctcggctcac cacaatctcc gcttcccaga ttcaagtgat
17701 tctcctgcct cagcctccca agtagctggg ataccggcta attttgtatg ttcggtagag
17761 acggggtttc tccatgttgg tcaggctggt ctcaaactct cgaccttagg tgatccgccc
17821 gcctcggcct ctcagattgc tgggcataca ggcatgagcc accgcgaccg gccctctgag
17881 tctttcttat ctgtaaaatg ggtataataa tacctatcta atcggtttta gtaatggtgg
17941 agataatgcc tgaaagtgct agtatagagg tttaatagcc agcaaggact gttattagaa
18001 tgaatagtaa tatgactact gtcacatttt gcaaatgtgt aaagaagagg gctaagtgaa
18061 tcaaagggag acagccccac tcaccctgtt ctgccccaga ggactgagcg atccccacca
18121 tctggcaagc tgcctcacca ggaggtccag ctggggctac aggactggct actttgctac
18181 aatggcctgt ctttcctgag cccagtggag ggtcccaggg gggcagaagt cattgatagg
18241 ggccagtagg gttgtagagc cactgtctga acttctgtgg aggtctggtg caggggaggt
18301 gtgaacagat aggaggctag gtgaggatgc agcagaggaa ggggcaggag tgcccaggag
18361 ggggcggtac cgaggcaggg attcagctgg gtctgagagg gaggcaaggc tgagggggat
18421 ggccctttgg agtgggcagg gacatgacca cagaaagcct gggaagtgga aacagacagg
18481 agcctgttgg agctcttgga gctttgtgtg gggcactggg ggaaggaggc aggcactcca
18541 ccctgacctg cccctacctc ggatgagggt cttctctgcc tgttggatga tgtgctgccc
18601 ctgctcttgg aggaagaggc ctcccagcca ccccctcctg ccagtcgccc tgacctgctg
18661 acccagacag atctgctatc cctacagcat ccaggaaggc cacagggagg gggcccaggt
18721 ggcatgggtt tctgaggcct gggatctgct ctcagtcccc gtctcgccaa ccccctttgc
18781 ccctctgtga cagtgtgtgt acatatgcgt gcatgtgtgt acctgtgtgg tgtggctcac
18841 aaggtcaccc ctgggggtgg gataataaga ggtaaagtgt gaccccctct cttccgtaca
18901 ttcatttttt caagcttgtg aatattcaca ttgttaatat aatcttctgc agtcccatga
18961 cttcgaatag cacttagatg ccgatgaccc ccccaccaaa gaaaccccac tgtctcccac
19021 agactccagg tctgtatctc caactgccta ctcaatacaa cacgtccacc tgatgcaatg
19081 aacttaatgt gtctaaaccc caattccaca tttccagtcc tcctaccggc tcctccccca
19141 gtcttctcta ctttactgta tgacaactcc atttttccca atactggttt taaaaaccct
19201 ggagtagtcc ttgactccag tctctctcac ctttcaacca attcagcagc aaatactgac
19261 agctctacct ttgctcatca aagcatattc caaaatcacc ctattcgaga ctgcacactg
19321 tcatctttcc cttcaattac agcagtagcc tcctaactag ttttcttgat tccactcttg
19381 cccatcagca gtaaaattag cacagtgcat ttaaaatgat cttttaaaaa tgacagggcc
19441 gggcgtgatg gctcatgcct ataatcacag cactttgtga ggctgaggcg gacgttattg
19501 cttgaggtca ggagtttgag accaacctgg ccaatatggt gaaaccctgt gtctactaaa
19561 aatataaaaa ttagccaggc atggtggtgg gcgcctgtaa tcgcagctgc tccagaggct
19621 gacaaaggag aatctcttga accagggagg tggaggctgc agtgagccaa gattgggaca
19681 ctgcactcca gcctgggcaa cagagacttt gtctcaaaaa aataaaaatt ggccgggcgt
19741 ggtggctcat gcctgtaatc ccagcacttt gggaggccga ggcgggtgga tcacctgagg
19801 tcaggagttc aagaccagcc tggccaacag ggtgaaacct tgtctctact aaaaacccaa
19861 aaaattagcc aggcgtggtg gcacgtgcct gtaatcccag ctactcaggc ggctgaggca
19921 ggggaattgc ttgaaccccg gagcggtacg ctgcagtgag tggagattgc gccattgtac
19981 tctagcctgg gcaacaagag tgaaactcca tctcaaaata ataataataa aattaaaaaa
20041 aatttaaaat aaaaaaggga gcctgggcaa caaagtgaga ccccgtctct acgaaaaaac
20101 aaaaaaaata gttgggtgtg gttgcacaca tctgtagtcc cagcttcttg ggaggctgac
20161 atgggaagac tgtttgagcc caggaggttg aggccacagt aagctgtggg cacaccactg
20221 cattccagcc tgggcaagag agggagaccc tgtctcataa aagaaagaaa gaggccaggt
20281 gtggtggcta cgcctgtaat cccaacactt tgggagccca aggccggtga atcacttgag
20341 atcaggagtt cgagaccagc ctggccaaca tggtaaaacc ccgtctctac taaaaataca
20401 aaaattagct gggcgtggta gtttgtgcct gtaatcccag ctactgggga ggctgaggca
20461 ggagaatcgc ttgaacccgg gagacgaagg ttgcagtgag ctgagatttc tccactgcac
20521 tccagcctgg gtaaagagtg agactcggcc gggtgcggtg gctcatgtct ttaatcccag
20581 cactttggga ggccgaggcg ggtggatcac ctgaggtcag gagttcaaga ccagcctggc
20641 caacctggtg aaagcctgtc tctactaaaa atagaaaaat tagctgggcg tggtggtggg
20701 tgcctgtaat cccagctact caggaggctg aagcaggaga atcacttgaa cctgggaggc
20761 agaggttgca gtgagccgag atcatgcgtt gcactccaac ctgggtgaca gtgagactct
20821 gtcttaaaac aaaacaaaag aagcaaagtt tattttgctc agagcttcac agttttaggg
20881 gctggaaagt gcaagaacct catgcagtat ctggcaaagg gctgtcccag gatggaaggg
20941 cagaaggtgg aagcaaatga gagacacgaa attgggccaa atgtcgtcca tccatcagga
21001 gccactccca tgataacagt atttatctat tcaagagggc agagccctca tgatctaatt
21061 atcttttttt tttttctcct gcctcggcct cttgagaagc tgggatttta ggcgcatgcc
21121 accacacccc gctaattttt gtacttttag tagagacggg gtttcaccac gttggttggg
21181 ctggtcgcaa actcctgatc tcaggtgatc cacctgagat ccaccacttt ggcttcccaa
21241 agtgctgaga ttacaggagt gagccacctc gcccagctga cctaattacc tcttaaaggc
21301 cccaccctgc ccagtgcagt ggctcaggcc tgtaatccca acactttggg aggctgaagt
21361 gggaggactg cttgagttca ggagttggaa accagcccag gcaacacagc aagatcctgt
21421 ctgtatgaaa aaaaaaattt tttttttaat tagccaggtg tggtggcaca catctgtagt
21481 tccagctgct caggaggcca aggcagagga ctgcttgagc ccaggcagtt gaggctgctg
21541 tgagccatga tcaacccact gcactccagc ctgggggaca agagtgagac cctgtctcaa
21601 agagaaaaaa agtcccatct cttaagacag tagcaattga attttttatt tcaacatgag
21661 ttgtggcagg gacattcgaa ccatagcacc aaggttctcc ccaaagctct ccaagacctt
21721 ctttgatctg gttcctgccc cctttctgat ccgttgacct gcag
//