LOCUS AH002741 20666 bp DNA linear HUM 10-JUN-2016 DEFINITION Homo sapiens alpha-2 type IV collagen gene, partial cds; and alpha-1 type IV collagen (COL4A1) gene, complete cds. ACCESSION AH002741 J04217 J05039 M26536-M26576 VERSION AH002741.2 KEYWORDS collagen. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2181) AUTHORS Soininen,R., Huotari,M., Hostikka,S.L., Prockop,D.J. and Tryggvason,K. TITLE The structural genes for alpha 1 and alpha 2 chains of human type IV collagen are divergently encoded on opposite DNA strands and have an overlapping promoter region JOURNAL J. Biol. Chem. 263 (33), 17217-17220 (1988) PUBMED 3182844 REFERENCE 2 (bases 1 to 20666) AUTHORS Soininen,R., Huotari,M., Ganguly,A., Prockop,D.J. and Tryggvason,K. TITLE Structural organization of the gene for the alpha 1 chain of human type IV collagen JOURNAL J. Biol. Chem. 264 (23), 13565-13571 (1989) PUBMED 2701944 REFERENCE 3 (bases 2282 to 2341; 2442 to 2682; 2783 to 3234; 3335 to 3739; 3840 to 4197; 4298 to 4543; 4644 to 4894; 4995 to 5360; 5461 to 5724; 5825 to 6070; 6171 to 6259; 6360 to 6759; 6860 to 7138; 7239 to 7429; 7530 to 8037; 8138 to 8370; 8471 to 8840; 8941 to 9225; 9326 to 9733; 9834 to 10333; 10434 to 10584; 10685 to 11002; 11103 to 11322; 11423 to 11615; 11716 to 11909; 12010 to 12852; 12953 to 13400; 13501 to 14263; 14364 to 14651; 14752 to 14948; 15049 to 15331; 15432 to 15777; 15878 to 16211; 16312 to 16544; 16645 to 16857; 16958 to 17056; 17157 to 17469; 17570 to 18360; 18461 to 18773; 18874 to 19157; 19258 to 20666) AUTHORS Tryggvason,K. JOURNAL Unpublished COMMENT On or before Jun 10, 2016 this sequence version replaced J04217.1, M26550.1, M26540.1, M26542.1, M26543.1, M26544.1, M26545.1, M26546.1, M26547.1, M26537.1, M26538.1, M26548.1, M26549.1, M26551.1, M26552.1, M26553.1, M26554.1, M26555.1, M26556.1, M26557.1, M26539.1, M26558.1, M26559.1, M26560.1, M26561.1, M26562.1, M26536.1, M26563.1, M26541.1, M26564.1, M26565.1, M26566.1, M26567.1, M26568.1, M26569.1, M26570.1, M26571.1, M26572.1, M26573.1, M26574.1, M26575.1, M26576.1, AH002741.1. FEATURES Location/Qualifiers source 1..20666 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /map="13q34" /tissue_lib="ATCC 577281" /tissue_lib="E.Fritsch and R.Poulsom's, and Clontech" intron complement(<1..773) /note="alpha-2 intron C" CDS complement(join(<774..829,951..994)) /note="alpha-2 collagen type IV" /codon_start=1 /product="alpha-2 type IV collagen" /protein_id="AAA53097.1" /db_xref="GDB:G00-119-792" /translation="MGRDQRAVAGPALRRWLLLGTVTVGFLAQSVLA" intron complement(830..950) /note="G00-119-792" /number=2 exon complement(<951..994) /note="alpha-2 collagen type IV, (first expressed exon)" /number=2 intron complement(1039..1739) /note="alpha-2 intron A; G00-119-792" /number=1 exon complement(1740..>2181) /note="G00-119-792" /number=1 gene 1869..20661 /gene="COL4A1" CDS join(1869..1952,2282..2341,2450..2539,2638..2682, 2837..2881,2976..3038,3375..3428,3624..3650,3857..3940, 4027..4089,4508..4543,4810..4851,4995..5081,5474..5500, 5652..5702,5825..5869,5956..6009,6180..6221,6372..6456, 7011..7046,7239..7403,7934..8029,8211..8294,8749..8819, 9013..9204,9377..9545,9969..10061,10148..10252, 10434..10531,10836..10986,11117..11230,11423..11590, 11732..11821,12109..12261,12423..12521,12634..12723, 13178..13317,13778..13904,14003..14083,14364..14462, 14752..14802,15096..15281,15644..15777,16139..16211, 16350..16421,16645..16773,16958..17056,17257..17469, 17781..17958,18617..18731,18874..19046,19276..19357) /gene="COL4A1" /codon_start=1 /product="alpha-1 type IV collagen" /protein_id="AAA53098.1" /db_xref="GDB:G00-119-791" /translation="MGPRLSVWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGV KGQKGERGLPGLQGVIGFPGMQGPEGPQGPPGQKGDTGEPGLPGTKGTRGPPGASGYP GNPGLPGIPGQDGPPGPPGIPGCNGTKGERGPLGPPGLPGFAGNPGPPGLPGMKGDPG EILGHVPGMLLKGERGFPGIPGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKG QMGLSFQGPKGDKGDQGVSGPPGVPGQAQVQEKGDFATKGEKGQKGEPGFQGMPGVGE KGEPGKPGPRGKPGKDGDKGEKGSPGFPGEPGYPGLIGRQGPQGEKGEAGPPGPPGIV IGTGPLGEKGERGYPGTPGPRGEPGPKGFPGLPGQPGPPGLPVPGQAGAPGFPGERGE KGDRGFPGTSLPGPSGRDGLPGPPGSPGPPGQPGYTNGIVECQPGPPGDQGPPGIPGQ PGFIGEIGEKGQKGESCLICDIDGYRGPPGPQGPPGEIGFPGQPGAKGDRGLPGRDGV AGVPGPQGTPGLIGQPGAKGEPGEFYFDLRLKGDKGDPGFPGQPGMPGRAGSPGRDGH PGLPGPKGSPGSVGLKGERGPPGGVGFPGSRGDTGPPGPPGYGPAGPIGDKGQAGFPG GPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGFPGPQGDRGFPGTPGRPGLPGEKG AVGQPGIGFPGPPGPKGVDGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPG LKGLPGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPGLPGSVGSPGV PGIGPPGARGPPGGQGPPGLSGPPGIKGEKGFPGFPGLDMPGPKGDKGAQGLPGITGQ SGLPGLPGQQGAPGIPGFPGSKGEMGVMGTPGQPGSPGPVGAPGLPGEKGDHGFPGSS GPRGDPGLKGDKGDVGLPGKPGSMDKVDMGSMKGQKGDQGEKGQIGPIGEKGSRGDPG TPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLPGPKGSVGGMGLPGTPGEKGVPGI PGPQGSPGLPGDKGAKGEKGQAGPPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIG IPGMPGSPGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPGTPGPTGP AGQKGEPGSDGIPGSAGEKGEPGLPGRGFPGFPGAKGDKGSKGEVGFPGLAGSPGIPG SKGEQGFMGPPGPQGQPGLPGSPGHATEGPKGDRGPQGQPGLPGLPGPMGPPGLPGID GVKGDKGNPGWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGFQGPKG LPGLQGIKGDQGDQGVPGAKGLPGPPGPPGPYDIIKGEPGLPGPEGPPGLKGLQGLPG PKGQQGVTGLVGIPGPPGIPGFDGAPGQKGEMGPAGPTGPRGFPGPPGPDGLPGSMGP PGTPSVDHGFLVTRHSQTIDDPQCPSGTKILYHGYSLLYVQGNERAHGQDLGTAGSCL RKFSTMPFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGENIRPFISRCAVCE APAMVMAVHSQTIQIPPCPSGWSSLWIGYSFVMHTSAGAEGSGQALASPGSCLEEFRS APFIECHGRGTCNYYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMRR T" sig_peptide 1869..1949 /gene="COL4A1" /note="G00-119-791" mat_peptide join(1950..1952,2282..2341,2450..2539,2638..2682, 2837..2881,2976..3038,3375..3428,3624..3650,3857..3940, 4027..4089,4508..4543,4810..4851,4995..5081,5474..5500, 5652..5702,5825..5869,5956..6009,6180..6221,6372..6456, 7011..7046,7239..7403,7934..8029,8211..8294,8749..8819, 9013..9204,9377..9545,9969..10061,10148..10252, 10434..10531,10836..10986,11117..11230,11423..11590, 11732..11821,12109..12261,12423..12521,12634..12723, 13178..13317,13778..13904,14003..14083,14364..14462, 14752..14802,15096..15281,15644..15777,16139..16211, 16350..16421,16645..16773,16958..17056,17257..17469, 17781..17958,18617..18731,18874..19046,19276..19354) /gene="COL4A1" /product="alpha-1 type IV collagen" /note="G00-119-791" intron 1953..>2181 /gene="COL4A1" /note="G00-119-791" /number=1 gap 2182..2281 /estimated_length=unknown exon 2282..2341 /gene="COL4A1" /note="G00-119-791" /number=2 gap 2342..2441 /estimated_length=unknown intron <2442..2449 /gene="COL4A1" /note="G00-119-791" /number=2 exon 2450..2539 /gene="COL4A1" /note="G00-119-791" /number=3 intron 2540..2637 /gene="COL4A1" /note="G00-119-791" /number=3 exon 2638..2682 /gene="COL4A1" /note="G00-119-791" /number=4 gap 2683..2782 /estimated_length=unknown intron <2783..2836 /gene="COL4A1" /note="G00-119-791" /number=4 exon 2837..2881 /gene="COL4A1" /note="G00-119-791" /number=5 intron 2882..2975 /gene="COL4A1" /note="G00-119-791" /number=5 exon 2976..3038 /gene="COL4A1" /note="G00-119-791" /number=6 intron 3039..>3234 /gene="COL4A1" /note="G00-119-791" /number=6 gap 3235..3334 /estimated_length=unknown intron <3335..3374 /gene="COL4A1" /note="G00-119-791" /number=6 exon 3375..3428 /gene="COL4A1" /note="G00-119-791" /number=7 intron 3429..3623 /gene="COL4A1" /note="G00-119-791" /number=7 exon 3624..3650 /gene="COL4A1" /note="G00-119-791" /number=8 intron 3651..>3739 /gene="COL4A1" /note="G00-119-791" /number=8 gap 3740..3839 /estimated_length=unknown intron <3840..3856 /gene="COL4A1" /note="G00-119-791" /number=8 exon 3857..3940 /gene="COL4A1" /note="G00-119-791" /number=9 intron 3941..4026 /gene="COL4A1" /note="G00-119-791" /number=9 exon 4027..4089 /gene="COL4A1" /note="G00-119-791" /number=10 intron 4090..>4197 /gene="COL4A1" /note="G00-119-791" /number=10 gap 4198..4297 /estimated_length=unknown intron <4298..4507 /gene="COL4A1" /note="G00-119-791" /number=10 exon 4508..4543 /gene="COL4A1" /note="G00-119-791" /number=11 gap 4544..4643 /estimated_length=unknown intron <4644..4809 /gene="COL4A1" /note="G00-119-791" /number=11 exon 4810..4851 /gene="COL4A1" /note="G00-119-791" /number=12 intron 4852..>4894 /gene="COL4A1" /note="G00-119-791" /number=12 gap 4895..4994 /estimated_length=unknown exon 4995..5081 /gene="COL4A1" /note="G00-119-791" /number=13 intron 5082..>5360 /gene="COL4A1" /note="G00-119-791" /number=13 gap 5361..5460 /estimated_length=unknown intron <5461..5473 /gene="COL4A1" /note="G00-119-791" /number=13 exon 5474..5500 /gene="COL4A1" /note="alpha-1 collagen type IV; G00-119-791" /number=14 intron 5501..5651 /gene="COL4A1" /note="G00-119-791" /number=14 exon 5652..5702 /gene="COL4A1" /note="G00-119-791" /number=15 intron 5703..>5724 /gene="COL4A1" /note="G00-119-791" /number=15 gap 5725..5824 /estimated_length=unknown exon 5825..5869 /gene="COL4A1" /note="G00-119-791" /number=16 intron 5870..5955 /gene="COL4A1" /note="G00-119-791" /number=16 exon 5956..6009 /gene="COL4A1" /note="G00-119-791" /number=17 intron 6010..>6070 /gene="COL4A1" /note="G00-119-791" /number=17 gap 6071..6170 /estimated_length=unknown intron <6171..6179 /gene="COL4A1" /note="G00-119-791" /number=17 exon 6180..6221 /gene="COL4A1" /note="G00-119-791" /number=18 intron 6222..>6259 /gene="COL4A1" /note="G00-119-791" /number=18 gap 6260..6359 /estimated_length=unknown intron <6360..6371 /gene="COL4A1" /note="G00-119-791" /number=18 exon 6372..6456 /gene="COL4A1" /note="G00-119-791" /number=19 intron 6457..>6759 /gene="COL4A1" /note="G00-119-791" /number=19 gap 6760..6859 /estimated_length=unknown intron <6860..7010 /gene="COL4A1" /note="G00-119-791" /number=19 exon 7011..7046 /gene="COL4A1" /note="G00-119-791" /number=20 intron 7047..>7138 /gene="COL4A1" /note="G00-119-791" /number=20 gap 7139..7238 /estimated_length=unknown exon 7239..7403 /gene="COL4A1" /note="alpha-1 collagen type IV; G00-119-791" /number=21 intron 7404..>7429 /gene="COL4A1" /note="G00-119-791" /number=21 gap 7430..7529 /estimated_length=unknown intron <7530..7933 /gene="COL4A1" /note="G00-119-791" /number=21 exon 7934..8029 /gene="COL4A1" /note="G00-119-791" /number=22 intron 8030..>8037 /gene="COL4A1" /note="G00-119-791" /number=22 gap 8038..8137 /estimated_length=unknown intron <8138..8210 /gene="COL4A1" /note="G00-119-791" /number=22 exon 8211..8294 /gene="COL4A1" /note="G00-119-791" /number=23 intron 8295..>8370 /gene="COL4A1" /note="G00-119-791" /number=23 gap 8371..8470 /estimated_length=unknown intron <8471..8748 /gene="COL4A1" /note="G00-119-791" /number=23 exon 8749..8819 /gene="COL4A1" /note="G00-119-791" /number=24 intron 8820..>8840 /gene="COL4A1" /note="G00-119-791" /number=24 gap 8841..8940 /estimated_length=unknown intron <8941..9012 /gene="COL4A1" /note="G00-119-791" /number=24 exon 9013..9204 /gene="COL4A1" /note="G00-119-791" /number=25 intron 9205..>9225 /gene="COL4A1" /note="G00-119-791" /number=25 gap 9226..9325 /estimated_length=unknown intron <9326..9376 /gene="COL4A1" /note="G00-119-791" /number=25 exon 9377..9545 /gene="COL4A1" /note="G00-119-791" /number=26 intron 9546..>9733 /gene="COL4A1" /note="G00-119-791" /number=26 gap 9734..9833 /estimated_length=unknown intron <9834..9968 /gene="COL4A1" /note="G00-119-791" /number=26 exon 9969..10061 /gene="COL4A1" /note="G00-119-791" /number=27 intron 10062..10147 /gene="COL4A1" /note="G00-119-791" /number=27 exon 10148..10252 /gene="COL4A1" /note="G00-119-791" /number=28 intron 10253..>10333 /gene="COL4A1" /note="G00-119-791" /number=28 gap 10334..10433 /estimated_length=unknown exon 10434..10531 /gene="COL4A1" /note="G00-119-791" /number=29 intron 10532..>10584 /gene="COL4A1" /note="G00-119-791" /number=29 gap 10585..10684 /estimated_length=unknown intron <10685..10835 /gene="COL4A1" /note="G00-119-791" /number=29 exon 10836..10986 /gene="COL4A1" /note="G00-119-791" /number=30 intron 10987..>11002 /gene="COL4A1" /note="G00-119-791" /number=30 gap 11003..11102 /estimated_length=unknown intron <11103..11116 /gene="COL4A1" /note="G00-119-791" /number=30 exon 11117..11230 /gene="COL4A1" /note="G00-119-791" /number=31 intron 11231..>11322 /gene="COL4A1" /note="G00-119-791" /number=31 gap 11323..11422 /estimated_length=unknown exon 11423..11590 /gene="COL4A1" /note="G00-119-791" /number=32 intron 11591..>11615 /gene="COL4A1" /note="G00-119-791" /number=32 gap 11616..11715 /estimated_length=unknown intron <11716..11731 /gene="COL4A1" /note="G00-119-791" /number=32 exon 11732..11821 /gene="COL4A1" /note="G00-119-791" /number=33 intron 11822..>11909 /gene="COL4A1" /note="G00-119-791" /number=33 gap 11910..12009 /estimated_length=unknown intron <12010..12108 /gene="COL4A1" /note="G00-119-791" /number=33 exon 12109..12261 /gene="COL4A1" /note="G00-119-791" /number=34 intron 12266..12422 /gene="COL4A1" /note="G00-119-791" /number=34 exon 12423..12521 /gene="COL4A1" /note="G00-119-791" /number=35 intron 12522..12633 /gene="COL4A1" /note="G00-119-791" /number=35 exon 12634..12723 /gene="COL4A1" /note="G00-119-791" /number=36 intron 12724..>12852 /gene="COL4A1" /note="G00-119-791" /number=36 gap 12853..12952 /estimated_length=unknown intron <12953..13177 /gene="COL4A1" /note="G00-119-791" /number=36 exon 13178..13317 /gene="COL4A1" /note="G00-119-791" /number=37 intron 13318..>13400 /gene="COL4A1" /note="G00-119-791" /number=37 gap 13401..13500 /estimated_length=unknown intron <13501..13777 /gene="COL4A1" /note="G00-119-791" /number=37 exon 13778..13904 /gene="COL4A1" /note="G00-119-791" /number=38 intron 13905..14002 /gene="COL4A1" /note="G00-119-791" /number=38 exon 14003..14083 /gene="COL4A1" /note="G00-119-791" /number=39 intron 14084..>14263 /gene="COL4A1" /note="G00-119-791" /number=39 gap 14264..14363 /estimated_length=unknown exon 14364..14462 /gene="COL4A1" /note="G00-119-791" /number=40 intron 14463..>14651 /gene="COL4A1" /note="G00-119-791" /number=40 gap 14652..14751 /estimated_length=unknown exon 14752..14802 /gene="COL4A1" /note="G00-119-791" /number=41 intron 14803..>14948 /gene="COL4A1" /note="G00-119-791" /number=41 gap 14949..15048 /estimated_length=unknown intron <15049..15095 /gene="COL4A1" /note="G00-119-791" /number=41 exon 15096..15281 /gene="COL4A1" /note="G00-119-791" /number=42 intron 15282..>15331 /gene="COL4A1" /note="G00-119-791" /number=42 gap 15332..15431 /estimated_length=unknown intron <15432..15643 /gene="COL4A1" /note="G00-119-791" /number=42 exon 15644..15777 /gene="COL4A1" /note="G00-119-791" /number=43 gap 15778..15877 /estimated_length=unknown intron <15878..16138 /gene="COL4A1" /note="G00-119-791" /number=43 exon 16139..16211 /gene="COL4A1" /note="G00-119-791" /number=44 gap 16212..16311 /estimated_length=unknown intron <16312..16349 /gene="COL4A1" /note="G00-119-791" /number=44 exon 16350..16421 /gene="COL4A1" /note="G00-119-791" /number=45 intron 16422..>16544 /gene="COL4A1" /note="alpha-1 intron AQ" gap 16545..16644 /estimated_length=unknown exon 16645..16773 /gene="COL4A1" /note="alpha-1 collagen type IV, 6; G00-119-791" /number=46 intron 16774..>16857 /gene="COL4A1" /note="G00-119-791" /number=46 gap 16858..16957 /estimated_length=unknown exon 16958..17056 /gene="COL4A1" /note="alpha-1 collagen type IV, 7; G00-119-791" /number=47 gap 17057..17156 /estimated_length=unknown intron <17157..17256 /gene="COL4A1" /note="G00-119-791" /number=47 exon 17257..17469 /gene="COL4A1" /note="G00-119-791" /number=48 gap 17470..17569 /estimated_length=unknown intron <17570..17780 /gene="COL4A1" /note="G00-119-791" /number=48 exon 17781..17958 /gene="COL4A1" /note="G00-119-791" /number=49 intron 17959..>18360 /gene="COL4A1" /note="G00-119-791" /number=49 gap 18361..18460 /estimated_length=unknown intron <18461..18616 /gene="COL4A1" /note="G00-119-791" /number=49 exon 18617..18731 /gene="COL4A1" /note="G00-119-791" /number=50 intron 18732..>18773 /gene="COL4A1" /note="G00-119-791" /number=50 gap 18774..18873 /estimated_length=unknown exon 18874..19046 /gene="COL4A1" /note="G00-119-791" /number=51 intron 19047..>19157 /gene="COL4A1" /note="G00-119-791" /number=51 gap 19158..19257 /estimated_length=unknown intron <19258..19275 /gene="COL4A1" /note="G00-119-791" /number=51 exon 19276..20661 /gene="COL4A1" /note="G00-119-791" /number=1 BASE COUNT 3883 a 4123 c 4373 g 4187 t ORIGIN Chromosome 13q34. 1 gaattcacct accaaacctc ttcattttag ccatggttac ataaagctca gtaaaagtgg 61 ctcctactct ttaaacaaaa cgtataaaat ggacaaagcc tcgcagtgca gtcgccagct 121 gcacccacgg tatcggcagg gctgtgcacg gccggcgtgg aggggcaggc tcggcgaaca 181 ggatcccagt gtggcgcccg gagcagctct tccagggccg gaacaggact tgggagaaaa 241 cgcggttttt ctgccattgt tccctcgcaa tgcttcaatt caaacaatac taaggttgtc 301 tcgttcgcct cctcgccccg cccctttacc cagagactcc catttaaaaa cacaaaaaac 361 cctcagtccg tgacagagac tccccaccag ctcccgagtt ccaggaaaac ttttttcgcc 421 ctaagcctgg cgtctcactg cctagtgact gacacacttt cctggcctct acggctgcag 481 ctttgcccaa agcaacgaaa acggtgcctc aggctacatt ttaagttccc agttcccgtg 541 aaaggccgac gcttgcaaaa cacaggattg acgctgggaa cccaagggga aggaaaggtg 601 gaaagcatga tgagggaaga gcaagctttc tcagaacctt ggtgggcagc ctggcgaaca 661 tccacacgca cacacacact cgggagcgca cggacgagct gccttctcca aggccactca 721 aacgtcccaa ccactctcgg gcgcgcaagt ccaagcgcgg gagccaggac ttaccgccaa 781 gacgctctgg gcgaggaacc ccacggtcac tgtccccagc agcagccacc tgcatgggaa 841 agggaggaag agagacgtga acgtgcgggc ccggcttgtc agtccccgag agttacaccg 901 aagggtccat gcgcgcgtga cccacgggga ccaggcagaa agtcgcttac cgccgtaggg 961 cagggccggc caccgcgcgc tggtctctcc ccatgctggc ggttcgtcca ctctgggccc 1021 cggtcagtcc cacttagcct agaggcgaca gacaaagcga gtttagcgca ggatgaggga 1081 ggcagcccat cctcaccgcc ggtctcggtc cgcgagacgc ggggacagcg cggtgcgcgg 1141 cccgcatgca ggggtgaccg gaggccgtcc cccccacgac ccggaaagaa ggagagcctc 1201 ccgttaggcc cctgtgggtg ctccttggcc gaggagctcg gtcttcgctc tcccaccctc 1261 cccctttcta ctcccaagca ggagagcgtg cagccctagc ctgcacaagg cgctccaagt 1321 cggtgctctc gggccagggt ctgggcgccg ctcgggggtc gctctcacct cgggctcggc 1381 ttcaggggct gctgcccgaa cgcattggcc cttccagaag cacccgccgg cggcacaccg 1441 gcagggcggc cggccttgct gggctctctg ggcgccggcc ccgggggctc gggcggcccc 1501 tttcggtcct cggcctggat ccgcgagcgc gcgggcgcaa ggggttggga cgcggcagcc 1561 tcttgagtgc ggagcgcgga gccctggtgt cccggcgcac ggcagccaca ctcccgggcc 1621 gcgcgctccc gccgcctctt acccgcgccg cagggtcctc ccctttgagg cgccgcccgc 1681 gcaccgccgg gggggagggg gcagcgccaa caaattgggg agctcggccc gccgcgctca 1741 ggtctccgct tggagccgcc gcacccggga cggtgcgtat cgctggaagt ccggccttcc 1801 gagagctagc tgtccgccgc ggcccccgca cgccgggcag ccgtccctcg cgcctcgggc 1861 gcgccaccat ggggccccgg ctcagcgtct ggctgctgct gctgcccgcc gcccttctgc 1921 tccacgagga gcacagccgg gccgctgcga aggtgagttc ccggccagct ccgctcccgg 1981 cgtcccgccc cgagcttggg cgccccgaga ggcccctttg tccgcgcctg gacccgtccg 2041 cctgcccctc gggggtcgcg cgtggcacgg ccaggtgcat tctctgggcc ggggttcgtt 2101 gggggtccct gtaggctacg atcgcgcatt ggtggaccga gcctcctttg ttatggtatg 2161 ggtactggag agttaaggaa tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2221 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2281 nggtggctgt gctggctctg gctgtggcaa atgtgactgc catggagtga agggacaaaa 2341 gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nttctgtagg gtgaaagagg 2461 cctcccgggg ttacaaggtg tcattgggtt tcctggaatg caaggacctg aggggccaca 2521 gggaccacca ggacaaaagg taagcacggc tgtgggattg ggggtgggtg gatgtaagat 2581 tgcccgattc ttgagatagc ggtgatcaat gacaatggca tttctattct gttccagggt 2641 gatactggag aaccaggact acctggaaca aaagggacaa gannnnnnnn nnnnnnnnnn 2701 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2761 nnnnnnnnnn nnnnnnnnnn nngagaagtg aggcttcccg gctcttcact gacacgcttt 2821 tacttccgcc tcacagggac ctccgggagc atctggctac cctggaaacc caggacttcc 2881 cgtatgtata gaaaacgtgc tctacttctt ttatgaaata ttcttccatc aggcgatttt 2941 ctgcctgggt taaattttca ctttcttcat tgcagggaat tcctggccaa gacggcccgc 3001 caggcccccc aggtattcca ggatgcaatg gcacaaaggt aaatccagaa ccgagaccct 3061 cctttttgtg tgtgtttacg taatttttgc atattaagga gtcaggtagt gtgattctgt 3121 taatagagtt ttatttgcca caattggaaa gttgcttgtc ttaaagtttg ctttatttag 3181 taaggaaata cagttttccc atatttagtg taccagaaag atataattgg tcccnnnnnn 3241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3301 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnctgcca cacacagtgc gtatgaatga 3361 tgctctgttc ccagggggag agagggccgc tcgggcctcc tggcttgcct ggtttcgcag 3421 gaaatcccgt gagtagaggt tatttagggc agaacttctt tcttttagtt atgacttctc 3481 tcttttcatt ccatttcctt ttctttcttt ctttgcttat gttttaataa cttatatttt 3541 atatataata ttacatgata acaattttaa tgctaatgat attttgagaa aaacaaaact 3601 aacaccaaaa ctttcttttt tagggaccac caggcttacc agggatgaag gtacatttta 3661 ttttattttg ctttattgga ttcattacag ggaatctgtt tacaacaagt gccatagaag 3721 cacattctct ttaaaccacn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc 3841 tttcttcatc ctctagggtg atccaggtga gatacttggc catgtgcccg ggatgctgtt 3901 gaaaggtgaa agaggatttc ccggaatccc agggactcca gtaagcattt gctgaatcat 3961 tatgaacatg tgccacctca ccctcccagt tcgcatctga actcatttct ttctcatgca 4021 ttctagggcc caccaggact gccagggctt caaggtcctg ttgggcctcc aggatttacc 4081 ggaccaccag taagttttgg gggctgtctc tccgaggcaa tcatttaaaa aacaactata 4141 ttgaggtttg aaaataaata ataatttagt aaaaacttta cacagttgtg gtcgctcnnn 4201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnggg agcagggaga tggattggta 4321 ttggtatggt cacacctgag ggtcctgcac ctgctaggag tggggaggga ggggaagccc 4381 aggaaaaata tttcacaaat aaattatcat gttgaccaga gaatcttaag ataacgtcag 4441 cctgaagaag ggcttaaagc tccctagagt tctgaatcct atttaattaa aatgctctct 4501 ctcccagggt cccccaggcc ctcccggccc tccaggtgaa aagnnnnnnn nnnnnnnnnn 4561 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4621 nnnnnnnnnn nnnnnnnnnn nnngaattcc agcatgatca gttactttcc agaaaatttt 4681 catgagagtt atgggacaaa gctattgcct gaaatctacc atcttattgc attttgtgtg 4741 aggtttgtag aatcgacttt gaacgaagta gacaaatcta ataatgagag cctaattttt 4801 aatccacagg gacaaatggg cttaagtttt caaggaccaa aaggtgacaa ggtgagtgca 4861 tattgctctg gagtgccttt ccatgttcgg aaaannnnnn nnnnnnnnnn nnnnnnnnnn 4921 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 4981 nnnnnnnnnn nnnnggtgac caaggggtca gtgggcctcc aggagtacca ggacaagctc 5041 aagttcaaga aaaaggagac ttcgccacca agggagaaaa ggtatgaatg ggcttcatca 5101 gtgaatactg attttctaat tttggacaaa ttccaaaaga caaagagaaa atgggtcaaa 5161 actcagtttt tcagctacat tcaatcctgg tctggaaaac agatttatag actgttcagt 5221 ccataaaata cgagcccctt tgagagcaga gacagtggaa ctggattggt ccaagcctga 5281 tctgaggctc acccggctgg atcagggctt cttaaatgtg ccccaagttg gagatggctt 5341 ggtcaggagg agggagtggg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5461 gaattccttt cagggccaaa aaggtgaacc tggatttcag gtcagtactc actttctgcc 5521 tatcattttt aggtccaaag acataaagat ttaagtttca agtttttttc accttaagtt 5581 taatatctat ctaatttaaa ttttgcctta aatagacgaa ctaaatctct ttgtcttttt 5641 attcctttaa ggggatgcca ggggtcggag agaaaggtga acccggaaaa ccaggaccca 5701 gagtaagtgc cttttccaaa gtccnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5761 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5821 nnnnggcaaa cccggaaaag atggtgacaa aggggaaaaa gggagtcccg taagtgtttc 5881 tctgtgattt ttacaagcag gctcactgtt ccaccactgg cctctgacgg ttactgcttc 5941 tctcttcggt ttcagggttt tcctggtgaa cccgggtacc caggactcat aggccgccag 6001 ggcccgcagg taaacgcagg cactattgtg agccctttat cttcactttc caattcaacc 6061 cagaagtctg nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn cttttgtagg 6181 gagaaaaggg tgaagcaggt cctcctggcc cacctggaat tgtgagtaag agtgctgtcc 6241 ctgggtctgt gagagcgctn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6301 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt 6361 tctcgattca ggttataggc acaggacctt tgggagaaaa aggagagagg ggctaccctg 6421 gaactccggg gccaagagga gagccaggcc caaaaggtag ctttttcttg cctttttttc 6481 ccccctcctc ctactccccc tccttctctc ctcctcctct tcctcctcct cgtcctcttc 6541 ctcctcctcc tcctgttcct cctcctcctt cctcctcccc ctccttctct tcctccgccc 6601 cctccttctc ttctcttcct cctccccctc ctcctctttc tcctctttct cctcttcctc 6661 ctcctcctcc ttctctctct ctccccacgc tttctatttc ttcccctctt tttgtctagt 6721 tttgacctgt gttcctccat attttggaag catctgcagn nnnnnnnnnn nnnnnnnnnn 6781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6841 nnnnnnnnnn nnnnnnnnnc ctcgggttgg tctcccgctc tgttttctac gttccctttg 6901 gccatggaca cttaccaatg caccaagcaa ggctttcagt agagaaataa gaagtgtata 6961 caaacgggtg aactgttctg tttgatccaa cactctcttt ctcttttcag gtttcccagg 7021 actaccaggc caacccggac ctccaggtga gactccttaa ctgatttgtt atagcatact 7081 tgcacacctt ctgtgctgtt tatattctga atatataatt tatcacgaat gttagatann 7141 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnngc ctccctgtac ctgggcaggc 7261 tggtgcccct ggcttccctg gtgaaagagg agaaaaaggt gaccgaggat ttcctggtac 7321 atctctgcca ggaccaagtg gaagagatgg gctcccgggt cctcctggtt cccccgggcc 7381 ccctgggcag cctggctaca caagtgagtt ccctcagaaa tggtttaatn nnnnnnnnnn 7441 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7501 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt tttctagtct cagctttgac tagatgcatg 7561 accctggaca aatcacttac ctttcttgag cctcagtttc ttcatttgta aaataagggt 7621 gatggtaggc acttcgcaag atgaaatgaa atagtataca ggacagggct gatgcaccta 7681 cagatacttc agaatttgtg aaatagaaat gttggcagtg atggcttggg atggtgagtg 7741 ggtggtgggt ggtgtgtggt gattatgttt tgcaagtggt ggatagccta agatgatttt 7801 gaaacaaact aaagctgaat ttctaacact tatcaaatag ccgtaaaata ttttggtttt 7861 ataatagagg ttgagttagg agagggtaaa aacacctagc tatgatccta taactctgac 7921 aactcaattt cagatggaat tgtggaatgt cagcccggac ctccaggtga ccagggtcct 7981 cctggaattc cagggcagcc aggatttata ggcgaaattg gagagaaagg taagaaannn 8041 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8101 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnctg ttctctcagt gcactgtgga 8161 gacttctggt cacatgttac acgtaagcac ctttgtgttt tcttttttag gtcaaaaagg 8221 agagagttgc ctcatctgtg atatagacgg atatcggggg cctcccgggc cacagggacc 8281 cccgggagaa ataggtaaga cccacatgtg aaaaggacca ggtcagagtt tgctttggtg 8341 tgttggcatc tttcctgtga aagggatttc nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8461 nnnnnnnnnn gaattcatgt ttttccttta agtggtccct gggtgtgtgg gcaggggggc 8521 aggtgtgggg tgctggaagc attcctgctt ccgggtgtac atgtgcgtcg ggtcacctat 8581 tccctttctg agtccgtctt gggcatttta gttattactc tctttggatt atccaagcaa 8641 tcatccactt cctatacaaa tttttatgct tttgagtctt aagaaaaggt tttgagaagt 8701 gtctcatgaa tgctgtgttg ctgtcttttt ccctctctct cacgaaaggt ttcccagggc 8761 agccaggggc caagggcgac agaggtttgc ctggcagaga tggtgttgca ggagtgccag 8821 taagtaaacc tgtctgagtt nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8881 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8941 cttgtgttac agtcacactg gtgtgcttca tggatgggtt cttctgtgga tcacaaggct 9001 ttgttcttac agggccctca aggtacacca gggctgatag gccagccagg agccaagggg 9061 gagcctggtg agttttattt cgacttgcgg ctcaaaggtg acaaaggaga cccaggcttt 9121 ccaggacagc ccggcatgcc agggagagcg ggttctcctg gaagagatgg ccatccgggt 9181 cttcctggcc ccaagggctc gccggtatgt ttatctccag actttnnnnn nnnnnnnnnn 9241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9301 nnnnnnnnnn nnnnnnnnnn nnnnngggtg ttggtgtgtc tctgataacg gtgtcttttt 9361 cttcttctca catcagggtt ctgtaggatt gaaaggagag cgtggccccc ctggaggagt 9421 tggattccca ggcagtcgtg gtgacaccgg cccccctggg cctccaggat atggtcctgc 9481 tggtcccatt ggtgacaaag gacaagcagg ctttcctgga ggccctggat ccccaggcct 9541 gccaggtgag gcctgagaaa ctcatgcagc gtgaagttgt aaacggaggt tagggcaggc 9601 acctcgggtg cacacagagg cctggggctc gcataggcca gggcgcacac ttggctttga 9661 tccttccagg aattggggca ggacctgggc aagccttcag ccttttgtgc ctccctttcc 9721 tcatgtatgt gatnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 9781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnaggcaga 9841 ggctctgctc tccgggcact cgtggtgctg ggtaggatgt ggcgtgcagc cctccactgc 9901 gtgtcctccc tgtgttccgt gtggtcctca ttccttcatc gtttgtcttt gctctttttt 9961 tcctctaggt ccaaagggtg aaccaggaaa aattgttcct ttaccaggcc cccctggagc 10021 agaaggactg ccggggtccc caggcttccc aggtccccaa ggtacgattg gaattccagc 10081 tcaccgggat accacgagag tttcctcacg tttttctcac gctgactgtt ggtctttctc 10141 accacaggag accgaggctt tcccggaacc ccaggaaggc caggcctgcc aggagagaag 10201 ggcgctgtgg gccagccagg cattggattt ccagggcccc ccggccccaa aggtaaccct 10261 gccagacgga cccgagcctt tggtccactg tgatttgtga gaaaagagga ggtggtagac 10321 atcaaaaaca ccannnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10381 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnngtgttga 10441 cggcttacct ggagacatgg ggccaccggg gactccaggt cgcccgggat ttaatggctt 10501 acctgggaac ccaggtgtgc agggccagaa ggtgagtgcc aagcattccc tcctgaccct 10561 tctcccccat ccttgttatt agttnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10681 nnnnaagctt tgcaaacgct tgaaaagggt tgagcagata atattcttac taaatgcatt 10741 ggcagtatct gttaagtctt cacttacgct attggtttga taaaaatact gactttgcaa 10801 atcaatttgc ttctctgtgc tttggggatt ttcagggaga gcctggagtt ggtctaccgg 10861 gactcaaagg tttgccaggt cttcccggca ttcctggcac acccggggag aaggggagca 10921 ttggggtacc aggcgttcct ggagaacatg gagcgatcgg accccctggg cttcagggga 10981 tcagaggtaa cttcatgcag atnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11041 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11101 nntttttctt ctccaggtga accgggacct cctggattgc caggctccgt ggggtctcca 11161 ggagttccag gaataggccc ccctggagct aggggtcccc ctggaggaca gggaccaccg 11221 gggttgtcag gtgagtgaca tgcttctaga attatctgtt cccaaaagca tgcttttctg 11281 aagcatgggg gatgaggccg gggtcccaat aaagtgaagc ttnnnnnnnn nnnnnnnnnn 11341 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11401 nnnnnnnnnn nnnnnnnnnn nngccctcct ggaataaaag gagagaaggg tttccccgga 11461 ttccctggac tggacatgcc gggccctaaa ggagataaag gggctcaagg actccctggc 11521 ataacgggac agtcggggct ccctggcctt cctggacagc agggggctcc tgggattcct 11581 gggtttccag gtaagtgatt tttgaacttc tgcctnnnnn nnnnnnnnnn nnnnnnnnnn 11641 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11701 nnnnnnnnnn nnnnntacgt tttggccaca ggttccaagg gagaaatggg cgtcatgggg 11761 acccccgggc agccgggctc accaggacca gtgggtgctc ctggattacc gggtgaaaaa 11821 ggtaggggag ctgataaccc tagaagctca ttttgtgtct atttctcgat gccacttggg 11881 aaattctcaa gttgtcatag atcatatttn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11941 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12001 nnnnnnnnng aattcatgct gctgtccctg tcaacaagaa cgtgagcctt ttgagcacct 12061 ctgtgacact ggccccaggc tcaccagtgc cttctctttc ctctgcaggg gaccatggct 12121 ttccgggctc ctcaggaccc aggggagacc ctggcttgaa aggtgataag ggggatgtcg 12181 gtctccctgg caagcctggc tccatggata aggtggacat gggcagcatg aagggccaga 12241 aaggagacca aggagagaaa ggcaagtcca ggaggccttc tcatggccaa gctcggaacc 12301 acaatgtgcc tttcctgggt tatcgggtcc tccataaata ataaccaaac caacctgcgt 12361 ttctgcagca acttgcttgc aagcaaatgc gggtcattgc atttttcttc tctgatgtgt 12421 aggacaaatt ggaccaattg gtgagaaggg atcccgagga gaccctggga ccccaggagt 12481 gcctggaaag gacgggcagg caggacagcc tgggcagcca ggtacagtgt ggagctccgg 12541 tccccagtgt ggcaagggct caggatgagg cctgtgttcc ccagctcttt ctgtagatgt 12601 caacctgtca gactgctttc tttatggttc taggacctaa aggtgatcca ggtataagtg 12661 gaaccccagg tgctccagga cttccgggac caaaaggatc tgttggtgga atgggcttgc 12721 caggtaaact ggatttagaa gaggattctt tagcatgtgt gcgtgtgata ctggtagggc 12781 atgtctgcag aatgagatcc tcttcagctg agacggacag aatccaggca cctgtgtctt 12841 tcacatgaat tcnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12901 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nncctgcatc 12961 ccttgcccac aacgttcacc tgcttacctg agacaaagtg caaaaggaga gaaagggcag 13021 gcaggcccac ctggcatagg catcccagga cttgcgtggt gaaaaggtaa ccagcgctct 13081 gcgtggaaag gtccccttct cccactcagt ggagatgctg cctgagaaat ctcctagcta 13141 agcccaaatc aagtttcaat ttggtttgtg tttataggaa cacctggaga gaaaggtgtg 13201 cctggcatcc ctggcccaca aggttcacct ggcttacctg gagacaaagg tgcaaaagga 13261 gagaaagggc aggcaggccc acctggcata ggcatcccag gactgcgtgg tgaaaaggta 13321 accagcgctc tggctggaaa ggtccccttc tcccagctca gtggagatgc tgcctgagaa 13381 atgctcagca aatatctcag nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13441 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13501 tgtcaaggct aaggtgggag acagctgtca gtgggataaa acagagagga tggagggcaa 13561 gaggaatggc gggtaggaaa ccagatggtt ggctctactg ctcttaagtt ttcctctcca 13621 ggtcagaagt ggactttgat tcttgatgag aaatctggaa aggagttttg caatgcaata 13681 ttcattgcac caaaatgtat accattatct ataacatgcc tacagatgca tgcccatgga 13741 caaatgtaaa gttaatatgc ctgactttta cttgcaggga gatcaaggga tagcgggttt 13801 cccaggaagc cctggagaga agggagaaaa aggaagcatt gggatcccag gaatgccagg 13861 gtccccaggc cttaaagggt ctcccgggag tgttggctat ccaggtaggt gaaggggggc 13921 ttctcttgga tttggggaat ctaagaaatt caaagaaaag cccaaactta tggatgttct 13981 ggtgttttgt ttttttcttt aggaagtcct gggctacctg gagaaaaagg tgacaaaggc 14041 ctcccaggat tggatggcat ccctggtgtc aaaggagaag caggtagagg gccctctttg 14101 ggacacaccc gggcacatag agaacagtgg ccaggggaat gggtaggccc caagtgaagt 14161 tccatcacag acgggggtag caggggtagg cctaactgtc cctgtctccc atcttcagat 14221 ggccaaactg atctagtcag gttacagatc ttctctcagg tctnnnnnnn nnnnnnnnnn 14281 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14341 nnnnnnnnnn nnnnnnnnnn nnngtcttcc tgggactcct ggccccacag gcccagctgg 14401 ccagaaaggg gagccaggca gtgatggaat cccggggtca gcaggagaga agggtgaacc 14461 aggtatggcc caacgcccca ttccctatca gaatcagcag tgtccactcc cagagacctg 14521 cagactgcct gtgttcagtg aataaagctg ggcacagaat ggcccctaga ctgcacatcc 14581 ctgcaactac ttaaacactg atgtggatgt gtggatagag tgggatgagt ggaggggaag 14641 aagggaaggg gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14701 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngtctaccag 14761 gaagaggatt cccagggttt ccaggggcca aaggagacaa aggtaatctt tgctcactgt 14821 gctcttcctt ttcaaccaac cgctgctgcc tgattagcag aacaaggggc agtgtcttca 14881 ggatgaagcc ccatccctgt tggccaggca gcaacgggat ggaggatggg tgctgtgccc 14941 agacgagann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15001 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnntt ctgctgctgc 15061 gtttcccatc acctcaaact gtttttcttg ttcaggttca aagggtgagg tgggtttccc 15121 aggattagcc gggagcccag gaattcctgg atccaaagga gagcaaggat tcatgggtcc 15181 tccggggccc cagggacagc cggggttacc gggatcccca ggccatgcca cggaggggcc 15241 caaaggagac cgcggacctc agggccagcc tggcctgcca ggtaagggca tctcgccagg 15301 ggccagggct gaggactggg acagaattca cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15421 nnnnnnnnnn ntttctattc cctccctccc tcccttcctc cctttcctcc ctccctcctt 15481 ccttccttcc tggacctgcc tcgatttctg tctcaagggt tgtcactgtt ggtccagtgt 15541 tgtatcagtg aggggccgtg ggtggcagta ttgatagccc cacagcccag ggacacttgt 15601 gcatatttaa ccccaaatgt gtggctttct tctcttccat taggacttcc gggacccatg 15661 gggcctccag ggcttcctgg gattgatgga gttaaaggtg acaaaggaaa tccaggctgg 15721 ccaggagcac ccggtgtccc agggcccaag ggagaccctg gattccaggg catgcctnnn 15781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15841 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncct caggcgcttc ccctgtggat 15901 ggacaagcag aggagataca tgggagggaa acaaatgctt ctactttcca gcttcccgat 15961 tgaggcacat gcaaaggtgt tgttattctg tcatttaaga aaccacaagg caccatttgt 16021 tcacaaatgt gctttccaga acattgaggg gctgtttcca ttttccaact agcaggttac 16081 tggagagact taacctacat atcctgagca attcccaacc tcctctttgt gtgtctaggg 16141 tattggtggc tctccaggaa tcacaggctc taagggtgat atggggcctc caggagttcc 16201 aggatttcaa gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngaattctgt 16321 gcaagacagt gctttccctc tgcttttagg tccaaaaggt cttcctggcc tccagggaat 16381 taaaggtgat caaggcgatc aaggcgtccc gggagctaaa ggtaggagag tttgttgatc 16441 tgtggaaccc ttactggtgc tttgtgaaaa tgtaaaagcc aggaatgcac agaattgggg 16501 tgtttggttt ttcatatgtg aaatgactca aaaatcatta aaaannnnnn nnnnnnnnnn 16561 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16621 nnnnnnnnnn nnnnnnnnnn nnnngtctcc cgggtcctcc tggcccccca ggtccttacg 16681 acatcatcaa aggggagccc gggctccctg gtcctgaggg ccccccaggg ctgaaagggc 16741 ttcagggact gccaggcccg aaaggccagc aaggtgagaa ggcttggctg tgcagggggt 16801 atggggagcc cagaggagtg gtcagagttc tcctacccat ctgatctaaa ataagttnnn 16861 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16921 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnngtg ttacaggatt ggtgggtata 16981 cctggacctc caggtattcc tgggtttgac ggtgcccctg gccagaaagg agagatggga 17041 cctgccgggc ctactgnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17101 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnntgga 17161 aatataaaat attaaaaact ctttggacaa atcattttaa agcctttagg gacctgtttg 17221 taattaggga aacctaattt ctctcctatc ttccaggtcc aagaggattt ccaggtccac 17281 caggccccga tgggttgcca ggatccatgg ggcccccagg caccccatct gttgatcacg 17341 gcttccttgt gaccaggcat agtcaaacaa tagatgaccc acagtgtcct tctgggacca 17401 aaattcttta ccacgggtac tctttgctct acgtgcaagg caatgaacgg gcccatggac 17461 aggacttggn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 17521 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt tcagcttagg 17581 aaaaggtttc tttgtgctgg taaactggtt ttgtttcctt tcatcaaaaa tgtcctacat 17641 aatcctttat acatttccat ttcagtgatt atgttgatta tatgtttttg cttgttgtcc 17701 cagctgaaat ccactttaca tagagataga ttgtgaatta taaaggaaac attctcacaa 17761 ttgtcttctt ccttgtctag gcacggccgg cagctgcctg cgcaagttca gcacaatgcc 17821 cttcctgttc tgcaatatta acaacgtgtg caactttgca tcacgaaatg actactcgta 17881 ctggctgtcc acccctgagc ccatgcccat gtcaatggca cccatcacgg gggaaaacat 17941 aagaccattt attagtaggt gagtcgagca tctgtaggaa caaatgcaaa attgaagagg 18001 aaaagtctct agtctggaca agcatatgtt ttctattttt cttccaaaac gttgaatcca 18061 ttgccattta gcttattatt tgtgataatt ccggagtttt gtaatggaaa tctaccataa 18121 aaattatttg acataaaagt cagttggctg ggcacagtgg ctcatgccgt aatcccaaca 18181 ctttgggaag ccgaggcgag agggttgctt gagctcagga gttcaagacc agcctgggca 18241 agatgacgag acttcatctc cacaaaaata caaaagttag ctgggcgtgg tggtgttgag 18301 ccttgtggtt cccagctacg agggaggctg aggtgggaga tccattaagc ctgggatgtt 18361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ccttaaaggc caaatggggc 18481 aacaatgttt tctacttaat atgtaattgc accagaaaga ctgagtgagg atgttgagtg 18541 taggtaacgg ggcagtgcat gagggcgcct ctgccctgca ccccggctgt gctgagtgtc 18601 tctgctccac ttccaggtgt gctgtgtgtg aggcgcctgc catggtgatg gccgtgcaca 18661 gccagaccat tcagatccca ccgtgcccca gcgggtggtc ctcgctgtgg atcggctact 18721 cttttgtgat ggtaagtgtc tggggagaag ccatatttcc cgggaagagc cccnnnnnnn 18781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18841 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnncacacca gcgctggtgc agaaggctct 18901 ggccaagccc tggcgtcccc cggctcctgc ctggaggagt ttagaagtgc gccattcatc 18961 gagtgtcacg gccgtgggac ctgcaattac tacgcaaacg cttacagctt ttggctcgcc 19021 accatagaga ggagcgagat gttcaagtaa gtgggagcac tgcttttttg caggctgctg 19081 gcccctttgt gactttattt tttaatcatc cgcgatccta acacttccaa tgcaaagtgt 19141 caaagatggt tagcaatnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 19201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnncct 19261 gtgtcgtgtt tttaggaagc ctacgccgtc caccttgaag gcaggggagc tgcgcacgca 19321 cgtcagccgc tgccaagtct gtatgagaag aacataatga agcctgactc agctaatgtc 19381 acaacatggt gctacttctt cttctttttg ttaacagcaa cgaaccctag aaatatatcc 19441 tgtgtacctc actgtccaat atgaaaaccg taaagtgcct tataggaatt tgcgtaacta 19501 acacaccctg cttcattgac ctctacttgc tgaaggagaa aaagacagcg ataagcttca 19561 atagtggcat accaaatggc acttttgatg aaataaaata tcaatatttt ctgcaatcca 19621 atgcactgat gtgtgaagtg agaactccat cagaaaacca aagggtgcta ggaggtgtgg 19681 gtgccttcca tactgtttgc ccattttcat tcttgtatta taattaattt tctaccccca 19741 gagataaatg tttgtttata tcactgtcta gctgtttcaa aatttaggtc ccttggtctg 19801 tacaaataat agcaatgtaa aaatggtttt ttgaacctcc aaatggaatt acagactcag 19861 tagccatatc ttccaacccc ccagtataaa tttctgtctt tctgctatgt gtggtacttt 19921 gcagctgctt ttgcagaaat cacaattttc ctgtggaata aagatggtcc aaaaatagtc 19981 aaaaattaaa tatatatata tattagtaat ttatatagat gtcagcaatt aggcagatca 20041 aggtttagtt taacttccac tgttaaaata aagcttacat agttttcttc ctttgaaaga 20101 ctgtgctgtc ctttaacata ggtttttaaa gactaggata ttgaatgtga aacatccgtt 20161 ttcattgttc acttctaaac caaaaattat gtgttgccaa aaccaaaccc aggttcatga 20221 atatggtgtc tattatagtg aaacatgtac tttgagctta ttgtttttat tctgtattaa 20281 atattttcag ggttttaaac actaatcaca aactgaatga cttgacttca aaagcaacaa 20341 ccttaaaggc cgtcatttca ttagtattcc tcattctgca tcctggcttg aaaaacagct 20401 ctgttgaatc acagtatcag tattttcaca cgtaagcaca ttcgggccat ttccgtggtt 20461 tctcatgagc tgtgttcaca gacctcagca gggcatcgca tggaccgcag gagggcagat 20521 tcggaccact aggcctgaaa tgacatttca ctaaaagtct ccaaaacatt tctaagacta 20581 ctaaggcctt ttatgtaatt tctttaaatg tgtatttctt aagaattcaa atttgtaata 20641 aaactatttg tataaaaatt aagctt //