LOCUS AH001430 4501 bp DNA linear HUM 01-AUG-2016 DEFINITION Homo sapiens carbonic anhydrase IV gene, complete cds, alternatively spliced. ACCESSION AH001430 L10951 L10952 L10953 L10954 L10955 VERSION AH001430.2 KEYWORDS carbonic anhydrase IV. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4501) AUTHORS Okuyama,T., Batanian,J.R. and Sly,W.S. TITLE Genomic organization and localization of gene for human carbonic anhydrase IV to chromosome 17q JOURNAL Genomics 16 (3), 678-684 (1993) PUBMED 8325641 COMMENT On or before Aug 1, 2016 this sequence version replaced L10951.1, L10952.1, L10953.1, L10954.1, L10955.1, AH001430.1. FEATURES Location/Qualifiers source 1..4501 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /cell_type="fibroblast" /tissue_lib="lambda FIX" intron <1..978 regulatory 801..806 /regulatory_class="GC_signal" regulatory 935..950 /regulatory_class="TATA_box" exon 979..1083 /note="label: 1a" /number=1 CDS join(1026..1083,1976..2131,2578..2723,2780..2878, 3150..3216,3371..3534,4055..4249) /EC_number="4.2.1.1" /note="membrane associated with glycosyl-phosphatidyl-inositol linkage" /codon_start=1 /product="carbonic anhydrase IV" /protein_id="AAA35625.1" /translation="MRMLLALLALSAARPSASAVPVKWGGNCQKDRQSPINIVTTKAK VDKKLGRFFFSGYDKKQTWTVQNNGHSVMMLLENKASISGGGLPAPYQAKQLHLHWSD LPYKGSEHSLDGEHFAMEMHIVHEKEKGTSRNVKEAQDPEDEIAVLAFLVEAGTQVNE GFQPLVEALSNIPKPEMSTTMAESSLLDLLPKEEKLRHYFRYLGSLTTPTCDEKVVWT VFREPIQLHREQILAFSQKLYYDKEQTVSMKDNVRPLQQLGQRTVIKSGAPGRPLPWA LPALLGPMLACLLAGFLR" intron order(1084..>1203,<1304..1524) gap 1204..1303 /estimated_length=unknown exon 1525..1578 /note="label: 1b" /number=1 intron order(1579..>1807,<1908..1975) gap 1808..1907 /estimated_length=unknown exon 1976..2131 /number=2 intron order(2132..>2341,<2442..2577) gap 2342..2441 /estimated_length=unknown exon 2578..2723 /number=3 CDS join(2580..2723,2780..2878,3150..3216,3371..3534, 4055..4249) /EC_number="4.2.1.1" /note="membrane associated with glycosyl-phosphatidyl-inositol linkage" /codon_start=1 /product="carbonic anhydrase IV" /protein_id="AAA35626.1" /translation="MMLLENKASISGGGLPAPYQAKQLHLHWSDLPYKGSEHSLDGEH FAMEMHIVHEKEKGTSRNVKEAQDPEDEIAVLAFLVEAGTQVNEGFQPLVEALSNIPK PEMSTTMAESSLLDLLPKEEKLRHYFRYLGSLTTPTCDEKVVWTVFREPIQLHREQIL AFSQKLYYDKEQTVSMKDNVRPLQQLGQRTVIKSGAPGRPLPWALPALLGPMLACLLA GFLR" intron 2724..2779 exon 2780..2878 /number=4 intron 2879..3149 exon 3150..3216 /number=5 intron 3217..3370 exon 3371..3534 /number=6 intron order(3535..>3746,<3847..4054) gap 3747..3846 /estimated_length=unknown exon 4055..>4348 /number=7 regulatory 4343..4348 /regulatory_class="polyA_signal_sequence" BASE COUNT 867 a 1259 c 1189 g 786 t ORIGIN 1 gtagcaggag ggcacagcta agagaggaga caagaggcag cttgggggca tgggggggac 61 cccatttccc caaatgttgg agccgccctt aagggaactt cccctggact cctctggagt 121 cttgccagat gggagaacct ggggtccagg ggtccattca gaccccagtc tcttctcttc 181 ccatctcatc tctccacctg gggggctcaa agaacctagc tcctcccctt tgaccccatc 241 agcttctccc tcaccgaatc ctcttacaat ccatttctta attctgccca tcaaccctcc 301 ctggtcatca cataaacacc agcaagggca gtagtttagg gacagctttg ctctgtattc 361 gaagaaaggt ctttggtctg cttagaaatt aagtttggag cagcctggga ggaaatagaa 421 gagatgagac ccagagaatc aatggccact gccaaagaga gataggagag cctttctgcg 481 gccactgatt caacaaacgg catcacacgc ccctcagcct ctcatctctt cgaggccatc 541 tccaccccca cccccacccc caccccaagg ccattccaag gtcccattca agaccactgc 601 gcaggggtgg cgccagaatg atcaaagtca tcatggggca ctcatctttg tgtgggtcaa 661 gcactttgtg gtgtcgtgtc cctgttccca aaccagggac caggagtagg atgggggagg 721 ggcagggcgc cgctgctgag tggagaggaa cgcggagtgg gagcgggggc cttgcgcaca 781 gggcaagagg tggccaggca gggcggggac cggggagagc ggaggatttc agagcaggga 841 caggggaagg tggagagggg agggcaggcc ccgccagggg gaggagccgc ctcccgcgtc 901 cagctgtcgg ggctccctcc cgtcgcggct ccgctataaa acccaggccg gcaggatcgc 961 tgcacccgcg gcggcctcct cggtgcgcga cccccggctc agaggactct ttgctgtccc 1021 gcaagatgcg gatgctgctg gcgctcctgg ccctctccgc ggcgcggcca tcggccagtg 1081 caggtgagct cccgggctcc ggccccaggt gcccctcggc ggtcccctcc gtgcccccag 1141 ctcccgcccc tgcagaggat ccccccgcgg gcgaccggtg agcgtcggtg gcgctggggt 1201 cccnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1261 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnntgcctgg ggaacagagc 1321 ctatgccagt gacctgttga tcagcccctg cttggttttc aattgagtct ctttttctgg 1381 gccctcaatc ctgctgccag gaacactcca tcccagccca agaggggctc caaccccagg 1441 ggtaggccca gcctctgatc ctcctgtgtg tgtgagcaac aagacacacg cacgcacact 1501 tcacaccctt cctctctgct ccagagtcac actggtgcta cgaggttcaa gccgagtcct 1561 ccaactaccc ctgcttgggt gagtacagcc agtcctgggg actgctcttt gtgcatggtg 1621 ggcaccacgc aagccgaaat ggagaccccg gaagagtggg aaggggaggg gtgatggtgg 1681 cttgcccagg cagatatcag ttcccagcat acacacacac acacacacac acacactctc 1741 tctctctctc tcacacacac acacacagat atacaaacac acacactcac acagaaacac 1801 acacacannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1861 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnttc agtggggtgg 1921 tgggggctac acctttggcc agcccccact ctcacccacc tctctccctg ctcagtgcca 1981 gtcaagtggg gtggaaactg ccagaaggac cgccagtccc ccatcaacat cgtcaccacc 2041 aaggcaaagg tggacaaaaa actgggacgc ttcttcttct ctggctacga taagaagcaa 2101 acgtggactg tccaaaataa cgggcactca ggtgggctgg atggaggcca ggcctggcac 2161 ccagttcccc aaggactgag aggatggggc tcctcccagg agggtgtgcc agacccaggc 2221 ccatctgtgc tgtgaggtgg ctgaaaatcc catgggggag gacagcttcc aggaggagag 2281 agcactctag tatgttttcg ttacttttgt cagaccagtc tgggatgtgg gggcaggaaa 2341 cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nacccctgca ggccagaacc 2461 agagctcatg aaggttggga ggcaggagac aatgtcccat ctgggtgaag ctgggatgaa 2521 gagctagagg aggctgaggg aggctggttc gaggactctc cccttctgtg ctcccagtga 2581 tgatgttgct ggagaacaag gccagcattt ctggaggagg actgcctgcc ccataccagg 2641 ccaaacagtt gcacctgcac tggtccgact tgccatataa gggctcggag cacagcctcg 2701 atggggagca ctttgccatg gaggtgaggg tagtccaggc ccttcatagg tccccttttc 2761 acccctccac cccgaccaga tgcacatagt acatgagaaa gagaagggga catcgaggaa 2821 tgtgaaagag gcccaggacc ctgaagacga aattgcggtg ctggcctttc tggtggaggt 2881 gggactccca tcccccactt cccggggaag gccgccctga gagcttcttc ttaggattca 2941 gagacctggg actcagcgag gcaggagggg gcggggagac tccaacttcc gcctctgttt 3001 ctggggttgc atgtccccgg gccaggtggg gagcccagag cctcaatccc agaagctgcc 3061 tggccttccg cacccagatc gggagaatga actggccacc accactggct ccctgcagac 3121 tttctcaaga cccttccctc cctttccagg ctggaaccca ggtgaacgag ggcttccagc 3181 cactggtgga ggcactgtct aatatcccca aacctggtga gtcaggatgg gggagaaggg 3241 cttggggtga ggggggggat tcctcccaca aaggaagggg tgggtgtgcg gggagctggg 3301 ctctcagagt gcaggggaag aggggctcct tctccaccct cactgacagt gtcctctgcc 3361 cctatctcag agatgagcac tacgatggca gagagcagcc tgttggacct gctccccaag 3421 gaggagaaac tgaggcacta cttccgctac ctgggctcac tcaccacacc gacctgcgat 3481 gagaaggtcg tctggactgt gttccgggag cccattcagc ttcacagaga acaggtgcac 3541 agggcctggg gcagggcatg ggctcccact gcctggctcc ccagaaatta tccctctgtc 3601 tgccctcaga ggtccctcag gatacaggtg gggagcccag gtaactgaag tccgttgtta 3661 atcatcgaca ttcactgaag acaggcaaga aaagcctgag ctgttccatc accagattgg 3721 gggctagaca ggaggcaggg gaaggtnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3841 nnnnnnccaa tggccattag ccttcaccca tccgcacgac ctcatttaca tcccctattc 3901 ttatcatctt ccagaccacc tcgagagcca ggggttcaga gcccctcttt cctaatgagg 3961 gctcccagga caggatgagg tgcctgcctg aggtcacacg gcagggagtg cagctccccc 4021 tgccccgacc tgctgagccc catcacttcc gcagatcctg gcattctctc agaagctgta 4081 ctacgacaag gaacagacag tgagcatgaa ggacaatgtc aggcccctgc agcagctggg 4141 gcagcgcacg gtgataaagt ccggggcccc gggtcggccg ctgccctggg ccctgcctgc 4201 cctgctgggc cccatgctgg cctgcctgct ggccggcttc ctgcgatgat ggctcacttc 4261 tgcacgcagc ctctctgttg cctcagctct ccaagttcca ggcttccggt ccttagcctt 4321 cccaggtggg actttaggca tgattaaaat atggacatat ttttggagaa acctttctca 4381 agtgtgtttt tagccttcca caactacccc accctgtccc cctccaccca cccctgttcc 4441 tcctgttcca gggcgggggc tttaaggcca ggagatttct ccaagcaggt accaccaggt 4501 g //