LOCUS AF288385 2926 bp mRNA linear HUM 22-FEB-2001 DEFINITION Homo sapiens carbonic anhydrase-related protein X mRNA, complete cds. ACCESSION AF288385 VERSION AF288385.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2926) AUTHORS Hewett-Emmett,D. and Tashian,R.E. TITLE Functional diversity, conservation, and convergence in the evolution of the alpha-, beta-, and gamma-carbonic anhydrase gene families JOURNAL Mol. Phylogenet. Evol. 5 (1), 50-77 (1996) PUBMED 8673298 REFERENCE 2 (bases 1 to 2926) AUTHORS Hewett-Emmett,D., Shimmin,L.C., Wiebauer,K., Tashian,R.E. and Porter,C.A. TITLE Structure and expression of the human CA10 gene: Evolution of the highly conserved alpha-carbonic anhydrase-related proteins, CA-RP X and CA-RP XI JOURNAL Unpublished REFERENCE 3 (bases 1 to 2926) AUTHORS Shimmin,L.C. and Hewett-Emmett,D. TITLE Direct Submission JOURNAL Submitted (19-JUL-2000) Human Genetics Center, School of Public Health, University of Texas-Houston Health Science Center, P.O. Box 20334, Houston, TX 77225-0334, USA FEATURES Location/Qualifiers source 1..2926 /db_xref="H-InvDB:HIT000076340" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /chromosome="17" /clone="IMAGE 2517501" /sex="male" /tissue_type="frontal lobe" /dev_stage="5 months post-conception" /lab_host="DH10B" /tissue_lib="Schneider fetal brain 00004" exon 1..85 /note="span derived from the genomic sequence of GenBank Accession Number AC026955; alternative exon 1b is present in the cDNA sequences of GenBank Accession Numbers AF064854 and AB036836" /number=1a exon 86..729 /note="span derived from the genomic sequence of GenBank Accession Number AC026955" /number=2 CDS 669..1655 /note="CA-RP X; CA10; member of the alpha-carbonic anhydrase family" /codon_start=1 /product="carbonic anhydrase-related protein X" /protein_id="AAK11974.1" /translation="MEIVWEVLFLLQANFIVCISAQQNSPKIHEGWWAYKEVVQGSFV PVPSFWGLVNSAWNLCSVGKRQSPVNIETSHMIFDPFLTPLRINTGGRKVSGTMYNTG RHVSLRLDKEHLVNISGGPMTYSHRLEEIRLHFGSEDSQGSEHLLNGQAFSGEVQLIH YNHELYTNVTEAAKSPNGLVVVSIFIKVSDSSNPFLNRMLNRDTITRITYKNDAYLLQ GLNIEELYPETSSFITYDGSMTIPPCYETASWIIMNKPVYITRMQMHSLRLLSQNQPS QIFLSMSDNFRPVQPLNNRCIRTNINFSLQGKDCPNNRAQKLQYRVNEWLLK" exon 730..804 /note="span derived from the genomic sequence of GenBank Accession Number AC006083" /number=3 exon 805..947 /note="span derived from the genomic sequence of GenBank Accession Number AC006083" /number=4 exon 948..1133 /note="span derived from the genomic sequence of GenBank Accession Number AC002090" /number=5 misc_feature 1056..1058 /note="encodes active-site glutamic acid instead of glutamine found in catalytically active alpha-carbonic anhydrase family members" misc_feature 1062..1064 /note="encodes active-site arginine instead of histidine found in catalytically active alpha-carbonic anhydrase family members" exon 1134..1229 /note="span derived from the genomic sequence of GenBank Accession Number AC002090" /number=6 misc_feature 1137..1139 /note="encodes active-site glutamine instead of histidine found in catalytically active alpha-carbonic anhydrase family members" exon 1230..1302 /note="span derived from the genomic sequence of GenBank Accession Number AC002090" /number=7 exon 1303..1457 /note="span derived from the genomic sequence of GenBank Accession Number AC002090" /number=8 variation 1373 /note="compared to the sequence of GenBank Accession Number AC002090" /replace="c" exon 1458..1632 /note="span derived from the genomic sequence of GenBank Accession Number AC005883" /number=9 exon 1633..2926 /note="span derived from the genomic sequence of GenBank Accession Number AC005883" /number=10 variation 1912 /note="compared to the sequence of GenBank Accession Number AC005883" /replace="t" regulatory 2444..2449 /regulatory_class="polyA_signal_sequence" /note="alternative" regulatory 2898..2903 /regulatory_class="polyA_signal_sequence" BASE COUNT 774 a 723 c 671 g 758 t ORIGIN 1 aatcgggaca agcgcgccga gaaatcctgg atcttcccag ggattccctc tacgctccct 61 cctcttcttc aataacccca aagaggagaa gaactgcccc ggagcgcagc agccaccctc 121 cgaccatgcc ccggtgaggg gggcggactt cgagggcaac ttgccgcgga ctgcctgggc 181 ttagccagcg agctacgcgc tcccgggagc ccggaattgc acggcgcagc ccggcggggg 241 gctatcgtct atgtcttctt ggggcgccag acgaatcggg gtctcgtttt tgctggaaga 301 gcccagtgtt ggtggcttca ggtggctgct gccgccgccg ccgccgccgc cgctgctagt 361 gcggtttccg ccgctggtgc gaagagaaga gacacgcgag cggggagacc tccaaggcag 421 cgaggcatcg gacatgtgtc agcacatctg gggcgcacat ccgtcgagcc cgaggggaga 481 tttgccggaa caattcaaac tgcgatattg atcttggggg tgactgtccc tggccggctg 541 tcgggtggga gtgcgagtgt gcactcgctc ggaagtgtgt gcgagtgtgt atgtgtgtgt 601 gccgtgtcgg gctcccccct tccccccgtt ttcccgtcga gtgatgcact tggaatgaga 661 atcagaggat ggaaatagtc tgggaggtgc tttttcttct tcaagccaat ttcatcgtct 721 gcatatcagc tcaacagaat tcaccaaaaa tccatgaagg ctggtgggca tacaaggagg 781 tggtccaggg aagctttgtt ccagttcctt ctttctgggg attggtgaac tcagcttgga 841 atctttgctc tgtggggaaa cggcagtcgc cagtcaacat agagaccagt cacatgatct 901 tcgacccctt tctgacacct cttcgcatca acacgggggg caggaaggtc agtgggacca 961 tgtacaacac tggaagacac gtatcccttc gcctggacaa ggagcacttg gtcaacatat 1021 ctggagggcc catgacatac agccaccggc tggaggagat ccgactacac tttgggagtg 1081 aggacagcca agggtcggag cacctcctca atggacaggc cttctctggg gaggtgcagc 1141 tcatccacta taaccatgag ctatatacga atgtcacaga agctgcaaag agtccaaatg 1201 gattggtggt agtttctata tttataaaag tttctgattc atcaaaccca tttcttaatc 1261 gaatgctcaa cagagatact atcacaagaa taacatataa aaatgatgca tatttactac 1321 aggggcttaa tatagaggaa ctatatccag agacctctag tttcatcact tatgatgggt 1381 cgatgactat cccaccctgc tatgagacag caagttggat cataatgaac aaacctgtct 1441 atataaccag gatgcagatg cattccttgc gcctgctcag ccagaaccag ccatctcaga 1501 tctttctgag catgagtgac aacttcaggc ctgtccagcc actcaacaac cgctgcatcc 1561 gcaccaatat caacttcagt ttacagggga aggactgtcc aaacaaccga gcccagaagc 1621 ttcagtatag agtaaatgaa tggctcctca agtagggaac aaagccaaga agaatcccac 1681 ctcagtgaaa tgctacaact gtgaattgac gtaacctaga atgtccccct tcttgcttct 1741 ctctccttct ttcccccaag cctcattcat tcttgggatt ggccctttct tcatgaaaag 1801 tgtctgcaaa accatggcag aggaatacat ctctcacaca tactcacaaa cacacacaca 1861 agcacttgca catacataca aacacatgca aacataccta cacacacaca cactcttaca 1921 acctccatca tgggaagtca agtttcagaa acaaaagtct cattcataag aggtcttaga 1981 agaaaataac cagttaacct gatttcaatt ttgataccgt tttcctgaac taataaatct 2041 acccaatgag acttttcagc ctttgtacat acaaaattct tccaaaagag agaggagaaa 2101 atacagctct gatggcatca aacggacttt gcatcaagta atttcagata gtgtcctagg 2161 atcctttgag ggtgctggta gcaggtgagc aggacaaagt tgaccaagga cacttatttc 2221 tagattatga ttcttctgtt tactcaacaa tttacaaaga aaaaaaggac agacattgaa 2281 gagctacaca ttgtatatat atcaccacag actataagga aatggaatta tttccctctt 2341 tgtcacatat ctgtagtagg atttgccaag atcagaaatg atccatttgc tgtttcttgt 2401 tttccaaagg tcatacattg tgtttggtta ttgttaccag ctcaataaat gtgtttaacg 2461 agttaatttc atttttctgg ctttggtctg ttctccttcc ttacaggcta agccctggct 2521 ccatgcaact gcattctttg atttcacttg ttccttcatc tacatgtttt gttcatttgc 2581 agccagtttt tactgagttt gtggcaatca ggaatgcatt tgctaagcaa gtatgacttt 2641 aattccactc catggctcaa tcattcacat gaggtgagct tcagcctgag atagcaggcg 2701 acagacttct tgcgtttcaa aactgccatg cccccctgtg atgctcccgt gaaggaatgc 2761 actttgcctt gtaagttcct gggaaagggg tatgttttct ctccaggtgc agccagatct 2821 cacaaagtac aaaacgaatg cctttctttt cttgtttata atggtcactc actgtgtttg 2881 gttactgtca agaaatcaat aaatgtgttt aacaagttac ccagta //