LOCUS AF107044 8372 bp DNA linear HUM 13-DEC-1998 DEFINITION Homo sapiens clone pCL4 DNA-binding protein SOX21 (SOX21) gene, complete cds. ACCESSION AF107044 VERSION AF107044.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 8372) AUTHORS Malas,S., Duthie,S. and Episkopou,V. TITLE The cloning and chromosomal localization of human SOX14 and SOX21; two members of the SOX gene family related to SOX1, SOX2 and SOX3 JOURNAL Unpublished REFERENCE 2 (bases 1 to 8372) AUTHORS Malas,S., Duthie,S. and Episkopou,V. TITLE Direct Submission JOURNAL Submitted (17-NOV-1998) Clinical Sciences Centre, Medical Research Council, Du Cane Rd, London W12 ONN, UK FEATURES Location/Qualifiers source 1..8372 /organism="Homo sapiens" /mol_type="genomic DNA" /db_xref="taxon:9606" /chromosome="13" /map="13q32-q33" /clone="pCL4" gene <1167..>1997 /gene="SOX21" mRNA <1167..>1997 /gene="SOX21" /product="DNA-binding protein SOX21" CDS 1167..1997 /gene="SOX21" /note="Sry related HMG-box containing gene 21" /codon_start=1 /product="DNA-binding protein SOX21" /protein_id="AAC95381.1" /translation="MSKPVDHVKRPMNAFMVWSRAQRRKMAQENPKMHNSEISKRLGA EWKLLTESEKRPFIDEAKRLRAMHMKEHPDYKYRPRRKPKTLLKKDKFAFPVPYGLGG VADAEHPALKAGAGLHAGAGGGLVPESLLANPEKAAAAAAAAAARVFFPQSAAAAAAA AAAAAAGSPYSLLDLGSKMAEISSSSSGLPYASSLGYPTAGAGAFHGAAAAAAAAAAA AGGHTHSHPSPGNPGYMIPCNCSAWPSPGLQPPLAYILLPGMGKPQLDPYPAAYAAAL " BASE COUNT 1996 a 2143 c 2104 g 2124 t ORIGIN 1 aagcttggtg ccatctattt tggactatgc cttgcataca gctttatggg aacatttgtc 61 aggcaaaagt ataataatgg caaactctac gccttttatt ttaaattaga ttggtgtgat 121 ttgatgctga cgggagtgag agtaatggcc ttatcctgct gcaggctgtg ctgaggatgg 181 cctggtctgc caccctcctc gagtagcatt ttgcatgtgt aacagggtct cccctctggg 241 gcacaacaac aaagagaagt tgctaaggac aagaagcagg tgcggaaatg catctcccat 301 tggaacagcc ctgggcttac tccaatggct gagagaggtg ctatggccag tcctcccaga 361 gctctgcagc tgcacttggg ggtggacagt ctcgtgcttg tcctgcgtga taacggccgt 421 gaaagccagc caactgctgc ccaaaatcac ccagccgatt gggggtttcc catcggcgca 481 ccctgcccgg agccaagaag acaggctggt gctgctgtat ttgtatttat atccattgct 541 gcgctctgcg ttctcgtggc acgcctggac actcctccgc ctccccctcc tcttcctcct 601 ccagggccac ctccccgcct tccccacccc catctgcttc tgtcaaatga gaaagtcacc 661 gaggagaacc caaacactcc agccgctgag agcccccttt ggcacttggc agcacgcggc 721 ggcgggctcc tcggctcaac ttcgaggagt ctccgcgacg caacttttgg ggacgctttg 781 catttaagag agaacgaccg aggaggagga gcgctctgcc cggccgccgc tacctgcggg 841 gagctcacca gcaaacgcca ctgcagacga aggacccaaa gaacgtaaag ggcaaactgc 901 cgccgcgggg agggggcacc gccgagaagt tagagtgtcc cagagacaac ctgctcgagc 961 gctcggccgg agacactaag gcggcccggg gcgcggcgtg gccctggctg gtcccccagc 1021 cccctcctcc ggggcgggag cgacgccggg gcgcgacgag ccccggccgg ccgagcgggt 1081 ctccgcgggc agccaacatt gatttcctcc gggccgaggg cgagggcccg ggcggcggcg 1141 ggctgcagcc gcggcagggc gagagcatgt ccaagccggt ggaccacgtc aagcggccca 1201 tgaacgcctt catggtgtgg tcgcgggctc agcggcgcaa gatggcccag gagaacccca 1261 agatgcacaa ctcggagatc agcaagcgct tgggcgccga gtggaaactg ctcacagagt 1321 cggagaagcg gccgttcatc gacgaggcca agcgtctacg cgccatgcac atgaaggagc 1381 accccgacta caagtaccgg ccgcggcgca agcccaagac gctcctcaag aaggacaagt 1441 tcgccttccc ggtgccctac ggcctgggcg gcgtggcgga cgccgagcac cctgcgctca 1501 aggcgggcgc cgggctgcac gcgggggcgg gcggcggcct ggtgcctgag tcgctgctcg 1561 ccaatcccga gaaggcggcc gcggccgccg ccgctgccgc cgcacgcgtc ttcttcccgc 1621 agtcggccgc tgccgccgcc gctgccgccg ccgccgccgc cgcgggcagc ccctactcgc 1681 tgctcgacct gggctccaaa atggcagaga tctcgtcgtc ctcgtccggc ctcccgtacg 1741 cgtcgtcgct gggctacccg accgcgggcg cgggcgcctt ccacggcgcg gcggcggcgg 1801 ctgcagcggc ggccgccgcc gccggggggc acacgcactc gcaccccagc ccgggcaacc 1861 cgggctacat gatcccgtgc aactgcagcg cgtggcccag ccccgggctg cagccgccgc 1921 tcgcctacat cctgctgccg ggcatgggca agccccagct ggacccctac cccgcggcct 1981 acgctgccgc gctatgaccc cgcggggccg cctcgcgagg accggtgtgc acacgtgtac 2041 atatgtatag gtacgagcgc tgcggcctcc ccgtgcgccc tcccgcgacc gggggcccgg 2101 tttgtatgta catagaatgt ataggtgcca ggtagaggca gagaggccag gcggggcagg 2161 agtggccaag cgcgcaaggg cgcgggcgag caggcctgtg aattcgcagg atcatttcag 2221 acccgcactt cggcagccaa ctcgaaagca ggcggttgtg tgcggcagca gttggcgttt 2281 gctttgcact tcggaacctg ttgcgttttg acccacggag gtggaggagt aactttttga 2341 catgttggcc tttccagttt tgttggaagt ttcatggtcg gttttgtttt tgtttctcat 2401 tcttcttcct cgcccctcag ccccccaacc cccaaccccc tcccggtccg tgttgcatgc 2461 acgctgttca aatgtgaggt ctgaaatggc tggcacacgg gaaaagctgc ttgtgtcatt 2521 cgtttctggg agtgggatgg ctctgagcag cctcgcctcc ctgtttgtac tatttgaact 2581 ttgcagatct ctgttctctc aagcagaact cccaaccaga tccattcttg accagtgacc 2641 ggctcgaatc tggccttttg tgtgagatga tcacggnttc ttttgtttat cacgccattt 2701 gcaaatcaga gcaagagctc tttctcaagg gcaagaaacg caaacaagaa atatttgtga 2761 gatgaaagtt gtcaattgga ttttcttcct aaacaaacaa caacaacaaa ctactagaag 2821 tctccctgag tccactcgct tggatttctg acacagttta caaaaaagga aaaaggcact 2881 gctcctattt tcccttatgg ctgagttcac cttaagattg taaatgtgta tatgtcagtg 2941 aaaacattga ggcttggaaa atgtgttatt ttcgttgccc taagtttgag tcgactttag 3001 actcaaaaac attttgagcg aatatcaaag ttaactttta aaaattgcga aactatttca 3061 gaatcgcaat tttatcgaag attaaatcag acttttttgt ctggtaatta tatatttatt 3121 atttagcaaa actgaagaaa aaaagcacag aattgtttca acagatgtct ctcattttca 3181 gctagcattt ctctcccaag ttgagctggt ttaatgtgtt ttggatttcc ctcctcaatt 3241 ggcttatttt ttagatcacc tgcaattcat ttgcaaattg caataaaaca cattttagaa 3301 aaaaggaacc ttcaattatt agctttgttt ctttttaaat gtatatattt tgactaatgt 3361 ttgtgaatga agttggctaa catgtattta gtttcatttt ggctttatgt aatataaagt 3421 ttttaaaatt ttaaatatgg ttttaacctt tatgtgtaaa tgattttcta gtgtgacctt 3481 ctaatttaat attagacgtc taaggtatat ctgtaaatta gaatccgact atcactctgt 3541 tcattttttt tgaacaaaga gtttaaataa agcctgaacc agggaaaaga aaaatcttct 3601 atttcttgtt gagttcctaa caagattttt atctgaattg cccttacgtg cctggtccag 3661 gtgaagtgta aggtatcctc caaaggcacc ctttgtttca cttttgaata gatttactag 3721 gaaatctaaa tcaagccatt gttattcaga gccaaaaacc tgatttatca catttttaat 3781 cgtgaatagg aaagaagatt tttaaaaagc ccaagtcgtt gtattagctt taacaacaac 3841 aaaaaaaagg cattcatgaa ccagtagaac agagcccatt gaaaacatcc agacctttca 3901 aagcatttca ccagtttcta gtaacatttt aagaggggaa agttgcttga ccactttatc 3961 ttgttagttg aagagcccca ccacttaaat cagtgtaatt tgttctccta tctttggggt 4021 attccttgtt gacaccttaa ggttttattt ggaaggataa tcactactaa cgacaaagta 4081 caaattttgg cctctttagg acttaatttt gttatgctaa tcgcattaaa gtagaagtat 4141 aacattcaaa tggagagggt tggatttcta gggctagaca aattgctact aaagtttgaa 4201 aaatcataaa ggattttaat tttagacaag aaatagaaga ctgtcagaaa aaaaaaaata 4261 ggaagatctc gcccccccgc aaccaaaatg gaaattctca agatactata tacaagtctt 4321 aaaccagttt ccccattgag accatctctg gagctgcacg tctttataaa cgacccaagt 4381 ctttaaagtc attgttttcc cccaacggaa taatatttta aaaaccatga aaagttttgg 4441 aaatgtgaga aataggctct gctggtttga ccctgattca ctaattaaaa tgatccctct 4501 cctgttattc cctgagctct ttgcaatatt ataagttaat tcatatggtt ctgagcgatt 4561 atgcaaaact aatttggact gtccaggggt aattatccct gacacggtta attaaatcct 4621 ttcaaggctt cgtctttccc ttttgtagca gcccatccct tctcaacacg gaacttctgc 4681 ggctcgctgg aaatcacccc agccctaaat cttagttacc accctgagcc ttccagctcg 4741 gccgcctcct cggcctgaag actccccgcc tcctcccgcc ccctcccctt ttcccaaaga 4801 tcagcgtttt ctgggagaaa cgctccggag ttgttgatga atgagaagag gactggaaag 4861 atgggtaaga ggaggggtga ggatgccgag ggggagcacc gaggtcatat cgccaacaga 4921 ttgtgcggct gtttgaggac ctccacaggc cccacagact cgtttatcac ccattctgac 4981 tccaatggtc ttgctaacaa gttggcgggt tttgcgcctg cagagagcct cctgccaagt 5041 tagactgtgc agaagtaagg ggttggagcg gggggagcgg ctccggggca agagggcgta 5101 gagaaaggcc cggggnnggg nggtgtaagc gtctgaaagt ggcccacaaa tgcagcgctg 5161 tgattgggca gagagctgct gctggctcgc gatctctatc tccatctctt tatctatctc 5221 cgtctctctc cctgtttctc catttttctt tctttccttc tctctccttc cttccttcca 5281 tctttcttct ttcccttcct tttattcttc tattttcgtt tcttttcaag gtttttttta 5341 aagccatgat gcaatttctt tggtattcac cgttgtccca aaacttgaag caagcctcgt 5401 atccaagggg ccaggcatgt tgcttcgggc tttgtgcaaa caggtggaat tgcgctgtgt 5461 aagcagtaag aactggtgct ggggagctgt cgcgcgaggg ggtggctttg ggagagcagg 5521 gttgctggcc gcgattgtta cttcccttga caatttcctc ctccccctcc cccaagaaga 5581 taggagaaag caccgcggat ctccctctca ccccaggctc ggggcgcaga agatggagag 5641 aagattccac tctccccgga gcagataggg acggtcgcgc cagccaatca gagcgcggct 5701 cggcgccggc gctcccggcc gcctgggccg ccgtgtcctc caggcaagcg aagttcccgc 5761 aactcgtccg cctcgagggt ccgcgtcttt cttgcgcccg cggcccagcg gaggccgagg 5821 gagccgtcca aactttatta atctctcctc ctttctttct ccctcagccc agtgcatctc 5881 aaaggtcagc cctcttcttt taaaagactg atattattaa tgcactgaca attcctcccc 5941 cccttttctt ttttctctct tgcagggggg aaaaaaaggg aaatggtgaa aagagctttt 6001 tttatccttt tttttttttt gtccttcagt gggagcgttt agacagtcga ggaggttttg 6061 tccgagaaca aaacgcaggg ttgggaggtt ttgtgagagt gttgtttgtt gaagtggagc 6121 taagaaaaag cggcggcttt ctcctcattg tgaagaaacc aatcagtggt atttggaaaa 6181 ctgttagcat tgtgcacttc ttctgtgtcc attgtgaggc gtttcttttc acaaggtttt 6241 tttttcagcc gatccagctg gccggaatga atagcggtgc aatgtgtaca cgctttgtcc 6301 ctccggcctt caagtagccc ccattgaata gactaagttg acctgcgtga cagtgaaaca 6361 acataataaa aaatacatga gcccctgaat aggagcaggc gcataaataa ataaaatggg 6421 tgaccaaaac tggataaact gaatgacaaa acggtgaaag gggaacaaaa agatatttaa 6481 cacgctagat tagcattaga atgcgatcta caaggcagaa caattgatga ataggtttac 6541 cggccaagaa agaaatggac taaatgccct ttgaatagat atgctttttg caagggcttt 6601 gaatagatat gcttttgcaa gggctgaatg ggaaaaggta aagatgaagc tatgcaaatg 6661 agccggggaa ctttttatat atattcttta aacacacaca cacactgcgg ggggaagagt 6721 gctgcctcgg gatgtttata gaagcaataa ttgccattat tagcattgtc tgcggcagat 6781 agaaattgaa caggttggga taatataggg tagcagtaat tattcttcta attaatggtc 6841 ctttgctact tgaaaaaaga aaaaaggaaa gaagtagtaa aagttatgca gaagttatgt 6901 ttccttgtgt ccatttgccc agcgctggaa tctgtggagc aggaagcctg gcaattccaa 6961 gatacgcgat gatcytcaaa cattcccggg agccagtcct gaggctctgg cttcagggcc 7021 tagtttccat ttatgccgcg tttttgagag tctaatactg tgtctggcac atggtaggtg 7081 ctcactgaat agtcgtggta tgaatgaatg aacgaatgaa tgaatgaatg aatgaatata 7141 agtttaatgg gggaaacccg ggcctcctaa taaaggtagg ggctggggga tacctagggg 7201 cttccccagg aggatttctt ttttcatcat cccacccctg ggagaaaggt ccacgcagga 7261 tggtcgcttc ccccttgctg agagttttgc cttcagccta tctgggccgc tggaaaagag 7321 gagaagaata aacaagagac aagcaactac tcccctaccg gcgttccgtc cttgtcctca 7381 ctgccaaatc cactccaaag ccgaggatgg tgagactgtg aagttgcaaa gaaacacaga 7441 gcccaccccc ttaaagaatt acgatatatt taaagtttgc ctctttcagg tttctctcct 7501 tggctcctgc ccctttcccc tcccggctcc ttgtccttga ctgaacctca tgggacagag 7561 aacctcctgt cccccacgag gcaaggcgcg aacccgcaga gatctggggt gccctttggt 7621 tccctgcgct gccctggagg cgtccataga ggcctttgcc gccaaggaca gcaattgttt 7681 tattttcgat ggttgctcgc caggctgcgg gtcgcgggcc cacccagccg tcgaactttc 7741 cagtcgttat cagcgctgct cctaacttaa tggaataatg caaattatag cctgcccagc 7801 tgacacgtcc ctgcgaatgc gccggggctg agctctggcc agccgctctc tcgacgtcct 7861 ggacggccgg agggaatgaa gctctgaatt gtgacaaaag tggggggggc accccaaatt 7921 ctcaaagcaa tgttcttttt tttttctttt ttcttaagca attgagcctt accaaatgtc 7981 ggggccggcc gcacggaagc cttgcatatt ttaaagtgta acctgagcct tcgcggtttc 8041 agcttcactt aaaacatgca aattcttgaa attgaaaaat ctgaaaaact tccgaagagt 8101 tctatctgaa taaatccaaa tccattggga gtcgctttga ggagacaaaa cgcacagcga 8161 tttggggtga gggatatttg tggggaggca ggacgtgctg gattgggttt ccagggtcaa 8221 ggtgtctctg ggccttcgac gatagcctta gcgcagagca gggaagtggc accgctaggc 8281 agcaagctca gttgctctac ttttgtgacc catcccccca ccccccccac cgccaccctt 8341 gcctccgggc cactgcccct ctctgcaagc tt //