LOCUS MUSHBBMAJ 6532 bp DNA linear ROD 22-MAR-2001 DEFINITION Mouse beta-globin major gene. ACCESSION J00413 K01748 K03545 VERSION J00413.1 KEYWORDS beta-globin; globin. SOURCE Mus musculus (house mouse) ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus. REFERENCE 1 (bases 2588 to 4160) AUTHORS Konkel,D.A., Tilghman,S.M. and Leder,P. TITLE The sequence of the chromosomal mouse beta-globin major gene: homologies in capping, splicing and poly(A) sites JOURNAL Cell 15 (4), 1125-1132 (1978) PUBMED 569555 REFERENCE 2 (bases 2489 to 4211) AUTHORS Konkel,D.A., Maizel,J.V. Jr. and Leder,P. TITLE The evolution and sequence comparison of two recently diverged mouse chromosomal beta--globin genes JOURNAL Cell 18 (3), 865-873 (1979) PUBMED 519759 REFERENCE 3 (bases 2540 to 3833) AUTHORS van Ooyen,A., van den Berg,J., Mantei,N. and Weissmann,C. TITLE Comparison of total sequence of a cloned rabbit beta-globin gene and its flanking regions with a homologous mouse sequence JOURNAL Science 206 (4416), 337-344 (1979) PUBMED 482942 REFERENCE 4 (bases 1257 to 2813) AUTHORS Gilmour,R.S., Spandidos,D.A., Vass,J.K., Gow,J.W. and Paul,J. TITLE A negative regulatory sequence near the mouse beta-maj globin gene associated with a region of potential Z-DNA JOURNAL EMBO J. 3 (6), 1263-1272 (1984) PUBMED 6086313 REFERENCE 5 (bases 4651 to 6064) AUTHORS Citron,B., Falck-Pedersen,E., Salditt-Georgieff,M. and Darnell,J.E. Jr. TITLE Transcription termination occurs within a 1000 base pair region downstream from the poly(A) site of the mouse beta-globin (major) gene JOURNAL Nucleic Acids Res. 12 (22), 8723-8731 (1984) PUBMED 6095210 REFERENCE 6 (sites) AUTHORS Goldberg,S.Z., Kuebbing,D., Trauber,D., Schafer,M.P., Lewis,S.E., Popp,R.A. and Anderson,W.F. TITLE A 66-base pair insert bridges the deletion responsible for a mouse model of beta-thalassemia JOURNAL J. Biol. Chem. 261 (26), 12368-12374 (1986) PUBMED 3017971 REFERENCE 7 (bases 1 to 6532) AUTHORS Kuebbing,D., Trauber,D. and Anderson,W.F. JOURNAL Unpublished COMMENT [2] revisies [1]. [6] sites; deletion causing beta-thalassemia. [7] revises [7]. Draft entry and computer-readable sequence for [1]-[7] kindly provided by D.Kuebbing, 08-JUN-1987. The sequence from [7] is presented below. There is homology between sequences of the beta-globin major and minor genes. Location of the intron in both genes is also preserved [1]. The beta-thalassemia deletion starts at position 685 and ends at 4391. A novel 66 (+/- 2 bp) sequence including a DAI element and ending in a stretch of 25 'a's was found to bridge the deletion [6] (see entries with accession numbers M14274 and M14275). The sequence sent to us from D.Kuebbing [7] got jumbled in transmission (10872 base pairs were added to the end of the sequence). The corrected sequence was kindly provided by him 04-OCT-1987 and is given as [7]. revision 6532 6532 c in [7]; c...10871 bp...c in [7]. FEATURES Location/Qualifiers source 1..6532 /organism="Mus musculus" /mol_type="genomic DNA" /strain="BALB/c" /db_xref="taxon:10090" misc_feature 685..4391 /note="beta-thalassemia-causing deletion" prim_transcript 2666..6065 /note="beta-globin mRNA (alt.)" prim_transcript 2666..4652 /note="beta-globin mRNA (alt.)" prim_transcript 2666..4061 /note="beta-globin mRNA (alt.)" CDS join(2718..2809,2926..3148,3802..3930) /codon_start=1 /product="beta-globin major" /protein_id="AAA37791.1" /translation="MVHLTDAEKAAVSCLWGKVNSDEVGGEALGRLLVVYPWTQRYFD SFGDLSSASAIMGNAKVKAHGKKVITAFNDGLNHLDSLKGTFASLSELHCDKLHVDPE NFRLLGNMIVIVLGHHLGKDFTPAAQAAFQKVVAGVATALAHKYH" exon <2718..2809 /note="beta-globin major" /number=1 exon 2926..3148 /number=2 exon 3802..>3930 /note="beta-globin major" /number=3 BASE COUNT 1957 a 1335 c 1250 g 1987 t ORIGIN 1 gatctaaata ttgttcttct ttgaatttat tccagtgatg ttccagcctc aaatttggca 61 agtgtcccta agacctctca taacaaccga atgttcataa tacccccttc caccaacttc 121 acattttatc cacacacagg acatatgaaa atttcacttt cacatcacat tatcacactg 181 ttaggaagtg gatgccatga ccagcatcac agttctgaca gggcaggaca agctgactct 241 cttagatagt ctactagaca caggccagat ataactgaan atattccagt cacaaatgca 301 caactgagac tccaggttgc atttagtatg ctttgtattg tagaatataa aatagcccct 361 ctacggaatg ttatggtcac ccgcgaggca gtcacagcct ctcctgactc ggtatcctgc 421 tatttttggg ttgtaccaaa aaataaacaa ccaataaatg atttaacctc ttaaaaaaac 481 atttatactt aaataatgag atgagtttgg atttcctcac cttttaaaaa tgttgctaga 541 gctactaaaa aaagcttgca tttacaagta gttgataaaa atattcctct ggattgtgca 601 agaagggagg cgggaccact aacagacatg atggatgctt agttggactt ggcttctttc 661 tctttttcca gaggctggac tctggtttcc tgtctctccg ttttctgcag gcagtttctc 721 tagtttgctg ttcaccattc acctgtttgg ctggtcagcc acttcagcct gtttgctctt 781 ctctcccctc ttccttttat ctgcattttt tcgtctgatg ctttattctt tcccgaggcc 841 cctttgggct tcctgtcctt agtaagggca ggcttggcta anngccttgc agaatggtgc 901 ttgggatatg cctttgctgc tccatctcac ttcctcttgg gaatcatagc tgtgccgggt 961 gagagataca tccatcgctg gacctgtgca aggagtcgca ctgcctagcc tccaccacac 1021 gaaatgccag taattttttt caactgtgct ttctatgcac aaataactct gacttgtgtc 1081 aagttgacaa agcaaaatcc aaccagaggc ttggggtgaa aaatttgcag tgagaaaaat 1141 gcatttacta tacttacact tagtacactt agtgtcaagc agacacagaa actgactgac 1201 tgactgaatg aacaaatgaa gaaatgaatg aatgagtcag tcagttagtt actatagaat 1261 tcctttctac cttcttccta cttggctcca ggcacatttc cacatcaaga taaagagaaa 1321 caggtatagc tacatagtct ctacttttag acaatcgagt ctactaatgt ttcaaaacac 1381 ttgaggttct aaataatttt atactgattc tatgtcttta gaattgagat ttctccattt 1441 gcatctgcag atcccaaaaa ttgaatagaa tcactactgc acagggtatg ctcaagttca 1501 aatgtacctt atctttaaaa cattatttag ctatgtgtaa aagtgaaata aagaggtaag 1561 ttcacataca aatagagagg catacaaaca tacatgcata catacacata catacataca 1621 tacatacata catacacaca gactcacaga cacacagata tacagagata cagacacaca 1681 gagacacaca cacaggcaca tatacaaata tacacacata gacatattca cagatataca 1741 gagatacaga cacacactca gagacataca catggacatg catacacaca cacagatata 1801 cacagataca cacacacaca tgggcagcat gtgctgagga cttggttcag taaataaagg 1861 tcaaggctgc ctgcctttaa ttcaaaagcg tggaaggaca ggacaatccc tgaaaaagca 1921 ggttatccaa gctaaccaga tttgtgagct cagggtttac ttgagagatc ctgactcaac 1981 aataaggtat agagcaatca agggcgattc tctgaaggca gttattgaac tccttgtaca 2041 ctcttcccac acacacgtgt tcagagaact tgataaaaca catatataca agtaagccac 2101 acaaacattc acatgagaga agaaaaacaa gagcaaacta agtaagatgc attttcttat 2161 caggaagttt agttgacacc agaaagaagt catatttgga atcaaaatgg aatcatcatc 2221 atgtatgcta aagatgtttt tttcacattc ttgagcaatg tggacagaga aggagattca 2281 tccatgcact caaactggga aacaaagaaa agaaatcctc ttctaagctt tgcttctcaa 2341 tttcttattt gcataatgag aaaaaaagga aaattaattt taacacaatt cagtagttga 2401 ttgagcaaat gcgttcgcca aaaaggatgc tttagagaca gtgttctctg cacagataag 2461 gacaaacatt attcagaggg agtcccagag ctgagacgtc ctaagccagt gagtggcaca 2521 gcatgtccag ggagaaatat cgcttcgtcc tcaccgaagc ctgattccgt agagccacac 2581 cctgaagggc caatctgctc acacaggata gagagggcag gagccagggc agagcatata 2641 aggtgaggta ggatcagttg ctcctacatt tgcttctgac atagttgtgt tgactcacaa 2701 ccccagaaac agacatcatg gtgcacctga ctgatgctga gaaggctgct gtctcttgcc 2761 tgtggggaaa ggtgaactcc gatgaagttg gtggtgaggc cctgggcagg ttggtatcca 2821 ggttacaagg cagctcacaa gaagaagttg ggtgcttgga gacagaggtc tgctttccag 2881 cagacactaa ctttcagtgt cccctgtcta tgtttccctt tttaggctgc tggttgtcta 2941 cccttggacc cagcggtact ttgatagctt tggagaccta tcctctgcct ctgctatcat 3001 gggtaatgcc aaagtgaagg cccatggcaa gaaggtgata actgccttta acgatggcct 3061 gaatcacttg gacagcctca agggcacctt tgccagcctc agtgagctcc actgtgacaa 3121 gctgcatgtg gatcctgaga acttcagggt gagtctgatg ggcacctcct gggtttcctt 3181 cccctggcta ttctgctcaa ccttcctatc agaaaaaaag gggaagcgat tctagggagc 3241 agtctccatg actgtgtgtg gagtgttgac aagagttcgg atattttatt ctctactcag 3301 aattgctgct ccccctcact ctgttctgtg ttgtcatttc ctctttcttt ggtaagcttt 3361 taatttccag ttgcatttta ctaaattaat taagctctgg ttatttactt cccatcctga 3421 tatcagcttc ccctcctcct ttcctcccag tccttctctc tctcctctct ctttctctaa 3481 tcctttcctt tccctcagtt cattctctct tgatctacgt ttgtttgtct ttttaaatat 3541 tgccttgtaa cttgctcaga ggacaaggaa gatatgtccc tgtttcttct catagctctc 3601 aagaatagta gcataattgg cttttatgcc agggtgacag gggaagaata tattttacat 3661 ataaattctg tttgacatag gattcttata ataatttgtc agtagtttaa ggttgcaaac 3721 aaatgtcttt gtaaataagc ctgcaggtat ctggtatttt tgctctacag ttatgttgat 3781 ggttcttcca tattcccaca gctcctgggc aatatgatcg tgattgtgct gggccaccac 3841 cttggcaagg atttcacccc cgctgcacag gctgccttcc agaaggtggt ggctggagtg 3901 gccactgcct tggctcacaa gtaccactaa accccctttc ctgctcttgc ctgtgaacaa 3961 tggttaattg ttcccaagag agcatctgtc agttgttggc aaaatgatag acatttgaaa 4021 atctgtcttc tgacaaataa aaagcattta tgttcactgc aatgatgttt taaattattt 4081 gtctgtgtca tagaagggtt tatgctaagt tttcaagata caaagaagtg agggttcagg 4141 tctgaccttg gggaaataaa tgaattacac ttcaaattgt gttgtcagct aagcagcagt 4201 agccacagat cctattgcca tgccctaaac actcagagaa aaattcaaca aatggtttca 4261 tttacacact acattatgat tacattttat gtaaattatt tgtttttttc tactcttcca 4321 cataaatgtc tttttttcct cttacctacc cagcacttca cagttctcaa gccaataatt 4381 tttcttttgt aaaactacca ttattctcta aacttttccc tctgtgttta ccaagcaaca 4441 ttatttatct tttcataaat cctgttgcct tagacagctt cagtagcaat agaggtagga 4501 ttaaggagag aatagaagtg ccctgtttgt cataccatgc ctgcacagtc aatagtcact 4561 atgggatttc aaatggcact ttgcctggga cctttacact tcacaccata ctctggcttg 4621 agttaggagt taagaatgag agaaatataa tctagagaga ataagaatat ctagttttta 4681 aggctcatta ctggggtctt atgaaatttc cataataccc tgtaaatgga agcatttatt 4741 ttttcaataa atctatcttg aatatccagt gtgggttagg attaaatctc tccttcatac 4801 agttggactg cttttattta tatggagtta ctagagttaa cacaataagt aatataccct 4861 tgatttgttt ttctttccat aaccaccagg ttatgcgcaa ttccggaaat aaaatgtgtg 4921 ttccaagagt tctttacgct actctctggt acagttttag tgagattttg aaatgactac 4981 atataataag tggcctttaa ttacagaatg gtttgtgtag gtacagaata aaatacacca 5041 aatattatga gtttgagtca ttgtcatgag tcataaaaat gcagctccaa acgaagtaaa 5101 gagttagagt atggtgagaa attataaacc atcaagaaaa aaatacagga cccataaagg 5161 tagttgtgcg gccaggtatt tcgtgcatat ttataatcct tatttattat tactaagaag 5221 ccaagcagta tttataaaat atggtcctct ctgaatgcaa tgtccaatgg tctaaaaccc 5281 atatcttagt gttctcagag cagtatcttc tgtttgcaaa tagaactgaa ttttttataa 5341 ctgtctcata atttatgtaa gacttttgcc tagccataaa gataggatga gcaattcttt 5401 ttgcagtagg tagaaccctt gcctgttttt tcttgactta atgaagatca gtaatagatc 5461 ttggtttata gagagttagt tggtagtaga catcatttca cagttgcatt cctcgactga 5521 atcctaatta aaatgtatta gttttgtttt ctctgaggca gacagtgctg gtggcttttc 5581 ccattgtgct tccaggtact ggtaccatag gttctagtcg aaagttgctc tctggatctg 5641 gaagaagagg cagaggctgg ttgaatgtta ccctctggat gaactgtagc tggaggcatg 5701 tatgcaaagt tcttccccac tactttccta agaaacaaat acactttgca tatgtgtttt 5761 aaaagttata tggtaagtac acacccaata gttagctgta tctgtcagca gccagatcat 5821 gcaatgtgct gatgtgttga tacaaaaaca ttactgctga ctgtctcatc gtgatagaaa 5881 tatttgttga gttacttaac tagagtacca aaaatgaaaa taagctagca ttcaattata 5941 ttcatctcaa tactctatta tttcagtacc acatatcttg ctagctacac aaacaggaag 6001 ggacttaaca gtgtagttta gctttttttt tttttttaca agccctctca atggacactg 6061 atcagagaat gaataatatt aaatatcatg gaaatggaaa caaatgcaga aagtttcatc 6121 agtccagaca aaatgagggt taggacacaa gaaagtgaaa gatagaaggt atgtaaggtg 6181 acttgagaag aggaacatgc agctcacaca cacatacaca tacagttcca cacagaggaa 6241 gggtagggag agagaatgct gctaacccag gaagtcaagt aggggcttat aatgggaaga 6301 ttctaatgat ttaaaaggag catgacttta tgtaccactt agcaagctaa ctgttcatgc 6361 attattttgg gcaaaagtct ttaagccaac aatgagttta cgtagtaaca tagtacaagg 6421 tccataacat tacttgttct tcttatcatc aggctgtttc tactgttacc tgtattcttc 6481 aaattctcct aatattttta aaataaagaa atatttataa agctctatga tc //