LOCUS       MUSHBBMAJ               6532 bp    DNA     linear   ROD 22-MAR-2001
DEFINITION  Mouse beta-globin major gene.
ACCESSION   J00413 K01748 K03545
VERSION     J00413.1
KEYWORDS    beta-globin; globin.
SOURCE      Mus musculus (house mouse)
  ORGANISM  Mus musculus
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha;
            Muroidea; Muridae; Murinae; Mus; Mus.
REFERENCE   1  (bases 2588 to 4160)
  AUTHORS   Konkel,D.A., Tilghman,S.M. and Leder,P.
  TITLE     The sequence of the chromosomal mouse beta-globin major gene:
            homologies in capping, splicing and poly(A) sites
  JOURNAL   Cell 15 (4), 1125-1132 (1978)
   PUBMED   569555
REFERENCE   2  (bases 2489 to 4211)
  AUTHORS   Konkel,D.A., Maizel,J.V. Jr. and Leder,P.
  TITLE     The evolution and sequence comparison of two recently diverged
            mouse chromosomal beta--globin genes
  JOURNAL   Cell 18 (3), 865-873 (1979)
   PUBMED   519759
REFERENCE   3  (bases 2540 to 3833)
  AUTHORS   van Ooyen,A., van den Berg,J., Mantei,N. and Weissmann,C.
  TITLE     Comparison of total sequence of a cloned rabbit beta-globin gene
            and its flanking regions with a homologous mouse sequence
  JOURNAL   Science 206 (4416), 337-344 (1979)
   PUBMED   482942
REFERENCE   4  (bases 1257 to 2813)
  AUTHORS   Gilmour,R.S., Spandidos,D.A., Vass,J.K., Gow,J.W. and Paul,J.
  TITLE     A negative regulatory sequence near the mouse beta-maj globin gene
            associated with a region of potential Z-DNA
  JOURNAL   EMBO J. 3 (6), 1263-1272 (1984)
   PUBMED   6086313
REFERENCE   5  (bases 4651 to 6064)
  AUTHORS   Citron,B., Falck-Pedersen,E., Salditt-Georgieff,M. and Darnell,J.E.
            Jr.
  TITLE     Transcription termination occurs within a 1000 base pair region
            downstream from the poly(A) site of the mouse beta-globin (major)
            gene
  JOURNAL   Nucleic Acids Res. 12 (22), 8723-8731 (1984)
   PUBMED   6095210
REFERENCE   6  (sites)
  AUTHORS   Goldberg,S.Z., Kuebbing,D., Trauber,D., Schafer,M.P., Lewis,S.E.,
            Popp,R.A. and Anderson,W.F.
  TITLE     A 66-base pair insert bridges the deletion responsible for a mouse
            model of beta-thalassemia
  JOURNAL   J. Biol. Chem. 261 (26), 12368-12374 (1986)
   PUBMED   3017971
REFERENCE   7  (bases 1 to 6532)
  AUTHORS   Kuebbing,D., Trauber,D. and Anderson,W.F.
  JOURNAL   Unpublished
COMMENT     [2]  revisies [1].
            [6]  sites; deletion causing beta-thalassemia.
            [7]  revises [7].
            Draft entry and computer-readable sequence for [1]-[7] kindly
            provided by D.Kuebbing, 08-JUN-1987.
            The sequence from [7] is presented below.
            There is homology between sequences of the beta-globin major and
            minor genes. Location of the intron in both genes is also preserved
            [1].
            The beta-thalassemia deletion starts at position 685 and ends at
            4391.  A novel 66 (+/- 2 bp) sequence including a DAI element and
            ending in a stretch of 25 'a's was found to bridge the deletion [6]
            (see entries with accession numbers M14274 and M14275). The
            sequence sent to us from D.Kuebbing [7] got jumbled in transmission
            (10872 base pairs were added to the end of the sequence).  The
            corrected sequence was kindly provided by him 04-OCT-1987 and is
            given as [7].
            revision   6532     6532     c in [7]; c...10871 bp...c in [7].
FEATURES             Location/Qualifiers
     source          1..6532
                     /organism="Mus musculus"
                     /mol_type="genomic DNA"
                     /strain="BALB/c"
                     /db_xref="taxon:10090"
     misc_feature    685..4391
                     /note="beta-thalassemia-causing deletion"
     prim_transcript 2666..6065
                     /note="beta-globin mRNA (alt.)"
     prim_transcript 2666..4652
                     /note="beta-globin mRNA (alt.)"
     prim_transcript 2666..4061
                     /note="beta-globin mRNA (alt.)"
     CDS             join(2718..2809,2926..3148,3802..3930)
                     /codon_start=1
                     /product="beta-globin major"
                     /protein_id="AAA37791.1"
                     /translation="MVHLTDAEKAAVSCLWGKVNSDEVGGEALGRLLVVYPWTQRYFD
                     SFGDLSSASAIMGNAKVKAHGKKVITAFNDGLNHLDSLKGTFASLSELHCDKLHVDPE
                     NFRLLGNMIVIVLGHHLGKDFTPAAQAAFQKVVAGVATALAHKYH"
     exon            <2718..2809
                     /note="beta-globin major"
                     /number=1
     exon            2926..3148
                     /number=2
     exon            3802..>3930
                     /note="beta-globin major"
                     /number=3
BASE COUNT         1957 a         1335 c         1250 g         1987 t
ORIGIN      
        1 gatctaaata ttgttcttct ttgaatttat tccagtgatg ttccagcctc aaatttggca
       61 agtgtcccta agacctctca taacaaccga atgttcataa tacccccttc caccaacttc
      121 acattttatc cacacacagg acatatgaaa atttcacttt cacatcacat tatcacactg
      181 ttaggaagtg gatgccatga ccagcatcac agttctgaca gggcaggaca agctgactct
      241 cttagatagt ctactagaca caggccagat ataactgaan atattccagt cacaaatgca
      301 caactgagac tccaggttgc atttagtatg ctttgtattg tagaatataa aatagcccct
      361 ctacggaatg ttatggtcac ccgcgaggca gtcacagcct ctcctgactc ggtatcctgc
      421 tatttttggg ttgtaccaaa aaataaacaa ccaataaatg atttaacctc ttaaaaaaac
      481 atttatactt aaataatgag atgagtttgg atttcctcac cttttaaaaa tgttgctaga
      541 gctactaaaa aaagcttgca tttacaagta gttgataaaa atattcctct ggattgtgca
      601 agaagggagg cgggaccact aacagacatg atggatgctt agttggactt ggcttctttc
      661 tctttttcca gaggctggac tctggtttcc tgtctctccg ttttctgcag gcagtttctc
      721 tagtttgctg ttcaccattc acctgtttgg ctggtcagcc acttcagcct gtttgctctt
      781 ctctcccctc ttccttttat ctgcattttt tcgtctgatg ctttattctt tcccgaggcc
      841 cctttgggct tcctgtcctt agtaagggca ggcttggcta anngccttgc agaatggtgc
      901 ttgggatatg cctttgctgc tccatctcac ttcctcttgg gaatcatagc tgtgccgggt
      961 gagagataca tccatcgctg gacctgtgca aggagtcgca ctgcctagcc tccaccacac
     1021 gaaatgccag taattttttt caactgtgct ttctatgcac aaataactct gacttgtgtc
     1081 aagttgacaa agcaaaatcc aaccagaggc ttggggtgaa aaatttgcag tgagaaaaat
     1141 gcatttacta tacttacact tagtacactt agtgtcaagc agacacagaa actgactgac
     1201 tgactgaatg aacaaatgaa gaaatgaatg aatgagtcag tcagttagtt actatagaat
     1261 tcctttctac cttcttccta cttggctcca ggcacatttc cacatcaaga taaagagaaa
     1321 caggtatagc tacatagtct ctacttttag acaatcgagt ctactaatgt ttcaaaacac
     1381 ttgaggttct aaataatttt atactgattc tatgtcttta gaattgagat ttctccattt
     1441 gcatctgcag atcccaaaaa ttgaatagaa tcactactgc acagggtatg ctcaagttca
     1501 aatgtacctt atctttaaaa cattatttag ctatgtgtaa aagtgaaata aagaggtaag
     1561 ttcacataca aatagagagg catacaaaca tacatgcata catacacata catacataca
     1621 tacatacata catacacaca gactcacaga cacacagata tacagagata cagacacaca
     1681 gagacacaca cacaggcaca tatacaaata tacacacata gacatattca cagatataca
     1741 gagatacaga cacacactca gagacataca catggacatg catacacaca cacagatata
     1801 cacagataca cacacacaca tgggcagcat gtgctgagga cttggttcag taaataaagg
     1861 tcaaggctgc ctgcctttaa ttcaaaagcg tggaaggaca ggacaatccc tgaaaaagca
     1921 ggttatccaa gctaaccaga tttgtgagct cagggtttac ttgagagatc ctgactcaac
     1981 aataaggtat agagcaatca agggcgattc tctgaaggca gttattgaac tccttgtaca
     2041 ctcttcccac acacacgtgt tcagagaact tgataaaaca catatataca agtaagccac
     2101 acaaacattc acatgagaga agaaaaacaa gagcaaacta agtaagatgc attttcttat
     2161 caggaagttt agttgacacc agaaagaagt catatttgga atcaaaatgg aatcatcatc
     2221 atgtatgcta aagatgtttt tttcacattc ttgagcaatg tggacagaga aggagattca
     2281 tccatgcact caaactggga aacaaagaaa agaaatcctc ttctaagctt tgcttctcaa
     2341 tttcttattt gcataatgag aaaaaaagga aaattaattt taacacaatt cagtagttga
     2401 ttgagcaaat gcgttcgcca aaaaggatgc tttagagaca gtgttctctg cacagataag
     2461 gacaaacatt attcagaggg agtcccagag ctgagacgtc ctaagccagt gagtggcaca
     2521 gcatgtccag ggagaaatat cgcttcgtcc tcaccgaagc ctgattccgt agagccacac
     2581 cctgaagggc caatctgctc acacaggata gagagggcag gagccagggc agagcatata
     2641 aggtgaggta ggatcagttg ctcctacatt tgcttctgac atagttgtgt tgactcacaa
     2701 ccccagaaac agacatcatg gtgcacctga ctgatgctga gaaggctgct gtctcttgcc
     2761 tgtggggaaa ggtgaactcc gatgaagttg gtggtgaggc cctgggcagg ttggtatcca
     2821 ggttacaagg cagctcacaa gaagaagttg ggtgcttgga gacagaggtc tgctttccag
     2881 cagacactaa ctttcagtgt cccctgtcta tgtttccctt tttaggctgc tggttgtcta
     2941 cccttggacc cagcggtact ttgatagctt tggagaccta tcctctgcct ctgctatcat
     3001 gggtaatgcc aaagtgaagg cccatggcaa gaaggtgata actgccttta acgatggcct
     3061 gaatcacttg gacagcctca agggcacctt tgccagcctc agtgagctcc actgtgacaa
     3121 gctgcatgtg gatcctgaga acttcagggt gagtctgatg ggcacctcct gggtttcctt
     3181 cccctggcta ttctgctcaa ccttcctatc agaaaaaaag gggaagcgat tctagggagc
     3241 agtctccatg actgtgtgtg gagtgttgac aagagttcgg atattttatt ctctactcag
     3301 aattgctgct ccccctcact ctgttctgtg ttgtcatttc ctctttcttt ggtaagcttt
     3361 taatttccag ttgcatttta ctaaattaat taagctctgg ttatttactt cccatcctga
     3421 tatcagcttc ccctcctcct ttcctcccag tccttctctc tctcctctct ctttctctaa
     3481 tcctttcctt tccctcagtt cattctctct tgatctacgt ttgtttgtct ttttaaatat
     3541 tgccttgtaa cttgctcaga ggacaaggaa gatatgtccc tgtttcttct catagctctc
     3601 aagaatagta gcataattgg cttttatgcc agggtgacag gggaagaata tattttacat
     3661 ataaattctg tttgacatag gattcttata ataatttgtc agtagtttaa ggttgcaaac
     3721 aaatgtcttt gtaaataagc ctgcaggtat ctggtatttt tgctctacag ttatgttgat
     3781 ggttcttcca tattcccaca gctcctgggc aatatgatcg tgattgtgct gggccaccac
     3841 cttggcaagg atttcacccc cgctgcacag gctgccttcc agaaggtggt ggctggagtg
     3901 gccactgcct tggctcacaa gtaccactaa accccctttc ctgctcttgc ctgtgaacaa
     3961 tggttaattg ttcccaagag agcatctgtc agttgttggc aaaatgatag acatttgaaa
     4021 atctgtcttc tgacaaataa aaagcattta tgttcactgc aatgatgttt taaattattt
     4081 gtctgtgtca tagaagggtt tatgctaagt tttcaagata caaagaagtg agggttcagg
     4141 tctgaccttg gggaaataaa tgaattacac ttcaaattgt gttgtcagct aagcagcagt
     4201 agccacagat cctattgcca tgccctaaac actcagagaa aaattcaaca aatggtttca
     4261 tttacacact acattatgat tacattttat gtaaattatt tgtttttttc tactcttcca
     4321 cataaatgtc tttttttcct cttacctacc cagcacttca cagttctcaa gccaataatt
     4381 tttcttttgt aaaactacca ttattctcta aacttttccc tctgtgttta ccaagcaaca
     4441 ttatttatct tttcataaat cctgttgcct tagacagctt cagtagcaat agaggtagga
     4501 ttaaggagag aatagaagtg ccctgtttgt cataccatgc ctgcacagtc aatagtcact
     4561 atgggatttc aaatggcact ttgcctggga cctttacact tcacaccata ctctggcttg
     4621 agttaggagt taagaatgag agaaatataa tctagagaga ataagaatat ctagttttta
     4681 aggctcatta ctggggtctt atgaaatttc cataataccc tgtaaatgga agcatttatt
     4741 ttttcaataa atctatcttg aatatccagt gtgggttagg attaaatctc tccttcatac
     4801 agttggactg cttttattta tatggagtta ctagagttaa cacaataagt aatataccct
     4861 tgatttgttt ttctttccat aaccaccagg ttatgcgcaa ttccggaaat aaaatgtgtg
     4921 ttccaagagt tctttacgct actctctggt acagttttag tgagattttg aaatgactac
     4981 atataataag tggcctttaa ttacagaatg gtttgtgtag gtacagaata aaatacacca
     5041 aatattatga gtttgagtca ttgtcatgag tcataaaaat gcagctccaa acgaagtaaa
     5101 gagttagagt atggtgagaa attataaacc atcaagaaaa aaatacagga cccataaagg
     5161 tagttgtgcg gccaggtatt tcgtgcatat ttataatcct tatttattat tactaagaag
     5221 ccaagcagta tttataaaat atggtcctct ctgaatgcaa tgtccaatgg tctaaaaccc
     5281 atatcttagt gttctcagag cagtatcttc tgtttgcaaa tagaactgaa ttttttataa
     5341 ctgtctcata atttatgtaa gacttttgcc tagccataaa gataggatga gcaattcttt
     5401 ttgcagtagg tagaaccctt gcctgttttt tcttgactta atgaagatca gtaatagatc
     5461 ttggtttata gagagttagt tggtagtaga catcatttca cagttgcatt cctcgactga
     5521 atcctaatta aaatgtatta gttttgtttt ctctgaggca gacagtgctg gtggcttttc
     5581 ccattgtgct tccaggtact ggtaccatag gttctagtcg aaagttgctc tctggatctg
     5641 gaagaagagg cagaggctgg ttgaatgtta ccctctggat gaactgtagc tggaggcatg
     5701 tatgcaaagt tcttccccac tactttccta agaaacaaat acactttgca tatgtgtttt
     5761 aaaagttata tggtaagtac acacccaata gttagctgta tctgtcagca gccagatcat
     5821 gcaatgtgct gatgtgttga tacaaaaaca ttactgctga ctgtctcatc gtgatagaaa
     5881 tatttgttga gttacttaac tagagtacca aaaatgaaaa taagctagca ttcaattata
     5941 ttcatctcaa tactctatta tttcagtacc acatatcttg ctagctacac aaacaggaag
     6001 ggacttaaca gtgtagttta gctttttttt tttttttaca agccctctca atggacactg
     6061 atcagagaat gaataatatt aaatatcatg gaaatggaaa caaatgcaga aagtttcatc
     6121 agtccagaca aaatgagggt taggacacaa gaaagtgaaa gatagaaggt atgtaaggtg
     6181 acttgagaag aggaacatgc agctcacaca cacatacaca tacagttcca cacagaggaa
     6241 gggtagggag agagaatgct gctaacccag gaagtcaagt aggggcttat aatgggaaga
     6301 ttctaatgat ttaaaaggag catgacttta tgtaccactt agcaagctaa ctgttcatgc
     6361 attattttgg gcaaaagtct ttaagccaac aatgagttta cgtagtaaca tagtacaagg
     6421 tccataacat tacttgttct tcttatcatc aggctgtttc tactgttacc tgtattcttc
     6481 aaattctcct aatattttta aaataaagaa atatttataa agctctatga tc
//