LOCUS       BC009797                1174 bp    mRNA    linear   HUM 01-SEP-2006
DEFINITION  Homo sapiens GATA binding protein 1 (globin transcription factor
            1), mRNA (cDNA clone MGC:13628 IMAGE:4048082), complete cds.
ACCESSION   BC009797
VERSION     BC009797.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1174)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1174)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JUL-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: CLONTECH Laboratories, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 19 Row: b Column: 1
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 32483408.
FEATURES             Location/Qualifiers
     source          1..1174
                     /db_xref="H-InvDB:HIT000034585"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:13628 IMAGE:4048082"
                     /tissue_type="Bone marrow, chronic myelogenous leukemia"
                     /clone_lib="NIH_MGC_54"
                     /lab_host="DH10B"
                     /note="Vector: pDNR-LIB"
     gene            1..1174
                     /gene="GATA1"
                     /gene_synonym="ERYF1"
                     /gene_synonym="NFE1"
                     /db_xref="GeneID:2623"
                     /db_xref="HGNC:HGNC:4170"
                     /db_xref="MIM:305371"
     CDS             95..1102
                     /gene="GATA1"
                     /gene_synonym="ERYF1"
                     /gene_synonym="NFE1"
                     /codon_start=1
                     /product="GATA1 protein"
                     /protein_id="AAH09797.1"
                     /db_xref="GeneID:2623"
                     /db_xref="HGNC:HGNC:4170"
                     /db_xref="MIM:305371"
                     /translation="MEFPGLGSLGTSEPLPQFVDPALVSSTPESGVFFPSGPEGLDAA
                     ASSTAPSTATAAAAALAYYRDAEAYRHSPVFQVYPLLNCMEGIPGGSPYAGWAYGKTG
                     LYPASTVCPTREDSPPQAVEDLDGKGSTSFLETLKTERLSPDLLTLGPALPSSLPVPN
                     SAYGGPDFSSTFFSPTGSPLNSAAYSSPKLRGTLPLPPCEARECVNCGATATPLWRRD
                     RTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGD
                     PVCNACGLYYKLHHQHYCGGSAQLMRAQSMASRGGVVSFSSCSQNSGQPKSLGPRHPL
                     A"
BASE COUNT          264 a          391 c          297 g          222 t
ORIGIN      
        1 ggggacaccc cctgggatca cactgagctt gccacatccc caaggcggcc gaaccctccg
       61 caaccaccag cccaggttaa tccccagagg ctccatggag ttccctggcc tggggtccct
      121 ggggacctca gagcccctcc cccagtttgt ggatcctgct ctggtgtcct ccacaccaga
      181 atcaggggtt ttcttcccct ctgggcctga gggcttggat gcagcagctt cctccactgc
      241 cccgagcaca gccaccgctg cagctgcggc actggcctac tacagggacg ctgaggccta
      301 cagacactcc ccagtctttc aggtgtaccc attgctcaac tgtatggagg ggatcccagg
      361 gggctcacca tatgccggct gggcctacgg caagacgggg ctctaccctg cctcaactgt
      421 gtgtcccacc cgcgaggact ctcctcccca ggccgtggaa gatctggatg gaaaaggcag
      481 caccagcttc ctggagactt tgaagacaga gcggctgagc ccagacctcc tgaccctggg
      541 acctgcactg ccttcatcac tccctgtccc caatagtgct tatgggggcc ctgacttttc
      601 cagtaccttc ttttctccca ccgggagccc cctcaattca gcagcctatt cctctcccaa
      661 gcttcgtgga actctccccc tgcctccctg tgaggccagg gagtgtgtga actgcggagc
      721 aacagccact ccactgtggc ggagggacag gacaggccac tacctatgca acgcctgcgg
      781 cctctatcac aagatgaatg ggcagaacag gcccctcatc cggcccaaga agcgcctgat
      841 tgtcagtaaa cgggcaggta ctcagtgcac caactgccag acgaccacca cgacactgtg
      901 gcggagaaat gccagtgggg atcccgtgtg caatgcctgc ggcctctact acaagctaca
      961 ccaccagcac tactgtggtg gctccgctca gctcatgagg gcacagagca tggcctccag
     1021 aggaggggtg gtgtccttct cctcttgtag ccagaattct ggacaaccca agtctctggg
     1081 ccccaggcac cccctggctt gaaccttcaa agcttttgta aaataaaacc accaaagtcc
     1141 tgataaaaaa aaaaaaaaaa aaaaaaaaaa aaaa
//