LOCUS       BC015577                2549 bp    mRNA    linear   HUM 06-OCT-2003
DEFINITION  Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:23124
            IMAGE:4127146), complete cds.
ACCESSION   BC015577
VERSION     BC015577.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2549)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2549)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (04-OCT-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC015577.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 33 Row: a Column: 23
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 20149534.
FEATURES             Location/Qualifiers
     source          1..2549
                     /db_xref="H-InvDB:HIT000037196"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:23124 IMAGE:4127146"
                     /tissue_type="Brain, neuroblastoma"
                     /clone_lib="NIH_MGC_19"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..2549
                     /gene="GATA2"
                     /gene_synonym="MGC2306"
                     /gene_synonym="NFE1B"
                     /db_xref="GeneID:2624"
                     /db_xref="MIM:137295"
     CDS             252..1652
                     /gene="GATA2"
                     /gene_synonym="MGC2306"
                     /gene_synonym="NFE1B"
                     /codon_start=1
                     /product="GATA2 protein"
                     /protein_id="AAH15577.1"
                     /db_xref="GeneID:2624"
                     /db_xref="MIM:137295"
                     /translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE
                     VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK
                     AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT
                     PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT
                     ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS
                     SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK
                     PKRRLTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEGIQTRNRKMSNKSKK
                     SKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSGHILPTPTPIHPSSS
                     LSFGHPHPSSMVTAMG"
     misc_feature    1122..1241
                     /gene="GATA2"
                     /gene_synonym="MGC2306"
                     /gene_synonym="NFE1B"
                     /note="GATA; Region: GATA zinc finger. This domain uses
                     four cysteine residues to coordinate a zinc ion. This
                     domain binds to DNA. Two GATA zinc fingers are found in
                     the GATA transcription factors. However there are several
                     proteins which only contains a single copy of the domain"
                     /db_xref="CDD:pfam00320"
BASE COUNT          531 a          881 c          711 g          426 t
ORIGIN      
        1 ggcagctggc gccagggcgg ccggaggatg ccgaggggcc ggagccgggc gggcccgagg
       61 ccgaggcgca ctctaccccc agctcctacc ctgtaagccc cgccagcctc cggacgtgct
      121 gtccctgggc ccgtcgccct ctgggctccc gccggaactc cttcactctc agaggccgag
      181 tccctcccct ccccacggct gcgtgtggcc gttgccgtct gcacccagac cctgagccgc
      241 cgccgccggc catggaggtg gcgccggagc agccgcgctg gatggcgcac ccggccgtgc
      301 tgaatgcgca gcaccccgac tcacaccacc cgggcctggc gcacaactac atggaacccg
      361 cgcagctgct gcctccagac gaggtggacg tcttcttcaa tcacctcgac tcgcagggca
      421 acccctacta tgccaacccc gctcacgcgc gggcgcgcgt ctcctacagc cccgcgcacg
      481 cccgcctgac cggaggccag atgtgccgcc cacacttgtt gcacagcccg ggtttgccct
      541 ggctggacgg gggcaaagca gccctctctg ccgctgcggc ccaccaccac aacccctgga
      601 ccgtgagccc cttctccaag acgccactgc acccctcagc tgctggaggc cctggaggcc
      661 cactctctgt gtacccaggg gctgggggtg ggagcggggg aggcagcggg agctcagtgg
      721 cctccctcac ccctacagca gcccactctg gctcccacct tttcggcttc ccacccacgc
      781 cacccaaaga agtgtctcct gaccctagca ccacgggggc tgcgtctcca gcctcatctt
      841 ccgcgggggg tagtgcagcc cgaggagagg acaaggacgg cgtcaagtac caggtgtcac
      901 tgacggagag catgaagatg gaaagtggca gtcccctgcg cccaggccta gctactatgg
      961 gcacccagcc tgctacacac caccccatcc ccacctaccc ctcctatgtg ccggcggctg
     1021 cccacgacta cagcagcgga ctcttccacc ccggaggctt cctgggggga ccggcctcca
     1081 gcttcacccc taagcagcgc agcaaggctc gttcctgttc agaaggccgg gagtgtgtca
     1141 actgtggggc cacagccacc cctctctggc ggcgggacgg caccggccac tacctgtgca
     1201 atgcctgtgg cctctaccac aagatgaatg ggcagaaccg accactcatc aagcccaagc
     1261 gaagactgac gacaaccacc accttatggc gccgaaacgc caacggggac cctgtctgca
     1321 acgcctgtgg cctctactac aagctgcaca atgttaacag gccactgacc atgaagaagg
     1381 aagggatcca gactcggaac cggaagatgt ccaacaagtc caagaagagc aagaaagggg
     1441 cggagtgctt cgaggagctg tcaaagtgca tgcaggagaa gtcatccccc ttcagtgcag
     1501 ctgccctggc tggacacatg gcacctgtgg gccacctccc gcccttcagc cactccggac
     1561 acatcctgcc cactccgacg cccatccacc cctcctccag cctctccttc ggccaccccc
     1621 acccgtccag catggtgacc gccatgggct agggaacaga tggacgtcga ggaccgggca
     1681 ctcccgggat gggtggacca aacccttagc agcccagcat ttcccgaagg ccgacaccac
     1741 tcctgccagc ccggctcggc ccagcacccc ctctcctgga gggcgcccag cagcctgcca
     1801 gcagttactg tgaatgttcc ccaccgctga gaggctgcct ccgcacctga ccgctgccca
     1861 ggtggggttt cctgcatgga cagttgtttg gagaacaaca aggacaactt tatgtagaga
     1921 aaaggagggg acgggacaga cgaaggcaac catttttaga aggaaaaagg attaggcaaa
     1981 aataatttat tttgctcttg tttctaacaa ggacttggag acttggtggt ctgagctgtc
     2041 ccaagtcctc cggttcttcc tcgggattgg cgggtccact tgccagggct ctgggggcag
     2101 atttgtgggg acctcagcct gcaccctctt ctcctctggc ttccctctct gaaatagccg
     2161 aactccaggc tgggctgagc caaagccaga gtggccacgg cccagggagg gtgagctggt
     2221 gcctgctttg acgggccagg ccctggaggg cagagacaat cacgggcggt cctgcacaga
     2281 ttcccaggcc agggctgggt cacaggaagg aaacaacatt ttcttgaaag gggaaacgtc
     2341 tcccagatcg ctcccttggc tttgaggccg aagctgctgt gactgtgtcc ccttactgag
     2401 cgcaagccac agcctgtctt gtcaggtgga ccctgtaaat acatcctttt tctgctaacc
     2461 cttcaacccc ctcgcctcct actctgagac aaaagaaaaa atattaaaaa aatgcatagg
     2521 cttaaaaaaa aaaaaaaaaa aaaaaaaaa
//