LOCUS       BC006793                2416 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens GATA binding protein 3, mRNA (cDNA clone MGC:5199
            IMAGE:2985843), complete cds.
ACCESSION   BC006793
VERSION     BC006793.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2416)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2416)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (27-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 3 Row: p Column: 9
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 50541957.
FEATURES             Location/Qualifiers
     source          1..2416
                     /db_xref="H-InvDB:HIT000032940"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:5199 IMAGE:2985843"
                     /tissue_type="Cervix, carcinoma"
                     /clone_lib="NIH_MGC_12"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2416
                     /gene="GATA3"
                     /gene_synonym="HDR"
                     /gene_synonym="MGC2346"
                     /gene_synonym="MGC5199"
                     /gene_synonym="MGC5445"
                     /db_xref="GeneID:2625"
                     /db_xref="HGNC:HGNC:4172"
                     /db_xref="MIM:131320"
     CDS             560..1894
                     /gene="GATA3"
                     /gene_synonym="HDR"
                     /gene_synonym="MGC2346"
                     /gene_synonym="MGC5199"
                     /gene_synonym="MGC5445"
                     /codon_start=1
                     /product="GATA binding protein 3"
                     /protein_id="AAH06793.1"
                     /db_xref="GeneID:2625"
                     /db_xref="HGNC:HGNC:4172"
                     /db_xref="MIM:131320"
                     /translation="MEVTADQPRWVSHHHPAVLNGQHPDTHHPGLSHSYMDAAQYPLP
                     EEVDVLFNIDGQGNHVPPYYGNSVRATVQRYPPTHHGSQVCRPPLLHGSLPWLDGGKA
                     LGSHHTASPWNLSPFSKTSIHHGSPGPLSVYPPASSSSLSGGHASPHLFTFPPTPPKD
                     VSPDPSLSTPGSAGSARQDEKECLKYQVPLPDSMKLESSHSRGSMTALGGASSSTHHP
                     ITTYPPYVPEYSSGLFPPSSLLGGSPTGFGCKSRPKARSSTEGRECVNCGATSTPLWR
                     RDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTTLWRRNAN
                     GDPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMSSKSKKCKKVHDSLEDFPKNSSF
                     NPAALSRHMSSLSHISPFSHSSHMLTTPTPMHPPSSLSFGPHHPSSMVTAMG"
BASE COUNT          590 a          793 c          589 g          444 t
ORIGIN      
        1 ggcgccgtct tgatactttc agaaagaatg cattccctgt aaaaaaaaaa aaaaaaaata
       61 ctgagagagg gagagagaga gagaagaaga gagagagacg gagggagagc gagacagagc
      121 gagcaacgca atctgaccga gcaggtcgta cgccgccgcc tcctcctcct ctctgctctt
      181 cgctacccag gtgacccgag gagggactcc gcctccgagc ggctgaggac cccggtgcag
      241 aggagcctgg ctcgcagaat tgcagagtcg tcgccccttt ttacaacctg gtcccgtttt
      301 attctgccat acccagtttt tggatttttg tcttcccctt cttctctttg ctaaacgacc
      361 cctccaagat aatttttaaa aaaccttctc ctttgctcac ctttgcttcc cagccttccc
      421 atccccccac cgaaagcaaa tcattcaacg acccccgacc ctccgacggc aggagccccc
      481 cgacctccca ggcggaccgc cctccctccc cgcgcgcggg ttccgggccc ggcgagaggg
      541 cgcgagcaca gccgaggcca tggaggtgac ggcggaccag ccgcgctggg tgagccacca
      601 ccaccccgcc gtgctcaacg ggcagcaccc ggacacgcac cacccgggcc tcagccactc
      661 ctacatggac gcggcgcagt acccgctgcc ggaggaggtg gatgtgcttt ttaacatcga
      721 cggtcaaggc aaccacgtcc cgccctacta cggaaactcg gtcagggcca cggtgcagag
      781 gtaccctccg acccaccacg ggagccaggt gtgccgcccg cctctgcttc atggatccct
      841 accctggctg gacggcggca aagccctggg cagccaccac accgcctccc cctggaatct
      901 cagccccttc tccaagacgt ccatccacca cggctccccg gggcccctct ccgtctaccc
      961 cccggcctcg tcctcctcct tgtcgggggg ccacgccagc ccgcacctct tcaccttccc
     1021 gcccaccccg ccgaaggacg tctccccgga cccatcgctg tccaccccag gctcggccgg
     1081 ctcggcccgg caggacgaga aagagtgcct caagtaccag gtgcccctgc ccgacagcat
     1141 gaagctggag tcgtcccact cccgtggcag catgaccgcc ctgggtggag cctcctcgtc
     1201 gacccaccac cccatcacca cctacccgcc ctacgtgccc gagtacagct ccggactctt
     1261 cccccccagc agcctgctgg gcggctcccc caccggcttc ggatgcaagt ccaggcccaa
     1321 ggcccggtcc agcacagaag gcagggagtg tgtgaactgt ggggcaacct cgaccccact
     1381 gtggcggcga gatggcacgg gacactacct gtgcaacgcc tgcgggctct atcacaaaat
     1441 gaacggacag aaccggcccc tcattaagcc caagcgaagg ctgtctgcag ccaggagagc
     1501 agggacgtcc tgtgcgaact gtcagaccac cacaaccaca ctctggagga ggaatgccaa
     1561 tggggaccct gtctgcaatg cctgtgggct ctactacaag cttcacaata ttaacagacc
     1621 cctgactatg aagaaggaag gcatccagac cagaaaccga aaaatgtcta gcaaatccaa
     1681 aaagtgcaaa aaagtgcatg actcactgga ggacttcccc aagaacagct cgtttaaccc
     1741 ggccgccctc tccagacaca tgtcctccct gagccacatc tcgcccttca gccactccag
     1801 ccacatgctg accacgccca cgccgatgca cccgccatcc agcctgtcct ttggaccaca
     1861 ccacccctcc agcatggtca ccgccatggg ttagagccct gctcgatgct cacagggccc
     1921 ccagcgagag tccctgcagt ccctttcgac ttgcattttt gcaggagcag tatcatgaag
     1981 cctaaacgcg atggatatat gtttttgaag gcagaaagca aaattatgtt tgccactttg
     2041 caaaggagct cactgtggtg tctgtgttcc aaccactgaa tctggacccc atctgtgaat
     2101 aagccattct gactcatatc ccctatttaa cagggtctct agtgctgtga aaaaaaaaaa
     2161 tgctgaacat tgcatataac ttatattgta agaaatactg tacaatgact ttattgcatc
     2221 tgggtagctg taaggcatga aggatgccaa gaagtttaag gaatatggga gaaatagtgt
     2281 ggaaattaag aagaaactag gtctgatatt caaatggaca aactgccagt tttgtttcct
     2341 ttcactggcc acagttgttt gatgcattaa aagaaaataa aaaaaagaaa aaagagaaaa
     2401 gaaaaaaaaa aaaaaa
//