LOCUS BC003070 2306 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens GATA binding protein 3, mRNA (cDNA clone MGC:2346 IMAGE:3504200), complete cds. ACCESSION BC003070 VERSION BC003070.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2306) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2306) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC003070.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 7 Row: a Column: 1 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 50541957. FEATURES Location/Qualifiers source 1..2306 /db_xref="H-InvDB:HIT000031403" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2346 IMAGE:3504200" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2306 /gene="GATA3" /gene_synonym="HDR" /gene_synonym="MGC2346" /gene_synonym="MGC5199" /gene_synonym="MGC5445" /db_xref="GeneID:2625" /db_xref="HGNC:HGNC:4172" /db_xref="MIM:131320" CDS 452..1783 /gene="GATA3" /gene_synonym="HDR" /gene_synonym="MGC2346" /gene_synonym="MGC5199" /gene_synonym="MGC5445" /codon_start=1 /product="GATA binding protein 3" /protein_id="AAH03070.1" /db_xref="GeneID:2625" /db_xref="HGNC:HGNC:4172" /db_xref="MIM:131320" /translation="MEVTADQPRWVSHHHPAVLNGQHPDTHHPGLSHSYMDAAQYPLP EEVDVLFNIDGQGNHVPPYYGNSVRATVQRYPPTHHGSQVCRPPLLHGSLPWLDGGKA LGSHHTASPWNLSPFSKTSIHHGSPGPLSVYPPASSSSLSGGHASPHLFTFPPTPPKD VSPDPSLSTPGSAGSARQDEKECLKYQVPLPDSMKLESSHSRGSMTALGGASSSTHHP ITTYPPYVPEYSSGLFPPSSLLGGSPTGFGCKSRPKARSSTGRECVNCGATSTPLWRR DGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTTLWRRNANG DPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMSSKSKKCKKVHDSLEDFPKNSSFN PAALSRHMSSLSHISPFSHSSHMLTTPTPMHPPSSLSFGPHHPSSMVTAMG" BASE COUNT 541 a 781 c 554 g 430 t ORIGIN 1 gcgagacaga gcgagcaacg caatctgacc gagcaggtcg tacgccgccg cctcctcctc 61 ctctctgctc ttcgctaccc aggtgacccg aggagggact ccgcctccga gcggctgagg 121 accccggtgc agaggagcct ggctcgcaga attgcagagt cgtcgcccct ttttacaacc 181 tggtcccgtt ttattctgcc atacccagtt tttggatttt tgtcttcccc ttcttctctt 241 tgctaaacga cccctccaag ataattttta aaaaaccttc tcctttgctc acctttgctt 301 cccagccttc ccatcccccc accgaaagca aatcattcaa cgacccccga ccctccgacg 361 gcaggagccc cccgacctcc caggcggacc gccctccctc cccgcgcgcg ggttccgggc 421 ccggcgagag ggcgcgagca cagccgaggc catggaggtg acggcggacc agccgcgctg 481 ggtgagccac caccaccccg ccgtgctcaa cgggcagcac ccggacacgc accacccggg 541 cctcagccac tcctacatgg acgcggcgca gtacccgctg ccggaggagg tggatgtgct 601 ttttaacatc gacggtcaag gcaaccacgt cccgccctac tacggaaact cggtcagggc 661 cacggtgcag aggtaccctc cgacccacca cgggagccag gtgtgccgcc cgcctctgct 721 tcatggatcc ctaccctggc tggacggcgg caaagccctg ggcagccacc acaccgcctc 781 cccctggaat ctcagcccct tctccaagac gtccatccac cacggctccc cggggcccct 841 ctccgtctac cccccggcct cgtcctcctc cttgtcgggg ggccacgcca gcccgcacct 901 cttcaccttc ccgcccaccc cgccgaagga cgtctccccg gacccatcgc tgtccacccc 961 aggctcggcc ggctcggccc ggcaggacga gaaagagtgc ctcaagtacc aggtgcccct 1021 gcccgacagc atgaagctgg agtcgtccca ctcccgtggc agcatgaccg ccctgggtgg 1081 agcctcctcg tcgacccacc accccatcac cacctacccg ccctacgtgc ccgagtacag 1141 ctccggactc ttccccccca gcagcctgct gggcggctcc cccaccggct tcggatgcaa 1201 gtccaggccc aaggcccggt ccagcacagg cagggagtgt gtgaactgtg gggcaacctc 1261 gaccccactg tggcggcgag atggcacggg acactacctg tgcaacgcct gcgggctcta 1321 tcacaaaatg aacggacaga accggcccct cattaagccc aagcgaaggc tgtctgcagc 1381 caggagagca gggacgtcct gtgcgaactg tcagaccacc acaaccacac tctggaggag 1441 gaatgccaat ggggaccctg tctgcaatgc ctgtgggctc tactacaagc ttcacaatat 1501 taacagaccc ctgactatga agaaggaagg catccagacc agaaaccgaa aaatgtctag 1561 caaatccaaa aagtgcaaaa aagtgcatga ctcactggag gacttcccca agaacagctc 1621 gtttaacccg gccgccctct ccagacacat gtcctccctg agccacatct cgcccttcag 1681 ccactccagc cacatgctga ccacgcccac gccgatgcac ccgccatcca gcctgtcctt 1741 tggaccacac cacccctcca gcatggtcac cgccatgggt tagagccctg ctcgatgctc 1801 acagggcccc cagcgagagt ccctgcagtc cctttcgact tgcatttttg caggagcagt 1861 atcatgaagc ctaaacgcga tggatatatg tttttgaagg cagaaagcaa aattatgttt 1921 gccactttgc aaaggagctc actgtggtgt ctgtgttcca accactgaat ctggacccca 1981 tctgtgaata agccattctg actcatatcc cctatttaac agggtctcta gtgctgtgaa 2041 aaaaaaaaaa tgctgaacat tgcatataac ttatattgta agaaatactg tacaatgact 2101 ttattgcatc tgggtagctg taaggcatga aggatgccaa gaagtttaag gaatatggga 2161 gaaatagtgt ggaaattaag aagaaactag gtctgatatt caaatggaca aactgccagt 2221 tttgtttcct ttcactggcc acagttgttt gatgcattaa aagaaaataa aaaaaagaaa 2281 aaagagaaaa aaaaaaaaaa aaaaaa //