LOCUS BC006793 2416 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens GATA binding protein 3, mRNA (cDNA clone MGC:5199 IMAGE:2985843), complete cds. ACCESSION BC006793 VERSION BC006793.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2416) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2416) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (27-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 3 Row: p Column: 9 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 50541957. FEATURES Location/Qualifiers source 1..2416 /db_xref="H-InvDB:HIT000032940" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:5199 IMAGE:2985843" /tissue_type="Cervix, carcinoma" /clone_lib="NIH_MGC_12" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2416 /gene="GATA3" /gene_synonym="HDR" /gene_synonym="MGC2346" /gene_synonym="MGC5199" /gene_synonym="MGC5445" /db_xref="GeneID:2625" /db_xref="HGNC:HGNC:4172" /db_xref="MIM:131320" CDS 560..1894 /gene="GATA3" /gene_synonym="HDR" /gene_synonym="MGC2346" /gene_synonym="MGC5199" /gene_synonym="MGC5445" /codon_start=1 /product="GATA binding protein 3" /protein_id="AAH06793.1" /db_xref="GeneID:2625" /db_xref="HGNC:HGNC:4172" /db_xref="MIM:131320" /translation="MEVTADQPRWVSHHHPAVLNGQHPDTHHPGLSHSYMDAAQYPLP EEVDVLFNIDGQGNHVPPYYGNSVRATVQRYPPTHHGSQVCRPPLLHGSLPWLDGGKA LGSHHTASPWNLSPFSKTSIHHGSPGPLSVYPPASSSSLSGGHASPHLFTFPPTPPKD VSPDPSLSTPGSAGSARQDEKECLKYQVPLPDSMKLESSHSRGSMTALGGASSSTHHP ITTYPPYVPEYSSGLFPPSSLLGGSPTGFGCKSRPKARSSTEGRECVNCGATSTPLWR RDGTGHYLCNACGLYHKMNGQNRPLIKPKRRLSAARRAGTSCANCQTTTTTLWRRNAN GDPVCNACGLYYKLHNINRPLTMKKEGIQTRNRKMSSKSKKCKKVHDSLEDFPKNSSF NPAALSRHMSSLSHISPFSHSSHMLTTPTPMHPPSSLSFGPHHPSSMVTAMG" BASE COUNT 590 a 793 c 589 g 444 t ORIGIN 1 ggcgccgtct tgatactttc agaaagaatg cattccctgt aaaaaaaaaa aaaaaaaata 61 ctgagagagg gagagagaga gagaagaaga gagagagacg gagggagagc gagacagagc 121 gagcaacgca atctgaccga gcaggtcgta cgccgccgcc tcctcctcct ctctgctctt 181 cgctacccag gtgacccgag gagggactcc gcctccgagc ggctgaggac cccggtgcag 241 aggagcctgg ctcgcagaat tgcagagtcg tcgccccttt ttacaacctg gtcccgtttt 301 attctgccat acccagtttt tggatttttg tcttcccctt cttctctttg ctaaacgacc 361 cctccaagat aatttttaaa aaaccttctc ctttgctcac ctttgcttcc cagccttccc 421 atccccccac cgaaagcaaa tcattcaacg acccccgacc ctccgacggc aggagccccc 481 cgacctccca ggcggaccgc cctccctccc cgcgcgcggg ttccgggccc ggcgagaggg 541 cgcgagcaca gccgaggcca tggaggtgac ggcggaccag ccgcgctggg tgagccacca 601 ccaccccgcc gtgctcaacg ggcagcaccc ggacacgcac cacccgggcc tcagccactc 661 ctacatggac gcggcgcagt acccgctgcc ggaggaggtg gatgtgcttt ttaacatcga 721 cggtcaaggc aaccacgtcc cgccctacta cggaaactcg gtcagggcca cggtgcagag 781 gtaccctccg acccaccacg ggagccaggt gtgccgcccg cctctgcttc atggatccct 841 accctggctg gacggcggca aagccctggg cagccaccac accgcctccc cctggaatct 901 cagccccttc tccaagacgt ccatccacca cggctccccg gggcccctct ccgtctaccc 961 cccggcctcg tcctcctcct tgtcgggggg ccacgccagc ccgcacctct tcaccttccc 1021 gcccaccccg ccgaaggacg tctccccgga cccatcgctg tccaccccag gctcggccgg 1081 ctcggcccgg caggacgaga aagagtgcct caagtaccag gtgcccctgc ccgacagcat 1141 gaagctggag tcgtcccact cccgtggcag catgaccgcc ctgggtggag cctcctcgtc 1201 gacccaccac cccatcacca cctacccgcc ctacgtgccc gagtacagct ccggactctt 1261 cccccccagc agcctgctgg gcggctcccc caccggcttc ggatgcaagt ccaggcccaa 1321 ggcccggtcc agcacagaag gcagggagtg tgtgaactgt ggggcaacct cgaccccact 1381 gtggcggcga gatggcacgg gacactacct gtgcaacgcc tgcgggctct atcacaaaat 1441 gaacggacag aaccggcccc tcattaagcc caagcgaagg ctgtctgcag ccaggagagc 1501 agggacgtcc tgtgcgaact gtcagaccac cacaaccaca ctctggagga ggaatgccaa 1561 tggggaccct gtctgcaatg cctgtgggct ctactacaag cttcacaata ttaacagacc 1621 cctgactatg aagaaggaag gcatccagac cagaaaccga aaaatgtcta gcaaatccaa 1681 aaagtgcaaa aaagtgcatg actcactgga ggacttcccc aagaacagct cgtttaaccc 1741 ggccgccctc tccagacaca tgtcctccct gagccacatc tcgcccttca gccactccag 1801 ccacatgctg accacgccca cgccgatgca cccgccatcc agcctgtcct ttggaccaca 1861 ccacccctcc agcatggtca ccgccatggg ttagagccct gctcgatgct cacagggccc 1921 ccagcgagag tccctgcagt ccctttcgac ttgcattttt gcaggagcag tatcatgaag 1981 cctaaacgcg atggatatat gtttttgaag gcagaaagca aaattatgtt tgccactttg 2041 caaaggagct cactgtggtg tctgtgttcc aaccactgaa tctggacccc atctgtgaat 2101 aagccattct gactcatatc ccctatttaa cagggtctct agtgctgtga aaaaaaaaaa 2161 tgctgaacat tgcatataac ttatattgta agaaatactg tacaatgact ttattgcatc 2221 tgggtagctg taaggcatga aggatgccaa gaagtttaag gaatatggga gaaatagtgt 2281 ggaaattaag aagaaactag gtctgatatt caaatggaca aactgccagt tttgtttcct 2341 ttcactggcc acagttgttt gatgcattaa aagaaaataa aaaaaagaaa aaagagaaaa 2401 gaaaaaaaaa aaaaaa //