LOCUS BC002557 2664 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:2306 IMAGE:3139044), complete cds. ACCESSION BC002557 VERSION BC002557.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2664) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2664) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC002557.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 4 Row: l Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31982886. FEATURES Location/Qualifiers source 1..2664 /db_xref="H-InvDB:HIT000030989" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:2306 IMAGE:3139044" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2664 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /db_xref="GeneID:2624" /db_xref="HGNC:HGNC:4171" /db_xref="MIM:137295" CDS 335..1777 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /codon_start=1 /product="GATA binding protein 2" /protein_id="AAH02557.1" /db_xref="GeneID:2624" /db_xref="HGNC:HGNC:4171" /db_xref="MIM:137295" /translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT PTATHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEG IQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSG HILPTPTPIHPSSSLSFGHPHPSSMVTAMG" BASE COUNT 558 a 932 c 738 g 436 t ORIGIN 1 gtgagcgcca ggaaggtagc gaggccagcg tcgccccggg actcgctgct caagtctgtc 61 tattgcctgc cgccacatcc atcctagcag ggccccgtcg cccaccaggc ggacaaaagc 121 ggtccgctga acaccatgcg gccgctcggc gtgccgccca ggctctgctg gtgagcgccg 181 ccaccccgcg cccaggtccc gcgagcccgc ctgccgcgca cctcgccctg ctcccagctc 241 tactccaggc cccgtccgcc cgggggcgcc gcccaccgcg cctcgctcgg gccgttgccg 301 tctgcaccca gaccctgagc cgccgccgcc ggccatggag gtggcgccgg agcagccgcg 361 ctggatggcg cacccggccg tgctgaatgc gcagcacccc gactcacacc acccgggcct 421 ggcgcacaac tacatggaac ccgcgcagct gctgcctcca gacgaggtgg acgtcttctt 481 caatcacctc gactcgcagg gcaaccccta ctatgccaac cccgctcacg cgcgggcgcg 541 cgtctcctac agccccgcgc acgcccgcct gaccggaggc cagatgtgcc gcccacactt 601 gttgcacagc ccgggtttgc cctggctgga cgggggcaaa gcagccctct ctgccgctgc 661 ggcccaccac cacaacccct ggaccgtgag ccccttctcc aagacgccac tgcacccctc 721 agctgctgga ggccctggag gcccactctc tgtgtaccca ggggctgggg gtgggagcgg 781 gggaggcagc gggagctcag tggcctccct cacccctaca gcaacccact ctggctccca 841 ccttttcggc ttcccaccca cgccacccaa agaagtgtct cctgacccta gcaccacggg 901 ggctgcgtct ccagcctcat cttccgcggg gggtagtgca gcccgaggag aggacaagga 961 cggcgtcaag taccaggtgt cactgacgga gagcatgaag atggaaagtg gcagtcccct 1021 gcgcccaggc ctagctacta tgggcaccca gcctgctaca caccacccca tccccaccta 1081 cccctcctat gtgccggcgg ctgcccacga ctacagcagc ggactcttcc accccggagg 1141 cttcctgggg ggaccggcct ccagcttcac ccctaagcag cgcagcaagg ctcgttcctg 1201 ttcagaaggc cgggagtgtg tcaactgtgg ggccacagcc acccctctct ggcggcggga 1261 cggcaccggc cactacctgt gcaatgcctg tggcctctac cacaagatga atgggcagaa 1321 ccgaccactc atcaagccca agcgaagact gtcggccgcc agaagagccg gcacctgttg 1381 tgcaaattgt cagacgacaa ccaccacctt atggcgccga aacgccaacg gggaccctgt 1441 ctgcaacgcc tgtggcctct actacaagct gcacaatgtt aacaggccac tgaccatgaa 1501 gaaggaaggg atccagactc ggaaccggaa gatgtccaac aagtccaaga agagcaagaa 1561 aggggcggag tgcttcgagg agctgtcaaa gtgcatgcag gagaagtcat cccccttcag 1621 tgcagctgcc ctggctggac acatggcacc tgtgggccac ctcccgccct tcagccactc 1681 cggacacatc ctgcccactc cgacgcccat ccacccctcc tccagcctct ccttcggcca 1741 cccccacccg tccagcatgg tgaccgccat gggctaggga acagatggac gtcgaggacc 1801 gggcactccc gggatgggtg gaccaaaccc ttagcagccc agcatttccc gaaggccgac 1861 accactcctg ccagcccggc tcggcccagc accccctctc ctggagggcg cccagcagcc 1921 tgccagcagt tactgtgaat gttccccacc gctgagaggc tgcctccgca cctgaccgct 1981 gcccaggtgg ggtttcctgc atggacagtt gtttggagaa caacaaggac aactttatgt 2041 agagaaaagg aggggacggg acagacgaag gcaaccattt ttagaaggaa aaaggattag 2101 gcaaaaataa tttattttgc tcttgtttct aacaaggact tggagacttg gtggtctgag 2161 ctgtcccaag tcctccggtt cttcctcggg attggcgggt ccacttgcca gggctctggg 2221 ggcagatttg tggggacctc agcctgcacc ctcttctcct ctggcttccc tctctgaaat 2281 agccgaactc caggctgggc tgagccaaag ccagagtggc cacggcccag ggagggtgag 2341 ctggtgcctg ctttgacggg ccaggccctg gagggcagag acaatcacgg gcggtcctgc 2401 acagattccc aggccagggc tgggtcacag gaaggaaaca acattttctt gaaaggggaa 2461 acgtctccca gatcgctccc ttggctttga ggccgaagct gctgtgactg tgtcccctta 2521 ctgagcgcaa gccacagcct gtcttgtcag gtggaccctg taaatacatc ctttttctgc 2581 taacccttca accccctcgc ctcctactct gagacaaaaa aaaaaaaaaa aaaaaaaaaa 2641 aaaaaaaaaa aaaaaaaaaa aaaa //