LOCUS BC051342 3383 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:59669 IMAGE:6341380), complete cds. ACCESSION BC051342 VERSION BC051342.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3383) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3383) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (25-APR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 47 Row: k Column: 20 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31982886. FEATURES Location/Qualifiers source 1..3383 /db_xref="H-InvDB:HIT000053633" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:59669 IMAGE:6341380" /tissue_type="Brain, neuroblastoma" /clone_lib="NIH_MGC_47" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3383 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /db_xref="GeneID:2624" /db_xref="HGNC:HGNC:4171" /db_xref="MIM:137295" CDS 333..1775 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /codon_start=1 /product="GATA binding protein 2" /protein_id="AAH51342.1" /db_xref="GeneID:2624" /db_xref="HGNC:HGNC:4171" /db_xref="MIM:137295" /translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEG IQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSG HILPTPTPIHPSSSLSFGHPHPSSMVTAMG" BASE COUNT 710 a 1059 c 950 g 664 t ORIGIN 1 gagcgccagg aaggtagcga ggccagcgtc gccccgggac tcgctgctca agtctgtcta 61 ttgcctgccg ccacatccat cctagcaggg ccccgtcgcc caccaggcgg acaaaagcgg 121 tccgctgaac accatgcggc cgctcggcgt gccgcccagg ctctgctggt gagcgccgcc 181 accccgcgcc caggtcccgc gagcccgcct gccgcgcacc tcgccctgct cccagctcta 241 ctccaggccc cgtccgcccg ggggcgccgc ccaccgcgcc tcgctcgggc cgttgccgtc 301 tgcacccaga ccctgagccg ccgccgccgg ccatggaggt ggcgccggag cagccgcgct 361 ggatggcgca cccggccgtg ctgaatgcgc agcaccccga ctcacaccac ccgggcctgg 421 cgcacaacta catggaaccc gcgcagctgc tgcctccaga cgaggtggac gtcttcttca 481 atcacctcga ctcgcagggc aacccctact atgccaaccc cgctcacgcg cgggcgcgcg 541 tctcctacag ccccgcgcac gcccgcctga ccggaggcca gatgtgccgc ccacacttgt 601 tgcacagccc gggtttgccc tggctggacg ggggcaaagc agccctctct gccgctgcgg 661 cccaccacca caacccctgg accgtgagcc ccttctccaa gacgccactg cacccctcag 721 ctgctggagg ccctggaggc ccactctctg tgtacccagg ggctgggggt gggagcgggg 781 gaggcagcgg gagctcagtg gcctccctca cccctacagc agcccactct ggctcccacc 841 ttttcggctt cccacccacg ccacccaaag aagtgtctcc tgaccctagc accacggggg 901 ctgcgtctcc agcctcatct tccgcggggg gtagtgcagc ccgaggagag gacaaggacg 961 gcgtcaagta ccaggtgtca ctgacggaga gcatgaagat ggaaagtggc agtcccctgc 1021 gcccaggcct agctactatg ggcacccagc ctgctacaca ccaccccatc cccacctacc 1081 cctcctatgt gccggcggct gcccacgact acagcagcgg actcttccac cccggaggct 1141 tcctgggggg accggcctcc agcttcaccc ctaagcagcg cagcaaggct cgttcctgtt 1201 cagaaggccg ggagtgtgtc aactgtgggg ccacagccac ccctctctgg cggcgggacg 1261 gcaccggcca ctacctgtgc aatgcctgtg gcctctacca caagatgaat gggcagaacc 1321 gaccactcat caagcccaag cgaagactgt cggccgccag aagagccggc acctgttgtg 1381 caaattgtca gacgacaacc accaccttat ggcgccgaaa cgccaacggg gaccctgtct 1441 gcaacgcctg tggcctctac tacaagctgc acaatgttaa caggccactg accatgaaga 1501 aggaagggat ccagactcgg aaccggaaga tgtccaacaa gtccaagaag agcaagaaag 1561 gggcggagtg cttcgaggag ctgtcaaagt gcatgcagga gaagtcatcc cccttcagtg 1621 cagctgccct ggctggacac atggcacctg tgggccacct cccgcccttc agccactccg 1681 gacacatcct gcccactccg acgcccatcc acccctcctc cagcctctcc ttcggccacc 1741 cccacccgtc cagcatggtg accgccatgg gctagggaac agatggacgt cgaggaccgg 1801 gcactcccgg gatgggtgga ccaaaccctt agcagcccag catttcccga aggccgacac 1861 cactcctgcc agcccggctc ggcccagcac cccctctcct ggagggcgcc cagcagcctg 1921 ccagcagtta ctgtgaatgt tccccaccgc tgagaggctg cctccgcacc tgactgctgc 1981 ccaggtgggg tttcctgcat ggacagttgt ttggagaaca acaaggacaa ctttatgtag 2041 agaaaaggag gggacgggac agacgaaggc aaccattttt agaaggaaaa aggattaggc 2101 aaaaataatt tattttgctc ttgtttctaa caaggacttg gagacttggt ggtctgagct 2161 gtcccaagtc ctccggttct tcctcgggat tggcgggtcc acttgccagg gctctggggg 2221 cagatttgtg gggacctcag cctgcaccct cttctcttct ggcttccctc tctgaaatag 2281 ccgaactcca ggctgggctg agccaaagcc agagtggcca cggcccaggg agggtgagct 2341 ggtgcctgct ttgacgggcc aggccctgga gggcagagac aatcacgggc ggtcctgcac 2401 agattcccag gccagggctg ggtcacagga aggaaacaac attttcttga aaggggaaac 2461 gtctcccaga tcgctccctt ggctttgagg ccgaagctgc tgtgactgtg tccccttact 2521 gagcgcaagc cacagcctgt cttgtcaggt ggaccctgta aatacatcct ttttctgcta 2581 acccttcaac cccctcgcct cctactctga gacaaaagaa aaaatattaa aaaaatgcat 2641 aggcttaact cgctgatgag ttaattgttt tatttttaaa ctctttttgg gtccagttga 2701 ttgtacgtag ccacaggagc cctgctatga aaggaataaa acctacacac aaggttggag 2761 ctttgcaatt ctttttggaa aagagctggg atcccacagc cctagtatga aagctggggg 2821 tggggagggg cctttgctgc ccttggtttc tgggggctgg ttggcatttg ctggcctggc 2881 agggggtgaa ggcaggagtt gggggcaggt caggaccagg acccagggag aggctgtgtc 2941 cctgctgggg tctcaggtcc agctttactg tggctgtctg gatccttccc aaggtacagc 3001 tgtatataaa cgtgtcccga gcttagattc tgtatgcggt gacggcgggg tgtggtggcc 3061 tgtgaggggc ccctggccca ggaggaggat tgtgctgatg tagtgaccaa gtgcaatatg 3121 ggcgggcagt cgctgcaggg agcaccacgg ccagaagtaa cttattttgt actagtgtcc 3181 gcataagaaa aagaatcggc agtattttct gtttttatgt tttatttggc ttgttttatt 3241 ttggattagt gaactaagtt attgttaatt atgtacaaca tttatatatt gtctgtaaaa 3301 aatgtatgct atcctcttat tcctttaaag tgagtactgt taagaataat aaaatacttt 3361 ttgtgaaaaa aaaaaaaaaa aaa //