LOCUS BC015577 2549 bp mRNA linear HUM 06-OCT-2003 DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:23124 IMAGE:4127146), complete cds. ACCESSION BC015577 VERSION BC015577.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2549) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2549) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (04-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC015577.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 33 Row: a Column: 23 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 20149534. FEATURES Location/Qualifiers source 1..2549 /db_xref="H-InvDB:HIT000037196" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:23124 IMAGE:4127146" /tissue_type="Brain, neuroblastoma" /clone_lib="NIH_MGC_19" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2549 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /db_xref="GeneID:2624" /db_xref="MIM:137295" CDS 252..1652 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /codon_start=1 /product="GATA2 protein" /protein_id="AAH15577.1" /db_xref="GeneID:2624" /db_xref="MIM:137295" /translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK PKRRLTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEGIQTRNRKMSNKSKK SKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSGHILPTPTPIHPSSS LSFGHPHPSSMVTAMG" misc_feature 1122..1241 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /note="GATA; Region: GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the domain" /db_xref="CDD:pfam00320" BASE COUNT 531 a 881 c 711 g 426 t ORIGIN 1 ggcagctggc gccagggcgg ccggaggatg ccgaggggcc ggagccgggc gggcccgagg 61 ccgaggcgca ctctaccccc agctcctacc ctgtaagccc cgccagcctc cggacgtgct 121 gtccctgggc ccgtcgccct ctgggctccc gccggaactc cttcactctc agaggccgag 181 tccctcccct ccccacggct gcgtgtggcc gttgccgtct gcacccagac cctgagccgc 241 cgccgccggc catggaggtg gcgccggagc agccgcgctg gatggcgcac ccggccgtgc 301 tgaatgcgca gcaccccgac tcacaccacc cgggcctggc gcacaactac atggaacccg 361 cgcagctgct gcctccagac gaggtggacg tcttcttcaa tcacctcgac tcgcagggca 421 acccctacta tgccaacccc gctcacgcgc gggcgcgcgt ctcctacagc cccgcgcacg 481 cccgcctgac cggaggccag atgtgccgcc cacacttgtt gcacagcccg ggtttgccct 541 ggctggacgg gggcaaagca gccctctctg ccgctgcggc ccaccaccac aacccctgga 601 ccgtgagccc cttctccaag acgccactgc acccctcagc tgctggaggc cctggaggcc 661 cactctctgt gtacccaggg gctgggggtg ggagcggggg aggcagcggg agctcagtgg 721 cctccctcac ccctacagca gcccactctg gctcccacct tttcggcttc ccacccacgc 781 cacccaaaga agtgtctcct gaccctagca ccacgggggc tgcgtctcca gcctcatctt 841 ccgcgggggg tagtgcagcc cgaggagagg acaaggacgg cgtcaagtac caggtgtcac 901 tgacggagag catgaagatg gaaagtggca gtcccctgcg cccaggccta gctactatgg 961 gcacccagcc tgctacacac caccccatcc ccacctaccc ctcctatgtg ccggcggctg 1021 cccacgacta cagcagcgga ctcttccacc ccggaggctt cctgggggga ccggcctcca 1081 gcttcacccc taagcagcgc agcaaggctc gttcctgttc agaaggccgg gagtgtgtca 1141 actgtggggc cacagccacc cctctctggc ggcgggacgg caccggccac tacctgtgca 1201 atgcctgtgg cctctaccac aagatgaatg ggcagaaccg accactcatc aagcccaagc 1261 gaagactgac gacaaccacc accttatggc gccgaaacgc caacggggac cctgtctgca 1321 acgcctgtgg cctctactac aagctgcaca atgttaacag gccactgacc atgaagaagg 1381 aagggatcca gactcggaac cggaagatgt ccaacaagtc caagaagagc aagaaagggg 1441 cggagtgctt cgaggagctg tcaaagtgca tgcaggagaa gtcatccccc ttcagtgcag 1501 ctgccctggc tggacacatg gcacctgtgg gccacctccc gcccttcagc cactccggac 1561 acatcctgcc cactccgacg cccatccacc cctcctccag cctctccttc ggccaccccc 1621 acccgtccag catggtgacc gccatgggct agggaacaga tggacgtcga ggaccgggca 1681 ctcccgggat gggtggacca aacccttagc agcccagcat ttcccgaagg ccgacaccac 1741 tcctgccagc ccggctcggc ccagcacccc ctctcctgga gggcgcccag cagcctgcca 1801 gcagttactg tgaatgttcc ccaccgctga gaggctgcct ccgcacctga ccgctgccca 1861 ggtggggttt cctgcatgga cagttgtttg gagaacaaca aggacaactt tatgtagaga 1921 aaaggagggg acgggacaga cgaaggcaac catttttaga aggaaaaagg attaggcaaa 1981 aataatttat tttgctcttg tttctaacaa ggacttggag acttggtggt ctgagctgtc 2041 ccaagtcctc cggttcttcc tcgggattgg cgggtccact tgccagggct ctgggggcag 2101 atttgtgggg acctcagcct gcaccctctt ctcctctggc ttccctctct gaaatagccg 2161 aactccaggc tgggctgagc caaagccaga gtggccacgg cccagggagg gtgagctggt 2221 gcctgctttg acgggccagg ccctggaggg cagagacaat cacgggcggt cctgcacaga 2281 ttcccaggcc agggctgggt cacaggaagg aaacaacatt ttcttgaaag gggaaacgtc 2341 tcccagatcg ctcccttggc tttgaggccg aagctgctgt gactgtgtcc ccttactgag 2401 cgcaagccac agcctgtctt gtcaggtgga ccctgtaaat acatcctttt tctgctaacc 2461 cttcaacccc ctcgcctcct actctgagac aaaagaaaaa atattaaaaa aatgcatagg 2521 cttaaaaaaa aaaaaaaaaa aaaaaaaaa //