LOCUS BC015613 3135 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:23183 IMAGE:4811349), complete cds. ACCESSION BC015613 VERSION BC015613.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3135) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3135) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (04-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC015613.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 33 Row: m Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31982886. FEATURES Location/Qualifiers source 1..3135 /db_xref="H-InvDB:HIT000037215" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:23183 IMAGE:4811349" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3135 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /db_xref="GeneID:2624" /db_xref="HGNC:HGNC:4171" /db_xref="MIM:137295" CDS 82..1524 /gene="GATA2" /gene_synonym="MGC2306" /gene_synonym="NFE1B" /codon_start=1 /product="GATA binding protein 2" /protein_id="AAH15613.1" /db_xref="GeneID:2624" /db_xref="HGNC:HGNC:4171" /db_xref="MIM:137295" /translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEG IQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSG HILPTPTPIHPSSSLSFGHPHPSSMVTAMG" BASE COUNT 677 a 950 c 877 g 631 t ORIGIN 1 gtccgcccgg gggcgccgcc caccgcgcct cgctcgggcc gttgccgtct gcacccagac 61 cctgagccgc cgccgccggc catggaggtg gcgccggagc agccgcgctg gatggcgcac 121 ccggccgtgc tgaatgcgca gcaccccgac tcacaccacc cgggcctggc gcacaactac 181 atggaacccg cgcagctgct gcctccagac gaggtggacg tcttcttcaa tcacctcgac 241 tcgcagggca acccctacta tgccaacccc gctcacgcgc gggcgcgcgt ctcctacagc 301 cccgcgcacg cccgcctgac cggaggccag atgtgccgcc cacacttgtt gcacagcccg 361 ggtttgccct ggctggacgg gggcaaagca gccctctctg ccgctgcggc ccaccaccac 421 aacccctgga ccgtgagccc cttctccaag acgccactgc acccctcagc tgctggaggc 481 cctggaggcc cactctctgt gtacccaggg gctgggggtg ggagcggggg aggcagcggg 541 agctcagtgg cctccctcac ccctacagca gcccactctg gctcccacct tttcggcttc 601 ccacccacgc cacccaaaga agtgtctcct gaccctagca ccacgggggc tgcgtctcca 661 gcctcatctt ccgcgggggg tagtgcagcc cgaggagagg acaaggacgg cgtcaagtac 721 caggtgtcac tgacggagag catgaagatg gaaagtggca gtcccctgcg cccaggccta 781 gctactatgg gcacccagcc tgctacacac caccccatcc ccacctaccc ctcctatgtg 841 ccggcggctg cccacgacta cagcagcgga ctcttccacc ccggaggctt cctgggggga 901 ccggcctcca gcttcacccc taagcagcgc agcaaggctc gttcctgttc agaaggccgg 961 gagtgtgtca actgtggggc cacagccacc cctctctggc ggcgggacgg caccggccac 1021 tacctgtgca atgcctgtgg cctctaccac aagatgaatg ggcagaaccg accactcatc 1081 aagcccaagc gaagactgtc ggccgccaga agagccggca cctgttgtgc aaattgtcag 1141 acgacaacca ccaccttatg gcgccgaaac gccaacgggg accctgtctg caacgcctgt 1201 ggcctctact acaagctgca caatgttaac aggccactga ccatgaagaa ggaagggatc 1261 cagactcgga accggaagat gtccaacaag tccaagaaga gcaagaaagg ggcggagtgc 1321 ttcgaggagc tgtcaaagtg catgcaggag aagtcatccc ccttcagtgc agctgccctg 1381 gctggacaca tggcacctgt gggccacctc ccgcccttca gccactccgg acacatcctg 1441 cccactccga cgcccatcca cccctcctcc agcctctcct tcggccaccc ccacccgtcc 1501 agcatggtga ccgccatggg ctagggaaca gatggacgtc gaggaccggg cactcccggg 1561 atgggtggac caaaccctta gcagcccagc atttcccgaa ggccgacacc actcctgcca 1621 gcccggctcg gcccagcacc ccctctcctg gagggcgccc agcagcctgc cagcagttac 1681 tgtgaatgtt ccccaccgct gagaggctgc ctccgcacct gactgctgcc caggtggggt 1741 ttcctgcatg gacagttgtt tggagaacaa caaggacaac tttatgtaga gaaaaggagg 1801 ggacgggaca gacgaaggca accattttta gaaggaaaaa ggattaggca aaaataattt 1861 attttgctct tgtttctaac aaggacttgg agacttggtg gtctgagctg tcccaagtcc 1921 tccggttctt cctcgggatt ggcgggtcca cttgccaggg ctctgggggc agatttgtgg 1981 ggacctcagc ctgcaccctc ttctcttctg gcttccctct ctgaaatagc cgaactccag 2041 gctgggctga gccaaagcca gagtggccac ggcccaggga gggtgagctg gtgcctgctt 2101 tgacgggcca ggccctggag ggcagagaca atcacgggcg gtcctgcaca gattcccagg 2161 ccagggctgg gtcacaggaa ggaaacaaca ttttcttgaa aggggaaacg tctcccagat 2221 cgctcccttg gctttgaggc cgaagctgct gtgactgtgt ccccttactg agcgcaagcc 2281 acagcctgtc ttgtcaggtg gaccctgtaa atacatcctt tttctgctaa cccttcaacc 2341 ccctcgcctc ctactctgag acaaaagaaa aaatattaaa aaaatgcata ggcttaactc 2401 gctgatgagt taattgtttt atttttaaac tctttttggg tccagttgat tgtacgtagc 2461 cacaggagcc ctgctatgaa aggaataaaa cctacacaca aggttggagc tttgcaattc 2521 tttttggaaa agagctggga tcccacagcc ctagtatgaa agctgggggt ggggaggggc 2581 ctttgctgcc cttggtttct gggggctggt tggcatttgc tggcctggca gggggtgaag 2641 gcaggagttg ggggcaggtc aggaccagga cccagggaga ggctgtgtcc ctgctggggt 2701 ctcaggtcca gctttactgt ggctgtctgg atccttccca aggtacagct gtatataaac 2761 gtgtcccgag cttagattct gtatgcggtg acggcggggt gtggtggcct gtgaggggcc 2821 cctggcccag gaggaggatt gtgctgatgt agtgaccaag tgcaatatgg gcgggcagtc 2881 gctgcaggga gcaccacggc cagaagtaac ttattttgta ctagtgtccg cataagaaaa 2941 agaatcggca gtattttctg tttttatgtt ttatttggct tgttttattt tggattagtg 3001 aactaagtta ttgttaatta tgtacaacat ttatatattg tctgtaaaaa atgtatgcta 3061 tcctcttatt cctttaaagt gagtactgtt aagaataata aaatactttt tgtgaaaaaa 3121 aaaaaaaaaa aaaaa //