LOCUS BC002557 2664 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:2306
IMAGE:3139044), complete cds.
ACCESSION BC002557
VERSION BC002557.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2664)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2664)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC002557.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 4 Row: l Column: 11
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31982886.
FEATURES Location/Qualifiers
source 1..2664
/db_xref="H-InvDB:HIT000030989"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:2306 IMAGE:3139044"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2664
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/db_xref="GeneID:2624"
/db_xref="HGNC:HGNC:4171"
/db_xref="MIM:137295"
CDS 335..1777
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/codon_start=1
/product="GATA binding protein 2"
/protein_id="AAH02557.1"
/db_xref="GeneID:2624"
/db_xref="HGNC:HGNC:4171"
/db_xref="MIM:137295"
/translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE
VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK
AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT
PTATHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT
ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS
SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK
PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEG
IQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSG
HILPTPTPIHPSSSLSFGHPHPSSMVTAMG"
BASE COUNT 558 a 932 c 738 g 436 t
ORIGIN
1 gtgagcgcca ggaaggtagc gaggccagcg tcgccccggg actcgctgct caagtctgtc
61 tattgcctgc cgccacatcc atcctagcag ggccccgtcg cccaccaggc ggacaaaagc
121 ggtccgctga acaccatgcg gccgctcggc gtgccgccca ggctctgctg gtgagcgccg
181 ccaccccgcg cccaggtccc gcgagcccgc ctgccgcgca cctcgccctg ctcccagctc
241 tactccaggc cccgtccgcc cgggggcgcc gcccaccgcg cctcgctcgg gccgttgccg
301 tctgcaccca gaccctgagc cgccgccgcc ggccatggag gtggcgccgg agcagccgcg
361 ctggatggcg cacccggccg tgctgaatgc gcagcacccc gactcacacc acccgggcct
421 ggcgcacaac tacatggaac ccgcgcagct gctgcctcca gacgaggtgg acgtcttctt
481 caatcacctc gactcgcagg gcaaccccta ctatgccaac cccgctcacg cgcgggcgcg
541 cgtctcctac agccccgcgc acgcccgcct gaccggaggc cagatgtgcc gcccacactt
601 gttgcacagc ccgggtttgc cctggctgga cgggggcaaa gcagccctct ctgccgctgc
661 ggcccaccac cacaacccct ggaccgtgag ccccttctcc aagacgccac tgcacccctc
721 agctgctgga ggccctggag gcccactctc tgtgtaccca ggggctgggg gtgggagcgg
781 gggaggcagc gggagctcag tggcctccct cacccctaca gcaacccact ctggctccca
841 ccttttcggc ttcccaccca cgccacccaa agaagtgtct cctgacccta gcaccacggg
901 ggctgcgtct ccagcctcat cttccgcggg gggtagtgca gcccgaggag aggacaagga
961 cggcgtcaag taccaggtgt cactgacgga gagcatgaag atggaaagtg gcagtcccct
1021 gcgcccaggc ctagctacta tgggcaccca gcctgctaca caccacccca tccccaccta
1081 cccctcctat gtgccggcgg ctgcccacga ctacagcagc ggactcttcc accccggagg
1141 cttcctgggg ggaccggcct ccagcttcac ccctaagcag cgcagcaagg ctcgttcctg
1201 ttcagaaggc cgggagtgtg tcaactgtgg ggccacagcc acccctctct ggcggcggga
1261 cggcaccggc cactacctgt gcaatgcctg tggcctctac cacaagatga atgggcagaa
1321 ccgaccactc atcaagccca agcgaagact gtcggccgcc agaagagccg gcacctgttg
1381 tgcaaattgt cagacgacaa ccaccacctt atggcgccga aacgccaacg gggaccctgt
1441 ctgcaacgcc tgtggcctct actacaagct gcacaatgtt aacaggccac tgaccatgaa
1501 gaaggaaggg atccagactc ggaaccggaa gatgtccaac aagtccaaga agagcaagaa
1561 aggggcggag tgcttcgagg agctgtcaaa gtgcatgcag gagaagtcat cccccttcag
1621 tgcagctgcc ctggctggac acatggcacc tgtgggccac ctcccgccct tcagccactc
1681 cggacacatc ctgcccactc cgacgcccat ccacccctcc tccagcctct ccttcggcca
1741 cccccacccg tccagcatgg tgaccgccat gggctaggga acagatggac gtcgaggacc
1801 gggcactccc gggatgggtg gaccaaaccc ttagcagccc agcatttccc gaaggccgac
1861 accactcctg ccagcccggc tcggcccagc accccctctc ctggagggcg cccagcagcc
1921 tgccagcagt tactgtgaat gttccccacc gctgagaggc tgcctccgca cctgaccgct
1981 gcccaggtgg ggtttcctgc atggacagtt gtttggagaa caacaaggac aactttatgt
2041 agagaaaagg aggggacggg acagacgaag gcaaccattt ttagaaggaa aaaggattag
2101 gcaaaaataa tttattttgc tcttgtttct aacaaggact tggagacttg gtggtctgag
2161 ctgtcccaag tcctccggtt cttcctcggg attggcgggt ccacttgcca gggctctggg
2221 ggcagatttg tggggacctc agcctgcacc ctcttctcct ctggcttccc tctctgaaat
2281 agccgaactc caggctgggc tgagccaaag ccagagtggc cacggcccag ggagggtgag
2341 ctggtgcctg ctttgacggg ccaggccctg gagggcagag acaatcacgg gcggtcctgc
2401 acagattccc aggccagggc tgggtcacag gaaggaaaca acattttctt gaaaggggaa
2461 acgtctccca gatcgctccc ttggctttga ggccgaagct gctgtgactg tgtcccctta
2521 ctgagcgcaa gccacagcct gtcttgtcag gtggaccctg taaatacatc ctttttctgc
2581 taacccttca accccctcgc ctcctactct gagacaaaaa aaaaaaaaaa aaaaaaaaaa
2641 aaaaaaaaaa aaaaaaaaaa aaaa
//