LOCUS BC018988 2549 bp mRNA linear HUM 04-OCT-2003
DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:20306
IMAGE:4128414), complete cds.
ACCESSION BC018988
VERSION BC018988.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2549)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2549)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (07-DEC-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC018988.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 28 Row: j Column: 7.
FEATURES Location/Qualifiers
source 1..2549
/db_xref="H-InvDB:HIT000038495"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:20306 IMAGE:4128414"
/tissue_type="Brain, neuroblastoma"
/clone_lib="NIH_MGC_19"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2549
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/db_xref="GeneID:2624"
/db_xref="MIM:137295"
CDS 252..1652
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/codon_start=1
/product="GATA2 protein"
/protein_id="AAH18988.1"
/db_xref="GeneID:2624"
/db_xref="MIM:137295"
/translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE
VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK
AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT
PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT
ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS
SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK
PKRRLTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEGIQTRNRKMSNKSKK
SKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSGHILPTPTPIHPSSS
LSFGHPHPSSMVTAMG"
misc_feature 1122..1241
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/note="GATA; Region: GATA zinc finger. This domain uses
four cysteine residues to coordinate a zinc ion. This
domain binds to DNA. Two GATA zinc fingers are found in
the GATA transcription factors. However there are several
proteins which only contains a single copy of the domain"
/db_xref="CDD:pfam00320"
BASE COUNT 531 a 881 c 711 g 426 t
ORIGIN
1 ggcagctggc gccagggcgg ccggaggatg ccgaggggcc ggagccgggc gggcccgagg
61 ccgaggcgca ctctaccccc agctcctacc ctgtaagccc cgccagcctc cggacgtgct
121 gtccctgggc ccgtcgccct ctgggctccc gccggaactc cttcactctc agaggccgag
181 tccctcccct ccccacggct gcgtgtggcc gttgccgtct gcacccagac cctgagccgc
241 cgccgccggc catggaggtg gcgccggagc agccgcgctg gatggcgcac ccggccgtgc
301 tgaatgcgca gcaccccgac tcacaccacc cgggcctggc gcacaactac atggaacccg
361 cgcagctgct gcctccagac gaggtggacg tcttcttcaa tcacctcgac tcgcagggca
421 acccctacta tgccaacccc gctcacgcgc gggcgcgcgt ctcctacagc cccgcgcacg
481 cccgcctgac cggaggccag atgtgccgcc cacacttgtt gcacagcccg ggtttgccct
541 ggctggacgg gggcaaagca gccctctctg ccgctgcggc ccaccaccac aacccctgga
601 ccgtgagccc cttctccaag acgccactgc acccctcagc tgctggaggc cctggaggcc
661 cactctctgt gtacccaggg gctgggggtg ggagcggggg aggcagcggg agctcagtgg
721 cctccctcac ccctacagca gcccactctg gctcccacct tttcggcttc ccacccacgc
781 cacccaaaga agtgtctcct gaccctagca ccacgggggc tgcgtctcca gcctcatctt
841 ccgcgggggg tagtgcagcc cgaggagagg acaaggacgg cgtcaagtac caggtgtcac
901 tgacggagag catgaagatg gaaagtggca gtcccctgcg cccaggccta gctactatgg
961 gcacccagcc tgctacacac caccccatcc ccacctaccc ctcctatgtg ccggcggctg
1021 cccacgacta cagcagcgga ctcttccacc ccggaggctt cctgggggga ccggcctcca
1081 gcttcacccc taagcagcgc agcaaggctc gttcctgttc agaaggccgg gagtgtgtca
1141 actgtggggc cacagccacc cctctctggc ggcgggacgg caccggccac tacctgtgca
1201 atgcctgtgg cctctaccac aagatgaatg ggcagaaccg accactcatc aagcccaagc
1261 gaagactgac gacaaccacc accttatggc gccgaaacgc caacggggac cctgtctgca
1321 acgcctgtgg cctctactac aagctgcaca atgttaacag gccactgacc atgaagaagg
1381 aagggatcca gactcggaac cggaagatgt ccaacaagtc caagaagagc aagaaagggg
1441 cggagtgctt cgaggagctg tcaaagtgca tgcaggagaa gtcatccccc ttcagtgcag
1501 ctgccctggc tggacacatg gcacctgtgg gccacctccc gcccttcagc cactccggac
1561 acatcctgcc cactccgacg cccatccacc cctcctccag cctctccttc ggccaccccc
1621 acccgtccag catggtgacc gccatgggct agggaacaga tggacgtcga ggaccgggca
1681 ctcccgggat gggtggacca aacccttagc agcccagcat ttcccgaagg ccgacaccac
1741 tcctgccagc ccggctcggc ccagcacccc ctctcctgga gggcgcccag cagcctgcca
1801 gcagttactg tgaatgttcc ccaccgctga gaggctgcct ccgcacctga ccgctgccca
1861 ggtggggttt cctgcatgga cagttgtttg gagaacaaca aggacaactt tatgtagaga
1921 aaaggagggg acgggacaga cgaaggcaac catttttaga aggaaaaagg attaggcaaa
1981 aataatttat tttgctcttg tttctaacaa ggacttggag acttggtggt ctgagctgtc
2041 ccaagtcctc cggttcttcc tcgggattgg cgggtccact tgccagggct ctgggggcag
2101 atttgtgggg acctcagcct gcaccctctt ctcctctggc ttccctctct gaaatagccg
2161 aactccaggc tgggctgagc caaagccaga gtggccacgg cccagggagg gtgagctggt
2221 gcctgctttg acgggccagg ccctggaggg cagagacaat cacgggcggt cctgcacaga
2281 ttcccaggcc agggctgggt cacaggaagg aaacaacatt ttcttgaaag gggaaacgtc
2341 tcccagatcg ctcccttggc tttgaggccg aagctgctgt gactgtgtcc ccttactgag
2401 cgcaagccac agcctgtctt gtcaggtgga ccctgtaaat acatcctttt tctgctaacc
2461 cttcaacccc ctcgcctcct actctgagac aaaagaaaaa atattaaaaa aatgcatagg
2521 cttaaaaaaa aaaaaaaaaa aaaaaaaaa
//