LOCUS BC051342 3383 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:59669
IMAGE:6341380), complete cds.
ACCESSION BC051342
VERSION BC051342.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3383)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3383)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (25-APR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 47 Row: k Column: 20
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31982886.
FEATURES Location/Qualifiers
source 1..3383
/db_xref="H-InvDB:HIT000053633"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:59669 IMAGE:6341380"
/tissue_type="Brain, neuroblastoma"
/clone_lib="NIH_MGC_47"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3383
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/db_xref="GeneID:2624"
/db_xref="HGNC:HGNC:4171"
/db_xref="MIM:137295"
CDS 333..1775
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/codon_start=1
/product="GATA binding protein 2"
/protein_id="AAH51342.1"
/db_xref="GeneID:2624"
/db_xref="HGNC:HGNC:4171"
/db_xref="MIM:137295"
/translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE
VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK
AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT
PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT
ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS
SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK
PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEG
IQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSG
HILPTPTPIHPSSSLSFGHPHPSSMVTAMG"
BASE COUNT 710 a 1059 c 950 g 664 t
ORIGIN
1 gagcgccagg aaggtagcga ggccagcgtc gccccgggac tcgctgctca agtctgtcta
61 ttgcctgccg ccacatccat cctagcaggg ccccgtcgcc caccaggcgg acaaaagcgg
121 tccgctgaac accatgcggc cgctcggcgt gccgcccagg ctctgctggt gagcgccgcc
181 accccgcgcc caggtcccgc gagcccgcct gccgcgcacc tcgccctgct cccagctcta
241 ctccaggccc cgtccgcccg ggggcgccgc ccaccgcgcc tcgctcgggc cgttgccgtc
301 tgcacccaga ccctgagccg ccgccgccgg ccatggaggt ggcgccggag cagccgcgct
361 ggatggcgca cccggccgtg ctgaatgcgc agcaccccga ctcacaccac ccgggcctgg
421 cgcacaacta catggaaccc gcgcagctgc tgcctccaga cgaggtggac gtcttcttca
481 atcacctcga ctcgcagggc aacccctact atgccaaccc cgctcacgcg cgggcgcgcg
541 tctcctacag ccccgcgcac gcccgcctga ccggaggcca gatgtgccgc ccacacttgt
601 tgcacagccc gggtttgccc tggctggacg ggggcaaagc agccctctct gccgctgcgg
661 cccaccacca caacccctgg accgtgagcc ccttctccaa gacgccactg cacccctcag
721 ctgctggagg ccctggaggc ccactctctg tgtacccagg ggctgggggt gggagcgggg
781 gaggcagcgg gagctcagtg gcctccctca cccctacagc agcccactct ggctcccacc
841 ttttcggctt cccacccacg ccacccaaag aagtgtctcc tgaccctagc accacggggg
901 ctgcgtctcc agcctcatct tccgcggggg gtagtgcagc ccgaggagag gacaaggacg
961 gcgtcaagta ccaggtgtca ctgacggaga gcatgaagat ggaaagtggc agtcccctgc
1021 gcccaggcct agctactatg ggcacccagc ctgctacaca ccaccccatc cccacctacc
1081 cctcctatgt gccggcggct gcccacgact acagcagcgg actcttccac cccggaggct
1141 tcctgggggg accggcctcc agcttcaccc ctaagcagcg cagcaaggct cgttcctgtt
1201 cagaaggccg ggagtgtgtc aactgtgggg ccacagccac ccctctctgg cggcgggacg
1261 gcaccggcca ctacctgtgc aatgcctgtg gcctctacca caagatgaat gggcagaacc
1321 gaccactcat caagcccaag cgaagactgt cggccgccag aagagccggc acctgttgtg
1381 caaattgtca gacgacaacc accaccttat ggcgccgaaa cgccaacggg gaccctgtct
1441 gcaacgcctg tggcctctac tacaagctgc acaatgttaa caggccactg accatgaaga
1501 aggaagggat ccagactcgg aaccggaaga tgtccaacaa gtccaagaag agcaagaaag
1561 gggcggagtg cttcgaggag ctgtcaaagt gcatgcagga gaagtcatcc cccttcagtg
1621 cagctgccct ggctggacac atggcacctg tgggccacct cccgcccttc agccactccg
1681 gacacatcct gcccactccg acgcccatcc acccctcctc cagcctctcc ttcggccacc
1741 cccacccgtc cagcatggtg accgccatgg gctagggaac agatggacgt cgaggaccgg
1801 gcactcccgg gatgggtgga ccaaaccctt agcagcccag catttcccga aggccgacac
1861 cactcctgcc agcccggctc ggcccagcac cccctctcct ggagggcgcc cagcagcctg
1921 ccagcagtta ctgtgaatgt tccccaccgc tgagaggctg cctccgcacc tgactgctgc
1981 ccaggtgggg tttcctgcat ggacagttgt ttggagaaca acaaggacaa ctttatgtag
2041 agaaaaggag gggacgggac agacgaaggc aaccattttt agaaggaaaa aggattaggc
2101 aaaaataatt tattttgctc ttgtttctaa caaggacttg gagacttggt ggtctgagct
2161 gtcccaagtc ctccggttct tcctcgggat tggcgggtcc acttgccagg gctctggggg
2221 cagatttgtg gggacctcag cctgcaccct cttctcttct ggcttccctc tctgaaatag
2281 ccgaactcca ggctgggctg agccaaagcc agagtggcca cggcccaggg agggtgagct
2341 ggtgcctgct ttgacgggcc aggccctgga gggcagagac aatcacgggc ggtcctgcac
2401 agattcccag gccagggctg ggtcacagga aggaaacaac attttcttga aaggggaaac
2461 gtctcccaga tcgctccctt ggctttgagg ccgaagctgc tgtgactgtg tccccttact
2521 gagcgcaagc cacagcctgt cttgtcaggt ggaccctgta aatacatcct ttttctgcta
2581 acccttcaac cccctcgcct cctactctga gacaaaagaa aaaatattaa aaaaatgcat
2641 aggcttaact cgctgatgag ttaattgttt tatttttaaa ctctttttgg gtccagttga
2701 ttgtacgtag ccacaggagc cctgctatga aaggaataaa acctacacac aaggttggag
2761 ctttgcaatt ctttttggaa aagagctggg atcccacagc cctagtatga aagctggggg
2821 tggggagggg cctttgctgc ccttggtttc tgggggctgg ttggcatttg ctggcctggc
2881 agggggtgaa ggcaggagtt gggggcaggt caggaccagg acccagggag aggctgtgtc
2941 cctgctgggg tctcaggtcc agctttactg tggctgtctg gatccttccc aaggtacagc
3001 tgtatataaa cgtgtcccga gcttagattc tgtatgcggt gacggcgggg tgtggtggcc
3061 tgtgaggggc ccctggccca ggaggaggat tgtgctgatg tagtgaccaa gtgcaatatg
3121 ggcgggcagt cgctgcaggg agcaccacgg ccagaagtaa cttattttgt actagtgtcc
3181 gcataagaaa aagaatcggc agtattttct gtttttatgt tttatttggc ttgttttatt
3241 ttggattagt gaactaagtt attgttaatt atgtacaaca tttatatatt gtctgtaaaa
3301 aatgtatgct atcctcttat tcctttaaag tgagtactgt taagaataat aaaatacttt
3361 ttgtgaaaaa aaaaaaaaaa aaa
//