LOCUS BC015613 3135 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens GATA binding protein 2, mRNA (cDNA clone MGC:23183
IMAGE:4811349), complete cds.
ACCESSION BC015613
VERSION BC015613.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3135)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3135)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (04-OCT-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC015613.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 33 Row: m Column: 11
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31982886.
FEATURES Location/Qualifiers
source 1..3135
/db_xref="H-InvDB:HIT000037215"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:23183 IMAGE:4811349"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..3135
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/db_xref="GeneID:2624"
/db_xref="HGNC:HGNC:4171"
/db_xref="MIM:137295"
CDS 82..1524
/gene="GATA2"
/gene_synonym="MGC2306"
/gene_synonym="NFE1B"
/codon_start=1
/product="GATA binding protein 2"
/protein_id="AAH15613.1"
/db_xref="GeneID:2624"
/db_xref="HGNC:HGNC:4171"
/db_xref="MIM:137295"
/translation="MEVAPEQPRWMAHPAVLNAQHPDSHHPGLAHNYMEPAQLLPPDE
VDVFFNHLDSQGNPYYANPAHARARVSYSPAHARLTGGQMCRPHLLHSPGLPWLDGGK
AALSAAAAHHHNPWTVSPFSKTPLHPSAAGGPGGPLSVYPGAGGGSGGGSGSSVASLT
PTAAHSGSHLFGFPPTPPKEVSPDPSTTGAASPASSSAGGSAARGEDKDGVKYQVSLT
ESMKMESGSPLRPGLATMGTQPATHHPIPTYPSYVPAAAHDYSSGLFHPGGFLGGPAS
SFTPKQRSKARSCSEGRECVNCGATATPLWRRDGTGHYLCNACGLYHKMNGQNRPLIK
PKRRLSAARRAGTCCANCQTTTTTLWRRNANGDPVCNACGLYYKLHNVNRPLTMKKEG
IQTRNRKMSNKSKKSKKGAECFEELSKCMQEKSSPFSAAALAGHMAPVGHLPPFSHSG
HILPTPTPIHPSSSLSFGHPHPSSMVTAMG"
BASE COUNT 677 a 950 c 877 g 631 t
ORIGIN
1 gtccgcccgg gggcgccgcc caccgcgcct cgctcgggcc gttgccgtct gcacccagac
61 cctgagccgc cgccgccggc catggaggtg gcgccggagc agccgcgctg gatggcgcac
121 ccggccgtgc tgaatgcgca gcaccccgac tcacaccacc cgggcctggc gcacaactac
181 atggaacccg cgcagctgct gcctccagac gaggtggacg tcttcttcaa tcacctcgac
241 tcgcagggca acccctacta tgccaacccc gctcacgcgc gggcgcgcgt ctcctacagc
301 cccgcgcacg cccgcctgac cggaggccag atgtgccgcc cacacttgtt gcacagcccg
361 ggtttgccct ggctggacgg gggcaaagca gccctctctg ccgctgcggc ccaccaccac
421 aacccctgga ccgtgagccc cttctccaag acgccactgc acccctcagc tgctggaggc
481 cctggaggcc cactctctgt gtacccaggg gctgggggtg ggagcggggg aggcagcggg
541 agctcagtgg cctccctcac ccctacagca gcccactctg gctcccacct tttcggcttc
601 ccacccacgc cacccaaaga agtgtctcct gaccctagca ccacgggggc tgcgtctcca
661 gcctcatctt ccgcgggggg tagtgcagcc cgaggagagg acaaggacgg cgtcaagtac
721 caggtgtcac tgacggagag catgaagatg gaaagtggca gtcccctgcg cccaggccta
781 gctactatgg gcacccagcc tgctacacac caccccatcc ccacctaccc ctcctatgtg
841 ccggcggctg cccacgacta cagcagcgga ctcttccacc ccggaggctt cctgggggga
901 ccggcctcca gcttcacccc taagcagcgc agcaaggctc gttcctgttc agaaggccgg
961 gagtgtgtca actgtggggc cacagccacc cctctctggc ggcgggacgg caccggccac
1021 tacctgtgca atgcctgtgg cctctaccac aagatgaatg ggcagaaccg accactcatc
1081 aagcccaagc gaagactgtc ggccgccaga agagccggca cctgttgtgc aaattgtcag
1141 acgacaacca ccaccttatg gcgccgaaac gccaacgggg accctgtctg caacgcctgt
1201 ggcctctact acaagctgca caatgttaac aggccactga ccatgaagaa ggaagggatc
1261 cagactcgga accggaagat gtccaacaag tccaagaaga gcaagaaagg ggcggagtgc
1321 ttcgaggagc tgtcaaagtg catgcaggag aagtcatccc ccttcagtgc agctgccctg
1381 gctggacaca tggcacctgt gggccacctc ccgcccttca gccactccgg acacatcctg
1441 cccactccga cgcccatcca cccctcctcc agcctctcct tcggccaccc ccacccgtcc
1501 agcatggtga ccgccatggg ctagggaaca gatggacgtc gaggaccggg cactcccggg
1561 atgggtggac caaaccctta gcagcccagc atttcccgaa ggccgacacc actcctgcca
1621 gcccggctcg gcccagcacc ccctctcctg gagggcgccc agcagcctgc cagcagttac
1681 tgtgaatgtt ccccaccgct gagaggctgc ctccgcacct gactgctgcc caggtggggt
1741 ttcctgcatg gacagttgtt tggagaacaa caaggacaac tttatgtaga gaaaaggagg
1801 ggacgggaca gacgaaggca accattttta gaaggaaaaa ggattaggca aaaataattt
1861 attttgctct tgtttctaac aaggacttgg agacttggtg gtctgagctg tcccaagtcc
1921 tccggttctt cctcgggatt ggcgggtcca cttgccaggg ctctgggggc agatttgtgg
1981 ggacctcagc ctgcaccctc ttctcttctg gcttccctct ctgaaatagc cgaactccag
2041 gctgggctga gccaaagcca gagtggccac ggcccaggga gggtgagctg gtgcctgctt
2101 tgacgggcca ggccctggag ggcagagaca atcacgggcg gtcctgcaca gattcccagg
2161 ccagggctgg gtcacaggaa ggaaacaaca ttttcttgaa aggggaaacg tctcccagat
2221 cgctcccttg gctttgaggc cgaagctgct gtgactgtgt ccccttactg agcgcaagcc
2281 acagcctgtc ttgtcaggtg gaccctgtaa atacatcctt tttctgctaa cccttcaacc
2341 ccctcgcctc ctactctgag acaaaagaaa aaatattaaa aaaatgcata ggcttaactc
2401 gctgatgagt taattgtttt atttttaaac tctttttggg tccagttgat tgtacgtagc
2461 cacaggagcc ctgctatgaa aggaataaaa cctacacaca aggttggagc tttgcaattc
2521 tttttggaaa agagctggga tcccacagcc ctagtatgaa agctgggggt ggggaggggc
2581 ctttgctgcc cttggtttct gggggctggt tggcatttgc tggcctggca gggggtgaag
2641 gcaggagttg ggggcaggtc aggaccagga cccagggaga ggctgtgtcc ctgctggggt
2701 ctcaggtcca gctttactgt ggctgtctgg atccttccca aggtacagct gtatataaac
2761 gtgtcccgag cttagattct gtatgcggtg acggcggggt gtggtggcct gtgaggggcc
2821 cctggcccag gaggaggatt gtgctgatgt agtgaccaag tgcaatatgg gcgggcagtc
2881 gctgcaggga gcaccacggc cagaagtaac ttattttgta ctagtgtccg cataagaaaa
2941 agaatcggca gtattttctg tttttatgtt ttatttggct tgttttattt tggattagtg
3001 aactaagtta ttgttaatta tgtacaacat ttatatattg tctgtaaaaa atgtatgcta
3061 tcctcttatt cctttaaagt gagtactgtt aagaataata aaatactttt tgtgaaaaaa
3121 aaaaaaaaaa aaaaa
//