LOCUS BC009797 1174 bp mRNA linear HUM 01-SEP-2006
DEFINITION Homo sapiens GATA binding protein 1 (globin transcription factor
1), mRNA (cDNA clone MGC:13628 IMAGE:4048082), complete cds.
ACCESSION BC009797
VERSION BC009797.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1174)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1174)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (02-JUL-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: CLONTECH Laboratories, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 19 Row: b Column: 1
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 32483408.
FEATURES Location/Qualifiers
source 1..1174
/db_xref="H-InvDB:HIT000034585"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:13628 IMAGE:4048082"
/tissue_type="Bone marrow, chronic myelogenous leukemia"
/clone_lib="NIH_MGC_54"
/lab_host="DH10B"
/note="Vector: pDNR-LIB"
gene 1..1174
/gene="GATA1"
/gene_synonym="ERYF1"
/gene_synonym="NFE1"
/db_xref="GeneID:2623"
/db_xref="HGNC:HGNC:4170"
/db_xref="MIM:305371"
CDS 95..1102
/gene="GATA1"
/gene_synonym="ERYF1"
/gene_synonym="NFE1"
/codon_start=1
/product="GATA1 protein"
/protein_id="AAH09797.1"
/db_xref="GeneID:2623"
/db_xref="HGNC:HGNC:4170"
/db_xref="MIM:305371"
/translation="MEFPGLGSLGTSEPLPQFVDPALVSSTPESGVFFPSGPEGLDAA
ASSTAPSTATAAAAALAYYRDAEAYRHSPVFQVYPLLNCMEGIPGGSPYAGWAYGKTG
LYPASTVCPTREDSPPQAVEDLDGKGSTSFLETLKTERLSPDLLTLGPALPSSLPVPN
SAYGGPDFSSTFFSPTGSPLNSAAYSSPKLRGTLPLPPCEARECVNCGATATPLWRRD
RTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGD
PVCNACGLYYKLHHQHYCGGSAQLMRAQSMASRGGVVSFSSCSQNSGQPKSLGPRHPL
A"
BASE COUNT 264 a 391 c 297 g 222 t
ORIGIN
1 ggggacaccc cctgggatca cactgagctt gccacatccc caaggcggcc gaaccctccg
61 caaccaccag cccaggttaa tccccagagg ctccatggag ttccctggcc tggggtccct
121 ggggacctca gagcccctcc cccagtttgt ggatcctgct ctggtgtcct ccacaccaga
181 atcaggggtt ttcttcccct ctgggcctga gggcttggat gcagcagctt cctccactgc
241 cccgagcaca gccaccgctg cagctgcggc actggcctac tacagggacg ctgaggccta
301 cagacactcc ccagtctttc aggtgtaccc attgctcaac tgtatggagg ggatcccagg
361 gggctcacca tatgccggct gggcctacgg caagacgggg ctctaccctg cctcaactgt
421 gtgtcccacc cgcgaggact ctcctcccca ggccgtggaa gatctggatg gaaaaggcag
481 caccagcttc ctggagactt tgaagacaga gcggctgagc ccagacctcc tgaccctggg
541 acctgcactg ccttcatcac tccctgtccc caatagtgct tatgggggcc ctgacttttc
601 cagtaccttc ttttctccca ccgggagccc cctcaattca gcagcctatt cctctcccaa
661 gcttcgtgga actctccccc tgcctccctg tgaggccagg gagtgtgtga actgcggagc
721 aacagccact ccactgtggc ggagggacag gacaggccac tacctatgca acgcctgcgg
781 cctctatcac aagatgaatg ggcagaacag gcccctcatc cggcccaaga agcgcctgat
841 tgtcagtaaa cgggcaggta ctcagtgcac caactgccag acgaccacca cgacactgtg
901 gcggagaaat gccagtgggg atcccgtgtg caatgcctgc ggcctctact acaagctaca
961 ccaccagcac tactgtggtg gctccgctca gctcatgagg gcacagagca tggcctccag
1021 aggaggggtg gtgtccttct cctcttgtag ccagaattct ggacaaccca agtctctggg
1081 ccccaggcac cccctggctt gaaccttcaa agcttttgta aaataaaacc accaaagtcc
1141 tgataaaaaa aaaaaaaaaa aaaaaaaaaa aaaa
//