LOCUS BC009797 1174 bp mRNA linear HUM 01-SEP-2006 DEFINITION Homo sapiens GATA binding protein 1 (globin transcription factor 1), mRNA (cDNA clone MGC:13628 IMAGE:4048082), complete cds. ACCESSION BC009797 VERSION BC009797.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1174) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1174) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JUL-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: CLONTECH Laboratories, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 19 Row: b Column: 1 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 32483408. FEATURES Location/Qualifiers source 1..1174 /db_xref="H-InvDB:HIT000034585" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:13628 IMAGE:4048082" /tissue_type="Bone marrow, chronic myelogenous leukemia" /clone_lib="NIH_MGC_54" /lab_host="DH10B" /note="Vector: pDNR-LIB" gene 1..1174 /gene="GATA1" /gene_synonym="ERYF1" /gene_synonym="NFE1" /db_xref="GeneID:2623" /db_xref="HGNC:HGNC:4170" /db_xref="MIM:305371" CDS 95..1102 /gene="GATA1" /gene_synonym="ERYF1" /gene_synonym="NFE1" /codon_start=1 /product="GATA1 protein" /protein_id="AAH09797.1" /db_xref="GeneID:2623" /db_xref="HGNC:HGNC:4170" /db_xref="MIM:305371" /translation="MEFPGLGSLGTSEPLPQFVDPALVSSTPESGVFFPSGPEGLDAA ASSTAPSTATAAAAALAYYRDAEAYRHSPVFQVYPLLNCMEGIPGGSPYAGWAYGKTG LYPASTVCPTREDSPPQAVEDLDGKGSTSFLETLKTERLSPDLLTLGPALPSSLPVPN SAYGGPDFSSTFFSPTGSPLNSAAYSSPKLRGTLPLPPCEARECVNCGATATPLWRRD RTGHYLCNACGLYHKMNGQNRPLIRPKKRLIVSKRAGTQCTNCQTTTTTLWRRNASGD PVCNACGLYYKLHHQHYCGGSAQLMRAQSMASRGGVVSFSSCSQNSGQPKSLGPRHPL A" BASE COUNT 264 a 391 c 297 g 222 t ORIGIN 1 ggggacaccc cctgggatca cactgagctt gccacatccc caaggcggcc gaaccctccg 61 caaccaccag cccaggttaa tccccagagg ctccatggag ttccctggcc tggggtccct 121 ggggacctca gagcccctcc cccagtttgt ggatcctgct ctggtgtcct ccacaccaga 181 atcaggggtt ttcttcccct ctgggcctga gggcttggat gcagcagctt cctccactgc 241 cccgagcaca gccaccgctg cagctgcggc actggcctac tacagggacg ctgaggccta 301 cagacactcc ccagtctttc aggtgtaccc attgctcaac tgtatggagg ggatcccagg 361 gggctcacca tatgccggct gggcctacgg caagacgggg ctctaccctg cctcaactgt 421 gtgtcccacc cgcgaggact ctcctcccca ggccgtggaa gatctggatg gaaaaggcag 481 caccagcttc ctggagactt tgaagacaga gcggctgagc ccagacctcc tgaccctggg 541 acctgcactg ccttcatcac tccctgtccc caatagtgct tatgggggcc ctgacttttc 601 cagtaccttc ttttctccca ccgggagccc cctcaattca gcagcctatt cctctcccaa 661 gcttcgtgga actctccccc tgcctccctg tgaggccagg gagtgtgtga actgcggagc 721 aacagccact ccactgtggc ggagggacag gacaggccac tacctatgca acgcctgcgg 781 cctctatcac aagatgaatg ggcagaacag gcccctcatc cggcccaaga agcgcctgat 841 tgtcagtaaa cgggcaggta ctcagtgcac caactgccag acgaccacca cgacactgtg 901 gcggagaaat gccagtgggg atcccgtgtg caatgcctgc ggcctctact acaagctaca 961 ccaccagcac tactgtggtg gctccgctca gctcatgagg gcacagagca tggcctccag 1021 aggaggggtg gtgtccttct cctcttgtag ccagaattct ggacaaccca agtctctggg 1081 ccccaggcac cccctggctt gaaccttcaa agcttttgta aaataaaacc accaaagtcc 1141 tgataaaaaa aaaaaaaaaa aaaaaaaaaa aaaa //