LOCUS BC047790 2658 bp mRNA linear HUM 22-APR-2003
DEFINITION Homo sapiens GATA binding protein 5, mRNA (cDNA clone
IMAGE:6464800), partial cds.
ACCESSION BC047790
VERSION BC047790.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2658)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2658)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (03-MAR-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 98 Row: o Column: 16
This clone was selected for full length sequencing because it
passed the following selection criteria: Similarity but not
identity to protein.
FEATURES Location/Qualifiers
source 1..2658
/db_xref="H-InvDB:HIT000098800"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:6464800"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene <1..2658
/gene="GATA5"
/gene_synonym="bB379O24.1"
/db_xref="GeneID:140628"
CDS <1..1256
/gene="GATA5"
/gene_synonym="bB379O24.1"
/codon_start=3
/product="GATA5 protein"
/protein_id="AAH47790.1"
/db_xref="GeneID:140628"
/translation="TATAVPCRPPCPLVKTTPGRMYQSLALAASPRQAAYADSGSFLH
APGAGSPMFVPPARVPSMLSYLSGCEPSPQPPELAARPGWAQTATADSSAFGPGSPHP
PAAHPPGATTFPFAHSPSGPGSGGSAGGRDGSAYQGALLPREQFAAPLGRPVGTSYSA
TYPAYVSPDVAQSWTAGPFDGSVLHGLPGRRPTFVSDFLEEFPGEGRECVNCGALSTP
LWRRDGTGHYLCNACGLYHKMNGVNRPLVRPQKRLSSSRRAGLCCTNCHTTNTTLWRR
NSEGEPVCNACGLYMKLHGVPRPLAMKKESIQTRKRKPKTIAKARGSSGSTRNASASP
SAVASTDSSAATSKAKPSLASPVCPGPSMAPQASGQEDDSLAPGHLEFKFEPEDFAFP
STAPSPQAGLRGALRQEAWCALALA"
BASE COUNT 532 a 885 c 782 g 459 t
ORIGIN
1 ccaccgccac cgccgtgccc tgccgccctc cctgcccgct ggtcaagacc acgcctggga
61 ggatgtacca gagcctggcg ctggccgcga gcccccgcca ggccgcctac gccgactcgg
121 gctccttcct gcacgctccg ggcgccggct ctccgatgtt tgtgccgccg gcgcgcgtcc
181 cctcgatgct gtcctacctg tccgggtgtg agccgagccc gcagcccccc gagctcgctg
241 cgcgccccgg ctgggcgcag acagccaccg cggattcgtc ggccttcggc ccgggcagtc
301 cgcacccccc agccgcgcac ccgcccgggg ccaccacctt ccctttcgcg cacagcccct
361 cggggcccgg cagcggcggc agcgcggggg gccgagacgg cagtgcctac cagggcgcgc
421 tgttgcctcg agaacagttc gcggccccgc ttgggcggcc ggtggggacc tcgtactccg
481 ccacctaccc ggcctacgtg agccccgacg tggcccagtc ctggactgcc gggcccttcg
541 atggcagcgt cctgcacggc ctcccaggcc gcaggcccac cttcgtgtcc gacttcttgg
601 aggagttccc gggtgagggt cgtgagtgtg tcaactgcgg ggccctgtcc acaccgctgt
661 ggcgccgaga tggcaccggc cactacctgt gcaatgcctg cggcctctac cacaagatga
721 atggcgtcaa ccggccgctc gttcggcctc agaagcgcct gtcctcgtcc cgccgcgccg
781 gcctctgctg caccaactgc cacacgacca acaccacgct gtggcggcgg aactcggagg
841 gggagcccgt gtgcaatgcc tgcggcctct acatgaagct gcacggggtg ccgcggcctc
901 tggctatgaa gaaagaaagc atccagacac ggaagcggaa gccaaagacc atcgccaagg
961 ccaggggctc ctcaggatcc acaaggaatg cctcggcctc cccatctgct gtcgccagca
1021 ctgacagctc agcagccact tccaaagcca agcccagcct ggcgtcccca gtgtgccctg
1081 ggcccagcat ggccccccag gcctctggcc aggaggatga ctctcttgcc cccggccact
1141 tggagttcaa gttcgagcct gaggactttg ccttcccctc cacggccccg agcccccagg
1201 ctggcctcag gggggctctg cgccaagagg cctggtgtgc gctggccttg gcctaggtcc
1261 ccaggccagc ccatgtcagg ggaacagcct ggaacagacc acccactgag tcacctccgt
1321 gcctgctttg ctccagcaca gcagagacca gcaggccccc caacccagag actgggtctg
1381 ctggagtctc cacacagtgg tggggaggcc ttctggacag acggcagtcg ggccccagag
1441 caagaaggct ggtgagggaa gggctcagct tcccacccca cgtacagcaa gggactcccc
1501 aggtgcggcc caaggctccg gaccacactg gccccctgcg gcggaggcca acgcagggca
1561 ccaccaccac caacttgaat tccgtcatca atgctcaccg tcaatatgtt tacaagttgt
1621 agcagttggg ggaaaacagt caacctccca gtgtaaaacc aagattccca gtgaagcacc
1681 tgaggccaag caggggagag gaatgagggg agcagctgga catgggcctc ctgaggcctc
1741 ggggctgtcc ttcattgccc acatggatag acggagctgt ggtgcagaga acttttcccg
1801 caacaggtgc aggactgcca gggatcggag tgcgggccgc gcacggtgcc aggattccgc
1861 cgaggggaag ccgctcacat tgcagtcatc acagacttac gcacttgttt ggacagtttt
1921 tccagagggg atgggaaagg gccttgttct agctgaatct gtgtatcatg accatttctg
1981 acaggcagaa tgaattgtct ggtagccctg tcctgaccca tccaagcgct gttggggctg
2041 gtggtgacgt ggtcacatgt cctggcatat ctggggccac gcagtttagt ctcttgtccc
2101 aggagaattg ttagtgaccc ctctttctct tgcaagcccc ctccacactg ggttggatga
2161 taccttaatg agtgacgctg gcgagaggca ccctacccga cgcagctgtg aatggccggt
2221 gatgtatgtc aggaggccac agggagcgga ggagcggggc aggcagccac agggagcgga
2281 ggagcggggc aggcagccac agggccctgc ggggagcaca tcctcgcctc cgtccggctg
2341 ctgcccttca acaacaagcc ctgatttttc cagcaatgcc agaaacctgg attttaagtc
2401 ttccaatttg attcaaaaat atttttaaca ttgtgagcca gctagacccc cagtgcacca
2461 ccccatattg aaaaacagtt gtctggcatc agcttcagga gcggtccggt cattctgaaa
2521 ctgtccctcc agaggttctt ccagccccac ttctatgcga tgtcatcttt tctaaaagag
2581 acaaatgaag ccacagggaa agtgaaataa agccttgaac ctcaaaaaaa aaaaaaaaaa
2641 aaaaaaaaaa aaaaaaaa
//