LOCUS BC018136 3203 bp mRNA linear HUM 24-JUL-2006
DEFINITION Homo sapiens GTF2I repeat domain containing 1, mRNA (cDNA clone
MGC:9316 IMAGE:3913745), complete cds.
ACCESSION BC018136
VERSION BC018136.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3203)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3203)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (03-DEC-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 22 Row: l Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 15011923.
FEATURES Location/Qualifiers
source 1..3203
/db_xref="H-InvDB:HIT000038289"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:9316 IMAGE:3913745"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3203
/gene="GTF2IRD1"
/gene_synonym="CREAM1"
/gene_synonym="GTF3"
/gene_synonym="hMusTRD1alpha1"
/gene_synonym="MUSTRD1"
/gene_synonym="RBAP2"
/gene_synonym="WBSCR12"
/db_xref="GeneID:9569"
/db_xref="HGNC:HGNC:4661"
/db_xref="MIM:604318"
CDS 159..3038
/gene="GTF2IRD1"
/gene_synonym="CREAM1"
/gene_synonym="GTF3"
/gene_synonym="hMusTRD1alpha1"
/gene_synonym="MUSTRD1"
/gene_synonym="RBAP2"
/gene_synonym="WBSCR12"
/codon_start=1
/product="GTF2I repeat domain containing 1"
/protein_id="AAH18136.1"
/db_xref="GeneID:9569"
/db_xref="HGNC:HGNC:4661"
/db_xref="MIM:604318"
/translation="MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSA
LSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRGPPWKDPEAEHPK
KVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLRE
PGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELN
GVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIRELKQEAPS
CPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDK
SEKWDAFIKETEDINTLRECVQILFNSRYAEALGLDHMVPVPYRKIACDPEAVEIVGI
PDKIPFKRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPP
EDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGGLRPIKIEPEDLDIIQVTV
PDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLR
KQVELLFNTRYAKAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLR
KILEASNSIQFVIKRPELLTEGVKEPIVDSQGTASSLGFSPPALPPERDSGDPLVDES
LKRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGS
VIIEGLPPGIPFRKPCTFGSQNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGE
KVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYD
IHRLEKILKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSS
SSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQLPGPLNY"
BASE COUNT 720 a 975 c 910 g 598 t
ORIGIN
1 cccgccctcg ccgcgccgcc gtcctcgcct ccctctgcct ctccttcccc cattctcccg
61 gattaattaa ggaggcagcg gcaggaggct gagtcctggc cgcgggccgg ggccggggcg
121 ccgctggcag gagcgcttgg ggatcctcca aggcgaccat ggccttgctg ggtaagcgct
181 gtgacgtccc caccaacggc tgcggacccg accgctggaa ctccgcgttc acccgcaaag
241 acgagatcat caccagcctc gtgtctgcct tagactccat gtgctcagcg ctgtccaaac
301 tgaacgccga ggtggcctgt gtcgccgtgc acgatgagag cgcctttgtg gtgggcacag
361 agaaggggag aatgttcctg aatgcccgga aggagctaca gtcagacttc ctcaggttct
421 gccgagggcc cccgtggaag gatccggagg cagagcaccc caagaaggtg cagcggggcg
481 agggtggagg ccgtagcctc cctcggtcct ccctggaaca tggctcagat gtgtaccttc
541 tgcggaagat ggtagaggag gtgtttgatg ttctttatag cgaggccctg ggaagggcca
601 gtgtggtgcc actgccctat gagaggctgc tcagggagcc agggctgctg gccgtgcagg
661 ggctgcccga gggcctggcc ttccgaaggc cagccgagta tgaccccaag gccctcatgg
721 ccatcctgga acacagccac cgcatccgct tcaagctcaa gaggccactt gaggatggcg
781 ggcgggactc gaaggccctg gtggagctga acggtgtctc cctgattccc aaggggtcac
841 gggactgtgg cctgcatggc caggccccca aggtgccacc ccaggacctg cccccaaccg
901 ccacctcctc ctccatggcc agcttcctgt acagcacggc gctccccaac cacgccatcc
961 gagagctcaa gcaggaagca ccttcctgcc cccttgcccc cagcgacctg ggcctgagtc
1021 ggcccatgcc agagcccaag gccaccggtg cccaagactt ctccgactgt tgtggacaga
1081 agcccactgg gcctggtggg cctctcatcc agaacgtcca tgcctccaag cgcattctct
1141 tctccatcgt ccatgacaag tcagagaagt gggacgcctt cataaaggaa accgaggaca
1201 tcaacacgct ccgggagtgt gtgcagatcc tgtttaacag cagatatgcg gaagccctgg
1261 gcctggacca catggtcccc gtgccctacc ggaagattgc ctgtgacccg gaggctgtgg
1321 agatcgtggg catcccggac aagatcccct tcaagcgccc ctgcacttac ggagtcccca
1381 agctgaagcg gatcctggag gagcgccata gtatccactt catcattaag aggatgtttg
1441 atgagcgaat tttcacaggg aacaagttta ccaaagacac cacgaagctg gagccagcca
1501 gcccgccaga ggacacctct gcagaggtct ctagggccac cgtccttgac cttgctggga
1561 atgctcggtc agacaagggc agcatgtctg aagactgtgg gccaggaacc tccggggagc
1621 tgggcgggct gaggccgatc aaaattgagc cagaggatct ggacatcatt caggtcaccg
1681 tcccagaccc ctcgccaacc tctgaggaaa tgacagactc gatgcctggg cacctgccat
1741 cggaggattc tggttatggg atggagatgc tgacagacaa aggtctgagt gaggacgcgc
1801 ggcccgagga gaggcccgtg gaggacagcc acggtgacgt gatccggccc ctgcggaagc
1861 aggtggagct gctcttcaac acacgatacg ccaaggccat tggcatctcg gagcccgtca
1921 aggtgccgta ctccaagttt ctgatgcacc cggaggagct gtttgtggtg ggactgcctg
1981 aaggcatctc cctccgcagg cccaactgct tcgggatcgc caagctccgg aagattctgg
2041 aggccagcaa cagcatccag tttgtcatca agaggcccga gctgctcact gagggagtca
2101 aagagcccat cgtggatagt caaggaactg cctcctcact tggcttctct ccccctgccc
2161 tgcccccaga gagggattcc ggggaccctc tggtggacga gagcctgaag agacagggct
2221 ttcaagaaaa ttatgacgcg aggctctcac ggatcgacat cgccaacaca ctaagggagc
2281 aggtccagga ccttttcaat aagaaatacg gggaagcctt gggcatcaag tacccggtcc
2341 aggtccccta caagcggatc aagagtaacc ccggctccgt gatcatcgag gggctgcccc
2401 caggaatccc gttccgaaag ccctgtacct tcggctccca gaacctggag aggattcttg
2461 ctgtggctga caagatcaag ttcacagtca ccaggccttt ccaaggactc atcccaaagc
2521 ctgatgaaga tgacgccaac agactcgggg agaaggtgat cctgcgggag caggtgaagg
2581 aactcttcaa cgagaaatac ggtgaggccc tgggcctgaa ccggccggtg ctggtccctt
2641 ataaactaat ccgggacagc ccagacgccg tggaggtcac gggtctgcct gatgacatcc
2701 ccttccggaa ccccaacacg tacgacatcc accggctgga gaagatcctg aaggcccgag
2761 agcatgtccg catggtcatc attaaccagc tccaaccctt tgcagaaatc tgcaatgatg
2821 ccaaggtgcc agccaaagac agcagcattc ccaagcgcaa gagaaagcgg gtctcggaag
2881 gaaattccgt ctcctcttcc tcctcgtctt cctcttcctc gtcctctaac ccggattcag
2941 tggcatcggc caaccagatc tcactcgtgc aatggccaat gtacatggtg gactatgccg
3001 gcctgaacgt gcagctcccg ggacctctta attactagac ctcagtactg aatcaggacc
3061 tcactcagaa agactaaagg aaatgtaatt tatgtacaaa atgtatattc ggatatgtat
3121 cgatgccttt tagtttttcc aatgattttt acactatatt cctgccacca aggccttttt
3181 aaataagtaa aaaaaaaaaa aaa
//