LOCUS BC018136 3203 bp mRNA linear HUM 24-JUL-2006 DEFINITION Homo sapiens GTF2I repeat domain containing 1, mRNA (cDNA clone MGC:9316 IMAGE:3913745), complete cds. ACCESSION BC018136 VERSION BC018136.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3203) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3203) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-DEC-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 22 Row: l Column: 7 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 15011923. FEATURES Location/Qualifiers source 1..3203 /db_xref="H-InvDB:HIT000038289" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:9316 IMAGE:3913745" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3203 /gene="GTF2IRD1" /gene_synonym="CREAM1" /gene_synonym="GTF3" /gene_synonym="hMusTRD1alpha1" /gene_synonym="MUSTRD1" /gene_synonym="RBAP2" /gene_synonym="WBSCR12" /db_xref="GeneID:9569" /db_xref="HGNC:HGNC:4661" /db_xref="MIM:604318" CDS 159..3038 /gene="GTF2IRD1" /gene_synonym="CREAM1" /gene_synonym="GTF3" /gene_synonym="hMusTRD1alpha1" /gene_synonym="MUSTRD1" /gene_synonym="RBAP2" /gene_synonym="WBSCR12" /codon_start=1 /product="GTF2I repeat domain containing 1" /protein_id="AAH18136.1" /db_xref="GeneID:9569" /db_xref="HGNC:HGNC:4661" /db_xref="MIM:604318" /translation="MALLGKRCDVPTNGCGPDRWNSAFTRKDEIITSLVSALDSMCSA LSKLNAEVACVAVHDESAFVVGTEKGRMFLNARKELQSDFLRFCRGPPWKDPEAEHPK KVQRGEGGGRSLPRSSLEHGSDVYLLRKMVEEVFDVLYSEALGRASVVPLPYERLLRE PGLLAVQGLPEGLAFRRPAEYDPKALMAILEHSHRIRFKLKRPLEDGGRDSKALVELN GVSLIPKGSRDCGLHGQAPKVPPQDLPPTATSSSMASFLYSTALPNHAIRELKQEAPS CPLAPSDLGLSRPMPEPKATGAQDFSDCCGQKPTGPGGPLIQNVHASKRILFSIVHDK SEKWDAFIKETEDINTLRECVQILFNSRYAEALGLDHMVPVPYRKIACDPEAVEIVGI PDKIPFKRPCTYGVPKLKRILEERHSIHFIIKRMFDERIFTGNKFTKDTTKLEPASPP EDTSAEVSRATVLDLAGNARSDKGSMSEDCGPGTSGELGGLRPIKIEPEDLDIIQVTV PDPSPTSEEMTDSMPGHLPSEDSGYGMEMLTDKGLSEDARPEERPVEDSHGDVIRPLR KQVELLFNTRYAKAIGISEPVKVPYSKFLMHPEELFVVGLPEGISLRRPNCFGIAKLR KILEASNSIQFVIKRPELLTEGVKEPIVDSQGTASSLGFSPPALPPERDSGDPLVDES LKRQGFQENYDARLSRIDIANTLREQVQDLFNKKYGEALGIKYPVQVPYKRIKSNPGS VIIEGLPPGIPFRKPCTFGSQNLERILAVADKIKFTVTRPFQGLIPKPDEDDANRLGE KVILREQVKELFNEKYGEALGLNRPVLVPYKLIRDSPDAVEVTGLPDDIPFRNPNTYD IHRLEKILKAREHVRMVIINQLQPFAEICNDAKVPAKDSSIPKRKRKRVSEGNSVSSS SSSSSSSSSNPDSVASANQISLVQWPMYMVDYAGLNVQLPGPLNY" BASE COUNT 720 a 975 c 910 g 598 t ORIGIN 1 cccgccctcg ccgcgccgcc gtcctcgcct ccctctgcct ctccttcccc cattctcccg 61 gattaattaa ggaggcagcg gcaggaggct gagtcctggc cgcgggccgg ggccggggcg 121 ccgctggcag gagcgcttgg ggatcctcca aggcgaccat ggccttgctg ggtaagcgct 181 gtgacgtccc caccaacggc tgcggacccg accgctggaa ctccgcgttc acccgcaaag 241 acgagatcat caccagcctc gtgtctgcct tagactccat gtgctcagcg ctgtccaaac 301 tgaacgccga ggtggcctgt gtcgccgtgc acgatgagag cgcctttgtg gtgggcacag 361 agaaggggag aatgttcctg aatgcccgga aggagctaca gtcagacttc ctcaggttct 421 gccgagggcc cccgtggaag gatccggagg cagagcaccc caagaaggtg cagcggggcg 481 agggtggagg ccgtagcctc cctcggtcct ccctggaaca tggctcagat gtgtaccttc 541 tgcggaagat ggtagaggag gtgtttgatg ttctttatag cgaggccctg ggaagggcca 601 gtgtggtgcc actgccctat gagaggctgc tcagggagcc agggctgctg gccgtgcagg 661 ggctgcccga gggcctggcc ttccgaaggc cagccgagta tgaccccaag gccctcatgg 721 ccatcctgga acacagccac cgcatccgct tcaagctcaa gaggccactt gaggatggcg 781 ggcgggactc gaaggccctg gtggagctga acggtgtctc cctgattccc aaggggtcac 841 gggactgtgg cctgcatggc caggccccca aggtgccacc ccaggacctg cccccaaccg 901 ccacctcctc ctccatggcc agcttcctgt acagcacggc gctccccaac cacgccatcc 961 gagagctcaa gcaggaagca ccttcctgcc cccttgcccc cagcgacctg ggcctgagtc 1021 ggcccatgcc agagcccaag gccaccggtg cccaagactt ctccgactgt tgtggacaga 1081 agcccactgg gcctggtggg cctctcatcc agaacgtcca tgcctccaag cgcattctct 1141 tctccatcgt ccatgacaag tcagagaagt gggacgcctt cataaaggaa accgaggaca 1201 tcaacacgct ccgggagtgt gtgcagatcc tgtttaacag cagatatgcg gaagccctgg 1261 gcctggacca catggtcccc gtgccctacc ggaagattgc ctgtgacccg gaggctgtgg 1321 agatcgtggg catcccggac aagatcccct tcaagcgccc ctgcacttac ggagtcccca 1381 agctgaagcg gatcctggag gagcgccata gtatccactt catcattaag aggatgtttg 1441 atgagcgaat tttcacaggg aacaagttta ccaaagacac cacgaagctg gagccagcca 1501 gcccgccaga ggacacctct gcagaggtct ctagggccac cgtccttgac cttgctggga 1561 atgctcggtc agacaagggc agcatgtctg aagactgtgg gccaggaacc tccggggagc 1621 tgggcgggct gaggccgatc aaaattgagc cagaggatct ggacatcatt caggtcaccg 1681 tcccagaccc ctcgccaacc tctgaggaaa tgacagactc gatgcctggg cacctgccat 1741 cggaggattc tggttatggg atggagatgc tgacagacaa aggtctgagt gaggacgcgc 1801 ggcccgagga gaggcccgtg gaggacagcc acggtgacgt gatccggccc ctgcggaagc 1861 aggtggagct gctcttcaac acacgatacg ccaaggccat tggcatctcg gagcccgtca 1921 aggtgccgta ctccaagttt ctgatgcacc cggaggagct gtttgtggtg ggactgcctg 1981 aaggcatctc cctccgcagg cccaactgct tcgggatcgc caagctccgg aagattctgg 2041 aggccagcaa cagcatccag tttgtcatca agaggcccga gctgctcact gagggagtca 2101 aagagcccat cgtggatagt caaggaactg cctcctcact tggcttctct ccccctgccc 2161 tgcccccaga gagggattcc ggggaccctc tggtggacga gagcctgaag agacagggct 2221 ttcaagaaaa ttatgacgcg aggctctcac ggatcgacat cgccaacaca ctaagggagc 2281 aggtccagga ccttttcaat aagaaatacg gggaagcctt gggcatcaag tacccggtcc 2341 aggtccccta caagcggatc aagagtaacc ccggctccgt gatcatcgag gggctgcccc 2401 caggaatccc gttccgaaag ccctgtacct tcggctccca gaacctggag aggattcttg 2461 ctgtggctga caagatcaag ttcacagtca ccaggccttt ccaaggactc atcccaaagc 2521 ctgatgaaga tgacgccaac agactcgggg agaaggtgat cctgcgggag caggtgaagg 2581 aactcttcaa cgagaaatac ggtgaggccc tgggcctgaa ccggccggtg ctggtccctt 2641 ataaactaat ccgggacagc ccagacgccg tggaggtcac gggtctgcct gatgacatcc 2701 ccttccggaa ccccaacacg tacgacatcc accggctgga gaagatcctg aaggcccgag 2761 agcatgtccg catggtcatc attaaccagc tccaaccctt tgcagaaatc tgcaatgatg 2821 ccaaggtgcc agccaaagac agcagcattc ccaagcgcaa gagaaagcgg gtctcggaag 2881 gaaattccgt ctcctcttcc tcctcgtctt cctcttcctc gtcctctaac ccggattcag 2941 tggcatcggc caaccagatc tcactcgtgc aatggccaat gtacatggtg gactatgccg 3001 gcctgaacgt gcagctcccg ggacctctta attactagac ctcagtactg aatcaggacc 3061 tcactcagaa agactaaagg aaatgtaatt tatgtacaaa atgtatattc ggatatgtat 3121 cgatgccttt tagtttttcc aatgattttt acactatatt cctgccacca aggccttttt 3181 aaataagtaa aaaaaaaaaa aaa //