LOCUS BC042297 4626 bp mRNA linear HUM 12-OCT-2006 DEFINITION Homo sapiens upstream binding transcription factor, RNA polymerase I, mRNA (cDNA clone MGC:48801 IMAGE:4509695), complete cds. ACCESSION BC042297 VERSION BC042297.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4626) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4626) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 84 Row: c Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7657670. FEATURES Location/Qualifiers source 1..4626 /db_xref="H-InvDB:HIT000052682" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:48801 IMAGE:4509695" /tissue_type="Testis, embryonal carcinoma" /clone_lib="NIH_MGC_92" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..4626 /gene="UBTF" /gene_synonym="NOR-90" /gene_synonym="UBF" /db_xref="GeneID:7343" /db_xref="HGNC:HGNC:12511" /db_xref="MIM:600673" CDS 202..2385 /gene="UBTF" /gene_synonym="NOR-90" /gene_synonym="UBF" /codon_start=1 /product="upstream binding transcription factor, RNA polymerase I" /protein_id="AAH42297.1" /db_xref="GeneID:7343" /db_xref="HGNC:HGNC:12511" /db_xref="MIM:600673" /translation="MNGEADCPTDLEMAAPKGQDRWSQEDMLTLLECMKNNLPSNDSS KFKTTESHMDWEKVAFKDFSGDMCKLKWVEISNEVRKFRTLTELILDAQEHVKNPYKG KKLKKHPDFPKKPLTPYFRFFMEKRAKYAKLHPEMSNLDLTKILSKKYKELPEKKKMK YIQDFQREKQEFERNLARFREDHPDLIQNAKKSDIPEKPKTPQQLWYTHEKKVYLKVR PDEIMRDYIQKHPELNISEEGITKSTLTKAERQLKDKFDGRPTKPPPNSYSLYCAELM ANMKDVPSTERMVLCSQQWKLLSQKEKDAYHKKCDQKKKDYEVELLRFLESLPEEEQQ RVLGEEKMLNINKKQATSPASKKPAQEGGKGGSEKPKRPVSAMFIFSEEKRRQLQEER PELSESELTRLLARMWNDLSEKKKAKYKAREAALKAQSERKPGGEREERGKLPESPKR AEEIWQQSVIGDYLARFKNDRVKALKAMEMTWNNMEKKEKLMWIKKAAEDQKRYEREL SEMRAPPAATNSSKKMKFQGEPKKPPMNGYQKFSQELLSNGELNHLPLKERMVEIGSR WQRISQSQKEHYKKLAEEQQKQYKVHLDLWVKSLSPQDRAAYKEYISNKRKSMTKLRG PNPKSSRTTLQSKSESEEDDEEDEDDEDEDEEEEDDENGDSSEDGGDSSESSSEDESE DGDENEEDDEDEDDDEDDDEDEDNESEGSSSSSSSSGDSSDSDSN" BASE COUNT 1186 a 1188 c 1306 g 946 t ORIGIN 1 ccacgcgtcc ggccatctcg ggctttgtct ggcgactcgc tgccccggcg tcggctgcag 61 cggagctgcg gctcgactgt tcggcccgcc gtgctcccag gtgcccccgg cctgcgctcc 121 catccacacg ctcgggtgag gtggctttga ccccgggttg cccggccagc acgaccgagg 181 aggtggctgg acagctggag gatgaacgga gaagccgact gccccacaga cctggaaatg 241 gccgccccca aaggccaaga ccgttggtcc caggaagaca tgctgacttt gctggaatgc 301 atgaagaaca accttccatc caatgacagc tccaagttca aaaccaccga atcacacatg 361 gactgggaaa aagtagcatt taaagacttt tctggagaca tgtgcaagct caaatgggtg 421 gagatttcta atgaggtgag gaagttccgt acattgacag aattgatcct cgatgctcag 481 gaacatgtta aaaatcctta caaaggcaaa aaactcaaga aacacccaga cttcccaaag 541 aagcccctga ccccttattt ccgcttcttc atggagaagc gggccaagta tgcgaaactc 601 caccctgaga tgagcaacct ggacctaacc aagattctgt ccaagaaata caaggagctt 661 ccggagaaga agaagatgaa atatattcag gacttccaga gagagaaaca ggagttcgag 721 cgaaacctgg cccgattcag ggaggatcac cccgacctaa tccagaatgc caagaaatcg 781 gacatcccag agaagcccaa aaccccccag cagctgtggt acacccacga gaagaaggtg 841 tatctcaaag tgcggccaga tgagatcatg agagactata tccagaagca cccagagctg 901 aacatcagtg aggagggtat caccaagtcc accctcacca aggccgaacg ccagctcaag 961 gacaagtttg acgggcgacc caccaagcca cctccgaaca gctactcgct gtactgcgca 1021 gagctcatgg ccaacatgaa ggacgtgccc agcacagagc gcatggtgct gtgcagccag 1081 cagtggaagc tgctgtccca gaaggagaag gacgcctatc acaagaagtg tgatcagaaa 1141 aagaaagatt acgaggtgga gctgctccgt ttcctcgaga gcctgcctga ggaggagcag 1201 cagcgggtct tgggggaaga gaagatgctg aacatcaaca agaagcaggc caccagcccc 1261 gcctccaaga agccagccca ggaagggggc aagggcggct ccgagaagcc caagcggccc 1321 gtgtcggcca tgttcatctt ctcggaggag aaacggcggc agctgcagga ggagcggcct 1381 gagctctccg agagcgagct gacccgcctg ctggcccgaa tgtggaacga cctgtctgag 1441 aagaagaagg ccaagtacaa ggcccgagag gcggcgctca aggctcagtc ggagaggaag 1501 cccggcgggg agcgcgagga acggggcaag ctgcccgagt cccccaaaag agctgaggag 1561 atctggcaac agagcgttat cggcgactac ctggcccgct tcaagaatga ccgggtgaag 1621 gccttgaaag ccatggaaat gacctggaat aacatggaaa agaaggagaa actgatgtgg 1681 attaagaagg cagccgaaga ccaaaagcga tatgagagag agctgagtga gatgcgggca 1741 cctccagctg ctacaaattc ttccaagaag atgaaattcc agggagaacc caagaagcct 1801 cccatgaacg gttaccagaa gttctcccag gagctgctgt ccaatgggga gctgaaccac 1861 ctgccgctga aggagcgcat ggtggagatc ggcagtcgct ggcagcgcat ctcccagagc 1921 cagaaggagc actacaaaaa gctggccgag gagcagcaaa agcagtacaa ggtgcacctg 1981 gacctctggg ttaagagcct gtctccccag gaccgtgcag catataaaga gtacatctcc 2041 aataaacgta agagcatgac caagctgcga ggcccaaacc ccaaatccag ccggactact 2101 ctgcagtcca agtcggagtc cgaggaggat gatgaagagg atgaggatga cgaggacgag 2161 gatgaagaag aggaagatga tgagaatggg gactcctctg aagatggcgg cgactcctct 2221 gagtccagca gcgaggacga gagcgaggat ggggatgaga atgaagagga tgacgaggac 2281 gaagacgacg acgaggatga cgatgaggat gaagataatg agtccgaggg cagcagctcc 2341 agctcctcct cctcagggga ctcctcagac tctgactcca actgaggctc agccccaccc 2401 cagggcagcc agggagagcc caggagctcc cctccccaac tgaccacctt tgtttctccc 2461 ccatgttctg tcccttgccc ccctggcctc ccccactttc tttctttctt taaaaaaaaa 2521 aaaaaatacg gtgggggtag ggggctggag gagcccaggc caggactctg cagcctcaga 2581 gacatcagcc cttgggggtc ctcctccagg gacagcaact atcagactaa gccagcaccg 2641 gaccagcctg gcccacccca cccacttctg cacttgcggt tccggcatgg acaatggacc 2701 ggagagtggg ggtggggggt cccaaagagt ttgatgaggc cctccacacc tgcggcccaa 2761 tccaaggtgg ggtggaagct tggggaagac ccattccttc ccagaggggc ctgccacctg 2821 gacccctgca ttggaactgg aggcagggaa catggggagg aggagggtcg gtgccttcaa 2881 gaaaacaggc actgccctgg tggccctctc ccctgccccc tgcaggaagg agctgcctgg 2941 acccgttcat gggggagggg gcagaagtgt tttttatata tgtgtatata ttttttttta 3001 agctctgagc tgtcaacgag acgtttccta ccgatctcgg ctgccgtctc tgttgtcatt 3061 tctgggagag ggagggttta ggggtagatt tggaaccttt taaaaatggt cttgatgtat 3121 gtggaagaga gtatgtgtat gtgtgttcct gtacatagca tgggtgcagc tgtggatgtg 3181 tgcaaaagag tgtgagtgtg tgtgtgtgtg tgtaaagggg tctgtcctag agcccacatc 3241 agtttgttgt gaatctggaa aaagggtcgg tgagggccgg gagatgttga ccctggtggg 3301 agcaggctga ggctgccccg ttctccacat cctctgtttt gcccagtctc tgattccatt 3361 agggggagtg tgctgaagcc attctcggat gcttcccaga ccaggctccc tctgccagag 3421 tcacatgcat ccgagctgct ggtctccatt gtccagcagg aaggcggaaa ggcaggcaag 3481 atggtgtgaa gcttaaagct tgtatttgat ggaaaaggtc tcccctgttc atctgagagg 3541 ccaagcctgg ccaccccagg ctcagaacct gggcttcaag aaatgtgctg ggagctccta 3601 acttacacat ccctccagcc ttccttgaat cctcccacca ccccctattt cctttaattt 3661 ctcaggtctg ctccctcctc ccccaacccc acagctgggc aagaagtctg caaaagctgc 3721 atctgcagct gtctctaact cttcccagcc atctcccgta ttttttggta ccttgattcc 3781 ttgactctta ataagccaag ccaccttatc tctgtagttc ttattttttt gttgactaaa 3841 tttggggggt tcttttttat ggtcatgtca ctgacctatt aaattggggc ttggtgcttt 3901 tccaccttcc ccctctgaat gaaagccaag gaatggggga agagcgggaa ctctgccgcg 3961 gaggtggagc aagaacggtg aagggccctg gtcccagaga ggctggtggg tccctctccc 4021 aaaggaaggc agacagtctc tgctttgcct tggaccttgg tgctgggggt ggggaggcct 4081 gggggggaca ctccccactc ccattcccct tcctttgtcc taatcctgga attaagtaca 4141 ggggtttata ggttctattt cttcccaaga gccctgcaaa gaaccccagt ttcctatttg 4201 gatgccccta cactgttgtg tttcagtgga atgtattttc atttaaaaac aactttgaat 4261 ggggcacttt ttctttcctg ttttaaaaat tgaaaaattc ttacagtaca aacaggactg 4321 tcagggtggg ggtgttggtg ctgtaagagg tcactcttga gtgcattttg gcactgggat 4381 gggatggctg gggtgggaag acccccatcc ccacccccaa cttcttttct aatatttaag 4441 gagtgttttg taggattcaa caaccaccac aacttgaatt tgtatcatgg gaggtgggag 4501 ggagtggctt agaggtgtct gcctatgctt aaagccaact gtggaagttt tgttttccct 4561 tttttgtata ataaagtgaa aaacaaaggt ttgaaaaaaa aaaaaaaaaa aaaaaaaaaa 4621 aaaaaa //