LOCUS BC030574 3043 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens U2 small nuclear RNA auxiliary factor 2, mRNA (cDNA clone MGC:16243 IMAGE:3687471), complete cds. ACCESSION BC030574 VERSION BC030574.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3043) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3043) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (13-MAY-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 9, 2003 this sequence version replaced BC030574.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Louis M. Staudt, M.D., Ph.D. cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 24 Row: m Column: 16 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 60279266. FEATURES Location/Qualifiers source 1..3043 /db_xref="H-InvDB:HIT000092158" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:16243 IMAGE:3687471" /tissue_type="Lymph, Burkitt lymphoma" /clone_lib="NIH_MGC_8" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..3043 /gene="U2AF2" /gene_synonym="U2AF65" /db_xref="GeneID:11338" /db_xref="HGNC:HGNC:23156" /db_xref="MIM:191318" CDS 960..2375 /gene="U2AF2" /gene_synonym="U2AF65" /codon_start=1 /product="U2 small nuclear RNA auxiliary factor 2" /protein_id="AAH30574.1" /db_xref="GeneID:11338" /db_xref="HGNC:HGNC:23156" /db_xref="MIM:191318" /translation="MSDFDEFERQLNENKQERDKENRHRKRSHSRSRSRDRKRRSRSR DRRNRDQRSASRDRRRRSKPLTRGAKEEHGGLIRSPRHEKKKKVRKYWDVPPPGFEHI TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGI TEEAMMDFFNAQMRLGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGII FQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVK ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQR ASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPEELLDDEEYE EIVEDVRDECSKYGLVKSIEIPRPVDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKF ANRVVVTKYCDPDSYHRRDFW" BASE COUNT 642 a 892 c 953 g 556 t ORIGIN 1 gcaacctccg cagcggtcgc gcgcaccctt cacccctctt gctgcttttg tctggcacaa 61 ccttctcagg ccctccgcga ggccggccct ttttttcttt cactttcccc acccggagag 121 ccaggtgccg agaactagga cctcgtcccc ttctctatcc cttcccctga tccatctcgt 181 ctctgttctc cttccctttt ccgttcccga agccggaagg agccgaggta cccacggaaa 241 aagtcgaagc tgtccccgga gagtgaggcc ttcacgaagc ggtggcggaa ggagccgaag 301 ttctctgatc ggaggcgtga gcaagtggca tctccggaaa gagccgaaac acggactcga 361 gcttaatccc aggaggggcc cgagggcgag atccggaaat ctccggaaag agccgaaggc 421 cgggacggtt aggattgtcg gaagtggccg attgcttgga cagggccggc ggagaagatc 481 ggagcaagtc cgtggaagaa gccaaagact gggacggatt gaattgttgg aagaacccga 541 actcgcagag gggactgggc gcagtggcac acgaggacac acggaaacct ccgaacgttg 601 aggagataat ctcggaaaag ttacggaaac ctgggaagtc ggcgaaaccc tacgttcggg 661 tgttttcgga agcagccgaa gcggccgcct gcttcgcctt cgtccagtat ccggagatag 721 ccgaggcgct tcggagccag gctcgggggg aggggcaagc tcgcgggcgg gcgggcgggt 781 gagggggcgg agcccgggcg gggcggggag ggctgccgga ggcttgatta ggtaggaggt 841 ggcgaagcgc cggcggcggc cggaagtagc cgaagccagc ggcggaagta gccgaagcgg 901 ctggagcggg cggcaaggcg aggcgaaagc tgcacagggc cctacgcggc cgcctcagca 961 tgtcggactt cgacgagttc gagcggcagc tcaacgagaa taaacaagag cgggacaagg 1021 agaaccggca tcggaagcgc agccacagcc gctctcggag ccgggaccgc aaacgccgga 1081 gccggagccg cgaccggcgc aaccgggacc agcggagcgc ctcccgggac aggcgacgac 1141 gcagcaaacc tttgaccaga ggcgctaaag aggagcacgg tggactgatt cgttcccccc 1201 gccacgagaa gaagaagaag gtccgtaaat actgggacgt gccaccccca ggctttgagc 1261 acatcacccc aatgcagtac aaggccatgc aagctgcggg tcagattcca gccactgctc 1321 ttctccccac catgacccct gacggtctgg ctgtgacccc aacgccggtg cccgtggtcg 1381 ggagccagat gaccagacaa gcccggcgcc tctacgtggg caacatcccc tttggcatca 1441 ctgaggaggc catgatggat ttcttcaacg cccagatgcg cctggggggg ctgacccagg 1501 cccctggcaa cccagtgttg gctgtgcaga ttaaccagga caagaatttt gcctttttgg 1561 agttccgctc agtggacgag actacccagg ctatggcctt tgatggcatc atcttccagg 1621 gccagtcact aaagatccgc aggcctcacg actaccagcc gcttcctggc atgtcagaga 1681 acccctccgt ctatgtgcct ggggttgtgt ccactgtggt ccccgactct gcccacaagc 1741 tgttcatcgg gggcttaccc aactacctga acgatgacca ggtcaaagag ctgctgacat 1801 cctttgggcc cctcaaggcc ttcaacctgg tcaaggacag tgccacgggg ctctccaagg 1861 gctacgcctt ctgtgagtac gtggacatca acgtcacgga tcaggccatt gcggggctga 1921 acggcatgca gctgggggat aagaagctgc tggtccagag ggcgagtgtg ggagccaaga 1981 atgccacgct gagcaccatc aatcagacgc ctgtgaccct gcaagtgccg ggcttgatga 2041 gctcccaggt gcagatgggc ggccacccga ctgaggtcct gtgcctcatg aacatggtgc 2101 tgcctgagga gctgctggac gacgaggagt atgaggagat cgtggaggat gtgcgggacg 2161 agtgcagcaa gtacgggctt gtcaagtcca tcgagatccc ccggcctgtg gacggcgtcg 2221 aggtgcccgg ctgcggaaag atctttgtgg agttcacctc tgtgtttgac tgccagaaag 2281 ccatgcaggg cctgacgggc cgcaagttcg ccaacagagt ggttgtcaca aaatactgtg 2341 accccgactc ttatcaccgc cgggacttct ggtagaggcg gctgggggag ggtgggggca 2401 gggctggctg ggggcttctc cccactcccg cccccccctt atccccctct gaagacgatg 2461 ggcagaggag tgacagccgc agacacacga cagccggcag caactggaat ggcagcaatt 2521 aagggtgggg ggcgggggtt ggggggttgg ggggttaggg cagggatggg actggggaag 2581 tgcgcacaca gcccacacag acaacacgca cccacacaga cacagaggga aggggttggg 2641 atggggacag ggtgcacagc agggcggggt aggaccccag cccctcccaa aacagcctct 2701 ccttctccca tagacccctt tcttctcccc ttccccacgg taggaacata gcgtgtttat 2761 attttatggc caaactattt tgaattttgt tgtccggccc tcagtgccct gccctctccc 2821 ttaccaggac cacagctctg ttccttcggc ctctggtcct ctctggtccc ctcctgggtt 2881 tcttacgtag ttgatttttc ctctttagtc tcccccgacc tgcgcccagc cccgtggccc 2941 ctgcccctct cctactctct gtggcagttt catatttgct aagacgaatt tgctcattaa 3001 acattttgtt gtattttact ttaaaaaaaa aaaaaaaaaa aaa //