LOCUS       BC030574                3043 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens U2 small nuclear RNA auxiliary factor 2, mRNA (cDNA
            clone MGC:16243 IMAGE:3687471), complete cds.
ACCESSION   BC030574
VERSION     BC030574.2
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3043)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3043)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-MAY-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Dec 9, 2003 this sequence version replaced BC030574.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Louis M. Staudt, M.D., Ph.D.
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 24 Row: m Column: 16
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 60279266.
FEATURES             Location/Qualifiers
     source          1..3043
                     /db_xref="H-InvDB:HIT000092158"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:16243 IMAGE:3687471"
                     /tissue_type="Lymph, Burkitt lymphoma"
                     /clone_lib="NIH_MGC_8"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            1..3043
                     /gene="U2AF2"
                     /gene_synonym="U2AF65"
                     /db_xref="GeneID:11338"
                     /db_xref="HGNC:HGNC:23156"
                     /db_xref="MIM:191318"
     CDS             960..2375
                     /gene="U2AF2"
                     /gene_synonym="U2AF65"
                     /codon_start=1
                     /product="U2 small nuclear RNA auxiliary factor 2"
                     /protein_id="AAH30574.1"
                     /db_xref="GeneID:11338"
                     /db_xref="HGNC:HGNC:23156"
                     /db_xref="MIM:191318"
                     /translation="MSDFDEFERQLNENKQERDKENRHRKRSHSRSRSRDRKRRSRSR
                     DRRNRDQRSASRDRRRRSKPLTRGAKEEHGGLIRSPRHEKKKKVRKYWDVPPPGFEHI
                     TPMQYKAMQAAGQIPATALLPTMTPDGLAVTPTPVPVVGSQMTRQARRLYVGNIPFGI
                     TEEAMMDFFNAQMRLGGLTQAPGNPVLAVQINQDKNFAFLEFRSVDETTQAMAFDGII
                     FQGQSLKIRRPHDYQPLPGMSENPSVYVPGVVSTVVPDSAHKLFIGGLPNYLNDDQVK
                     ELLTSFGPLKAFNLVKDSATGLSKGYAFCEYVDINVTDQAIAGLNGMQLGDKKLLVQR
                     ASVGAKNATLSTINQTPVTLQVPGLMSSQVQMGGHPTEVLCLMNMVLPEELLDDEEYE
                     EIVEDVRDECSKYGLVKSIEIPRPVDGVEVPGCGKIFVEFTSVFDCQKAMQGLTGRKF
                     ANRVVVTKYCDPDSYHRRDFW"
BASE COUNT          642 a          892 c          953 g          556 t
ORIGIN      
        1 gcaacctccg cagcggtcgc gcgcaccctt cacccctctt gctgcttttg tctggcacaa
       61 ccttctcagg ccctccgcga ggccggccct ttttttcttt cactttcccc acccggagag
      121 ccaggtgccg agaactagga cctcgtcccc ttctctatcc cttcccctga tccatctcgt
      181 ctctgttctc cttccctttt ccgttcccga agccggaagg agccgaggta cccacggaaa
      241 aagtcgaagc tgtccccgga gagtgaggcc ttcacgaagc ggtggcggaa ggagccgaag
      301 ttctctgatc ggaggcgtga gcaagtggca tctccggaaa gagccgaaac acggactcga
      361 gcttaatccc aggaggggcc cgagggcgag atccggaaat ctccggaaag agccgaaggc
      421 cgggacggtt aggattgtcg gaagtggccg attgcttgga cagggccggc ggagaagatc
      481 ggagcaagtc cgtggaagaa gccaaagact gggacggatt gaattgttgg aagaacccga
      541 actcgcagag gggactgggc gcagtggcac acgaggacac acggaaacct ccgaacgttg
      601 aggagataat ctcggaaaag ttacggaaac ctgggaagtc ggcgaaaccc tacgttcggg
      661 tgttttcgga agcagccgaa gcggccgcct gcttcgcctt cgtccagtat ccggagatag
      721 ccgaggcgct tcggagccag gctcgggggg aggggcaagc tcgcgggcgg gcgggcgggt
      781 gagggggcgg agcccgggcg gggcggggag ggctgccgga ggcttgatta ggtaggaggt
      841 ggcgaagcgc cggcggcggc cggaagtagc cgaagccagc ggcggaagta gccgaagcgg
      901 ctggagcggg cggcaaggcg aggcgaaagc tgcacagggc cctacgcggc cgcctcagca
      961 tgtcggactt cgacgagttc gagcggcagc tcaacgagaa taaacaagag cgggacaagg
     1021 agaaccggca tcggaagcgc agccacagcc gctctcggag ccgggaccgc aaacgccgga
     1081 gccggagccg cgaccggcgc aaccgggacc agcggagcgc ctcccgggac aggcgacgac
     1141 gcagcaaacc tttgaccaga ggcgctaaag aggagcacgg tggactgatt cgttcccccc
     1201 gccacgagaa gaagaagaag gtccgtaaat actgggacgt gccaccccca ggctttgagc
     1261 acatcacccc aatgcagtac aaggccatgc aagctgcggg tcagattcca gccactgctc
     1321 ttctccccac catgacccct gacggtctgg ctgtgacccc aacgccggtg cccgtggtcg
     1381 ggagccagat gaccagacaa gcccggcgcc tctacgtggg caacatcccc tttggcatca
     1441 ctgaggaggc catgatggat ttcttcaacg cccagatgcg cctggggggg ctgacccagg
     1501 cccctggcaa cccagtgttg gctgtgcaga ttaaccagga caagaatttt gcctttttgg
     1561 agttccgctc agtggacgag actacccagg ctatggcctt tgatggcatc atcttccagg
     1621 gccagtcact aaagatccgc aggcctcacg actaccagcc gcttcctggc atgtcagaga
     1681 acccctccgt ctatgtgcct ggggttgtgt ccactgtggt ccccgactct gcccacaagc
     1741 tgttcatcgg gggcttaccc aactacctga acgatgacca ggtcaaagag ctgctgacat
     1801 cctttgggcc cctcaaggcc ttcaacctgg tcaaggacag tgccacgggg ctctccaagg
     1861 gctacgcctt ctgtgagtac gtggacatca acgtcacgga tcaggccatt gcggggctga
     1921 acggcatgca gctgggggat aagaagctgc tggtccagag ggcgagtgtg ggagccaaga
     1981 atgccacgct gagcaccatc aatcagacgc ctgtgaccct gcaagtgccg ggcttgatga
     2041 gctcccaggt gcagatgggc ggccacccga ctgaggtcct gtgcctcatg aacatggtgc
     2101 tgcctgagga gctgctggac gacgaggagt atgaggagat cgtggaggat gtgcgggacg
     2161 agtgcagcaa gtacgggctt gtcaagtcca tcgagatccc ccggcctgtg gacggcgtcg
     2221 aggtgcccgg ctgcggaaag atctttgtgg agttcacctc tgtgtttgac tgccagaaag
     2281 ccatgcaggg cctgacgggc cgcaagttcg ccaacagagt ggttgtcaca aaatactgtg
     2341 accccgactc ttatcaccgc cgggacttct ggtagaggcg gctgggggag ggtgggggca
     2401 gggctggctg ggggcttctc cccactcccg cccccccctt atccccctct gaagacgatg
     2461 ggcagaggag tgacagccgc agacacacga cagccggcag caactggaat ggcagcaatt
     2521 aagggtgggg ggcgggggtt ggggggttgg ggggttaggg cagggatggg actggggaag
     2581 tgcgcacaca gcccacacag acaacacgca cccacacaga cacagaggga aggggttggg
     2641 atggggacag ggtgcacagc agggcggggt aggaccccag cccctcccaa aacagcctct
     2701 ccttctccca tagacccctt tcttctcccc ttccccacgg taggaacata gcgtgtttat
     2761 attttatggc caaactattt tgaattttgt tgtccggccc tcagtgccct gccctctccc
     2821 ttaccaggac cacagctctg ttccttcggc ctctggtcct ctctggtccc ctcctgggtt
     2881 tcttacgtag ttgatttttc ctctttagtc tcccccgacc tgcgcccagc cccgtggccc
     2941 ctgcccctct cctactctct gtggcagttt catatttgct aagacgaatt tgctcattaa
     3001 acattttgtt gtattttact ttaaaaaaaa aaaaaaaaaa aaa
//