LOCUS BC007577 4220 bp mRNA linear HUM 18-MAR-2009
DEFINITION Homo sapiens small nuclear ribonucleoprotein 200kDa (U5), mRNA
(cDNA clone IMAGE:3139787), partial cds.
ACCESSION BC007577
VERSION BC007577.1
KEYWORDS .
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4220)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4220)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (10-MAY-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 22 Row: h Column: 20.
FEATURES Location/Qualifiers
source 1..4220
/db_xref="H-InvDB:HIT000086850"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="IMAGE:3139787"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_21"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene <1..4220
/gene="SNRNP200"
/gene_synonym="BRR2"
/gene_synonym="HELIC2"
/gene_synonym="U5-200KD"
/db_xref="GeneID:23020"
/db_xref="HGNC:HGNC:30859"
/db_xref="MIM:601664"
CDS <1..3927
/gene="SNRNP200"
/gene_synonym="BRR2"
/gene_synonym="HELIC2"
/gene_synonym="U5-200KD"
/codon_start=1
/product="SNRNP200 protein"
/protein_id="AAH07577.1"
/db_xref="GeneID:23020"
/db_xref="HGNC:HGNC:30859"
/db_xref="MIM:601664"
/translation="KGTQVYSPEKGRWTELGALDILQMLGRAGRPQYDTKGEGILITS
HGELQYYLSLLNQQLPIESQMVSKLPDMLNAEIVLGNVQNAKDAVNWLGYAYLYIRML
RSPTLYGISHDDLKGDPLLDQRRLDLVHTAALMLDKNNLVKYDKKTGNFQVTELGRIA
SHYYITNDTVQTYNQLLKPTLSEIELFRVFSLSSEFKNITVREEEKLELQKLLERVPI
PVKESIEEPSAKINVLLQAFISQLKLEGFALMADMVYVTQSAGRLMRAIFEIVLNRGW
AQLTDKTLNLCKMIDKRMWQSMCPLRQFRKLPEEVVKKIEKKNFPFERLYDLNHNEIG
ELIRMPKMGKTIHKYVHLFPKLELSVHLQPITRSTLKVELTITPDFQWDEKVHGSSEA
FWILVEDVDSEVILHHEYFLLKAKYAQDEHLITFFVPVFEPLPPQYFIRVVSDRWLSC
ETQLPVSFRHLILPEKYPPPTELLDLQPLPVSALRNSAFESLYQDKFPFFNPIQTQVF
NTVYNSDDNVFVGAPTGSGKTICAEFAILRMLLQSSEGRCVYITPMEALAEQVYMDWY
EKFQDRLNKKVVLLTGETSTDLKLLGKGNIIISTPEKWDILSRRWKQRKNVQNINLFV
VDEVHLIGGENGPVLEVICSRMRYISSQIERPIRIVALSSSLSNAKDVAHWLGCSATS
TFNFHPNVRPVPLELHIQGFNISHTQTRLLSMAKPVYHAITKHSPKKPVIVFVPSRKQ
TRLTAIDILTTCAADIQRQRFLHCTEKDLIPYLEKLSDSTLKETLLNGVGYLHEGLSP
MERRLVEQLFSSGAIQVVVASRSLCWGMNVAAHLVIIMDTQYYNGKIHAYVDYPIYDV
LQMVGHANRPLQDDEGRCVIMCQGSKKDFFKKFLYEPLPVESHLDHCMHDHFNAEIVT
KTIENKQDAVDYLTWTFLYRRMTQNPNYYNLQGISHRHLSDHLSELVEQTLSDLEQSK
CISIEDEMDVAPLNLGMIAAYYYINYTTIELFSMSLNAKTKVRGLIEIISNAAEYENI
PIRHHEDNLLRQLAQKVPHKLNNPKFNDPHVKTNLLLQAHLSRMQLSAELQSDTEEIL
SKAIRLIQACVDVLSSNGWLSPALAAMELAQMVTQAMWSKDSYLKQLPHFTSEHIKRC
TDKGVESVFDIMEMEDEERNALLQLTDSQIADVARFCNRYPNIELSYEVVDKDSIRSG
GPVVVLVQLEREEEVTGPVIAPLFPQKREEGWWVVIGDAKSNSLISIKRLTLQQKAKV
KLDFVAPATGAHNYTLYFMSDAYMGCDQEYKFSVDVKEAETDSDSD"
BASE COUNT 1040 a 1128 c 1102 g 950 t
ORIGIN
1 aaaggcaccc aggtgtacag tccagagaag gggcgttgga cagaactggg agcactggac
61 attctgcaga tgctgggacg tgccggaaga ccccagtatg acaccaaggg tgaaggcata
121 ctcatcacat ctcatgggga gctacagtac tacctgtccc tcctcaatca acaacttcct
181 attgaaagcc agatggtttc aaagcttcct gacatgctca atgcagaaat cgtgctagga
241 aatgtccaga atgccaagga tgcggtgaac tggctgggct atgcctacct ctatatccga
301 atgctgcgat ccccaaccct ctatggcatc tctcatgatg acctcaaggg agatcccctg
361 ctggaccagc gccgactaga tctggttcat acagctgccc tgatgctgga caagaacaat
421 ctggtcaagt acgacaagaa gacgggcaac ttccaggtga cagaactggg ccgtatagcc
481 agccactact acatcaccaa tgatacagtg cagacttaca accagctgct gaagcccacc
541 ctgagtgaga ttgagctttt cagggtcttc tcattgtcct ctgagttcaa gaacatcaca
601 gtgagagagg aggagaagct ggagctgcag aagttgctgg agagggtgcc tatccctgta
661 aaggagagca ttgaggaacc cagtgctaag atcaacgttc ttctgcaagc cttcatctca
721 cagctgaaat tggagggctt tgcactgatg gctgacatgg tgtatgtcac acagtcggct
781 ggccggttga tgcgagcgat atttgaaatt gtcctgaacc gaggttgggc acagcttaca
841 gacaagaccc tgaacctctg caagatgatc gacaaacgca tgtggcagtc catgtgtcct
901 ctgcgccagt tccggaaact ccctgaggaa gtagtgaaga agattgagaa gaagaatttc
961 ccctttgagc gtctgtacga cctgaatcat aatgagattg gggagcttat ccgcatgcca
1021 aagatgggga agaccatcca caaatatgtc catctgtttc ccaagttgga gttgtcagtg
1081 cacctgcagc ctatcacacg ctccaccctg aaggtggagc tgaccatcac gccagacttc
1141 cagtgggatg aaaaggtgca tggttcatcc gaggcttttt ggattctggt ggaggatgtg
1201 gacagcgagg tgattctgca ccatgagtat tttctcctca aggccaagta cgcccaggac
1261 gagcacctca ttacattctt cgtgcctgtc tttgaaccgc tgccccctca gtacttcatc
1321 cgagtggtgt ctgaccgctg gctctcttgt gagacccagc tgcctgtctc cttccggcac
1381 ctgatcttgc cggagaagta cccccctcca accgaacttt tggacctgca gcccttgccc
1441 gtgtctgctc tgagaaacag tgcctttgag agtctttacc aagataaatt tcctttcttc
1501 aatcccatcc agacccaggt gtttaacact gtatacaaca gtgacgacaa cgtgtttgtg
1561 ggggccccca cgggcagcgg gaagactatt tgtgcagagt ttgccatcct gcgaatgctg
1621 ctgcagagct cggaggggcg ctgtgtgtac atcaccccca tggaggccct ggcagagcag
1681 gtatacatgg actggtacga gaagttccag gacaggctca acaagaaggt ggtactcctg
1741 acaggcgaga ccagcacaga cctgaagctg ctgggcaaag ggaacattat catcagcacc
1801 cctgagaagt gggacatact ttcccggcga tggaagcagc gcaagaacgt gcagaacatc
1861 aacctcttcg tggtggatga ggtccacctt atcgggggcg agaatgggcc tgtcttagaa
1921 gtgatctgct cccgaatgcg ctacatctcc tcccagattg agcggcccat tcgcattgtg
1981 gcactcagct cttcgctctc caatgccaag gatgtggccc actggctggg ctgcagtgcc
2041 acctccacct tcaacttcca tcccaatgtg cgtcccgtcc ccttggagct gcacatccag
2101 ggcttcaaca tcagccatac acaaacccgc ctgctctcca tggccaagcc tgtgtaccat
2161 gctatcacca agcactcgcc caagaagcct gtcattgtct ttgtgccgtc tcgcaagcag
2221 acccgcctca ctgccattga catcctcacc acctgtgcag cagacatcca acggcagagg
2281 ttcttgcact gcaccgagaa ggatctgatt ccgtacctgg agaagctaag tgacagcacg
2341 ctcaaggaaa cgctgctaaa tggggtgggc tacctgcatg aggggctcag ccccatggag
2401 cgacgcctgg tggagcagct cttcagctca ggggctatcc aggtggtggt ggcttctcgg
2461 agtctctgct ggggcatgaa cgtggctgcc cacctggtaa tcatcatgga tacccagtac
2521 tacaatggca agatccacgc ctatgtggat taccccatct atgacgtgct tcagatggtg
2581 ggccacgcca accgcccttt gcaggacgat gaggggcgct gtgtcatcat gtgtcagggc
2641 tccaagaagg atttcttcaa gaagttctta tatgagccat tgccagtaga atctcacctg
2701 gaccactgta tgcatgacca cttcaatgct gagatcgtca ccaagaccat tgagaacaag
2761 caggatgctg tggactacct cacctggacc tttctgtacc gccgcatgac acagaacccc
2821 aattactaca acctgcaggg catctcccat cgtcacttgt cggaccactt gtcagagctg
2881 gtggagcaga ccctgagtga cctggagcag tccaagtgca tcagcatcga ggacgagatg
2941 gacgtggcgc ctctgaacct aggcatgatc gccgcctact attacatcaa ctacaccacc
3001 attgagctct tcagcatgtc cctcaatgcc aagaccaagg tgcgagggct tatcgagatc
3061 atctccaatg cagcagagta tgagaacatt cccatccggc accatgaaga caatctcctg
3121 aggcagttgg ctcagaaggt cccccacaag ctgaataacc ctaagttcaa tgatccgcac
3181 gtcaagacca acctgctcct gcaggctcac ttgtctcgca tgcagctgag tgctgagttg
3241 cagtcagata cggaggaaat ccttagtaag gcaatccggc ttatccaggc ctgcgtggat
3301 gtcctttcca gcaatgggtg gctcagccct gctctggcag ctatggaact ggcccagatg
3361 gtcacccaag ccatgtggtc caaggactca tacctgaagc agctgccaca cttcacctct
3421 gagcatatca aacgttgcac agacaaggga gtggagagtg ttttcgacat catggagatg
3481 gaggatgaag aacggaacgc gttgcttcag ctgactgaca gccagattgc agatgtggct
3541 cgcttttgta accgctaccc taatatcgaa ctatcttatg aggtggtaga taaggacagc
3601 atccgcagtg gcgggccagt tgtggtgctg gtgcagctgg agcgagagga ggaagtcaca
3661 ggccctgtca ttgcgcctct cttcccgcag aaacgtgaag agggctggtg ggtggtgatt
3721 ggagatgcca agtccaatag cctcatctcc atcaagaggc tgaccttgca gcagaaggcc
3781 aaggtgaagt tggactttgt ggccccagcc actggtgccc acaactacac tctgtacttc
3841 atgagtgacg cttacatggg atgtgaccag gagtacaaat tcagcgtgga tgtgaaagaa
3901 gctgagacag acagtgattc agattgagtc ctgaggcatt tacttttggg taaaggagag
3961 ttgagcctga attaggaatg tgtacattgt aggaatcctg gttgtgggga ccaggtctgt
4021 gggcctcagg tctggccagc cagggctggt gctgtccccg cctacctcca cttcctttcc
4081 cttgctcact ctggatccag tgacagcagg tgtcatgggt caagcataaa tcatatatag
4141 cattttcagg catgttcctg gtagttcttt tgagtctgac attctaataa aataatttgt
4201 agaaaaaaaa aaaaaaaaaa
//