LOCUS BC003621 2634 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens heterogeneous nuclear ribonucleoprotein U (scaffold
attachment factor A), mRNA (cDNA clone MGC:1992 IMAGE:2966453),
complete cds.
ACCESSION BC003621
VERSION BC003621.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2634)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2634)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (26-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC003621.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Louis M. Staudt, M.D., Ph.D.
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 3 Row: a Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 14141160.
FEATURES Location/Qualifiers
source 1..2634
/db_xref="H-InvDB:HIT000031649"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:1992 IMAGE:2966453"
/tissue_type="Lymph, Burkitt lymphoma"
/clone_lib="NIH_MGC_8"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2634
/gene="HNRPU"
/gene_synonym="HNRNPU"
/gene_synonym="SAF-A"
/gene_synonym="U21.1"
/db_xref="GeneID:3192"
/db_xref="HGNC:HGNC:5048"
/db_xref="MIM:602869"
CDS 152..2572
/gene="HNRPU"
/gene_synonym="HNRNPU"
/gene_synonym="SAF-A"
/gene_synonym="U21.1"
/codon_start=1
/product="heterogeneous nuclear ribonucleoprotein U
(scaffold attachment factor A)"
/protein_id="AAH03621.1"
/db_xref="GeneID:3192"
/db_xref="HGNC:HGNC:5048"
/db_xref="MIM:602869"
/translation="MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDD
EEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGD
QMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGE
QQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGDGKTEQ
KGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNC
DLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKD
IDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFAN
FESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPI
PEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNI
LGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVS
AAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLP
EVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFN
RGGGHRGRGGFNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVF
PGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWS
QHYHQGYY"
BASE COUNT 809 a 547 c 779 g 499 t
ORIGIN
1 caggcggatc cgctctgcag cacgaaccca tctccagccg cagccgcagc cgccgcccgg
61 gccgaggagc agccgcagca gccgccacca gtggccgagt gagcggagcc gagtttgagg
121 cagcgcctag cggtgaatcg gggccctcac catgagttcc tcgcctgtta atgtaaaaaa
181 gctgaaggtg tcggagctga aagaggagct caagaagcga cgcctttctg acaagggtct
241 caaggccgag ctcatggagc gactccaggc tgcgctggac gacgaggagg ccgggggccg
301 ccccgccatg gagcccggga acggcagcct agacctgggc ggggattccg ctgggcgctc
361 gggagcaggc ctcgagcagg aggccgcggc cggcggcgat gaagaggagg aagaagagga
421 agaggaggag gaaggaatct ccgctctgga cggcgaccag atggagctag gagaggagaa
481 cggggccgcg ggggcggccg actcgggccc gatggaggag gaggaggccg cctcggaaga
541 cgagaacggc gacgatcagg gtttccagga aggggaagat gagctcgggg acgaagagga
601 aggcgcgggc gacgagaacg ggcacgggga gcagcagcct caaccgccgg cgacgcagca
661 gcaacagccc caacagcagc gcggggccgc caaggaggcc gcggggaaga gcagcggccc
721 cacctcgctg ttcgcggtga cggtggcgcc gcccggggcg aggcagggcc agcagcaggc
781 gggaggggac ggcaaaacag aacagaaagg cggagataaa aagaggggtg ttaaaagacc
841 acgagaagat catggccgtg gatattttga gtacattgaa gagaacaagt atagcagagc
901 caaatctcct cagccacctg ttgaagaaga agatgaacac ttcgatgaca cagtggtttg
961 tcttgatact tataattgtg atctacattt taaaatatca agagatcgtc tcagtgcttc
1021 ttcccttaca atggagagtt ttgcttttct ttgggctgga ggaagagcat cctatggtgt
1081 gtcaaaaggc aaagtgtgtt ttgagatgaa ggttacagag aagatcccag taaggcattt
1141 atatacaaaa gatattgaca tacatgaagt tcgtattggc tggtcactaa ctacaagtgg
1201 aatgttactt ggtgaagaag aattttctta tgggtattct ctaaaaggaa taaaaacatg
1261 caactgtgag actgaagatt atggagaaaa gtttgatgaa aatgatgtga ttacatgttt
1321 tgctaacttt gaaagtgatg aagtagaact ctcgtatgct aagaatggac aagatcttgg
1381 cgttgccttc aaaatcagta aggaagttct tgctggacgg ccactgttcc cgcatgttct
1441 ctgccacaac tgtgcagttg aatttaattt tggtcagaag gaaaagccat attttccaat
1501 acctgaagag tatactttca tccagaacgt ccccttagag gatcgagtta gaggaccaaa
1561 ggggcctgaa gagaagaaag attgtgaagt tgtgatgatg attggcttgc caggagctgg
1621 aaaaactacc tgggttacta aacatgcagc agaaaatcca gggaaatata acattcttgg
1681 cacaaatact attatggata agatgatggt ggcaggtttt aagaagcaaa tggcagatac
1741 tggaaaactg aacacactgt tgcagagagc cccccagtgt cttgggaaat ttattgagat
1801 tgctgcccga aagaagcgaa attttattct ggatcagaca aatgtgtctg ctgctgccca
1861 gaggagaaaa atgtgcctgt ttgcaggctt ccagcgaaaa gctgttgtag tttgcccaaa
1921 agatgaagac tataagcaaa gaacacagaa gaaagcagaa gtagagggga aagacctacc
1981 agaacatgcg gtcctcaaaa tgaaaggaaa ctttaccctc ccagaggtag ctgagtgctt
2041 tgatgaaata acctatgttg aacttcagaa ggaagaagcc caaaaactct tggagcaata
2101 taaggaagaa agcaaaaagg ctcttccacc agaaaagaaa cagaacactg gctcaaagaa
2161 aagcaataaa aataagagtg gcaagaacca gtttaacaga ggtggtggcc atagaggacg
2221 tggaggattc aatatgcgtg gtggaaattt cagaggagga gcccctggga atcgtggcgg
2281 atataatagg aggggcaaca tgccacagag aggtggtggc ggtggaggaa gtggtggaat
2341 cggctatcca taccctcgtg cccctgtttt tcctggccgt ggtagttact caaacagagg
2401 gaactacaac agaggtggaa tgcccaacag agggaactac aaccagaact tcagaggacg
2461 aggaaacaat cgtggctaca aaaatcaatc tcagggctac aaccagtggc agcagggtca
2521 attctggggt cagaagccat ggagtcagca ttatcaccaa ggatattatt gaatacccaa
2581 ataaaacgaa ctgatacata tttctccaaa accttcaaaa aaaaaaaaaa aaaa
//