LOCUS BC024767 2926 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens heterogeneous nuclear ribonucleoprotein U (scaffold
attachment factor A), mRNA (cDNA clone MGC:29899 IMAGE:4993703),
complete cds.
ACCESSION BC024767
VERSION BC024767.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2926)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2926)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-MAR-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 42 Row: j Column: 10
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis, Similarity but not identity to protein.
FEATURES Location/Qualifiers
source 1..2926
/db_xref="H-InvDB:HIT000039837"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:29899 IMAGE:4993703"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_10"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2926
/gene="HNRPU"
/gene_synonym="HNRNPU"
/gene_synonym="SAF-A"
/gene_synonym="U21.1"
/db_xref="GeneID:3192"
/db_xref="HGNC:HGNC:5048"
/db_xref="MIM:602869"
CDS 152..2572
/gene="HNRPU"
/gene_synonym="HNRNPU"
/gene_synonym="SAF-A"
/gene_synonym="U21.1"
/codon_start=1
/product="heterogeneous nuclear ribonucleoprotein U
(scaffold attachment factor A)"
/protein_id="AAH24767.1"
/db_xref="GeneID:3192"
/db_xref="HGNC:HGNC:5048"
/db_xref="MIM:602869"
/translation="MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDD
EEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGD
QMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGE
QQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGDGKTEQ
KGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNC
DLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKD
IDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFAN
FESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPI
PEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNI
LGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVS
AAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLP
EVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFN
RGGGHRGRGGFNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVF
PGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWS
QHYHQGYY"
BASE COUNT 898 a 584 c 831 g 613 t
ORIGIN
1 caggcggatc cgctctgcag cacgaaccca tctccagccg cagccgcagc cgccgcccgg
61 gccgaggagc agccgcagca gccgccacca gtggccgagt gagcggagcc gagtttgagg
121 cagcgcctag cggtgaatcg gggccctcac catgagttcc tcgcctgtta atgtaaaaaa
181 gctgaaggtg tcggagctga aagaggagct caagaagcga cgcctttctg acaagggtct
241 caaggccgag ctcatggagc gactccaggc tgcgctggac gacgaggagg ccgggggccg
301 ccccgccatg gagcccggga acggcagcct agacctgggc ggggattccg ctgggcgctc
361 gggagcaggc ctcgagcagg aggccgcggc cggcggcgat gaagaggagg aggaagagga
421 agaggaggag gaaggaatct ccgctctgga cggcgaccag atggagctag gagaggagaa
481 cggggccgcg ggggcggccg actcgggccc gatggaggag gaggaggccg cctcggaaga
541 cgagaacggc gacgatcagg gtttccagga aggggaagat gagctcgggg acgaagagga
601 aggcgcgggc gacgagaacg ggcacgggga gcagcagcct caaccgccgg cgacgcagca
661 gcaacagccc caacagcagc gcggggccgc caaggaggcc gcggggaaga gcagcggccc
721 cacctcgctg ttcgcggtga cggtggcgcc gcccggggcg aggcagggcc agcagcaggc
781 gggaggggac ggcaaaacag aacagaaagg cggagataaa aagaggggtg ttaaaagacc
841 acgagaagat catggccgtg gatattttga gtacattgaa gagaacaagt atagcagagc
901 caaatctcct cagccacctg ttgaagaaga agatgaacac ttcgatgaca cagtggtttg
961 tcttgatact tataattgtg atctacattt taaaatatca agagatcgtc tcagtgcttc
1021 ttcccttaca atggagagtt ttgcttttct ttgggctgga ggaagagcat cctatggtgt
1081 gtcaaaaggc aaagtgtgtt ttgagatgaa ggttacagag aagatcccag taaggcattt
1141 atatacaaaa gatattgaca tacatgaagt tcgtattggc tggtcactaa ctacaagtgg
1201 aatgttactt ggtgaagaag aattttctta tgggtattct ctaaaaggaa taaaaacatg
1261 caactgtgag actgaagatt atggagaaaa gtttgatgaa aatgatgtga ttacatgttt
1321 tgctaacttt gaaagtgatg aagtagaact ctcgtatgct aagaatggac aagatcttgg
1381 cgttgccttc aaaatcagta aggaagttct tgctggacgg ccactgttcc cgcatgttct
1441 ctgccacaac tgtgcagttg aatttaattt tggtcagaag gaaaagccat attttccaat
1501 acctgaagag tatactttca tccagaacgt ccccttagag gatcgagtta gaggaccaaa
1561 ggggcctgaa gagaagaaag attgtgaagt tgtgatgatg attggcttgc caggagctgg
1621 aaaaactacc tgggttacta aacatgcagc agaaaatcca gggaaatata acattcttgg
1681 cacaaatact attatggata agatgatggt ggcaggtttt aagaagcaaa tggcagatac
1741 tggaaaactg aacacactgt tgcagagagc cccccagtgt cttgggaaat ttattgagat
1801 tgctgcccga aagaagcgaa attttattct ggatcagaca aatgtgtctg ctgctgccca
1861 gaggagaaaa atgtgcctgt ttgcaggctt ccagcgaaaa gctgttgtag tttgcccaaa
1921 agatgaagac tataagcaaa gaacacagaa gaaagcagaa gtagagggga aagacctacc
1981 agaacatgcg gtcctcaaaa tgaaaggaaa ctttaccctc ccagaggtag ctgagtgctt
2041 tgatgaaata acctatgttg aacttcagaa ggaagaagcc caaaaactct tggagcaata
2101 taaggaagaa agcaaaaagg ctcttccacc agaaaagaaa cagaacactg gctcaaagaa
2161 aagcaataaa aataagagtg gcaagaacca gtttaacaga ggtggtggcc atagaggacg
2221 tggaggattc aatatgcgtg gtggaaattt cagaggagga gcccctggga atcgtggcgg
2281 atataatagg aggggcaaca tgccacagag aggtggtggc ggtggaggaa gtggtggaat
2341 cggctatcca taccctcgtg cccctgtttt tcctggccgt ggtagttact caaacagagg
2401 gaactacaac agaggtggaa tgcccaacag agggaactac aaccagaact tcagaggacg
2461 aggaaacaat cgtggctaca aaaatcaatc tcagggctac aaccagtggc agcagggtca
2521 attctggggt cagaagccat ggagtcagca ttatcaccaa ggatattatt gaatacccaa
2581 ataaaacgaa ctgatacata tttctccaaa accttcacaa gaagtcgact gttttcttta
2641 gtaggctaac tttttaaaca ttccacaaga ggaagtgcct gcgggttcct tttttagaag
2701 ctttgtgggt tgattttttt tcttttcttt tttgtacatt tttaattgca gtttaaaagt
2761 gaatcgtaag agaacctcag cattgtgcac gataagagaa tgtgtcagta tttcagggtt
2821 ctacatttta tctgtaaaat gtgacttttt tttttttttt atcacaacag aagtaaaatg
2881 ttgctttgta aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//