LOCUS BC003367 3396 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens heterogeneous nuclear ribonucleoprotein U (scaffold
attachment factor A), mRNA (cDNA clone MGC:5216 IMAGE:2900876),
complete cds.
ACCESSION BC003367
VERSION BC003367.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3396)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3396)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (20-FEB-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 3 Row: m Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 14141160.
FEATURES Location/Qualifiers
source 1..3396
/db_xref="H-InvDB:HIT000031521"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:5216 IMAGE:2900876"
/tissue_type="Placenta, choriocarcinoma"
/clone_lib="NIH_MGC_10"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3396
/gene="HNRPU"
/gene_synonym="HNRNPU"
/gene_synonym="SAF-A"
/gene_synonym="U21.1"
/db_xref="GeneID:3192"
/db_xref="HGNC:HGNC:5048"
/db_xref="MIM:602869"
CDS 150..2570
/gene="HNRPU"
/gene_synonym="HNRNPU"
/gene_synonym="SAF-A"
/gene_synonym="U21.1"
/codon_start=1
/product="heterogeneous nuclear ribonucleoprotein U
(scaffold attachment factor A)"
/protein_id="AAH03367.1"
/db_xref="GeneID:3192"
/db_xref="HGNC:HGNC:5048"
/db_xref="MIM:602869"
/translation="MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDD
EEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGD
QMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGE
QQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGDGKTEQ
KGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNC
DLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKD
IDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFAN
FESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPI
PEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNI
LGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVS
AAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLP
EVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFN
RGGGHRGRGGFNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVF
PGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWS
QHYHQGYY"
BASE COUNT 1045 a 654 c 928 g 769 t
ORIGIN
1 ggcggatccg ctctgcagca cgaacccatc tccagccgca gccgcagccg ccgcccgggc
61 cgaggagcag ccgcagcagc cgccaccagt ggccgagtga gcggagccga gtttgaggca
121 gcgcctagcg gtgaatcggg gccctcacca tgagttcctc gcctgttaat gtaaaaaagc
181 tgaaggtgtc ggagctgaaa gaggagctca agaagcgacg cctttctgac aagggtctca
241 aggccgagct catggagcga ctccaggctg cgctggacga cgaggaggcc gggggccgcc
301 ccgccatgga gcccgggaac ggcagcctag acctgggcgg ggattccgct gggcgctcgg
361 gagcaggcct cgagcaggag gccgcggccg gcggcgatga agaggaggag gaagaggaag
421 aggaggagga aggaatctcc gctctggacg gcgaccagat ggagctagga gaggagaacg
481 gggccgcggg ggcggccgac tcgggcccga tggaggagga ggaggccgcc tcggaagacg
541 agaacggcga cgatcagggt ttccaggaag gggaagatga gctcggggac gaagaggaag
601 gcgcgggcga cgagaacggg cacggggagc agcagcctca accgccggcg acgcagcagc
661 aacagcccca acagcagcgc ggggccgcca aggaggccgc ggggaagagc agcggcccca
721 cctcgctgtt cgcggtgacg gtggcgccgc ccggggcgag gcagggccag cagcaggcgg
781 gaggggacgg caaaacagaa cagaaaggcg gagataaaaa gaggggtgtt aaaagaccac
841 gagaagatca tggccgtgga tattttgagt acattgaaga gaacaagtat agcagagcca
901 aatctcctca gccacctgtt gaagaagaag atgaacactt cgatgacaca gtggtttgtc
961 ttgatactta taattgtgat ctacatttta aaatatcaag agatcgtctc agtgcttctt
1021 cccttacaat ggagagtttt gcttttcttt gggctggagg aagagcatcc tatggtgtgt
1081 caaaaggcaa agtgtgtttt gagatgaagg ttacagagaa gatcccagta aggcatttat
1141 atacaaaaga tattgacata catgaagttc gtattggctg gtcactaact acaagtggaa
1201 tgttacttgg tgaagaagaa ttttcttatg ggtattctct aaaaggaata aaaacatgca
1261 actgtgagac tgaagattat ggagaaaagt ttgatgaaaa tgatgtgatt acatgttttg
1321 ctaactttga aagtgatgaa gtagaactct cgtatgctaa gaatggacaa gatcttggcg
1381 ttgccttcaa aatcagtaag gaagttcttg ctggacggcc actgttcccg catgttctct
1441 gccacaactg tgcagttgaa tttaattttg gtcagaagga aaagccatat tttccaatac
1501 ctgaagagta tactttcatc cagaacgtcc ccttagagga tcgagttaga ggaccaaagg
1561 ggcctgaaga gaagaaagat tgtgaagttg tgatgatgat tggcttgcca ggagctggaa
1621 aaactacctg ggttactaaa catgcagcag aaaatccagg gaaatataac attcttggca
1681 caaatactat tatggataag atgatggtgg caggttttaa gaagcaaatg gcagatactg
1741 gaaaactgaa cacactgttg cagagagccc cccagtgtct tgggaaattt attgagattg
1801 ctgcccgaaa gaagcgaaat tttattctgg atcagacaaa tgtgtctgct gctgcccaga
1861 ggagaaaaat gtgcctgttt gcaggcttcc agcgaaaagc tgttgtagtt tgcccaaaag
1921 atgaagacta taagcaaaga acacagaaga aagcagaagt agaggggaaa gacctaccag
1981 aacatgcggt cctcaaaatg aaaggaaact ttaccctccc agaggtagct gagtgctttg
2041 atgaaataac ctatgttgaa cttcagaagg aagaagccca aaaactcttg gagcaatata
2101 aggaagaaag caaaaaggct cttccaccag aaaagaaaca gaacactggc tcaaagaaaa
2161 gcaataaaaa taagagtggc aagaaccagt ttaacagagg tggtggccat agaggacgtg
2221 gaggattcaa tatgcgtggt ggaaatttca gaggaggagc ccctgggaat cgtggcggat
2281 ataataggag gggcaacatg ccacagagag gtggtggcgg tggaggaagt ggtggaatcg
2341 gctatccata ccctcgtgcc cctgtttttc ctggccgtgg tagttactca aacagaggga
2401 actacaacag aggtggaatg cccaacagag ggaactacaa ccagaacttc agaggacgag
2461 gaaacaatcg tggctacaaa aatcaatctc agggctacaa ccagtggcag cagggtcaat
2521 tctggggtca gaagccatgg agtcagcatt atcaccaagg atattattga atacccaaat
2581 aaaacgaact gatacatatt tctccaaaac cttcacaaga agtcgactgt tttctttagt
2641 aggctaactt tttaaacatt ccacaagagg aagtgcctgc gggttccttt tttagaagct
2701 ttgtgggttg attttttttc ttttcttttt tgtacatttt taattgcagt ttaaaagtga
2761 atcgtaagag aacctcagca ttgtgcacga taagagaatg tgtcagtatt tcagggttct
2821 acattttatc tgtaaaatgt gacttttttt tttttttatc acaacagaag taaaatgttg
2881 ctttgtacct ggtgtctttt attaagaatt tactcccccc atttctcaca gagaataaca
2941 gtcgggagtc attgtcacaa tataatagaa atgttagcaa ccagattcat gtaaggacta
3001 agtggtcctc atgaattgca ttaagactct gtactgctca tattacactc catcctctct
3061 gtagtttgct gggtagtgga gggggtaagc taaatcatag tttctgacaa taactgggaa
3121 ggttttttct taaaataaca atggaattgg tataattggg attgaaaact aaaacttgga
3181 actaagatag agaagatgga gtgtatgtag aagggctgtt aaaaatgtaa aacttggttg
3241 cattatttgt ggaggctcaa acttgtgaag gttaatacca taatttttcc atttgttctg
3301 cattttgatt ctgaaaagaa agctggcttt gcccatttct tattaaaaaa acttgttgta
3361 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa
//