LOCUS BC003367 3396 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens heterogeneous nuclear ribonucleoprotein U (scaffold attachment factor A), mRNA (cDNA clone MGC:5216 IMAGE:2900876), complete cds. ACCESSION BC003367 VERSION BC003367.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3396) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3396) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (20-FEB-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 3 Row: m Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 14141160. FEATURES Location/Qualifiers source 1..3396 /db_xref="H-InvDB:HIT000031521" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:5216 IMAGE:2900876" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_10" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3396 /gene="HNRPU" /gene_synonym="HNRNPU" /gene_synonym="SAF-A" /gene_synonym="U21.1" /db_xref="GeneID:3192" /db_xref="HGNC:HGNC:5048" /db_xref="MIM:602869" CDS 150..2570 /gene="HNRPU" /gene_synonym="HNRNPU" /gene_synonym="SAF-A" /gene_synonym="U21.1" /codon_start=1 /product="heterogeneous nuclear ribonucleoprotein U (scaffold attachment factor A)" /protein_id="AAH03367.1" /db_xref="GeneID:3192" /db_xref="HGNC:HGNC:5048" /db_xref="MIM:602869" /translation="MSSSPVNVKKLKVSELKEELKKRRLSDKGLKAELMERLQAALDD EEAGGRPAMEPGNGSLDLGGDSAGRSGAGLEQEAAAGGDEEEEEEEEEEEGISALDGD QMELGEENGAAGAADSGPMEEEEAASEDENGDDQGFQEGEDELGDEEEGAGDENGHGE QQPQPPATQQQQPQQQRGAAKEAAGKSSGPTSLFAVTVAPPGARQGQQQAGGDGKTEQ KGGDKKRGVKRPREDHGRGYFEYIEENKYSRAKSPQPPVEEEDEHFDDTVVCLDTYNC DLHFKISRDRLSASSLTMESFAFLWAGGRASYGVSKGKVCFEMKVTEKIPVRHLYTKD IDIHEVRIGWSLTTSGMLLGEEEFSYGYSLKGIKTCNCETEDYGEKFDENDVITCFAN FESDEVELSYAKNGQDLGVAFKISKEVLAGRPLFPHVLCHNCAVEFNFGQKEKPYFPI PEEYTFIQNVPLEDRVRGPKGPEEKKDCEVVMMIGLPGAGKTTWVTKHAAENPGKYNI LGTNTIMDKMMVAGFKKQMADTGKLNTLLQRAPQCLGKFIEIAARKKRNFILDQTNVS AAAQRRKMCLFAGFQRKAVVVCPKDEDYKQRTQKKAEVEGKDLPEHAVLKMKGNFTLP EVAECFDEITYVELQKEEAQKLLEQYKEESKKALPPEKKQNTGSKKSNKNKSGKNQFN RGGGHRGRGGFNMRGGNFRGGAPGNRGGYNRRGNMPQRGGGGGGSGGIGYPYPRAPVF PGRGSYSNRGNYNRGGMPNRGNYNQNFRGRGNNRGYKNQSQGYNQWQQGQFWGQKPWS QHYHQGYY" BASE COUNT 1045 a 654 c 928 g 769 t ORIGIN 1 ggcggatccg ctctgcagca cgaacccatc tccagccgca gccgcagccg ccgcccgggc 61 cgaggagcag ccgcagcagc cgccaccagt ggccgagtga gcggagccga gtttgaggca 121 gcgcctagcg gtgaatcggg gccctcacca tgagttcctc gcctgttaat gtaaaaaagc 181 tgaaggtgtc ggagctgaaa gaggagctca agaagcgacg cctttctgac aagggtctca 241 aggccgagct catggagcga ctccaggctg cgctggacga cgaggaggcc gggggccgcc 301 ccgccatgga gcccgggaac ggcagcctag acctgggcgg ggattccgct gggcgctcgg 361 gagcaggcct cgagcaggag gccgcggccg gcggcgatga agaggaggag gaagaggaag 421 aggaggagga aggaatctcc gctctggacg gcgaccagat ggagctagga gaggagaacg 481 gggccgcggg ggcggccgac tcgggcccga tggaggagga ggaggccgcc tcggaagacg 541 agaacggcga cgatcagggt ttccaggaag gggaagatga gctcggggac gaagaggaag 601 gcgcgggcga cgagaacggg cacggggagc agcagcctca accgccggcg acgcagcagc 661 aacagcccca acagcagcgc ggggccgcca aggaggccgc ggggaagagc agcggcccca 721 cctcgctgtt cgcggtgacg gtggcgccgc ccggggcgag gcagggccag cagcaggcgg 781 gaggggacgg caaaacagaa cagaaaggcg gagataaaaa gaggggtgtt aaaagaccac 841 gagaagatca tggccgtgga tattttgagt acattgaaga gaacaagtat agcagagcca 901 aatctcctca gccacctgtt gaagaagaag atgaacactt cgatgacaca gtggtttgtc 961 ttgatactta taattgtgat ctacatttta aaatatcaag agatcgtctc agtgcttctt 1021 cccttacaat ggagagtttt gcttttcttt gggctggagg aagagcatcc tatggtgtgt 1081 caaaaggcaa agtgtgtttt gagatgaagg ttacagagaa gatcccagta aggcatttat 1141 atacaaaaga tattgacata catgaagttc gtattggctg gtcactaact acaagtggaa 1201 tgttacttgg tgaagaagaa ttttcttatg ggtattctct aaaaggaata aaaacatgca 1261 actgtgagac tgaagattat ggagaaaagt ttgatgaaaa tgatgtgatt acatgttttg 1321 ctaactttga aagtgatgaa gtagaactct cgtatgctaa gaatggacaa gatcttggcg 1381 ttgccttcaa aatcagtaag gaagttcttg ctggacggcc actgttcccg catgttctct 1441 gccacaactg tgcagttgaa tttaattttg gtcagaagga aaagccatat tttccaatac 1501 ctgaagagta tactttcatc cagaacgtcc ccttagagga tcgagttaga ggaccaaagg 1561 ggcctgaaga gaagaaagat tgtgaagttg tgatgatgat tggcttgcca ggagctggaa 1621 aaactacctg ggttactaaa catgcagcag aaaatccagg gaaatataac attcttggca 1681 caaatactat tatggataag atgatggtgg caggttttaa gaagcaaatg gcagatactg 1741 gaaaactgaa cacactgttg cagagagccc cccagtgtct tgggaaattt attgagattg 1801 ctgcccgaaa gaagcgaaat tttattctgg atcagacaaa tgtgtctgct gctgcccaga 1861 ggagaaaaat gtgcctgttt gcaggcttcc agcgaaaagc tgttgtagtt tgcccaaaag 1921 atgaagacta taagcaaaga acacagaaga aagcagaagt agaggggaaa gacctaccag 1981 aacatgcggt cctcaaaatg aaaggaaact ttaccctccc agaggtagct gagtgctttg 2041 atgaaataac ctatgttgaa cttcagaagg aagaagccca aaaactcttg gagcaatata 2101 aggaagaaag caaaaaggct cttccaccag aaaagaaaca gaacactggc tcaaagaaaa 2161 gcaataaaaa taagagtggc aagaaccagt ttaacagagg tggtggccat agaggacgtg 2221 gaggattcaa tatgcgtggt ggaaatttca gaggaggagc ccctgggaat cgtggcggat 2281 ataataggag gggcaacatg ccacagagag gtggtggcgg tggaggaagt ggtggaatcg 2341 gctatccata ccctcgtgcc cctgtttttc ctggccgtgg tagttactca aacagaggga 2401 actacaacag aggtggaatg cccaacagag ggaactacaa ccagaacttc agaggacgag 2461 gaaacaatcg tggctacaaa aatcaatctc agggctacaa ccagtggcag cagggtcaat 2521 tctggggtca gaagccatgg agtcagcatt atcaccaagg atattattga atacccaaat 2581 aaaacgaact gatacatatt tctccaaaac cttcacaaga agtcgactgt tttctttagt 2641 aggctaactt tttaaacatt ccacaagagg aagtgcctgc gggttccttt tttagaagct 2701 ttgtgggttg attttttttc ttttcttttt tgtacatttt taattgcagt ttaaaagtga 2761 atcgtaagag aacctcagca ttgtgcacga taagagaatg tgtcagtatt tcagggttct 2821 acattttatc tgtaaaatgt gacttttttt tttttttatc acaacagaag taaaatgttg 2881 ctttgtacct ggtgtctttt attaagaatt tactcccccc atttctcaca gagaataaca 2941 gtcgggagtc attgtcacaa tataatagaa atgttagcaa ccagattcat gtaaggacta 3001 agtggtcctc atgaattgca ttaagactct gtactgctca tattacactc catcctctct 3061 gtagtttgct gggtagtgga gggggtaagc taaatcatag tttctgacaa taactgggaa 3121 ggttttttct taaaataaca atggaattgg tataattggg attgaaaact aaaacttgga 3181 actaagatag agaagatgga gtgtatgtag aagggctgtt aaaaatgtaa aacttggttg 3241 cattatttgt ggaggctcaa acttgtgaag gttaatacca taatttttcc atttgttctg 3301 cattttgatt ctgaaaagaa agctggcttt gcccatttct tattaaaaaa acttgttgta 3361 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa //