LOCUS BC004534 2270 bp mRNA linear HUM 28-JUL-2005 DEFINITION Homo sapiens splicing factor proline/glutamine-rich (polypyrimidine tract binding protein associated), mRNA (cDNA clone IMAGE:3139130), partial cds. ACCESSION BC004534 VERSION BC004534.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2270) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2270) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (14-MAR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Oct 28, 2003 this sequence version replaced BC004534.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 4 Row: n Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis. FEATURES Location/Qualifiers source 1..2270 /db_xref="H-InvDB:HIT000086391" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3139130" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..2270 /gene="SFPQ" /gene_synonym="POMP100" /gene_synonym="PSF" /db_xref="GeneID:6421" /db_xref="MIM:605199" CDS <1..1905 /gene="SFPQ" /gene_synonym="POMP100" /gene_synonym="PSF" /codon_start=1 /product="SFPQ protein" /protein_id="AAH04534.2" /db_xref="GeneID:6421" /db_xref="MIM:605199" /translation="PQQPPPQQPPPHQPPPHPQPHQQQQPPPPPQDSSKPVVAQGPGP APGVGSAPPASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTPPSSGVP TTPPQAGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPGGHPKP PHRGGGEPRGGRQHHPPYHQQHHQGPPPGGPGGRSEEKISDSEGFKANLSLLRRPGEK TYTQRCRLFVGNLPADITEDEFKRLFAKYGEPGEVFINKGKGFGFIKLESRALAEIAK AELDDTPMRGRQLRVRFATHAAALSVRNLSPYVSNELLEEAFSQFGPIERAVVIVDDR GRSTGKGIVEFASKPAARKAFERCSEGVFLLTTTPRPVIVEPLEQLDDEDGLPEKLAQ KNPMYQKERETPPRFAQHGTFEYEYSQRWKSLDEMEKQQREQVEKNMKDAKDKLESEM EDAYHEHQANLLRQDLMRRQEELRRMEELHNQEMQKRKEMQLRQEEERRRREEEMMIR QREMEEQMRRQREESYSRMGYMDPRERDMRMGGGGAMNMGDPYGSGGQKFPPLGGGGG IGYEANPGVPPATMSGSMMGSDMRTERFGQGGAGPVGGQGPRGMGPGTPAGYGRGREE YEGPNKKPRF" BASE COUNT 615 a 551 c 605 g 499 t ORIGIN 1 ccgcagcagc caccgccgca gcagccgcca ccgcatcagc cgccgccgca tccacagccg 61 catcagcagc agcagccgcc gccaccgccg caggactctt ccaagcccgt cgttgctcag 121 ggacccggcc ccgctcccgg agtaggcagc gcaccaccag cctccagctc ggccccgccc 181 gccactccac caacctcggg ggccccgcca gggtccgggc caggcccgac tccgaccccg 241 ccgcctgcag tcacctcggc ccctcccggg gcgccgccac ccaccccgcc aagcagcggg 301 gtccctacca cacctcctca ggccggaggc ccgccgcctc cgcccgcggc agtcccgggc 361 ccgggtccag ggcctaagca gggcccaggt ccgggtggtc ccaaaggcgg caaaatgcct 421 ggcgggccga agccaggtgg cggcccgggc ctaagtacgc ctggcggcca ccccaagccg 481 ccgcatcgag gcggcgggga gccccgcggg ggccgccagc accacccgcc ctaccaccag 541 cagcatcacc aggggccccc gcccggcggg cccggcggcc gcagcgagga gaagatctcg 601 gactcggagg ggtttaaagc caatttgtct ctcttgagga ggcctggaga gaaaacttac 661 acacagcgat gtcggttgtt tgttgggaat ctacctgctg atatcacgga ggatgaattc 721 aaaagactat ttgctaaata tggagaacca ggagaagttt ttatcaacaa aggcaaagga 781 ttcggattta ttaagcttga atctagagct ttggctgaaa ttgccaaagc cgaactggat 841 gatacaccca tgagaggtag acagcttcga gttcgctttg ccacacatgc tgctgccctt 901 tctgttcgta atctttcacc ttatgtttcc aatgaactgt tggaagaagc ctttagccaa 961 tttggtccta ttgaaagggc tgttgtaata gtggatgatc gtggaagatc tacagggaaa 1021 ggcattgttg aatttgcttc taagccagca gcaagaaagg catttgaacg atgcagtgaa 1081 ggtgttttct tactgacgac aactcctcgt ccagtcattg tggaaccact tgaacaacta 1141 gatgatgaag atggtcttcc tgaaaaactt gcccagaaga atccaatgta tcaaaaggag 1201 agagaaaccc ctcctcgttt tgcccagcat ggcacgtttg agtacgaata ttctcagcga 1261 tggaagtctt tggatgaaat ggaaaaacag caaagggaac aagttgaaaa aaacatgaaa 1321 gatgcaaaag acaaattgga aagtgaaatg gaagatgcct atcatgaaca tcaggcaaat 1381 cttttgcgcc aagatctgat gagacgacag gaagaattaa gacgcatgga agaacttcac 1441 aatcaagaaa tgcagaaacg taaagaaatg caattgaggc aagaggagga acgacgtaga 1501 agagaggaag agatgatgat tcgtcaacgt gagatggaag aacaaatgag gcgccaaaga 1561 gaggaaagtt acagccgaat gggctacatg gatccacggg aaagagacat gcgaatgggt 1621 ggcggaggag caatgaacat gggagatccc tatggttcag gaggccagaa atttccacct 1681 ctaggaggtg gtggtggcat aggttatgaa gctaatcctg gcgttccacc agcaaccatg 1741 agtggttcca tgatgggaag tgacatgcgt actgagcgct ttgggcaggg aggtgcgggg 1801 cctgtgggtg gacagggtcc tagaggaatg gggcctggaa ctccagcagg atatggtaga 1861 gggagagaag agtacgaagg cccaaacaaa aaaccccgat tttagatgtg atatttaggc 1921 tttcattcca gtttgttttg tttttttgtt tagataccaa tcttttaaat tcttgcattt 1981 tagtaagaaa gctatctttt tatggatgtt agcagtttat tgacctaata tttgtaaatg 2041 gtctgtttgg gcaggtaaaa ttatgtaatg cagtgtttgg aacaggagaa tttttttttc 2101 ctttttattt ctttattttt tcttttttac tgtataatgt ccctcaagtt tatggcagtg 2161 taccttgtgc cactgaattt ccaaagtgta ccaatttttt tttttttact gtgcttcaaa 2221 taaatagaaa aatagttata atattgatct tcaaaaaaaa aaaaaaaaaa //