LOCUS BC044568 2006 bp mRNA linear HUM 21-JUL-2005 DEFINITION Homo sapiens PC4 and SFRS1 interacting protein 1, mRNA (cDNA clone MGC:57122 IMAGE:4838839), complete cds. ACCESSION BC044568 VERSION BC044568.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2006) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2006) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (17-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 106 Row: g Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 16945969. FEATURES Location/Qualifiers source 1..2006 /db_xref="H-InvDB:HIT000097859" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:57122 IMAGE:4838839" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2006 /gene="PSIP1" /gene_synonym="DFS70" /gene_synonym="LEDGF" /gene_synonym="p52" /gene_synonym="p75" /gene_synonym="PAIP" /gene_synonym="PSIP2" /db_xref="GeneID:11168" /db_xref="MIM:603620" CDS 316..1305 /gene="PSIP1" /gene_synonym="DFS70" /gene_synonym="LEDGF" /gene_synonym="p52" /gene_synonym="p75" /gene_synonym="PAIP" /gene_synonym="PSIP2" /codon_start=1 /product="PSIP1 protein" /protein_id="AAH44568.2" /db_xref="GeneID:11168" /db_xref="MIM:603620" /translation="MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFF FGTHETAFLGPKDIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQS NASSDVEVEEKETSVSKEDTDHEEKASNEDVTKAVDITTPKAARRGRKRKAEKQVETE EAGVVTTATASVNLKVSPKRGRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSK KKGQEEKQPKKQPKKDEEGQKEEDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSE EEGDDQEGEKKRKGGRNFQTAHRRNMLKGQHEKEAADRKRKQEEQMETEHFAL" BASE COUNT 709 a 395 c 463 g 439 t ORIGIN 1 gggccgagcg ggagccgcgc gggagcagcg cagctacggc ggcggcagcg gcggcgcggt 61 tgcgattccg agccgttgag acgcctctgc ggcagctggt ggcgcaggtg gcttgcgtgg 121 acgcgggtag aggcgaccgg ccagcaaccg cagcgtcggc gcccgcggcc ccggcaggcg 181 cgtcgggacg ccccgaggca tcctcccccg cccgcgggcc cggtagctgg gcccgcgtcc 241 gccgcccgca tccccgcgcc gccgcatctc ctcgccgcct cccgggcttc ggacccccgg 301 tctcgccccc gaaacatgac tcgcgatttc aaacctggag acctcatctt cgccaagatg 361 aaaggttatc cccattggcc agctcgagta gacgaagttc ctgatggagc tgtaaagcca 421 cccacaaaca aactacccat tttctttttt ggaactcatg agactgcttt tttaggacca 481 aaggatatat ttccttactc agaaaataag gaaaagtatg gcaaaccaaa taaaagaaaa 541 ggttttaatg aaggtttatg ggagatagat aacaatccaa aagtgaaatt ttcaagtcaa 601 caggcagcaa ctaaacaatc aaatgcatca tctgatgttg aagttgaaga aaaggaaact 661 agtgtttcaa aggaagatac cgaccatgaa gaaaaagcca gcaatgagga tgtgactaaa 721 gcagttgaca taactactcc aaaagctgcc agaaggggga gaaagagaaa ggcagaaaaa 781 caagtagaaa ctgaggaggc aggagtagtg acaacagcaa cagcatctgt taatctaaaa 841 gtgagtccta aaagaggacg acctgcagct acagaagtca agattccaaa accaagaggc 901 agacccaaaa tggtaaaaca gccctgtcct tcagagagtg acatcattac tgaagaggac 961 aaaagtaaga aaaaggggca agaggaaaaa caacctaaaa agcagcctaa gaaggatgaa 1021 gagggccaga aggaagaaga taagccaaga aaagagccgg ataaaaaaga ggggaagaaa 1081 gaagttgaat caaaaaggaa aaatttagct aaaacagggg ttacttcaac ctccgattct 1141 gaagaagaag gagatgatca agaaggtgaa aagaagagaa aaggtgggag gaactttcag 1201 actgctcaca gaaggaatat gctgaaaggc caacatgaga aagaagcagc agatcgaaaa 1261 cgcaagcaag aggaacaaat ggaaactgag cattttgctc tgtaaagcat ttcagatgca 1321 gcttgtgtga aatgcattat gcaaaataaa gtttattgta ttgtattccc agccaaacaa 1381 catgtaatct acagtaataa aaaatatatc tcattttggg ctcaaagcat taatccagtt 1441 actgaaaaga gaatacaagt ggagcaaaca agagatgaag atcttgatac agactcattg 1501 gactgaattt cccccttccc cccattgatg gaagaatgtt ccagattcta aattgaggac 1561 ttcattatta atggcattac tgtgttatga ttaacaaatt tcctgtaagg tacacactac 1621 atactaaggt cggccatcat tcctgttttt tttttttttt ttttttttaa ccaagcttaa 1681 aatgaagctt tgtgtttgaa agtaataaca agctcagacg aagatggtgg ttgtacatta 1741 ttcatctaga aaatataaaa attcattttg ttttgaagct agttattaaa ctggaatagc 1801 agttatatcc ctgagaatgg ggcccttctc ttgacattcc tttgttgttt aattctttag 1861 aatcttaata aatgtttttt taatcctgag agattaaaca gtagtagact tgttaagaat 1921 gaaactgtaa ccaaaatttt aaaataaagt tttttttaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaaaaaa gaaaaaaaaa aaaaaa //