LOCUS BC064135 1842 bp mRNA linear HUM 03-JAN-2005
DEFINITION Homo sapiens PC4 and SFRS1 interacting protein 1, mRNA (cDNA clone
MGC:74712 IMAGE:6504372), complete cds.
ACCESSION BC064135
VERSION BC064135.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1842)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1842)
AUTHORS Director MGC Project.
TITLE Direct Submission
JOURNAL Submitted (11-DEC-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Steve Jones, Sarah Barber, Mabel Brown-John, Yaron Butterfield,
Andy Chan, Steve S. Chand, William Chow, Alison Cloutier, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Amara Masson, Mike R. Mayo, Josh Moran, Ryan Morin,
Teika Olson, Diana Palmquist, Anca Petrescu, Anna Liisa Prahbu,
Parvaneh Saeedi, JR Santos, Angelique Schnerch, Ursula Skalska,
Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie
Schein, Asim Siddiqui, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 141 Row: k Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 19923652.
FEATURES Location/Qualifiers
source 1..1842
/db_xref="H-InvDB:HIT000261277"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:74712 IMAGE:6504372"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1842
/gene="PSIP1"
/gene_synonym="DFS70"
/gene_synonym="LEDGF"
/gene_synonym="p52"
/gene_synonym="p75"
/gene_synonym="PAIP"
/gene_synonym="PSIP2"
/db_xref="GeneID:11168"
/db_xref="MIM:603620"
CDS 289..1263
/gene="PSIP1"
/gene_synonym="DFS70"
/gene_synonym="LEDGF"
/gene_synonym="p52"
/gene_synonym="p75"
/gene_synonym="PAIP"
/gene_synonym="PSIP2"
/codon_start=1
/product="PSIP1 protein"
/protein_id="AAH64135.1"
/db_xref="GeneID:11168"
/db_xref="MIM:603620"
/translation="MTRDFKPGDLIFAKMKGYPHWPARVDEVPDGAVKPPTNKLPIFF
FGTHETAFLGPKDIFPYSENKEKYGKPNKRKGFNEGLWEIDNNPKVKFSSQQAATKQS
NASSDVEVEEKETSVSKEDTDHEEKASNEDVTKAVDITTPKAARRGRKRKAGVVTTAT
ASVNLKVSPKRGRPAATEVKIPKPRGRPKMVKQPCPSESDIITEEDKSKKKGQEEKQP
KKQPKKDEEGQKEEDKPRKEPDKKEGKKEVESKRKNLAKTGVTSTSDSEEEGDDQEGE
KKRKGGRNFQTAHRRNMLKGQHEKEAADRKRKQEEQMETEHQTTCNLQ"
BASE COUNT 646 a 374 c 424 g 398 t
ORIGIN
1 gcgcagctac ggcggcggca gcggcggcgc ggttgcgatc ccgagccgtt gagacgcctc
61 tgcggcagct ggtggcgcag gtggcttgcg tggacgcggg tagaggcgac cggccagcaa
121 ccgcagcgtc ggcgcccgcg gccccggcag gcgcgtcggg acgccccgag gcatcctccc
181 ccgcccgcgg gcccggtagc tgggcccgcg tccgccgccc gcatccccgc gccgccgcat
241 ctcctcgccg cctcccgggc ttcggacccc cggtctcgcc cccgaaacat gactcgcgat
301 ttcaaacctg gagacctcat cttcgccaag atgaaaggtt atccccattg gccagctcga
361 gtagacgaag ttcctgatgg agctgtaaag ccacccacaa acaaactacc cattttcttt
421 tttggaactc atgagactgc ttttttagga ccaaaggata tatttcctta ctcagaaaat
481 aaggaaaagt atggcaaacc aaataaaaga aaaggtttta atgaaggttt atgggagata
541 gataacaatc caaaagtgaa attttcaagt caacaggcag caactaaaca atcaaatgca
601 tcatctgatg ttgaagttga agaaaaggaa actagtgttt caaaggaaga taccgaccat
661 gaagaaaaag ccagcaatga ggatgtgact aaagcagttg acataactac tccaaaagct
721 gccagaaggg ggagaaagag aaaggcagga gtagtgacaa cagcaacagc atctgttaat
781 ctaaaagtga gtcctaaaag aggacgacct gcagctacag aagtcaagat tccaaaacca
841 agaggcagac ccaaaatggt aaaacagccc tgtccttcag agagtgacat cattactgaa
901 gaggacaaaa gtaagaaaaa ggggcaagag gaaaaacaac ctaaaaagca gcctaagaag
961 gatgaagagg gccagaagga agaagataag ccaagaaaag agccggataa aaaagagggg
1021 aagaaagaag ttgaatcaaa aaggaaaaat ttagctaaaa caggggttac ttcaacctcc
1081 gattctgaag aagaaggaga tgatcaagaa ggtgaaaaga agagaaaagg tgggaggaac
1141 tttcagactg ctcacagaag gaatatgctg aaaggccaac atgagaaaga agcagcagat
1201 cgaaaacgca agcaagagga acaaatggaa actgagcacc aaacaacatg taatctacag
1261 taataaaaaa tatatctcat tttgggctca aagcattaat ccagttactg aaaagagaat
1321 acaagtggag caaacaagag atgaagatct tgatacagac tcattggact gaatttcccc
1381 cttcccccca ttgatggaag aatgttccag attctaaatt gaggacttca ttattaatgg
1441 cattactgtg ttatgattaa caaatttcct gtaaggtaca cactacatac taaggtcggc
1501 catcattcct gttttttttt tttttttaac caagcttaaa atgaagcttt gtgtttgaaa
1561 gtaataacaa gctcagacga agatggtggt tgtacattat tcatctagaa aatataaaaa
1621 ttcattttgt tttgaagcta gttattaaac tggaatagca gttatatccc tgagaatggg
1681 gcccttctct tgacattcct ttgttgttta attctttaga atcttaataa atgttttttt
1741 aatcctgaga gattaaacag tagtagactt gttaagaatg aaactgtaac caaaatttta
1801 aaataaagtt ttttttaaaa aaaaaaaaaa aaaaaaaaaa aa
//