LOCUS BC038860 4061 bp mRNA linear HUM 28-JUL-2005
DEFINITION Homo sapiens ProSAPiP1 protein, mRNA (cDNA clone MGC:33747
IMAGE:5260571), complete cds.
ACCESSION BC038860
VERSION BC038860.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4061)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4061)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (25-OCT-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 47 Row: f Column: 18
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 35493938.
FEATURES Location/Qualifiers
source 1..4061
/db_xref="H-InvDB:HIT000052085"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:33747 IMAGE:5260571"
/tissue_type="Brain, hippocampus"
/clone_lib="NIH_MGC_95"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..4061
/gene="ProSAPiP1"
/gene_synonym="KIAA0552"
/db_xref="GeneID:9762"
CDS 337..2220
/gene="ProSAPiP1"
/gene_synonym="KIAA0552"
/codon_start=1
/product="ProSAPiP1 protein"
/protein_id="AAH38860.1"
/db_xref="GeneID:9762"
/translation="MAKLETLPVRADPGRDPLLAFAPRPSELGPPDPRLAMGSVGSGV
AHAQEFAMKSVGTRTGGGGSQGSFPGPRGSGSGASRERPGRYPSEDKGLANSLYLNGE
LRGSDHTDVCGNVVGSSGGSSSSGGSDKAPPQYREPSHPPKLLATSGKLDQCSEPLVR
PSAFKPVVPKNFHSMQNLCPPQTNGTPEGRQGPGGLKGGLDKSRTMTPAGGSGSGLSD
SGRNSLTSLPTYSSSYSQHLAPLSASTSHINRIGTASYGSGSGGSSGGGSGYQDLGTS
DSGRASSKSGSSSSMGRPGHLGSGEGGGGGLPFAACSPPSPSALIQELEERLWEKEQE
VAALRRSLEQSEAAVAQQDKKQLQEEAARLMRQREELEDKVAACQKEQADFLPRIEET
KWEVCQKAGEISLLKQQLKDSQADVSQKLSEIVGLRSQLREGRASLREKEEQLLSLRD
SFSSKQASLELGEGELPAACLKPALTPVDPAEPQDALATCESDEAKMRRQAGVAAAAS
LVYVDGEAEAGGESGTRALRREVGRLQAELAAERRARERQGASFAEERRVWLEEKEKV
IEYQKQLQLSYVEMYQRNQQLERRLRERGAAGGASTPTPQHGEEKKAWTPSRLERIES
TEI"
BASE COUNT 802 a 1255 c 1274 g 730 t
ORIGIN
1 cggcgcgccc cgccgggcgt gcttcgggct gcgccagcaa gcggggccgg tggcgcccgt
61 gtcggagacc ccgcgccgga ccctgaggca gcgaaggaga aaactgcagt ccgggatggc
121 tcagtcggcc cctccaaagt cgcgcagctg gttcgggccg agccccgact gcgagagtga
181 ggcacatggc ccctgcagac cgggcctcgg agggtcccag gcttgaggac ccgtcggccc
241 ctcaacccct tggaaaggca aggagagaag cctgcgctgg tagtgagctt ccacttctgc
301 cctgactgca acacttagtg cccccctggc ttagtcatgg cgaagctgga gacgctgcct
361 gtgcgcgctg acccagggcg ggaccctctc ctggcctttg ccccacggcc ctccgagctt
421 ggacccccgg acccccgcct ggccatgggc agcgtgggca gtggggtggc ccatgcccag
481 gagtttgcca tgaagagcgt gggtacccgc acagggggtg ggggcagcca gggcagtttc
541 cctggccccc gaggcagtgg cagtggggcc agcagggaga ggccgggccg ctacccctca
601 gaggacaagg gtctcgccaa ctccctctac ctcaatggtg agctgcgggg cagtgaccac
661 accgatgtct gtggcaatgt ggttggcagc agcggaggca gcagcagcag tggtggcagt
721 gacaaagccc caccgcagta tcgtgagccc agccacccac ccaagctcct ggccacctct
781 ggcaagctag accagtgctc agaaccacta gttcggccgt cggccttcaa gcctgtcgta
841 cccaagaatt tccactccat gcagaatttg tgccccccgc agaccaatgg gactcctgag
901 ggacggcagg gccctggtgg cctcaaaggc ggactggaca agtctcggac catgactcca
961 gcgggtggga gtgggagtgg cctctcagac tcaggccgga actccctcac aagcctgccc
1021 acctacagct ccagctacag ccagcacctg gcacccctca gtgcctccac cagccacatt
1081 aaccgcattg gcactgccag ctatggtagt ggtagtggcg gcagcagcgg tggggggtcg
1141 ggctaccagg acctggggac ctccgatagt ggacgggcct ccagcaagag tgggtcgtcg
1201 tcatctatgg ggcggccagg ccacctgggc tctggggagg gcggaggtgg aggcctgcct
1261 ttcgcggcct gctcaccgcc ctcccccagt gcactcatcc aggagctgga ggagcggctg
1321 tgggagaagg agcaggaggt ggcagctctg cggcgcagcc tggagcagag cgaggcggct
1381 gtggcccagc aggacaagaa gcagctgcag gaggaggcgg cccggctgat gcggcagcgg
1441 gaagagctgg aggacaaggt ggccgcctgc cagaaggagc aggccgactt cctgccccgg
1501 atagaggaaa ctaagtggga ggtgtgccag aaggctggcg agatctccct cctgaagcag
1561 cagctgaagg actcgcaggc ggatgtgtcg cagaagttga gtgagatcgt gggactgcgc
1621 tcgcagctgc gggagggccg ggcttcgctg cgggagaagg aggagcagct gctcagcctg
1681 cgggactcct tcagcagcaa gcaggccagc ctggagctgg gcgaaggcga gctgcctgcc
1741 gcctgcctca agccggcgct gacccccgtg gacccggccg agccacagga tgctctggcc
1801 acctgcgaga gcgacgaggc taagatgcgc cgtcaggccg gggtggccgc tgccgcctcc
1861 ttggtttacg tggacgggga ggcggaggct ggcggggaga gcgggacgcg ggccctgcgg
1921 cgggaggtgg ggcggctgca ggccgagctg gcggctgagc ggcgggcccg ggagcgccag
1981 ggtgccagct tcgccgagga gcgccgcgtg tggctggagg agaaggagaa ggtgatcgag
2041 taccagaagc agctgcagct gagctacgtg gagatgtacc aacgcaacca gcagctggag
2101 cgcaggctgc gggagcgcgg ggccgcaggg ggtgcaagca cgcccactcc ccagcatggc
2161 gaggagaaga aggcctggac cccctcccgc ctcgagcgca ttgagtccac agaaatctga
2221 tcgacctggg cactcggcat tttgacacat gtcctgtcaa aaggccagag tccccagtgt
2281 cccctcccct ccatctctct tccccataga ccccataacc ccagaccaaa gaggttctct
2341 aagcagctgt gaccaggttc ctccctcccc acctgccctc ctagctccag cactgccccc
2401 gtggcagccc acttggaccc ccctaaaagg agggaatagg aggagggcag ggtgagtggg
2461 ggcaatccta ggtggtgggg gagtcatgct ccctttctcg gcaccccctt gttggagatg
2521 gaggcagcag acgtgcagtg ccataaggtg ccccagtcct tctggaggcc tgggctgcta
2581 ctgttggcca ccctgtgtct agtgatgctc tctgtgctca cctcctaggc catggagcct
2641 gagggggcct gcaccaggtt tgctgaaact gacagagcct gggctccaga cctctctccc
2701 tcctacagtg ctctccctcc ctgggcagat tggcaggaca agtgggagca gatggcctgc
2761 ctttggctga gagggctacc tgcccagccc ctcccccaac aagatctctt ggactcaggc
2821 ctcagagcct ggcctggttg tgagtgtgtg tccctgtgtg tgtgttgcgg gaggggagga
2881 ctggggctgg aagtccagca cccagggaag atctgtcctc ctgttcttgg gaagcgttgc
2941 ctgacggctt ctcggctcta ccctcaccct tctggccagg atcccgcagg gcaacagccc
3001 catctgcttg gctgacccca cacccaggac cactgtccgg ctctaacaca gctattaagt
3061 gctacctgcc tctcaggcac tctcctcgcc cagtttctga ggtcagacga gtgtctgcga
3121 tgtcttcccg cactctattc ccccagcctc tttctgcttt catgctcagc acatcatctt
3181 cctaggcagt ctcttcccca aagtctcacc ttttcttcca atagaaaatt ccgcttgacc
3241 tttggtgcac tgcccacttc ccagctccac tggcccaagt ctgagccgga ggcccttgtt
3301 ttgggggcgg ggggagagtt ggatgtgatt gcccttgaag aacaaggctg acctgagagg
3361 ttcctggcgc cctgaggtgg ctcagcacct gcccagggta ggcctggcat gaggggttag
3421 gtcagccaat gtcagctgct tctcttgggg ccctctcaga gtctatctcc ccaagacagg
3481 aagggaaaag caaatttcta attcaccagc aataaaaatt ggaggaggct tggccctcag
3541 cccttatatc tctctctttt tcactctctt cctcccaccc ccaagactga gttttggggg
3601 gcaaggtgga gagagctggc aactactgtg agcaagtccc ctagcccctg accagcctcc
3661 tcccatgact ggtgactgtt taatgagctg tgcatccccc acaaaaacat gagtgcccct
3721 ctgtgtggcc tctaaccctc tgcacagccc ttctgggtgg tcctcaccag gtctctgagc
3781 tgggtgggag gccatcctgg cgaccactgc ccattccatt cacccctcac tgtacctgcc
3841 ctagaacctg ggcctaggcc acaggggcag ggagaagaga aggcattagt aagaaaaaaa
3901 tagaaaaaaa tatgaacaga ctcagctttg ggacgtccaa ccacaaaaga aattatatat
3961 aaatatatat aaatatatat ctctaccata tgtgatggaa agactttttg ttttcctttc
4021 ccaaagaaat aaaacggaaa aagcccaaaa aaaaaaaaaa a
//