LOCUS BC052581 4891 bp mRNA linear HUM 07-OCT-2003
DEFINITION Homo sapiens sperm specific antigen 2, mRNA (cDNA clone MGC:59972
IMAGE:6084501), complete cds.
ACCESSION BC052581
VERSION BC052581.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4891)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4891)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (16-MAY-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 47 Row: f Column: 9
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis, GenomeScan gene prediction.
FEATURES Location/Qualifiers
source 1..4891
/db_xref="H-InvDB:HIT000053799"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:59972 IMAGE:6084501"
/tissue_type="Pancreas, ductal carcinoma"
/clone_lib="NIH_MGC_110"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..4891
/gene="SSFA2"
/gene_synonym="CS-1"
/gene_synonym="CS1"
/gene_synonym="KIAA1927"
/gene_synonym="KRAP"
/gene_synonym="SPAG13"
/db_xref="GeneID:6744"
/db_xref="MIM:118990"
CDS 121..3900
/gene="SSFA2"
/gene_synonym="CS-1"
/gene_synonym="CS1"
/gene_synonym="KIAA1927"
/gene_synonym="KRAP"
/gene_synonym="SPAG13"
/codon_start=1
/product="SSFA2 protein"
/protein_id="AAH52581.1"
/db_xref="GeneID:6744"
/db_xref="MIM:118990"
/translation="MDRPLSSSAEAEEELEWQVASRRRKAWAKCRSSWQASETEDLST
EATTQDEEEDEEEDLPGAQLPAAGGRGNVPNEKIAIWLKDCRTPLGASLDEQSSSTLK
GVLVRNGGSFEDDLSLGAEANHLHESDAQIENCNNILAKERRLQFHQKGRSMNSTGSG
KSSGTVSSVSELLELYEEDPEEILYNLGFGRDEPDIASKIPSRFFNSSSFAKGIDIKV
FLSAQMQRMEVENPNYALTSRFRQIEVLTTVANAFSSLYSQVSGTPLQRIGSMSSVTS
NKETDPPPPLTRSNTANRLMKTLSKLNLCVDKTEKGESSSPSPSAEKGKILNVSVIEE
SGNKNDQKSQKIMKKKESSSMLATVKEEVSGSSAAVTENADSDRISDEANSNFNQGTE
NEQSKETQSHESKLGEESGIVESKLDSDFNISSHSELENSSELKSVHISTPEKEPCAP
LTIPSIRNIMTQQKDSFEMEEVQSTEGEAPHVPATYQLGLTKSKRDHLLRTASQHSDS
SGFAEDSTDCLSLNHLQVQESLQAMGSSADSCDSETTVTSLGEDLATPTAQDQPYFNE
SEEESLVPLQKGLEKAAAVADKRKSGSQDFPQCNTIENTGTKQSTCSPGDHIIEITEV
EEDLFPAETVELLREASAESDVGKSSESEFTQYTTHHILKSLASIEAKCSDMSSENTT
GPPSSMDRVNTALQRAQMKVCSLSNQRMGRSLLKSKDLLKQRYLFAKAGYPLRRSQSL
PTTLLSPVRVVSSVNVRLSPGKETRCSPPSFTYKYTPEEEQELEKRVMEHDGQSLVKS
TIFISPSSVKKEEAPQSEAPRVEECHHGRTPTCSRLAPPPMSQSTCSLHSIHSEWQER
PLCEHTRTLSTHSVPNISGATCSAFASPFGCPYSHRHATYPYRVCSVNPPSAIEMQLR
RVLHDIRNSLQNLSQYPMMRGPDPAAAPYSTQKSSVLPLYENTFQELQVMRRSLNLFR
TQMMDLELAMLRQQTMVYHHMTEEERFEVDQLQGLRNSVRMELQDLELQLEERLLGLE
EQLRAVRMPSPFRSSALMGMCGSRSADNLSCPSPLNVMEPVTELMQEQSYLKSELGLG
LGEMGFEIPPGESSESVFSQATSESSSVCSGPSHANRRTGVPSTASVGKSKTPLVARK
KVFRASVALTPTAPSRTGSVQTPPDLESSEEVDAAEGAPEVVGPKSEVEEGHGKLPSM
PAAEEMHKNVEQDELQQVIREIKESIVGEIRREIVSGLLAAVSSSKASNSKQDYH"
BASE COUNT 1477 a 980 c 1122 g 1312 t
ORIGIN
1 cccgcttcag ctccccgggg ccccctgccc ggccgggcgc tgacagcaag ggcgggggtc
61 cctgccgccg ccttgtctcg cgcagggtcc ggctggggta gcggagcccc cagtgcggcc
121 atggaccggc ccctgtcgtc gtcggcggag gcggaggagg aactggagtg gcaagtggcg
181 agtcgcagga ggaaggcctg ggccaagtgc cgcagctcct ggcaagcgtc ggagacggag
241 gatctgtcca cagaagcgac gacgcaggac gaggaggagg acgaggagga ggacctcccc
301 ggcgcgcagc tgccggcagc ggggggaaga ggaaacgtgc ccaacgagaa gatcgcgata
361 tggctcaagg actgccgtac acctttggga gcctcactgg atgaacaaag cagtagtaca
421 ctcaagggtg tgcttgtgag aaatggagga agttttgaag atgatttgtc attgggagct
481 gaagccaacc acctccatga aagtgatgct caaattgaaa actgcaataa tatcttggcc
541 aaagagagaa gattacagtt tcatcagaaa gggagaagta tgaattccac tggatctggg
601 aaaagtagtg ggacagtttc aagtgtttca gaattgttgg aactttatga ggaagatcct
661 gaagaaattc tttataatct tggatttgga cgtgatgaac cagatattgc ttctaaaatt
721 ccttccagat tttttaattc atcatccttt gccaaaggga tagatattaa agtatttttg
781 agtgctcaga tgcaacggat ggaagtagaa aacccaaatt atgctttaac aagccgtttt
841 cgtcaaattg aagtgcttac tactgtggcc aatgcgtttt cttctttata ttctcaagtc
901 tccgggacgc ccctgcagag aattggaagt atgtcctcag tgacctctaa caaggagaca
961 gacccacctc cacctttaac tcgaagtaac actgcaaatc gtttaatgaa aacactctca
1021 aaactgaatt tatgtgttga taaaacagag aaaggagaaa gtagtagtcc ttctccatca
1081 gctgaaaaag gaaagattct aaatgtttca gtgattgaag aaagtggcaa taaaaacgat
1141 caaaagtctc aaaaaattat gaagaagaaa gagtcatctt ctatgttggc tacagttaaa
1201 gaagaagtct ctggtagttc agcagctgtt acggagaatg ctgatagtga tagaatttct
1261 gatgaagcaa atagtaattt taaccaagga actgaaaatg aacaaagtaa agaaactcaa
1321 agtcatgaga gtaaactggg tgaggaatct ggtattgtag aatccaaatt agatagtgat
1381 ttcaacatat ccagccacag tgagctggaa aatagcagtg agctgaaaag tgtccatata
1441 tccacacctg aaaaagagcc ttgtgcacca ctgacaatac catccataag aaatataatg
1501 acacagcaga aggactcctt cgaaatggaa gaggttcaaa gtacggaggg agaagctcct
1561 catgttccag ccacttacca gctaggtctt acgaagtcga aaagagatca tctgttacgt
1621 actgcaagtc agcattccga tagcagtggt tttgctgaag attctacaga ctgcctatcc
1681 cttaatcatc ttcaggttca ggagtccttg caggctatgg ggagtagtgc tgatagttgt
1741 gacagtgaga caacagttac gtcacttggt gaagaccttg ccacaccaac agcacaagac
1801 cagccttatt ttaatgaatc agaggaggag tctcttgtcc ctcttcagaa gggactagag
1861 aaggcagcag cagttgcaga caaaagaaaa tcaggtagcc aggatttccc tcagtgcaac
1921 accattgaga atacaggaac taaacagtcc acctgtagtc caggggatca tatcattgaa
1981 attactgaag tggaagagga tttgtttcca gcagagacag tagagctact gagggaagca
2041 agtgctgaaa gtgatgtggg taaaagcagt gaaagtgaat ttactcagta taccacacac
2101 catattctga aatcattggc ttctattgaa gctaaatgca gtgatatgag ctctgaaaat
2161 acaactgggc ctccctcttc catggacaga gttaatacag ctttgcaaag agctcaaatg
2221 aaggtttgca gtctgtctaa tcaaaggatg gggcgtagcc tgctaaaatc aaaagatttg
2281 ttaaaacaaa ggtacttatt tgcaaaagct ggctatcctc taagaaggtc tcagtcttta
2341 ccaaccacct tattgagccc agtaagggtt gtgtcctctg tcaatgttcg attatctcca
2401 ggaaaagaga ccagatgcag cccaccttcc ttcacctata agtacacacc tgaagaggag
2461 caggaattgg aaaagcgggt gatggaacat gatggtcagt ctttagttaa atcgaccatt
2521 ttcatctctc catcatctgt gaagaaagaa gaagcccccc agagtgaggc gccgcgggtg
2581 gaggaatgcc atcatggaag gactcctacc tgttcacggc ttgctccacc accaatgtct
2641 cagtctacct gttcccttca ttccatccac tctgagtggc aagaaaggcc cctgtgtgag
2701 cacacaagaa ctctgagcac tcacagtgtt cccaacatat caggggctac ttgtagtgcc
2761 ttcgcttccc ctttcgggtg tccttactca catagacatg ccacctaccc ttaccgagtg
2821 tgctctgtga atcctccttc agccatagaa atgcagttgc gaagagtatt acatgatatt
2881 agaaactcac tgcagaatct ttcacagtac cctatgatga gaggacctga tcctgctgct
2941 gctccatata gtactcagaa atcatctgtt ctacctcttt atgaaaatac ttttcaggag
3001 ctccaggtaa tgaggcggag cctgaatttg tttagaacac aaatgatgga tttagaattg
3061 gcaatgctgc gtcagcaaac catggtttat catcatatga ctgaggagga gaggtttgaa
3121 gttgatcagc tccagggttt gagaaattca gtccgaatgg aacttcagga cctggaactg
3181 cagctggagg agcgcctgct gggcctggag gagcagcttc gtgctgtgcg catgccttca
3241 cccttccgct cctccgcact catgggaatg tgtggcagta gaagcgctga taacttgtca
3301 tgcccttctc cattgaatgt aatggaacca gtcactgaac tgatgcagga gcagtcatac
3361 ctgaagtctg aattgggcct gggacttgga gaaatgggat ttgaaattcc tcctggagaa
3421 agctcagaat ctgttttttc ccaagcaaca tcagaatcat cttctgtatg ttctggtccc
3481 tctcatgcta acagaagaac tggagtacct tctactgcct cagtgggcaa atccaaaacc
3541 ccattagtgg caaggaagaa agtgttccga gcatcggtgg ctctaacgcc aacagctcct
3601 tctagaacag gctctgtgca gacacctcca gatttggaaa gttctgagga agttgatgca
3661 gctgaaggag ccccagaagt tgtaggacct aaatctgaag tggaagaagg gcatggaaaa
3721 ctcccatcaa tgccagctgc tgaggaaatg cataaaaatg tggagcaaga tgagttgcag
3781 caagtcatac gggagattaa agagtctatt gttggggaaa tcagacggga aattgtaagt
3841 ggacttttgg cagcagtatc ttcaagtaaa gcgtctaatt ctaagcaaga ttatcattaa
3901 acagaaatta taggttggca tggatcctat tagctgtgta atactggaat tatcaatgat
3961 atgcactggt ggaggtgtta tttgtgcttt agaagatact tgctgttgag ctgggctact
4021 gtatacagtg tacaatgtgt atttcttcaa ccatatattt taaaaagacg tacatagaaa
4081 cttaggcact ttgctatttc ttttctaaac tatcaaaaac tctagcagtt tgaaaagcct
4141 aatatttatt tgtatgtcag tatttttcat ttgattccct attagaatta attttaaaac
4201 ttgaagactt ccagacttat ccaacttata aataacatat ttcttcagac taacatctta
4261 aaacactgac ctctatgagg tatttactgt gcaataactg attcattttt ttcagagctt
4321 gaagcatcca atgatttttc cctccactgc tgttaattaa tgtcacttcc aagaagaaaa
4381 actgttctgt tgtaaaaaat ataattgctc ttaattcttg gggaggttac taatagcagt
4441 aggatagaat tttatgaggt tacctacaac tacttaatgt acttacactg taagccttgt
4501 tgctttaccc aagacaaatg taattttatc attgcttatg tagtattttt cttttggaaa
4561 tgtgccttat gttaaacact atgtactttt actttttgca ttgtccagac ttctttatta
4621 gatggagatg tttctttttc tgtcttctag actaaataga gtatcatcca aataatgggg
4681 cctatgactt gaatgaatag aaatgaataa gctggtgttt gttttttcaa aatggaagta
4741 atttagattt gttctcctca tacataaaat gattttagtt cagttttaac cagtgaaaac
4801 tttgttttta tgaaaaaaaa aggaaaatgg tttcccattt ggttttatat gtgttaaata
4861 aatgtgtaaa gtaaaaaaaa aaaaaaaaaa a
//