LOCUS BC025255 2640 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens spermatogenesis associated 20, mRNA (cDNA clone
MGC:39131 IMAGE:5019010), complete cds.
ACCESSION BC025255
VERSION BC025255.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2640)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2640)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (05-MAR-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson
Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen
Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel
Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave
Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth
Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao,
Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR
Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang,
Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 42 Row: m Column: 9
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31542722.
FEATURES Location/Qualifiers
source 1..2640
/db_xref="H-InvDB:HIT000039864"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:39131 IMAGE:5019010"
/tissue_type="Pancreas, epithelioid carcinoma"
/clone_lib="NIH_MGC_42"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2640
/gene="SPATA20"
/gene_synonym="FLJ21347"
/gene_synonym="FLJ21969"
/gene_synonym="SSP411"
/gene_synonym="Tisp78"
/db_xref="GeneID:64847"
/db_xref="HGNC:HGNC:26125"
CDS 9..2417
/gene="SPATA20"
/gene_synonym="FLJ21347"
/gene_synonym="FLJ21969"
/gene_synonym="SSP411"
/gene_synonym="Tisp78"
/codon_start=1
/product="spermatogenesis associated 20"
/protein_id="AAH25255.1"
/db_xref="GeneID:64847"
/db_xref="HGNC:HGNC:26125"
/translation="MLGARAWLGRVLLLPRAGAGLAASRRCPGVWPRTWPHRSPSRGS
SSRDKDRSATVSSSVPMPAGGKGSHPSSTPQRVPNRLIHEKSPYLLQHAYNPVDWYPW
GQEAFDKARKENKPIFLSVGYSTCHWCHMMEEESFQNEEIGRLLSEDFVSVKVDREER
PDVDKVYMTFVQATSSGGGWPMNVWLTPNLQPFVGGTYFPPEDGLTRVGFRTVLLRIR
EQWKQNKNTLLENSQRVTTALLARSEISVGDRQLPPSAATVNNRCFQQLDEGYDEEYG
GFAEAPKFPTPVILSFLFSYWLSHRLTQDGSRAQQMALHTLKMMANGGIRDHVGQGFH
RYSTDRQWHVPHFEKMLYDQAQLAVAYSQAFQLSGDEFYSDVAKGILQYVARSLSHRS
GGFYSAEDADSPPERGQRPKEGAYYVWTVKEVQQLLPEPVLGATEPLTSGQLLMKHYG
LTEAGNISPSQDPKGELQGQNVLTVRYSLELTAARFGLDVEAVRTLLNSGLEKLFQAR
KHRPKPHLDSKMLAAWNGLMVSGYAVTGAVLGQDRLINYATNGAKFLKRHMFDVASGR
LMRTCYTGPGGTVEHSNPPCWGFLEDYAFVVRGLLDLYEASQESAWLEWALRLQDTQD
KLFWDSQGGGYFCSEAELGAGLPLRLKDDQDGAEPSANSVSAHNLLRLHGFTGHKDWM
DKCVCLLTAFSERMRRVPVALPEMVRALSAQQQTLKQIVICGDRQAKDTKALVQCVHS
VYIPNKVLILADGDPSSFLSRQLPFLSTLRRLEDQATAYVCENQACSVPITDPCELRK
LLHP"
BASE COUNT 550 a 808 c 785 g 497 t
ORIGIN
1 gagcagccat gctgggcgcg cgggcctggt tgggccgcgt ccttctgctg ccccgcgccg
61 gtgcaggcct cgccgcgagc cgcaggtgtc ctggagtctg gcccaggacc tggccccaca
121 ggagtcccag caggggtagc tcctcccggg acaaggaccg aagtgcgacg gtcagtagtt
181 cagtgcccat gcctgctgga gggaaaggaa gccatccttc atctacaccc cagagggtcc
241 ccaaccgcct gatccacgag aagtcaccat acctcctaca acatgcctac aatcctgtgg
301 actggtaccc ctggggacag gaagccttcg acaaggccag gaaggaaaac aagccgattt
361 tcctctcagt cgggtactcc acctgccact ggtgccacat gatggaagag gagtccttcc
421 agaatgagga gattggccgc ctgctcagtg aggactttgt gagtgtgaag gtagaccgtg
481 aggagcggcc tgacgtggac aaggtgtaca tgacgttcgt gcaggccacc agcagcggcg
541 ggggctggcc catgaatgtg tggctgactc ccaacctcca gccctttgtc gggggcacct
601 atttccctcc tgaggatggc ttgacccgag tcggcttccg cacagtgttg ctgagaatac
661 gagaacagtg gaaacagaac aagaacaccc tgctagaaaa tagccagcgt gtcaccactg
721 ccctgctggc ccgatcagag atcagcgtgg gtgaccgcca gctgccgccc tctgccgcca
781 ccgtgaacaa tcgctgcttc cagcagctgg atgagggcta tgatgaggaa tacggtggct
841 tcgctgaggc ccccaagttt cccacgccgg tgatcctgag cttcctgttc tcctactggc
901 tcagccatcg actgactcag gatggctctc gggcccagca gatggccttg cataccctga
961 aaatgatggc taacgggggc atccgggacc atgtggggca gggctttcac cgctactcca
1021 cagaccgcca gtggcacgtc cctcactttg agaagatgct ctatgaccag gcacagctcg
1081 ctgtggccta ttcgcaggcc ttccagctct ctggtgatga attctactct gacgtggcca
1141 aaggcatcct gcagtacgtg gctcggagcc tgagccaccg gtccggaggc ttctatagcg
1201 cagaagatgc agactcgccc ccagagcggg gccagcggcc caaagagggc gcctactatg
1261 tgtggacggt caaagaggtt cagcagctcc tcccggagcc tgtgttgggt gccaccgagc
1321 cgctgacctc aggccagctc ctcatgaagc actacggcct cacagaggct ggtaacatca
1381 gccccagtca ggaccccaag ggggagctgc agggccagaa tgtgctgacc gtccggtact
1441 cgctggagct gactgctgcc cgctttggct tggatgtgga ggccgtgcgg accttgctca
1501 attcagggct ggagaagctc ttccaggccc ggaagcatcg gcccaagccg cacctggaca
1561 gcaagatgct ggctgcctgg aatggcttga tggtgtcagg ctatgctgtg actggggctg
1621 tcctgggcca agacaggctg atcaactatg ccaccaatgg tgccaagttc ctgaagcggc
1681 acatgtttga tgtggccagt ggccgcctga tgcggacctg ctacaccggc cctgggggga
1741 ctgtggagca cagcaaccca ccctgctggg gcttcctgga ggactacgcc ttcgtggtgc
1801 ggggcctgct ggacctgtat gaggcctcac aggagagtgc gtggctcgag tgggctctgc
1861 ggctgcagga cacacaggac aagctctttt gggactccca gggtggcggc tacttctgca
1921 gtgaggctga gctgggggct ggcctgcccc tgcgtctgaa ggacgaccag gatggagcag
1981 agcccagcgc caattccgtg tcagcccaca acctgctccg gctgcatggc ttcacgggcc
2041 acaaggactg gatggacaag tgtgtgtgcc tattgaccgc cttttccgag cgcatgcgtc
2101 gtgtcccggt ggcgttgccc gagatggtcc gcgccctctc agcccagcag cagaccctca
2161 agcagatcgt gatctgtgga gaccgtcagg ccaaggacac caaggccctg gtgcagtgcg
2221 tccactctgt ctacattcct aacaaggtgc tgattctggc tgatggggac ccctcgagct
2281 tcctgtcccg ccagctgcct ttcctgagta ccctccgacg gttggaagac caggccactg
2341 catatgtgtg tgagaatcaa gcctgctcag tgcccatcac tgatccctgc gaattacgaa
2401 aactactaca tccatgactg ccccaacccc cttggggtgg ggcagaaggt gaagcatccc
2461 aactgactag agactcaggc cctgcagggc cctatagaac ctgtggccat ccctgagcac
2521 cctgccacca ggtgacctcg gccatactca ctgcccccct tgggcaccca ctcaccctag
2581 aataaactta acagtgtccc gtggtaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//