LOCUS BC033016 1354 bp mRNA linear HUM 17-JUL-2006
DEFINITION Homo sapiens testes-specific protease 50, mRNA (cDNA clone
MGC:26854 IMAGE:4822455), complete cds.
ACCESSION BC033016
VERSION BC033016.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1354)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1354)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (21-JUN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Institute for Systems Biology
http://www.systemsbiology.org
contact: amadan@systemsbiology.org
Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 33 Row: d Column: 10
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 31543829.
FEATURES Location/Qualifiers
source 1..1354
/db_xref="H-InvDB:HIT000041762"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:26854 IMAGE:4822455"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..1354
/gene="TSP50"
/db_xref="GeneID:29122"
/db_xref="MIM:607950"
CDS 51..1208
/gene="TSP50"
/codon_start=1
/product="testes-specific protease 50"
/protein_id="AAH33016.1"
/db_xref="GeneID:29122"
/db_xref="MIM:607950"
/translation="MGRWCQTVARGQRPRTSAPSRAGALLLLLLLLRSAGCWGAGEAP
GALSTADPADQSVQCVPKATCPSSRPRLLWQTPTTQTLPSTTMETQFPVSEGKVDPYR
SCGFSYEQDPTLRDPEAVARRWPWMVSVRANGTHICAGTIIASQWVLTVAHCLIWRDV
IYSVRVGSPWIDQMTQTASDVPVLQVIMHSRYRAQRFWSWVGQANDIGLLKLKQELKY
SNYVRPICLPGTDYVLKDHSRCTVTGWGLSKADGMWPQFRTIQEKEVIILNNKECDNF
YHNFTKIPTLVQIIKSQMMCAEDTHREKFCYELTGEPLVCSMEGTWYLVGLVSWGAGC
QKSEAPPIYLQVSSYQHWIWDCLNGQALALPAPSRTLLLALPLPLSLLAAL"
BASE COUNT 256 a 444 c 388 g 266 t
ORIGIN
1 agcggactgg gagcgccttc cggagagacg cagtcggctg ccaccccggg atgggtcgct
61 ggtgccagac cgtcgcgcgc gggcagcgcc cccggacgtc tgccccctcc cgcgccggtg
121 ccctgctgct gctgcttctg ttgctgaggt ctgcaggttg ctggggcgca ggggaagccc
181 cgggggcgct gtccactgct gatcccgccg accagagcgt ccagtgtgtc cccaaggcca
241 cctgtccttc cagccggcct cgccttctct ggcagacccc gaccacccag acactgccct
301 cgaccaccat ggagacccaa ttcccagttt ctgaaggcaa agtcgaccca taccgctcct
361 gtggcttttc ctacgagcag gaccccaccc tcagggaccc agaagccgtg gctcggcggt
421 ggccctggat ggtcagcgtg cgggccaatg gcacacacat ctgtgccggc accatcattg
481 cctcccagtg ggtgctgact gtggcccact gcctgatctg gcgtgatgtt atctactcag
541 tgagggtggg gagtccgtgg attgaccaga tgacgcagac cgcctccgat gtcccggtgc
601 tccaggtcat catgcatagc aggtaccggg cccagcggtt ctggtcctgg gtgggccagg
661 ccaacgacat cggcctcctc aagctcaagc aggaactcaa gtacagcaat tacgtgcggc
721 ccatctgcct gcctggcacg gactatgtgt tgaaggacca ttcccgctgc actgtgacgg
781 gctggggact ttccaaggct gacggcatgt ggcctcagtt ccggaccatt caggagaagg
841 aagtcatcat cctgaacaac aaagagtgtg acaatttcta ccacaacttc accaaaatcc
901 ccactctggt tcagatcatc aagtcccaga tgatgtgtgc ggaggacacc cacagggaga
961 agttctgcta tgagctaact ggagagccct tggtctgctc catggagggc acgtggtacc
1021 tggtgggatt ggtgagctgg ggtgcaggct gccagaagag cgaggcccca cccatctacc
1081 tacaggtctc ctcctaccaa cactggatct gggactgcct caacgggcag gccctggccc
1141 tgccagcccc atccaggacc ctgctcctgg cactcccact gcccctcagc ctccttgctg
1201 ccctctgact ctgtgtgccc tccctcactt gtgggccccc cttgcctccg tgcccaggtt
1261 gctgtaggtg cagctgtcac agccctgaga gtcagggtgg agatgaggtg ctcaattaaa
1321 cattactgtt ttccatgcaa aaaaaaaaaa aaaa
//