LOCUS BC033016 1354 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens testes-specific protease 50, mRNA (cDNA clone MGC:26854 IMAGE:4822455), complete cds. ACCESSION BC033016 VERSION BC033016.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1354) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1354) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (21-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 33 Row: d Column: 10 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31543829. FEATURES Location/Qualifiers source 1..1354 /db_xref="H-InvDB:HIT000041762" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:26854 IMAGE:4822455" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..1354 /gene="TSP50" /db_xref="GeneID:29122" /db_xref="MIM:607950" CDS 51..1208 /gene="TSP50" /codon_start=1 /product="testes-specific protease 50" /protein_id="AAH33016.1" /db_xref="GeneID:29122" /db_xref="MIM:607950" /translation="MGRWCQTVARGQRPRTSAPSRAGALLLLLLLLRSAGCWGAGEAP GALSTADPADQSVQCVPKATCPSSRPRLLWQTPTTQTLPSTTMETQFPVSEGKVDPYR SCGFSYEQDPTLRDPEAVARRWPWMVSVRANGTHICAGTIIASQWVLTVAHCLIWRDV IYSVRVGSPWIDQMTQTASDVPVLQVIMHSRYRAQRFWSWVGQANDIGLLKLKQELKY SNYVRPICLPGTDYVLKDHSRCTVTGWGLSKADGMWPQFRTIQEKEVIILNNKECDNF YHNFTKIPTLVQIIKSQMMCAEDTHREKFCYELTGEPLVCSMEGTWYLVGLVSWGAGC QKSEAPPIYLQVSSYQHWIWDCLNGQALALPAPSRTLLLALPLPLSLLAAL" BASE COUNT 256 a 444 c 388 g 266 t ORIGIN 1 agcggactgg gagcgccttc cggagagacg cagtcggctg ccaccccggg atgggtcgct 61 ggtgccagac cgtcgcgcgc gggcagcgcc cccggacgtc tgccccctcc cgcgccggtg 121 ccctgctgct gctgcttctg ttgctgaggt ctgcaggttg ctggggcgca ggggaagccc 181 cgggggcgct gtccactgct gatcccgccg accagagcgt ccagtgtgtc cccaaggcca 241 cctgtccttc cagccggcct cgccttctct ggcagacccc gaccacccag acactgccct 301 cgaccaccat ggagacccaa ttcccagttt ctgaaggcaa agtcgaccca taccgctcct 361 gtggcttttc ctacgagcag gaccccaccc tcagggaccc agaagccgtg gctcggcggt 421 ggccctggat ggtcagcgtg cgggccaatg gcacacacat ctgtgccggc accatcattg 481 cctcccagtg ggtgctgact gtggcccact gcctgatctg gcgtgatgtt atctactcag 541 tgagggtggg gagtccgtgg attgaccaga tgacgcagac cgcctccgat gtcccggtgc 601 tccaggtcat catgcatagc aggtaccggg cccagcggtt ctggtcctgg gtgggccagg 661 ccaacgacat cggcctcctc aagctcaagc aggaactcaa gtacagcaat tacgtgcggc 721 ccatctgcct gcctggcacg gactatgtgt tgaaggacca ttcccgctgc actgtgacgg 781 gctggggact ttccaaggct gacggcatgt ggcctcagtt ccggaccatt caggagaagg 841 aagtcatcat cctgaacaac aaagagtgtg acaatttcta ccacaacttc accaaaatcc 901 ccactctggt tcagatcatc aagtcccaga tgatgtgtgc ggaggacacc cacagggaga 961 agttctgcta tgagctaact ggagagccct tggtctgctc catggagggc acgtggtacc 1021 tggtgggatt ggtgagctgg ggtgcaggct gccagaagag cgaggcccca cccatctacc 1081 tacaggtctc ctcctaccaa cactggatct gggactgcct caacgggcag gccctggccc 1141 tgccagcccc atccaggacc ctgctcctgg cactcccact gcccctcagc ctccttgctg 1201 ccctctgact ctgtgtgccc tccctcactt gtgggccccc cttgcctccg tgcccaggtt 1261 gctgtaggtg cagctgtcac agccctgaga gtcagggtgg agatgaggtg ctcaattaaa 1321 cattactgtt ttccatgcaa aaaaaaaaaa aaaa //