LOCUS BC025716 1761 bp mRNA linear HUM 18-MAR-2009 DEFINITION Homo sapiens hepsin, mRNA (cDNA clone MGC:34588 IMAGE:5228525), complete cds. ACCESSION BC025716 VERSION BC025716.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1761) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1761) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (06-MAR-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 49 Row: o Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 33695154. FEATURES Location/Qualifiers source 1..1761 /db_xref="H-InvDB:HIT000039982" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:34588 IMAGE:5228525" /tissue_type="Pancreas, Spleen, adult pooled" /clone_lib="NIH_MGC_120" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1761 /gene="HPN" /gene_synonym="TMPRSS1" /db_xref="GeneID:3249" /db_xref="HGNC:HGNC:5155" /db_xref="MIM:142440" CDS 198..1451 /gene="HPN" /gene_synonym="TMPRSS1" /codon_start=1 /product="hepsin" /protein_id="AAH25716.1" /db_xref="GeneID:3249" /db_xref="HGNC:HGNC:5155" /db_xref="MIM:142440" /translation="MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLL RSDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHS ELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPV DRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVF AGAVAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPV CLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEARVPIISNDVCNGADFYGNQIKP KMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTK VSDFREWIFQAIKTHSEASGMVTQL" BASE COUNT 321 a 562 c 543 g 335 t ORIGIN 1 gcctggccta gcaggcccca cgccaccgcc tctgcctcca ggccgcccgc tgctgcgggg 61 ccaccatgct cctgcccagg cctggagact gacccgaccc cggcactacc tcgaggctcc 121 gcccccacct gctggacccc agggtcccac cctggcccag gaggtcagcc agggaatcat 181 taacaagagg cagtgacatg gcgcagaagg agggtggccg gactgtgcca tgctgctcca 241 gacccaaggt ggcagctctc actgcgggga ccctgctact tctgacagcc atcggggcgg 301 catcctgggc cattgtggct gttctcctca ggagtgacca ggagccgctg tacccagtgc 361 aggtcagctc tgcggacgct cggctcatgg tctttgacaa gacggaaggg acgtggcggc 421 tgctgtgctc ctcgcgctcc aacgccaggg tagccggact cagctgcgag gagatgggct 481 tcctcagggc actgacccac tccgagctgg acgtgcgaac ggcgggcgcc aatggcacgt 541 cgggcttctt ctgtgtggac gaggggaggc tgccccacac ccagaggctg ctggaggtca 601 tctccgtgtg tgattgcccc agaggccgtt tcttggccgc catctgccaa gactgtggcc 661 gcaggaagct gcccgtggac cgcatcgtgg gaggccggga caccagcttg ggccggtggc 721 cgtggcaagt cagccttcgc tatgatggag cacacctctg tgggggatcc ctgctctccg 781 gggactgggt gctgacagcc gcccattgct tcccggagcg gaaccgggtc ctgtcccgat 841 ggcgagtgtt tgccggtgcc gtggcccagg cctctcccca cggtctgcag ctgggggtgc 901 aggctgtggt ctaccacggg ggctatcttc cctttcggga ccccaacagc gaggagaaca 961 gcaacgatat tgccctggtc cacctctcca gtcccctgcc cctcacagaa tacatccagc 1021 ctgtgtgcct cccagctgcc ggccaggccc tggtggatgg caagatctgt accgtgacgg 1081 gctggggcaa cacgcagtac tatggccaac aggccggggt actccaggag gctcgagtcc 1141 ccataatcag caatgatgtc tgcaatggcg ctgacttcta tggaaaccag atcaagccca 1201 agatgttctg tgctggctac cccgagggtg gcattgatgc ctgccagggc gacagcggtg 1261 gtccctttgt gtgtgaggac agcatctctc ggacgccacg ttggcggctg tgtggcattg 1321 tgagttgggg cactggctgt gccctggccc agaagccagg cgtctacacc aaagtcagtg 1381 acttccggga gtggatcttc caggccataa agactcactc cgaagccagc ggcatggtga 1441 cccagctctg accggtggct tctcgctgcg cagcctccag ggcccgaggt gatcccggtg 1501 gtgggatcca cgctgggccg aggatgggac gtttttcttc ttgggcccgg tccacaggtc 1561 caaggacacc ctccctccag ggtcctctct tccacagtgg cgggcccact cagccccgag 1621 accacccaac ctcaccctcc tgacccccat gtaaatattg ttctgctgtc tgggactcct 1681 gtctaggtgc ccctgatgat gggatgctct ttaaataata aagatggttt tgattaaaaa 1741 aaaaaaaaaa aaaaaaaaaa a //