LOCUS BC025716 1761 bp mRNA linear HUM 18-MAR-2009
DEFINITION Homo sapiens hepsin, mRNA (cDNA clone MGC:34588 IMAGE:5228525),
complete cds.
ACCESSION BC025716
VERSION BC025716.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1761)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1761)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (06-MAR-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 49 Row: o Column: 11
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 33695154.
FEATURES Location/Qualifiers
source 1..1761
/db_xref="H-InvDB:HIT000039982"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:34588 IMAGE:5228525"
/tissue_type="Pancreas, Spleen, adult pooled"
/clone_lib="NIH_MGC_120"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1761
/gene="HPN"
/gene_synonym="TMPRSS1"
/db_xref="GeneID:3249"
/db_xref="HGNC:HGNC:5155"
/db_xref="MIM:142440"
CDS 198..1451
/gene="HPN"
/gene_synonym="TMPRSS1"
/codon_start=1
/product="hepsin"
/protein_id="AAH25716.1"
/db_xref="GeneID:3249"
/db_xref="HGNC:HGNC:5155"
/db_xref="MIM:142440"
/translation="MAQKEGGRTVPCCSRPKVAALTAGTLLLLTAIGAASWAIVAVLL
RSDQEPLYPVQVSSADARLMVFDKTEGTWRLLCSSRSNARVAGLSCEEMGFLRALTHS
ELDVRTAGANGTSGFFCVDEGRLPHTQRLLEVISVCDCPRGRFLAAICQDCGRRKLPV
DRIVGGRDTSLGRWPWQVSLRYDGAHLCGGSLLSGDWVLTAAHCFPERNRVLSRWRVF
AGAVAQASPHGLQLGVQAVVYHGGYLPFRDPNSEENSNDIALVHLSSPLPLTEYIQPV
CLPAAGQALVDGKICTVTGWGNTQYYGQQAGVLQEARVPIISNDVCNGADFYGNQIKP
KMFCAGYPEGGIDACQGDSGGPFVCEDSISRTPRWRLCGIVSWGTGCALAQKPGVYTK
VSDFREWIFQAIKTHSEASGMVTQL"
BASE COUNT 321 a 562 c 543 g 335 t
ORIGIN
1 gcctggccta gcaggcccca cgccaccgcc tctgcctcca ggccgcccgc tgctgcgggg
61 ccaccatgct cctgcccagg cctggagact gacccgaccc cggcactacc tcgaggctcc
121 gcccccacct gctggacccc agggtcccac cctggcccag gaggtcagcc agggaatcat
181 taacaagagg cagtgacatg gcgcagaagg agggtggccg gactgtgcca tgctgctcca
241 gacccaaggt ggcagctctc actgcgggga ccctgctact tctgacagcc atcggggcgg
301 catcctgggc cattgtggct gttctcctca ggagtgacca ggagccgctg tacccagtgc
361 aggtcagctc tgcggacgct cggctcatgg tctttgacaa gacggaaggg acgtggcggc
421 tgctgtgctc ctcgcgctcc aacgccaggg tagccggact cagctgcgag gagatgggct
481 tcctcagggc actgacccac tccgagctgg acgtgcgaac ggcgggcgcc aatggcacgt
541 cgggcttctt ctgtgtggac gaggggaggc tgccccacac ccagaggctg ctggaggtca
601 tctccgtgtg tgattgcccc agaggccgtt tcttggccgc catctgccaa gactgtggcc
661 gcaggaagct gcccgtggac cgcatcgtgg gaggccggga caccagcttg ggccggtggc
721 cgtggcaagt cagccttcgc tatgatggag cacacctctg tgggggatcc ctgctctccg
781 gggactgggt gctgacagcc gcccattgct tcccggagcg gaaccgggtc ctgtcccgat
841 ggcgagtgtt tgccggtgcc gtggcccagg cctctcccca cggtctgcag ctgggggtgc
901 aggctgtggt ctaccacggg ggctatcttc cctttcggga ccccaacagc gaggagaaca
961 gcaacgatat tgccctggtc cacctctcca gtcccctgcc cctcacagaa tacatccagc
1021 ctgtgtgcct cccagctgcc ggccaggccc tggtggatgg caagatctgt accgtgacgg
1081 gctggggcaa cacgcagtac tatggccaac aggccggggt actccaggag gctcgagtcc
1141 ccataatcag caatgatgtc tgcaatggcg ctgacttcta tggaaaccag atcaagccca
1201 agatgttctg tgctggctac cccgagggtg gcattgatgc ctgccagggc gacagcggtg
1261 gtccctttgt gtgtgaggac agcatctctc ggacgccacg ttggcggctg tgtggcattg
1321 tgagttgggg cactggctgt gccctggccc agaagccagg cgtctacacc aaagtcagtg
1381 acttccggga gtggatcttc caggccataa agactcactc cgaagccagc ggcatggtga
1441 cccagctctg accggtggct tctcgctgcg cagcctccag ggcccgaggt gatcccggtg
1501 gtgggatcca cgctgggccg aggatgggac gtttttcttc ttgggcccgg tccacaggtc
1561 caaggacacc ctccctccag ggtcctctct tccacagtgg cgggcccact cagccccgag
1621 accacccaac ctcaccctcc tgacccccat gtaaatattg ttctgctgtc tgggactcct
1681 gtctaggtgc ccctgatgat gggatgctct ttaaataata aagatggttt tgattaaaaa
1741 aaaaaaaaaa aaaaaaaaaa a
//