LOCUS X07994 6274 bp mRNA linear HUM 07-OCT-2008 DEFINITION Human mRNA for lactase-phlorizin hydrolase LPH (EC 3.2.1.23-62). ACCESSION X07994 VERSION X07994.1 KEYWORDS glycosylceramidase; hydrolase; phlorizin hydrolase. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 6274) AUTHORS Mantei N. JOURNAL Submitted (28-JUN-1988) to the INSDC. Mantei N., Swiss Federal Institute of Technology Zurich, Universitaetstrasse 16, ETH-Zentrum, CH-8092 Zurich, Switzerland. REFERENCE 2 AUTHORS Mantei N., Villa M., Enzler T., Wacker H., Boll W., James P., Hunziker W., Semenza G. TITLE Complete primary structure of human and rabbit lactase-phlorizin hydrolase: implications for biosynthesis, membrane anchoring and evolution of the enzyme JOURNAL EMBO J. 7(9), 2705-2713(1988). PUBMED 2460343 COMMENT Data kindly reviewed (12-DEC-1988) by Mantei N. FEATURES Location/Qualifiers source 1..6274 /db_xref="H-InvDB:HIT000321293" /organism="Homo sapiens" /mol_type="mRNA" /dev_stage="adult" /clone="pHLac-61, pHLac-5, pHLac-1" /tissue_type="intestinal mucosa" /db_xref="taxon:9606" CDS 12..5795 /note="LPH prepro-polypeptide (AA -19 to 1908)" /db_xref="GOA:P09848" /db_xref="H-InvDB:HIT000321293.15" /db_xref="HGNC:HGNC:6530" /db_xref="InterPro:IPR001360" /db_xref="InterPro:IPR017853" /db_xref="InterPro:IPR018120" /db_xref="InterPro:IPR033132" /db_xref="UniProtKB/Swiss-Prot:P09848" /protein_id="CAA30801.1" /translation="MELSWHVVFIALLSFSCWGSDWESDRNFISTAGPLTNDLLHNLS GLLGDQSSNFVAGDKDMYVCHQPLPTFLPEYFSSLHASQITHYKVFLSWAQLLPAGST QNPDEKTVQCYRRLLKALKTARLQPMVILHHQTLPASTLRRTEAFADLFADYATFAFH SFGDLVGIWFTFSDLEEVIKELPHQESRASQLQTLSDAHRKAYEIYHESYAFQGGKLS VVLRAEDIPELLLEPPISALAQDTVDFLSLDLSYECQNEASLRQKLSKLQTIEPKVKV FIFNLKLPDCPSTMKNPASLLFSLFEAINKDQVLTIGFDINEFLSCSSSSKKSMSCSL TGSLALQPDQQQDHETTDSSPASAYQRVWEAFANQSRAERDAFLQDTFPEGFLWGAST GAFNVEGGWAEGGRGVSIWDPRRPLNTTEGQATLEVASDSYHKVASDVALLCGLRAQV YKFSISWSRIFPMGHGSSPSLPGVAYYNKLIDRLQDAGIEPMATLFHWDLPQALQDHG GWQNESVVDAFLDYAAFCFSTFGDRVKLWVTFHEPWVMSYAGYGTGQHPPGISDPGVA SFKVAHLVLKAHARTWHHYNSHHRPQQQGHVGIVLNSDWAEPLSPERPEDLRASERFL HFMLGWFAHPVFVDGDYPATLRTQIQQMNRQCSHPVAQLPEFTEAEKQLLKGSADFLG LSHYTSRLISNAPQNTCIPSYDTIGGFSQHVNHVWPQTSSSWIRVVPWGIRRLLQFVS LEYTRGKVPIYLAGNGMPIGESENLFDDSLRVDYFNQYINEVLKAIKEDSVDVRSYIA RSLIDGFEGPSGYSQRFGLHHVNFSDSSKSRTPRKSAYFFTSIIEKNGFLTKGAKRLL PPNTVNLPSKVRAFTFPSEVPSKAKVVWEKFSSQPKFERDLFYHGTFRDDFLWGVSSS AYQIEGAWDADGKGPSIWDNFTHTPGSNVKDNATGDIACDSYHQLDADLNMLRALKVK AYRFSISWSRIFPTGRNSSINSHGVDYYNRLINGLVASNIFPMVTLFHWDLPQALQDI GGWENPALIDLFDSYADFCFQTFGDRVKFWMTFNEPMYLAWLGYGSGEFPPGVKDPGW APYRIAHTVIKAHARVYHTYDEKYRQEQKGVISLSLSTHWAEPKSPGVPRDVEAADRM LQFSLGWFAHPIFRNGDYPDTMKWKVGNRSELQHLATSRLPSFTEEEKRFIRATADVF CLNTYYSRIVQHKTPRLNPPSYEDDQEMAEEEDPSWPSTAMNRAAPWGTRRLLNWIKE EYGDIPIYITENGVGLTNPNTEDTDRIFYHKTYINEALKAYRLDGIDLRGYVAWSLMD NFEWLNGYTVKFGLYHVDFNNTNRPRTARASARYYTEVITNNGMPLAREDEFLYGRFP EGFIWSAASAAYQIEGAWRADGKGLSIWDTFSHTPLRVENDAIGDVACDSYHKIAEDL VTLQNLGVSHYRFSISWSRILPDGTTRYINEAGLNYYVRLIDTLLAASIQPQVTIYHW DLPQTLQDVGGWENETIVQRFKEYADVLFQRLGDKVKFWITLNEPFVIAYQGYGYGTA APGVSNRPGTAPYIVGHNLIKAHAEAWHLYNDVYRASQGGVISITISSDWAEPRDPSN QEDVEAARRYVQFMGGWFAHPIFKNGDYNEVMKTRIRDRSLAAGLNKSRLPEFTESEK RRINGTYDFFGFNHYTTVLAYNLNYATAISSFDADRGVASIADRSWPDSGSFWLKMTP FGFRRILNWLKEEYNDPPIYVTENGVSQREETDLNDTARIYYLRTYINEALKAVQDKV DLRGYTVWSAMDNFEWATGFSERFGLHFVNYSDPSLPRIPKASAKFYASVVRCNGFPD PATGPHACLHQPDAGPTISPVRQEEVQFLGLMLGTTEAQTALYVLFSLVLLGVCGLAF LSYKYCKRSKQGKTQRSQQELSPVSSF" sig_peptide 12..68 /note="signal peptide (AA -19 to -1)" misc_feature 69..5792 /note="LPH propeptide (AA 1 to 1908)" mat_peptide 2616..5792 /note="put. mature LPH (AA 850 to 1908)" misc_feature 6250..6255 /note="polyA signal" polyA_site 6274..6274 /note="polyA site" BASE COUNT 1536 a 1692 c 1582 g 1464 t ORIGIN 1 gttcctagaa aatggagctg tcttggcatg tagtctttat tgccctgcta agtttttcat 61 gctgggggtc agactgggag tctgatagaa atttcatttc caccgctggt cctctaacca 121 atgacttgct gcacaacctg agtggtctcc tgggagacca gagttctaac tttgtagcag 181 gggacaaaga catgtatgtt tgtcaccagc cactgcccac tttcctgcca gaatacttca 241 gcagtctcca tgccagtcag atcacccatt ataaggtatt tctgtcatgg gcacagctcc 301 tcccagcagg aagcacccag aatccagacg agaaaacagt gcagtgctac cggcgactcc 361 tcaaggccct caagactgca cggcttcagc ccatggtcat cctgcaccac cagaccctcc 421 ctgccagcac cctccggaga accgaagcct ttgctgacct cttcgccgac tatgccacat 481 tcgccttcca ctccttcggg gacctagttg ggatctggtt caccttcagt gacttggagg 541 aagtgatcaa ggagcttccc caccaggaat caagagcgtc acaactccag accctcagtg 601 atgcccacag aaaagcctat gagatttacc acgaaagcta tgcttttcag ggcggaaaac 661 tctctgttgt cctgcgagct gaagatatcc cggagctcct gctagaacca cccatatctg 721 cgcttgccca ggacacggtc gatttcctct ctcttgattt gtcttatgaa tgccaaaatg 781 aggcaagtct gcggcagaag ctgagtaaat tgcagaccat tgagccaaaa gtgaaagttt 841 tcatcttcaa cctaaaactc ccagactgcc cctccaccat gaagaaccca gccagtctgc 901 tcttcagcct ttttgaagcc ataaataaag accaagtgct caccattggg tttgatatta 961 atgagtttct gagttgttca tcaagttcca agaaaagcat gtcttgttct ctgactggca 1021 gcctggccct tcagcctgac cagcagcagg accacgagac cacggactcc tctcctgcct 1081 ctgcctatca gagagtctgg gaagcatttg ccaatcagtc cagagcggaa agggatgcct 1141 tcctgcagga tactttccct gaaggcttcc tctggggtgc ctccacagga gcctttaacg 1201 tggaaggagg ctgggccgag ggtgggagag gggtgagcat ctgggatcca cgcaggcccc 1261 tgaacaccac tgagggccaa gcgacgctgg aggtggccag cgacagttac cacaaggtag 1321 cctctgacgt cgccctgctt tgcggcctcc gggctcaggt gtacaagttc tccatctcct 1381 ggtcccggat cttccccatg gggcacggga gcagccccag cctcccaggc gttgcctact 1441 acaacaagct gattgacagg ctacaggatg cgggcatcga gcccatggcc acgctgttcc 1501 actgggacct gcctcaggcc ctgcaggatc atggtggatg gcagaatgag agcgtggtgg 1561 atgccttcct ggactatgcg gccttctgct tctccacatt tggggaccgt gtgaagctgt 1621 gggtgacctt ccatgagccg tgggtgatga gctacgcagg ctatggcacc ggccagcacc 1681 ctcccggcat ctctgaccca ggagtggcct cttttaaggt ggctcacttg gtcctcaagg 1741 ctcatgccag aacttggcac cactacaaca gccatcatcg cccacagcag caggggcacg 1801 tgggcattgt gctgaactca gactgggcag aacccctgtc tccagagagg cctgaggacc 1861 tgagagcctc tgagcgcttc ttgcacttca tgctgggctg gtttgcacac cccgtctttg 1921 tggatggaga ctacccagcc accctgagga cccagatcca acagatgaac agacagtgct 1981 cccatcctgt ggctcaactc cccgagttca cagaggcaga gaagcagctc ctgaaaggct 2041 ctgctgattt tctgggtctg tcgcattaca cctcccgcct catcagcaac gccccacaaa 2101 acacctgcat ccctagctat gataccattg gaggcttctc ccaacacgtg aaccatgtgt 2161 ggccccagac ctcatcctct tggattcgtg tggtgccctg ggggataagg aggctgttgc 2221 agtttgtatc cctggaatac acaagaggaa aagttccaat ataccttgcc gggaatggca 2281 tgcccatagg ggaaagtgaa aatctctttg atgattcctt aagagtagac tacttcaatc 2341 aatatatcaa tgaggtgctc aaggctatca aggaagactc tgtggatgtt cgttcctaca 2401 ttgctcgttc cctcattgat ggcttcgaag gcccttctgg ttacagccag cggtttggcc 2461 tgcaccacgt caacttcagc gacagcagca agtcaaggac tcccaggaaa tctgcctact 2521 ttttcactag catcatagaa aagaacggtt tcctcaccaa gggggcaaaa agactgctac 2581 cacctaatac agtaaacctc ccctccaaag tcagagcctt cacttttcca tctgaggtgc 2641 cctccaaggc taaagtcgtt tgggaaaagt tctccagcca acccaagttc gaaagagatt 2701 tgttctacca cgggacgttt cgggatgact ttctgtgggg cgtgtcctct tccgcttatc 2761 agattgaagg cgcgtgggat gccgatggca aaggccccag catctgggat aactttaccc 2821 acacaccagg gagcaatgtg aaagacaatg ccactggaga catcgcctgt gacagctatc 2881 accagctgga tgccgatctg aatatgctcc gagctttgaa ggtgaaggcc taccgcttct 2941 ctatctcctg gtctcggatt ttcccaactg ggagaaacag ctctatcaac agtcatgggg 3001 ttgattatta caacaggctg atcaatggct tggtggcaag caacatcttt cccatggtga 3061 cattgttcca ttgggacctg ccccaggccc tccaggatat cggaggctgg gagaatcctg 3121 ccttgattga cttgtttgac agctacgcag acttttgttt ccagaccttt ggtgatagag 3181 tcaagttttg gatgactttt aatgagccca tgtacctggc atggctaggt tatggctcag 3241 gggaatttcc cccaggggtg aaggacccag gctgggcacc atataggata gcccacaccg 3301 tcatcaaagc ccatgccaga gtctatcaca cgtacgatga gaaatacagg caggagcaga 3361 agggggtcat ctcgctgagc ctcagtacac actgggcaga gcccaagtca ccaggggtcc 3421 ccagagatgt ggaagccgct gaccgaatgc tgcagttctc cctgggctgg tttgctcacc 3481 ccatttttag aaacggagac tatcctgaca ccatgaagtg gaaagtgggg aacaggagtg 3541 aactgcagca cttagccacc tcccgcctgc caagcttcac tgaggaagag aagaggttca 3601 tcagggcgac ggccgacgtc ttctgcctca acacgtacta ctccagaatc gtgcagcaca 3661 aaacacccag gctaaaccca ccctcctacg aagacgacca ggagatggct gaggaggagg 3721 acccttcgtg gccttccacg gcaatgaaca gagctgcgcc ctgggggacg cgaaggctgc 3781 tgaactggat caaggaagag tatggtgaca tccccattta catcaccgaa aacggagtgg 3841 ggctgaccaa tccgaacacg gaggatactg ataggatatt ttaccacaaa acctacatca 3901 atgaggcttt gaaagcctac aggctcgatg gtatagacct tcgagggtat gtcgcctggt 3961 ctctgatgga caactttgag tggctaaatg gctacacggt caagtttgga ctgtaccatg 4021 ttgatttcaa caacacgaac aggcctcgca cagcaagagc ctccgccagg tactacacag 4081 aggtcattac caacaacggc atgccactgg ccagggagga tgagtttctg tacggacggt 4141 ttcctgaggg cttcatctgg agtgcagctt ctgctgcata tcagattgaa ggtgcgtgga 4201 gagcagatgg caaaggactc agcatttggg acacgttttc tcacacacca ctgagggttg 4261 agaacgatgc cattggagac gtggcctgtg acagttatca caagattgct gaggatctgg 4321 tcaccctgca gaacctgggt gtgtcccact accgtttttc catctcctgg tctcgcatcc 4381 tccctgatgg aaccaccagg tacatcaatg aagcgggcct gaactactac gtgaggctca 4441 tcgatacact gctggccgcc agcatccagc cccaggtgac catttaccac tgggacctac 4501 cacagacgct ccaagatgta ggaggctggg agaatgagac catcgtgcag cggtttaagg 4561 agtatgcaga tgtgctcttc cagaggctgg gagacaaggt gaagttttgg atcacgttga 4621 atgagccctt tgtcattgct taccagggct atggctacgg aacagcagct ccaggagtct 4681 ccaataggcc tggcactgcc ccctacattg ttggccacaa tctaataaag gctcatgctg 4741 aggcctggca tctgtacaac gatgtgtacc gcgccagtca aggtggcgtg atttccatca 4801 ccatcagcag tgactgggct gaacccagag atccctctaa ccaggaggat gtggaggcag 4861 ccaggagata tgttcagttc atgggaggct ggtttgcaca tcctattttc aagaatggag 4921 attacaatga ggtgatgaag acgcggatcc gtgacaggag cttggctgca ggcctcaaca 4981 agtctcggct gccagaattt acagagagtg agaagaggag gatcaacggc acctatgact 5041 tttttgggtt caatcactac accactgtcc tcgcctacaa cctcaactat gccactgcca 5101 tctcttcttt tgatgcagac agaggagttg cttccatcgc agatcgctcg tggccagact 5161 ctggctcctt ctggctgaag atgacgcctt ttggcttcag gaggatcctg aactggttaa 5221 aggaggaata caatgaccct ccaatttatg tcacagagaa tggagtgtcc cagcgggaag 5281 aaacagacct caatgacact gcaaggatct actaccttcg gacttacatc aatgaggccc 5341 tcaaagctgt gcaggacaag gtggaccttc gaggatacac agtttggagt gcgatggaca 5401 attttgagtg ggccacaggc ttttcagaga gatttggtct gcattttgtg aactacagtg 5461 acccttctct gccaaggatc cccaaagcat cagcgaagtt ctacgcctct gtggtccgat 5521 gcaatggctt ccctgacccc gctacagggc ctcacgcttg tctccaccag ccagatgctg 5581 gacccaccat cagccccgtg agacaggagg aggtgcagtt cctggggcta atgctcggca 5641 ccacagaagc acagacagct ttgtacgttc tcttttctct tgtgcttctt ggagtctgtg 5701 gcttggcatt tctgtcatac aagtactgca agcgctctaa gcaagggaaa acacaacgaa 5761 gccaacagga attgagcccg gtgtcttcat tctgatgagt taccacctca agttctatga 5821 agcaggccta gtttcttcat ctatctttac cggccaccaa acaccttagg gtcttagact 5881 ctgctgatac tggacttctc cataaagtcc tgctgcaccg ttagagatga ctttaatctt 5941 gaatgatttc gacttgctga gtaaaatgga aatatctcca tcttgctcca gtatcagagt 6001 tcatttgggc atttgagaag caagtagctc ttgcggaaac gtgtagatac tggtctagtg 6061 ggtctgtgaa ccacttaatt gaacttaaca gggctgtttt aagtttcaga gttgttaagg 6121 gttgttaagg gagcaaaaac cgtaaaaatc cttcctataa gaagaaatca actccattgc 6181 atagactgca atatcatctc ctgcccttct gcaagctctc cctagcttca catcttgtgt 6241 tttccagaaa ataaaaacag aagactgtcc tttc //