LOCUS X02750 1843 bp mRNA linear HUM 07-OCT-2008 DEFINITION Human liver mRNA for protein C. ACCESSION X02750 VERSION X02750.1 KEYWORDS protein C; signal peptide. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1843) AUTHORS Beckmann R.J., Schmidt R.J., Santerre R.F., Plutzky J., Crabtree G.R., Long G.L. TITLE The structure and evolution of a 461 amino acid human protein C precursor and its messenger RNA, based upon the DNA sequence of cloned human liver cDNAs JOURNAL Nucleic Acids Res. 13(14), 5233-5247(1985). PUBMED 2991859 COMMENT Data kindly reviewed (27-MAR-1986) by G. Long FEATURES Location/Qualifiers source 1..1843 /db_xref="H-InvDB:HIT000320942" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" CDS 98..1483 /note="protein C precursor" /db_xref="GOA:P04070" /db_xref="H-InvDB:HIT000320942.14" /db_xref="HGNC:HGNC:9451" /db_xref="InterPro:IPR000152" /db_xref="InterPro:IPR000294" /db_xref="InterPro:IPR000742" /db_xref="InterPro:IPR001254" /db_xref="InterPro:IPR001314" /db_xref="InterPro:IPR001881" /db_xref="InterPro:IPR009003" /db_xref="InterPro:IPR012224" /db_xref="InterPro:IPR013032" /db_xref="InterPro:IPR017857" /db_xref="InterPro:IPR018097" /db_xref="InterPro:IPR018114" /db_xref="InterPro:IPR033116" /db_xref="InterPro:IPR035972" /db_xref="PDB:1AUT" /db_xref="PDB:1LQV" /db_xref="PDB:1PCU" /db_xref="PDB:2PCT" /db_xref="PDB:3F6U" /db_xref="PDB:3JTC" /db_xref="PDB:4DT7" /db_xref="UniProtKB/Swiss-Prot:P04070" /protein_id="CAA26528.1" /translation="MWQLTSLLLFVATWGISGTPAPLDSVFSSSERAHQVLRIRKRAN SFLEELRHSSLERECIEEICDFEEAKEIFQNVDDTLAFWSKHVDGDQCLVLPLEHPCA SLCCGHGTCIDGIGSFSCDCRSGWEGRFCQREVSFLNCSLDNGGCTHYCLEEVGWRRC SCAPGYKLGDDLLQCHPAVKFPCGRPWKRMEKKRSHLKRDTEDQEDQVDPRLIDGKMT RRGDSPWQVVLLDSKKKLACGAVLIHPSWVLTAAHCMDESKKLLVRLGEYDLRRWEKW ELDLDIKEVFVHPNYSKSTTDNDIALLHLAQPATLSQTIVPICLPDSGLAERELNQAG QETLVTGWGYHSSREKEAKRNRTFVLNFIKIPVVPHNECSEVMSNMVSENMLCAGILG DRQDACEGDSGGPMVASFHGTWFLVGLVSWGEGCGLLHNYGVYTKVSRYLDWIHGHIR DKEAPQKSWAP" sig_peptide 98..196 /note="signal peptide (aa -42 to -10)" misc_feature 197..223 /note="propeptide (aa -9 to -1)" misc_feature 224..688 /note="light chain (aa 1-155)" misc_feature 224..358 /note="gamma carboxylation domain (aa 1-45)" misc_feature 359..496 /note="EGF-domain I (aa 46-91)" misc_feature 497..634 /note="EGF-domain II (aa 92-137)" misc_feature 653..730 /note="activation peptide region (aa 144-169)" misc_feature 695..1531 /note="heavy chain (aa 57-419)" misc_feature 731..1531 /note="serine protease region (aa 170-419)" misc_feature 1759..1764 /note="polyA signal" BASE COUNT 417 a 530 c 564 g 332 t ORIGIN 1 ctgcaggggg gggggggggg gggggctgtc atggcggcag gacggcgaac ttgcagtatc 61 tccacgaccc gcccctacag gtgccagtgc ctccagaatg tggcagctca caagcctcct 121 gctgttcgtg gccacctggg gaatttccgg cacaccagct cctcttgact cagtgttctc 181 cagcagcgag cgtgcccacc aggtgctgcg gatccgcaaa cgtgccaact ccttcctgga 241 ggagctccgt cacagcagcc tggagcggga gtgcatagag gagatctgtg acttcgagga 301 ggccaaggaa attttccaaa atgtggatga cacactggcc ttctggtcca agcacgtcga 361 cggtgaccag tgcttggtct tgcccttgga gcacccgtgc gccagcctgt gctgcgggca 421 cggcacgtgc atcgacggca tcggcagctt cagctgcgac tgccgcagcg gctgggaggg 481 ccgcttctgc cagcgcgagg tgagcttcct caattgctcg ctggacaacg gcggctgcac 541 gcattactgc ctagaggagg tgggctggcg gcgctgtagc tgtgcgcctg gctacaagct 601 gggggacgac ctcctgcagt gtcaccccgc agtgaagttc ccttgtggga ggccctggaa 661 gcggatggag aagaagcgca gtcacctgaa acgagacaca gaagaccaag aagaccaagt 721 agatccgcgg ctcattgatg ggaagatgac caggcgggga gacagcccct ggcaggtggt 781 cctgctggac tcaaagaaga agctggcctg cggggcagtg ctcatccacc cctcctgggt 841 gctgacagcg gcccactgca tggatgagtc caagaagctc cttgtcaggc ttggagagta 901 tgacctgcgg cgctgggaga agtgggagct ggacctggac atcaaggagg tcttcgtcca 961 ccccaactac agcaagagca ccaccgacaa tgacatcgca ctgctgcacc tggcccagcc 1021 cgccaccctc tcgcagacca tagtgcccat ctgcctcccg gacagcggcc ttgcagagcg 1081 cgagctcaat caggccggcc aggagaccct cgtgacgggc tggggctacc acagcagccg 1141 agagaaggag gccaagagaa accgcacctt cgtcctcaac ttcatcaaga ttcccgtggt 1201 cccgcacaat gagtgcagcg aggtcatgag caacatggtg tctgagaaca tgctgtgtgc 1261 gggcatcctc ggggaccggc aggatgcctg cgagggcgac agtggggggc ccatggtcgc 1321 ctccttccac ggcacctggt tcctggtggg cctggtgagc tggggtgagg gctgtgggct 1381 ccttcacaac tacggcgttt acaccaaagt cagccgctac ctcgactgga tccatgggca 1441 catcagagac aaggaagccc cccagaagag ctgggcacct tagcgaccct ccctgcaggg 1501 ctgggctttt gcatggcaat ggatgggaca ttaaagggac atgtaacaag cacaccggcc 1561 tgctgttctg tccttccatc cctcttttgg gctcttctgg agggaagtaa catttactga 1621 gcacctgttg tatgtcacat gccttatgaa tagaatctta actcctagag caactctgtg 1681 gggtggggag gagcagatcc aagttttgcg gggtctaaag ctgtgtgtgt tgagggggat 1741 actctgttta tgaaaaagaa taaaaaacac aaccacgaaa aaaaaaaaaa aaaaaaaaaa 1801 aaaaaaaaaa aaaaaaaccc ccccccgccc cccccccctg cag //