LOCUS BC073740 1374 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens progastricsin (pepsinogen C), mRNA (cDNA clone MGC:88742 IMAGE:6296399), complete cds. ACCESSION BC073740 VERSION BC073740.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1374) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1374) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (23-JUN-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: CGAP (Stanford) cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 58 Row: l Column: 22 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 4505756. FEATURES Location/Qualifiers source 1..1374 /db_xref="H-InvDB:HIT000264971" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:88742 IMAGE:6296399" /tissue_type="Liver, hepatocellular carcinoma" /clone_lib="NIH_MGC_100" /lab_host="DH10B" /note="Vector: pOTB7" gene 1..1374 /gene="PGC" /db_xref="GeneID:5225" /db_xref="HGNC:HGNC:8890" /db_xref="MIM:169740" CDS 50..1216 /gene="PGC" /codon_start=1 /product="progastricsin (pepsinogen C)" /protein_id="AAH73740.1" /db_xref="GeneID:5225" /db_xref="HGNC:HGNC:8890" /db_xref="MIM:169740" /translation="MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLR THKYDPAWKYRFGDLSVTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSV YCQSQACTSHSRFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEF GLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQQ GSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVD TGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSS YILSNNGYCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA" BASE COUNT 273 a 415 c 357 g 329 t ORIGIN 1 gcagaactca gagctgctct tcctctgtgg ccagttgggg accagcatca tgaagtggat 61 ggtggtggtc ttggtctgcc tccagctctt ggaggcagca gtggtcaaag tgcccctgaa 121 gaaatttaag tctatccgtg agaccatgaa ggagaagggc ttgctggggg agttcctgag 181 gacccacaag tatgatcctg cttggaagta ccgctttggt gacctcagcg tgacctacga 241 gcccatggcc tacatggatg ctgcctactt tggtgagatc agcatcggga ctccacccca 301 gaacttcctg gtcctttttg acaccggctc ctccaacttg tgggtgccct ctgtctactg 361 ccagagccag gcctgcacca gtcactcccg cttcaacccc agcgagtcgt ccacctactc 421 caccaatggg cagaccttct ccctgcagta tggcagtggc agcctcaccg gcttctttgg 481 ctatgacacc ctgactgtcc agagcatcca ggtccccaac caggagttcg gcttgagtga 541 gaatgagcct ggtaccaact tcgtctatgc gcagtttgat ggcatcatgg gcctggccta 601 ccctgctctg tccgtggatg aggccaccac agctatgcag ggcatggtgc aggagggcgc 661 cctcaccagc cccgtcttca gcgtctacct cagcaaccag cagggctcca gcgggggagc 721 ggttgtcttt gggggtgtgg atagcagcct gtacacgggg cagatctact gggcgcctgt 781 cacccaggaa ctctactggc agattggcat tgaagagttc ctcatcggcg gccaggcctc 841 cggctggtgt tctgagggtt gccaggccat cgtggacaca ggcacctctc tgctcactgt 901 gccccagcag tacatgagtg ctcttctgca ggccacaggg gcccaggagg atgagtatgg 961 acagtttctc gtgaactgta acagcattca gaatctgccc agcttgacct tcatcatcaa 1021 tggtgtggag ttccctctgc caccttcctc ctatatcctc agtaacaacg gctactgcac 1081 cgtgggagtc gagcccacct acctgtcctc ccagaacggc cagcccctgt ggatcctcgg 1141 ggatgtcttc ctcaggtcct actattccgt ctacgacttg ggcaacaaca gagtaggctt 1201 tgccactgcc gcctagactt gctgcctcga cacgtgggct cccctcttcc tcttgaccct 1261 gcaccctcct agggcattgt atctgtcttt ccactctgga ttcagccttc tttttctgga 1321 ctctggactt tctctaataa taaatagttc ttctttaaaa aaaaaaaaaa aaaa //