LOCUS BC073740 1374 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens progastricsin (pepsinogen C), mRNA (cDNA clone
MGC:88742 IMAGE:6296399), complete cds.
ACCESSION BC073740
VERSION BC073740.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1374)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1374)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (23-JUN-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: CGAP (Stanford)
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 58 Row: l Column: 22
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 4505756.
FEATURES Location/Qualifiers
source 1..1374
/db_xref="H-InvDB:HIT000264971"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:88742 IMAGE:6296399"
/tissue_type="Liver, hepatocellular carcinoma"
/clone_lib="NIH_MGC_100"
/lab_host="DH10B"
/note="Vector: pOTB7"
gene 1..1374
/gene="PGC"
/db_xref="GeneID:5225"
/db_xref="HGNC:HGNC:8890"
/db_xref="MIM:169740"
CDS 50..1216
/gene="PGC"
/codon_start=1
/product="progastricsin (pepsinogen C)"
/protein_id="AAH73740.1"
/db_xref="GeneID:5225"
/db_xref="HGNC:HGNC:8890"
/db_xref="MIM:169740"
/translation="MKWMVVVLVCLQLLEAAVVKVPLKKFKSIRETMKEKGLLGEFLR
THKYDPAWKYRFGDLSVTYEPMAYMDAAYFGEISIGTPPQNFLVLFDTGSSNLWVPSV
YCQSQACTSHSRFNPSESSTYSTNGQTFSLQYGSGSLTGFFGYDTLTVQSIQVPNQEF
GLSENEPGTNFVYAQFDGIMGLAYPALSVDEATTAMQGMVQEGALTSPVFSVYLSNQQ
GSSGGAVVFGGVDSSLYTGQIYWAPVTQELYWQIGIEEFLIGGQASGWCSEGCQAIVD
TGTSLLTVPQQYMSALLQATGAQEDEYGQFLVNCNSIQNLPSLTFIINGVEFPLPPSS
YILSNNGYCTVGVEPTYLSSQNGQPLWILGDVFLRSYYSVYDLGNNRVGFATAA"
BASE COUNT 273 a 415 c 357 g 329 t
ORIGIN
1 gcagaactca gagctgctct tcctctgtgg ccagttgggg accagcatca tgaagtggat
61 ggtggtggtc ttggtctgcc tccagctctt ggaggcagca gtggtcaaag tgcccctgaa
121 gaaatttaag tctatccgtg agaccatgaa ggagaagggc ttgctggggg agttcctgag
181 gacccacaag tatgatcctg cttggaagta ccgctttggt gacctcagcg tgacctacga
241 gcccatggcc tacatggatg ctgcctactt tggtgagatc agcatcggga ctccacccca
301 gaacttcctg gtcctttttg acaccggctc ctccaacttg tgggtgccct ctgtctactg
361 ccagagccag gcctgcacca gtcactcccg cttcaacccc agcgagtcgt ccacctactc
421 caccaatggg cagaccttct ccctgcagta tggcagtggc agcctcaccg gcttctttgg
481 ctatgacacc ctgactgtcc agagcatcca ggtccccaac caggagttcg gcttgagtga
541 gaatgagcct ggtaccaact tcgtctatgc gcagtttgat ggcatcatgg gcctggccta
601 ccctgctctg tccgtggatg aggccaccac agctatgcag ggcatggtgc aggagggcgc
661 cctcaccagc cccgtcttca gcgtctacct cagcaaccag cagggctcca gcgggggagc
721 ggttgtcttt gggggtgtgg atagcagcct gtacacgggg cagatctact gggcgcctgt
781 cacccaggaa ctctactggc agattggcat tgaagagttc ctcatcggcg gccaggcctc
841 cggctggtgt tctgagggtt gccaggccat cgtggacaca ggcacctctc tgctcactgt
901 gccccagcag tacatgagtg ctcttctgca ggccacaggg gcccaggagg atgagtatgg
961 acagtttctc gtgaactgta acagcattca gaatctgccc agcttgacct tcatcatcaa
1021 tggtgtggag ttccctctgc caccttcctc ctatatcctc agtaacaacg gctactgcac
1081 cgtgggagtc gagcccacct acctgtcctc ccagaacggc cagcccctgt ggatcctcgg
1141 ggatgtcttc ctcaggtcct actattccgt ctacgacttg ggcaacaaca gagtaggctt
1201 tgccactgcc gcctagactt gctgcctcga cacgtgggct cccctcttcc tcttgaccct
1261 gcaccctcct agggcattgt atctgtcttt ccactctgga ttcagccttc tttttctgga
1321 ctctggactt tctctaataa taaatagttc ttctttaaaa aaaaaaaaaa aaaa
//