LOCUS BC029055 1387 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens pepsinogen 5, group I (pepsinogen A), mRNA (cDNA clone MGC:36750 IMAGE:5184316), complete cds. ACCESSION BC029055 VERSION BC029055.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1387) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1387) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-MAY-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 50 Row: e Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23943853. FEATURES Location/Qualifiers source 1..1387 /db_xref="H-InvDB:HIT000040672_03" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:36750 IMAGE:5184316" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..1387 /gene="PGA5" /db_xref="GeneID:5222" /db_xref="HGNC:HGNC:8887" /db_xref="MIM:169730" CDS 31..1197 /gene="PGA5" /codon_start=1 /product="pepsinogen 5, group I (pepsinogen A)" /protein_id="AAH29055.1" /db_xref="GeneID:5222" /db_xref="HGNC:HGNC:8887" /db_xref="MIM:169730" /translation="MKWLLLLGLVALSECIMYKVPLIRKKSLRRTLSERGLLKDFLKK HNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWV PSVYCSSLACTNHNRFNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTN QIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS ADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGETIACAEGCQAIV DTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTINGVQYPVPPS AYILQSEGSCISGFQGMNVPTESGELWILGDVFIRQYFTVFDRANNQVGLAPVA" BASE COUNT 315 a 438 c 336 g 298 t ORIGIN 1 tctccctcga gttgggaccc gggaagaacc atgaagtggc tgctgctgct gggtctggtg 61 gcgctctctg agtgcatcat gtacaaggtc cccctcatca gaaagaagtc cttgaggcgc 121 accctgtccg agcgtggcct gctgaaggac ttcctgaaga agcacaacct caacccagcc 181 agaaagtact tcccccagtg ggaggctccc accctggtag atgaacagcc cctggagaac 241 tacctggata tggagtactt cggcactatc ggcatcggaa ctcctgccca ggatttcacc 301 gtcgtctttg acaccggctc ctccaacctg tgggtgccct cagtctactg ctccagtctt 361 gcctgcacca accacaaccg cttcaaccct gaggattctt ccacctacca gtccaccagc 421 gagacagtct ccatcaccta cggcaccggc agcatgacag gcatcctcgg atacgacact 481 gtccaggttg gaggcatctc tgacaccaat cagatcttcg gcctgagcga gacggaacct 541 ggctccttcc tgtattatgc tcccttcgat ggcatcctgg ggctggccta ccccagcatt 601 tcctcctccg gggccacacc cgtctttgac aacatctgga accagggcct ggtttctcag 661 gacctcttct ctgtctacct cagcgccgat gacaagagtg gcagcgtggt gatctttggt 721 ggcattgact cttcttacta cactggaagt ctgaactggg tgcctgttac cgtcgagggt 781 tactggcaga tcaccgtgga cagcatcacc atgaacggag agaccatcgc ctgtgctgag 841 ggctgccagg ccattgttga caccggcacc tctctgctga ccggcccaac cagccccatt 901 gccaacatcc agagcgacat cggagccagc gagaactcag atggcgacat ggtggtcagc 961 tgctcagcca tcagcagcct gcccgacatc gtcttcacca tcaatggagt ccagtacccc 1021 gtgccaccca gtgcctacat cctgcagagc gaggggagct gcatcagtgg cttccagggc 1081 atgaacgtcc ccaccgaatc tggagagctt tggatcctgg gtgatgtctt catccgccag 1141 tactttaccg tcttcgacag ggcaaacaac caggtcggcc tggcccctgt ggcttaagcc 1201 taagtctctt cagccacctc ccaggaagat ctggcctccg tcctatgccc actttagatg 1261 tatctaattc tcctgactgt tcttcccagg ggagtgtgaa ggtcttggcc ctgttccctg 1321 tcctaccaat aacgtagaat aaaaacataa cccactgaaa aaaaaaaaaa aaaaaaaaaa 1381 aaaaaaa //