LOCUS BC029055 1387 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens pepsinogen 5, group I (pepsinogen A), mRNA (cDNA clone
MGC:36750 IMAGE:5184316), complete cds.
ACCESSION BC029055
VERSION BC029055.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 1387)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 1387)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-MAY-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 50 Row: e Column: 11
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 23943853.
FEATURES Location/Qualifiers
source 1..1387
/db_xref="H-InvDB:HIT000040672_03"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:36750 IMAGE:5184316"
/tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
/clone_lib="NIH_MGC_116"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..1387
/gene="PGA5"
/db_xref="GeneID:5222"
/db_xref="HGNC:HGNC:8887"
/db_xref="MIM:169730"
CDS 31..1197
/gene="PGA5"
/codon_start=1
/product="pepsinogen 5, group I (pepsinogen A)"
/protein_id="AAH29055.1"
/db_xref="GeneID:5222"
/db_xref="HGNC:HGNC:8887"
/db_xref="MIM:169730"
/translation="MKWLLLLGLVALSECIMYKVPLIRKKSLRRTLSERGLLKDFLKK
HNLNPARKYFPQWEAPTLVDEQPLENYLDMEYFGTIGIGTPAQDFTVVFDTGSSNLWV
PSVYCSSLACTNHNRFNPEDSSTYQSTSETVSITYGTGSMTGILGYDTVQVGGISDTN
QIFGLSETEPGSFLYYAPFDGILGLAYPSISSSGATPVFDNIWNQGLVSQDLFSVYLS
ADDKSGSVVIFGGIDSSYYTGSLNWVPVTVEGYWQITVDSITMNGETIACAEGCQAIV
DTGTSLLTGPTSPIANIQSDIGASENSDGDMVVSCSAISSLPDIVFTINGVQYPVPPS
AYILQSEGSCISGFQGMNVPTESGELWILGDVFIRQYFTVFDRANNQVGLAPVA"
BASE COUNT 315 a 438 c 336 g 298 t
ORIGIN
1 tctccctcga gttgggaccc gggaagaacc atgaagtggc tgctgctgct gggtctggtg
61 gcgctctctg agtgcatcat gtacaaggtc cccctcatca gaaagaagtc cttgaggcgc
121 accctgtccg agcgtggcct gctgaaggac ttcctgaaga agcacaacct caacccagcc
181 agaaagtact tcccccagtg ggaggctccc accctggtag atgaacagcc cctggagaac
241 tacctggata tggagtactt cggcactatc ggcatcggaa ctcctgccca ggatttcacc
301 gtcgtctttg acaccggctc ctccaacctg tgggtgccct cagtctactg ctccagtctt
361 gcctgcacca accacaaccg cttcaaccct gaggattctt ccacctacca gtccaccagc
421 gagacagtct ccatcaccta cggcaccggc agcatgacag gcatcctcgg atacgacact
481 gtccaggttg gaggcatctc tgacaccaat cagatcttcg gcctgagcga gacggaacct
541 ggctccttcc tgtattatgc tcccttcgat ggcatcctgg ggctggccta ccccagcatt
601 tcctcctccg gggccacacc cgtctttgac aacatctgga accagggcct ggtttctcag
661 gacctcttct ctgtctacct cagcgccgat gacaagagtg gcagcgtggt gatctttggt
721 ggcattgact cttcttacta cactggaagt ctgaactggg tgcctgttac cgtcgagggt
781 tactggcaga tcaccgtgga cagcatcacc atgaacggag agaccatcgc ctgtgctgag
841 ggctgccagg ccattgttga caccggcacc tctctgctga ccggcccaac cagccccatt
901 gccaacatcc agagcgacat cggagccagc gagaactcag atggcgacat ggtggtcagc
961 tgctcagcca tcagcagcct gcccgacatc gtcttcacca tcaatggagt ccagtacccc
1021 gtgccaccca gtgcctacat cctgcagagc gaggggagct gcatcagtgg cttccagggc
1081 atgaacgtcc ccaccgaatc tggagagctt tggatcctgg gtgatgtctt catccgccag
1141 tactttaccg tcttcgacag ggcaaacaac caggtcggcc tggcccctgt ggcttaagcc
1201 taagtctctt cagccacctc ccaggaagat ctggcctccg tcctatgccc actttagatg
1261 tatctaattc tcctgactgt tcttcccagg ggagtgtgaa ggtcttggcc ctgttccctg
1321 tcctaccaat aacgtagaat aaaaacataa cccactgaaa aaaaaaaaaa aaaaaaaaaa
1381 aaaaaaa
//