LOCUS BC045624 4326 bp mRNA linear HUM 17-JUL-2006
DEFINITION Homo sapiens carboxypeptidase D, mRNA (cDNA clone MGC:39560
IMAGE:4830195), complete cds.
ACCESSION BC045624
VERSION BC045624.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 4326)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 4326)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (31-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki
Toshiyuki and Piero Carninci (RIKEN)
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 63 Row: a Column: 21
This clone was selected for full length sequencing because it
passed the following selection criteria: Hexamer frequency ORF
analysis.
FEATURES Location/Qualifiers
source 1..4326
/db_xref="H-InvDB:HIT000052947"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:39560 IMAGE:4830195"
/tissue_type="Testis"
/clone_lib="NIH_MGC_97"
/lab_host="DH10B"
/note="Vector: pBluescriptR"
gene 1..4326
/gene="CPD"
/db_xref="GeneID:1362"
/db_xref="HGNC:HGNC:2301"
/db_xref="MIM:603102"
CDS 32..4174
/gene="CPD"
/codon_start=1
/product="carboxypeptidase D"
/protein_id="AAH45624.1"
/db_xref="GeneID:1362"
/db_xref="HGNC:HGNC:2301"
/db_xref="MIM:603102"
/translation="MASGRDERPPWRLGRLLLLMCLLLLGSSARAAHIKKAEATTTTT
SAGAEAAEGQFDRYYHEEELESALREAAAAGLPGLARLFSIGRSVEGRPLWVLRLTAG
LGSLIPEGDAGPDAAGPDAAGPLLPGRPQVKLVGNMHGDETVSRQVLIYLARELAAGY
RRGDPRLVRLLNTTDVYLLPSLNPDGFERAREGDCGFGDGGPSGASGRDNSRGRDLNR
SFPDQFSTGEPPALDEVPEVRALIEWIRRNKFVLSGNLHGGSVVASYPFDDSPEHKAT
GIYSKTSDDEVFKYLAKAYASNHPIMKTGEPHCPGDEDETFKDGITNGAHWYDVEGGM
QDYNYVWANCFEITLELSCCKYPPASQLRQEWENNRESLITLIEKVHIGVKGFVKDSI
TGSGLENATISVAGINHNITTGRFGDFYRLLVPGTYNLTVVLTGYMPLTVTNVVVKEG
PATEVDFSLRPTVTSVIPDTTEAVSTASTVAIPNILSGTSSSYQPIQPKDFHHHHFPD
MEIFLRRFANEYPNITRLYSLGKSVESRELYVMEISDNPGVHEPGEPEFKYIGNMHGN
EVVGRELLLNLIEYLCKNFGTDPEVTDLVHNTRIHLMPSMNPDGYEKSQEGDSISVIG
RNNSNNFDLNRNFPDQFVQITDPTQPETIAVMSWMKSYPFVLSANLHGGSLVVNYPFD
DDEQGLATYSKSPDDAVFQQIALSYSKENSQMFQGRPCKNMYPNEYFPHGITNGASWY
NVPGGMQDWNYLQTNCFEVTIELGCVKYPLEKELPNFWEQNRRSLIQFMKQVHQGVRG
FVLDATDGRGILNATISVAEINHPVTTYKTGDYWRLLVPGTYKITASARGYNPVTKNV
TVKSEGAIQVNFTLVRSSTDSNNESKKGKGASSSTNDASDPTTKEFETLIKDLSAENG
LESLMLRSSSNLALALYRYHSYKDLSEFLRGLVMNYPHITNLTNLGQSTEYRHIWSLE
ISNKPNVSEPEEPKIRFVAGIHGNAPVGTELLLALAEFLCLNYKKNPAVTQLVDRTRI
VIVPSLNPDGRERAQEKDCTSKIGQTNARGKDLDTDFTNNASQPETKAIIENLIQKQD
FSLSVALDGGSMLVTYPYDKPVQTVENKETLKHLASLYANNHPSMHMGQPSCPNKSDE
NIPGGVMRGAEWHSHLGSMKDYSVTYGHCPEITVYTSCCYFPSAARLPSLWADNKRSL
LSMLVEVHKGVHGFVKDKTGKPISKAVIVLNEGIKVQTKEGGYFHVLLAPGVHNIIAI
ADGYQQQHSQVFVHHDAASSVVIVFDTDNRIFGLPRELVVTVSGATMSALILTACIIW
CICSIKSNRHKDGFHRLRQHHDEYEDEIRMMSTGSKKSLLSHEFQDETDTEEETLYSS
KH"
BASE COUNT 1224 a 982 c 1042 g 1078 t
ORIGIN
1 agcgagccgg ggttagcggc gctgctggaa gatggcgagc ggccgggacg agcggccgcc
61 ttggcggcta gggcggctcc tgttgctcat gtgcctgctg ctgctgggga gctcggcccg
121 ggcggctcac atcaagaagg cggaggcgac taccacaact acgagcgcgg gcgccgaggc
181 ggccgagggc cagttcgacc gctactacca cgaagaggag ttggagtcgg cgctgaggga
241 ggcggcggcc gcgggcctcc ccggcctggc ccgcctcttt agcatcggcc gctcggtgga
301 aggccggccg ctgtgggtgc ttcgcctcac cgccggcctg gggtcgctaa tccctgaggg
361 cgacgcgggg cctgacgctg ccgggcccga cgctgcgggg ccgctgctgc ccggccggcc
421 ccaggtgaag ctggtgggca acatgcatgg cgacgagacc gtgtcgcgcc aggtgttgat
481 ctacttggcc cgcgagctgg cggccggcta ccgccgcggg gacccgcgcc tggtccgcct
541 gctcaacacc accgacgtgt acctgctgcc cagcctcaac cccgatggct tcgagcgtgc
601 ccgcgagggc gactgtggct tcggcgacgg cggcccgtcc ggggccagcg gccgcgacaa
661 tagtcgcggc cgcgacctca accgaagctt tcccgaccag tttagcaccg gcgaaccccc
721 cgccctggac gaggtgcccg aggtgcgcgc cctcatcgag tggatccgca ggaacaagtt
781 tgtgctttct ggaaatctgc atggtggctc agtggtagca agctatcctt ttgatgattc
841 tccagaacat aaggccactg gaatctatag caaaacctca gatgatgaag tatttaaata
901 cttggcaaaa gcttatgctt caaaccaccc cataatgaaa actggtgagc ctcattgtcc
961 aggagatgaa gacgagactt tcaaagatgg aatcacaaac ggcgcacatt ggtatgatgt
1021 ggaaggtggt atgcaagatt acaattatgt gtgggccaac tgttttgaga tcacattaga
1081 actgtcttgt tgcaagtacc cacctgcttc acagcttcga caggaatggg agaacaatcg
1141 tgagtctttg atcacattga ttgaaaaggt tcacattgga gtgaaaggat ttgttaaaga
1201 ttccataaca ggatctgggt tagagaatgc aaccatctca gtggctggta ttaatcataa
1261 tatcacaaca ggcagatttg gtgatttcta ccgattactt gttcctggaa cttacaacct
1321 tacagtagtt ttaactgggt atatgccatt gactgttact aatgtagtgg tgaaagaagg
1381 accagccaca gaggtggatt tttctcttag gccaactgta acttcagtaa tccctgacac
1441 gacagaggct gtatcaactg ctagcacagt tgctatacct aatattcttt ctggaacatc
1501 atcctcctac cagccaattc agccaaagga ctttcaccac caccatttcc ctgatatgga
1561 aatcttcttg agaaggtttg ccaatgaata tcctaacatt acccggcttt attccttggg
1621 aaaatcagta gagtcaagag aactttatgt gatggagata tctgataatc cgggtgtcca
1681 tgaaccaggt gaaccagaat ttaagtacat tggaaatatg catggaaatg aagtggttgg
1741 aagagaactg ctgttgaacc tcatagaata cctttgtaag aactttggaa cagaccctga
1801 agtcacagat ttggttcata acactagaat tcaccttatg ccatccatga atcctgatgg
1861 gtatgaaaag tcccaggaag gagattcaat aagtgtaatt ggcagaaaca acagcaacaa
1921 ctttgacctg aaccgaaatt tcccagacca gtttgttcag atcacagatc ctacgcaacc
1981 agaaactatt gctgtaatga gctggatgaa gtcctatcca tttgtacttt cagcaaacct
2041 gcatggaggt tctttggtgg ttaactaccc ttttgatgat gatgaacaag gacttgccac
2101 atatagtaaa tcaccagatg atgctgtgtt ccaacaaata gcactttctt attccaagga
2161 aaattcccag atgtttcaag gtagaccttg caagaatatg tatcctaatg aatattttcc
2221 tcatggaata acaaatggag ctagttggta taatgtgcca ggaggaatgc aggactggaa
2281 ctatttacaa acaaattgct ttgaagtgac tattgaacta ggttgtgtga aatatccact
2341 tgagaaagag ctgccaaact tttgggaaca gaatcgaaga tcactaatcc agtttatgaa
2401 acaggttcat cagggcgtca gaggatttgt tctagatgcc acagatggca ggggtatatt
2461 aaatgccacc attagtgttg ctgagattaa tcacccagtg actacttaca aaactggaga
2521 ttactggcgt ctcttggttc caggaactta taaaatcaca gcatctgctc gagggtataa
2581 tccagttacc aagaatgtga ctgtcaagag tgaaggcgct attcaggtca acttcacact
2641 tgttcgatcc tcaacagatt caaacaatga atcaaagaaa ggaaaagggg ctagcagcag
2701 caccaatgat gccagtgatc caactactaa agagtttgaa actttaatta aagacctttc
2761 agcggagaat ggtttggaaa gcctcatgtt acgctcctcc tcaaatctgg ctctggctct
2821 ttatcgatac cattcctaca aagacttatc agagtttctg agaggacttg taatgaacta
2881 tccacatatt acaaatctta ccaatttggg acagagcact gaatatcgtc acatttggtc
2941 ccttgaaatc tccaataagc ccaatgtatc tgagcctgaa gaaccaaaga ttcgttttgt
3001 tgctggtatc catggaaatg cgccagttgg aactgaactg cttttggctc tggcagaatt
3061 tctctgcctg aactacaaaa agaacccagc tgttacccaa ttggttgaca ggactaggat
3121 tgtgattgtc ccttctctaa atccagatgg gcgagagaga gctcaagaga aagactgtac
3181 ttcaaaaata ggacaaacaa atgctcgtgg caaagatttg gatacagact tcacaaataa
3241 tgcctcccaa cctgagacca aagccatcat tgaaaatttg attcaaaaac aggactttag
3301 tctttctgtt gccttagatg gtggttccat gctggtcaca tatccttatg acaagccagt
3361 acagacagtg gaaaataaag agactctgaa gcatttggca tctctttatg caaataatca
3421 tccatccatg cacatgggtc agcccagttg cccaaataaa tcagatgaga atattccagg
3481 aggagtaatg cgtggagcag aatggcatag tcacctgggc agcatgaagg attatagtgt
3541 cacctatggc cattgtccgg aaatcacagt atacacaagc tgctgttact ttcctagtgc
3601 tgcacgactc ccttccttgt gggcagacaa taagagatct cttcttagta tgttagtgga
3661 ggttcacaag ggagttcatg gatttgttaa agataagact ggaaagccaa tctctaaagc
3721 agtcattgta cttaatgaag gaataaaggt acaaacaaaa gagggaggtt atttccatgt
3781 actcttagcg ccaggtgtcc ataacattat tgccatcgct gatgggtacc agcaacaaca
3841 ttcacaggtc tttgtgcatc atgatgcagc tagttctgtg gtgatagtct ttgacacaga
3901 taaccggata tttggtttgc caagggagct tgtggtaact gtatcaggtg ctactatgtc
3961 ggcattgatc ctaacagctt gcattatttg gtgcatctgc tcaatcaagt ctaatagaca
4021 caaggatggc tttcatcggc tcaggcagca tcatgatgag tatgaagatg aaattcgcat
4081 gatgtctacc ggctccaaga agtccctcct aagccatgag ttccaggatg aaacagacac
4141 tgaagaggaa acattatatt ctagcaaaca ttgaaaaaca cattttgcat atctcccagc
4201 ataagtacca agcaaaatta cagttcctct tgggagaaca ctgcattaag aagagagact
4261 ctcttgcttc ttcaaagagc tttgggaaat taaattgcta aatttgtatt caaaaaaaaa
4321 aaaaaa
//