LOCUS       AB451339                2301 bp    mRNA    linear   HUM 02-SEP-2008
DEFINITION  Homo sapiens DPP4 mRNA for dipeptidylpeptidase IV, complete cds,
            clone: FLJ85507SAAN.
ACCESSION   AB451339
VERSION     AB451339.1
KEYWORDS    Gateway cloning system.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2301)
  AUTHORS   Kawamura,Y., Nomura,N. and Goshima,N.
  TITLE     Direct Submission
  JOURNAL   Submitted (31-JUL-2008) to the DDBJ/EMBL/GenBank databases.
            Contact:Naoki Goshima
            Biomedicinal Information Research Center (BIRC), National
            Institute of Advanced Industrial Science and Technology (AIST);
            2-42 Aomi, Koto-ku, Tokyo 135-0064, Japan
REFERENCE   2
  AUTHORS   Goshima,N., Kawamura,Y., Fukumoto,A., Miura,A., Honma,R.,
            Satoh,R., Wakamatsu,A., Yamamoto,J.-I., Kimura,K., Nishikawa,T.,
            Andoh,T., Iida,Y., Ishikawa,K., Ito,E., Kagawa,N., Kaminaga,C.,
            Kanehori,K., Kawakami,B., Kenmochi,K.-I., Kimura,R., Kobayashi,M.,
            Kuroita,T., Kuwayama,H., Maruyama,Y., Matsuo,K., Minami,K.,
            Mitsubori,M., Mori,M., Morishita,R., Murase,A., Nishikawa,A.,
            Nishikawa,S., Okamoto,T., Sakagami,N., Sakamoto,Y., Sasaki,Y.,
            Seki,T., Sono,S., Sugiyama,A., Sumiya,T., Takayama,T.,
            Takayama,Y., Takeda,H., Togashi,T., Yahata,K., Yamada,H.,
            Yanagisawa,Y., Endo,Y., Imamoto,F., Kisu,Y., Tanaka,S., Isogai,T.,
            Imai,J.-I., Watanabe,S. and Nomura,N.
  TITLE     Human Protein Factory: an infrastructure to convert the human
            transcriptome into the in vitro-expressed human proteome of
            versatile utility
  JOURNAL   Unpublished (2008)
COMMENT     This human Gateway entry clone was constructed by recombining an
            attB-attached open reading frame (ORF) fragment with attP
            sequences of the Gateway donor vector pDONR201. The ORF sequence
            in the entry clone is flanked with attL1 sequence (100 nt) -
            spacer nucleotides (TC) - Shine-Dalgano sequence (GAAGGAGATA) -
            spacer nucleotides (GA) - Kozak sequence (ACC) upstream of the
            translational initiation codon (ATG) and is also flanked with a
            spacer nucleotide (G) - attL2 sequence (99 nt) downstream of the
            ORF. This is an N-type clone which has an intrinsic stop codon at
            the end of ORF. DNA sequences of the entire ORF and flanking
            regions were validated by sequencing. Clone structure of the ORF
            and flanking sequences is as follows:
            5'-caaataatgattttattttgactgatagtgacctgttcgttgcaacaaattgatgagcaatgct
            tttttataatgccaactttgtacaaaaaagcaggct TC GAAGGAGATA GA ACC 5'ORF3'
            G acccagctttcttgtacaaagttggcattataagaaagcattgcttatcaatttgttgcaacgaa
            caggtcactatcagtcaaaataaaatcattattg-3' (5'ORF3'; ORF sequence. attL
            sequences are described in lower cases).
            Clone information: http://www.HGPD.jp/
            This clone was produced in the "Functional Analysis of Protein and
            Research Application" project supported by the New Energy and
            Industrial Technology Development Organization (NEDO), Japan
FEATURES             Location/Qualifiers
     source          1..2301
                     /clone="FLJ85507SAAN"
                     /db_xref="H-InvDB:HIT000487552"
                     /db_xref="taxon:9606"
                     /mol_type="mRNA"
                     /note="Vector: pDONR201"
                     /organism="Homo sapiens"
     CDS             1..2301
                     /codon_start=1
                     /gene="DPP4"
                     /product="dipeptidylpeptidase IV"
                     /protein_id="BAG70153.1"
                     /transl_table=1
                     /translation="MKTPWKVLLGLLGAAALVTIITVPVVLLNKGTDDATADSRKTYT
                     LTDYLKNTYRLKLYSLRWISDHEYLYKQENNILVFNAEYGNSSVFLENSTFDEFGHSI
                     NDYSISPDGQFILLEYNYVKQWRHSYTASYDIYDLNKRQLITEERIPNNTQWVTWSPV
                     GHKLAYVWNNDIYVKIEPNLPSYRITWTGKEDIIYNGITDWVYEEEVFSAYSALWWSP
                     NGTFLAYAQFNDTEVPLIEYSFYSDESLQYPKTVRVPYPKAGAVNPTVKFFVVNTDSL
                     SSVTNATSIQITAPASMLIGDHYLCDVTWATQERISLQWLRRIQNYSVMDICDYDESS
                     GRWNCLVARQHIEMSTTGWVGRFRPSEPHFTLDGNSFYKIISNEEGYRHICYFQIDKK
                     DCTFITKGTWEVIGIEALTSDYLYYISNEYKGMPGGRNLYKIQLSDYTKVTCLSCELN
                     PERCQYYSVSFSKEAKYYQLRCSGPGLPLYTLHSSVNDKGLRVLEDNSALDKMLQNVQ
                     MPSKKLDFIILNETKFWYQMILPPHFDKSKKYPLLLDVYAGPCSQKADTVFRLNWATY
                     LASTENIIVASFDGRGSGYQGDKIMHAINRRLGTFEVEDQIEAARQFSKMGFVDNKRI
                     AIWGWSYGGYVTSMVLGSGSGVFKCGIAVAPVSRWEYYDSVYTERYMGLPTPEDNLDH
                     YRNSTVMSRAENFKQVEYLLIHGTADDNVHFQQSAQISKALVDVGVDFQAMWYTDEDH
                     GIASSTAHQHIYTHMSHFIKQCFSLP"
BASE COUNT          734 a          453 c          492 g          622 t
ORIGIN      
        1 atgaagacac cgtggaaggt tcttctggga ctgctgggtg ctgctgcgct tgtcaccatc
       61 atcaccgtgc ccgtggttct gctgaacaaa ggcacagatg atgctacagc tgacagtcgc
      121 aaaacttaca ctctaactga ttacttaaaa aatacttata gactgaagtt atactcctta
      181 agatggattt cagatcatga atatctctac aaacaagaaa ataatatctt ggtattcaat
      241 gctgaatatg gaaacagctc agttttcttg gagaacagta catttgatga gtttggacat
      301 tctatcaatg attattcaat atctcctgat gggcagttta ttctcttaga atacaactac
      361 gtgaagcaat ggaggcattc ctacacagct tcatatgaca tttatgattt aaataaaagg
      421 cagctgatta cagaagagag gattccaaac aacacacagt gggtcacatg gtcaccagtg
      481 ggtcataaat tggcatatgt ttggaacaat gacatttatg ttaaaattga accaaattta
      541 ccaagttaca gaatcacatg gacggggaaa gaagatataa tatataatgg aataactgac
      601 tgggtttatg aagaggaagt cttcagtgcc tactctgctc tgtggtggtc tccaaacggc
      661 acttttttag catatgccca atttaacgac acagaagtcc cacttattga atactccttc
      721 tactctgatg agtcactgca gtacccaaag actgtacggg ttccatatcc aaaggcagga
      781 gctgtgaatc caactgtaaa gttctttgtt gtaaatacag actctctcag ctcagtcacc
      841 aatgcaactt ccatacaaat cactgctcct gcttctatgt tgatagggga tcactacttg
      901 tgtgatgtga catgggcaac acaagaaaga atttctttgc agtggctcag gaggattcag
      961 aactattcgg tcatggatat ttgtgactat gatgaatcca gtggaagatg gaactgctta
     1021 gtggcacggc aacacattga aatgagtact actggctggg ttggaagatt taggccttca
     1081 gaacctcatt ttacccttga tggtaatagc ttctacaaga tcatcagcaa tgaagaaggt
     1141 tacagacaca tttgctattt ccaaatagat aaaaaagact gcacatttat tacaaaaggc
     1201 acctgggaag tcatcgggat agaagctcta accagtgatt atctatacta cattagtaat
     1261 gaatataaag gaatgccagg aggaaggaat ctttataaaa tccaacttag tgactataca
     1321 aaagtgacat gcctcagttg tgagctgaat ccggaaaggt gtcagtacta ttctgtgtca
     1381 ttcagtaaag aggcgaagta ttatcagctg agatgttccg gtcctggtct gcccctctat
     1441 actctacaca gcagcgtgaa tgataaaggg ctgagagtcc tggaagacaa ttcagctttg
     1501 gataaaatgc tgcagaatgt ccagatgccc tccaaaaaac tggacttcat tattttgaat
     1561 gaaacaaaat tttggtatca gatgatcttg cctcctcatt ttgataaatc caagaaatat
     1621 cctctactat tagatgtgta tgcaggccca tgtagtcaaa aagcagacac tgtcttcaga
     1681 ctgaactggg ccacttacct tgcaagcaca gaaaacatta tagtagctag ctttgatggc
     1741 agaggaagtg gttaccaagg agataagatc atgcatgcaa tcaacagaag actgggaaca
     1801 tttgaagttg aagatcaaat tgaagcagcc agacaatttt caaaaatggg atttgtggac
     1861 aacaaacgaa ttgcaatttg gggctggtca tatggagggt acgtaacctc aatggtcctg
     1921 ggatcgggaa gtggcgtgtt caagtgtgga atagccgtgg cgcctgtatc ccggtgggag
     1981 tactatgact cagtgtacac agaacgttac atgggtctcc caactccaga agacaacctt
     2041 gaccattaca gaaattcaac agtcatgagc agagctgaaa attttaaaca agttgagtac
     2101 ctccttattc atggaacagc agatgataac gttcactttc agcagtcagc tcagatctcc
     2161 aaagccctgg tcgatgttgg agtggatttc caggcaatgt ggtatactga tgaagaccat
     2221 ggaatagcta gcagcacagc acaccaacat atatataccc acatgagcca cttcataaaa
     2281 caatgtttct ctttacctta a
//