LOCUS AB451488 2298 bp mRNA linear HUM 02-SEP-2008 DEFINITION Homo sapiens DPP4 mRNA for dipeptidylpeptidase IV, partial cds, clone: FLJ85507SAAF. ACCESSION AB451488 VERSION AB451488.1 KEYWORDS Gateway cloning system. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2298) AUTHORS Kawamura,Y., Nomura,N. and Goshima,N. TITLE Direct Submission JOURNAL Submitted (31-JUL-2008) to the DDBJ/EMBL/GenBank databases. Contact:Naoki Goshima Biomedicinal Information Research Center (BIRC), National Institute of Advanced Industrial Science and Technology (AIST); 2-42 Aomi, Koto-ku, Tokyo 135-0064, Japan REFERENCE 2 AUTHORS Goshima,N., Kawamura,Y., Fukumoto,A., Miura,A., Honma,R., Satoh,R., Wakamatsu,A., Yamamoto,J.-I., Kimura,K., Nishikawa,T., Andoh,T., Iida,Y., Ishikawa,K., Ito,E., Kagawa,N., Kaminaga,C., Kanehori,K., Kawakami,B., Kenmochi,K.-I., Kimura,R., Kobayashi,M., Kuroita,T., Kuwayama,H., Maruyama,Y., Matsuo,K., Minami,K., Mitsubori,M., Mori,M., Morishita,R., Murase,A., Nishikawa,A., Nishikawa,S., Okamoto,T., Sakagami,N., Sakamoto,Y., Sasaki,Y., Seki,T., Sono,S., Sugiyama,A., Sumiya,T., Takayama,T., Takayama,Y., Takeda,H., Togashi,T., Yahata,K., Yamada,H., Yanagisawa,Y., Endo,Y., Imamoto,F., Kisu,Y., Tanaka,S., Isogai,T., Imai,J.-I., Watanabe,S. and Nomura,N. TITLE Human Protein Factory: an infrastructure to convert the human transcriptome into the in vitro-expressed human proteome of versatile utility JOURNAL Unpublished (2008) COMMENT This human Gateway entry clone was constructed by recombining an attB-attached open reading frame (ORF) fragment with attP sequences of the Gateway donor vector pDONR201. The ORF sequence in the entry clone is flanked with attL1 sequence (100 nt) - spacer nucleotides (TC) - Shine-Dalgano sequence (GAAGGAGATA) - spacer nucleotides (GA) - Kozak sequence (ACC) upstream of the translational initiation codon (ATG) and is also flanked with spacer nucleotides (TATG) - attL2 sequence (99 nt) downstream of the ORF. This is an F-type clone that deletes the stop codon for C-terminal tagged proteins. DNA sequences of the entire ORF and flanking regions were validated by sequencing. Clone structure of the ORF and flanking sequences is as follows: 5'-caaataatgattttattttgactgatagtgacctgttcgttgcaacaaattgatgagcaatgct tttttataatgccaactttgtacaaaaaagcaggct TC GAAGGAGATA GA ACC 5'ORF3' TATG acccagctttcttgtacaaagttggcattataagaaagcattgcttatcaatttgttgcaac gaacaggtcactatcagtcaaaataaaatcattattg-3' (5'ORF3'; ORF sequence. attL sequences are described in lower cases). Clone information: http://www.HGPD.jp/ This clone was produced in the "Functional Analysis of Protein and Research Application" project supported by the New Energy and Industrial Technology Development Organization (NEDO), Japan FEATURES Location/Qualifiers source 1..2298 /clone="FLJ85507SAAF" /db_xref="H-InvDB:HIT000487701" /db_xref="taxon:9606" /mol_type="mRNA" /note="Vector: pDONR201" /organism="Homo sapiens" CDS 1..>2298 /codon_start=1 /gene="DPP4" /product="dipeptidylpeptidase IV" /protein_id="BAG70302.1" /transl_table=1 /translation="MKTPWKVLLGLLGAAALVTIITVPVVLLNKGTDDATADSRKTYT LTDYLKNTYRLKLYSLRWISDHEYLYKQENNILVFNAEYGNSSVFLENSTFDEFGHSI NDYSISPDGQFILLEYNYVKQWRHSYTASYDIYDLNKRQLITEERIPNNTQWVTWSPV GHKLAYVWNNDIYVKIEPNLPSYRITWTGKEDIIYNGITDWVYEEEVFSAYSALWWSP NGTFLAYAQFNDTEVPLIEYSFYSDESLQYPKTVRVPYPKAGAVNPTVKFFVVNTDSL SSVTNATSIQITAPASMLIGDHYLCDVTWATQERISLQWLRRIQNYSVMDICDYDESS GRWNCLVARQHIEMSTTGWVGRFRPSEPHFTLDGNSFYKIISNEEGYRHICYFQIDKK DCTFITKGTWEVIGIEALTSDYLYYISNEYKGMPGGRNLYKIQLSDYTKVTCLSCELN PERCQYYSVSFSKEAKYYQLRCSGPGLPLYTLHSSVNDKGLRVLEDNSALDKMLQNVQ MPSKKLDFIILNETKFWYQMILPPHFDKSKKYPLLLDVYAGPCSQKADTVFRLNWATY LASTENIIVASFDGRGSGYQGDKIMHAINRRLGTFEVEDQIEAARQFSKMGFVDNKRI AIWGWSYGGYVTSMVLGSGSGVFKCGIAVAPVSRWEYYDSVYTERYMGLPTPEDNLDH YRNSTVMSRAENFKQVEYLLIHGTADDNVHFQQSAQISKALVDVGVDFQAMWYTDEDH GIASSTAHQHIYTHMSHFIKQCFSLP" BASE COUNT 732 a 453 c 492 g 621 t ORIGIN 1 atgaagacac cgtggaaggt tcttctggga ctgctgggtg ctgctgcgct tgtcaccatc 61 atcaccgtgc ccgtggttct gctgaacaaa ggcacagatg atgctacagc tgacagtcgc 121 aaaacttaca ctctaactga ttacttaaaa aatacttata gactgaagtt atactcctta 181 agatggattt cagatcatga atatctctac aaacaagaaa ataatatctt ggtattcaat 241 gctgaatatg gaaacagctc agttttcttg gagaacagta catttgatga gtttggacat 301 tctatcaatg attattcaat atctcctgat gggcagttta ttctcttaga atacaactac 361 gtgaagcaat ggaggcattc ctacacagct tcatatgaca tttatgattt aaataaaagg 421 cagctgatta cagaagagag gattccaaac aacacacagt gggtcacatg gtcaccagtg 481 ggtcataaat tggcatatgt ttggaacaat gacatttatg ttaaaattga accaaattta 541 ccaagttaca gaatcacatg gacggggaaa gaagatataa tatataatgg aataactgac 601 tgggtttatg aagaggaagt cttcagtgcc tactctgctc tgtggtggtc tccaaacggc 661 acttttttag catatgccca atttaacgac acagaagtcc cacttattga atactccttc 721 tactctgatg agtcactgca gtacccaaag actgtacggg ttccatatcc aaaggcagga 781 gctgtgaatc caactgtaaa gttctttgtt gtaaatacag actctctcag ctcagtcacc 841 aatgcaactt ccatacaaat cactgctcct gcttctatgt tgatagggga tcactacttg 901 tgtgatgtga catgggcaac acaagaaaga atttctttgc agtggctcag gaggattcag 961 aactattcgg tcatggatat ttgtgactat gatgaatcca gtggaagatg gaactgctta 1021 gtggcacggc aacacattga aatgagtact actggctggg ttggaagatt taggccttca 1081 gaacctcatt ttacccttga tggtaatagc ttctacaaga tcatcagcaa tgaagaaggt 1141 tacagacaca tttgctattt ccaaatagat aaaaaagact gcacatttat tacaaaaggc 1201 acctgggaag tcatcgggat agaagctcta accagtgatt atctatacta cattagtaat 1261 gaatataaag gaatgccagg aggaaggaat ctttataaaa tccaacttag tgactataca 1321 aaagtgacat gcctcagttg tgagctgaat ccggaaaggt gtcagtacta ttctgtgtca 1381 ttcagtaaag aggcgaagta ttatcagctg agatgttccg gtcctggtct gcccctctat 1441 actctacaca gcagcgtgaa tgataaaggg ctgagagtcc tggaagacaa ttcagctttg 1501 gataaaatgc tgcagaatgt ccagatgccc tccaaaaaac tggacttcat tattttgaat 1561 gaaacaaaat tttggtatca gatgatcttg cctcctcatt ttgataaatc caagaaatat 1621 cctctactat tagatgtgta tgcaggccca tgtagtcaaa aagcagacac tgtcttcaga 1681 ctgaactggg ccacttacct tgcaagcaca gaaaacatta tagtagctag ctttgatggc 1741 agaggaagtg gttaccaagg agataagatc atgcatgcaa tcaacagaag actgggaaca 1801 tttgaagttg aagatcaaat tgaagcagcc agacaatttt caaaaatggg atttgtggac 1861 aacaaacgaa ttgcaatttg gggctggtca tatggagggt acgtaacctc aatggtcctg 1921 ggatcgggaa gtggcgtgtt caagtgtgga atagccgtgg cgcctgtatc ccggtgggag 1981 tactatgact cagtgtacac agaacgttac atgggtctcc caactccaga agacaacctt 2041 gaccattaca gaaattcaac agtcatgagc agagctgaaa attttaaaca agttgagtac 2101 ctccttattc atggaacagc agatgataac gttcactttc agcagtcagc tcagatctcc 2161 aaagccctgg tcgatgttgg agtggatttc caggcaatgt ggtatactga tgaagaccat 2221 ggaatagcta gcagcacagc acaccaacat atatataccc acatgagcca cttcataaaa 2281 caatgtttct ctttacct //