LOCUS AK156391 3845 bp mRNA linear HTC 06-OCT-2010 DEFINITION Mus musculus activated spleen cDNA, RIKEN full-length enriched library, clone:F830019M02 product:diaphanous homolog 1 (Drosophila), full insert sequence. ACCESSION AK156391 VERSION AK156391.1 KEYWORDS HTC_FLI; HTC; CAP trapper. SOURCE Mus musculus (house mouse) ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus. REFERENCE 1 (bases 1 to 3845) AUTHORS Arakawa,T., Carninci,P., Fukuda,S., Hashizume,W., Hayashida,K., Hori,F., Iida,J., Imamura,K., Imotani,K., Itoh,M., Kanagawa,S., Kawai,J., Kojima,M., Konno,H., Murata,M., Nakamura,M., Ninomiya,N., Nishiyori,H., Nomura,K., Ohno,M., Sakazume,N., Sano,H., Sasaki,D., Shibata,K., Shiraki,T., Tagami,M., Tagami,Y., Waki,K., Watahiki,A., Muramatsu,M. and Hayashizaki,Y. TITLE Direct Submission JOURNAL Submitted (30-MAR-2004) Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL :http://www.osc.riken.jp/ REFERENCE 2 AUTHORS CONSRTM The FANTOM Consortium, Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) TITLE The Transcriptional Landscape of the Mammalian Genome JOURNAL Science 309, 1559-1563 (2005) REFERENCE 3 AUTHORS CONSRTM RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium TITLE Antisense Transcription in the Mammalian Transcriptome JOURNAL Science 309, 1564-1566 (2005) REFERENCE 4 AUTHORS CONSRTM The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team TITLE Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs JOURNAL Nature 420, 563-573 (2002) REFERENCE 5 AUTHORS CONSRTM The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium TITLE Functional annotation of a full-length mouse cDNA collection JOURNAL Nature 409, 685-690 (2001) REFERENCE 6 AUTHORS Carninci,P. and Hayashizaki,Y. TITLE High-efficiency full-length cDNA cloning JOURNAL Meth. Enzymol. 303, 19-44 (1999) REFERENCE 7 AUTHORS Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y. TITLE Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes JOURNAL Genome Res. 10, 1617-1630 (2000) REFERENCE 8 AUTHORS Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki,N., Carninci,P., Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., Yamamoto,R., Matsumoto,H., Sakaguchi,S., Ikegami,T., Kashiwagi,K., Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., Okazaki,Y., Muramatsu,M., Inoue,Y., Kira,A. and Hayashizaki,Y. TITLE RIKEN integrated sequence analysis (RISA) system-384-format sequencing pipeline with 384 multicapillary sequencer JOURNAL Genome Res. 10, 1757-1771 (2000) COMMENT cDNA library was prepared and sequenced in Mouse Genome Encyclopedia Project of Genome Exploration Research Group in Riken Genomic Sciences Center and Genome Science Laboratory in RIKEN. Division of Experimental Animal Research in Riken contributed to prepare mouse tissues. Tissues were provided by Dr. John Todd (Dept. of Medical Genetics Wellcome Trust Centre for Molecular Mechanisms in Disease Wellcome Trust/MRC building Addenbrookes Hospital Cambridge) whose assistance we gratefully acknowledge. Please visit our web site for further details. URL:http://www.osc.riken.jp/ URL:http://fantom.gsc.riken.jp/ clone information is available at: http://fantom.gsc.riken.jp/3/db/annotate/ main.cgi?masterid=F830019M02 FEATURES Location/Qualifiers source 1..3845 /clone="F830019M02" /db_xref="FANTOM_DB:F830019M02" /db_xref="MGI:3557021" /db_xref="taxon:10090" /mol_type="mRNA" /note="clone_lib: RIKEN full-length enriched mouse cDNA library" /organism="Mus musculus" /strain="NOD" /tissue_type="activated spleen" CDS 70..2109 /codon_start=1 /note="diaphanous homolog 1 (Drosophila) (MGD|MGI:1194490 GB|NM_007858, evidence: BLASTN, 99%, match=2383)" /note="putative" /protein_id="BAE33697.1" /transl_table=1 /translation="MASLSAVVVAPSVSSSAAVPPAPPLPGDSGTVIPPPPPPPPLPG GVVPPSPSLPPGTCIPPPPPLPGGACIPPPPQLPGSAAIPPPPPLPGVASIPPPPPLP GGTGIPPPPPPLPGSVGVPPPPPLPGGPGLPPPPPPFPGAPGIPPPPPGMGVPPPPPF GFGVPAAPVLPFGLTPKKVYKPEVQLRRPNWSKFVAEDLSQDCFWTKVKEDRFENNEL FAKLTLAFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGSFRM PYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKEEYDDLAESEQFGVVM GTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACEELRKSENFSSLLELTLLVGN YMNAGSRNAGAFGFNISFLCKLRDTKSADQKMTLLHFLAELCENDHPEVLKFPDELAH VEKASRVSAENLQKSLDQMKKQIADVERDVQNFPAATDEKDKFVEKMTSFVKDAQEQY NKLRMMHSNMETLYKELGDYFVFDPKKLSVEEFFMDLHNFRNMFLQAVKENQKRRETE EKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQSGAAFRRKRG PRQVNRKAGCAVTSLLASELTKDDAMAPGPVKVPKKSEGVPTILEEAKELVGRAS" regulatory 3830..3835 /note="putative" /regulatory_class="polyA_signal_sequence" polyA_site 3845 /note="putative" BASE COUNT 895 a 1041 c 980 g 929 t ORIGIN 1 gggtaggtgt ccaagctgac aggagaggtt gccaagctgt caaaagaact agaagatgcc 61 aagaatgaaa tggcttctct ctctgctgtg gttgttgcac cttctgtttc tagcagtgct 121 gctgttcccc ctgcccctcc tctgcctggt gactctggca ctgttattcc acctccccca 181 cccccacctc ctcttcctgg aggtgtggtc ccaccatccc cttctctgcc tccaggtact 241 tgtatccctc cacctcctcc tttacctgga ggtgcttgta taccccctcc cccccagttg 301 cctggcagtg ctgccatccc tccacctcct cctctacctg gagttgcttc catcccccca 361 cctccccctt tgcctggagg tacaggtata ccaccaccac ctcctccttt gcctggaagt 421 gttggcgttc ccccaccgcc tcccttgcct ggaggaccag gactgcctcc tccccctccc 481 ccttttcctg gagcacctgg cattcctcca cctccacctg gtatgggcgt gcctccacct 541 cccccctttg gatttggggt tcctgcggcc ccagttctgc catttggatt aacccccaaa 601 aaagtttata agccagaggt gcagctccgg aggccaaact ggtccaagtt tgtggctgag 661 gacctttccc aggactgctt ctggacaaag gtgaaggagg accgctttga gaacaatgaa 721 ctttttgcca aacttaccct tgccttctcc gcccagacca agacttctaa agccaagaag 781 gatcaagaag gtggagaaga aaagaaatct gttcaaaaga agaaagtaaa agagctgaaa 841 gtgctggatt caaagacagc gcagaatctc tcaatctttt tgggttcatt ccgcatgccc 901 tatcaagaga taaagaacgt tatcctggag gtgaatgagg ctgttctcac agagtctatg 961 atccagaacc tcattaaaca gatgccagag ccagagcagc taaagatgct ctctgaactg 1021 aaggaggagt acgatgatct ggctgagtca gagcagtttg gtgtggtgat gggcacagtg 1081 ccccgccttc ggcctcgcct caacgccatc ctcttcaagc tacagttcag tgagcaagtt 1141 gagaacatca agccagagat cgtgtctgtc accgccgcat gcgaagagct gcgtaagagt 1201 gagaacttct ccagcctcct ggagctcaca ctgctggtcg gaaactatat gaatgcgggc 1261 tccaggaatg ctggtgcttt cggcttcaat atcagcttcc tttgtaagct tcgagacacc 1321 aagtctgcag atcagaagat gactctgttg catttcttgg ctgagttatg tgagaatgac 1381 caccccgaag tcctcaagtt tcctgatgag cttgcccatg tagagaaagc cagcagagtc 1441 tctgctgaga acctgcagaa gagcttagat cagatgaaga agcagattgc ggacgtggag 1501 cgcgatgttc agaatttccc agctgccact gacgagaagg acaagtttgt tgagaagatg 1561 accagctttg tgaaggatgc acaggaacag tataacaaac tacggatgat gcactccaac 1621 atggagaccc tctataagga gctaggtgac tacttcgtct ttgaccctaa gaagttgtct 1681 gtagaggaat tctttatgga tctgcacaac tttaggaata tgtttttgca agcagtcaag 1741 gaaaaccaga agcgccggga aacagaagaa aagatgcgga gagcaaaatt agccaaggag 1801 aaggcagaaa aagagcgact ggagaagcag cagaagcgcg agcagctcat cgacatgaac 1861 gcagaggggg atgagacagg tgtgatggac agtcttctag aagctctgca gtcaggggca 1921 gcattccgac ggaagagagg gccccggcag gtcaacagga aggctgggtg tgcagtcaca 1981 tctctgctag cctcggagct gaccaaggat gatgccatgg ctcctggtcc tgttaaggta 2041 cccaagaaaa gtgaaggagt ccccacaatc ctggaagaag ccaaggagct ggttggccgt 2101 gcaagctaag ctgggcttta tggccattgc tgctcctagg cgaagcccag actgtcgacc 2161 tgcagcatgg gcctaaatgg tcaaggagat agtggccact ccaccacctg accctgtctt 2221 tctgtctggc ctgctgctct ctgaacacca catacagctt cagctgcctg gaggccaaaa 2281 ggaaggggca gtgtaggagt ggcctgagcc cagcccagcc agccctggct gttgtattac 2341 caaagcaggg tccgtgtttg ctgccttaac cctgtctcct ctatgttacc cagaggtcct 2401 ggtctcagac agaacccagc ctgctttctc agccccactc tctagtgggc cttccctagg 2461 tcaatcttgc tgcatttgtg cttttctttt gtggtttctc tggccctgag aatagcatgg 2521 gacttgtgaa cctttgggct aggtcttttc actgctgtca cctctgcttt tcctcctggc 2581 aattatttat tactagtgct gtggcattgg gagctgcttc tgcaaaagca ggaagcaaat 2641 cccaccctta ccccaccttc ctgggaaagg tctcccgggc aaagggtctc aaccactctc 2701 cccattcgtt ccactgtggc ccaggtctga gccataggca ggtagctagg gcatagtgac 2761 agcgtggggt ccgccccctg cccgctgcca tgctttatgc ccttgcctcc ctggtccctc 2821 tgcaccagcc ccctcacctc caccctgcaa ctgagccctg ctcccccatg ccttccccag 2881 agctttggtg catccatttg gagtgtgggc tgtagtgtga ccgtcccaac acctcacccc 2941 ctccatccaa gggattggga ggtggagcct cctggctgag aaattgtttt gcaaatggat 3001 ctatttttat atgaaaaaaa attttttaag aaaaataact ttggtcccct tccccctcca 3061 taataaaaga ggctttgatg gtagggtcaa gcattcctgg ggtcggtcag tcctgaacat 3121 ggcctaaata ttgatcccat tcctcaagac taaatcgtga ggccaaggca gacccttccc 3181 tagtcagttc tgaggcaaga gagatttctc tgggtgggct gggccacacc atagggtagc 3241 aagatgctgt gttttcatga atgtattaaa ttatcaagcc aaacgtgcag atgctgggcg 3301 gacgtggaga agtggcccag ggtgcctgct ccccatatga acagtgagcc cgactgggtg 3361 gtccacactg caggaagcac aggttctctg gctgagggtc actgtgtggc ctcctccgag 3421 gcccctttga gcttcccctc caaggcgaca gagaaggaag gacactgtcc gggcttgaca 3481 ctgaattgga gttgccagag catgggggag ttagggcaca caagggccag tacagggggc 3541 cttgtggcca gggatgattg attgggatgc tagaacttgt gtccaaagga acagtggaag 3601 ctcctgggct acgtcctggg aagactgaag taggggttgg ctcttccctg agctgccacc 3661 ttgctcgctg acatctgggg ctttgaccct tttctttttt taatcacttt tgctaaaatg 3721 catttaatta aaaaaatttt taagtagagg ggataaaatg caaacccctt gcctcttggg 3781 cttttgggat gtcatcacct ccagtttgat ggatttgttt ccaactgtca ataaagcatt 3841 gaaac //