LOCUS AK134378 2948 bp mRNA linear HTC 06-OCT-2010 DEFINITION Mus musculus 13 days embryo male testis cDNA, RIKEN full-length enriched library, clone:6030449F04 product:nuclear receptor subfamily 5, group A, member 1, full insert sequence. ACCESSION AK134378 VERSION AK134378.1 KEYWORDS HTC_FLI; HTC; CAP trapper. SOURCE Mus musculus (house mouse) ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus. REFERENCE 1 (bases 1 to 2948) AUTHORS Arakawa,T., Carninci,P., Fukuda,S., Hashizume,W., Hayashida,K., Hori,F., Iida,J., Imamura,K., Imotani,K., Itoh,M., Kanagawa,S., Kawai,J., Kojima,M., Konno,H., Murata,M., Nakamura,M., Ninomiya,N., Nishiyori,H., Nomura,K., Ohno,M., Sakazume,N., Sano,H., Sasaki,D., Shibata,K., Shiraki,T., Tagami,M., Tagami,Y., Waki,K., Watahiki,A., Muramatsu,M. and Hayashizaki,Y. TITLE Direct Submission JOURNAL Submitted (30-MAR-2004) to the DDBJ/EMBL/GenBank databases. Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL :http://www.osc.riken.jp/ REFERENCE 2 AUTHORS CONSRTM The FANTOM Consortium, Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) TITLE The Transcriptional Landscape of the Mammalian Genome JOURNAL Science 309, 1559-1563 (2005) REFERENCE 3 AUTHORS CONSRTM RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium TITLE Antisense Transcription in the Mammalian Transcriptome JOURNAL Science 309, 1564-1566 (2005) REFERENCE 4 AUTHORS CONSRTM The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team TITLE Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs JOURNAL Nature 420, 563-573 (2002) REFERENCE 5 AUTHORS CONSRTM The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium TITLE Functional annotation of a full-length mouse cDNA collection JOURNAL Nature 409, 685-690 (2001) REFERENCE 6 AUTHORS Carninci,P. and Hayashizaki,Y. TITLE High-efficiency full-length cDNA cloning JOURNAL Meth. Enzymol. 303, 19-44 (1999) REFERENCE 7 AUTHORS Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y. TITLE Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes JOURNAL Genome Res. 10, 1617-1630 (2000) REFERENCE 8 AUTHORS Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki,N., Carninci,P., Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., Yamamoto,R., Matsumoto,H., Sakaguchi,S., Ikegami,T., Kashiwagi,K., Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., Okazaki,Y., Muramatsu,M., Inoue,Y., Kira,A. and Hayashizaki,Y. TITLE RIKEN integrated sequence analysis (RISA) system-384-format sequencing pipeline with 384 multicapillary sequencer JOURNAL Genome Res. 10, 1757-1771 (2000) COMMENT cDNA library was prepared and sequenced in Mouse Genome Encyclopedia Project of Genome Exploration Research Group in Riken Genomic Sciences Center and Genome Science Laboratory in RIKEN. Division of Experimental Animal Research in Riken contributed to prepare mouse tissues. Please visit our web site for further details. URL:http://www.osc.riken.jp/ URL:http://fantom.gsc.riken.jp/ clone information is available at: http://fantom.gsc.riken.jp/3/db/annotate/ main.cgi?masterid=6030449F04 FEATURES Location/Qualifiers source 1..2948 /clone="6030449F04" /clone_lib="RIKEN full-length enriched mouse cDNA library" /db_xref="FANTOM_DB:6030449F04" /db_xref="MGI:3536103" /db_xref="taxon:10090" /dev_stage="13 days embryo" /mol_type="mRNA" /organism="Mus musculus" /sex="male" /strain="C57BL/6J" /tissue_type="testis" CDS 203..1591 /codon_start=1 /note="nuclear receptor subfamily 5, group A, member 1 (MGD|MGI:1346833 GB|NM_139051, evidence: BLASTN, 99%, match=2903)" /note="putative" /protein_id="BAE22121.1" /transl_table=1 /translation="MDYSYDEDLDELCPVCGDKVSGYHYGLLTCESCKGFFKRTVQNN KHYTCTESQSCKIDKTQRKRCPFCRFQKCLTVGMRLEAVRADRMRGGRNKFGPMYKRD RALKQQKKAQIRANGFKLETGPPMGVPPPPPPPPDYMLPPSLHAPEPKALVSGPPSGP LGDFGAPSLPMAVPGPHGPLAGYLYPAFSNRTIKSEYPEPYASPPQQPGPPYSYPEPF SGGPNVPELILQLLQLEPEEDQVRARIVGCLQEPAKSRSDQPAPFSLLCRMADQTFIS IVDWARRCMVFKELEVADQMTLLQNCWSELLVLDHIYRQVQYGKEDSILLVTGQEVEL STVAVQAGSLLHSLVLRAQELVLQLHALQLDRQEFVCLKFLILFSLDVKFLNNHSLVK DAQEKANAALLDYTLCHYPHCGDKFQQLLLCLVEVRALSMQAKEYLYHKHLGNEMPRN NLLIEMLQAKQT" regulatory 2926..2931 /note="putative" /regulatory_class="polyA_signal_sequence" polyA_site 2948 /note="putative" BASE COUNT 618 a 886 c 820 g 624 t ORIGIN 1 ggaggggagg aggaaaggac gatcggacag ggccagtttc cagtccgccg ctgcccgccc 61 gctgctgggt gaagaagttt ctgagagccc gctagccact gccctacctg aggcctggga 121 gcctccccac caggaccctg gtgtccagtg tccaccctta tccggctgag aattctcctt 181 ccgttcagcg gacgccgcgg gcatggacta ttcgtacgac gaggacctgg acgagctgtg 241 tccagtgtgt ggtgacaagg tgtcgggcta ccactacggg ctgctcacgt gcgagagctg 301 caagggcttc ttcaagcgca cagtccagaa caacaagcat tacacgtgca ccgagagtca 361 gagctgcaaa atcgacaaga cgcagcgtaa gcgctgtccc ttctgccgct tccagaagtg 421 cctgacggtg ggcatgcgcc tggaagctgt gcgtgctgat cgaatgcggg gtggccggaa 481 caagtttggg cccatgtaca agagagaccg ggccttgaag cagcagaaga aagcacagat 541 tcgggccaat ggcttcaagc tggagaccgg accaccgatg ggggtgcccc cgccaccccc 601 tcccccaccg gactacatgt taccccctag cctgcacgca ccggagccca aggccctggt 661 ctctggccca cccagtgggc cgctgggtga ctttggagcc ccatctctac ccatggctgt 721 gcctggtccc cacggacctc tggctggcta cctctatcct gccttctcta accgcaccat 781 caagtctgag tatccagagc cctatgccag ccccccacaa cagccagggc caccctacag 841 ctatccagag cccttctcag gagggcccaa tgtaccagag ctcatattgc agctgctgca 901 actagagcca gaggaggacc aggtgcgcgc tcgcatcgtg ggctgtctgc aggagccagc 961 caaaagccgc tctgaccagc cagcgccctt cagcctcctc tgcagaatgg ccgaccagac 1021 ctttatctcc attgtcgact gggcacgaag gtgcatggtc tttaaggagc tggaggtggc 1081 tgaccagatg acactgctgc agaactgttg gagcgagctg ctggtgttgg accacatcta 1141 ccgccaggtc cagtacggca aggaagacag catcctgctg gttactggac aggaggtgga 1201 gctgagcaca gtggctgtgc aggctggctc cctgctgcac agcctggtgc tgcgggccca 1261 agagttagtg ctccagttgc atgcactgca gctggaccgc caggagttcg tctgtctcaa 1321 gttcctcatc ctcttcagcc tcgatgtgaa attcctgaac aaccacagcc tcgtaaagga 1381 cgcccaggaa aaggccaacg ctgccctgtt ggattacacc ttgtgtcact acccacactg 1441 cggggacaaa ttccagcagt tgctattgtg cctggtggag gtgcgggccc tgagcatgca 1501 ggccaaggag tacctgtacc acaagcattt gggcaacgag atgccccgca acaaccttct 1561 cattgagatg ctgcaggcca agcagacttg agcctgggtg ccaggcagcg ggcaataggc 1621 agggatgcca ctgcctccaa aagactcctt gcattaggtg atccaggagc cctgtcacta 1681 agcccctgcc cctgagctcc agagctgtgt gtttgggcaa ggatgggcgg ggattggccg 1741 gggcaggttg cctttactag ccattggcct gtgtccgcca cttggagtgc cccaaagggg 1801 gcttctaacc attccttcct ccatcagccc ccagcttttt ttcctggtat ctgaggtccc 1861 aggaggaggc tcaggattcc ctggtgggtc tggatgtccc ttgggtcaga ggtcatcctt 1921 tccctctctc ctgttatcag aggcaaagga aggtctacag gcatcaatga gggcaaagga 1981 gggggtctcc agactccact gaagcaggaa gtccactgtt gtaaactgag tttgctaaat 2041 tgggtcccca gaggatacca tgagagtggg tagggcaaaa agagcccttt ccgccctcta 2101 cccatctaat tctgatcctc tacctgtagg aggactttgg tgtgatcatc cttctcccag 2161 ggcccggcta cccagggagg aggagtctgg tgtagccaac attcctgccc taaccctgcc 2221 catcaccagc tggctgggct ggtatttatc tgcaaggttg aagtcactgg gattcttttc 2281 ctttcaccta gatagtcctt ggaaagtgtg tgagagagaa gtgggcagga gacagactgg 2341 ggactgagct gggatatggg gactagcatc aaagctttct cctgacatct ctttccaaga 2401 gtcggggtgg catctgtacc ccacctcacc cccgagaagt gctattgctt gccctctgcc 2461 tcagccccac taggggaaca acaggaggcc tgctggggct tagagtccgt gcaggtgggg 2521 atatgggtaa atctaggaga actcacagat ctttatatga ggacagtgct gaggactttc 2581 tcatggctcc atccttttgg tccctcgcca ctacccttga agctggcttc agttccctgg 2641 ctgctgcttt gcctcctgaa agccactctg taggaccaag cactcggggg agaggcctaa 2701 gccatcctct gttccagact ggacatccac tgtctttcct gctttcgcgt cagatttaca 2761 gcttatgcta ggcccaccca actggacaag gctgtctcct gtcttctact accctggctc 2821 agcccccacc tctgcccctg aaatgcgtgc tcccaccaag gccagagacc cacagcccca 2881 agacaagaag tgcccttata aacccctgca gccctgcagc cctgaaataa attttgcaat 2941 tagtttcc //