LOCUS AK132305 3960 bp mRNA linear HTC 06-OCT-2010 DEFINITION Mus musculus 15 days embryo head cDNA, RIKEN full-length enriched library, clone:4021404H12 product:forkhead box G1, full insert sequence. ACCESSION AK132305 VERSION AK132305.1 KEYWORDS HTC_FLI; HTC; CAP trapper. SOURCE Mus musculus (house mouse) ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus. REFERENCE 1 (bases 1 to 3960) AUTHORS Arakawa,T., Carninci,P., Fukuda,S., Hashizume,W., Hayashida,K., Hori,F., Iida,J., Imamura,K., Imotani,K., Itoh,M., Kanagawa,S., Kawai,J., Kojima,M., Konno,H., Murata,M., Nakamura,M., Ninomiya,N., Nishiyori,H., Nomura,K., Ohno,M., Sakazume,N., Sano,H., Sasaki,D., Shibata,K., Shiraki,T., Tagami,M., Tagami,Y., Waki,K., Watahiki,A., Muramatsu,M. and Hayashizaki,Y. TITLE Direct Submission JOURNAL Submitted (30-MAR-2004) to the DDBJ/EMBL/GenBank databases. Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL :http://www.osc.riken.jp/ REFERENCE 2 AUTHORS CONSRTM The FANTOM Consortium, Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) TITLE The Transcriptional Landscape of the Mammalian Genome JOURNAL Science 309, 1559-1563 (2005) REFERENCE 3 AUTHORS CONSRTM RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium TITLE Antisense Transcription in the Mammalian Transcriptome JOURNAL Science 309, 1564-1566 (2005) REFERENCE 4 AUTHORS CONSRTM The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team TITLE Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs JOURNAL Nature 420, 563-573 (2002) REFERENCE 5 AUTHORS CONSRTM The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium TITLE Functional annotation of a full-length mouse cDNA collection JOURNAL Nature 409, 685-690 (2001) REFERENCE 6 AUTHORS Carninci,P. and Hayashizaki,Y. TITLE High-efficiency full-length cDNA cloning JOURNAL Meth. Enzymol. 303, 19-44 (1999) REFERENCE 7 AUTHORS Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y. TITLE Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes JOURNAL Genome Res. 10, 1617-1630 (2000) REFERENCE 8 AUTHORS Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki,N., Carninci,P., Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., Yamamoto,R., Matsumoto,H., Sakaguchi,S., Ikegami,T., Kashiwagi,K., Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., Okazaki,Y., Muramatsu,M., Inoue,Y., Kira,A. and Hayashizaki,Y. TITLE RIKEN integrated sequence analysis (RISA) system-384-format sequencing pipeline with 384 multicapillary sequencer JOURNAL Genome Res. 10, 1757-1771 (2000) COMMENT cDNA library was prepared and sequenced in Mouse Genome Encyclopedia Project of Genome Exploration Research Group in Riken Genomic Sciences Center and Genome Science Laboratory in RIKEN. Division of Experimental Animal Research in Riken contributed to prepare mouse tissues. Please visit our web site for further details. URL:http://www.osc.riken.jp/ URL:http://fantom.gsc.riken.jp/ clone information is available at: http://fantom.gsc.riken.jp/3/db/annotate/ main.cgi?masterid=4021404H12 FEATURES Location/Qualifiers source 1..3960 /clone="4021404H12" /clone_lib="RIKEN full-length enriched mouse cDNA library" /db_xref="FANTOM_DB:4021404H12" /db_xref="MGI:3533023" /db_xref="taxon:10090" /dev_stage="15 days embryo" /mol_type="mRNA" /organism="Mus musculus" /strain="C57BL/6J" /tissue_type="head" CDS 1592..3037 /codon_start=1 /note="forkhead box G1 (MGD|MGI:1347464 GB|BC064449, evidence: BLASTN, 99%, match=2877)" /note="putative" /protein_id="BAE21092.1" /transl_table=1 /translation="MLDMGDRKEVKMIPKSSFSINSLVPEAVQNDNHHASHGHHNSHH PQHHHHHHHHHHPPPPAPQPPPPPPQQQQQQPPPAPQPPQARGAPAADDDKGPQPLLL PPSTALDGAKADALGAKGEPGGGPAELAPVGPDEKEKGAGAGGEEKKGAGEGGKDGEG GKEGDKKNGKYEKPPFSYNALIMMAIRQSPEKRLTLNGIYEFIMKNFPYYRENKQGWQ NSIRHNLSLNKCFVKVPRHYDDPGKGNYWMLDPSSDDVFIGGTTGKLRRRSTTSRAKL AFKRGARLTSTGLTFMDRAGSLYWPMSPFLSLHHPRASSTLSYNGTTSAYPSHPMPYS SVLTQNSLGNNHSFSTANGLSVDRLVNGEIPYATHHLTAAALAASVPCGLSVPCSGTY SLNPCSVNLLAGQTSYFFPHVPHPSMTSQTSTSMSARAASSSTSPQAPSTLPCESLRP SLPSFTTGLSGGLSDYFTHQNQGSSSNPLIH" regulatory 3940..3945 /note="putative" /regulatory_class="polyA_signal_sequence" polyA_site 3960 /note="putative" BASE COUNT 831 a 1261 c 1046 g 822 t ORIGIN 1 tttccagcag catccaagaa gggtaagagc tgatttcgcc ctactggtgt ttcaagattg 61 aggttttaaa gagataaagt atttgtgaaa tgggaaaaag gagcaaacat aggatttaaa 121 aaaaaaaaaa acggaagtag acgcccactc atattttctt caatgtcaag caaaatgaaa 181 atatttccag gatgatcgat cgcggctacc ggcttccagt actgcctaga agctgaagag 241 gaggtggagt gcccgaggag acaactgcca ccgggtcacc cgcgcgtgag cgcgactctg 301 ggagtgaagc gagacggagg gagaacacac gtgttacctg ctttatttcg ggactgtttg 361 ggtctgtgct cgcccggggc agctctccgc ccgcccgccc gcgtgcgtgg aaggcctcca 421 cagaacgcac ccaccgctca gccgcccccg ctcgccgccc tcagcccagc ttcacagccg 481 agctcgccgc gggccgcagg aagctgtaag taccggcctg gccgtggccg cccccgcccc 541 accttccagg ccccgttccc ccgatgcccg gctctccgga gcttgctggc ccgctcgctc 601 ccagggcgct gccggggccg ccgcgtcccc agggttcacg ccctctcccg caccctcgca 661 gctcgcccct ccgccgcgcc ccccgccccg ctcttccaga caaaattgga tacaaacttt 721 aatcaataag gataaatatt gacgcgcaaa ggaaaataat cccatacagg gaggaggcca 781 cagtcgagcg gccgcggccc ggggcgcacg ccccacgcta atcctggccg ggtgggcgat 841 ctcccgccgg tccccgcggc tccaggcctg ggccccgcca cccggagggg gcggtgagcg 901 gggggcgggg ccggggcggg gcggggatgg gcgggccctg ggcccccgat tggtcgacgg 961 ctagccagac gctcctgcac gccggcggca ctgattggtt cggcagtagg aaaggttaaa 1021 ccaaaaattt ttttacagcc ctagtgtgcg cctgtagctc ggaaaattaa ttgtggctat 1081 agctgcctcg atcgctgtct cctagcctcg ctgcggccgc tccgggacgc gcccgcccgc 1141 cgcccggctc tccccccctt cgggctgccg ctgctgctgc tgtgactgct gcggcgcgag 1201 gaggaggagg cagcggggga gggggaggcc gggcgcggaa cggagcgggg cgctgcaccc 1261 cgggcgacgg gttgcttctg cctctagcct ccctctctct ctctctctct ctctctctct 1321 ctctctctct ctctctctct ctctctccct ctctctctct ctctccctct ctctttctct 1381 ctctaattct tgaggggtgg ttgcagcttt tgctacatgc cttgccagcg ccggagcctg 1441 cggtccaact gcgctgctgc cggagcgctc agtgccgcct ccgctgcccg ctccccccgc 1501 gccccactcc gaacccgctg gtcgcccgcc gcgctgctgc ccggctcccg tgccgccgcc 1561 gccgccgccg ccgccccccg acgcctgggt gatgctggac atgggagata ggaaagaggt 1621 gaaaatgatt cccaagtcct cgttcagcat caacagcctg gtccccgagg ccgtccagaa 1681 cgacaaccac cacgcgagcc acggccacca caacagccac cacccccagc atcaccatca 1741 tcatcaccac caccaccacc cgccgccgcc cgcgccccag ccgcctccac cgccgcccca 1801 gcagcagcag cagcagccgc ccccggcccc gcagcccccg caggcgcgcg gcgccccagc 1861 agccgacgac gacaagggtc cccagccgct cctgctcccg ccctccaccg ccctggacgg 1921 ggccaaggct gacgcacttg gagccaaagg cgagccgggc ggcgggccgg cggagctggc 1981 gcccgtcggg ccggacgaga aggagaaggg cgcgggcgcc gggggggagg agaagaaagg 2041 ggcgggcgag ggcggcaagg acggggaggg gggcaaggag ggcgacaaga agaacggcaa 2101 gtacgagaag ccgccgttca gctacaacgc gctcatcatg atggccatca ggcagagtcc 2161 cgagaagcgc ctgacgctca atggcatcta tgagttcatc atgaagaact tcccctacta 2221 ccgcgagaac aagcagggct ggcagaactc catccgccac aacctgtccc tcaacaagtg 2281 cttcgtgaag gtaccgcgcc actacgacga cccgggcaag ggcaactact ggatgctcga 2341 cccgtcgagc gacgacgtgt tcatcggcgg cacgaccggc aagctgcggc gccgctccac 2401 cacgtctcgg gccaagctgg cctttaagcg cggggcgcgc ctcacctcca ccggcctcac 2461 cttcatggac cgcgccggct ccctctactg gcccatgtcg cccttcctgt ccctgcacca 2521 cccccgcgcc agcagcactt tgagttacaa cgggaccacg tcggcctacc ccagccaccc 2581 catgccctac agctccgtgt tgactcaaaa ctcgctgggc aacaaccact ccttctccac 2641 cgccaacggg ctgagtgtgg accggctggt caacggggag atcccgtacg ccacgcacca 2701 cctcacggcc gctgcgctcg ccgcctcggt gccctgcggc ctgtcggtgc cctgctccgg 2761 gacctactcc ctcaacccct gctccgtcaa cctgctcgcg ggccagacca gttacttttt 2821 cccccacgtc ccgcacccgt caatgacttc gcagaccagc acgtccatga gcgcccgggc 2881 cgcgtcctcc tctacgtcgc cgcaggcccc ctcgaccctg ccctgtgagt ctttaagacc 2941 ctctttgcca agttttacga caggactgtc cgggggactg tctgattatt tcacacatca 3001 aaatcagggg tcttcttcca accctttaat acattaacat ccgggggacc agactgtaag 3061 tgaacgtttt acacacattt gcattgtaaa tgataattaa aaaaataagt ccaggtattt 3121 tttattaagc cccccctccc catttctgta cgtttgttca gtctttaggg ttgtttacta 3181 ttctaacacg gtgtggagtg tcagcgaggt gcaatgtggg agaatacatt gtagaatata 3241 aggtttggac gtcaaattat agtagaatgt gtatctaaat agtgactgct ttgccatttc 3301 attcaaacct gacaagtcta tctcaacagg ctgccagatt tccatgtgtg cagtattata 3361 agttatcatg gagctatctg gtggacgcag gccttgagaa caacctaaat tatgaagaga 3421 gttttaaaat gttaaattgt aatttgtatt taagaatttg tagtaaaggt gcccaaggaa 3481 ttatattggc catttattgt tttgtccttt ttctttaaag aactgtttct ttccttttgt 3541 ttacttttag accaaagatt ggattctagc aaatgcactt ggtatactaa gtattaaaac 3601 aagtaaacaa acaaacgaaa aaggaaggtt gtttagttgg caacactgcc cattcaattg 3661 aatccgaagg agacaaaatt aaggattgcc ttcagtttgt gttgtgtata tttcgatgta 3721 tgtggtcact aacaggtcac ttttattttt tctaaatgta gtgaaatgtt aatacctatt 3781 gtacttatag gtaaaccttg caaatatgta acctgtgttg cgcaaatgcc gcatcaattt 3841 gagtgattgt taatgttgtc ttaaaatttc ttgattgtga tactgtggtc atatgcccgt 3901 gtttgtcact tacaaaaatg tttactatga acacacagaa ataaaaaata ggctaaattc //