LOCUS AK004791 2591 bp mRNA linear HTC 06-OCT-2010 DEFINITION Mus musculus adult male lung cDNA, RIKEN full-length enriched library, clone:1200015I03 product:HYPOTHETICAL 42.5 KDA PROTEIN homolog [Mus musculus], full insert sequence. ACCESSION AK004791 VERSION AK004791.1 KEYWORDS HTC_FLI; HTC; CAP trapper. SOURCE Mus musculus (house mouse) ORGANISM Mus musculus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus. REFERENCE 1 (bases 1 to 2591) AUTHORS Adachi,J., Aizawa,K., Akahira,S., Akimura,T., Arai,A., Aono,H., Arakawa,T., Bono,H., Carninci,P., Fukuda,S., Fukunishi,Y., Furuno,M., Hanagaki,T., Hara,A., Hayatsu,N., Hiramoto,K., Hiraoka,T., Hori,F., Imotani,K., Ishii,Y., Itoh,M., Izawa,M., Kasukawa,T., Kato,H., Kawai,J., Kojima,Y., Konno,H., Kouda,M., Koya,S., Kurihara,C., Matsuyama,T., Miyazaki,A., Nishi,K., Nomura,K., Numazaki,R., Ohno,M., Okazaki,Y., Okido,T., Owa,C., Saito,H., Saito,R., Sakai,C., Sakai,K., Sano,H., Sasaki,D., Shibata,K., Shibata,Y., Shinagawa,A., Shiraki,T., Sogabe,Y., Suzuki,H., Tagami,M., Tagawa,A., Takahashi,F., Tanaka,T., Tejima,Y., Toya,T., Yamamura,T., Yasunishi,A., Yoshida,K., Yoshino,M., Muramatsu,M. and Hayashizaki,Y. TITLE Direct Submission JOURNAL Submitted (10-JUL-2000) to the DDBJ/EMBL/GenBank databases. Contact:Yoshihide Hayashizaki The Institute of Physical and Chemical Research (RIKEN), Omics Science Center, RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan URL :http://www.osc.riken.jp/ REFERENCE 2 AUTHORS CONSRTM The FANTOM Consortium, Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) TITLE The Transcriptional Landscape of the Mammalian Genome JOURNAL Science 309, 1559-1563 (2005) REFERENCE 3 AUTHORS CONSRTM RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium TITLE Antisense Transcription in the Mammalian Transcriptome JOURNAL Science 309, 1564-1566 (2005) REFERENCE 4 AUTHORS CONSRTM The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team TITLE Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs JOURNAL Nature 420, 563-573 (2002) REFERENCE 5 AUTHORS CONSRTM The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium TITLE Functional annotation of a full-length mouse cDNA collection JOURNAL Nature 409, 685-690 (2001) REFERENCE 6 AUTHORS Carninci,P. and Hayashizaki,Y. TITLE High-efficiency full-length cDNA cloning JOURNAL Meth. Enzymol. 303, 19-44 (1999) REFERENCE 7 AUTHORS Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y. TITLE Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes JOURNAL Genome Res. 10, 1617-1630 (2000) REFERENCE 8 AUTHORS Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki,N., Carninci,P., Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A., Yamamoto,R., Matsumoto,H., Sakaguchi,S., Ikegami,T., Kashiwagi,K., Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., Okazaki,Y., Muramatsu,M., Inoue,Y., Kira,A. and Hayashizaki,Y. TITLE RIKEN integrated sequence analysis (RISA) system-384-format sequencing pipeline with 384 multicapillary sequencer JOURNAL Genome Res. 10, 1757-1771 (2000) COMMENT cDNA library was prepared and sequenced in Mouse Genome Encyclopedia Project of Genome Exploration Research Group in Riken Genomic Sciences Center and Genome Science Laboratory in RIKEN. Division of Experimental Animal Research in Riken contributed to prepare mouse tissues. Please visit our web site for further details. URL:http://www.osc.riken.jp/ URL:http://fantom.gsc.riken.jp/ clone information is available at: http://fantom.gsc.riken.jp/3/db/annotate/ main.cgi?masterid=1200015I03 FEATURES Location/Qualifiers source 1..2591 /clone="1200015I03" /clone_lib="RIKEN full-length enriched mouse cDNA library" /db_xref="FANTOM_DB:1200015I03" /db_xref="MGI:1896848" /db_xref="taxon:10090" /dev_stage="adult" /mol_type="mRNA" /organism="Mus musculus" /sex="male" /strain="C57BL/6J" /tissue_type="lung" CDS 100..2037 /codon_start=1 /note="HYPOTHETICAL 42.5 KDA PROTEIN homolog [Mus musculus] (SPTR|Q9JHR3, evidence: FASTY, 99.5%ID, 100%length, match=1107)" /note="putative" /protein_id="BAB23567.2" /transl_table=1 /translation="MAAMLGDAIMVAKGLAKLTQAAVETHLQNLGLGGELLLAARALQ STAVEQFSMVFGKVQGQDKHEDSYATENFEDLEAEVQFSTPQAAGTSLDFSAASSLDQ SLSPSHSQGPAPAYASSGPFREAGLPGQATSPMGRVNGRLFVDHRDLFLANGIQRRSF HQDQSSVGGLTAEDIEKARQAKARPESKPHKQMLSERARERKVPVTRIGRLANFGGLA VGLGIGALAEVAKKSLRSENSTGKKAVLDSSPFLSEANAERIVSTLCKVRGAALKLGQ MLSIQDDAFINPHLAKIFERVRQSADFMPLKQMTKTLNSDLGPHWRDKLEYFEERPFA AASIGQVHLARMKGGREVAMKIQYPGVAQSINSDVNNLMAVLNMSNMLPEGLFPEHLI DVLRRELTLECDYQREAAYAKKFRELLKDHPFFYVPEIVDELCSPHVLTTELISGFPL DQAEGLSQEVRNEICYNILVLCLRELFEFHVMQTDPNWSNFFYDPQQHKVALLDFGAT REYDRSFTDLYIQVIRAAADQDREAVLKKSIEMKFLTGYEVKAMEDAHLDAILILGEA FASEEPFDFGTQSTTEKIHNLIPVMLKHRLIPPPEETYSLHRKMGGSFLICSKLKARF PCKAMFEEAYSNYCRMKSGLQ" regulatory 2566..2571 /note="putative" /regulatory_class="polyA_signal_sequence" polyA_site 2591 /note="putative" BASE COUNT 586 a 697 c 757 g 551 t ORIGIN 1 tagaagcgga gagtgtgcgg agcgctcgca gcggggcgag cgcgcggagg cgttgcgttg 61 cggacggagg agctaccttc ctgcagcccg ctctgaagga tggctgctat gttgggggat 121 gccatcatgg tggccaaagg ccttgccaag ctgacccaag cagcggtgga aacccacttg 181 cagaacctgg gccttggtgg ggagctcctc ctggcggcca gggccctgca gtctacagct 241 gtggagcagt tcagcatggt cttcgggaag gtgcagggtc aggataagca tgaagattca 301 tatgccactg agaactttga agatctggaa gccgaagttc agttctcaac accacaggca 361 gctggaacct ccctggactt ctctgcggcc tcttccctgg accagtcact gtctccatcc 421 cacagtcagg gaccggcccc tgcctatgct tccagtgggc ctttcaggga agctgggctc 481 cctggacagg ccacctctcc tatgggcaga gtcaatggaa ggctctttgt agatcacaga 541 gacttgttct tggccaacgg catccagcga agatccttcc accaggacca gtcctccgtg 601 ggaggcctca cggctgaaga cattgaaaag gcccggcagg ccaaggctcg ccctgagagc 661 aagccacaca agcagatgct cagtgagcgg gctcgggagc ggaaggtgcc agttacccgg 721 attgggcggt tggccaactt tggaggtctg gctgtaggtc tgggaattgg ggcgttggct 781 gaagttgcca agaagagtct gcgttctgag aactccacag gaaaaaaagc cgtgctggat 841 tctagcccct tcctgtcaga ggcaaatgca gagcgcattg tgagtacact gtgcaaggtg 901 cgtggggccg cactgaagct gggccagatg ctgagcatcc aggatgatgc cttcatcaac 961 cctcacctgg ccaagatctt tgagcgggtg aggcagagtg ctgacttcat gccactgaaa 1021 cagatgacga aaaccctcaa cagcgacctg ggcccccact ggagggacaa gctggagtac 1081 tttgaagagc ggccctttgc agcggcctcc atagggcagg tgcacctggc ccgcatgaag 1141 ggtggccgtg aggtggccat gaagatccag taccctggtg tggcccagag cattaacagt 1201 gacgtcaaca acctcatggc tgtgctgaac atgagtaaca tgcttccaga aggcctgttc 1261 cctgagcacc tgattgatgt gctgagacgg gagcttaccc ttgagtgtga ctaccagcgg 1321 gaggctgcct atgccaagaa gttcagggaa ctgctgaagg accatccctt cttctatgtg 1381 cctgagattg tggatgagtt gtgcagtccc catgtgctga ccacagagct catatcaggc 1441 ttccccttgg accaggcaga agggctgagc caggaagtga ggaatgagat ctgctacaac 1501 attttggtgc tgtgcctgag agagctgttt gagttccatg tcatgcagac tgaccccaac 1561 tggtccaatt tcttttatga ccctcagcag cacaaggtgg ctctcctgga cttcggcgca 1621 actagagaat atgacagatc ctttactgac ctctacattc aggtcatccg ggccgctgct 1681 gaccaggaca gggaggcagt gctaaagaaa tccatagaga tgaagttcct taccggttat 1741 gaggtcaagg ccatggagga tgctcacctg gatgctattc tcatcctggg ggaggccttt 1801 gcctccgaag agcccttcga cttcggtacc cagagcacta ctgagaagat ccacaacctg 1861 attcccgtca tgctgaagca ccgcctcatc ccacccccag aggagaccta ctctctgcac 1921 cgcaagatgg gtggctcttt cctcatctgc tctaagctga aggcccgctt cccctgtaag 1981 gccatgttcg aggaagcgta cagtaactac tgcaggatga agtctgggct gcagtaggtg 2041 aggtgtggtc caggctaact aggggccatg ccctccttca ggctcagtca agcagaggga 2101 ggacgttgaa agggagccac ggcctctatg aataggaaga gagcgttgcg tagaccctcc 2161 tattgcttct gtactgtcca gacaaaaagg caggcttctt ttgccatctc ctccctgctg 2221 tggatgtaca accaggaaca ggagcaatgc agtactaaac tcaacacagc cacctttggg 2281 gtctagttgg ccatgggaat gaattgtgtg tttctgggtg agcttatgtc cctgcccttt 2341 gccaccccag cctgtggcct ggagctagaa gctcaggaca gcagggtagc caccctagct 2401 cagtctcagc atggacaaca gaccctttta aaagcagggc tgacattgga tgagcagcag 2461 ggctccagca tgccccagga gagggagcta tcaccccacc ccaggggagc gacacaggct 2521 ctttgtttgt atgtcttgta attacacatt catacctttg cgttcaataa agaagtgcgt 2581 tggggactgc c //