LOCUS       AK004791                2591 bp    mRNA    linear   HTC 06-OCT-2010
DEFINITION  Mus musculus adult male lung cDNA, RIKEN full-length enriched
            library, clone:1200015I03 product:HYPOTHETICAL 42.5 KDA PROTEIN
            homolog [Mus musculus], full insert sequence.
ACCESSION   AK004791
VERSION     AK004791.1
KEYWORDS    HTC_FLI; HTC; CAP trapper.
SOURCE      Mus musculus (house mouse)
  ORGANISM  Mus musculus
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha;
            Muroidea; Muridae; Murinae; Mus; Mus.
REFERENCE   1  (bases 1 to 2591)
  AUTHORS   Adachi,J., Aizawa,K., Akahira,S., Akimura,T., Arai,A., Aono,H.,
            Arakawa,T., Bono,H., Carninci,P., Fukuda,S., Fukunishi,Y.,
            Furuno,M., Hanagaki,T., Hara,A., Hayatsu,N., Hiramoto,K.,
            Hiraoka,T., Hori,F., Imotani,K., Ishii,Y., Itoh,M., Izawa,M.,
            Kasukawa,T., Kato,H., Kawai,J., Kojima,Y., Konno,H., Kouda,M.,
            Koya,S., Kurihara,C., Matsuyama,T., Miyazaki,A., Nishi,K.,
            Nomura,K., Numazaki,R., Ohno,M., Okazaki,Y., Okido,T., Owa,C.,
            Saito,H., Saito,R., Sakai,C., Sakai,K., Sano,H., Sasaki,D.,
            Shibata,K., Shibata,Y., Shinagawa,A., Shiraki,T., Sogabe,Y.,
            Suzuki,H., Tagami,M., Tagawa,A., Takahashi,F., Tanaka,T.,
            Tejima,Y., Toya,T., Yamamura,T., Yasunishi,A., Yoshida,K.,
            Yoshino,M., Muramatsu,M. and Hayashizaki,Y.
  TITLE     Direct Submission
  JOURNAL   Submitted (10-JUL-2000) to the DDBJ/EMBL/GenBank databases.
            Contact:Yoshihide Hayashizaki
            The Institute of Physical and Chemical Research (RIKEN), Omics
            Science Center, RIKEN Yokohama Institute; 1-7-22 Suehiro-cho,
            Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan
            URL    :http://www.osc.riken.jp/
REFERENCE   2
  AUTHORS   
  CONSRTM   The FANTOM Consortium, Riken Genome Exploration Research Group and
            Genome Science Group (Genome Network Project Core Group)
  TITLE     The Transcriptional Landscape of the Mammalian Genome
  JOURNAL   Science 309, 1559-1563 (2005)
REFERENCE   3
  AUTHORS   
  CONSRTM   RIKEN Genome Exploration Research Group and Genome Science Group
            (Genome Network Project Core Group) and the FANTOM Consortium
  TITLE     Antisense Transcription in the Mammalian Transcriptome
  JOURNAL   Science 309, 1564-1566 (2005)
REFERENCE   4
  AUTHORS   
  CONSRTM   The FANTOM Consortium and the RIKEN Genome Exploration Research
            Group Phase I and II Team
  TITLE     Analysis of the mouse transcriptome based on functional annotation
            of 60,770 full-length cDNAs
  JOURNAL   Nature 420, 563-573 (2002)
REFERENCE   5
  AUTHORS   
  CONSRTM   The RIKEN Genome Exploration Research Group Phase II Team and the
            FANTOM Consortium
  TITLE     Functional annotation of a full-length mouse cDNA collection
  JOURNAL   Nature 409, 685-690 (2001)
REFERENCE   6
  AUTHORS   Carninci,P. and Hayashizaki,Y.
  TITLE     High-efficiency full-length cDNA cloning
  JOURNAL   Meth. Enzymol. 303, 19-44 (1999)
REFERENCE   7
  AUTHORS   Carninci,P., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K.,
            Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki,Y.
  TITLE     Normalization and subtraction of cap-trapper-selected cDNAs to
            prepare full-length cDNA libraries for rapid discovery of new genes
  JOURNAL   Genome Res. 10, 1617-1630 (2000)
REFERENCE   8
  AUTHORS   Shibata,K., Itoh,M., Aizawa,K., Nagaoka,S., Sasaki,N.,
            Carninci,P., Konno,H., Akiyama,J., Nishi,K., Kitsunai,T.,
            Tashiro,H., Itoh,M., Sumi,N., Ishii,Y., Nakamura,S., Hazama,M.,
            Nishine,T., Harada,A., Yamamoto,R., Matsumoto,H., Sakaguchi,S.,
            Ikegami,T., Kashiwagi,K., Fujiwake,S., Inoue,K., Togawa,Y.,
            Izawa,M., Ohara,E., Watahiki,M., Yoneda,Y., Ishikawa,T., Ozawa,K.,
            Tanaka,T., Matsuura,S., Kawai,J., Okazaki,Y., Muramatsu,M.,
            Inoue,Y., Kira,A. and Hayashizaki,Y.
  TITLE     RIKEN integrated sequence analysis (RISA) system-384-format
            sequencing pipeline with 384 multicapillary sequencer
  JOURNAL   Genome Res. 10, 1757-1771 (2000)
COMMENT     cDNA library was prepared and sequenced in Mouse Genome
            Encyclopedia Project of Genome Exploration Research Group in Riken
            Genomic Sciences Center and Genome Science Laboratory in RIKEN.
            Division of Experimental Animal Research in Riken contributed to
            prepare mouse tissues.
            
            Please visit our web site for further details.
            URL:http://www.osc.riken.jp/
            URL:http://fantom.gsc.riken.jp/
            
            clone information is available at:
            http://fantom.gsc.riken.jp/3/db/annotate/
            main.cgi?masterid=1200015I03
FEATURES             Location/Qualifiers
     source          1..2591
                     /clone="1200015I03"
                     /clone_lib="RIKEN full-length enriched mouse cDNA library"
                     /db_xref="FANTOM_DB:1200015I03"
                     /db_xref="MGI:1896848"
                     /db_xref="taxon:10090"
                     /dev_stage="adult"
                     /mol_type="mRNA"
                     /organism="Mus musculus"
                     /sex="male"
                     /strain="C57BL/6J"
                     /tissue_type="lung"
     CDS             100..2037
                     /codon_start=1
                     /note="HYPOTHETICAL 42.5 KDA PROTEIN homolog [Mus
                     musculus] (SPTR|Q9JHR3, evidence: FASTY, 99.5%ID,
                     100%length, match=1107)"
                     /note="putative"
                     /protein_id="BAB23567.2"
                     /transl_table=1
                     /translation="MAAMLGDAIMVAKGLAKLTQAAVETHLQNLGLGGELLLAARALQ
                     STAVEQFSMVFGKVQGQDKHEDSYATENFEDLEAEVQFSTPQAAGTSLDFSAASSLDQ
                     SLSPSHSQGPAPAYASSGPFREAGLPGQATSPMGRVNGRLFVDHRDLFLANGIQRRSF
                     HQDQSSVGGLTAEDIEKARQAKARPESKPHKQMLSERARERKVPVTRIGRLANFGGLA
                     VGLGIGALAEVAKKSLRSENSTGKKAVLDSSPFLSEANAERIVSTLCKVRGAALKLGQ
                     MLSIQDDAFINPHLAKIFERVRQSADFMPLKQMTKTLNSDLGPHWRDKLEYFEERPFA
                     AASIGQVHLARMKGGREVAMKIQYPGVAQSINSDVNNLMAVLNMSNMLPEGLFPEHLI
                     DVLRRELTLECDYQREAAYAKKFRELLKDHPFFYVPEIVDELCSPHVLTTELISGFPL
                     DQAEGLSQEVRNEICYNILVLCLRELFEFHVMQTDPNWSNFFYDPQQHKVALLDFGAT
                     REYDRSFTDLYIQVIRAAADQDREAVLKKSIEMKFLTGYEVKAMEDAHLDAILILGEA
                     FASEEPFDFGTQSTTEKIHNLIPVMLKHRLIPPPEETYSLHRKMGGSFLICSKLKARF
                     PCKAMFEEAYSNYCRMKSGLQ"
     regulatory      2566..2571
                     /note="putative"
                     /regulatory_class="polyA_signal_sequence"
     polyA_site      2591
                     /note="putative"
BASE COUNT          586 a          697 c          757 g          551 t
ORIGIN      
        1 tagaagcgga gagtgtgcgg agcgctcgca gcggggcgag cgcgcggagg cgttgcgttg
       61 cggacggagg agctaccttc ctgcagcccg ctctgaagga tggctgctat gttgggggat
      121 gccatcatgg tggccaaagg ccttgccaag ctgacccaag cagcggtgga aacccacttg
      181 cagaacctgg gccttggtgg ggagctcctc ctggcggcca gggccctgca gtctacagct
      241 gtggagcagt tcagcatggt cttcgggaag gtgcagggtc aggataagca tgaagattca
      301 tatgccactg agaactttga agatctggaa gccgaagttc agttctcaac accacaggca
      361 gctggaacct ccctggactt ctctgcggcc tcttccctgg accagtcact gtctccatcc
      421 cacagtcagg gaccggcccc tgcctatgct tccagtgggc ctttcaggga agctgggctc
      481 cctggacagg ccacctctcc tatgggcaga gtcaatggaa ggctctttgt agatcacaga
      541 gacttgttct tggccaacgg catccagcga agatccttcc accaggacca gtcctccgtg
      601 ggaggcctca cggctgaaga cattgaaaag gcccggcagg ccaaggctcg ccctgagagc
      661 aagccacaca agcagatgct cagtgagcgg gctcgggagc ggaaggtgcc agttacccgg
      721 attgggcggt tggccaactt tggaggtctg gctgtaggtc tgggaattgg ggcgttggct
      781 gaagttgcca agaagagtct gcgttctgag aactccacag gaaaaaaagc cgtgctggat
      841 tctagcccct tcctgtcaga ggcaaatgca gagcgcattg tgagtacact gtgcaaggtg
      901 cgtggggccg cactgaagct gggccagatg ctgagcatcc aggatgatgc cttcatcaac
      961 cctcacctgg ccaagatctt tgagcgggtg aggcagagtg ctgacttcat gccactgaaa
     1021 cagatgacga aaaccctcaa cagcgacctg ggcccccact ggagggacaa gctggagtac
     1081 tttgaagagc ggccctttgc agcggcctcc atagggcagg tgcacctggc ccgcatgaag
     1141 ggtggccgtg aggtggccat gaagatccag taccctggtg tggcccagag cattaacagt
     1201 gacgtcaaca acctcatggc tgtgctgaac atgagtaaca tgcttccaga aggcctgttc
     1261 cctgagcacc tgattgatgt gctgagacgg gagcttaccc ttgagtgtga ctaccagcgg
     1321 gaggctgcct atgccaagaa gttcagggaa ctgctgaagg accatccctt cttctatgtg
     1381 cctgagattg tggatgagtt gtgcagtccc catgtgctga ccacagagct catatcaggc
     1441 ttccccttgg accaggcaga agggctgagc caggaagtga ggaatgagat ctgctacaac
     1501 attttggtgc tgtgcctgag agagctgttt gagttccatg tcatgcagac tgaccccaac
     1561 tggtccaatt tcttttatga ccctcagcag cacaaggtgg ctctcctgga cttcggcgca
     1621 actagagaat atgacagatc ctttactgac ctctacattc aggtcatccg ggccgctgct
     1681 gaccaggaca gggaggcagt gctaaagaaa tccatagaga tgaagttcct taccggttat
     1741 gaggtcaagg ccatggagga tgctcacctg gatgctattc tcatcctggg ggaggccttt
     1801 gcctccgaag agcccttcga cttcggtacc cagagcacta ctgagaagat ccacaacctg
     1861 attcccgtca tgctgaagca ccgcctcatc ccacccccag aggagaccta ctctctgcac
     1921 cgcaagatgg gtggctcttt cctcatctgc tctaagctga aggcccgctt cccctgtaag
     1981 gccatgttcg aggaagcgta cagtaactac tgcaggatga agtctgggct gcagtaggtg
     2041 aggtgtggtc caggctaact aggggccatg ccctccttca ggctcagtca agcagaggga
     2101 ggacgttgaa agggagccac ggcctctatg aataggaaga gagcgttgcg tagaccctcc
     2161 tattgcttct gtactgtcca gacaaaaagg caggcttctt ttgccatctc ctccctgctg
     2221 tggatgtaca accaggaaca ggagcaatgc agtactaaac tcaacacagc cacctttggg
     2281 gtctagttgg ccatgggaat gaattgtgtg tttctgggtg agcttatgtc cctgcccttt
     2341 gccaccccag cctgtggcct ggagctagaa gctcaggaca gcagggtagc caccctagct
     2401 cagtctcagc atggacaaca gaccctttta aaagcagggc tgacattgga tgagcagcag
     2461 ggctccagca tgccccagga gagggagcta tcaccccacc ccaggggagc gacacaggct
     2521 ctttgtttgt atgtcttgta attacacatt catacctttg cgttcaataa agaagtgcgt
     2581 tggggactgc c
//