LOCUS       BC068445                2814 bp    mRNA    linear   HUM 21-JUL-2005
DEFINITION  Homo sapiens chromosome 9 open reading frame 72, mRNA (cDNA clone
            MGC:86985 IMAGE:5298741), complete cds.
ACCESSION   BC068445
VERSION     BC068445.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2814)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2814)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-APR-2004) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 168 Row: f Column: 17
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 37039614.
FEATURES             Location/Qualifiers
     source          1..2814
                     /db_xref="H-InvDB:HIT000263003"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:86985 IMAGE:5298741"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2814
                     /gene="C9orf72"
                     /gene_synonym="MGC23980"
                     /db_xref="GeneID:203228"
     CDS             57..1502
                     /gene="C9orf72"
                     /gene_synonym="MGC23980"
                     /codon_start=1
                     /product="C9orf72 protein"
                     /protein_id="AAH68445.1"
                     /db_xref="GeneID:203228"
                     /translation="MSTLCPPPSPAVAKTEIALSGKSPLLAATFAYWDNILGPRVRHI
                     WAPKTEQVLLSDGEITFLANHTLNGEILRNAESGAIDVKFFVLSEKGVIIVSLIFDGN
                     WNGDRSTYGLSIILPQTELSFYLPLHRVCVDRLTHIIRKGRIWMHKERQENVQKIILE
                     GTERMEDQGQSIIPMLTGEVIPVMELLSSMKSHSVPEEIDIADTVLNDDDIGDSCHEG
                     FLLNAISSHLQTCGCSVVVGSSAEKVNKIVRTLCLFLTPAERKCSRLCEAESSFKYES
                     GLFVQGLLKDSTGSFVLPFRQVMYAPYPTTHIDVDVNTVKQMPPCHEHIYNQRRYMRS
                     ELTAFWRATSEEDMAQDTIIYTDESFTPDLNIFQDVLHRDTLVKAFLDQVFQLKPGLS
                     LRSTFLAQFLLVLHRKALTLIKYIEDDTQKGKKPFKSLRNLKIDLDLTAEGDLNIIMA
                     LAEKIKPGLHSFIFGRPFYTSVQERDVLMTF"
BASE COUNT          879 a          493 c          544 g          898 t
ORIGIN      
        1 ggtggcgagt ggatatctcc ggagcattta gataatgtga cagttggaat gcagtgatgt
       61 cgactctttg cccaccgcca tctccagctg ttgccaagac agagattgct ttaagtggca
      121 aatcaccttt attagcagct acttttgctt actgggacaa tattcttggt cctagagtaa
      181 ggcacatttg ggctccaaag acagaacagg tacttctcag tgatggagaa ataacttttc
      241 ttgccaacca cactctaaat ggagaaatcc ttcgaaatgc agagagtggt gctatagatg
      301 taaagttttt tgtcttgtct gaaaagggag tgattattgt ttcattaatc tttgatggaa
      361 actggaatgg ggatcgcagc acatatggac tatcaattat acttccacag acagaactta
      421 gtttctacct cccacttcat agagtgtgtg ttgatagatt aacacatata atccggaaag
      481 gaagaatatg gatgcataag gaaagacaag aaaatgtcca gaagattatc ttagaaggca
      541 cagagagaat ggaagatcag ggtcagagta ttattccaat gcttactgga gaagtgattc
      601 ctgtaatgga actgctttca tctatgaaat cacacagtgt tcctgaagaa atagatatag
      661 ctgatacagt actcaatgat gatgatattg gtgacagctg tcatgaaggc tttcttctca
      721 atgccatcag ctcacacttg caaacctgtg gctgttccgt tgtagtaggt agcagtgcag
      781 agaaagtaaa taagatagtc agaacattat gcctttttct gactccagca gagagaaaat
      841 gctccaggtt atgtgaagca gaatcatcat ttaaatatga gtcagggctc tttgtacaag
      901 gcctgctaaa ggattcaact ggaagctttg tgctgccttt ccggcaagtc atgtatgctc
      961 catatcccac cacacacata gatgtggatg tcaatactgt gaagcagatg ccaccctgtc
     1021 atgaacatat ttataatcag cgtagataca tgagatccga gctgacagcc ttctggagag
     1081 ccacttcaga agaagacatg gctcaggata cgatcatcta cactgacgaa agctttactc
     1141 ctgatttgaa tatttttcaa gatgtcttac acagagacac tctagtgaaa gccttcctgg
     1201 atcaggtctt tcagctgaaa cctggcttat ctctcagaag tactttcctt gcacagtttc
     1261 tacttgtcct tcacagaaaa gccttgacac taataaaata tatagaagac gatacgcaga
     1321 agggaaaaaa gccctttaaa tctcttcgga acctgaagat agaccttgat ttaacagcag
     1381 agggcgatct taacataata atggctctgg ctgagaaaat taaaccaggc ctacactctt
     1441 ttatctttgg aagacctttc tacactagtg tgcaagaacg agatgttcta atgacttttt
     1501 aaatgtgtaa cttaataagc ctattccatc acaatcatga tcgctggtaa agtagctcag
     1561 tggtgtgggg aaacgttccc ctggatcata ctccagaatt ctgctctcag caattgcagt
     1621 taagtaagtt acactacagt tctcacaaga gcctgtgagg ggatgtcagg tgcatcatta
     1681 cattgggtgt ctcttttcct agatttatgc ttttgggata cagacctatg tttacaatat
     1741 aataaatatt attgctatct tttaaagata taataatagg atgtaaactt gaccacaact
     1801 actgtttttt tgaaatacat gattcatggt ttacatgtgt caaggtgaaa tctgagttgg
     1861 cttttacaga tagttgactt tctatctttt ggcattcttt ggtgtgtaga attactgtaa
     1921 tacttctgca atcaactgaa aactagagcc tttaaatgat ttcaattcca cagaaagaaa
     1981 gtgagcttga acataggatg agctttagaa agaaaattga tcaagcagat gtttaattgg
     2041 aattgattat tagatcctac tttgtggatt tagtccctgg gattcagtct gtagaaatgt
     2101 ctaatagttc tctatagtcc ttgttcctgg tgaaccacag ttagggtgtt ttgtttattt
     2161 tattgttctt gctattgttg atattctatg tagttgagct ctgtaaaagg aaattgtatt
     2221 ttatgtttta gtaattgttg ccaacttttt aaattaattt tcattatttt tgagccaaat
     2281 tgaaatgtgc acctcctgtg ccttttttct ccttagaaaa tctaattact tggaacaagt
     2341 tcagatttca ctggtcagtc attttcatct tgttttcttc ttgctaagtc ttaccatgta
     2401 cctgctttgg caatcattgc aactctgaga ttataaaatg ccttagagaa tatactaact
     2461 aataagatct ttttttcaga aacagaaaat agttccttga gtacttcctt cttgcatttc
     2521 tgcctatgtt tttgaagttg ttgctgtttg cctgcaatag gctataagga atagcaggag
     2581 aaattttact gaagtgctgt tttcctaggt gctactttgg cagagctaag ttatcttttg
     2641 ttttcttaat gcgtttggac cattttgctg gctataaaat aactgattaa tataattcta
     2701 acacaatgtt gacattgtag ttacacaaac acaaataaat attttattta aaattcaaaa
     2761 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaacaa aaaaaaaaaa aaaa
//