LOCUS       BC071657                2833 bp    mRNA    linear   HUM 25-JUN-2004
DEFINITION  Homo sapiens TAR DNA binding protein, mRNA (cDNA clone MGC:87845
            IMAGE:5498250), complete cds.
ACCESSION   BC071657
VERSION     BC071657.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2833)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2833)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (01-JUN-2004) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Lou Staudt
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 166 Row: n Column: 2
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 19743843.
FEATURES             Location/Qualifiers
     source          1..2833
                     /db_xref="H-InvDB:HIT000264482"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:87845 IMAGE:5498250"
                     /tissue_type="Lymph, lymphoma"
                     /clone_lib="NIH_MGC_85"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2833
                     /gene="TARDBP"
                     /gene_synonym="TDP-43"
                     /db_xref="GeneID:23435"
                     /db_xref="MIM:605078"
     CDS             86..1330
                     /gene="TARDBP"
                     /gene_synonym="TDP-43"
                     /codon_start=1
                     /product="TARDBP protein"
                     /protein_id="AAH71657.1"
                     /db_xref="GeneID:23435"
                     /db_xref="MIM:605078"
                     /translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYR
                     NPVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQK
                     TSDLIVLGLPWKTTEQDLKEYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVK
                     VMSQRHMIDGRWCDCKLPNSKQSQDEPLRSRKVFVGRCTEDMTEDELREFFSQYGDVM
                     DVFIPKPFRAFAFVTFADDQIAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRF
                     GGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSW
                     GMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAG
                     SGSGFNGGFGSSMDSKSSGWGM"
BASE COUNT          783 a          486 c          716 g          848 t
ORIGIN      
        1 cggtggctgg gctgcgcttg ggtccgtcgc tgcttcggtg tccctgtcgg gcttcccagc
       61 agcggcctag cgggaaaagt aaaagatgtc tgaatatatt cgggtaaccg aagatgagaa
      121 cgatgagccc attgaaatac catcggaaga cgatgggacg gtgctgctct ccacggttac
      181 agcccagttt ccaggggcgt gtgggcttcg ctacaggaat ccagtgtctc agtgtatgag
      241 aggtgtccgg ctggtagaag gaattctgca tgccccagat gctggctggg gaaatctggt
      301 gtatgttgtc aactatccaa aagataacaa aagaaaaatg gatgagacag atgcttcatc
      361 agcagtgaaa gtgaaaagag cagtccagaa aacatccgat ttaatagtgt tgggtctccc
      421 atggaaaaca accgaacagg acctgaaaga gtattttagt acctttggag aagttcttat
      481 ggtgcaggtc aagaaagatc ttaagactgg tcattcaaag gggtttggct ttgttcgttt
      541 tacggaatat gaaacacaag tgaaagtaat gtcacagcga catatgatag atggacgatg
      601 gtgtgactgc aaacttccta attctaagca aagccaagat gagcctttga gaagcagaaa
      661 agtgtttgtg gggcgctgta cagaggacat gactgaggat gagctgcggg agttcttctc
      721 tcagtacggg gatgtgatgg atgtcttcat ccccaagcca ttcagggcct ttgcctttgt
      781 tacatttgca gatgatcaga ttgcgcagtc tctttgtgga gaggacttga tcattaaagg
      841 aatcagcgtt catatatcca atgccgaacc taagcacaat agcaatagac agttagaaag
      901 aagtggaaga tttggtggta atccaggtgg ctttgggaat cagggtggat ttggtaatag
      961 cagagggggt ggagctggtt tgggaaacaa tcaaggtagt aatatgggtg gtgggatgaa
     1021 ctttggtgcg ttcagcatta atccagccat gatggctgcc gcccaggcag cactacagag
     1081 cagttggggt atgatgggca tgttagccag ccagcagaac cagtcaggcc catcgggtaa
     1141 taaccaaaac caaggcaaca tgcagaggga gccaaaccag gccttcggtt ctggaaataa
     1201 ctcttatagt ggctctaatt ctggtgcagc aattggttgg ggatcagcat ccaatgcagg
     1261 gtcgggcagt ggttttaatg gaggctttgg ctcaagcatg gattctaagt cttctggctg
     1321 gggaatgtag acagtggggt tgtggttggt tggtatagaa tggtgggaat tcaaattttt
     1381 ctaaactcat ggtaagtata ttgtaaaata catatgtact aagaattttc aaaattggtt
     1441 tgttcagtgt ggagtatatt cagcagtatt tttgacattt ttctttagaa aaaggaagag
     1501 ctaaaggaat tttataagtt ttgttacatg aaaggttgaa atattgagtg gttgaaagtg
     1561 aactgctgtt tgcctgattg gtaaaccaac acactacaat tgatatcaaa aggtttctcc
     1621 tgtaatattt tatccctgga cttgtcaagt gaattctttg catgttcaaa acggaaacca
     1681 ttgattagaa ctacattctt taccccttgt tttaatttga accccaccat atggattttt
     1741 ttccttaaga aaatctcctt ttaggagatc atggtgtcac agtgtttggt tcttttgttt
     1801 tgttttttaa cacttgtctc ccctcataca caaaagtaca atatgaagcc ttcatttaat
     1861 ctctgcagtt catctcattt caaatgttta tggaagaagc acttcattga aagtagtgct
     1921 gtaaatattc tgccatagga atactgtcta catgctttct cattcaagaa ttcgtcatca
     1981 cgcatcacag gccgcgtctt tgacggtggg tgtcccattt ttatccgcta ctctttattt
     2041 catggagtcg tatcaacgct atgaacgcaa ggctgtgata tggaaccaga aggctgtctg
     2101 aacttttgaa accttgtgtg ggattgatgg tggtgccgag gcatgaaagg ctagtatgag
     2161 cgagaaaagg agagagcgcg tgcagagact tggtggtgca taatggatat tttttaactt
     2221 ggcgagatgt gtctctcaat cctgtggctt tggtgagaga gtgtgcagag agcaatgata
     2281 gcaaataatg tacgaatgtt ttttgcattc aaaggacatc cacatctgtt ggaagacttt
     2341 taagtgagtt tttgttctta gataacccac attagatgaa tgtgttaagt gaaatgatac
     2401 ttgtactccc cctacccctt tgtcaactgc tgtgaatgct gtatggtgtg tgttctcttc
     2461 tgttactgat atgtaagtgt ggcaatgtga actgaagctg atgggctgag aacatggact
     2521 gagcttgtgg tgtgctttgc aggaggactt gaagcagagt tcaccagtga gctcaggtgt
     2581 ctcaaagaag ggtggaagtt ctaatgtctg ttagctaccc ataagaatgc tgtttgctgc
     2641 agttctgtgt cctgtgcttg gatgcttttt ataagagttg tcattgttgg aaattcttaa
     2701 ataaaactga tttaaataaa aaaaaaaaaa aagggcggcc gccctttttt tttttttttt
     2761 agagctttta tatttgcgtt tattcttcat ttaacttttt aaaacactac tatagtttat
     2821 taaaaaaaaa aaa
//