LOCUS BC071657 2833 bp mRNA linear HUM 25-JUN-2004
DEFINITION Homo sapiens TAR DNA binding protein, mRNA (cDNA clone MGC:87845
IMAGE:5498250), complete cds.
ACCESSION BC071657
VERSION BC071657.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2833)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2833)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (01-JUN-2004) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Lou Staudt
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 166 Row: n Column: 2
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 19743843.
FEATURES Location/Qualifiers
source 1..2833
/db_xref="H-InvDB:HIT000264482"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:87845 IMAGE:5498250"
/tissue_type="Lymph, lymphoma"
/clone_lib="NIH_MGC_85"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2833
/gene="TARDBP"
/gene_synonym="TDP-43"
/db_xref="GeneID:23435"
/db_xref="MIM:605078"
CDS 86..1330
/gene="TARDBP"
/gene_synonym="TDP-43"
/codon_start=1
/product="TARDBP protein"
/protein_id="AAH71657.1"
/db_xref="GeneID:23435"
/db_xref="MIM:605078"
/translation="MSEYIRVTEDENDEPIEIPSEDDGTVLLSTVTAQFPGACGLRYR
NPVSQCMRGVRLVEGILHAPDAGWGNLVYVVNYPKDNKRKMDETDASSAVKVKRAVQK
TSDLIVLGLPWKTTEQDLKEYFSTFGEVLMVQVKKDLKTGHSKGFGFVRFTEYETQVK
VMSQRHMIDGRWCDCKLPNSKQSQDEPLRSRKVFVGRCTEDMTEDELREFFSQYGDVM
DVFIPKPFRAFAFVTFADDQIAQSLCGEDLIIKGISVHISNAEPKHNSNRQLERSGRF
GGNPGGFGNQGGFGNSRGGGAGLGNNQGSNMGGGMNFGAFSINPAMMAAAQAALQSSW
GMMGMLASQQNQSGPSGNNQNQGNMQREPNQAFGSGNNSYSGSNSGAAIGWGSASNAG
SGSGFNGGFGSSMDSKSSGWGM"
BASE COUNT 783 a 486 c 716 g 848 t
ORIGIN
1 cggtggctgg gctgcgcttg ggtccgtcgc tgcttcggtg tccctgtcgg gcttcccagc
61 agcggcctag cgggaaaagt aaaagatgtc tgaatatatt cgggtaaccg aagatgagaa
121 cgatgagccc attgaaatac catcggaaga cgatgggacg gtgctgctct ccacggttac
181 agcccagttt ccaggggcgt gtgggcttcg ctacaggaat ccagtgtctc agtgtatgag
241 aggtgtccgg ctggtagaag gaattctgca tgccccagat gctggctggg gaaatctggt
301 gtatgttgtc aactatccaa aagataacaa aagaaaaatg gatgagacag atgcttcatc
361 agcagtgaaa gtgaaaagag cagtccagaa aacatccgat ttaatagtgt tgggtctccc
421 atggaaaaca accgaacagg acctgaaaga gtattttagt acctttggag aagttcttat
481 ggtgcaggtc aagaaagatc ttaagactgg tcattcaaag gggtttggct ttgttcgttt
541 tacggaatat gaaacacaag tgaaagtaat gtcacagcga catatgatag atggacgatg
601 gtgtgactgc aaacttccta attctaagca aagccaagat gagcctttga gaagcagaaa
661 agtgtttgtg gggcgctgta cagaggacat gactgaggat gagctgcggg agttcttctc
721 tcagtacggg gatgtgatgg atgtcttcat ccccaagcca ttcagggcct ttgcctttgt
781 tacatttgca gatgatcaga ttgcgcagtc tctttgtgga gaggacttga tcattaaagg
841 aatcagcgtt catatatcca atgccgaacc taagcacaat agcaatagac agttagaaag
901 aagtggaaga tttggtggta atccaggtgg ctttgggaat cagggtggat ttggtaatag
961 cagagggggt ggagctggtt tgggaaacaa tcaaggtagt aatatgggtg gtgggatgaa
1021 ctttggtgcg ttcagcatta atccagccat gatggctgcc gcccaggcag cactacagag
1081 cagttggggt atgatgggca tgttagccag ccagcagaac cagtcaggcc catcgggtaa
1141 taaccaaaac caaggcaaca tgcagaggga gccaaaccag gccttcggtt ctggaaataa
1201 ctcttatagt ggctctaatt ctggtgcagc aattggttgg ggatcagcat ccaatgcagg
1261 gtcgggcagt ggttttaatg gaggctttgg ctcaagcatg gattctaagt cttctggctg
1321 gggaatgtag acagtggggt tgtggttggt tggtatagaa tggtgggaat tcaaattttt
1381 ctaaactcat ggtaagtata ttgtaaaata catatgtact aagaattttc aaaattggtt
1441 tgttcagtgt ggagtatatt cagcagtatt tttgacattt ttctttagaa aaaggaagag
1501 ctaaaggaat tttataagtt ttgttacatg aaaggttgaa atattgagtg gttgaaagtg
1561 aactgctgtt tgcctgattg gtaaaccaac acactacaat tgatatcaaa aggtttctcc
1621 tgtaatattt tatccctgga cttgtcaagt gaattctttg catgttcaaa acggaaacca
1681 ttgattagaa ctacattctt taccccttgt tttaatttga accccaccat atggattttt
1741 ttccttaaga aaatctcctt ttaggagatc atggtgtcac agtgtttggt tcttttgttt
1801 tgttttttaa cacttgtctc ccctcataca caaaagtaca atatgaagcc ttcatttaat
1861 ctctgcagtt catctcattt caaatgttta tggaagaagc acttcattga aagtagtgct
1921 gtaaatattc tgccatagga atactgtcta catgctttct cattcaagaa ttcgtcatca
1981 cgcatcacag gccgcgtctt tgacggtggg tgtcccattt ttatccgcta ctctttattt
2041 catggagtcg tatcaacgct atgaacgcaa ggctgtgata tggaaccaga aggctgtctg
2101 aacttttgaa accttgtgtg ggattgatgg tggtgccgag gcatgaaagg ctagtatgag
2161 cgagaaaagg agagagcgcg tgcagagact tggtggtgca taatggatat tttttaactt
2221 ggcgagatgt gtctctcaat cctgtggctt tggtgagaga gtgtgcagag agcaatgata
2281 gcaaataatg tacgaatgtt ttttgcattc aaaggacatc cacatctgtt ggaagacttt
2341 taagtgagtt tttgttctta gataacccac attagatgaa tgtgttaagt gaaatgatac
2401 ttgtactccc cctacccctt tgtcaactgc tgtgaatgct gtatggtgtg tgttctcttc
2461 tgttactgat atgtaagtgt ggcaatgtga actgaagctg atgggctgag aacatggact
2521 gagcttgtgg tgtgctttgc aggaggactt gaagcagagt tcaccagtga gctcaggtgt
2581 ctcaaagaag ggtggaagtt ctaatgtctg ttagctaccc ataagaatgc tgtttgctgc
2641 agttctgtgt cctgtgcttg gatgcttttt ataagagttg tcattgttgg aaattcttaa
2701 ataaaactga tttaaataaa aaaaaaaaaa aagggcggcc gccctttttt tttttttttt
2761 agagctttta tatttgcgtt tattcttcat ttaacttttt aaaacactac tatagtttat
2821 taaaaaaaaa aaa
//