LOCUS HSU23851 4168 bp mRNA linear HUM 25-MAR-1997 DEFINITION Human atrophin-1 mRNA, complete cds. ACCESSION U23851 VERSION U23851.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4168) AUTHORS Margolis,R.L., Li,S.H., Young,W.S., Wagster,M.V., Stine,O.C., Kidwai,A.S., Ashworth,R.G. and Ross,C.A. TITLE DRPLA gene (atrophin-1) sequence and mRNA expression in human brain JOURNAL Brain Res. Mol. Brain Res. 36 (2), 219-226 (1996) PUBMED 8965642 REFERENCE 2 (bases 1 to 4168) AUTHORS Li,S., McInnis,M.G., Margolis,R.L., Antonarakis,S.E. and Ross,C.A. TITLE Novel triplet repeat containing genes in human brain: cloning, expression, and length polymorphisms JOURNAL Unpublished REFERENCE 3 (bases 1 to 4168) AUTHORS Margolis,R.L. TITLE Direct Submission JOURNAL Submitted (28-MAR-1995) Russell L. Margolis, Psychiatry, Johns Hopkins University School of Medicine, 720 Rutland Ave., Baltimore, MD 21205-2196, USA FEATURES Location/Qualifiers source 1..4168 /db_xref="H-InvDB:HIT000218709" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /chromosome="12" /clone_lib="frontal cortex, cerebellum, and caudate cDNA libraries (Stratagene)" /dev_stage="adult" gene 1..4168 /gene="DRPLA" CDS 74..3628 /gene="DRPLA" /codon_start=1 /product="atrophin-1" /protein_id="AAB50276.1" /translation="MKTRQNKDSMSMRSGRKKEAPGPREELRSRGRASPGGVSTSSSD GKAEKSRQTAKKARVEEASTPKVNKQGRSEEISESESEETNAPKKTKTEELPRPQSPS DLDSLDGRSLNDDGSSDPRDIDQDNRSTSPSIYSPGSVENDSDSSSGLSQGPARPYHP PPLFPPSPQPPDSTPRQPEASFEPHPSVTPTGYHAPMEPPTSRMFQAPPGAPPPHPQL YPGGTGGVLSGPPMGPKGGGAASSVGGPNGGKQHPPPTTPISVSSSGASGAPPTKPPT TPVGGGNLPSAPPPANFPHVTPNLPPPPALRPLNNASASPPGLGAQPLPGHLPSPHAM GQGIGGLPPGPEKGPTLAPSPHSLPPASSSAPAPPMRFPYSSSSSSSAAASSSSSSSS SSASPFPASQALPSYPHSFPPPTSLSVSNQPPKYTQPSLPSQAVWSQGPPPPPPYGRL LANSNAHPGPFPPSTGAQSTAHPPVSTHHHHHQQQQQQQQQQQQQQHHGNSGPPPPGA FPHPLEGGSSHHAHPYAMSPSLGSLRPYPPGPAHLPPPHSQVSYSQAGPNGPPVSSSS NSSSSTSQGSYPCSHPSPSQGPQGAPYPFPPVPTVTTSSATLSTVIATVASSPAGYKT ASPPGPPPYGKRAPSPGAYKTATPPGYKPGSPPSFRTGTPPGYRGTSPPAGPGTFKPG SPTVGPGPLPPAGPSGLPSLPPPPAAPASGPPLSATQIKQEPAEEYETPESPVPPARS PSPPPKVVDVPSHASQSARFNKHLDRGFNSCARSDLYFVPLEGSKLAKKRADLVEKVR REAEQRAREEKEREREREREKEREREKERELERSVKLAQEGRAPVECPSLGPVPHRPP FEPGSAVATVPPYLGPDTPALRTLSEYARPHVMSPGNRNHPFYVPLGAVDPGLLGYNV PALYSSDPAAREREREARERDLRDRLKPGFEVKPSELEPLHGVPGPGLDPFPRHGGLA LQPGPPGLHPFPFHPSLGPLERERLALAAGPALRPDMSYAERLAAERQHAERVAALGN DPLARLQMLNVTPHHHQHSHIHSHLHLHQQDAIHAASASVHPLIDPLASGSHLTRIPY PAGTLPNPLLPHPLHENEVLRHQLFAAPYRDLPASLSAPMSAAHQLQAMHAQSAELQR LALEQQQWLHAHHPLHSVPLPAQEDYYSHLKKESDKPL" repeat_region 1532..1561 /gene="DRPLA" /note="polymorphic CAG repeat encoding glutamine, long expansion causes DRPLA" /rpt_type=tandem /rpt_unit_seq="cag" /satellite="microsatellite" polyA_site 4168 /gene="DRPLA" /note="49 A nucleotides" BASE COUNT 827 a 1554 c 1046 g 741 t ORIGIN 1 ttggggtgga gcagagaagt ttctgtattc agctgcccag gcagaggaga atggggtctc 61 cacagcctga agaatgaaga cacgacagaa taaagactcg atgtcaatga ggagtggacg 121 gaagaaagag gcccctgggc cccgggaaga actgagatcg aggggccggg cctcccctgg 181 aggggtcagc acgtccagca gtgatggcaa agctgagaag tccaggcaga cagccaagaa 241 ggcccgagta gaggaagcct ccaccccaaa ggtcaacaag cagggtcgga gtgaggagat 301 ctcagagagt gaaagtgagg agaccaatgc accaaaaaag accaaaactg aggaactccc 361 tcggccacag tctccctccg atctggatag cttggacggg cggagcctta atgatgatgg 421 cagcagcgac cctagggata tcgaccagga caaccgaagc acgtccccca gtatctacag 481 ccctggaagt gtggagaatg actctgactc atcttctggc ctgtcccagg gcccagcccg 541 cccctaccac ccacctccac tctttcctcc ttcccctcaa ccgccagaca gcacccctcg 601 acagccagag gctagctttg aaccccatcc ttctgtgaca cccactggat atcatgctcc 661 catggagccc cccacatctc gaatgttcca ggctcctcct ggggcccctc cccctcaccc 721 acagctctat cccgggggca ctggtggagt tttgtctgga cccccaatgg gtcccaaggg 781 gggaggggct gcctcatcag tggggggccc taatgggggt aagcagcacc ccccacccac 841 tactcccatt tcagtatcaa gctctggggc tagtggtgct cccccaacaa agccgcctac 901 cactccagtg ggtggtggga acctaccttc tgctccacca ccagccaact tcccccatgt 961 gacaccgaac ctgcctcccc cacctgccct gagacccctc aacaatgcat cagcctctcc 1021 ccctggcctg ggggcccaac cactacctgg tcatctgccc tctccccacg ccatgggaca 1081 gggtatcggt ggacttcctc ctggcccaga gaagggccca actctggctc cttcacccca 1141 ctctctgcct cctgcttcct cttctgctcc agcgcccccc atgaggtttc cttattcatc 1201 ctctagtagt agctctgcag cagcctcctc ttccagttct tcctcctctt cctctgcctc 1261 ccccttccca gcttcccagg cattgcccag ctacccccac tctttccctc ccccaacaag 1321 cctctctgtc tccaatcagc cccccaagta tactcagcct tctctcccat cccaggctgt 1381 gtggagccag ggtcccccac cacctcctcc ctatggccgc ctcttagcca acagcaatgc 1441 ccatccaggc cccttccctc cctctactgg ggcccagtcc accgcccacc caccagtctc 1501 aacacatcac catcaccacc agcaacagca acagcagcag cagcagcagc agcagcagca 1561 gcatcacgga aactctgggc cccctcctcc tggagcattt ccccacccac tggagggcgg 1621 tagctcccac cacgcacacc cttacgccat gtctccctcc ctggggtctc tgaggcccta 1681 cccaccaggg ccagcacacc tgcccccacc tcacagccag gtgtcctaca gccaagcagg 1741 ccccaatggc cctccagtct cttcctcttc caactcttcc tcttccactt ctcaagggtc 1801 ctacccatgt tcacacccct ccccttccca gggccctcaa ggggcgccct accctttccc 1861 accggtgcct acggtcacca cctcttcggc taccctttcc acggtcattg ccaccgtggc 1921 ttcctcgcca gcaggctaca aaacggcctc cccacctggg cccccaccgt acggaaagag 1981 agccccgtcc ccgggggcct acaagacagc caccccaccc ggatacaaac ccgggtcgcc 2041 tccctccttc cgaacgggga ccccaccggg ctatcgagga acctcgccac ctgcaggccc 2101 agggaccttc aagccgggct cgcccaccgt gggacctggg cccctgccac ctgcggggcc 2161 ctcaggcctg ccatcgctgc caccaccacc tgcggcccct gcctcagggc cgcccctgag 2221 cgccacgcag atcaaacagg agccggctga ggagtatgag acccccgaga gcccggtgcc 2281 cccagcccgc agcccctcgc cccctcccaa ggtggtagat gtacccagcc atgccagtca 2341 gtctgccagg ttcaacaaac acctggatcg cggcttcaac tcgtgcgcgc gcagcgacct 2401 gtacttcgtg ccactggagg gctccaagct ggccaagaag cgggccgacc tggtggagaa 2461 ggtgcggcgc gaggccgagc agcgcgcgcg cgaagaaaag gagcgcgagc gcgagcggga 2521 acgcgagaaa gagcgcgagc gcgagaagga gcgcgagctt gaacgcagcg tgaagttggc 2581 tcaggagggc cgtgctccgg tggaatgccc atctctgggc ccagtgcccc atcgccctcc 2641 atttgaaccg ggcagtgcgg tggctacagt gcccccctac ctgggtcctg acactccagc 2701 cttgcgcact ctcagtgaat atgcccggcc tcatgtcatg tctcctggca atcgcaacca 2761 tccattctac gtgcccctgg gggcagtgga cccggggctc ctgggttaca atgtcccggc 2821 cctgtacagc agtgatccag ctgcccggga gagggaacgg gaagcccgtg aacgagacct 2881 ccgtgaccgc ctcaagcctg gctttgaggt gaagcctagt gagctggaac ccctacatgg 2941 ggtccctggg ccgggcttgg atccctttcc ccgacatggg ggcctggctc tgcagcctgg 3001 cccacctggc ctgcaccctt tcccctttca tccgagcctg gggcccctgg agcgagaacg 3061 tctagcgctg gcagctgggc cagccctgcg gcctgacatg tcctatgctg agcggctggc 3121 agctgagagg cagcacgcag aaagggtggc ggccctgggc aatgacccac tggcccggct 3181 gcagatgctc aatgtgactc cccatcacca ccagcactcc cacatccact cgcacctgca 3241 cctgcaccag caagatgcta tccatgcagc ctctgcctcg gtgcaccctc tcattgaccc 3301 cctggcctca gggtctcacc ttacccggat cccctaccca gctggaactc tccctaaccc 3361 cctgcttcct caccctctgc acgagaacga agttcttcgt caccagctct ttgctgcccc 3421 ttaccgggac ctgccggcct ccctttctgc cccgatgtca gcagctcatc agctgcaggc 3481 catgcacgca cagtcagctg agctgcagcg cttggcgctg gaacagcagc agtggctgca 3541 tgcccatcac ccgctgcaca gtgtgccgct gcctgcccag gaggactact acagtcacct 3601 gaagaaggaa agcgacaagc cactgtagaa cctgcgatca agagagcacc atggctccta 3661 cattggacct tggagcaccc ccaccctccc cccaccgtgc ccttggcctg ccacccagag 3721 ccaagagggt gctgctcagt tgcagggcct ccgcagctgg acagagagtg ggggagggag 3781 ggacagacag aaggccaagg cccgatgtgg tgtgcagagg tggggaggtg gcgaggatgg 3841 ggacagaaag cgcacagaat cttggaccag gtctctcttc cttgtccccc ctgcttttct 3901 cctcccccat gcccaacccc tgtggccgcc gcccctcccc tgccccgttg gtgtgattat 3961 ttcatctgtt agatgtggct gttttgcgta gcatcgtgtg ccacccctgc ccctccccga 4021 tccctgtgtg cgcgccccct ctgcaatgta tgccccttgc cccttcccca cactaataat 4081 ttatatatat aaatatctat atgacgctct taaaaaaaca tcccaaccaa aaccaaccaa 4141 acaaaaacat cctcacaact ccccagga //