LOCUS AB209159 5065 bp mRNA linear HUM 17-NOV-2007 DEFINITION Homo sapiens mRNA for Thyroglobulin precursor variant protein. ACCESSION AB209159 VERSION AB209159.1 KEYWORDS FLI_CDNA. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 5065) AUTHORS Totoki,Y., Toyoda,A., Takeda,T., Sakaki,Y., Tanaka,A., Yokoyama,S., Ohara,O., Nagase,T. and Kikuno,R.F. TITLE Direct Submission JOURNAL Submitted (22-MAR-2005) to the DDBJ/EMBL/GenBank databases. Contact:Osamu Ohara Kazusa DNA Research Institute, Department of Human Gene Research; 2-6-7 Kazusa-kamatari, Kisarazu, Chiba 292-0818, Japan URL :http://protein.gsc.riken.go.jp/ REFERENCE 2 AUTHORS Totoki,Y., Toyoda,A., Takeda,T., Sakaki,Y., Tanaka,A., Yokoyama,S., Ohara,O., Nagase,T. and Kikuno,R.F. TITLE Homo sapiens protein coding cDNA JOURNAL Published Only in Database(2005) COMMENT This work was supported in part by the National Project on Protein Structural and Functional Analysis , Ministry of Education, Culture , Sports, Science and Technology of Japan. Totoki Y, Toyoda A, Takeda T, Sakaki Y, Tanaka A, Yokoyama S. RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan. URL: http://protein.gsc.riken.go.jp/ FEATURES Location/Qualifiers source 1..5065 /clone="aj00807" /db_xref="H-InvDB:HIT000329094" /db_xref="taxon:9606" /mol_type="mRNA" /note="this clone is also named as hsk003000858" /note="vector:pBluescriptII SK plus" /organism="Homo sapiens" /tissue_type="brain" CDS <238..4962 /codon_start=1 /inference="non-experimental evidence, no additional details recorded" /note="Start codon is not identified." /product="Thyroglobulin precursor variant" /protein_id="BAD92396.1" /translation="IPRKPISKRPVRPSLPRSPRCPLPFNASEVVGGTILCETISGPT GSAMQQCQLLCRQGSWSVFPPGPLICSLESGRWESQLPQPRACQRPQLWQTIQTQGHF QLQLPPGKMCSADYAGLLQTFQVFILDELTARGFCQIQVKTFGTLVSIPVCNNSSVQV GCLTRERLGVNVTWKSRLEDIPVASLPDLHDIERALVGKDLLGRFTDLIQSGSFQLHL DSKTFPAETIRFLQGDHFGTSPRTWFGCSEGFYQVLTSEASQDGLGCVKCPEGSYSQD EECIPCPVGFYQEQAGSLACVPCPVGRTTISAGAFSQTHCVTDCQRNEAGLQCDQNGQ YRASQKDRGSGKAFCVDGEGRRLPWWETEAPLEDSQCLMMQKFEKVPESKVIFDANAP VAVRSKVPDSEFPVMQCLTDCTEDEACSFFTVSTTEPEISCDFYAWTSDNVACMTSDQ KRDALGNSKATSFGSLRCQVKVRSHGQDSPAVYLKKGQGSTTTLQKRFEPTGFQNMLS GLYNPIVFSASGANLTDAHLFCLLACDRDLCCDGFVLTQVQGGAIICGLLSSPSVLLC NVKDWMDPSEAWANATCPGVTYDQESHQVILRLGDQEFIKSLTPLEGTQDTFTNFQQV YLWKDSDMGSRPESMGCRKNTVPRPASPTEAGLTTELFSPVDLNQVIVNGNQSLSSQK HWLFKHLFSAQQANLWCLSRCVQEHSFCQLAEITESASLYFTCTLYPEAQVCDDIMES NAQGCRLILPQMPKALFRKKVILEDKVKNFYTRLPFQKLTGISIRNKVPMSEKSISNG FFECERRCDADPCCTGFGFLNVSQLKGGEVTCLTLNSLGIQMCSEENGGAWRILDCGS PDIEVHTYPFGWYQKPIAQNNAPSFCPLVVLPSLTEKVSLDSWQSLALSSVVVDPSIR HFDVAHVSTAATSNFSAVRDLCLSECSQHEACLITTLQTQPGAVRCMFYADTQSCTHS LQGQNCRLLLREEATHIYRKPGISLLSYEASVPSVPISTHGRLLGRSQAIQVGTSWKQ VDQFLGVPYAAPPLAERRFQAPEPLNWTGSWDASKPRASCWQPGTRTSTSPGVSEDCL YLNVFIPQNVAPNASVLVFFHNTMDREESEGWPAIDGSFLAAVGNLIVVTASYRVGVF GFLSSGSGEVSGNWGLLDQVAALTWVQTHIRGFGGDPRRVSLAADRGGADVASIHLLT ARATNSQLFRRAVLMGGSALSPAAVISHERAQQQAIALAKEVSCPMSSSQEVVSCLRQ KPANVLNDAQTKLLAVSGPFHYWGPVIDGHFLREPPARALKRSLWVEVDLLIGSSQDD GLINRAKAVKQFEESQGRTSSKTAFYQALQNSLGGEDSDARVEAAATWYYSLEHSTDD YASFSRALENATRDYFIICPIIDMASAWAKRARGNVFMYHAPENYGHGSLELLADVQF ALGLPFYPAYEGQFSLEEKSLSLKIMQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDF VPRAGGENYKEFSELLPNRQGLKKADCSFWSKYISSLKTSADGAKGGQSAESEEEELT AGSGLREDLLSLQEPGSKTYSK" BASE COUNT 1129 a 1389 c 1366 g 1181 t ORIGIN 1 ctcggctgga gcatccgccc ttctgcggtc gggcccctac atgccacagt gtgatgcgtt 61 tggaagttgg gagcctgtgc agtgccacgc tgggactggg cactgctggt gtgtagatga 121 gaaaggaggg ttcatccctg gctcactgac tgcccgctct ctgcagattc cacagtgccc 181 gacaacctgc gagaaatctc gaaccagtgg gctgctttcc agttggaaac aggctagatc 241 ccaagaaaac ccatctccaa aagacctgtt cgtcccagcc tgcctagaag cccgcggtgt 301 ccgctgccat tcaacgcgtc ggaggtggtt ggtggaacaa tcctgtgtga gacaatctcg 361 ggccccacag gctctgccat gcagcagtgc caattgctgt gccgccaggg ctcctggagc 421 gtgtttccac cagggccatt gatatgtagc ctggagagcg gacgctggga gtcacagctg 481 cctcagcccc gggcctgcca acggccccag ctgtggcaga ccatccagac ccaagggcac 541 tttcagctcc agctcccgcc gggcaagatg tgcagtgctg actacgcggg tttgctgcag 601 actttccagg ttttcatatt ggatgagctg acagcccgcg gcttctgcca gatccaggtg 661 aagacttttg gcaccctggt ttccattcct gtctgcaaca actcctctgt gcaggtgggt 721 tgtctgacca gggagcgttt aggagtgaat gttacatgga aatcacggct tgaggacatc 781 ccagtggctt ctcttcctga cttacatgac attgagagag ccttggtggg caaggatctc 841 cttgggcgct tcacagatct gatccagagt ggctcattcc agcttcatct ggactccaag 901 acgttcccag cggaaaccat ccgcttcctc caaggggacc actttggcac ctctcccagg 961 acatggtttg ggtgctcgga aggattctac caagtcttga caagtgaggc cagtcaggac 1021 ggactgggat gcgttaagtg tcctgaagga agctattccc aagatgagga atgcattcct 1081 tgtcctgttg gattctacca agaacaggca gggagcttgg cctgtgtccc atgtcctgtg 1141 ggcagaacga ccatttctgc cggagctttc agccagactc actgtgtcac tgactgtcag 1201 aggaacgaag caggcctgca atgtgaccag aatggccagt atcgagccag ccagaaggac 1261 aggggcagtg ggaaggcctt ctgtgtggac ggcgaggggc ggaggctgcc atggtgggaa 1321 acagaggccc ctcttgagga ctcacagtgt ttgatgatgc agaagtttga gaaggttcca 1381 gaatcaaagg tgatcttcga cgccaatgct cctgtggctg tcagatccaa agttcctgat 1441 tctgagttcc ccgtgatgca gtgcttgaca gattgcacag aggacgaggc ctgcagcttc 1501 ttcaccgtgt ccacgacgga gccagagatt tcctgtgatt tctatgcttg gacaagtgac 1561 aatgttgcct gcatgacttc tgaccagaaa cgagatgcac tggggaactc aaaggccacc 1621 agctttggaa gtcttcgctg ccaggtgaaa gtgaggagcc atggtcaaga ttctccagct 1681 gtgtatttga aaaagggcca aggatccacc acaacacttc agaaacgctt tgaacccact 1741 ggtttccaaa acatgctttc tggattgtac aaccccattg tgttctcagc ctcaggagcc 1801 aatctaaccg atgctcacct cttctgtctt cttgcatgcg accgtgatct gtgttgcgat 1861 ggcttcgtcc tcacacaggt tcaaggaggt gccatcatct gtgggttgct gagctcaccc 1921 agtgtcctgc tttgtaatgt caaagactgg atggatccct ctgaagcctg ggctaatgct 1981 acatgtcctg gtgtgacata tgaccaggag agccaccagg tgatattgcg tcttggagac 2041 caggagttca tcaagagtct gacaccctta gaaggaactc aagacacctt taccaatttt 2101 cagcaggttt atctctggaa agattctgac atggggtctc ggcctgagtc tatgggatgt 2161 agaaaaaaca cagtgccaag gccagcatct ccaacagaag caggtttgac aacagaactt 2221 ttctcccctg tggacctcaa ccaggtcatt gtcaatggaa atcaatcact atccagccag 2281 aagcactggc ttttcaagca cctgttttca gcccagcagg caaacctatg gtgcctttct 2341 cgttgtgtgc aggagcactc tttctgtcag ctcgcagaga taacagagag tgcatccttg 2401 tacttcacct gcaccctcta cccagaggca caggtgtgtg atgacatcat ggagtccaat 2461 gcccagggct gcagactgat cctgcctcag atgccaaagg ccctgttccg gaagaaagtt 2521 atactggaag ataaagtgaa gaacttttac actcgcctgc cgttccaaaa actgacgggg 2581 atatccatta gaaataaagt gcccatgtct gaaaaatcta tttctaatgg gttctttgaa 2641 tgtgaacgac ggtgcgatgc ggacccatgc tgcactggct ttggatttct aaatgtttcc 2701 cagttaaaag gaggagaggt gacatgtctc actctgaaca gcttgggaat tcagatgtgc 2761 agtgaggaga atggaggagc ctggcgcatt ttggactgtg gctctcctga cattgaagtc 2821 cacacctatc ccttcggatg gtaccagaag cccattgctc aaaataatgc tcccagtttt 2881 tgccctttgg ttgttctgcc ttccctcaca gagaaagtgt ctctggactc gtggcagtcc 2941 ctggccctct cttcagtggt tgttgatcca tccattaggc actttgatgt tgcccatgtc 3001 agcactgctg ccaccagcaa tttctctgct gtccgagacc tctgtttgtc ggaatgttcc 3061 caacatgagg cctgtctcat caccactctg caaacccaac ctggggctgt gagatgtatg 3121 ttctatgctg atactcaaag ctgcacacat agtctgcagg gtcagaactg ccgacttctg 3181 cttcgtgaag aggccaccca catctaccgg aagccaggaa tctctctgct cagctatgag 3241 gcatctgtac cttctgtgcc catttccacc catggccggc tgctgggcag gtcccaggcc 3301 atccaggtgg gtacctcatg gaagcaagtg gaccagttcc ttggagttcc atatgctgcc 3361 ccgcccctgg cagagaggcg cttccaggca ccagagccct tgaactggac aggctcctgg 3421 gatgccagca agccaagggc cagctgctgg cagccaggca ccagaacatc cacgtctcct 3481 ggagtcagtg aagattgttt gtatctcaat gtgttcatcc ctcagaatgt ggcccctaac 3541 gcgtctgtgc tggtgttctt ccacaacacc atggacaggg aggagagtga aggatggccg 3601 gctatcgacg gctccttctt ggctgctgtt ggcaacctca tcgtggtcac tgccagctac 3661 cgagtgggtg tcttcggctt cctgagttct gggtccggag aggtgagtgg caactggggg 3721 ctgctggacc aggtggcggc tctgacctgg gtgcagaccc acatccgagg atttggcggg 3781 gaccctcggc gcgtgtccct ggcagcagac cgtggcgggg ctgatgtggc cagcatccac 3841 cttctcacgg ccagggccac caactcccaa cttttccgga gagctgtgct gatgggaggc 3901 tccgcactct ccccggccgc cgtcatcagc catgagaggg ctcagcagca ggcaattgct 3961 ttggcaaagg aggtcagttg ccccatgtca tccagccaag aagtggtgtc ctgcctccgc 4021 cagaagcctg ccaatgtcct caatgatgcc cagaccaagc tcttggccgt gagtggccct 4081 ttccactact ggggtcctgt gatcgatggc cacttcctcc gtgagcctcc agccagagca 4141 ctgaagaggt ctttatgggt agaggtcgat ctgctcattg ggagttctca ggacgacggg 4201 ctcatcaaca gagcaaaggc tgtgaagcaa tttgaggaaa gtcaaggccg gaccagtagc 4261 aaaacagcct tttaccaggc actgcagaat tctctgggtg gcgaggactc agatgcccgc 4321 gtcgaggctg ctgctacatg gtattactct ctggagcact ccacggatga ctatgcctcc 4381 ttctcccggg ctctggagaa tgccacccgg gactacttta tcatctgccc tataatcgac 4441 atggccagtg cctgggcaaa gagggcccga ggaaacgtct tcatgtacca tgctcctgaa 4501 aactacggcc atggcagcct ggagctgctg gcggatgttc agtttgcctt ggggcttccc 4561 ttctacccag cctatgaggg gcagttttct ctggaggaga agagcctgtc gctgaaaatc 4621 atgcagtact tttcccactt catcagatca ggaaatccca actaccctta tgagttctca 4681 cggaaagtac ccacatttgc aaccccctgg cctgactttg taccccgtgc tggtggagag 4741 aactacaagg agttcagtga gctgctcccc aatcgacagg gcctgaagaa agccgactgc 4801 tccttctggt ccaagtacat ctcgtctctg aagacatctg cagatggagc caagggcggg 4861 cagtcagcag agagtgaaga ggaggagttg acggctggat ctgggctaag agaagatctc 4921 ctaagcctcc aggaaccagg ctctaagacc tacagcaagt gaccagccct tgagctcccc 4981 aaaaacctca cccgaggctg cccactatgg tcatcttttt ctctaaaata gccacttacc 5041 ttcaataaag tatctacatg cggtg //