LOCUS       AB209159                5065 bp    mRNA    linear   HUM 17-NOV-2007
DEFINITION  Homo sapiens mRNA for Thyroglobulin precursor variant protein.
ACCESSION   AB209159
VERSION     AB209159.1
KEYWORDS    FLI_CDNA.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 5065)
  AUTHORS   Totoki,Y., Toyoda,A., Takeda,T., Sakaki,Y., Tanaka,A.,
            Yokoyama,S., Ohara,O., Nagase,T. and Kikuno,R.F.
  TITLE     Direct Submission
  JOURNAL   Submitted (22-MAR-2005) to the DDBJ/EMBL/GenBank databases.
            Contact:Osamu Ohara
            Kazusa DNA Research Institute, Department of Human Gene Research;
            2-6-7   Kazusa-kamatari, Kisarazu, Chiba 292-0818, Japan
            URL    :http://protein.gsc.riken.go.jp/
REFERENCE   2
  AUTHORS   Totoki,Y., Toyoda,A., Takeda,T., Sakaki,Y., Tanaka,A.,
            Yokoyama,S., Ohara,O., Nagase,T. and Kikuno,R.F.
  TITLE     Homo sapiens protein coding cDNA
  JOURNAL   Published Only in Database(2005)
COMMENT     This work was supported in part by the National Project on Protein
            Structural and Functional Analysis , Ministry of Education,
            Culture , Sports, Science and Technology of Japan.
            Totoki Y, Toyoda A, Takeda T, Sakaki Y, Tanaka A, Yokoyama S.
            RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi,
            Yokohama 230-0045, Japan. 
            URL: http://protein.gsc.riken.go.jp/
FEATURES             Location/Qualifiers
     source          1..5065
                     /clone="aj00807"
                     /db_xref="H-InvDB:HIT000329094"
                     /db_xref="taxon:9606"
                     /mol_type="mRNA"
                     /note="this clone is also named as hsk003000858"
                     /note="vector:pBluescriptII SK plus"
                     /organism="Homo sapiens"
                     /tissue_type="brain"
     CDS             <238..4962
                     /codon_start=1
                     /inference="non-experimental evidence, no additional
                     details recorded"
                     /note="Start codon is not identified."
                     /product="Thyroglobulin precursor variant"
                     /protein_id="BAD92396.1"
                     /translation="IPRKPISKRPVRPSLPRSPRCPLPFNASEVVGGTILCETISGPT
                     GSAMQQCQLLCRQGSWSVFPPGPLICSLESGRWESQLPQPRACQRPQLWQTIQTQGHF
                     QLQLPPGKMCSADYAGLLQTFQVFILDELTARGFCQIQVKTFGTLVSIPVCNNSSVQV
                     GCLTRERLGVNVTWKSRLEDIPVASLPDLHDIERALVGKDLLGRFTDLIQSGSFQLHL
                     DSKTFPAETIRFLQGDHFGTSPRTWFGCSEGFYQVLTSEASQDGLGCVKCPEGSYSQD
                     EECIPCPVGFYQEQAGSLACVPCPVGRTTISAGAFSQTHCVTDCQRNEAGLQCDQNGQ
                     YRASQKDRGSGKAFCVDGEGRRLPWWETEAPLEDSQCLMMQKFEKVPESKVIFDANAP
                     VAVRSKVPDSEFPVMQCLTDCTEDEACSFFTVSTTEPEISCDFYAWTSDNVACMTSDQ
                     KRDALGNSKATSFGSLRCQVKVRSHGQDSPAVYLKKGQGSTTTLQKRFEPTGFQNMLS
                     GLYNPIVFSASGANLTDAHLFCLLACDRDLCCDGFVLTQVQGGAIICGLLSSPSVLLC
                     NVKDWMDPSEAWANATCPGVTYDQESHQVILRLGDQEFIKSLTPLEGTQDTFTNFQQV
                     YLWKDSDMGSRPESMGCRKNTVPRPASPTEAGLTTELFSPVDLNQVIVNGNQSLSSQK
                     HWLFKHLFSAQQANLWCLSRCVQEHSFCQLAEITESASLYFTCTLYPEAQVCDDIMES
                     NAQGCRLILPQMPKALFRKKVILEDKVKNFYTRLPFQKLTGISIRNKVPMSEKSISNG
                     FFECERRCDADPCCTGFGFLNVSQLKGGEVTCLTLNSLGIQMCSEENGGAWRILDCGS
                     PDIEVHTYPFGWYQKPIAQNNAPSFCPLVVLPSLTEKVSLDSWQSLALSSVVVDPSIR
                     HFDVAHVSTAATSNFSAVRDLCLSECSQHEACLITTLQTQPGAVRCMFYADTQSCTHS
                     LQGQNCRLLLREEATHIYRKPGISLLSYEASVPSVPISTHGRLLGRSQAIQVGTSWKQ
                     VDQFLGVPYAAPPLAERRFQAPEPLNWTGSWDASKPRASCWQPGTRTSTSPGVSEDCL
                     YLNVFIPQNVAPNASVLVFFHNTMDREESEGWPAIDGSFLAAVGNLIVVTASYRVGVF
                     GFLSSGSGEVSGNWGLLDQVAALTWVQTHIRGFGGDPRRVSLAADRGGADVASIHLLT
                     ARATNSQLFRRAVLMGGSALSPAAVISHERAQQQAIALAKEVSCPMSSSQEVVSCLRQ
                     KPANVLNDAQTKLLAVSGPFHYWGPVIDGHFLREPPARALKRSLWVEVDLLIGSSQDD
                     GLINRAKAVKQFEESQGRTSSKTAFYQALQNSLGGEDSDARVEAAATWYYSLEHSTDD
                     YASFSRALENATRDYFIICPIIDMASAWAKRARGNVFMYHAPENYGHGSLELLADVQF
                     ALGLPFYPAYEGQFSLEEKSLSLKIMQYFSHFIRSGNPNYPYEFSRKVPTFATPWPDF
                     VPRAGGENYKEFSELLPNRQGLKKADCSFWSKYISSLKTSADGAKGGQSAESEEEELT
                     AGSGLREDLLSLQEPGSKTYSK"
BASE COUNT         1129 a         1389 c         1366 g         1181 t
ORIGIN      
        1 ctcggctgga gcatccgccc ttctgcggtc gggcccctac atgccacagt gtgatgcgtt
       61 tggaagttgg gagcctgtgc agtgccacgc tgggactggg cactgctggt gtgtagatga
      121 gaaaggaggg ttcatccctg gctcactgac tgcccgctct ctgcagattc cacagtgccc
      181 gacaacctgc gagaaatctc gaaccagtgg gctgctttcc agttggaaac aggctagatc
      241 ccaagaaaac ccatctccaa aagacctgtt cgtcccagcc tgcctagaag cccgcggtgt
      301 ccgctgccat tcaacgcgtc ggaggtggtt ggtggaacaa tcctgtgtga gacaatctcg
      361 ggccccacag gctctgccat gcagcagtgc caattgctgt gccgccaggg ctcctggagc
      421 gtgtttccac cagggccatt gatatgtagc ctggagagcg gacgctggga gtcacagctg
      481 cctcagcccc gggcctgcca acggccccag ctgtggcaga ccatccagac ccaagggcac
      541 tttcagctcc agctcccgcc gggcaagatg tgcagtgctg actacgcggg tttgctgcag
      601 actttccagg ttttcatatt ggatgagctg acagcccgcg gcttctgcca gatccaggtg
      661 aagacttttg gcaccctggt ttccattcct gtctgcaaca actcctctgt gcaggtgggt
      721 tgtctgacca gggagcgttt aggagtgaat gttacatgga aatcacggct tgaggacatc
      781 ccagtggctt ctcttcctga cttacatgac attgagagag ccttggtggg caaggatctc
      841 cttgggcgct tcacagatct gatccagagt ggctcattcc agcttcatct ggactccaag
      901 acgttcccag cggaaaccat ccgcttcctc caaggggacc actttggcac ctctcccagg
      961 acatggtttg ggtgctcgga aggattctac caagtcttga caagtgaggc cagtcaggac
     1021 ggactgggat gcgttaagtg tcctgaagga agctattccc aagatgagga atgcattcct
     1081 tgtcctgttg gattctacca agaacaggca gggagcttgg cctgtgtccc atgtcctgtg
     1141 ggcagaacga ccatttctgc cggagctttc agccagactc actgtgtcac tgactgtcag
     1201 aggaacgaag caggcctgca atgtgaccag aatggccagt atcgagccag ccagaaggac
     1261 aggggcagtg ggaaggcctt ctgtgtggac ggcgaggggc ggaggctgcc atggtgggaa
     1321 acagaggccc ctcttgagga ctcacagtgt ttgatgatgc agaagtttga gaaggttcca
     1381 gaatcaaagg tgatcttcga cgccaatgct cctgtggctg tcagatccaa agttcctgat
     1441 tctgagttcc ccgtgatgca gtgcttgaca gattgcacag aggacgaggc ctgcagcttc
     1501 ttcaccgtgt ccacgacgga gccagagatt tcctgtgatt tctatgcttg gacaagtgac
     1561 aatgttgcct gcatgacttc tgaccagaaa cgagatgcac tggggaactc aaaggccacc
     1621 agctttggaa gtcttcgctg ccaggtgaaa gtgaggagcc atggtcaaga ttctccagct
     1681 gtgtatttga aaaagggcca aggatccacc acaacacttc agaaacgctt tgaacccact
     1741 ggtttccaaa acatgctttc tggattgtac aaccccattg tgttctcagc ctcaggagcc
     1801 aatctaaccg atgctcacct cttctgtctt cttgcatgcg accgtgatct gtgttgcgat
     1861 ggcttcgtcc tcacacaggt tcaaggaggt gccatcatct gtgggttgct gagctcaccc
     1921 agtgtcctgc tttgtaatgt caaagactgg atggatccct ctgaagcctg ggctaatgct
     1981 acatgtcctg gtgtgacata tgaccaggag agccaccagg tgatattgcg tcttggagac
     2041 caggagttca tcaagagtct gacaccctta gaaggaactc aagacacctt taccaatttt
     2101 cagcaggttt atctctggaa agattctgac atggggtctc ggcctgagtc tatgggatgt
     2161 agaaaaaaca cagtgccaag gccagcatct ccaacagaag caggtttgac aacagaactt
     2221 ttctcccctg tggacctcaa ccaggtcatt gtcaatggaa atcaatcact atccagccag
     2281 aagcactggc ttttcaagca cctgttttca gcccagcagg caaacctatg gtgcctttct
     2341 cgttgtgtgc aggagcactc tttctgtcag ctcgcagaga taacagagag tgcatccttg
     2401 tacttcacct gcaccctcta cccagaggca caggtgtgtg atgacatcat ggagtccaat
     2461 gcccagggct gcagactgat cctgcctcag atgccaaagg ccctgttccg gaagaaagtt
     2521 atactggaag ataaagtgaa gaacttttac actcgcctgc cgttccaaaa actgacgggg
     2581 atatccatta gaaataaagt gcccatgtct gaaaaatcta tttctaatgg gttctttgaa
     2641 tgtgaacgac ggtgcgatgc ggacccatgc tgcactggct ttggatttct aaatgtttcc
     2701 cagttaaaag gaggagaggt gacatgtctc actctgaaca gcttgggaat tcagatgtgc
     2761 agtgaggaga atggaggagc ctggcgcatt ttggactgtg gctctcctga cattgaagtc
     2821 cacacctatc ccttcggatg gtaccagaag cccattgctc aaaataatgc tcccagtttt
     2881 tgccctttgg ttgttctgcc ttccctcaca gagaaagtgt ctctggactc gtggcagtcc
     2941 ctggccctct cttcagtggt tgttgatcca tccattaggc actttgatgt tgcccatgtc
     3001 agcactgctg ccaccagcaa tttctctgct gtccgagacc tctgtttgtc ggaatgttcc
     3061 caacatgagg cctgtctcat caccactctg caaacccaac ctggggctgt gagatgtatg
     3121 ttctatgctg atactcaaag ctgcacacat agtctgcagg gtcagaactg ccgacttctg
     3181 cttcgtgaag aggccaccca catctaccgg aagccaggaa tctctctgct cagctatgag
     3241 gcatctgtac cttctgtgcc catttccacc catggccggc tgctgggcag gtcccaggcc
     3301 atccaggtgg gtacctcatg gaagcaagtg gaccagttcc ttggagttcc atatgctgcc
     3361 ccgcccctgg cagagaggcg cttccaggca ccagagccct tgaactggac aggctcctgg
     3421 gatgccagca agccaagggc cagctgctgg cagccaggca ccagaacatc cacgtctcct
     3481 ggagtcagtg aagattgttt gtatctcaat gtgttcatcc ctcagaatgt ggcccctaac
     3541 gcgtctgtgc tggtgttctt ccacaacacc atggacaggg aggagagtga aggatggccg
     3601 gctatcgacg gctccttctt ggctgctgtt ggcaacctca tcgtggtcac tgccagctac
     3661 cgagtgggtg tcttcggctt cctgagttct gggtccggag aggtgagtgg caactggggg
     3721 ctgctggacc aggtggcggc tctgacctgg gtgcagaccc acatccgagg atttggcggg
     3781 gaccctcggc gcgtgtccct ggcagcagac cgtggcgggg ctgatgtggc cagcatccac
     3841 cttctcacgg ccagggccac caactcccaa cttttccgga gagctgtgct gatgggaggc
     3901 tccgcactct ccccggccgc cgtcatcagc catgagaggg ctcagcagca ggcaattgct
     3961 ttggcaaagg aggtcagttg ccccatgtca tccagccaag aagtggtgtc ctgcctccgc
     4021 cagaagcctg ccaatgtcct caatgatgcc cagaccaagc tcttggccgt gagtggccct
     4081 ttccactact ggggtcctgt gatcgatggc cacttcctcc gtgagcctcc agccagagca
     4141 ctgaagaggt ctttatgggt agaggtcgat ctgctcattg ggagttctca ggacgacggg
     4201 ctcatcaaca gagcaaaggc tgtgaagcaa tttgaggaaa gtcaaggccg gaccagtagc
     4261 aaaacagcct tttaccaggc actgcagaat tctctgggtg gcgaggactc agatgcccgc
     4321 gtcgaggctg ctgctacatg gtattactct ctggagcact ccacggatga ctatgcctcc
     4381 ttctcccggg ctctggagaa tgccacccgg gactacttta tcatctgccc tataatcgac
     4441 atggccagtg cctgggcaaa gagggcccga ggaaacgtct tcatgtacca tgctcctgaa
     4501 aactacggcc atggcagcct ggagctgctg gcggatgttc agtttgcctt ggggcttccc
     4561 ttctacccag cctatgaggg gcagttttct ctggaggaga agagcctgtc gctgaaaatc
     4621 atgcagtact tttcccactt catcagatca ggaaatccca actaccctta tgagttctca
     4681 cggaaagtac ccacatttgc aaccccctgg cctgactttg taccccgtgc tggtggagag
     4741 aactacaagg agttcagtga gctgctcccc aatcgacagg gcctgaagaa agccgactgc
     4801 tccttctggt ccaagtacat ctcgtctctg aagacatctg cagatggagc caagggcggg
     4861 cagtcagcag agagtgaaga ggaggagttg acggctggat ctgggctaag agaagatctc
     4921 ctaagcctcc aggaaccagg ctctaagacc tacagcaagt gaccagccct tgagctcccc
     4981 aaaaacctca cccgaggctg cccactatgg tcatcttttt ctctaaaata gccacttacc
     5041 ttcaataaag tatctacatg cggtg
//