LOCUS       BC020543                3468 bp    mRNA    linear   HUM 09-JUN-2008
DEFINITION  Homo sapiens sedoheptulokinase, mRNA (cDNA clone MGC:21179
            IMAGE:4413491), complete cds.
ACCESSION   BC020543
VERSION     BC020543.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3468)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3468)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (03-JAN-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 27 Row: h Column: 7
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 7019340.
FEATURES             Location/Qualifiers
     source          1..3468
                     /db_xref="H-InvDB:HIT000038811"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:21179 IMAGE:4413491"
                     /tissue_type="Liver, adenocarcinoma"
                     /clone_lib="NIH_MGC_90"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3468
                     /gene="SHPK"
                     /gene_synonym="SHK"
                     /db_xref="GeneID:23729"
                     /db_xref="HGNC:HGNC:1492"
                     /db_xref="MIM:605060"
     CDS             60..1496
                     /gene="SHPK"
                     /gene_synonym="SHK"
                     /codon_start=1
                     /product="sedoheptulokinase"
                     /protein_id="AAH20543.1"
                     /db_xref="GeneID:23729"
                     /db_xref="HGNC:HGNC:1492"
                     /db_xref="MIM:605060"
                     /translation="MAARPITLGIDLGTTSVKAALLRAAPDDPSGFAVLASCARAARA
                     EAAVESAVAGPQGREQDVSRILQALHECLAALPRPQLRSVVGIGVSGQMHGVVFWKTG
                     QGCEWTEGGITPVFEPRAVSHLVTWQDGRCSSEFLASLPQPKSHLSVATGFGCATIFW
                     LLKYRPEFLKSYDAAGTIHDYVVAMLCGLPRPLMSDQNAASWGYFNTQSQSWNVETLR
                     SSGFPVHLLPDIAEPGSVAGRTSHMWFEIPKGTQVGVALGDLQASVYSCMAQRTDAVL
                     NISTSVQLAASMPSGFQPAQTPDPTAPVAYFPYFNRTYLGVAASLNGGNVLATFVHML
                     VQWMADLGLEVEESTVYSRMIQAAVQQRDTHLTITPTVLGERHLPDQLASVTRISSSD
                     LSLGHVTRALCRGIVQNLHSMLPIQQLQEWGVERVMGSGSALSRNDVLKQEVQRAFPL
                     PMSFGQDVDAAVGAALVMLRRHLNQKES"
BASE COUNT          767 a          958 c          987 g          756 t
ORIGIN      
        1 agagcgcgga cttgtggggc cgctggctgc agactggagc tgcgcgggtc agggagataa
       61 tggctgcgcg gccgatcacc ctcggcattg acctgggcac cacatctgtg aaggcagctc
      121 tgctgagggc cgcgcccgac gacccatccg ggttcgcagt gctggcgagc tgtgcccgtg
      181 ctgcgcgggc agaggcggcg gtcgagagcg cggtggccgg gccccagggg cgggagcagg
      241 atgtgagtag aatcctccaa gccctacacg agtgccttgc tgcccttccc cgaccccagc
      301 tccggagcgt cgtgggcatc ggggtgtcgg gccagatgca tggagtcgtg ttttggaaaa
      361 caggccaagg ctgtgaatgg acagagggag ggattacccc ggtgttcgag ccccgagctg
      421 ttagccacct ggtcacgtgg caggatggcc gatgtagcag cgaattcctg gcctctctgc
      481 cccagccgaa gtctcatctc agtgtggcca cgggcttcgg ctgtgcaacc atcttctggc
      541 ttttgaaata tcgcccagag ttcctgaagt cctacgacgc agccggtacc atccacgact
      601 atgtggttgc catgctgtgt ggcttgccaa gacctctgat gtccgaccag aatgctgcca
      661 gctggggcta tttcaacacg cagagccaaa gctggaacgt agagacactg aggagctcgg
      721 gttttcctgt ccacctgctc ccagacatcg ccgagcctgg cagtgtggcg ggcagaactt
      781 cccacatgtg gtttgaaatc ccaaagggga cgcaggtggg agtggccttg ggtgatttac
      841 aggcctctgt ctattcctgc atggcccaga ggacagatgc agttctcaac atcagcacct
      901 cggttcagct ggcagcctcc atgccttcag gattccagcc tgcacagact ccagacccta
      961 cggccccagt cgcctacttc ccatacttca acaggaccta cctgggggtg gccgcgtcac
     1021 tcaacggggg caatgtgctg gccacgttcg tccacatgct ggttcagtgg atggcagatc
     1081 taggcctgga ggttgaagaa tccactgtgt attcacgcat gattcaggca gctgtgcagc
     1141 agagagatac ccacctgacc atcaccccga cagtgctggg ggagaggcac ctgccggacc
     1201 agctggcctc agtgaccaga atctcctcct ccgacctctc cctggggcac gtgacccggg
     1261 ctctgtgccg aggcattgtt cagaacctgc actccatgct tccgattcag cagctccagg
     1321 agtggggcgt ggagagggtg atgggcagtg ggagtgcgct gtccaggaat gacgtgctga
     1381 agcaggaggt gcagagggct ttccctttgc ccatgtcctt tgggcaggat gtggatgcag
     1441 ctgtcggggc agctctggtc atgctccgga gacacctcaa ccagaaggaa tcttagacag
     1501 caaactcttt cgccaaacga ctgctgtgaa ttttacctga ttaacattcc tgacaccatc
     1561 tgtgggtcat cctttccctg gaccgttcag tggacagctt tcaagcagtg cttgttgtga
     1621 ggtcccatct tggccaagaa cttaccttca gaacatactc taataatgca gccaggagcc
     1681 gtcagccaga tcccaaatga gtgccttccg aaattgaccc acctgggagc tatttacaaa
     1741 tgtccatgtg ggagagagag agcatgagag cacagtagcc cagcctgctg gtcagcaggc
     1801 tcatctgtgg ttcacctgta gacagagagc agatcaatgt gtacttcaga caccagaaag
     1861 tctggtggct ttggtcccaa gtgggaaaag agaactgccc catgcccagc ttgtgattat
     1921 cgtttttgga gacctgaagc ccacactcgg gtcgtatgga cttctggaaa agttcttgtc
     1981 tcctggactg aaccatgtga ccggaggccc ctttcctagt ctcatcctcc cctggctgca
     2041 gatgcttagc tgggccaggg attgacccaa gcgcgatgca gcaggcaggc tcagaagacg
     2101 atgcggggct gtgtgccggc cttcttgctg catgtactca gcctcaggag agcttgctgc
     2161 acccaggccg cccaggtctt cacagcacaa ctgcctggaa ggcaggttgc gagaaggaga
     2221 ggcggatggc atgagcagca agggggaccg atgctgtgca gctcacacca ctccagaacc
     2281 tgacaaggca ccagcaggac cccttgccag gagcatgtct gtgcagcagt gtttttgccc
     2341 ctgcacattc cagaagccct catgggaagg gatgcagcca ggcagactcc tgccagatgg
     2401 ggcaggtagt ttattcaaag agaactctgt atcccatagg cccaggctct cctttcgctt
     2461 ggcgtgggct ttgctggccc agtgtgtgct cctggctcag cagaaacatc catttgagtt
     2521 ggcatccctg tagggatccc agagcgttgt aagccttctt gtgattggta gggatggctg
     2581 tggggtggct tccaggaggg ggccaccatt gccgcatcta cttctagact cccaaaggag
     2641 cccaggctca ggcaggcctg gcccagagtc acgctggcaa ccacgagttt gggaagcagt
     2701 cgtattctct ctctctctct ctctctctca gtatccatga caggtatgaa acatattgtc
     2761 tctttataaa tgtcatttta caaattatgt gattatctgg aagctctaag atgagagcaa
     2821 atgcctgatc actctggcca aatgtcagat actaaagccc attcttggcc gggcatgttg
     2881 gctcccgcct gtaatcccag cactttggga agcccaagtg ggtgaatcac ctgaggtcag
     2941 gagttcaaga ccagcctgac caacatgggg ataccccgtc tctactaaaa atacaagccg
     3001 ggcgtggtgg cgcatgcctg taatcccagc tactcaggag gctgaggcag gaaaatcact
     3061 tgaactcggg aggcagaggt tgcagtgagc tgagatcgcg ccattgcact ccagcctggg
     3121 tgacagagca agactctgtc tcataaataa atacaaagcc cattcttcca gagtcttgtg
     3181 ccttaaataa aacacacctc tctgctgtgg gaagactgtg caatggcaca gccgcagagc
     3241 ttggtttggg aggttgaaat gctctgggga gaattcgtag atcatcctca gaaaagcctt
     3301 gccctggtgt tctaccagaa aaacgtctcc caatcaccca ggaaagctgt ccacagtagt
     3361 ccccccttat ccacggtgtc actttccatg ggttcagtta tctgcggtca accacggtct
     3421 gacaatatta aatggaaaat tcttcaaaca gttaaaaaaa aaaaaaaa
//