LOCUS BC020543 3468 bp mRNA linear HUM 09-JUN-2008 DEFINITION Homo sapiens sedoheptulokinase, mRNA (cDNA clone MGC:21179 IMAGE:4413491), complete cds. ACCESSION BC020543 VERSION BC020543.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3468) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3468) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 27 Row: h Column: 7 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7019340. FEATURES Location/Qualifiers source 1..3468 /db_xref="H-InvDB:HIT000038811" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:21179 IMAGE:4413491" /tissue_type="Liver, adenocarcinoma" /clone_lib="NIH_MGC_90" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..3468 /gene="SHPK" /gene_synonym="SHK" /db_xref="GeneID:23729" /db_xref="HGNC:HGNC:1492" /db_xref="MIM:605060" CDS 60..1496 /gene="SHPK" /gene_synonym="SHK" /codon_start=1 /product="sedoheptulokinase" /protein_id="AAH20543.1" /db_xref="GeneID:23729" /db_xref="HGNC:HGNC:1492" /db_xref="MIM:605060" /translation="MAARPITLGIDLGTTSVKAALLRAAPDDPSGFAVLASCARAARA EAAVESAVAGPQGREQDVSRILQALHECLAALPRPQLRSVVGIGVSGQMHGVVFWKTG QGCEWTEGGITPVFEPRAVSHLVTWQDGRCSSEFLASLPQPKSHLSVATGFGCATIFW LLKYRPEFLKSYDAAGTIHDYVVAMLCGLPRPLMSDQNAASWGYFNTQSQSWNVETLR SSGFPVHLLPDIAEPGSVAGRTSHMWFEIPKGTQVGVALGDLQASVYSCMAQRTDAVL NISTSVQLAASMPSGFQPAQTPDPTAPVAYFPYFNRTYLGVAASLNGGNVLATFVHML VQWMADLGLEVEESTVYSRMIQAAVQQRDTHLTITPTVLGERHLPDQLASVTRISSSD LSLGHVTRALCRGIVQNLHSMLPIQQLQEWGVERVMGSGSALSRNDVLKQEVQRAFPL PMSFGQDVDAAVGAALVMLRRHLNQKES" BASE COUNT 767 a 958 c 987 g 756 t ORIGIN 1 agagcgcgga cttgtggggc cgctggctgc agactggagc tgcgcgggtc agggagataa 61 tggctgcgcg gccgatcacc ctcggcattg acctgggcac cacatctgtg aaggcagctc 121 tgctgagggc cgcgcccgac gacccatccg ggttcgcagt gctggcgagc tgtgcccgtg 181 ctgcgcgggc agaggcggcg gtcgagagcg cggtggccgg gccccagggg cgggagcagg 241 atgtgagtag aatcctccaa gccctacacg agtgccttgc tgcccttccc cgaccccagc 301 tccggagcgt cgtgggcatc ggggtgtcgg gccagatgca tggagtcgtg ttttggaaaa 361 caggccaagg ctgtgaatgg acagagggag ggattacccc ggtgttcgag ccccgagctg 421 ttagccacct ggtcacgtgg caggatggcc gatgtagcag cgaattcctg gcctctctgc 481 cccagccgaa gtctcatctc agtgtggcca cgggcttcgg ctgtgcaacc atcttctggc 541 ttttgaaata tcgcccagag ttcctgaagt cctacgacgc agccggtacc atccacgact 601 atgtggttgc catgctgtgt ggcttgccaa gacctctgat gtccgaccag aatgctgcca 661 gctggggcta tttcaacacg cagagccaaa gctggaacgt agagacactg aggagctcgg 721 gttttcctgt ccacctgctc ccagacatcg ccgagcctgg cagtgtggcg ggcagaactt 781 cccacatgtg gtttgaaatc ccaaagggga cgcaggtggg agtggccttg ggtgatttac 841 aggcctctgt ctattcctgc atggcccaga ggacagatgc agttctcaac atcagcacct 901 cggttcagct ggcagcctcc atgccttcag gattccagcc tgcacagact ccagacccta 961 cggccccagt cgcctacttc ccatacttca acaggaccta cctgggggtg gccgcgtcac 1021 tcaacggggg caatgtgctg gccacgttcg tccacatgct ggttcagtgg atggcagatc 1081 taggcctgga ggttgaagaa tccactgtgt attcacgcat gattcaggca gctgtgcagc 1141 agagagatac ccacctgacc atcaccccga cagtgctggg ggagaggcac ctgccggacc 1201 agctggcctc agtgaccaga atctcctcct ccgacctctc cctggggcac gtgacccggg 1261 ctctgtgccg aggcattgtt cagaacctgc actccatgct tccgattcag cagctccagg 1321 agtggggcgt ggagagggtg atgggcagtg ggagtgcgct gtccaggaat gacgtgctga 1381 agcaggaggt gcagagggct ttccctttgc ccatgtcctt tgggcaggat gtggatgcag 1441 ctgtcggggc agctctggtc atgctccgga gacacctcaa ccagaaggaa tcttagacag 1501 caaactcttt cgccaaacga ctgctgtgaa ttttacctga ttaacattcc tgacaccatc 1561 tgtgggtcat cctttccctg gaccgttcag tggacagctt tcaagcagtg cttgttgtga 1621 ggtcccatct tggccaagaa cttaccttca gaacatactc taataatgca gccaggagcc 1681 gtcagccaga tcccaaatga gtgccttccg aaattgaccc acctgggagc tatttacaaa 1741 tgtccatgtg ggagagagag agcatgagag cacagtagcc cagcctgctg gtcagcaggc 1801 tcatctgtgg ttcacctgta gacagagagc agatcaatgt gtacttcaga caccagaaag 1861 tctggtggct ttggtcccaa gtgggaaaag agaactgccc catgcccagc ttgtgattat 1921 cgtttttgga gacctgaagc ccacactcgg gtcgtatgga cttctggaaa agttcttgtc 1981 tcctggactg aaccatgtga ccggaggccc ctttcctagt ctcatcctcc cctggctgca 2041 gatgcttagc tgggccaggg attgacccaa gcgcgatgca gcaggcaggc tcagaagacg 2101 atgcggggct gtgtgccggc cttcttgctg catgtactca gcctcaggag agcttgctgc 2161 acccaggccg cccaggtctt cacagcacaa ctgcctggaa ggcaggttgc gagaaggaga 2221 ggcggatggc atgagcagca agggggaccg atgctgtgca gctcacacca ctccagaacc 2281 tgacaaggca ccagcaggac cccttgccag gagcatgtct gtgcagcagt gtttttgccc 2341 ctgcacattc cagaagccct catgggaagg gatgcagcca ggcagactcc tgccagatgg 2401 ggcaggtagt ttattcaaag agaactctgt atcccatagg cccaggctct cctttcgctt 2461 ggcgtgggct ttgctggccc agtgtgtgct cctggctcag cagaaacatc catttgagtt 2521 ggcatccctg tagggatccc agagcgttgt aagccttctt gtgattggta gggatggctg 2581 tggggtggct tccaggaggg ggccaccatt gccgcatcta cttctagact cccaaaggag 2641 cccaggctca ggcaggcctg gcccagagtc acgctggcaa ccacgagttt gggaagcagt 2701 cgtattctct ctctctctct ctctctctca gtatccatga caggtatgaa acatattgtc 2761 tctttataaa tgtcatttta caaattatgt gattatctgg aagctctaag atgagagcaa 2821 atgcctgatc actctggcca aatgtcagat actaaagccc attcttggcc gggcatgttg 2881 gctcccgcct gtaatcccag cactttggga agcccaagtg ggtgaatcac ctgaggtcag 2941 gagttcaaga ccagcctgac caacatgggg ataccccgtc tctactaaaa atacaagccg 3001 ggcgtggtgg cgcatgcctg taatcccagc tactcaggag gctgaggcag gaaaatcact 3061 tgaactcggg aggcagaggt tgcagtgagc tgagatcgcg ccattgcact ccagcctggg 3121 tgacagagca agactctgtc tcataaataa atacaaagcc cattcttcca gagtcttgtg 3181 ccttaaataa aacacacctc tctgctgtgg gaagactgtg caatggcaca gccgcagagc 3241 ttggtttggg aggttgaaat gctctgggga gaattcgtag atcatcctca gaaaagcctt 3301 gccctggtgt tctaccagaa aaacgtctcc caatcaccca ggaaagctgt ccacagtagt 3361 ccccccttat ccacggtgtc actttccatg ggttcagtta tctgcggtca accacggtct 3421 gacaatatta aatggaaaat tcttcaaaca gttaaaaaaa aaaaaaaa //