LOCUS BC020543 3468 bp mRNA linear HUM 09-JUN-2008
DEFINITION Homo sapiens sedoheptulokinase, mRNA (cDNA clone MGC:21179
IMAGE:4413491), complete cds.
ACCESSION BC020543
VERSION BC020543.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 3468)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 3468)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (03-JAN-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 27 Row: h Column: 7
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 7019340.
FEATURES Location/Qualifiers
source 1..3468
/db_xref="H-InvDB:HIT000038811"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:21179 IMAGE:4413491"
/tissue_type="Liver, adenocarcinoma"
/clone_lib="NIH_MGC_90"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..3468
/gene="SHPK"
/gene_synonym="SHK"
/db_xref="GeneID:23729"
/db_xref="HGNC:HGNC:1492"
/db_xref="MIM:605060"
CDS 60..1496
/gene="SHPK"
/gene_synonym="SHK"
/codon_start=1
/product="sedoheptulokinase"
/protein_id="AAH20543.1"
/db_xref="GeneID:23729"
/db_xref="HGNC:HGNC:1492"
/db_xref="MIM:605060"
/translation="MAARPITLGIDLGTTSVKAALLRAAPDDPSGFAVLASCARAARA
EAAVESAVAGPQGREQDVSRILQALHECLAALPRPQLRSVVGIGVSGQMHGVVFWKTG
QGCEWTEGGITPVFEPRAVSHLVTWQDGRCSSEFLASLPQPKSHLSVATGFGCATIFW
LLKYRPEFLKSYDAAGTIHDYVVAMLCGLPRPLMSDQNAASWGYFNTQSQSWNVETLR
SSGFPVHLLPDIAEPGSVAGRTSHMWFEIPKGTQVGVALGDLQASVYSCMAQRTDAVL
NISTSVQLAASMPSGFQPAQTPDPTAPVAYFPYFNRTYLGVAASLNGGNVLATFVHML
VQWMADLGLEVEESTVYSRMIQAAVQQRDTHLTITPTVLGERHLPDQLASVTRISSSD
LSLGHVTRALCRGIVQNLHSMLPIQQLQEWGVERVMGSGSALSRNDVLKQEVQRAFPL
PMSFGQDVDAAVGAALVMLRRHLNQKES"
BASE COUNT 767 a 958 c 987 g 756 t
ORIGIN
1 agagcgcgga cttgtggggc cgctggctgc agactggagc tgcgcgggtc agggagataa
61 tggctgcgcg gccgatcacc ctcggcattg acctgggcac cacatctgtg aaggcagctc
121 tgctgagggc cgcgcccgac gacccatccg ggttcgcagt gctggcgagc tgtgcccgtg
181 ctgcgcgggc agaggcggcg gtcgagagcg cggtggccgg gccccagggg cgggagcagg
241 atgtgagtag aatcctccaa gccctacacg agtgccttgc tgcccttccc cgaccccagc
301 tccggagcgt cgtgggcatc ggggtgtcgg gccagatgca tggagtcgtg ttttggaaaa
361 caggccaagg ctgtgaatgg acagagggag ggattacccc ggtgttcgag ccccgagctg
421 ttagccacct ggtcacgtgg caggatggcc gatgtagcag cgaattcctg gcctctctgc
481 cccagccgaa gtctcatctc agtgtggcca cgggcttcgg ctgtgcaacc atcttctggc
541 ttttgaaata tcgcccagag ttcctgaagt cctacgacgc agccggtacc atccacgact
601 atgtggttgc catgctgtgt ggcttgccaa gacctctgat gtccgaccag aatgctgcca
661 gctggggcta tttcaacacg cagagccaaa gctggaacgt agagacactg aggagctcgg
721 gttttcctgt ccacctgctc ccagacatcg ccgagcctgg cagtgtggcg ggcagaactt
781 cccacatgtg gtttgaaatc ccaaagggga cgcaggtggg agtggccttg ggtgatttac
841 aggcctctgt ctattcctgc atggcccaga ggacagatgc agttctcaac atcagcacct
901 cggttcagct ggcagcctcc atgccttcag gattccagcc tgcacagact ccagacccta
961 cggccccagt cgcctacttc ccatacttca acaggaccta cctgggggtg gccgcgtcac
1021 tcaacggggg caatgtgctg gccacgttcg tccacatgct ggttcagtgg atggcagatc
1081 taggcctgga ggttgaagaa tccactgtgt attcacgcat gattcaggca gctgtgcagc
1141 agagagatac ccacctgacc atcaccccga cagtgctggg ggagaggcac ctgccggacc
1201 agctggcctc agtgaccaga atctcctcct ccgacctctc cctggggcac gtgacccggg
1261 ctctgtgccg aggcattgtt cagaacctgc actccatgct tccgattcag cagctccagg
1321 agtggggcgt ggagagggtg atgggcagtg ggagtgcgct gtccaggaat gacgtgctga
1381 agcaggaggt gcagagggct ttccctttgc ccatgtcctt tgggcaggat gtggatgcag
1441 ctgtcggggc agctctggtc atgctccgga gacacctcaa ccagaaggaa tcttagacag
1501 caaactcttt cgccaaacga ctgctgtgaa ttttacctga ttaacattcc tgacaccatc
1561 tgtgggtcat cctttccctg gaccgttcag tggacagctt tcaagcagtg cttgttgtga
1621 ggtcccatct tggccaagaa cttaccttca gaacatactc taataatgca gccaggagcc
1681 gtcagccaga tcccaaatga gtgccttccg aaattgaccc acctgggagc tatttacaaa
1741 tgtccatgtg ggagagagag agcatgagag cacagtagcc cagcctgctg gtcagcaggc
1801 tcatctgtgg ttcacctgta gacagagagc agatcaatgt gtacttcaga caccagaaag
1861 tctggtggct ttggtcccaa gtgggaaaag agaactgccc catgcccagc ttgtgattat
1921 cgtttttgga gacctgaagc ccacactcgg gtcgtatgga cttctggaaa agttcttgtc
1981 tcctggactg aaccatgtga ccggaggccc ctttcctagt ctcatcctcc cctggctgca
2041 gatgcttagc tgggccaggg attgacccaa gcgcgatgca gcaggcaggc tcagaagacg
2101 atgcggggct gtgtgccggc cttcttgctg catgtactca gcctcaggag agcttgctgc
2161 acccaggccg cccaggtctt cacagcacaa ctgcctggaa ggcaggttgc gagaaggaga
2221 ggcggatggc atgagcagca agggggaccg atgctgtgca gctcacacca ctccagaacc
2281 tgacaaggca ccagcaggac cccttgccag gagcatgtct gtgcagcagt gtttttgccc
2341 ctgcacattc cagaagccct catgggaagg gatgcagcca ggcagactcc tgccagatgg
2401 ggcaggtagt ttattcaaag agaactctgt atcccatagg cccaggctct cctttcgctt
2461 ggcgtgggct ttgctggccc agtgtgtgct cctggctcag cagaaacatc catttgagtt
2521 ggcatccctg tagggatccc agagcgttgt aagccttctt gtgattggta gggatggctg
2581 tggggtggct tccaggaggg ggccaccatt gccgcatcta cttctagact cccaaaggag
2641 cccaggctca ggcaggcctg gcccagagtc acgctggcaa ccacgagttt gggaagcagt
2701 cgtattctct ctctctctct ctctctctca gtatccatga caggtatgaa acatattgtc
2761 tctttataaa tgtcatttta caaattatgt gattatctgg aagctctaag atgagagcaa
2821 atgcctgatc actctggcca aatgtcagat actaaagccc attcttggcc gggcatgttg
2881 gctcccgcct gtaatcccag cactttggga agcccaagtg ggtgaatcac ctgaggtcag
2941 gagttcaaga ccagcctgac caacatgggg ataccccgtc tctactaaaa atacaagccg
3001 ggcgtggtgg cgcatgcctg taatcccagc tactcaggag gctgaggcag gaaaatcact
3061 tgaactcggg aggcagaggt tgcagtgagc tgagatcgcg ccattgcact ccagcctggg
3121 tgacagagca agactctgtc tcataaataa atacaaagcc cattcttcca gagtcttgtg
3181 ccttaaataa aacacacctc tctgctgtgg gaagactgtg caatggcaca gccgcagagc
3241 ttggtttggg aggttgaaat gctctgggga gaattcgtag atcatcctca gaaaagcctt
3301 gccctggtgt tctaccagaa aaacgtctcc caatcaccca ggaaagctgt ccacagtagt
3361 ccccccttat ccacggtgtc actttccatg ggttcagtta tctgcggtca accacggtct
3421 gacaatatta aatggaaaat tcttcaaaca gttaaaaaaa aaaaaaaa
//