LOCUS BC018723 2496 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens carbohydrate (N-acetylgalactosamine 4-0) sulfotransferase 8, mRNA (cDNA clone MGC:31814 IMAGE:4137487), complete cds. ACCESSION BC018723 VERSION BC018723.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2496) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2496) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-DEC-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 40 Row: c Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 21361615. FEATURES Location/Qualifiers source 1..2496 /db_xref="H-InvDB:HIT000038422" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:31814 IMAGE:4137487" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2496 /gene="CHST8" /db_xref="GeneID:64377" /db_xref="HGNC:HGNC:15993" /db_xref="MIM:610190" CDS 759..2033 /gene="CHST8" /codon_start=1 /product="carbohydrate (N-acetylgalactosamine 4-0) sulfotransferase 8" /protein_id="AAH18723.1" /db_xref="GeneID:64377" /db_xref="HGNC:HGNC:15993" /db_xref="MIM:610190" /translation="MTLRPGTMRLACMFSSILLFGAAGLLLFISLQDPTELAPQQVPG IKFNIRPRQPHHDLPPGGSQDGDLKEPTERVTRDLSSGAPRGRNLPAPDQPQPPLQRG TRLRLRQRRRRLLIKKMPAAATIPANSSDAPFIRPGPGTLDGRWVSLHRSQQERKRVM QEACAKYRASSSRRAVTPRHVSRIFVEDRHRVLYCEVPKAGCSNWKRVLMVLAGLASS TADIQHNTVHYGSALKRLDTFDRQGILHRLSTYTKMLFVREPFERLVSAFRDKFEHPN SYYHPVFGKAILARYRANASREALRTGSGVRFPEFVQYLLDVHRPVGMDIHWDHVSRL CSPCLIDYDFVGKFESMEDDANFFLSLIRAPRNLTFPRFKDRHSQEARTTARIAHQYF AQLSALQRQRTYDFYYMDYLMFNYSKPFADLY" BASE COUNT 483 a 830 c 759 g 424 t ORIGIN 1 gggactcgtc ccagagtttg ctgccgccgc cgccgctgcc attagagcgg ttttaattag 61 ctgcaggatt tttaatgagc tgaaatcaag aagagcccca tctcagaagt aagcgagagc 121 cgcgcgcagg gaagaactag tctgacaaaa cagaaggagg tggtgctctc gggggactgg 181 gctgtaggct cccaccgatc gatcggcgac gagcagatga gttcccgagg ccaggaggct 241 gaggagggaa gccagggctc agagaccagc gcccggtgca gagtccccgc tgctgccacc 301 gcggcagaag gcacggtgcc ggattcaagg tgacggcagt gcctggctcc attctcctca 361 agttgccata gcagcctctc tcccaacaag cagcttctcc gggagccctc agccgggagg 421 gacgtcattg ttctctcctg cccgtgtggt gcaaggagac ccaccggaca gagaactttc 481 ccgagcccgg ggtccgctga ccacgtgggt gcacccatgc tgcctggagc caggcacagg 541 gacaggtccc cacggccaca gggaatcctc tgggcagctg aggggagcgt ccaggcccag 601 aagcagctgc agccaagcgt gtccttggag ccgtccatgc gtctgtccgc ctgctgccgg 661 tcgccactgg agatctcggc ctgatggacg cctggtgtgg acgatgaggg aagaacgtgc 721 cccccacacc caagaggtga cccctgagcc agccccggat gaccctgcga cctggaacaa 781 tgcggctggc ctgcatgttc tcttccatcc tgctgttcgg agctgcaggc ctcctcctct 841 tcatcagcct gcaggaccct acggagctcg ccccccagca ggtgccagga ataaagttca 901 acatcaggcc aaggcagccc caccacgacc tcccaccagg cggctcccag gatggtgact 961 tgaaggaacc cacagagagg gtcactcggg acttatccag tggggccccg aggggccgca 1021 acctgccagc gcctgaccag cctcaacccc cgctgcagag gggaacccgt ctgcggctcc 1081 gccagcgccg tcgccgtctg ctcatcaaga aaatgccagc tgcggcgacc atcccggcca 1141 acagctcgga cgcgcccttc atccggccgg gacccgggac gctggatggc cgctgggtca 1201 gcctgcaccg gagccagcag gagcgcaagc gggtgatgca ggaggcctgc gccaagtacc 1261 gggcgagcag cagccgccgg gccgtcacgc cccgccacgt gtcccgtatc ttcgtggagg 1321 accgccaccg cgtgctctac tgcgaggtgc ccaaggccgg ctgctccaat tggaagcggg 1381 tgctcatggt gctggccggc ctggcctcgt ccactgccga catccagcac aacaccgtcc 1441 actatggcag cgctctcaag cgcctggaca ccttcgaccg ccagggtatc ttgcaccgtc 1501 tcagcaccta caccaagatg ctctttgtcc gcgagccctt cgagaggctg gtgtccgcct 1561 tccgcgacaa gtttgagcac cccaacagct actatcaccc ggtcttcggc aaggccatcc 1621 tggcccggta ccgcgccaat gcctctcggg aggccctgcg gaccggctct ggggtgcgtt 1681 ttcccgagtt cgtccagtac ctgctggacg tgcaccggcc cgtggggatg gacattcact 1741 gggaccatgt cagccggctc tgcagcccct gcctcatcga ctacgatttc gtaggcaagt 1801 tcgagagcat ggaggacgat gccaacttct tcctgagcct catccgcgcg ccgcggaacc 1861 tgaccttccc ccggttcaag gaccggcact cgcaggaggc gcggaccaca gcgaggatcg 1921 cccaccagta cttcgcccaa ctctcggccc tgcaaaggca gcgcacctac gacttctact 1981 acatggatta cctgatgttc aactattcca agccctttgc agatctgtac tgaggggcgc 2041 cgcagctggc cggggccgcc ctgccccggt cactcacctg tgctcccggg catcctcctg 2101 tccctggctc ctcatcctgg gagcaacagg gctctgagga cgtgaggagc catcgctgtg 2161 ggaggcagca ggccccgggt ggggggcaga ggcgcccagc cttggatggg gaccccagcc 2221 cctggcctgt acctgtttcc tcattccttg gctgagggag aggctgagaa ctgggcagac 2281 acccctggag ctcagccgac agttttgatg agcagggaag tctgaggccc agaggacggg 2341 gggcccagcg gtaagggatg tcccgcactc ccttagccat tgccttggac caaaccacgt 2401 ggtttgcagc ttttctacga gccagggggg aggttccctt ggattaaggt tccaaataaa 2461 gcacatggtt tccagagcaa aaaaaaaaaa aaaaaa //