LOCUS BC045537 2097 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens carbohydrate (N-acetylglucosamine 6-O) sulfotransferase 7, mRNA (cDNA clone MGC:57153 IMAGE:5261787), complete cds. ACCESSION BC045537 VERSION BC045537.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2097) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2097) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (31-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 106 Row: h Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 23238227. FEATURES Location/Qualifiers source 1..2097 /db_xref="H-InvDB:HIT000052928" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:57153 IMAGE:5261787" /tissue_type="Brain, hippocampus" /clone_lib="NIH_MGC_95" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2097 /gene="CHST7" /gene_synonym="C6ST-2" /db_xref="GeneID:56548" /db_xref="HGNC:HGNC:13817" /db_xref="MIM:300375" CDS 149..1609 /gene="CHST7" /gene_synonym="C6ST-2" /codon_start=1 /product="carbohydrate (N-acetylglucosamine 6-O) sulfotransferase 7" /protein_id="AAH45537.1" /db_xref="GeneID:56548" /db_xref="HGNC:HGNC:13817" /db_xref="MIM:300375" /translation="MKGRRRRRREYCKFALLLVLYTLVLLLVPSVLDGGRDGDKGAEH CPGLQRSLGVWSLEAAAAGEREQGAEARAAEEGGANQSPRFPSNLSGAVGEAVSREKQ HIYVHATWRTGSSFLGELFNQHPDVFYLYEPMWHLWQALYPGDAESLQGALRDMLRSL FRCDFSVLRLYAPPGDPAARAPDTANLTTAALFRWRTNKVICSPPLCPGAPRARAEVG LVEDTACERSCPPVAIRALEAECRKYPVVVIKDVRLLDLGVLVPLLRDPGLNLKVVQL FRDPRAVHNSRLKSRQGLLRESIQVLRTRQRGDRFHRVLLAHGVGARPGGQSRALPAA PRADFFLTGALEVICEAWLRDLLFARGAPAWLRRRYLRLRYEDLVRQPRAQLRRLLRF SGLRALAALDAFALNMTRGAAYGADRPFHLSARDAREAVHAWRERLSREQVRQVEAAC APAMRLLAYPRSGEEGDAEQPREGETPLEMDADGAT" BASE COUNT 369 a 665 c 699 g 364 t ORIGIN 1 aagtttcccc ttgtggatgc gcggccccgc ggctctgctc ctcccggcgc agaggggccg 61 ggagaggcca caggagcgga cctggcacgg gatttctgag gaacgggaga agactggcgc 121 ccgacccgct ctggagggtc ggtgaacgat gaagggccgg cggcggcgac gccgagagta 181 ctgcaagttc gcgctgctgt tggtgctgta cacgctggtg ctgttgctcg tcccctccgt 241 attggacggc ggccgcgacg gggacaaggg cgccgagcac tgccccggcc tgcagcgcag 301 cctgggagtg tggagcctgg aggcggcggc ggccggcgaa cgcgagcagg gagcggaggc 361 gcgggccgcc gaggaagggg gcgcgaacca gtctcctcgg ttcccaagca acctcagcgg 421 cgctgtcggg gaggcagtgt ctcgcgagaa gcagcacatc tacgtgcatg ccacctggcg 481 caccggctcg tccttcctgg gcgaactctt taaccagcac ccggacgttt tctacttgta 541 tgagcccatg tggcatctat ggcaggcgct gtatccgggc gacgccgaga gcttgcaggg 601 cgcgctgcgc gacatgctgc gttcgctctt ccgctgcgac ttctccgtgc tgcggctgta 661 cgcgccgccg ggggaccccg ctgcgcgcgc cccggacacg gccaatctta ccacggccgc 721 cctcttccgc tggcggacta acaaggtcat ctgctcgccg ccactgtgtc ctggcgcacc 781 ccgtgcccgg gccgaggtgg gcctcgtcga ggacaccgcc tgcgagcgca gctgcccacc 841 cgtggcgata cgcgccctgg aggccgagtg ccgaaagtac ccggtggtgg tcatcaagga 901 cgtgcgcctg ctcgatctgg gcgtgctggt gcccctgttg cgtgatccag gcctcaacct 961 gaaggtggtg cagcttttcc gcgacccgag ggcggtgcac aactcgcgcc tcaagtctag 1021 gcagggactg ctgcgcgaga gcatccaggt gctgcgcacc cgccagaggg gcgaccgctt 1081 ccaccgtgtg ctgctggcgc acggcgtggg tgctcgcccc gggggccagt ctcgcgcgct 1141 gcccgccgcg ccgcgcgccg atttcttcct gaccggtgcg ctcgaggtga tctgcgaagc 1201 ctggctgcgc gatctgcttt tcgcgcgcgg cgcgcccgcc tggctgcggc gccgctacct 1261 gaggctgcgc tatgaggacc tggtgcggca gccacgcgcc cagctgcgcc gcctgctgcg 1321 cttctccggg ctacgcgcgc tcgcagcgct cgatgccttc gcgctcaaca tgactcgcgg 1381 cgcggcctac ggcgccgacc ggcccttcca cctgtcagcg cgcgacgccc gggaggcggt 1441 gcacgcctgg cgcgagcgcc tgagccgaga gcaggtgcgc caggtggagg ccgcctgcgc 1501 tccagccatg cgtctgctcg cctaccctcg cagcggagag gagggcgacg cggagcagcc 1561 cagggaaggg gagacgccgc tggagatgga tgccgacggc gccacgtagc ctcccatccc 1621 tgtccccggc acggatccgg gtcagtcacc acgaacaggg gcactcggca tgctgcccca 1681 gcactggaga agcagcgctg tgggggcaat ctgtcacact ctcagagtct gggacttgac 1741 ttgctaccaa caactgctgt gcaattctgc tgagcaggaa tatcatgagc tgttcaataa 1801 tgacggacgc attggttgag atgaagtttc cagtaaggaa gtgacagtgc aatgtggata 1861 tttatggctg taaaatagga agagctttag ttcccaggct gaacctgcca ctgctggagc 1921 catttcaaca aggcatcctc acaacaaaga agagatgtga tttggtacca tttcacacca 1981 gcaggtgtct ggacgaaaac atcaatgtga ataagggcca agtgcagtcc tgtcttgatt 2041 aaattactta ataatattat taaataataa taggtctggg caaaaaaaaa aaaaaaa //