LOCUS BC064509 2533 bp mRNA linear HUM 25-JUN-2004 DEFINITION Homo sapiens chondroitin sulfate glucuronyltransferase, mRNA (cDNA clone MGC:71339 IMAGE:6423613), complete cds. ACCESSION BC064509 VERSION BC064509.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2533) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2533) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (22-DEC-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP/Gazdar cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 50 Row: n Column: 23 This clone was selected for full length sequencing because it passed the following selection criteria: Hexamer frequency ORF analysis, GenomeScan gene prediction. FEATURES Location/Qualifiers source 1..2533 /db_xref="H-InvDB:HIT000261428" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:71339 IMAGE:6423613" /tissue_type="Lung, large cell carcinoma" /clone_lib="NIH_MGC_18" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2533 /gene="CSGlcA-T" /gene_synonym="KIAA1402" /db_xref="GeneID:54480" /db_xref="MIM:608037" CDS 483..2483 /gene="CSGlcA-T" /gene_synonym="KIAA1402" /codon_start=1 /product="CSGlcA-T protein" /protein_id="AAH64509.1" /db_xref="GeneID:54480" /db_xref="MIM:608037" /translation="MRLSSLLALLRPALPLILGLSLGCSLSLLRVSWIQGEGEDPCVE AVGERGGPQNPDSRARLDQSDEDFKPRIVPYYRDPNKPYKKVLRTRYIQTELGSRERL LVAVLTSRATLSTLAVAVNRTVAHHFPRLLYFTGQRGARAPAGMQVVSHGDERPAWLM SETLRHLHTHFGADYDWFFIMQDDTYVQAPRLAALAGHLSINQDLYLGRAEEFIGAGE QARYCHGGFGYLLSRSLLLRLRPHLDGCRGDILSARPDEWLGRCLIDSLGVGCVSQHQ GQQYRSFELAKNRDPEKEGSSAFLSAFAVHPVSEGTLMYRLHKRFSALELERAYSEIE QLQAQIRNLTVLTPEGEAGLSWPVGLPAPFTPHSRFEVLGWDYFTEQHTFSCADGAPK CPLQGASRADVGDALETALEQLNRRYQPRLRFQKQRLLNGYRRFDPARGMEYTLDLLL ECVTQRGHRRALARRVSLLRPLSRVEILPMPYVTEATRVQLVLPLLVAEAAAAPAFLE AFAANVLEPREHALLTLLLVYGPREGGRGAPDPFLGVKAAAAELERRYPGTRLAWLAV RAEAPSQVRLMDVVSKKHPVDTLFFLTTVWTRPGPEVLNRCRMNAISGWQAFFPVHFQ EFNPALSPQRSPPNLITFPLSASAPGRARQDGGQIENCCCIF" BASE COUNT 482 a 734 c 736 g 581 t ORIGIN 1 ggcccactga ggcaggctcc ggctcctctg gttggggctg ttgttttgat ggatcgtgtg 61 cttttccctt acctcttatc acttgctgtc atctgttgac ttaggcccag tctgcagatg 121 tgtgtagtgt tcctttttgg gttagctttg gcagtattga gttttacttc ctcctctttt 181 tagtggaaga cagaccataa tcccagtgtg agtgaaattg attgtttcat ttattaccgt 241 tttggctggg ggttagttcc gacaccttca cagttgaaga gcaggcagaa ggagttgtga 301 agacaggaca atcttcttgg ggatgctggt cctggaagcc agcgggcctt gctctgtctt 361 tggcctcatt gaccccaggt tctctggtta aaactgaaag cctactactg gcctggtgcc 421 catcaatcca ttgatccttg aggctgtgcc cctggggcac ccacctggca gggcctacca 481 ccatgcgact gagctccctg ttggctctgc tgcggccagc gcttcccctc atcttagggc 541 tgtctctggg gtgcagcctg agcctcctgc gggtttcctg gatccagggg gagggagaag 601 atccctgtgt cgaggctgta ggggagcgag gagggccaca gaatccagat tcgagagctc 661 ggctagacca aagtgatgaa gacttcaaac cccggattgt cccctactac agggacccca 721 acaagcccta caagaaggtg ctcaggactc ggtacatcca gacagagctg ggctcccgtg 781 agcggttgct ggtggctgtc ctgacctccc gagctacact gtccactttg gccgtggctg 841 tgaaccgtac ggtggcccat cacttccctc ggttactcta cttcactggg cagcgggggg 901 cccgggctcc agcagggatg caggtggtgt ctcatgggga tgagcggccc gcctggctca 961 tgtcagagac cctgcgccac cttcacacac actttggggc cgactacgac tggttcttca 1021 tcatgcagga tgacacatat gtgcaggccc cccgcctggc agcccttgct ggccacctca 1081 gcatcaacca agacctgtac ttaggccggg cagaggagtt cattggcgca ggcgagcagg 1141 cccggtactg tcatgggggc tttggctacc tgttgtcacg gagtctcctg cttcgtctgc 1201 ggccacatct ggatggctgc cgaggagaca ttctcagtgc ccgtcctgac gagtggcttg 1261 gacgctgcct cattgactct ctgggcgtcg gctgtgtctc acagcaccag gggcagcagt 1321 atcgctcatt tgaactggcc aaaaataggg accctgagaa ggaagggagc tcggctttcc 1381 tgagtgcctt cgccgtgcac cctgtctccg aaggtaccct catgtaccgg ctccacaaac 1441 gcttcagcgc tctggagttg gagcgggctt acagtgaaat agaacaactg caggctcaga 1501 tccggaacct gaccgtgctg acccccgaag gggaggcagg gctgagctgg cccgttgggc 1561 tccctgctcc tttcacacca cactctcgct ttgaggtgct gggctgggac tacttcacag 1621 agcagcacac cttctcctgt gcagatgggg ctcccaagtg cccactacag ggggctagca 1681 gggcggacgt gggtgatgcg ttggagactg ccctggagca gctcaatcgg cgctatcagc 1741 cccgcctgcg cttccagaag cagcgactgc tcaacggcta tcggcgcttc gacccagcac 1801 ggggcatgga gtacaccctg gacctgctgt tggaatgtgt gacacagcgt gggcaccggc 1861 gggccctggc tcgcagggtc agcctgctgc ggccactgag ccgggtggaa atcctaccta 1921 tgccctatgt cactgaggcc acccgagtgc agctggtgct gccactcctg gtggctgaag 1981 ctgctgcagc cccggctttc ctcgaggcgt ttgcagccaa tgtcctggag ccacgagaac 2041 atgcattgct caccctgttg ctggtctacg ggccacgaga aggtggccgt ggagctccag 2101 acccatttct tggggtgaag gctgcagcag cggagttaga gcgacggtac cctgggacga 2161 ggctggcctg gctcgctgtg cgagcagagg ccccttccca ggtgcgactc atggacgtgg 2221 tctcgaagaa gcaccctgtg gacactctct tcttccttac caccgtgtgg acaaggcctg 2281 ggcccgaagt cctcaaccgc tgtcgcatga atgccatctc tggctggcag gccttctttc 2341 cagtccattt ccaggagttc aatcctgccc tgtcaccaca gagatcaccc cctaacctca 2401 ttacctttcc tttgtctgcc tcagccccag gaagggcaag gcaagatggt ggacagatag 2461 agaattgttg ctgtattttt taaatatgaa aatgttatta aacatgtctt ctgccaaaaa 2521 aaaaaaaaaa aaa //