LOCUS BC029051 2013 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens arylsulfatase B, mRNA (cDNA clone MGC:34518 IMAGE:5186657), complete cds. ACCESSION BC029051 VERSION BC029051.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2013) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2013) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (01-MAY-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Baylor College of Medicine Human Genome Sequencing Center Center code: BCM-HGSC Web site: http://www.hgsc.bcm.tmc.edu/cdna/ Contact: amg@bcm.tmc.edu Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H., Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati, A.N., Gibbs, R.A. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 50 Row: g Column: 11 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 38569406. FEATURES Location/Qualifiers source 1..2013 /db_xref="H-InvDB:HIT000040669" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:34518 IMAGE:5186657" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2013 /gene="ARSB" /gene_synonym="ASB" /gene_synonym="G4S" /gene_synonym="MPS6" /db_xref="GeneID:411" /db_xref="HGNC:HGNC:714" /db_xref="MIM:253200" CDS 356..1597 /gene="ARSB" /gene_synonym="ASB" /gene_synonym="G4S" /gene_synonym="MPS6" /codon_start=1 /product="arylsulfatase B" /protein_id="AAH29051.1" /db_xref="GeneID:411" /db_xref="HGNC:HGNC:714" /db_xref="MIM:253200" /translation="MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRP PHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR YQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRR GFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKR AIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEA VGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVAS PLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELL HNIDPNFVDSSPYWPECSLLL" BASE COUNT 505 a 558 c 536 g 414 t ORIGIN 1 cttgaaagta accgcacctt ccaaagggca ccgtgcaatc agactgaaac cacggtgcaa 61 atttaattgc cggggaagat aacgggcctt ggtgccctcc aagcgtcagc tgagtttcca 121 agaagccggg cagcgggcgc ccgcgggttc gtctctggct cctcctccgc cacagcagcc 181 gggggcccgg gtcggaggcg gcgggggccg agcgcccggc ctcgcaagcc cacggcccgc 241 tgggggtgcc gtcccgcgcc ggggcggagc aggccccggc agcccagttc ctcattctat 301 cagcggtaca aggggctggt ggcgccacag gcgctgggac cgcgggcgga caaggatggg 361 tccgcgcggc gcggcgagct tgccccgagg ccccggacct cggcggctgc tcctccccgt 421 cgtcctcccg ctgctgctgc tgctgttgtt ggcgccgccg ggctcgggcg ccggggccag 481 ccggccgccc cacctggtct tcttgctggc agacgaccta ggctggaacg acgtcggctt 541 ccacggctcc cgcatccgca cgccgcacct ggacgcgctg gcggccggcg gggtgctcct 601 ggacaactac tacacgcagc cgctgtgcac gccgtcgcgg agccagctgc tcactggccg 661 ctaccagatc cgtacaggtt tacagcacca aataatctgg ccctgtcagc ccagctgtgt 721 tcctctggat gaaaaactcc tgccccagct cctaaaagaa gcaggttata ctacccatat 781 ggtcggaaaa tggcacctgg gaatgtaccg gaaagaatgc cttccaaccc gccgaggatt 841 tgatacctac tttggatatc tcctgggtag tgaagattat tattcccatg aacgctgtac 901 attaattgac gctctgaatg tcacacgatg tgctcttgat tttcgagatg gcgaagaagt 961 tgcaacagga tataaaaata tgtattcaac aaacatattc accaaaaggg ctatagccct 1021 cataactaac catccaccag agaagcctct gtttctctac cttgctctcc agtctgtgca 1081 tgagcccctt caggtccctg aggaatactt gaagccatat gactttatcc aagacaagaa 1141 caggcatcac tatgcaggaa tggtgtccct tatggatgaa gcagtaggaa atgtcactgc 1201 agctttaaaa agcagtgggc tctggaacaa cacggtgttc atcttttcta cagataacgg 1261 agggcagact ttggcagggg gtaataactg gccccttcga ggaagaaaat ggagcctgtg 1321 ggaaggaggc gtccgagggg tgggctttgt ggcaagcccc ttgctgaagc agaagggcgt 1381 gaagaaccgg gagctcatcc acatctctga ctggctgcca acactcgtga agctggccag 1441 gggacacacc aatggcacaa agcctctgga tggcttcgac gtgtggaaaa ccatcagtga 1501 aggaagccca tcccccagaa ttgagctgct gcataatatt gacccgaact tcgtggactc 1561 ttcaccgtac tggcctgagt gctcgctgct gttgtagcta ccaccaactt tctactgaag 1621 atgataaccc agggcataag aaatgacttc agacccaagg ttctgaaagg gcccctcaag 1681 gcctcgggtg gctcctgcag aagtggcaga agaggcggga actaggaacc tggcatcata 1741 ggaaaagtgc cttctccaag aaagaagggg ccccaagagg ctgtcttact tagatcaacc 1801 ataaactacc acagatgggt catttcttat actatttcaa aatatctttg aagatgagaa 1861 ttcatttgtg tccttcatag accaaagttc tttgtgttac cttttcccaa aagtaaattc 1921 ctttcccttt attcattcct tgtggaaata aaatgcaagc cctttaaaaa aaaaaaaaaa 1981 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa //