LOCUS BC029051 2013 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens arylsulfatase B, mRNA (cDNA clone MGC:34518
IMAGE:5186657), complete cds.
ACCESSION BC029051
VERSION BC029051.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2013)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2013)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (01-MAY-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Life Technologies, Inc.
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Baylor College of Medicine Human Genome
Sequencing Center
Center code: BCM-HGSC
Web site: http://www.hgsc.bcm.tmc.edu/cdna/
Contact: amg@bcm.tmc.edu
Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
A.N., Gibbs, R.A.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 50 Row: g Column: 11
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 38569406.
FEATURES Location/Qualifiers
source 1..2013
/db_xref="H-InvDB:HIT000040669"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:34518 IMAGE:5186657"
/tissue_type="Colon, Kidney, Stomach, adult, whole pooled"
/clone_lib="NIH_MGC_116"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2013
/gene="ARSB"
/gene_synonym="ASB"
/gene_synonym="G4S"
/gene_synonym="MPS6"
/db_xref="GeneID:411"
/db_xref="HGNC:HGNC:714"
/db_xref="MIM:253200"
CDS 356..1597
/gene="ARSB"
/gene_synonym="ASB"
/gene_synonym="G4S"
/gene_synonym="MPS6"
/codon_start=1
/product="arylsulfatase B"
/protein_id="AAH29051.1"
/db_xref="GeneID:411"
/db_xref="HGNC:HGNC:714"
/db_xref="MIM:253200"
/translation="MGPRGAASLPRGPGPRRLLLPVVLPLLLLLLLAPPGSGAGASRP
PHLVFLLADDLGWNDVGFHGSRIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGR
YQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRR
GFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKR
AIALITNHPPEKPLFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEA
VGNVTAALKSSGLWNNTVFIFSTDNGGQTLAGGNNWPLRGRKWSLWEGGVRGVGFVAS
PLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELL
HNIDPNFVDSSPYWPECSLLL"
BASE COUNT 505 a 558 c 536 g 414 t
ORIGIN
1 cttgaaagta accgcacctt ccaaagggca ccgtgcaatc agactgaaac cacggtgcaa
61 atttaattgc cggggaagat aacgggcctt ggtgccctcc aagcgtcagc tgagtttcca
121 agaagccggg cagcgggcgc ccgcgggttc gtctctggct cctcctccgc cacagcagcc
181 gggggcccgg gtcggaggcg gcgggggccg agcgcccggc ctcgcaagcc cacggcccgc
241 tgggggtgcc gtcccgcgcc ggggcggagc aggccccggc agcccagttc ctcattctat
301 cagcggtaca aggggctggt ggcgccacag gcgctgggac cgcgggcgga caaggatggg
361 tccgcgcggc gcggcgagct tgccccgagg ccccggacct cggcggctgc tcctccccgt
421 cgtcctcccg ctgctgctgc tgctgttgtt ggcgccgccg ggctcgggcg ccggggccag
481 ccggccgccc cacctggtct tcttgctggc agacgaccta ggctggaacg acgtcggctt
541 ccacggctcc cgcatccgca cgccgcacct ggacgcgctg gcggccggcg gggtgctcct
601 ggacaactac tacacgcagc cgctgtgcac gccgtcgcgg agccagctgc tcactggccg
661 ctaccagatc cgtacaggtt tacagcacca aataatctgg ccctgtcagc ccagctgtgt
721 tcctctggat gaaaaactcc tgccccagct cctaaaagaa gcaggttata ctacccatat
781 ggtcggaaaa tggcacctgg gaatgtaccg gaaagaatgc cttccaaccc gccgaggatt
841 tgatacctac tttggatatc tcctgggtag tgaagattat tattcccatg aacgctgtac
901 attaattgac gctctgaatg tcacacgatg tgctcttgat tttcgagatg gcgaagaagt
961 tgcaacagga tataaaaata tgtattcaac aaacatattc accaaaaggg ctatagccct
1021 cataactaac catccaccag agaagcctct gtttctctac cttgctctcc agtctgtgca
1081 tgagcccctt caggtccctg aggaatactt gaagccatat gactttatcc aagacaagaa
1141 caggcatcac tatgcaggaa tggtgtccct tatggatgaa gcagtaggaa atgtcactgc
1201 agctttaaaa agcagtgggc tctggaacaa cacggtgttc atcttttcta cagataacgg
1261 agggcagact ttggcagggg gtaataactg gccccttcga ggaagaaaat ggagcctgtg
1321 ggaaggaggc gtccgagggg tgggctttgt ggcaagcccc ttgctgaagc agaagggcgt
1381 gaagaaccgg gagctcatcc acatctctga ctggctgcca acactcgtga agctggccag
1441 gggacacacc aatggcacaa agcctctgga tggcttcgac gtgtggaaaa ccatcagtga
1501 aggaagccca tcccccagaa ttgagctgct gcataatatt gacccgaact tcgtggactc
1561 ttcaccgtac tggcctgagt gctcgctgct gttgtagcta ccaccaactt tctactgaag
1621 atgataaccc agggcataag aaatgacttc agacccaagg ttctgaaagg gcccctcaag
1681 gcctcgggtg gctcctgcag aagtggcaga agaggcggga actaggaacc tggcatcata
1741 ggaaaagtgc cttctccaag aaagaagggg ccccaagagg ctgtcttact tagatcaacc
1801 ataaactacc acagatgggt catttcttat actatttcaa aatatctttg aagatgagaa
1861 ttcatttgtg tccttcatag accaaagttc tttgtgttac cttttcccaa aagtaaattc
1921 ctttcccttt attcattcct tgtggaaata aaatgcaagc cctttaaaaa aaaaaaaaaa
1981 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa
//