LOCUS BC014210 2060 bp mRNA linear HUM 15-JUL-2006
DEFINITION Homo sapiens arylsulfatase A, mRNA (cDNA clone MGC:20637
IMAGE:4763974), complete cds.
ACCESSION BC014210
VERSION BC014210.2
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2060)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
CONSRTM Mammalian Gene Collection Program Team
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2060)
CONSRTM NIH MGC Project
TITLE Direct Submission
JOURNAL Submitted (10-SEP-2001) National Institutes of Health, Mammalian
Gene Collection (MGC), Bethesda, MD 20892-2590, USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT On Aug 19, 2003 this sequence version replaced BC014210.1.
Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: Louis Staudt
cDNA Library Preparation: Rubin Laboratory
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: National Institutes of Health Intramural
Sequencing Center (NISC),
Gaithersburg, Maryland;
Web site: http://www.nisc.nih.gov/
Contact: nisc_mgc@nhgri.nih.gov
Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
Young,A., Zhang,L.-H. and Green,E.D.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 30 Row: p Column: 15
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 7262293.
FEATURES Location/Qualifiers
source 1..2060
/db_xref="H-InvDB:HIT000036578"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:20637 IMAGE:4763974"
/tissue_type="Primary B-Cells from Tonsils"
/clone_lib="NIH_MGC_48"
/lab_host="DH10B-R"
/note="Vector: pOTB7"
gene 1..2060
/gene="ARSA"
/gene_synonym="MLD"
/db_xref="GeneID:410"
/db_xref="HGNC:HGNC:713"
/db_xref="MIM:607574"
CDS 371..1894
/gene="ARSA"
/gene_synonym="MLD"
/codon_start=1
/product="arylsulfatase A"
/protein_id="AAH14210.2"
/db_xref="GeneID:410"
/db_xref="HGNC:HGNC:713"
/db_xref="MIM:607574"
/translation="MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSS
TTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPL
EEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLT
CFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRP
FFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLV
IFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSL
DLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRSG
KYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATP
EVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA"
BASE COUNT 345 a 708 c 619 g 388 t
ORIGIN
1 gcgcccgcag cccggtaccg gctcctcctg ggctccctct agcgccttcc ccccggcccg
61 actccgctgg tcagcgccaa gtgacttacg cccccgaccc tgagcccgga ccgctaggcg
121 aggaggatca gatctccgct cgagaatctg aaggtgccct ggtcctggag gagttccgtc
181 ccagcccgcg gtctcccggt actgtcgggc cccggccctc tggagcttca ggaggcggcc
241 gtcagggtcg gggagtattt gggtccgggg tctcagggaa gggcggcgcc tgggtctgcg
301 gtatcggaaa gagcctgctg gagccaagta gccctccctc tcttgggaca gacccctcgg
361 tcccatgtcc atgggggcac cgcggtccct cctcctggcc ctggctgctg gcctggccgt
421 tgcccgtccg cccaacatcg tgctgatctt tgccgacgac ctcggctatg gggacctggg
481 ctgctatggg caccccagct ctaccactcc caacctggac cagctggcgg cgggagggct
541 gcggttcaca gacttctacg tgcctgtgtc tctgtgcaca ccctctaggg ccgccctcct
601 gaccggccgg ctcccggttc ggatgggcat gtaccctggc gtcctggtgc ccagctcccg
661 ggggggcctg cccctggagg aggtgaccgt ggccgaagtc ctggctgccc gaggctacct
721 cacaggaatg gccggcaagt ggcaccttgg ggtggggcct gagggggcct tcctgccccc
781 ccatcagggc ttccatcgat ttctaggcat cccgtactcc cacgaccagg gcccctgcca
841 gaacctgacc tgcttcccgc cggccactcc ttgcgacggt ggctgtgacc agggcctggt
901 ccccatccca ctgttggcca acctgtccgt ggaggcgcag cccccctggc tgcccggact
961 agaggcccgc tacatggctt tcgcccatga cctcatggcc gacgcccagc gccaggatcg
1021 ccccttcttc ctgtactatg cctctcacca cacccactac cctcagttca gtgggcagag
1081 ctttgcagag cgttcaggcc gcgggccatt tggggactcc ctgatggagc tggatgcagc
1141 tgtggggacc ctgatgacag ccatagggga cctggggctg cttgaagaga cgctggtcat
1201 cttcactgca gacaatggac ctgagaccat gcgtatgtcc cgaggcggct gctccggtct
1261 cttgcggtgt ggaaagggaa cgacctacga gggcggtgtc cgagagcctg ccttggcctt
1321 ctggccaggt catatcgctc ccggcgtgac ccacgagctg gccagctccc tggacctgct
1381 gcctaccctg gcagccctgg ctggggcccc actgcccaat gtcaccttgg atggctttga
1441 cctcagcccc ctgctgctgg gcacaggcaa gagccctcgg cagtctctct tcttctaccc
1501 gtcctaccca gacgaggtcc gtggggtttt tgctgtgcgg agtggaaagt acaaggctca
1561 cttcttcacc cagggctctg cccacagtga taccactgca gaccctgcct gccacgcctc
1621 cagctctctg actgctcatg agcccccgct gctctatgac ctgtccaagg accctggtga
1681 gaactacaac ctgctggggg gtgtggccgg ggccacccca gaggtgctgc aagccctgaa
1741 acagcttcag ctgctcaagg cccagttaga cgcagctgtg accttcggcc ccagccaggt
1801 ggcccggggc gaggaccccg ccctgcagat ctgctgtcat cctggctgca ccccccgccc
1861 agcttgctgc cattgcccag atccccatgc ctgagggccc ctcggctggc ctgggcatgt
1921 gatggctcct cactgggagc ctgtggggga ggctcaggtg tctggagggg gtttgtgcct
1981 gataacgtaa taacaccagt ggagacttgc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
2041 aaaaaaaaaa aaaaaaaaaa
//