LOCUS BC014210 2060 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens arylsulfatase A, mRNA (cDNA clone MGC:20637 IMAGE:4763974), complete cds. ACCESSION BC014210 VERSION BC014210.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2060) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2060) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (10-SEP-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC014210.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Louis Staudt cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 30 Row: p Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7262293. FEATURES Location/Qualifiers source 1..2060 /db_xref="H-InvDB:HIT000036578" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:20637 IMAGE:4763974" /tissue_type="Primary B-Cells from Tonsils" /clone_lib="NIH_MGC_48" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2060 /gene="ARSA" /gene_synonym="MLD" /db_xref="GeneID:410" /db_xref="HGNC:HGNC:713" /db_xref="MIM:607574" CDS 371..1894 /gene="ARSA" /gene_synonym="MLD" /codon_start=1 /product="arylsulfatase A" /protein_id="AAH14210.2" /db_xref="GeneID:410" /db_xref="HGNC:HGNC:713" /db_xref="MIM:607574" /translation="MGAPRSLLLALAAGLAVARPPNIVLIFADDLGYGDLGCYGHPSS TTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLVPSSRGGLPL EEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLT CFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLEARYMAFAHDLMADAQRQDRP FFLYYASHHTHYPQFSGQSFAERSGRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLV IFTADNGPETMRMSRGGCSGLLRCGKGTTYEGGVREPALAFWPGHIAPGVTHELASSL DLLPTLAALAGAPLPNVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRSG KYKAHFFTQGSAHSDTTADPACHASSSLTAHEPPLLYDLSKDPGENYNLLGGVAGATP EVLQALKQLQLLKAQLDAAVTFGPSQVARGEDPALQICCHPGCTPRPACCHCPDPHA" BASE COUNT 345 a 708 c 619 g 388 t ORIGIN 1 gcgcccgcag cccggtaccg gctcctcctg ggctccctct agcgccttcc ccccggcccg 61 actccgctgg tcagcgccaa gtgacttacg cccccgaccc tgagcccgga ccgctaggcg 121 aggaggatca gatctccgct cgagaatctg aaggtgccct ggtcctggag gagttccgtc 181 ccagcccgcg gtctcccggt actgtcgggc cccggccctc tggagcttca ggaggcggcc 241 gtcagggtcg gggagtattt gggtccgggg tctcagggaa gggcggcgcc tgggtctgcg 301 gtatcggaaa gagcctgctg gagccaagta gccctccctc tcttgggaca gacccctcgg 361 tcccatgtcc atgggggcac cgcggtccct cctcctggcc ctggctgctg gcctggccgt 421 tgcccgtccg cccaacatcg tgctgatctt tgccgacgac ctcggctatg gggacctggg 481 ctgctatggg caccccagct ctaccactcc caacctggac cagctggcgg cgggagggct 541 gcggttcaca gacttctacg tgcctgtgtc tctgtgcaca ccctctaggg ccgccctcct 601 gaccggccgg ctcccggttc ggatgggcat gtaccctggc gtcctggtgc ccagctcccg 661 ggggggcctg cccctggagg aggtgaccgt ggccgaagtc ctggctgccc gaggctacct 721 cacaggaatg gccggcaagt ggcaccttgg ggtggggcct gagggggcct tcctgccccc 781 ccatcagggc ttccatcgat ttctaggcat cccgtactcc cacgaccagg gcccctgcca 841 gaacctgacc tgcttcccgc cggccactcc ttgcgacggt ggctgtgacc agggcctggt 901 ccccatccca ctgttggcca acctgtccgt ggaggcgcag cccccctggc tgcccggact 961 agaggcccgc tacatggctt tcgcccatga cctcatggcc gacgcccagc gccaggatcg 1021 ccccttcttc ctgtactatg cctctcacca cacccactac cctcagttca gtgggcagag 1081 ctttgcagag cgttcaggcc gcgggccatt tggggactcc ctgatggagc tggatgcagc 1141 tgtggggacc ctgatgacag ccatagggga cctggggctg cttgaagaga cgctggtcat 1201 cttcactgca gacaatggac ctgagaccat gcgtatgtcc cgaggcggct gctccggtct 1261 cttgcggtgt ggaaagggaa cgacctacga gggcggtgtc cgagagcctg ccttggcctt 1321 ctggccaggt catatcgctc ccggcgtgac ccacgagctg gccagctccc tggacctgct 1381 gcctaccctg gcagccctgg ctggggcccc actgcccaat gtcaccttgg atggctttga 1441 cctcagcccc ctgctgctgg gcacaggcaa gagccctcgg cagtctctct tcttctaccc 1501 gtcctaccca gacgaggtcc gtggggtttt tgctgtgcgg agtggaaagt acaaggctca 1561 cttcttcacc cagggctctg cccacagtga taccactgca gaccctgcct gccacgcctc 1621 cagctctctg actgctcatg agcccccgct gctctatgac ctgtccaagg accctggtga 1681 gaactacaac ctgctggggg gtgtggccgg ggccacccca gaggtgctgc aagccctgaa 1741 acagcttcag ctgctcaagg cccagttaga cgcagctgtg accttcggcc ccagccaggt 1801 ggcccggggc gaggaccccg ccctgcagat ctgctgtcat cctggctgca ccccccgccc 1861 agcttgctgc cattgcccag atccccatgc ctgagggccc ctcggctggc ctgggcatgt 1921 gatggctcct cactgggagc ctgtggggga ggctcaggtg tctggagggg gtttgtgcct 1981 gataacgtaa taacaccagt ggagacttgc aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2041 aaaaaaaaaa aaaaaaaaaa //