LOCUS X97868 1996 bp mRNA linear HUM 07-OCT-2008 DEFINITION H.sapiens mRNA for arylsulphatase. ACCESSION X97868 VERSION X97868.1 KEYWORDS arsf gene; arylsulphatase. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 AUTHORS Puca A.A., Zollo M., Repetto M., Guffanti A., Simon G., Ballabio A., Franco B. TITLE Identification by shotgun sequencing, genomic organization, and functional analysis of a fourth arylsulfatase gene (ARSF) from the Xp22.3 region JOURNAL Genomics 42(2), 192-199(1997). PUBMED 9192838 REFERENCE 2 (bases 1 to 1996) AUTHORS Franco B. JOURNAL Submitted (15-MAY-1996) to the INSDC. B. Franco, T.I.G.E.M., via Olgettina 58, 20132 Milano, ITALY FEATURES Location/Qualifiers source 1..1996 /db_xref="H-InvDB:HIT000325021" /organism="Homo sapiens" /chromosome="X" /map="p22.3" /mol_type="mRNA" /dev_stage="foetus" /tissue_type="brain" /db_xref="taxon:9606" CDS 71..1846 /gene="arsf" /product="arylsulphatase" /db_xref="GOA:P54793" /db_xref="H-InvDB:HIT000325021.11" /db_xref="HGNC:HGNC:721" /db_xref="InterPro:IPR000917" /db_xref="InterPro:IPR017850" /db_xref="InterPro:IPR024607" /db_xref="UniProtKB/Swiss-Prot:P54793" /protein_id="CAA66462.1" /translation="MRPRRPLVFMSLVCALLNTWPGHTGCMTTRPNIVLIMVDDLGIG DLGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSSGN RRVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGF DYYYGMPFTLVDSCWPDPSRNTELAFESQLWLCVQLVAIAILTLTFGKLSGWVSVPWL LIFSMILFIFLLGYAWFSSHTSPLYWDCLLMRGHEITEQPMKAERAGSIMVKEAISFL ERHSKETFLLFFSFLHVHTPLPTTDDFTGTSKHGLYGDNVEEMDSMVGKILDAIDDFG LRNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMGGWEGGIRVPGIVRWPGKV PAGRLIKEPTSLMDILPTVASVSGGSLPQDRVIDGRDLMPLLQGNVRHSEHEFLFHYC GSYLHAVRWIPKDDSGSVWKAHYVTPVFQPPASGGCYVTSLCRCFGEQVTYHNPPLLF DLSRDPSESTPLTPATEPLYDFVIKKVANALKEHQETIVPVTYQLSELNQGRTWLKPC CGVFPFCLCDKEEEVSQPRGPNEKR" BASE COUNT 481 a 521 c 508 g 486 t ORIGIN 1 gggttctgct cctagacatt agagagataa tacggctgat agacaacaag aaggtattcc 61 aagctgcaca atgaggccca ggagaccgtt ggtcttcatg tctttggtgt gtgcactctt 121 gaacacatgg ccagggcaca cagggtgcat gacgacaagg cctaatattg tcctaatcat 181 ggttgatgac ctgggtattg gagatctggg ctgctacggc aatgacacca tgaggacgcc 241 tcacatcgac cgccttgcca gggaaggcgt gcgactgact cagcacatct ctgccgcctc 301 cctctgcagc ccaagccggt ccgcgttctt gacgggaaga taccccatcc gatcaggtat 361 ggtttctagt ggtaatagac gtgtcatcca aaatcttgca gtccccgcag gcctccctct 421 taatgagaca acacttgcag ccttgctaaa gaagcaagga tacagcacgg ggcttatagg 481 caaatggcac caaggcttga actgcgactc ccgaagtgac cagtgccacc atccatataa 541 ttatgggttt gactactact atggcatgcc gttcactctc gttgacagct gctggccgga 601 cccctctcgt aacacggaat tagcctttga gagtcagctc tggctctgtg tgcagctagt 661 tgccattgcc atcctcaccc taacctttgg gaagctgagc ggctgggtct ctgttccctg 721 gctcctgatc ttctccatga ttctgtttat tttcctcttg ggctatgctt ggttctccag 781 ccacacgtcc cctttatact gggactgcct cctcatgcgg gggcacgaga tcacggagca 841 gcccatgaag gctgaacgag ctggatccat tatggtgaag gaagcgattt cctttttaga 901 aaggcacagt aaggaaactt tccttctctt tttctccttt cttcacgtgc acacacctct 961 ccccaccacg gacgatttca ctggcaccag caagcatggc ttgtatgggg ataatgtgga 1021 agagatggac tccatggtgg gcaagattct tgatgctatc gatgattttg gcctaaggaa 1081 caacaccctt gtctacttta catcagatca cggagggcat ttggaagcta ggcgagggca 1141 tgcccaactt ggtggatgga atggaatata caaaggtgga aaaggcatgg ggggctggga 1201 aggtggaatc cgcgtcccag gaattgtccg atggcctgga aaggtaccag ctggacggtt 1261 gattaaggaa cctacaagtt taatggatat tttaccaact gtcgcatcag tgtcaggagg 1321 aagtctccct caggacaggg tcattgacgg ccgagacctc atgcccttgc tgcagggcaa 1381 cgtcaggcac tcggagcatg aatttctttt ccactactgt ggctcctacc tgcacgccgt 1441 gcggtggatc cccaaggacg acagtgggtc agtttggaag gctcactatg tgaccccggt 1501 attccagcca ccagcttctg gtggctgcta tgtcacctca ttatgcagat gtttcggaga 1561 acaggttacc taccacaacc cccctctgct cttcgatctc tccagggacc cctcagagtc 1621 cacacccctg acacctgcca cagagcccct ctatgatttt gtgattaaaa aggtggccaa 1681 cgccctgaag gaacaccagg aaaccatcgt gcctgtgacc taccaactct cagaactgaa 1741 tcagggcagg acgtggctga agccttgctg tggggtgttc ccattttgtc tgtgtgacaa 1801 ggaagaggaa gtctctcagc ctcggggtcc taacgagaag agataattac aatcaggcta 1861 ccagaggaag cctttggtcc taacgagaag agataattac aatcaggcta ccaaaggaag 1921 cactaacttt ggtgctttca agttggcaag gagtgcattt aatagtcaat aaattcatct 1981 accattccag attatt //