LOCUS BC022389 2195 bp mRNA linear HUM 06-OCT-2003 DEFINITION Homo sapiens arylsulfatase F, mRNA (cDNA clone MGC:24090 IMAGE:4609825), complete cds. ACCESSION BC022389 VERSION BC022389.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2195) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2195) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (01-FEB-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: CLONTECH cDNA Library Preparation: CLONTECH Laboratories, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 36 Row: f Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 6552297. FEATURES Location/Qualifiers source 1..2195 /db_xref="H-InvDB:HIT000039515" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:24090 IMAGE:4609825" /tissue_type="Kidney" /clone_lib="NIH_MGC_75" /lab_host="DH10B" /note="Vector: pDNR-LIB" gene 1..2195 /gene="ARSF" /gene_synonym="ASF" /db_xref="GeneID:416" /db_xref="MIM:300003" CDS 222..1994 /gene="ARSF" /gene_synonym="ASF" /codon_start=1 /product="ARSF protein" /protein_id="AAH22389.1" /db_xref="GeneID:416" /db_xref="MIM:300003" /translation="MRPRRPLVFMSLVCALLNTCQAHRVHDDKPNIVLIMVDDLGIGD LGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSSGNR RVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGFD YYYGMPFTLVDSCWPDPSRNTELAFESQLWLCVQLVAIAILTLTFGKLSGWVSVPWLL IFSMILFIFLLGYAWFSSHTSPLYWDCLLMRGHEITEQPMKAERAGSIMVKEAISFLE RHSKETFLLFFSFLHVHTPLPTTDDFTGTSKHGLYGDNVEEMDSMVGKILDAIDDFGL RNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMGGWEGGIRVPGIVRWPGKVP AGRLIKEPTSLMDILPTVASVSGGSLPQDRVIDGRDLMPLLQGNVRHSEHEFLFHYCG SYLHAVRWIPKDDSGSVWKAHYVTPVFQPPASGGCYVTSLCRCFGEQVTYHNPPLLFD LSRDPSESTPLTPATEPLYDFVIKKVANALKEHQETIVPVTYQLSELNQGRTWLKPCC GVFPFCLCDKEEEVSQPRGPNEKR" misc_feature 309..1760 /gene="ARSF" /gene_synonym="ASF" /note="Sulfatase; Region: Sulfatase" /db_xref="CDD:pfam00884" BASE COUNT 542 a 577 c 543 g 533 t ORIGIN 1 agtggagaca ggctctcacc atgttgacca ggctggtctc gaactcccga cctcaagtga 61 tccgccggcc tcggcctccc aaagtgctgg gattacaggc gtgaaccacc actcctggcc 121 cttctgttac ttttccactg ctgttgagat agaagtcact gttgggatcc aacctttttg 181 tttgtataaa ctgacaacaa gaaggtattc caagctgcac aatgaggccc aggagaccct 241 tggtcttcat gtctttggtg tgtgcactct tgaacacatg ccaggcacac agggtgcatg 301 acgacaagcc taatattgtc ctaatcatgg ttgatgacct gggtattgga gatctgggct 361 gctacggcaa tgacaccatg aggacgcctc acatcgaccg ccttgccagg gaaggcgtgc 421 gactgactca gcacatctct gccgcctccc tctgcagccc aagccggtcc gcgttcttga 481 cgggaagata ccccatccga tcaggtatgg tttctagtgg taatagacgt gtcatccaaa 541 atcttgcagt ccccgcaggc ctccctctta atgagacaac acttgcagcc ttgctaaaga 601 agcaaggata cagcacgggg cttataggca aatggcacca aggcttgaac tgcgactccc 661 gaagtgacca gtgccaccat ccatataatt atgggtttga ctactactat ggcatgccgt 721 tcactctcgt tgacagctgc tggccggacc cctctcgtaa cacggaatta gcctttgaga 781 gtcagctctg gctctgtgtg cagctagttg ccattgccat cctcacccta acctttggga 841 agctgagcgg ctgggtctct gttccctggc tcctgatctt ctccatgatt ctgtttattt 901 tcctcttggg ctatgcttgg ttctccagcc acacgtcccc tttatactgg gactgcctcc 961 tcatgcgggg gcacgagatc acggagcagc ccatgaaggc tgaacgagct ggatccatta 1021 tggtgaagga agcgatttcc tttttagaaa ggcacagtaa ggaaactttc cttctctttt 1081 tctcctttct tcacgtgcac acacctctcc ccaccacgga cgatttcact ggcaccagca 1141 agcatggctt gtatggggat aatgtggaag agatggactc catggtgggc aagattcttg 1201 atgctatcga tgattttggc ctaaggaaca acacccttgt ctactttaca tcagatcacg 1261 gagggcattt ggaagctagg cgagggcatg cccaacttgg tggatggaat ggaatataca 1321 aaggtggaaa aggcatgggg ggctgggaag gtggaatccg cgtcccagga attgtccgat 1381 ggcctggaaa ggtaccagct ggacggttga ttaaggaacc tacaagttta atggatattt 1441 taccaactgt cgcatcagtg tcaggaggaa gtctccctca ggacagggtc attgacggcc 1501 gagacctcat gcccttgctg cagggcaacg tcaggcactc ggagcatgaa tttcttttcc 1561 actactgtgg ctcctacctg cacgccgtgc ggtggatccc caaggacgac agtgggtcag 1621 tttggaaggc tcactatgtg accccggtat tccagccacc agcttctggt ggctgctatg 1681 tcacctcatt atgcagatgt ttcggagaac aggttaccta ccacaacccc cctctgctct 1741 tcgatctctc cagggacccc tcagagtcca cacccctgac acctgccaca gagcccctct 1801 atgattttgt gattaaaaag gtggccaacg ccctgaagga acaccaggaa accatcgtgc 1861 ctgtgaccta ccaactctca gaactgaatc agggcaggac gtggctgaag ccttgctgtg 1921 gggtgttccc attttgtctg tgtgacaagg aagaggaagt ctctcagcct cggggtccta 1981 acgagaagag ataattacaa tcaggctacc agaggaagcc tttggtccta acgagaagag 2041 ataattacaa tcaggctacc aaaggaagca ctaactttgg tgctttcaag ttggcaagga 2101 gtgcatttaa tagtcaataa attcatctac cattccagat tattaaaggc ccactggttg 2161 ttcctaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa //