LOCUS BC022389 2195 bp mRNA linear HUM 06-OCT-2003
DEFINITION Homo sapiens arylsulfatase F, mRNA (cDNA clone MGC:24090
IMAGE:4609825), complete cds.
ACCESSION BC022389
VERSION BC022389.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2195)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2195)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (01-FEB-2002) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: CLONTECH
cDNA Library Preparation: CLONTECH Laboratories, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Sequencing Group at the Stanford Human Genome
Center, Stanford University School of Medicine, Stanford, CA 94305
Web site: http://www-shgc.stanford.edu
Contact: (Dickson, Mark) mcd@paxil.stanford.edu
Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
R. M.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAL Plate: 36 Row: f Column: 3
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 6552297.
FEATURES Location/Qualifiers
source 1..2195
/db_xref="H-InvDB:HIT000039515"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:24090 IMAGE:4609825"
/tissue_type="Kidney"
/clone_lib="NIH_MGC_75"
/lab_host="DH10B"
/note="Vector: pDNR-LIB"
gene 1..2195
/gene="ARSF"
/gene_synonym="ASF"
/db_xref="GeneID:416"
/db_xref="MIM:300003"
CDS 222..1994
/gene="ARSF"
/gene_synonym="ASF"
/codon_start=1
/product="ARSF protein"
/protein_id="AAH22389.1"
/db_xref="GeneID:416"
/db_xref="MIM:300003"
/translation="MRPRRPLVFMSLVCALLNTCQAHRVHDDKPNIVLIMVDDLGIGD
LGCYGNDTMRTPHIDRLAREGVRLTQHISAASLCSPSRSAFLTGRYPIRSGMVSSGNR
RVIQNLAVPAGLPLNETTLAALLKKQGYSTGLIGKWHQGLNCDSRSDQCHHPYNYGFD
YYYGMPFTLVDSCWPDPSRNTELAFESQLWLCVQLVAIAILTLTFGKLSGWVSVPWLL
IFSMILFIFLLGYAWFSSHTSPLYWDCLLMRGHEITEQPMKAERAGSIMVKEAISFLE
RHSKETFLLFFSFLHVHTPLPTTDDFTGTSKHGLYGDNVEEMDSMVGKILDAIDDFGL
RNNTLVYFTSDHGGHLEARRGHAQLGGWNGIYKGGKGMGGWEGGIRVPGIVRWPGKVP
AGRLIKEPTSLMDILPTVASVSGGSLPQDRVIDGRDLMPLLQGNVRHSEHEFLFHYCG
SYLHAVRWIPKDDSGSVWKAHYVTPVFQPPASGGCYVTSLCRCFGEQVTYHNPPLLFD
LSRDPSESTPLTPATEPLYDFVIKKVANALKEHQETIVPVTYQLSELNQGRTWLKPCC
GVFPFCLCDKEEEVSQPRGPNEKR"
misc_feature 309..1760
/gene="ARSF"
/gene_synonym="ASF"
/note="Sulfatase; Region: Sulfatase"
/db_xref="CDD:pfam00884"
BASE COUNT 542 a 577 c 543 g 533 t
ORIGIN
1 agtggagaca ggctctcacc atgttgacca ggctggtctc gaactcccga cctcaagtga
61 tccgccggcc tcggcctccc aaagtgctgg gattacaggc gtgaaccacc actcctggcc
121 cttctgttac ttttccactg ctgttgagat agaagtcact gttgggatcc aacctttttg
181 tttgtataaa ctgacaacaa gaaggtattc caagctgcac aatgaggccc aggagaccct
241 tggtcttcat gtctttggtg tgtgcactct tgaacacatg ccaggcacac agggtgcatg
301 acgacaagcc taatattgtc ctaatcatgg ttgatgacct gggtattgga gatctgggct
361 gctacggcaa tgacaccatg aggacgcctc acatcgaccg ccttgccagg gaaggcgtgc
421 gactgactca gcacatctct gccgcctccc tctgcagccc aagccggtcc gcgttcttga
481 cgggaagata ccccatccga tcaggtatgg tttctagtgg taatagacgt gtcatccaaa
541 atcttgcagt ccccgcaggc ctccctctta atgagacaac acttgcagcc ttgctaaaga
601 agcaaggata cagcacgggg cttataggca aatggcacca aggcttgaac tgcgactccc
661 gaagtgacca gtgccaccat ccatataatt atgggtttga ctactactat ggcatgccgt
721 tcactctcgt tgacagctgc tggccggacc cctctcgtaa cacggaatta gcctttgaga
781 gtcagctctg gctctgtgtg cagctagttg ccattgccat cctcacccta acctttggga
841 agctgagcgg ctgggtctct gttccctggc tcctgatctt ctccatgatt ctgtttattt
901 tcctcttggg ctatgcttgg ttctccagcc acacgtcccc tttatactgg gactgcctcc
961 tcatgcgggg gcacgagatc acggagcagc ccatgaaggc tgaacgagct ggatccatta
1021 tggtgaagga agcgatttcc tttttagaaa ggcacagtaa ggaaactttc cttctctttt
1081 tctcctttct tcacgtgcac acacctctcc ccaccacgga cgatttcact ggcaccagca
1141 agcatggctt gtatggggat aatgtggaag agatggactc catggtgggc aagattcttg
1201 atgctatcga tgattttggc ctaaggaaca acacccttgt ctactttaca tcagatcacg
1261 gagggcattt ggaagctagg cgagggcatg cccaacttgg tggatggaat ggaatataca
1321 aaggtggaaa aggcatgggg ggctgggaag gtggaatccg cgtcccagga attgtccgat
1381 ggcctggaaa ggtaccagct ggacggttga ttaaggaacc tacaagttta atggatattt
1441 taccaactgt cgcatcagtg tcaggaggaa gtctccctca ggacagggtc attgacggcc
1501 gagacctcat gcccttgctg cagggcaacg tcaggcactc ggagcatgaa tttcttttcc
1561 actactgtgg ctcctacctg cacgccgtgc ggtggatccc caaggacgac agtgggtcag
1621 tttggaaggc tcactatgtg accccggtat tccagccacc agcttctggt ggctgctatg
1681 tcacctcatt atgcagatgt ttcggagaac aggttaccta ccacaacccc cctctgctct
1741 tcgatctctc cagggacccc tcagagtcca cacccctgac acctgccaca gagcccctct
1801 atgattttgt gattaaaaag gtggccaacg ccctgaagga acaccaggaa accatcgtgc
1861 ctgtgaccta ccaactctca gaactgaatc agggcaggac gtggctgaag ccttgctgtg
1921 gggtgttccc attttgtctg tgtgacaagg aagaggaagt ctctcagcct cggggtccta
1981 acgagaagag ataattacaa tcaggctacc agaggaagcc tttggtccta acgagaagag
2041 ataattacaa tcaggctacc aaaggaagca ctaactttgg tgctttcaag ttggcaagga
2101 gtgcatttaa tagtcaataa attcatctac cattccagat tattaaaggc ccactggttg
2161 ttcctaaaaa aaaaaaaaaa aaaaaaaaaa aaaaa
//