LOCUS BC068565 3019 bp mRNA linear HUM 28-JUL-2005 DEFINITION Homo sapiens sulfatase 1, mRNA (cDNA clone MGC:87514 IMAGE:4831138), complete cds. ACCESSION BC068565 VERSION BC068565.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3019) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3019) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-APR-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 167 Row: h Column: 7. FEATURES Location/Qualifiers source 1..3019 /db_xref="H-InvDB:HIT000263121" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:87514 IMAGE:4831138" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3019 /gene="SULF1" /gene_synonym="HSULF-1" /gene_synonym="KIAA1077" /gene_synonym="SULF-1" /db_xref="GeneID:23213" CDS 64..1998 /gene="SULF1" /gene_synonym="HSULF-1" /gene_synonym="KIAA1077" /gene_synonym="SULF-1" /codon_start=1 /product="SULF1 protein" /protein_id="AAH68565.1" /db_xref="GeneID:23213" /translation="MCYGTPSYNYAPNMDKHWIMQYTGPMLPIHMEFTNILQRKRLQT LMSVDDSVERLYNMLVETGELENTYIIYTADHGYHIGQFGLVKGKSMPYDFDIRVPFF IRGPSVEPGSIVPQIVLNIDLAPTILDIAGLDTPPDVDGKSVLKLLDPEKPGNRFRTN KKAKIWRDTFLVERGKFLRKKEESSKNIQQSNHLPKYERVKELCQQARYQTACEQPGQ KWQCIEDTSGKLRIHKCKGPSDLLTVRQSTRNLYARGFHDKDKECSCRESGYRASRSQ RKSQRQFLRNQGTPKYKPRFVHTRQTRSLPVEFEGEIYDINLEEEEELQVLQPRNIAK RHDEGHKGPRDLQASSGGNRGRMLADSSNAVGPPTTVRVTHKCFILPNDSIHCERELY QSARAWKDHKAYIDKEIEALQDKIKNLREVRGHLKRRKPEECSCSKQSYYNKEKGVKK QEKLKSHLHPFKEAAQEVDSKLQLFKENNRRRKKERKEKRRQRKGEECSLPGLTCFTH DNNHWQTAPFWNLGSFCACTSSNNNTYWCLRTVNETHNFLFCEFATGFLEYFDMNTDP YQLTNTVHTVERGILNQLHVQLMELRSCQGYKQCNPRPKNLDVGNKDGGSYDLHRMLL FLVPCSAPLTSRAMVSDASF" BASE COUNT 968 a 649 c 727 g 675 t ORIGIN 1 attccaatgg aacagacagg gtaaggacca atctggactg tgttatcttt tccaggtgca 61 agtatgtgct atggaactcc tagttataac tatgcaccaa atatggataa acactggatt 121 atgcagtaca caggaccaat gctgcccatc cacatggaat ttacaaacat tctacagcgc 181 aaaaggctcc agactttgat gtcagtggat gattctgtgg agaggctgta taacatgctc 241 gtggagacgg gggagctgga gaatacttac atcatttaca ccgccgacca tggttaccat 301 attgggcagt ttggactggt caaggggaaa tccatgccat atgactttga tattcgtgtg 361 ccttttttta ttcgtggtcc aagtgtagaa ccaggatcaa tagtcccaca gatcgttctc 421 aacattgact tggcccccac gatcctggat attgctgggc tcgacacacc tcctgatgtg 481 gacggcaagt ctgtcctcaa acttctggac ccagaaaagc caggtaacag gtttcgaaca 541 aacaagaagg ccaaaatttg gcgtgataca ttcctagtgg aaagaggcaa atttctacgt 601 aagaaggaag aatccagcaa gaatatccaa cagtcaaatc acttgcccaa atatgaacgg 661 gtcaaagaac tatgccagca ggccaggtac cagacagcct gtgaacaacc ggggcagaag 721 tggcaatgca ttgaggatac atctggcaag cttcgaattc acaagtgtaa aggacccagt 781 gacctgctca cagtccggca gagcacgcgg aacctctacg ctcgcggctt ccatgacaaa 841 gacaaagagt gcagttgtag ggagtctggt taccgtgcca gcagaagcca aagaaagagt 901 caacggcaat tcttgagaaa ccaggggact ccaaagtaca agcccagatt tgtccatact 961 cggcagacac gttccttgcc cgtcgaattt gaaggtgaaa tatatgacat aaatctggaa 1021 gaagaagaag aattgcaagt gttgcaacca agaaacattg ctaagcgtca tgatgaaggc 1081 cacaaggggc caagagatct ccaggcttcc agtggtggca acaggggcag gatgctggca 1141 gatagcagca acgccgtggg cccacctacc actgtccgag tgacacacaa gtgttttatt 1201 cttcccaatg actctatcca ttgtgagaga gaactgtacc aatcggccag agcgtggaag 1261 gaccataagg catacattga caaagagatt gaagctctgc aagataaaat taagaattta 1321 agagaagtga gaggacatct gaagagaagg aagcctgagg aatgtagctg cagtaaacaa 1381 agctattaca ataaagagaa aggtgtaaaa aagcaagaga aattaaagag ccatcttcac 1441 ccattcaagg aggctgctca ggaagtagat agcaaactgc aacttttcaa ggagaacaac 1501 cgtaggagga agaaggagag gaaggagaag agacggcaga ggaaggggga agagtgcagc 1561 ctgcctggcc tcacttgctt cacgcatgac aacaaccact ggcagacagc cccgttctgg 1621 aacctgggat ctttctgtgc ttgcacgagt tctaacaata acacctactg gtgtttgcgt 1681 acagttaatg agacgcataa ttttcttttc tgtgagtttg ctactggctt tttggagtat 1741 tttgatatga atacagatcc ttatcagctc acaaatacag tgcacacggt agaacgaggc 1801 attttgaatc agctacacgt acaactaatg gagctcagaa gctgtcaagg atataagcag 1861 tgcaacccaa gacctaagaa tcttgatgtt ggaaataaag atggaggaag ctatgaccta 1921 cacagaatgt tgctgttcct ggtaccctgt tcggcccctc taacctccag agctatggtc 1981 tcagatgctt ccttttagag agaaggtcat tagtccacca agaagccaaa tgacaacagg 2041 aaaggtgatg ggaagatgaa aacaaaggaa ggtggacttt tgggtatatg ttatagccat 2101 aggacagtta tgggatggat gggaaggtta atcagccccg tctcactgca gacatcaact 2161 ggcaaggcct agaggagcta cacagtgtga atgaaaacat ctatgagtac agacaaaact 2221 acagacttag tctggtggac tggactaatt acttgaagga tttagataga gtatttgcac 2281 tgctgaagag tcactatgag caaaataaaa caaataagac tcaaactgct caaagtgacg 2341 ggttcttggt tgtctctgct gagcacgctg tgtcaatgga gatggcctct gctgactcag 2401 atgaagaccc aaggcataag gttgggaaaa cacctcattt gaccttgcca gctgaccttc 2461 aaaccctgca tttgaaccga ccaacattaa gtccagagag taaacttgaa tggaataacg 2521 acattccaga agttaatcat ttgaattctg aacactggag aaaaaccgaa aaatggacgg 2581 ggcatgaaga gactaatcat ctggaaaccg atttcagtgg cgatggcatg acagagctag 2641 agctcgggcc cagccccagg ctgcagccca ttcgcaggca cccgaaagaa cttccccagt 2701 atggtggtcc tggaaaggac atttttgaag atcaactata tcttcctgtg cattccgatg 2761 gaatttcagt tcatcagatg ttcaccatgg ccaccgcaga acaccgaagt aattccagca 2821 tagcggggaa gatgttgacc aaggtggaga agaatcacga aaaggagaag tcacagcacc 2881 tagaaggcag cgcctcctct tcactctcct ctgattagat gaaactgtta ccttacccta 2941 aacacagtat ttctttttaa cttttttatt tgtaaactaa taaaggtaat cacagccacc 3001 aacaaaaaaa aaaaaaaaa //