LOCUS BC110862 2165 bp mRNA linear HUM 21-JUL-2006 DEFINITION Homo sapiens sulfatase modifying factor 1, mRNA (cDNA clone MGC:131853 IMAGE:5760490), complete cds. ACCESSION BC110862 VERSION BC110862.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2165) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2165) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-DEC-2005) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Life Technologies, Inc. cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 255 Row: i Column: 15 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 38202249. FEATURES Location/Qualifiers source 1..2165 /db_xref="H-InvDB:HIT000339753" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:131853 IMAGE:5760490" /tissue_type="Colon, Kidney, Stomach, adult, whole pooled" /clone_lib="NIH_MGC_116" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2165 /gene="SUMF1" /gene_synonym="FGE" /db_xref="GeneID:285362" /db_xref="HGNC:HGNC:20376" /db_xref="MIM:607939" CDS 9..1133 /gene="SUMF1" /gene_synonym="FGE" /codon_start=1 /product="sulfatase modifying factor 1" /protein_id="AAI10863.1" /db_xref="GeneID:285362" /db_xref="HGNC:HGNC:20376" /db_xref="MIM:607939" /translation="MAAPALGLVCGRCPELGLVLLLLLLSLLCGAAGSQEAGTGAGAG SLAGSCGCGTPQRPGAHGSSAAAHRYSREANAPGPVPGERQLAHSKMVPIPAGVFTMG TDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAEKFGDSFVFEG MLSEQVKTNIQQAVAAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYC TWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQ GTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSY MCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPTMD" BASE COUNT 527 a 503 c 587 g 548 t ORIGIN 1 gggacaacat ggctgcgccc gcactagggc tggtgtgtgg acgttgccct gagctgggtc 61 tcgtcctctt gctgctgctg ctctcgctgc tgtgtggagc ggcagggagc caggaggccg 121 ggaccggtgc gggcgcgggg tcccttgcgg gttcttgcgg ctgcggcacg ccccagcggc 181 ctggcgccca tggcagttcg gcagccgctc accgatactc gcgggaggct aacgctccgg 241 gccccgtacc cggagagcgg caactcgcgc actcaaagat ggtccccatc cctgctggag 301 tatttacaat gggcacagat gatcctcaga taaagcagga tggggaagca cctgcgagga 361 gagttactat tgatgccttt tacatggatg cctatgaagt cagtaatact gaatttgaga 421 agtttgtgaa ctcaactggc tatttgacag aggctgagaa gtttggcgac tcctttgtct 481 ttgaaggcat gttgagtgag caagtgaaga ccaatattca acaggcagtt gcagctgctc 541 cctggtggtt acctgtgaaa ggcgctaact ggagacaccc agaagggcct gactctacta 601 ttctgcacag gccggatcat ccagttctcc atgtgtcctg gaatgatgcg gttgcctact 661 gcacttgggc agggaagcgg ctgcccacgg aagctgagtg ggaatacagc tgtcgaggag 721 gcctgcataa tagacttttc ccctggggca acaaactgca gcccaaaggc cagcattatg 781 ccaacatttg gcagggcgag tttccggtga ccaacactgg tgaggatggc ttccaaggaa 841 ctgcgcctgt tgatgccttc cctcccaatg gttatggctt atacaacata gtggggaacg 901 catgggaatg gacttcagac tggtggactg ttcatcattc tgttgaagaa acgcttaacc 961 caaaaggtcc cccttctggg aaagaccgag tgaagaaagg tggatcctac atgtgccata 1021 ggtcttattg ttacaggtat cgctgtgctg ctcggagcca gaacacacct gatagctctg 1081 cttcgaatct gggattccgc tgtgcagccg accgcctgcc cactatggac tgacaaccaa 1141 ggaaagtctt ccccagtcca aggagcagtc gtgtctgacc tacattgggc ttttctcaga 1201 actttgaacg atcccatgca aagaattccc accctgaggt gggttacata cctgcccaat 1261 ggccaaagga accgccttgt gagaccaaat tgctgacctg ggtcagtgca tgtgctttat 1321 ggtgtggtgc atctttggag atcatcgcca tattttactt ttgagagtct ttaaagagga 1381 aggggagtgg agggaaccct gagctaggct tcaggaggcc cgcgtcctac gcaggctctg 1441 ccacaggggt tagaccccag gtccgacgct tgaccttcct gggcctcaag tgccctcccc 1501 tatcaaatga agggatggac agcatgacct ctgggtgtct ctccaactca ccagttctaa 1561 aaagggtatc agattctatt gtgacttcat agtgagaatt tatgatagat tattttttag 1621 ctattttttc catgtgtgaa ccttgagtga tactaatcat gtaaagtaag agttctctta 1681 tgtattattt tcggaagagg ggtgtggtga ctcctttata ttcgtactgc actttgtttt 1741 tccaaggaaa tcagtgtctt ttacgttgtt atgatgaatc ccacatgggg ccggtgatgg 1801 tatgctgcag ttcagccgtt gaacacatag gaatgtctgt ggggtgactc tactgtgctt 1861 tatcttttaa cattaagtgc ctttggttca gaggggcagt cataagctct gtttccccct 1921 ctccccaaag ccttcagcga acgtgaaatg tgcgctaaac ggggaaacct gtttaattct 1981 agatataggg aaaaaggaac gaggaccttg aatgagctat attcagggta tccggtattt 2041 tgtaataggg aataggaaac cttgttggct gtggaatatc cgatgctttg aatcatgcac 2101 tgtgttgaat aaacgtatct gctaaatcag gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2161 aaaaa //