LOCUS BC121123 1129 bp mRNA linear HUM 04-OCT-2006 DEFINITION Homo sapiens sulfatase modifying factor 1, mRNA (cDNA clone MGC:150436 IMAGE:40120671), complete cds. ACCESSION BC121123 VERSION BC121123.2 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1129) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1129) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (02-AUG-2006) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Oct 4, 2006 this sequence version replaced BC121123.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Baylor Human Genome Sequencing Center cDNA Library Preparation: Baylor Human Genome Sequencing Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAM Plate: 29 Row: g Column: 12. FEATURES Location/Qualifiers source 1..1129 /db_xref="H-InvDB:HIT000388320" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:150436 IMAGE:40120671" /tissue_type="PCR rescued clones" /clone_lib="NIH_MGC_283" /note="Vector: pCR-Blunt II-TOPO with reversed insert; Clone identification sequence tag: GAACTATT sequenced from the forward primer" gene 1..1129 /gene="SUMF1" /gene_synonym="FGE" /db_xref="GeneID:285362" /db_xref="HGNC:HGNC:20376" /db_xref="MIM:607939" CDS 30..1094 /gene="SUMF1" /gene_synonym="FGE" /codon_start=1 /product="SUMF1 protein" /protein_id="AAI21124.1" /db_xref="GeneID:285362" /db_xref="HGNC:HGNC:20376" /db_xref="MIM:607939" /translation="MAAPALGLVCGRCPELGLVLLLLLLSLLCGAAGSQEAGTGAGAG SLAGSCGCGTPQRPGAHGSSAAAHRYSREANAPGPVPGERQLAHSKMVPIPAGVFTMG TDDPQIKQDGEAPARRATIDAFYMDAYEVSNTEFEKFVNSTGYLTEAEKFGDSFVFEG MLSEQVKTNIQQAVAAAPWWLPVKGANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYC TWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQ GTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPSYCYRYRCAARSQNTP DSSASNLGFRCAADRLPTMD" BASE COUNT 242 a 302 c 337 g 248 t ORIGIN 1 aacacgggtc acatggcccg cgggacaaca tggctgcgcc cgcactaggg ctggtgtgtg 61 gacgttgccc tgagctgggt ctcgtcctct tgctgctgct gctctcgctg ctgtgtggag 121 cggcagggag ccaggaggcc gggaccggtg cgggcgcggg gtcccttgcg ggttcttgcg 181 gctgcggcac gccccagcgg cctggcgccc atggcagttc ggcagccgct caccgatact 241 cgcgggaggc taacgctccg ggccccgtac ccggagagcg gcaactcgcg cactcaaaga 301 tggtccccat ccctgctgga gtatttacaa tgggcacaga tgatcctcag ataaagcagg 361 atggggaagc acctgcgagg agagctacta ttgatgcctt ttacatggat gcctatgaag 421 tcagtaatac tgaatttgag aagtttgtga actcaactgg ctatttgaca gaggctgaga 481 agtttggcga ctcctttgtc tttgaaggca tgttgagtga gcaagtgaag accaatattc 541 aacaggcagt tgcagctgct ccctggtggt tacctgtgaa aggcgctaac tggagacacc 601 cagaagggcc tgactctact attctgcaca ggccggatca tccagttctc catgtgtcct 661 ggaatgatgc ggttgcctac tgcacttggg cagggaagcg gctgcccacg gaagctgagt 721 gggaatacag ctgtcgagga ggcctgcata atagactttt cccctggggc aacaaactgc 781 agcccaaagg ccagcattat gccaacattt ggcagggcga gtttccggtg accaacactg 841 gtgaggatgg cttccaagga actgcgcctg ttgatgcctt ccctcccaat ggttatggct 901 tatacaacat agtggggaac gcatgggaat ggacttcaga ctggtggact gttcatcatt 961 ctgttgaaga aacgcttaac ccatcttatt gttacaggta tcgctgtgct gctcggagcc 1021 agaacacacc tgatagctct gcttcgaatc tgggattccg ctgtgcagcc gaccgcctgc 1081 ccaccatgga ctgacaacca agggtagtct tccccagtcc aaggagcag //