LOCUS BC084539 2051 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens sulfatase modifying factor 2, mRNA (cDNA clone MGC:99485 IMAGE:6599080), complete cds. ACCESSION BC084539 VERSION BC084539.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2051) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2051) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-OCT-2004) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 59 Row: o Column: 14 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24308052. FEATURES Location/Qualifiers source 1..2051 /db_xref="H-InvDB:HIT000266048" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:99485 IMAGE:6599080" /tissue_type="Ovary, teratocarcinoma" /clone_lib="NIH_MGC_109" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2051 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /gene_synonym="pFGE" /db_xref="GeneID:25870" /db_xref="HGNC:HGNC:20415" /db_xref="MIM:607940" CDS 19..924 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /gene_synonym="pFGE" /codon_start=1 /product="sulfatase modifying factor 2" /protein_id="AAH84539.1" /db_xref="GeneID:25870" /db_xref="HGNC:HGNC:20415" /db_xref="MIM:607940" /translation="MARHGLPLLPLLSLLVGAWLKLGNGQATSMVQLQGGRFLMGTNS PDSRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAEMFGWSFVFEDFVSD ELRNKATQPMKSVLWWLPVEKAFWRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRG KRLPTEEEWEFAARGGLKGQVYPWGNWFQPNRTNLWQGKFPKGDKAEDGFHGVSPVNA FPAQNNYGLYDLLGNVWEWTASPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMG NTPDSASDNLGFRCAADAGRPPGEL" BASE COUNT 519 a 516 c 538 g 478 t ORIGIN 1 cgcagcgcgg cagtcctgat ggcccggcat gggttaccgc tgctgcccct gctgtcgctc 61 ctggtcggcg cgtggctcaa gctaggaaat ggacaggcta ctagcatggt ccaactgcag 121 ggtgggagat tcctgatggg aacaaattct ccagacagca gagatggtga agggcctgtg 181 cgggaggcga cagtgaaacc ctttgccatc gacatatttc ctgtcaccaa caaagatttc 241 agggattttg tcagggagaa aaagtatcgg acagaagctg agatgtttgg atggagcttt 301 gtctttgagg actttgtctc tgatgagctg agaaacaaag ccacccagcc aatgaagtct 361 gtactctggt ggcttccagt ggaaaaggca ttttggaggc agcctgcagg tcctggctct 421 ggcatccgag agagactgga gcacccagtg ttacacgtga gctggaatga cgcccgtgcc 481 tactgtgctt ggcggggaaa acgactgccc acggaggaag agtgggagtt tgccgcccga 541 gggggcttga agggtcaagt ttacccatgg gggaactggt tccagccaaa ccgcaccaac 601 ctgtggcagg gaaagttccc caagggagac aaagctgagg atggcttcca tggagtctcc 661 ccagtgaatg ctttccccgc ccagaacaac tacgggctct atgacctcct ggggaacgtg 721 tgggagtgga cagcatcacc gtaccaggct gctgagcagg acatgcgcgt cctccggggg 781 gcatcctgga tcgacacagc tgatggctct gccaatcacc gggcccgggt caccaccagg 841 atgggcaaca ctccagattc agcctcagac aacctcggtt tccgctgtgc tgcagacgca 901 ggccggccgc caggggagct gtaagcagcc gggtggtgac aaggagaaaa gccttctagg 961 gtcactgtca ttccctggcc atgttgcaaa cagcgcaatt ccaagctcga gagcttcagc 1021 ctcaggaaag aacttcccct tccctgtctc ccatccctct gtggcaggcg cctctcacca 1081 gggcaggaga ggactcagcc tcctgtgttt tggagaaggg gcccaatgtg tgttgacgat 1141 ggctgggggc caggtgtttc tgttagaggc caagtattat tgacacagga ttgcaaacac 1201 acaaacaatt ggaacagagc actctgaaag gccatttttt aagcatttta aaatctattc 1261 tctccccctt tctccctgga tgattcagga agctgacatt gtttcctcaa ggcagaattt 1321 tcctggttct gttttctcag ccagttgctg tggaaggaga atgctttctt tgtggcctca 1381 tctgtggttt cgtgtccctc tgaaggaaac tagtttccac tgtgtaacag gcagacatgt 1441 aactatttaa agcacagttc agtcctaaaa gggtctggga gaaccagatg atgtactagg 1501 tgaagcattg cattgtggga atcacaaagc aaatagtact ccagaaagac aaatatcaga 1561 agcttcctat tctttttttt tttttttttt ttttttgaga cagggtcttt ctctgttgcc 1621 caggctagag tgcactggtg atcacggctc actctagcct tgaattcctg ggcccaagca 1681 attctcccac ctcagcctcc tgagtagctg ggactacaag tgtgcaccac catgcctggc 1741 taattttttg aatttttgta gtgatgggat ctcgctctgt tgcccagggt ggtctcgaac 1801 tcctggcctc aagcgatcct cccacctcga cctcccaaag tgctgggatt acaggtgtga 1861 gccacctcgc ctgggccccc ttctccatat gcctccaaaa acatgtccct ggagagtagc 1921 ctgctcccac actgtcactg gatgtcatgg ggccaataaa atctcctgca attgtgtatc 1981 tcaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2041 aaaaaaaaaa a //