LOCUS BC015600 1822 bp mRNA linear HUM 23-DEC-2003 DEFINITION Homo sapiens sulfatase modifying factor 2, mRNA (cDNA clone IMAGE:4653574), partial cds. ACCESSION BC015600 VERSION BC015600.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1822) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1822) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (04-OCT-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Dec 19, 2003 this sequence version replaced BC015600.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 33 Row: i Column: 7. FEATURES Location/Qualifiers source 1..1822 /db_xref="H-InvDB:HIT000037207" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:4653574" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..1822 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /db_xref="GeneID:25870" /db_xref="MIM:607940" CDS <1..1005 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /codon_start=1 /product="SUMF2 protein" /protein_id="AAH15600.2" /db_xref="GeneID:25870" /db_xref="MIM:607940" /translation="LPLLPLLSLLVGAWLKLGNGQATSMVQLQGGRFLMGTNSPDSRD GEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAEMFGWSFVFEDFVSDELRNK ATQPMKSVLWWLPVEKAFWRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPT EEEWEFAARGGLKGQVYPWGNWFQPNRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQN NYGWATLQIQPQTTSVSAVLQTQAGRQGSCKQPGGDKEKSLLGSLSFPGHVANSAIPS SRASASGKNFPFPVSHPSVAGASHQGRRGLSLLCFGEGAQCVLTMAGGQVFLLEAKYY " misc_feature 61..663 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /note="DUF323; Region: Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown" /db_xref="CDD:pfam03781" BASE COUNT 439 a 459 c 476 g 448 t ORIGIN 1 ttaccgctgc tgcccctgct gtcgctcctg gtcggcgcgt ggctcaagct aggaaatgga 61 caggctacta gcatggtcca actgcagggt gggagattcc tgatgggaac aaattctcca 121 gacagcagag atggtgaagg gcctgtgcgg gaggcgacag tgaaaccctt tgccatcgac 181 atatttcctg tcaccaacaa agatttcagg gattttgtca gggagaaaaa gtatcggaca 241 gaagctgaga tgtttggatg gagctttgtc tttgaggact ttgtctctga tgagctgaga 301 aacaaagcca cccagccaat gaagtctgta ctctggtggc ttccagtgga aaaggcattt 361 tggaggcagc ctgcaggtcc tggctctggc atccgagaga gactggagca cccagtgtta 421 cacgtgagct ggaatgacgc ccgtgcctac tgtgcttggc ggggaaaacg actgcccacg 481 gaggaagagt gggagtttgc cgcccgaggg ggcttgaagg gtcaagttta cccatggggg 541 aactggttcc agccaaaccg caccaacctg tggcagggaa agttccccaa gggagacaaa 601 gctgaggatg gcttccatgg agtctcccca gtgaatgctt tccccgccca gaacaactac 661 ggatgggcaa cactccagat tcagcctcag acaacctcgg tttccgctgt gctgcagacg 721 caggccggcc gccaggggag ctgtaagcag ccgggtggtg acaaggagaa aagccttcta 781 gggtcactgt cattccctgg ccatgttgca aacagcgcaa ttccaagctc gagagcttca 841 gcctcaggaa agaacttccc cttccctgtc tcccatccct ctgtggcagg cgcctctcac 901 cagggcagga gaggactcag cctcctgtgt tttggagaag gggcccaatg tgtgttgacg 961 atggctgggg gccaggtgtt tctgttagag gccaagtatt attgacacag gattgcaaac 1021 acacaaacaa ttggaacaga gcactctgaa aggccatttt ttaagcattt taaaatctat 1081 tctctccccc tttctccctg gatgattcag gaagctgaca ttgtttcctc aaggcagaat 1141 tttcctggtt ctgttttctc agccagttgc tgtggaagga gaatgctttc tttgtggcct 1201 catctgtggt ttcgtgtccc tctgaaggaa actagtttcc actgtgtaac aggcagacat 1261 gtaactattt aaagcacagt tcagtcctaa aagggtctgg gagaaccaga tgatgtacta 1321 ggtgaagcat tgcattgtgg gaatcacaaa gcaaatagta ctccagaaag acaaatatca 1381 gaagcttcct attctttttt tttttttttt ttttttgaga cagggtcttt ctctgttgcc 1441 caggctagag tgcactggtg atcacggctc actctagcct tgaattcctg ggcccaagca 1501 attctcccac ctcagcctcc tgagtagctg ggactacaag tgtgcaccac catgcctggc 1561 taattttttg aatttttgta gtgatgggat ctcgctctgt tgcccagggt ggtctcgaac 1621 tcctggcctc aagcgatcct cccacctcga cctcccaaag tgctgggatt acaggtgtga 1681 gccacctcgc ctgggccccc ttctccatat gcctccaaaa acatgtccct ggagagtagc 1741 ctgctcccac actgtcactg gatgtcatgg ggccaataaa atctcctgca attgtgtatc 1801 tcaaaaaaaa aaaaaaaaaa aa //