LOCUS BC006159 2005 bp mRNA linear HUM 16-SEP-2003 DEFINITION Homo sapiens sulfatase modifying factor 2, mRNA (cDNA clone IMAGE:3635549), partial cds. ACCESSION BC006159 VERSION BC006159.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2005) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2005) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (02-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, George Yang, Scott Zuyderduyn, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 18 Row: i Column: 18. FEATURES Location/Qualifiers source 1..2005 /db_xref="H-InvDB:HIT000086605" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3635549" /tissue_type="Placenta, choriocarcinoma" /clone_lib="NIH_MGC_21" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..2005 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /db_xref="GeneID:25870" /db_xref="MIM:607940" CDS <1..892 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /codon_start=2 /product="SUMF2 protein" /protein_id="AAH06159.1" /db_xref="GeneID:25870" /db_xref="MIM:607940" /translation="LPLLPLLSLLVGAWLKLGNGQATSMVQLQGGRFLMGTNSPDSRD GEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAEMFGWSFVFEDFVSDELRNK ATQPMKSVLWWLPVEKAFWRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPT EEEWEFAARGGLKGQVYPWGNWFQPNRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQN NYGLYDLLGNVWEWTASPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDS ASDNLGFRCAADAGRPPGEL" misc_feature 62..862 /gene="SUMF2" /gene_synonym="DKFZp66I1024" /note="DUF323; Region: Domain of unknown function (DUF323). This presumed domain is found in bacterial proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown" /db_xref="CDD:pfam03781" BASE COUNT 508 a 503 c 525 g 469 t ORIGIN 1 gttaccgctg ctgcccctgc tgtcgctcct ggtcggcgcg tggctcaagc taggaaatgg 61 acaggctact agcatggtcc aactgcaggg tgggagattc ctgatgggaa caaattctcc 121 agacagcaga gatggtgaag ggcctgtgcg ggaggcgaca gtgaaaccct ttgccatcga 181 catatttcct gtcaccaaca aagatttcag ggattttgtc agggagaaaa agtatcggac 241 agaagctgag atgtttggat ggagctttgt ctttgaggac tttgtctctg atgagctgag 301 aaacaaagcc acccagccaa tgaagtctgt actctggtgg cttccagtgg aaaaggcatt 361 ttggaggcag cctgcaggtc ctggctctgg catccgagag agactggagc acccagtgtt 421 acacgtgagc tggaatgacg cccgtgccta ctgtgcttgg cggggaaaac gactgcccac 481 ggaggaagag tgggagtttg ccgcccgagg gggcttgaag ggtcaagttt acccatgggg 541 gaactggttc cagccaaacc gcaccaacct gtggcaggga aagttcccca agggagacaa 601 agctgaggat ggcttccatg gagtctcccc agtgaatgct ttccccgccc agaacaacta 661 cgggctctat gacctcctgg ggaacgtgtg ggagtggaca gcatcaccgt accaggctgc 721 tgagcaggac atgcgcgtcc tccggggggc atcctggatc gacacagctg atggctctgc 781 caatcaccgg gcccgggtca ccaccaggat gggcaacact ccagattcag cctcagacaa 841 cctcggtttc cgctgtgctg cagacgcagg ccggccgcca ggggagctgt aagcagccgg 901 gtggtgacaa ggagaaaagc cttctagggt cactgtcatt ccctggccat gttgcaaaca 961 gcgcaattcc aagctcgaga gcttcagcct caggaaagaa cttccccttc cctgtctccc 1021 atccctctgt ggcaggcgcc tctcaccagg gcaggagagg actcagcctc ctgtgttttg 1081 gagaaggggc ccaatgtgtg ttgacgatgg ctgggggcca ggtgtttctg ttagaggcca 1141 agtattattg acacaggatt gcaaacacac aaacaattgg aacagagcac tctgaaaggc 1201 cattttttaa gcattttaaa atctattctc tccccctttc tccctggatg attcaggaag 1261 ctgacattgt ttcctcaagg cagaattttc ctggttctgt tttctcagcc agttgctgtg 1321 gaaggagaat gctttctttg tggcctcatc tgtggtttcg tgtccctctg aaggaaacta 1381 gtttccactg tgtaacaggc agacatgtaa ctatttaaag cacagttcag tcctaaaagg 1441 gtctgggaga accagatgat gtactaggtg aagcattgca ttgtgggaat cacaaagcaa 1501 atagtactcc agaaagacaa atatcagaag cttcctattc tttttttttt tttttttttt 1561 ttgagacagg gtctttctct gttgcccagg ctagagtgca ctggtgatca cggctcactc 1621 tagccttgaa ttcctgggcc caagcaattc tcccacctca gcctcctgag tagctgggac 1681 tacaagtgtg caccaccatg cctggctaat tttttgaatt tttgtagtga tgggatctcg 1741 ctctgttgcc cagggtggtc tcgaactcct ggcctcaagc gatcctccca cctcgacctc 1801 ccaaagtgct gggattacag gtgtgagcca cctcgcctgg gcccccttct ccatatgcct 1861 ccaaaaacat gtccctggag agtagcctgc tcccacactg tcactggatg tcatggggcc 1921 aataaaatct cctgcaattg tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 1981 aaaaaaaaaa aaaaaaaaaa aaaaa //