LOCUS       BC017005                1443 bp    mRNA    linear   HUM 30-SEP-2003
DEFINITION  Homo sapiens sulfatase modifying factor 1, mRNA (cDNA clone
            IMAGE:4430514), partial cds.
ACCESSION   BC017005
VERSION     BC017005.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 1443)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 1443)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-NOV-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Sep 16, 2003 this sequence version replaced BC017005.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Sequencing Group at the Stanford Human Genome
            Center, Stanford University School of Medicine, Stanford, CA  94305
            Web site:       http://www-shgc.stanford.edu
            Contact:  (Dickson, Mark) mcd@paxil.stanford.edu
            Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers,
            R. M.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 27 Row: n Column: 6.
FEATURES             Location/Qualifiers
     source          1..1443
                     /db_xref="H-InvDB:HIT000089152"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:4430514"
                     /tissue_type="Prostate, adenocarcinoma."
                     /clone_lib="NIH_MGC_91"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            <1..1443
                     /gene="SUMF1"
                     /gene_synonym="FGE"
                     /db_xref="GeneID:285362"
                     /db_xref="MIM:607939"
     CDS             <1..401
                     /gene="SUMF1"
                     /gene_synonym="FGE"
                     /codon_start=3
                     /product="SUMF1 protein"
                     /protein_id="AAH17005.2"
                     /db_xref="GeneID:285362"
                     /db_xref="MIM:607939"
                     /translation="LFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPP
                     NGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRY
                     RCAARSQNTPDSSASNLGFRCAADRLPTMD"
     misc_feature    3..377
                     /gene="SUMF1"
                     /gene_synonym="FGE"
                     /note="DUF323; Region: Domain of unknown function
                     (DUF323). This presumed domain is found in bacterial
                     proteins. In some cases these proteins also contain a
                     protein kinase domain. The function of this domain is
                     unknown"
                     /db_xref="CDD:pfam03781"
BASE COUNT          388 a          313 c          353 g          389 t
ORIGIN      
        1 gacttttccc ctggggcaac aaactgcagc ccaaaggcca gcattatgcc aacatttggc
       61 agggcgagtt tccggtgacc aacactggtg aggatggctt ccaaggaact gcgcctgttg
      121 atgccttccc tcccaatggt tatggcttat acaacatagt ggggaacgca tgggaatgga
      181 cttcagactg gtggactgtt catcattctg ttgaagaaac gcttaaccca aaaggtcccc
      241 cttctgggaa agaccgagtg aagaaaggtg gatcctacat gtgccatagg tcttattgtt
      301 acaggtatcg ctgtgctgct cggagccaga acacacctga tagctctgct tcgaatctgg
      361 gattccgctg tgcagccgac cgcctgccca ctatggactg acaaccaagg aaagtcttcc
      421 ccagtccaag gagcagtcgt gtctgaccta cattgggctt ttctcagaac tttgaacgat
      481 cccatgcaaa gaattcccac cctgaggtgg gttacatacc tgcccaatgg ccaaaggaac
      541 cgccttgtga gaccaaattg ctgacctggg tcagtgcatg tgctttatgg tgtggtgcat
      601 ctttggagat catcgccata ttttactttt gagagtcttt aaagaggaag gggagtggag
      661 ggaaccctga gctaggcttc aggaggcccg cgtcctacgc aggctctgcc acaggggtta
      721 gaccccaggt ccgacgcttg accttcctgg gcctcaagtg ccctccccta tcaaatgaag
      781 ggatggacag catgacctct gggtgtctct ccaactcacc agttctaaaa agggtatcag
      841 attctattgt gacttcatag tgagaattta tgatagatta ttttttagct attttttcca
      901 tgtgtgaacc ttgagtgata ctaatcatgt aaagtaagag ttctcttatg tattattttc
      961 ggaagagggg tgtggtgact cctttatatt cgtactgcac tttgtttttc caaggaaatc
     1021 agtgtctttt acgttgttat gatgaatccc acatggggcc ggtgatggta tgctgcagtt
     1081 cagccgttga acacatagga atgtctgtgg ggtgactcta ctgtgcttta tcttttaaca
     1141 ttaagtgcct ttggttcaga ggggcagtca taagctctgt ttccccctct ccccaaagcc
     1201 ttcagcgaac gtgaaatgtg cgctaaacgg ggaaacctgt ttaattctag atatagggaa
     1261 aaaggaacga ggaccttgaa tgagctatat tcagggtatc cggtattttg taatagggaa
     1321 taggaaacct tgttggctgt ggaatatccg atgctttgaa tcatgcactg tgttgaataa
     1381 acgtatctgc taaatcagga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     1441 aaa
//