LOCUS       BC006247                3364 bp    mRNA    linear   HUM 02-MAR-2004
DEFINITION  Homo sapiens zinc finger protein 589, mRNA (cDNA clone
            IMAGE:3945618), partial cds.
ACCESSION   BC006247
VERSION     BC006247.2
KEYWORDS    .
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3364)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3364)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-APR-2001) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     On Aug 19, 2003 this sequence version replaced BC006247.1.
            Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: DCTD/DTP
            cDNA Library Preparation: Rubin Laboratory
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAL Plate: 15 Row: h Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 7706726.
FEATURES             Location/Qualifiers
     source          1..3364
                     /db_xref="H-InvDB:HIT000086622"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="IMAGE:3945618"
                     /tissue_type="Lung, small cell carcinoma"
                     /clone_lib="NIH_MGC_7"
                     /lab_host="DH10B-R"
                     /note="Vector: pOTB7"
     gene            <1..3364
                     /gene="ZNF589"
                     /gene_synonym="SZF1"
                     /db_xref="GeneID:51385"
     CDS             <1..1141
                     /gene="ZNF589"
                     /gene_synonym="SZF1"
                     /codon_start=2
                     /product="ZNF589 protein"
                     /protein_id="AAH06247.2"
                     /db_xref="GeneID:51385"
                     /translation="THGAATSFASCVRAQMWAPREQLLGWAAEALPAKDSAWPWEEKP
                     RYLGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLENLRNLVSLAESKPEVHTCPSC
                     PLAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQPQSQHPSDKNHRGAEAEDQ
                     RVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEIQLSPAQNASSEEVDRI
                     SKRAETPGFGAVRFGECALAFNQKSNLFRQKAVTAEKSSDKRQSQVCRECGRGFSRKS
                     QLIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCSHCGRGFSCKPYL
                     IRHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYVCRD"
     misc_feature    149..268
                     /gene="ZNF589"
                     /gene_synonym="SZF1"
                     /note="KRAB; Region: KRAB box. The KRAB domain (or
                     Kruppel-associated box) is present in about a third of
                     zinc finger proteins containing C2H2 fingers. The KRAB
                     domain is found to be involved in protein-protein
                     interactions. The KRAB domain is generally encoded by two
                     exons. The regions coded by the two exons are known as
                     KRAB-A and KRAB-B"
                     /db_xref="CDD:pfam01352"
     misc_feature    791..856
                     /gene="ZNF589"
                     /gene_synonym="SZF1"
                     /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2
                     zinc finger is the classical zinc finger domain. The two
                     conserved cysteines and histidines co-ordinate a zinc ion.
                     The following pattern describes the zinc finger.
                     #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
                     any amino acid, and numbers in brackets indicate the
                     number of residues. The positions marked # are those that
                     are important for the stable fold of the zinc finger. The
                     final position can be either his or cys. The C2H2 zinc
                     finger is composed of two short beta strands followed by
                     an alpha helix. The amino terminal part of the helix binds
                     the major groove in DNA binding zinc fingers"
                     /db_xref="CDD:pfam00096"
     misc_feature    1040..1108
                     /gene="ZNF589"
                     /gene_synonym="SZF1"
                     /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2
                     zinc finger is the classical zinc finger domain. The two
                     conserved cysteines and histidines co-ordinate a zinc ion.
                     The following pattern describes the zinc finger.
                     #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be
                     any amino acid, and numbers in brackets indicate the
                     number of residues. The positions marked # are those that
                     are important for the stable fold of the zinc finger. The
                     final position can be either his or cys. The C2H2 zinc
                     finger is composed of two short beta strands followed by
                     an alpha helix. The amino terminal part of the helix binds
                     the major groove in DNA binding zinc fingers"
                     /db_xref="CDD:pfam00096"
BASE COUNT          896 a          803 c          884 g          781 t
ORIGIN      
        1 cacacacggt gctgctacct cgtttgcttc gtgcgtgcgt gcgcagatgt gggccccgcg
       61 ggagcagcta ctgggctggg ctgcggaagc tctgcctgcc aaggattctg cctggccctg
      121 ggaagagaag cctagatatc tgggaccagt gactttcgag gatgtggctg tgcttttcac
      181 tgaggcagag tggaagagac tgagccttga gcagaggaac ctatacaaag aagtgatgct
      241 ggaaaatctc aggaatctgg tctcattggc agaatcaaag ccagaagtcc atacctgccc
      301 ttcttgccct ctggcctttg gcagtcagca gttcctcagc caagatgagc tacacaatca
      361 tcctattcca ggtttccatg caggaaatca actccaccca ggaaatccct gcccagagga
      421 tcagccacag tcacaacatc cttctgataa aaatcacagg ggggctgaag cagaagatca
      481 acgagtggaa ggaggcgtca gacccttgtt ttggagtaca aatgaaaggg gggctttagt
      541 gggtttctct agcctgttcc agagaccacc aataagctct tggggaggca acagaatatt
      601 agagatacag ctcagtccag cccagaatgc aagctctgag gaagtagaca gaatttccaa
      661 gagggcagaa accccagggt ttggagcagt caggtttggg gagtgtgcac tagcttttaa
      721 ccagaagtca aacctgttca gacagaaggc agtcacagca gaaaaatctt cagacaaaag
      781 gcagtcacag gtgtgcaggg agtgtgggcg aggctttagc aggaagtcac agctcatcat
      841 acaccagagg acacacacag gagaaaagcc ttatgtctgc ggagagtgtg ggcgaggctt
      901 tatagttgag tcagtcctcc gcaaccacct gagtacacac tccggggaga aaccttatgt
      961 gtgcagccat tgtgggcgag gctttagctg caagccatac ctcatcagac atcagaggac
     1021 acacacaagg gagaaatcgt ttatgtgcac agtgtgtggg cgaggctttc gtgaaaagtc
     1081 agagctcatt aagcaccaga gaattcacac gggggataag ccttatgtgt gcagagattg
     1141 aggccgaggc tttgtaagga gatcatgtct caacacacac cagaggatac attcagatga
     1201 gaagcctttt gtttgcagag agtgtgggcg aggctttcgt gctaaatcaa ctctcctcct
     1261 acaccagtgg acacattcag aggtgaaacc tcacgtgtgt gaggagtgtg ggcatggatt
     1321 tagccagaag tcgtcgctca aatcacatcg gagaacacac tcaggggaga agccttatgt
     1381 gtgtggggaa tgtgggcggg gatttagccg gaggatagtc ctcaatggac actggaggac
     1441 acacacggga gagaagcctt acacgtgctt tgagtgtggg cgaaacttta gcctcaagtc
     1501 cgctcttagt gtacatcaga ggatacactc tggggagaag ccttatgcat gcacggagtg
     1561 tgggcaaggc tttatcacga aatcacagct catcagacac cagaggacac acacaggaga
     1621 aaagccttat gtctgcggag agtgtgggcg aggctttata gctcagtcaa ccctccacta
     1681 ccaccggagt acacactcca aggaaaaacc ttatgtgtgc agccagtgtg ggcgaggctt
     1741 ttgtgataaa tcaactctcc tcgcacacga gcagacacat tcaggggaga agccttatgt
     1801 gtgtggggaa tgtgggcggg gatttggccg gaagatactc ctcaacagac actggaggac
     1861 acacacagga gagaaacctt acgcatgcat cgagtgtggg cgaaacttta gccacaagtc
     1921 cactctcagc ttacatcaga ggatacactc gggggagaag ccttatgcat gcgtggagtg
     1981 tgggcaaagc tttaggagaa agtcacagct catcatacac cagaagatac actcggggaa
     2041 aagctttaga ggtgcaagga gtgaggatgt gattttagca acaagtcagc catcagccac
     2101 accagcggaa atgcttaggg agaagccttg tttgtaaggt aatgtggaca gagctgtacg
     2161 tggacatcat tacttgtcac gtgtcagagg acacactcgg gagaaacctt catggagtga
     2221 gagtaaggtg ttggctggaa gtggcccctt aagagatact tggagtcaaa tctatccact
     2281 gtacgcccac cccactcttg ttctaagagc tttggggaca gtcttttgac cccttacatt
     2341 cctttagatg tgaagatgac agagatctaa cttctgagag cagaggtgtc aagtgacggt
     2401 ccccttggag gaatggtctt tgcatctgac tacttccttc tgcaactgtg ttcttccatt
     2461 agcttccatg acactctcct gctttatttt tttctacatc tctagccttt gctgtttcct
     2521 ctcctacccc acctttagat tttactcaga gttcagtctc cagccctaca atctgaggga
     2581 cacctttacc aggtcccctt cctaaccctc cagtcccaaa tccaagattc tttaaccaca
     2641 ctctaaaagt tcttcagact caggacttaa acatagccac gccaccttgg ccttcaatga
     2701 cagggatcta gcaatgctgc atcatcagcc ttccaatacc aggtttaagg gtattttaaa
     2761 cacagctcct cttaaatcct ccaatctcag tacccagtgt tttagccatg ctcgggtggc
     2821 taaattacat ccaggaatgg tgccagggcc tttagccatt tgtctctcct cacactccag
     2881 ggcccatatg gcccaggttc tgacagtttg ccttactccc ttggcctggg gctagcccta
     2941 cctgataccc tgtgtcaatg agtgtacctt ggagagctat ccactcaggc cccagtgcct
     3001 ctatttgcta agggactctg ccacagaaaa gaaggggaga gatgttcatg taacctcaaa
     3061 atacttaggc ttggttttga tgctagagag gaagaaggac ttggagagag agaaggaatg
     3121 gctggtccag aggcttttgt ccactccctc tcactggaag tggttgatct ccagggaatc
     3181 cccaaggtta gcctgcttag gggaagggct aggggtacct ggaatgtagg atctccccca
     3241 tgcctggcct accaccctaa tgtgtccgga attggtgggt tcttggtctt gctgacttca
     3301 agaatgaagc cgtggaccct cacggtgagt gttacaattc ttaaaaaaaa aaaaaaaaaa
     3361 aaaa
//