LOCUS BC005859 3368 bp mRNA linear HUM 02-MAR-2004 DEFINITION Homo sapiens zinc finger protein 589, mRNA (cDNA clone IMAGE:3533214), partial cds. ACCESSION BC005859 VERSION BC005859.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3368) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3368) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (02-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC005859.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 11 Row: d Column: 19 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7706726. FEATURES Location/Qualifiers source 1..3368 /db_xref="H-InvDB:HIT000086544" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3533214" /tissue_type="Muscle, rhabdomyosarcoma" /clone_lib="NIH_MGC_17" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..3368 /gene="ZNF589" /gene_synonym="SZF1" /db_xref="GeneID:51385" CDS <1..1120 /gene="ZNF589" /gene_synonym="SZF1" /codon_start=2 /product="ZNF589 protein" /protein_id="AAH05859.2" /db_xref="GeneID:51385" /translation="LRACVRAQMWAPREQLLGWTAEALPAKDSAWPWEEKPRYLGPVT FEDVAVLFTEAEWKRLSLEQRNLYKEVMLENLRNLVSLAESKPEVHTCPSCPLAFGSQ QFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQPQSQHPSDKNHRGAEAEDQRVEGGVR PLFWSTNERGALVGFSSLFQRPPISSWGGNRILEIQLSPAQNASSEEVDRISKRAETP GFGAVTFGECALAFNQKSNLFRQKAVTAEKSSDKRQSQVCRECGRGFSRKSQLIIHQR THTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCSHCGRGFSCKPYLIRHQRTH TREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYVCRD" misc_feature 128..247 /gene="ZNF589" /gene_synonym="SZF1" /note="KRAB; Region: KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B" /db_xref="CDD:pfam01352" misc_feature 770..835 /gene="ZNF589" /gene_synonym="SZF1" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" misc_feature 1019..1087 /gene="ZNF589" /gene_synonym="SZF1" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" BASE COUNT 919 a 795 c 879 g 775 t ORIGIN 1 gcttcgtgcg tgcgtgcgcg cgcagatgtg ggccccgcgg gagcagctac tgggctggac 61 tgcggaagct ctgcctgcca aggattctgc ctggccctgg gaagagaagc ctagatatct 121 gggaccagtg actttcgagg atgtggctgt gcttttcact gaggcagagt ggaagagact 181 gagccttgag cagaggaacc tatacaaaga agtgatgctg gaaaatctca ggaatctggt 241 ctcattggca gaatcaaagc cagaagtcca tacctgccct tcttgccctc tggcctttgg 301 cagtcagcag ttcctcagcc aagatgagct acacaatcat cctattccag gtttccatgc 361 aggaaatcaa ctccacccag gaaatccctg cccagaggat cagccacagt cacaacatcc 421 ttctgataaa aatcacaggg gggctgaagc agaagatcaa cgagtggaag gaggcgtcag 481 acccttgttt tggagtacaa atgaaagggg ggctttagtg ggtttctcta gcctgttcca 541 gagaccacca ataagctctt ggggaggcaa cagaatatta gagatacagc tcagtccagc 601 ccagaatgca agctctgagg aagtagacag aatttccaag agggcagaaa ccccagggtt 661 tggagcagtc acgtttgggg agtgtgcact agcttttaac cagaagtcaa acctgttcag 721 acagaaggca gtcacagcag aaaaatcttc agacaaaagg cagtcacagg tgtgcaggga 781 gtgtgggcga ggctttagca ggaagtcaca gctcatcata caccagagga cacacacagg 841 agaaaagcct tatgtctgcg gagagtgtgg gcgaggcttt atagttgagt cagtcctccg 901 caaccacctg agtacacact ccggggagaa accttatgtg tgcagccatt gtgggcgagg 961 ctttagctgc aagccatacc tcatcagaca tcagaggaca cacacaaggg agaaatcgtt 1021 tatgtgcaca gtgtgtgggc gaggctttcg tgaaaagtca gagctcatta agcaccagag 1081 aattcacacg ggggataagc cttatgtgtg cagagattga ggccgaggct ttgtaaggag 1141 atcatgtctc aacacacacc agaggataca ttcagatgag aagccttttg tttgcagaga 1201 gtgtgggcga ggctttcgtg ctaaatcaac tctcctccta caccagtgga cacattcaga 1261 ggtgaaacct cacgtgtgtg aggagtgtgg gcatggattt agccagaagt cgtcgctcaa 1321 atcacatcgg agaacacact caggggagaa gccttatgtg tgtggggaat gtgggcgggg 1381 atttagccgg aggatagtcc tcaatggaca ctggaggaca cacacgggag agaagcctta 1441 cacgtgcttt gagtgtgggc gaaactttag cctcaagtcc gctcttagtg tacatcagag 1501 gatacactct ggggagaagc cttatgcatg cacggagtgt gggcaaggct ttatcacgaa 1561 atcacagctc atcagacacc agaggacaca cacaggagaa aagccttatg tctgcggaga 1621 gtgtgggcga ggctttatag ctcagtcaac cctccactac caccggagta cacactccaa 1681 ggaaaaacct tatgtgtgca gccagtgtgg gcgaggcttt tgtgataaat caactctcct 1741 cgcacacgag cagacacatt caggggagaa gccttatgtg tgtggggaat gtgggcgggg 1801 atttggccgg aagatactcc tcaacagaca ctggaggaca cacacaggag agaaacctta 1861 cgcatgcatc gagtgtgggc gaaactttag ccacaagtcc actctcagct tacatcagag 1921 gatacactcg ggggagaagc cttatgcatg cgtggagtgt gggcaaagct ttaggagaaa 1981 gtcacagctc atcatacacc agaagataca ctcggggaaa agctttagag gtgcaaggag 2041 tgaggatgtg attttagcaa caagtcagcc atcagccaca ccagcggaaa tgcttaggga 2101 gaagccttgt ttgtaaggta atgtggacag agctgtacgt ggacatcatt acttgtcacg 2161 tgtcagagga cacactcggg agaaaccttc atggagtgag agtaaggtgt tggctggaag 2221 tggcccctta agagatactt ggagtcaaat ctatccactg tacgcccacc ccactcttgt 2281 tctaagagct ttggggacag tcttttgacc ccttacattc ctttagatgt gaagatgaca 2341 gagatctaac ttctgagagc agaggtgtca agtgacggtc cccttggagg aatggtcttt 2401 gcatctgact acttccttct gcaactgtgt tcttccatta gcttccatga cactctcctg 2461 ctttattttt ttctacatct ctagcctttg ctgtttcctc tcctacccca cctttagatt 2521 ttactcagag ttcagtctcc agccctacaa tctgagggac acctttacca ggtccccttc 2581 ctaaccctcc agtcccaaat ccaagattct ttaaccacac tctaaaagtt cttcagactc 2641 aggacttaaa catagccacg ccaccttggc cttcaatgac agggatctag caatgctgca 2701 tcatcagcct tccaatacca ggtttaaggg tattttaaac acagctcctc ttaaatcctc 2761 caatctcagt acccagtgtt ttagccatgc tcgggtggct aaattacatc caggaatggt 2821 gccagggcct ttagccattt gtctctcctc acactccagg gcccatatgg cccaggttct 2881 gacagtttgc cttactccct tgggctgggg ctagccctac ctgataccct gtgtcaatga 2941 gtgtaccttg gagagctatc cactcaggcc ccagtgcctc tatttgctaa gggactctgc 3001 cacagaaaag aaggggagag atgttcatgt aacctcaaaa tacttaggct tggttttgat 3061 gctagagagg aaaaaggact tggagagaga gaaggaatgg ctggtccaga ggcttttgtc 3121 cactccctct cactggaagt ggttgatctc cagggaatcc ccaaggttag cctgcttagg 3181 ggaagggcta ggggtacctg gaatgtagga tctcccccat gcctggccta ccaccctaat 3241 gtgtctggaa ttggtgggtt cttggtcttg ctgacttcaa gaatgaagcc gtggaccctc 3301 acggtgagtg ttacaattct taaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 3361 aaaaaaaa //