LOCUS BC048798 4705 bp mRNA linear HUM 02-MAR-2004 DEFINITION Homo sapiens zinc finger protein 589, mRNA (cDNA clone IMAGE:5547444), partial cds. ACCESSION BC048798 VERSION BC048798.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 4705) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 4705) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (14-MAR-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 93 Row: o Column: 21 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7706726. FEATURES Location/Qualifiers source 1..4705 /db_xref="H-InvDB:HIT000099013" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:5547444" /tissue_type="Eye, retinoblastoma" /clone_lib="NIH_MGC_67" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..4705 /gene="ZNF589" /gene_synonym="SZF1" /db_xref="GeneID:51385" CDS <1..1138 /gene="ZNF589" /gene_synonym="SZF1" /codon_start=2 /product="ZNF589 protein" /protein_id="AAH48798.1" /db_xref="GeneID:51385" /translation="VLLPRLLRACVRAQMWAPREQLLGWTAEALPAKDSAWPWEEKPR YLGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLENLRNLVSLAESKPEVHTCPSCP LAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQPQSQHPSDKNHRGAEAEDQR VEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEIQLSPAQNASSEEVDRIS KRAETPGFGAVTFGECALAFNQKSNLFRQKAVTAEKSSDKRQSQVCRECGRGFSRKSQ LIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCSHCGRGFSCKPYLI RHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYVCRD" misc_feature 146..265 /gene="ZNF589" /gene_synonym="SZF1" /note="KRAB; Region: KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B" /db_xref="CDD:pfam01352" misc_feature 788..853 /gene="ZNF589" /gene_synonym="SZF1" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" misc_feature 1037..1105 /gene="ZNF589" /gene_synonym="SZF1" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" BASE COUNT 1215 a 1084 c 1230 g 1176 t ORIGIN 1 ggtgctgcta cctcgtttgc ttcgtgcgtg cgtgcgcgcg cagatgtggg ccccgcggga 61 gcagctactg ggctggactg cggaagctct gcctgccaag gattctgcct ggccctggga 121 agagaagcct agatatctgg gaccagtgac tttcgaggat gtggctgtgc ttttcactga 181 ggcagagtgg aagagactga gccttgagca gaggaaccta tacaaagaag tgatgctgga 241 aaatctcagg aatctggtct cattggcaga atcaaagcca gaagtccata cctgcccttc 301 ttgccctctg gcctttggca gtcagcagtt cctcagccaa gatgagctac acaatcatcc 361 tattccaggt ttccatgcag gaaatcaact ccacccagga aatccctgcc cagaggatca 421 gccacagtca caacatcctt ctgataaaaa tcacaggggg gctgaagcag aagatcaacg 481 agtggaagga ggcgtcagac ccttgttttg gagtacaaat gaaagggggg ctttagtggg 541 tttctctagc ctgttccaga gaccaccaat aagctcttgg ggaggcaaca gaatattaga 601 gatacagctc agtccagccc agaatgcaag ctctgaggaa gtagacagaa tttccaagag 661 ggcagaaacc ccagggtttg gagcagtcac gtttggggag tgtgcactag cttttaacca 721 gaagtcaaac ctgttcagac agaaggcagt cacagcagaa aaatcttcag acaaaaggca 781 gtcacaggtg tgcagggagt gtgggcgagg ctttagcagg aagtcacagc tcatcataca 841 ccagaggaca cacacaggag aaaagcctta tgtctgcgga gagtgtgggc gaggctttat 901 agttgagtca gtcctccgca accacctgag tacacactcc ggggagaaac cttatgtgtg 961 cagccattgt gggcgaggct ttagctgcaa gccatacctc atcagacatc agaggacaca 1021 cacaagggag aaatcgttta tgtgcacagt gtgtgggcga ggctttcgtg aaaagtcaga 1081 gctcattaag caccagagaa ttcacacggg ggataagcct tatgtgtgca gagattgagg 1141 ccgaggcttt gtaaggagat catgtctcaa cacacaccag aggatacatt cagatgagaa 1201 gccttttgtt tgcagagagt gtgggcgagg ctttcgtgct aaatcaactc tcctcctaca 1261 ccagtggaca cattcagagg tgaaacctca cgtgtgtgag gagtgtgggc atggatttag 1321 ccagaagtcg tcgctcaaat cacatcggag aacacactca ggggagaagc cttatgtgtg 1381 tggggaatgt gggcggggat ttagccggag gatagtcctc aatggacact ggaggacaca 1441 cacgggagag aagccttaca cgtgctttga gtgtgggcga aactttagcc tcaagtccgc 1501 tcttagtgta catcagagga tacactctgg ggagaagcct tatgcatgca cggagtgtgg 1561 gcaaggcttt atcacgaaat cacagctcat cagacaccag aggacacaca caggagaaaa 1621 gccttatgtc tgcggagagt gtgggcgagg ctttatagct cagtcaaccc tccactacca 1681 ccggagtaca cactccaagg aaaaacctta tgtgtgcagc cagtgtgggc gaggcttttg 1741 tgataaatca actctcctcg cacacgagca gacacattca ggggagaagc cttatgtgtg 1801 tggggaatgt gggcggggat ttggccggaa gatactcctc aacagacact ggaggacaca 1861 cacaggagag aaaccttacg catgcatcga gtgtgggcga aactttagcc acaagtccac 1921 tctcagctta catcagagga tacactcggg ggagaagcct tatgcatgcg tggagtgtgg 1981 gcaaagcttt aggagaaagt cacagctcat catacaccag aagatacact cggggaaaag 2041 ctttagaggt gcaaggagtg aggatgtgat tttagcaaca agtcagccat cagccacacc 2101 agcggaaatg cttagggaga agccttgttt gtaaggtaat gtggacagag ctgtacgtgg 2161 acatcattac ttgtcacgtg tcagaggaca cactcgggag aaaccttcat ggagtgagag 2221 taaggtgttg gctggaagtg gccccttaag agatacttgg agtcaaatct atccactgta 2281 cgcccacccc actcttgttc taagagcttt ggggacagtc ttttgacccc ttacattcct 2341 ttagatgtga agatgacaga gatctaactt ctgagagcag aggtgtcaag tgacggtccc 2401 cttggaggaa tggtctttgc atctgactac ttccttctgc aactgtgttc ttccattagc 2461 ttccatgaca ctctcctgct ttattttttt ctacatctct agcctttgct gtttcctctc 2521 ctaccccacc tttagatttt actcagagtt cagtctccag ccctacaatc tgagggacac 2581 ctttaccagg tccccttcct aaccctccag tcccaaatcc aagattcttt aaccacactc 2641 taaaagttct tcagactcag gacttaaaca tagccacgcc accttggcct tcaatgacag 2701 ggatctagca atgctgcatc atcagccttc caataccagg tttaagggta ttttaaacac 2761 agctcctctt aaatcctcca atctcagtac ccagtgtttt agccatgctc gggtggctaa 2821 attacatcca ggaatggtgc cagggccttt agccatttgt ctctcctcac actccagggc 2881 ccatatggcc caggttctga cagtttgcct tactcccttg ggctggggct agccctacct 2941 gataccctgt gtcaatgagt gtaccttgga gagctatcca ctcaggcccc agtgcctcta 3001 tttgctaagg gactctgcca cagaaaagaa ggggagagat gttcatgtaa cctcaaaata 3061 cttaggcttg gttttgatgc tagagaggaa aaaggacttg gagagagaga aggaatggct 3121 ggtccagagg cttttgtcca ctccctctca ctggaagtgg ttgatctcca gggaatcccc 3181 aaggttagcc tgcttagggg aagggctagg ggtacctgga atgtaggatc tcccccatgc 3241 ctggcctacc accctaatgt gtctggaatt ggtgggttct tggtcttgct gacttcaaga 3301 atgaagccgt ggaccctcac gttctggaac tgggaagtcc aagatcaggg ctcctgcccc 3361 tggacctgaa cctggatctt tggctttgat gttggacctt gtgtgtggat atgaactttg 3421 aacctgcacc ttggaactaa acctctgact tggatttgtt ctgcattttg gaactcacct 3481 tagacctgaa ccttggtcat gaaatggatc tgaatcattc ttgggtcttg gacatgggac 3541 ttggacctcc atgtgtatct tggacttagg accatgatct ggaccttgta aattgacttg 3601 gactttggac ttggaccttg gactagaatt tgtccctggc tgtggacctc gtctttggat 3661 ctggatctgg acctggtctt ggatgttgga cctagatgca ggccttgatt tgatttggaa 3721 catggacttg gtccttgcac atggcccaac acccgcgtct ttggaactgc atcttggacc 3781 ttggatctga aatttaaaca gggaggtgac cttcatttct tgtatttgac tttataactg 3841 gaaattcaat ctctacctgg atcttggacc tggatctcag accttgcttt ttattgggcc 3901 atggatctta taccttaaac atggaaattg tacctggatt tggacctttg acttggacct 3961 ggatctggac ccagggataa gatttggacc tcacttgcaa attttggtaa tgaaattttc 4021 ctttgctcct gtagtccttg gacctaggct tcagtcatgg atttagacct tagacgtgac 4081 catcagacct tagagttgaa cctaaattcc tggtccccag tcctataact cagaatgatt 4141 atgcataacc ttggacttag aacttagatc tggatcttag agctgatact tgacctggta 4201 ctggatattt aaccttggac ttcagacatg gatttaatct gggccttggg atgatttaag 4261 atctagaagt ggacctagaa ttagaccttg gacttatgtg cctgagcctg gactttggaa 4321 gtggactttg ggccttgtaa ctcaaccttg aatatgagcc tcaaatgtag actgtgggcc 4381 aggcacggtg gctcacatct gtaatcccag cactttgggt ggaaagctga ggcaggtgga 4441 tcacttgagc tcaggaattc gggaccagcc tggggaacat ggtgaaaacc tggctctaca 4501 aaaaataaaa aagttagcca ggcttggtgg cgcacgcctg tggttccggg tactcgggag 4561 gctgaggcag gaggttcact tgggcccggg aggcagaggt tgcagtgagc tgagcttgtg 4621 cagctgcact ccagcctgga ggacagagtg agactctgtc tcaagaaaaa aaaataaaaa 4681 ataaataaat aaaaaaaaaa aaaaa //