LOCUS BC006247 3364 bp mRNA linear HUM 02-MAR-2004 DEFINITION Homo sapiens zinc finger protein 589, mRNA (cDNA clone IMAGE:3945618), partial cds. ACCESSION BC006247 VERSION BC006247.2 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3364) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3364) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (09-APR-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT On Aug 19, 2003 this sequence version replaced BC006247.1. Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 15 Row: h Column: 3 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 7706726. FEATURES Location/Qualifiers source 1..3364 /db_xref="H-InvDB:HIT000086622" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:3945618" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene <1..3364 /gene="ZNF589" /gene_synonym="SZF1" /db_xref="GeneID:51385" CDS <1..1141 /gene="ZNF589" /gene_synonym="SZF1" /codon_start=2 /product="ZNF589 protein" /protein_id="AAH06247.2" /db_xref="GeneID:51385" /translation="THGAATSFASCVRAQMWAPREQLLGWAAEALPAKDSAWPWEEKP RYLGPVTFEDVAVLFTEAEWKRLSLEQRNLYKEVMLENLRNLVSLAESKPEVHTCPSC PLAFGSQQFLSQDELHNHPIPGFHAGNQLHPGNPCPEDQPQSQHPSDKNHRGAEAEDQ RVEGGVRPLFWSTNERGALVGFSSLFQRPPISSWGGNRILEIQLSPAQNASSEEVDRI SKRAETPGFGAVRFGECALAFNQKSNLFRQKAVTAEKSSDKRQSQVCRECGRGFSRKS QLIIHQRTHTGEKPYVCGECGRGFIVESVLRNHLSTHSGEKPYVCSHCGRGFSCKPYL IRHQRTHTREKSFMCTVCGRGFREKSELIKHQRIHTGDKPYVCRD" misc_feature 149..268 /gene="ZNF589" /gene_synonym="SZF1" /note="KRAB; Region: KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B" /db_xref="CDD:pfam01352" misc_feature 791..856 /gene="ZNF589" /gene_synonym="SZF1" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" misc_feature 1040..1108 /gene="ZNF589" /gene_synonym="SZF1" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" BASE COUNT 896 a 803 c 884 g 781 t ORIGIN 1 cacacacggt gctgctacct cgtttgcttc gtgcgtgcgt gcgcagatgt gggccccgcg 61 ggagcagcta ctgggctggg ctgcggaagc tctgcctgcc aaggattctg cctggccctg 121 ggaagagaag cctagatatc tgggaccagt gactttcgag gatgtggctg tgcttttcac 181 tgaggcagag tggaagagac tgagccttga gcagaggaac ctatacaaag aagtgatgct 241 ggaaaatctc aggaatctgg tctcattggc agaatcaaag ccagaagtcc atacctgccc 301 ttcttgccct ctggcctttg gcagtcagca gttcctcagc caagatgagc tacacaatca 361 tcctattcca ggtttccatg caggaaatca actccaccca ggaaatccct gcccagagga 421 tcagccacag tcacaacatc cttctgataa aaatcacagg ggggctgaag cagaagatca 481 acgagtggaa ggaggcgtca gacccttgtt ttggagtaca aatgaaaggg gggctttagt 541 gggtttctct agcctgttcc agagaccacc aataagctct tggggaggca acagaatatt 601 agagatacag ctcagtccag cccagaatgc aagctctgag gaagtagaca gaatttccaa 661 gagggcagaa accccagggt ttggagcagt caggtttggg gagtgtgcac tagcttttaa 721 ccagaagtca aacctgttca gacagaaggc agtcacagca gaaaaatctt cagacaaaag 781 gcagtcacag gtgtgcaggg agtgtgggcg aggctttagc aggaagtcac agctcatcat 841 acaccagagg acacacacag gagaaaagcc ttatgtctgc ggagagtgtg ggcgaggctt 901 tatagttgag tcagtcctcc gcaaccacct gagtacacac tccggggaga aaccttatgt 961 gtgcagccat tgtgggcgag gctttagctg caagccatac ctcatcagac atcagaggac 1021 acacacaagg gagaaatcgt ttatgtgcac agtgtgtggg cgaggctttc gtgaaaagtc 1081 agagctcatt aagcaccaga gaattcacac gggggataag ccttatgtgt gcagagattg 1141 aggccgaggc tttgtaagga gatcatgtct caacacacac cagaggatac attcagatga 1201 gaagcctttt gtttgcagag agtgtgggcg aggctttcgt gctaaatcaa ctctcctcct 1261 acaccagtgg acacattcag aggtgaaacc tcacgtgtgt gaggagtgtg ggcatggatt 1321 tagccagaag tcgtcgctca aatcacatcg gagaacacac tcaggggaga agccttatgt 1381 gtgtggggaa tgtgggcggg gatttagccg gaggatagtc ctcaatggac actggaggac 1441 acacacggga gagaagcctt acacgtgctt tgagtgtggg cgaaacttta gcctcaagtc 1501 cgctcttagt gtacatcaga ggatacactc tggggagaag ccttatgcat gcacggagtg 1561 tgggcaaggc tttatcacga aatcacagct catcagacac cagaggacac acacaggaga 1621 aaagccttat gtctgcggag agtgtgggcg aggctttata gctcagtcaa ccctccacta 1681 ccaccggagt acacactcca aggaaaaacc ttatgtgtgc agccagtgtg ggcgaggctt 1741 ttgtgataaa tcaactctcc tcgcacacga gcagacacat tcaggggaga agccttatgt 1801 gtgtggggaa tgtgggcggg gatttggccg gaagatactc ctcaacagac actggaggac 1861 acacacagga gagaaacctt acgcatgcat cgagtgtggg cgaaacttta gccacaagtc 1921 cactctcagc ttacatcaga ggatacactc gggggagaag ccttatgcat gcgtggagtg 1981 tgggcaaagc tttaggagaa agtcacagct catcatacac cagaagatac actcggggaa 2041 aagctttaga ggtgcaagga gtgaggatgt gattttagca acaagtcagc catcagccac 2101 accagcggaa atgcttaggg agaagccttg tttgtaaggt aatgtggaca gagctgtacg 2161 tggacatcat tacttgtcac gtgtcagagg acacactcgg gagaaacctt catggagtga 2221 gagtaaggtg ttggctggaa gtggcccctt aagagatact tggagtcaaa tctatccact 2281 gtacgcccac cccactcttg ttctaagagc tttggggaca gtcttttgac cccttacatt 2341 cctttagatg tgaagatgac agagatctaa cttctgagag cagaggtgtc aagtgacggt 2401 ccccttggag gaatggtctt tgcatctgac tacttccttc tgcaactgtg ttcttccatt 2461 agcttccatg acactctcct gctttatttt tttctacatc tctagccttt gctgtttcct 2521 ctcctacccc acctttagat tttactcaga gttcagtctc cagccctaca atctgaggga 2581 cacctttacc aggtcccctt cctaaccctc cagtcccaaa tccaagattc tttaaccaca 2641 ctctaaaagt tcttcagact caggacttaa acatagccac gccaccttgg ccttcaatga 2701 cagggatcta gcaatgctgc atcatcagcc ttccaatacc aggtttaagg gtattttaaa 2761 cacagctcct cttaaatcct ccaatctcag tacccagtgt tttagccatg ctcgggtggc 2821 taaattacat ccaggaatgg tgccagggcc tttagccatt tgtctctcct cacactccag 2881 ggcccatatg gcccaggttc tgacagtttg ccttactccc ttggcctggg gctagcccta 2941 cctgataccc tgtgtcaatg agtgtacctt ggagagctat ccactcaggc cccagtgcct 3001 ctatttgcta agggactctg ccacagaaaa gaaggggaga gatgttcatg taacctcaaa 3061 atacttaggc ttggttttga tgctagagag gaagaaggac ttggagagag agaaggaatg 3121 gctggtccag aggcttttgt ccactccctc tcactggaag tggttgatct ccagggaatc 3181 cccaaggtta gcctgcttag gggaagggct aggggtacct ggaatgtagg atctccccca 3241 tgcctggcct accaccctaa tgtgtccgga attggtgggt tcttggtctt gctgacttca 3301 agaatgaagc cgtggaccct cacggtgagt gttacaattc ttaaaaaaaa aaaaaaaaaa 3361 aaaa //