LOCUS BC064604 3298 bp mRNA linear HUM 06-JAN-2004 DEFINITION Homo sapiens zinc finger protein 1 homolog (mouse), mRNA (cDNA clone IMAGE:5784831), partial cds. ACCESSION BC064604 VERSION BC064604.1 KEYWORDS . SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3298) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3298) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (22-DEC-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: National Institutes of Health Intramural Sequencing Center (NISC), Gaithersburg, Maryland; Web site: http://www.nisc.nih.gov/ Contact: nisc_mgc@nhgri.nih.gov Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B., Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S., Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P., Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R., Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C., McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W., Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L., Young,A., Zhang,L.-H. and Green,E.D. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 139 Row: p Column: 13 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 24233531. FEATURES Location/Qualifiers source 1..3298 /db_xref="H-InvDB:HIT000261523" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="IMAGE:5784831" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene <1..3298 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /db_xref="GeneID:162239" CDS <1..1348 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /codon_start=2 /product="ZFP1 protein" /protein_id="AAH64604.1" /db_xref="GeneID:162239" /translation="AALRRDPLWHWATSGGCAPPGTRGASSSAFIVLCLCPKLQKMNK SQGSVSFTDVTVDFTQEEWEQLDPSQRILYMDVMLENYSNLLSVEVWKADDQMERDHR NPDEQARQFLILKNQTPIEERGDLFGKALNLNTDFVSLRQVPYKYDLYEKTLKYNSDL LNSNRSYAGKQTDECNEFGKALLYLKQEKTHSGVEYSEYNKSGKALSHKAAIFKHQKI KNLVQPFICTYCDKAFSFKSLLISHKRIHTGEKPYECNVCKKTFSHKANLIKHQRIHT GEKPFECPECGKAFTHQSNLIVHQRAHMEKKPYECSECGKTFAQKFELTTHQRIHTGE RPYECNECAKTFFKKSNLIIHQKIHTGEKRYECSECGKSFIQNSQLIIHMRTHTGEKP YECTECGKTFSQRSTLRLHLRIHTGEKPYECSECGKAFSRKSRLSVHQRVHIGEKP" misc_feature 146..268 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /note="KRAB; Region: KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B" /db_xref="CDD:pfam01352" misc_feature 758..826 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" misc_feature 842..910 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" misc_feature 1094..1162 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" misc_feature 1178..1246 /gene="ZFP1" /gene_synonym="FLJ34243" /gene_synonym="ZNF475" /note="zf-C2H2; Region: Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers" /db_xref="CDD:pfam00096" BASE COUNT 1104 a 629 c 628 g 937 t ORIGIN 1 cgccgccctg cgccgtgacc cgctgtggca ctgggccacg agtggaggct gcgcccctcc 61 aggtacgaga ggggcttcca gttctgcctt catagttctc tgcctttgcc caaaactgca 121 gaaaatgaac aaatcccagg gatcagtttc attcacggat gtgactgtgg actttaccca 181 ggaggaatgg gaacaactgg acccctctca gaggatccta tacatggatg tgatgctgga 241 gaattatagc aacttacttt cagtggaagt gtggaaggct gatgaccaga tggagagaga 301 ccacagaaac ccagacgagc aggcgaggca atttttaatt cttaagaacc aaaccccaat 361 tgaggaaaga ggcgatctct ttggaaaagc acttaatctg aacacagact ttgtttcttt 421 aagacaagta ccttataaat atgacttata tgaaaaaact ttgaaatata attcagactt 481 gcttaatagt aatagaagct atgcaggaaa gcagactgat gagtgtaatg aatttggaaa 541 agcacttctc tacctgaagc aagagaaaac ccacagtgga gtagaatatt ctgaatacaa 601 taaaagtgga aaagccctca gccataaagc agccattttt aaacatcaga aaataaaaaa 661 cttggttcaa cctttcattt gtacttactg tgacaaggct ttctccttta agtcactcct 721 cattagtcat aagagaatac atactggaga aaagccatat gaatgcaatg tatgtaagaa 781 aaccttctcc cataaggcca acctcatcaa acatcagaga attcacactg gggagaaacc 841 tttcgagtgt ccggaatgtg gaaaagcttt cacccaccag tcaaacctca ttgtacacca 901 gagagcacat atggagaaga agccctatga gtgcagtgaa tgtggaaaga catttgccca 961 aaagtttgaa ctcaccacac accagagaat tcatacagga gagcgaccct atgagtgtaa 1021 cgaatgtgca aaaaccttct ttaagaagtc aaaccttatc atacatcaga agattcacac 1081 gggggagaaa cgctatgagt gcagtgaatg tggaaaatcc tttatccaga actcacagct 1141 catcatacac atgagaactc atacaggaga gaaaccctat gaatgtactg agtgcggcaa 1201 aactttcagc cagaggtcaa ctcttagatt acacttgcga atccacacag gagagaaacc 1261 atatgagtgt tccgaatgtg ggaaggcctt tagcaggaag tcccgactca gtgtccatca 1321 gagagttcac atcggggaga aaccctgaaa ctccagccag gtcttactgt ggaaaactcc 1381 tgccagaact cttcaagcgg gtgaaaaacc tcatgacagt attgagggaa catgggaatt 1441 catactgaga tgcaatctct caactcaaaa atgtattaaa aataggatcc catgagaaca 1501 ttatactgga agttacatgt gatacccagc taaagaatac atatcagaat atatccagct 1561 gtaaatagcc atacccagtt gtactacaat gagcttttta aaatcttgaa tgaattgagt 1621 attctagaca gcagcataag aaatatacaa acagtatcct atgaatttag tggttttatg 1681 tggatatgcc acaaaacaca agtggtatgc tttagatctg cacatgtgaa tttagcataa 1741 tcactgggaa tttgaaattc ttgtcctctg tgataattag gacataacaa gattttccat 1801 actaaacatc acatgtgttt ttcagtatct ctgcaatcca gtatgcattc caaatgaaat 1861 gtgtaacatt taaagtagca tggaacctac gtctttccct actgcttgct ggcatatatc 1921 agttggtatg aacattgttc ttgagctgcc tcttaggaaa cagaagacag tgttgaagtt 1981 ttccctgctt ctggtttgct gaagattttc tagactgctt ctggtttgct gaagattttc 2041 tagaagcact atttacacaa atatgcaaat gtgaaataac cgatctataa cgtgaaggag 2101 gcagacatga gctgtactac tcagtatgca ccacagaata agagtttgcc gtgtaaagac 2161 aatatcccca ttcgtcatgc tcttattttc ccgtgggata tttgcataca aatgcatgtc 2221 tgttaccaaa atattgtgta acacagacag aaaccacctg tttttgtctt tccttgtttc 2281 ccttaatatt tcatgaattg tctagcaaaa atggtaggat gcttctgtag ttcacaaatg 2341 ttacatttca gagactttag aggaaaaatt attttaaata actgtcaact gtttcattgc 2401 tttttaaatt tttcacgtgc ataaccccct ttagaagtaa atttttacac tattttgttg 2461 tttaaaggag gcatttctac ttccttgagt tttgctgttg ccaacctaaa acatttccct 2521 ttggaacatg agttataagt tattactttt cctttacatg tttacacttt tataaaaatc 2581 agatttttca gtggtctctc cagatattaa caagaattgt tgtgtaactc aaagatttgc 2641 ttttatagac ttgatttcaa attcttaagt tcagcctttc cctatcaatc acaaatcatg 2701 tattggaaat tataaatggc aaccaaaaac tggcttttaa aaaatatttt tggatatcat 2761 tgatgtgctg tcacactata tattgagtga ctttctgaac aaatttatcc agaacatcca 2821 tagcccaata agcttttgtc taagttgtca tcttacttac tcacaaagga ggggaaaacg 2881 ttattttcat agctgctttt agaaatgtag attgtaacag cctccttgga caacattgtg 2941 agtctgtttt aacattaata tactcttaca acctaggaat ctcctgggaa tctatcccta 3001 aggaataaaa agcgccagca cttaagatta tatgtttact aatgtatatt gtattttttg 3061 gtaaggacca aaaaaacaaa caaaaaaaag accatcagta agggaatgga tgaataaatt 3121 gtggtaaaaa cataccatgg ctgaatggta tttccttgaa aagaatgtgt gggaacaaac 3181 tccaaacttc tctaggtcaa atgaggaaag caagagggga gagagagagt gtatgtgtgt 3241 gtgtagcagt ttttataaaa taaagataaa cccttataaa aaaaaaaaaa aaaaaaaa //