LOCUS BC043351 2995 bp mRNA linear HUM 07-OCT-2003 DEFINITION Homo sapiens jerky homolog (mouse), mRNA (cDNA clone MGC:49934 IMAGE:6156875), complete cds. ACCESSION BC043351 VERSION BC043351.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2995) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2995) AUTHORS Strausberg,R. TITLE Direct Submission JOURNAL Submitted (09-JAN-2003) National Institutes of Health, Mammalian Gene Collection (MGC), Cancer Genomics Office, National Cancer Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, George Yang, Scott Zuyderduyn, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 89 Row: p Column: 13 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 22208998. FEATURES Location/Qualifiers source 1..2995 /db_xref="H-InvDB:HIT000052800" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:49934 IMAGE:6156875" /tissue_type="Uterus, leiomyosarcoma" /clone_lib="NIH_MGC_71" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2995 /gene="JRK" /gene_synonym="JH8" /db_xref="GeneID:8629" /db_xref="MIM:603210" CDS 339..2045 /gene="JRK" /gene_synonym="JH8" /codon_start=1 /product="JRK protein" /protein_id="AAH43351.1" /db_xref="GeneID:8629" /db_xref="MIM:603210" /translation="MASKPAAGKSRGEKRKRVVLTLKEKIDICTRLEKGESRKALMQE YNVGMSTLYDIRAHKAQLLRFFASSDSNKALEQRRTLHTPKLEHLDRVLYEWFLGKRS EGVPVSGPMLIEKAKDFYEQMQLTEPCVFSGGWLWRFKARHGIKKLDASSEKQSADHQ AAEQFCAFFRSLAAEHGLSAEQVYNADETGLFWRCLPNPTPEGGAVPGPKQGKDRLTV LMCANATGSHRLKPLAIGKCSGPRAFKGIQHLPVAYKAQGNAWVDKEIFSDWFHHIFV PSVREHFRTIGLPEDSKAVLLLDSSRAHPQEAELVSSNVFTIFLPASVASLVQPMEQG IRRDFMRNFINPPVPLQGPHARYNMNDAIFSVACAWNAVPSHVFRRAWRKLWPSVAFA EGSSSEEELEAECFPVKPHNKSFAHILELVKEGSSCPGQLRQRQAASWGVAGREAEGG RPPAATSPAEVVWSSEKTPKADQDGRGDPGEGEEVAWEQAAVAFDAVLRFAERQPCFS AQEVGQLRALRAVFRSQQQVRRRRGALGAVVKVEALQEGPGGCGATAQSPLPCSSTAG DN" misc_feature 378..536 /gene="JRK" /gene_synonym="JH8" /note="CENP-B_N; Region: CENP-B N-terminal DNA-binding domain. Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere. Within the N-terminal 125 residues, there is a DNA-binding region, which binds to a corresponding 17bp CENP-B box sequence. CENP-B dimers either bind two separate DNA molecules or alternatively, they may bind two CENP-B boxes on one DNA molecule, with the intervening stretch of DNA forming a loop structure. The CENP-B DNA-binding domain consists of two repeating domains, RP1 and RP2. This family corresponds to RP1 has been shown to consist of four helices in a helix-turn-helix structure" /db_xref="CDD:pfam04218" misc_feature 585..773 /gene="JRK" /gene_synonym="JH8" /note="CENPB; Region: Putative DNA-binding domain in centromere protein B, mouse jerky and transposases" /db_xref="CDD:smart00674" misc_feature 822..1499 /gene="JRK" /gene_synonym="JH8" /note="DDE; Region: DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Interestingly this family also includes the CENP-B protein. This domain in that protein appears to have lost the metal binding residues and is unlikely to have endonuclease activity. Centromere Protein B (CENP-B) is a DNA-binding protein localised to the centromere" /db_xref="CDD:pfam03184" BASE COUNT 608 a 845 c 971 g 571 t ORIGIN 1 gcaggcccct gaagtgctgt gcctggagat accattgtgg accctggaga ggcacctgct 61 gcttatgcga gagaattggc agctgactgc actgccagga gccaaggcct aggtgtgtgg 121 agcctggaag accagggata cctgaggcag gaggtgcagc gtgtggagtg agaaacccga 181 gagggactgg ggcctgtgct caggatcagc agagcaggca ggcagagtgg agagggagag 241 acggagcccc agaaggggag caggaggaga gggaagtagt ggcagcagcc caggccaggc 301 gtgtgtcctg caagtgctag gaccagccac ccctccccat ggcctccaag ccggctgccg 361 ggaagagcag aggggagaag cggaagaggg tggtgctgac actgaaggag aagattgaca 421 tctgcacgcg cctggagaag ggcgagagcc ggaaggcact gatgcaggag tacaatgtgg 481 gcatgtccac cctctacgac atcagggccc acaaggcgca gctgctccgg ttcttcgcca 541 gctccgactc caacaaggcg ctggagcagc ggcgcacgct gcacacgccc aagctggagc 601 acctggaccg cgtcctgtac gagtggttcc tggggaagcg ctccgagggc gtccccgtgt 661 caggccccat gctcatcgag aaggccaagg acttctacga gcagatgcag ctcactgagc 721 cctgcgtgtt ctccggaggg tggctttggc gctttaaggc cagacacggc attaaaaagc 781 tagatgcatc cagtgaaaag cagtcagccg accaccaggc cgcggagcag ttctgtgcgt 841 ttttcaggag cttggctgct gagcacgggc tgtccgccga gcaggtttac aacgctgatg 901 agaccggcct tttctggcgg tgcctgccaa atcccactcc ggaaggcggg gctgtgcctg 961 gccccaagca gggcaaggac cggctgaccg tgctgatgtg tgccaacgcc acgggctccc 1021 acaggctcaa gcccttggcc atcgggaagt gcagcggtcc cagggctttc aaaggcatcc 1081 agcacctgcc cgtcgcctat aaggcccagg ggaacgcctg ggtggacaag gagatttttt 1141 ccgattggtt ccatcatatc tttgtgccct cggtgagaga gcacttcaga accataggtt 1201 tgccggaaga cagcaaagcc gttctcttgc tggacagctc ccgggctcac ccgcaggagg 1261 ccgagctggt gtccagtaac gttttcacca tcttcctgcc tgccagcgtg gcctcattgg 1321 tgcagcccat ggagcagggc attcggagag atttcatgag gaacttcatt aaccctccgg 1381 tccccctgca gggcccccac gcccgctaca acatgaacga tgccatattc agcgtggcct 1441 gtgcctggaa cgcagtccct agccacgtct tcaggcgggc ctggaggaag ctgtggccgt 1501 cggttgcgtt tgccgaaggc tcctcctctg aggaggagtt ggaggcagag tgcttcccag 1561 tgaagcccca caacaagtcc tttgcacaca tcctggagct tgtgaaggaa ggctcctcct 1621 gcccgggcca gcttcgccag cgccaggccg ccagctgggg ggtagcggga agggaggcag 1681 aagggggacg gccccctgct gccacgtcgc cagcagaggt tgtgtggagt tcagaaaaga 1741 ctccgaaagc tgaccaggac ggcagaggag atcctggtga gggcgaggag gtggcctggg 1801 agcaggcggc cgtggccttt gacgcagtcc tgcgctttgc ggagcggcag ccatgcttca 1861 gtgcgcagga agtggggcag ctgcgggcgc tgcgtgccgt gttccggagc cagcagcagg 1921 tgaggaggcg gcgtggtgcc ctcggggctg tggtcaaggt tgaagccctc caggagggcc 1981 ctggtggctg tggggccaca gctcagtctc ccttgccctg ctcatccaca gcaggtgaca 2041 actgatggct tctctgccct gccctggcca ctggccctgt ttctccccac accctggagt 2101 ggcatggtcc tgtgccccga ccccacctga ggcaggaggg catgtgcaga cactcaagag 2161 cccttccagg agtgggtcgc ccacgggtgt ggctcgggtg cccaggacgg tctgtgcccg 2221 aggttcctgt caatacaggt tttattttat cacttgccgt gtcatccgaa agtgaggaaa 2281 tgttttggaa gggtccaccc tagcctagaa caagccagag ccgcaccctg gctggaatgg 2341 gggccaggct gagccgatct ggtctcgtgt tcgctggctg atcattgcag tatcagaggg 2401 tggagatgtc agtctgtcca cgtggagaga agttgccctc caggccgaca ggaggccatg 2461 cccaccgccc ctggacaggc ttcgtctcag aaggctctat ctgctgggct ggcggccatc 2521 cccgtgttgg gtggaccccg agcacggttg cctgaggtcc gatggcccga gagctgggac 2581 tcagttcttg gcctgctagc ggctgaacag gccgcacatc tcacttcagt tgtggcctca 2641 ttcagcagaa tgactctgga accatcctct gttacccgca gatcctgtcc catgggctct 2701 ggccccaaga tgttgggggc cccacggaga gttgacttgg tagagttcct ttctgggaag 2761 aaagtaggag tggctgacca ggccctgctc atcacccgga tagaggacac ggaccctgtg 2821 tggtattttg gcattttggc tcagagtcca atgtaccatg ttgcccagaa tttcatactt 2881 atggccttta catgaatacg tcttgatcag acattcagag attagtctta ggtttgcacg 2941 taagtcactg aaaacagtaa aacaggctgc ttagatttct aaaaaaaaaa aaaaa //