LOCUS       BC043351                2995 bp    mRNA    linear   HUM 07-OCT-2003
DEFINITION  Homo sapiens jerky homolog (mouse), mRNA (cDNA clone MGC:49934
            IMAGE:6156875), complete cds.
ACCESSION   BC043351
VERSION     BC043351.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2995)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2995)
  AUTHORS   Strausberg,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (09-JAN-2003) National Institutes of Health, Mammalian
            Gene Collection (MGC), Cancer Genomics Office, National Cancer
            Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
            USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Genome Sequence Centre,
            BC Cancer Agency, Vancouver, BC, Canada
            info@bcgsc.bc.ca
            Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
            Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
            Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
            Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
            Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
            Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
            Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
            George Yang, Scott Zuyderduyn, Marco Marra.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 89 Row: p Column: 13
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 22208998.
FEATURES             Location/Qualifiers
     source          1..2995
                     /db_xref="H-InvDB:HIT000052800"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:49934 IMAGE:6156875"
                     /tissue_type="Uterus, leiomyosarcoma"
                     /clone_lib="NIH_MGC_71"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2995
                     /gene="JRK"
                     /gene_synonym="JH8"
                     /db_xref="GeneID:8629"
                     /db_xref="MIM:603210"
     CDS             339..2045
                     /gene="JRK"
                     /gene_synonym="JH8"
                     /codon_start=1
                     /product="JRK protein"
                     /protein_id="AAH43351.1"
                     /db_xref="GeneID:8629"
                     /db_xref="MIM:603210"
                     /translation="MASKPAAGKSRGEKRKRVVLTLKEKIDICTRLEKGESRKALMQE
                     YNVGMSTLYDIRAHKAQLLRFFASSDSNKALEQRRTLHTPKLEHLDRVLYEWFLGKRS
                     EGVPVSGPMLIEKAKDFYEQMQLTEPCVFSGGWLWRFKARHGIKKLDASSEKQSADHQ
                     AAEQFCAFFRSLAAEHGLSAEQVYNADETGLFWRCLPNPTPEGGAVPGPKQGKDRLTV
                     LMCANATGSHRLKPLAIGKCSGPRAFKGIQHLPVAYKAQGNAWVDKEIFSDWFHHIFV
                     PSVREHFRTIGLPEDSKAVLLLDSSRAHPQEAELVSSNVFTIFLPASVASLVQPMEQG
                     IRRDFMRNFINPPVPLQGPHARYNMNDAIFSVACAWNAVPSHVFRRAWRKLWPSVAFA
                     EGSSSEEELEAECFPVKPHNKSFAHILELVKEGSSCPGQLRQRQAASWGVAGREAEGG
                     RPPAATSPAEVVWSSEKTPKADQDGRGDPGEGEEVAWEQAAVAFDAVLRFAERQPCFS
                     AQEVGQLRALRAVFRSQQQVRRRRGALGAVVKVEALQEGPGGCGATAQSPLPCSSTAG
                     DN"
     misc_feature    378..536
                     /gene="JRK"
                     /gene_synonym="JH8"
                     /note="CENP-B_N; Region: CENP-B N-terminal DNA-binding
                     domain. Centromere Protein B (CENP-B) is a DNA-binding
                     protein localized to the centromere. Within the N-terminal
                     125 residues, there is a DNA-binding region, which binds
                     to a corresponding 17bp CENP-B box sequence. CENP-B dimers
                     either bind two separate DNA molecules or alternatively,
                     they may bind two CENP-B boxes on one DNA molecule, with
                     the intervening stretch of DNA forming a loop structure.
                     The CENP-B DNA-binding domain consists of two repeating
                     domains, RP1 and RP2. This family corresponds to RP1 has
                     been shown to consist of four helices in a
                     helix-turn-helix structure"
                     /db_xref="CDD:pfam04218"
     misc_feature    585..773
                     /gene="JRK"
                     /gene_synonym="JH8"
                     /note="CENPB; Region: Putative DNA-binding domain in
                     centromere protein B, mouse jerky and transposases"
                     /db_xref="CDD:smart00674"
     misc_feature    822..1499
                     /gene="JRK"
                     /gene_synonym="JH8"
                     /note="DDE; Region: DDE superfamily endonuclease. This
                     family of proteins are related to pfam00665 and are
                     probably endonucleases of the DDE superfamily. Transposase
                     proteins are necessary for efficient DNA transposition.
                     This domain is a member of the DDE superfamily, which
                     contain three carboxylate residues that are believed to be
                     responsible for coordinating metal ions needed for
                     catalysis. The catalytic activity of this enzyme involves
                     DNA cleavage at a specific site followed by a strand
                     transfer reaction. Interestingly this family also includes
                     the CENP-B protein. This domain in that protein appears to
                     have lost the metal binding residues and is unlikely to
                     have endonuclease activity. Centromere Protein B (CENP-B)
                     is a DNA-binding protein localised to the centromere"
                     /db_xref="CDD:pfam03184"
BASE COUNT          608 a          845 c          971 g          571 t
ORIGIN      
        1 gcaggcccct gaagtgctgt gcctggagat accattgtgg accctggaga ggcacctgct
       61 gcttatgcga gagaattggc agctgactgc actgccagga gccaaggcct aggtgtgtgg
      121 agcctggaag accagggata cctgaggcag gaggtgcagc gtgtggagtg agaaacccga
      181 gagggactgg ggcctgtgct caggatcagc agagcaggca ggcagagtgg agagggagag
      241 acggagcccc agaaggggag caggaggaga gggaagtagt ggcagcagcc caggccaggc
      301 gtgtgtcctg caagtgctag gaccagccac ccctccccat ggcctccaag ccggctgccg
      361 ggaagagcag aggggagaag cggaagaggg tggtgctgac actgaaggag aagattgaca
      421 tctgcacgcg cctggagaag ggcgagagcc ggaaggcact gatgcaggag tacaatgtgg
      481 gcatgtccac cctctacgac atcagggccc acaaggcgca gctgctccgg ttcttcgcca
      541 gctccgactc caacaaggcg ctggagcagc ggcgcacgct gcacacgccc aagctggagc
      601 acctggaccg cgtcctgtac gagtggttcc tggggaagcg ctccgagggc gtccccgtgt
      661 caggccccat gctcatcgag aaggccaagg acttctacga gcagatgcag ctcactgagc
      721 cctgcgtgtt ctccggaggg tggctttggc gctttaaggc cagacacggc attaaaaagc
      781 tagatgcatc cagtgaaaag cagtcagccg accaccaggc cgcggagcag ttctgtgcgt
      841 ttttcaggag cttggctgct gagcacgggc tgtccgccga gcaggtttac aacgctgatg
      901 agaccggcct tttctggcgg tgcctgccaa atcccactcc ggaaggcggg gctgtgcctg
      961 gccccaagca gggcaaggac cggctgaccg tgctgatgtg tgccaacgcc acgggctccc
     1021 acaggctcaa gcccttggcc atcgggaagt gcagcggtcc cagggctttc aaaggcatcc
     1081 agcacctgcc cgtcgcctat aaggcccagg ggaacgcctg ggtggacaag gagatttttt
     1141 ccgattggtt ccatcatatc tttgtgccct cggtgagaga gcacttcaga accataggtt
     1201 tgccggaaga cagcaaagcc gttctcttgc tggacagctc ccgggctcac ccgcaggagg
     1261 ccgagctggt gtccagtaac gttttcacca tcttcctgcc tgccagcgtg gcctcattgg
     1321 tgcagcccat ggagcagggc attcggagag atttcatgag gaacttcatt aaccctccgg
     1381 tccccctgca gggcccccac gcccgctaca acatgaacga tgccatattc agcgtggcct
     1441 gtgcctggaa cgcagtccct agccacgtct tcaggcgggc ctggaggaag ctgtggccgt
     1501 cggttgcgtt tgccgaaggc tcctcctctg aggaggagtt ggaggcagag tgcttcccag
     1561 tgaagcccca caacaagtcc tttgcacaca tcctggagct tgtgaaggaa ggctcctcct
     1621 gcccgggcca gcttcgccag cgccaggccg ccagctgggg ggtagcggga agggaggcag
     1681 aagggggacg gccccctgct gccacgtcgc cagcagaggt tgtgtggagt tcagaaaaga
     1741 ctccgaaagc tgaccaggac ggcagaggag atcctggtga gggcgaggag gtggcctggg
     1801 agcaggcggc cgtggccttt gacgcagtcc tgcgctttgc ggagcggcag ccatgcttca
     1861 gtgcgcagga agtggggcag ctgcgggcgc tgcgtgccgt gttccggagc cagcagcagg
     1921 tgaggaggcg gcgtggtgcc ctcggggctg tggtcaaggt tgaagccctc caggagggcc
     1981 ctggtggctg tggggccaca gctcagtctc ccttgccctg ctcatccaca gcaggtgaca
     2041 actgatggct tctctgccct gccctggcca ctggccctgt ttctccccac accctggagt
     2101 ggcatggtcc tgtgccccga ccccacctga ggcaggaggg catgtgcaga cactcaagag
     2161 cccttccagg agtgggtcgc ccacgggtgt ggctcgggtg cccaggacgg tctgtgcccg
     2221 aggttcctgt caatacaggt tttattttat cacttgccgt gtcatccgaa agtgaggaaa
     2281 tgttttggaa gggtccaccc tagcctagaa caagccagag ccgcaccctg gctggaatgg
     2341 gggccaggct gagccgatct ggtctcgtgt tcgctggctg atcattgcag tatcagaggg
     2401 tggagatgtc agtctgtcca cgtggagaga agttgccctc caggccgaca ggaggccatg
     2461 cccaccgccc ctggacaggc ttcgtctcag aaggctctat ctgctgggct ggcggccatc
     2521 cccgtgttgg gtggaccccg agcacggttg cctgaggtcc gatggcccga gagctgggac
     2581 tcagttcttg gcctgctagc ggctgaacag gccgcacatc tcacttcagt tgtggcctca
     2641 ttcagcagaa tgactctgga accatcctct gttacccgca gatcctgtcc catgggctct
     2701 ggccccaaga tgttgggggc cccacggaga gttgacttgg tagagttcct ttctgggaag
     2761 aaagtaggag tggctgacca ggccctgctc atcacccgga tagaggacac ggaccctgtg
     2821 tggtattttg gcattttggc tcagagtcca atgtaccatg ttgcccagaa tttcatactt
     2881 atggccttta catgaatacg tcttgatcag acattcagag attagtctta ggtttgcacg
     2941 taagtcactg aaaacagtaa aacaggctgc ttagatttct aaaaaaaaaa aaaaa
//