LOCUS BC043351 2995 bp mRNA linear HUM 07-OCT-2003
DEFINITION Homo sapiens jerky homolog (mouse), mRNA (cDNA clone MGC:49934
IMAGE:6156875), complete cds.
ACCESSION BC043351
VERSION BC043351.1
KEYWORDS MGC.
SOURCE Homo sapiens (human)
ORGANISM Homo sapiens
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
Catarrhini; Hominidae; Homo.
REFERENCE 1 (bases 1 to 2995)
AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
TITLE Generation and initial analysis of more than 15,000 full-length
human and mouse cDNA sequences
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
PUBMED 12477932
REFERENCE 2 (bases 1 to 2995)
AUTHORS Strausberg,R.
TITLE Direct Submission
JOURNAL Submitted (09-JAN-2003) National Institutes of Health, Mammalian
Gene Collection (MGC), Cancer Genomics Office, National Cancer
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590,
USA
REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT Contact: MGC help desk
Email: cgapbs-r@mail.nih.gov
Tissue Procurement: ATCC
cDNA Library Preparation: Life Technologies, Inc.
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
DNA Sequencing by: Genome Sequence Centre,
BC Cancer Agency, Vancouver, BC, Canada
info@bcgsc.bc.ca
Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield,
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin,
Letticia Hsiao, Martin Krzywinski, Reta Kutsche, Oliver Lee, Soo
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott,
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy,
George Yang, Scott Zuyderduyn, Marco Marra.
Clone distribution: MGC clone distribution information can be found
through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
Series: IRAK Plate: 89 Row: p Column: 13
This clone was selected for full length sequencing because it
passed the following selection criteria: matched mRNA gi: 22208998.
FEATURES Location/Qualifiers
source 1..2995
/db_xref="H-InvDB:HIT000052800"
/organism="Homo sapiens"
/mol_type="mRNA"
/db_xref="taxon:9606"
/clone="MGC:49934 IMAGE:6156875"
/tissue_type="Uterus, leiomyosarcoma"
/clone_lib="NIH_MGC_71"
/lab_host="DH10B"
/note="Vector: pCMV-SPORT6"
gene 1..2995
/gene="JRK"
/gene_synonym="JH8"
/db_xref="GeneID:8629"
/db_xref="MIM:603210"
CDS 339..2045
/gene="JRK"
/gene_synonym="JH8"
/codon_start=1
/product="JRK protein"
/protein_id="AAH43351.1"
/db_xref="GeneID:8629"
/db_xref="MIM:603210"
/translation="MASKPAAGKSRGEKRKRVVLTLKEKIDICTRLEKGESRKALMQE
YNVGMSTLYDIRAHKAQLLRFFASSDSNKALEQRRTLHTPKLEHLDRVLYEWFLGKRS
EGVPVSGPMLIEKAKDFYEQMQLTEPCVFSGGWLWRFKARHGIKKLDASSEKQSADHQ
AAEQFCAFFRSLAAEHGLSAEQVYNADETGLFWRCLPNPTPEGGAVPGPKQGKDRLTV
LMCANATGSHRLKPLAIGKCSGPRAFKGIQHLPVAYKAQGNAWVDKEIFSDWFHHIFV
PSVREHFRTIGLPEDSKAVLLLDSSRAHPQEAELVSSNVFTIFLPASVASLVQPMEQG
IRRDFMRNFINPPVPLQGPHARYNMNDAIFSVACAWNAVPSHVFRRAWRKLWPSVAFA
EGSSSEEELEAECFPVKPHNKSFAHILELVKEGSSCPGQLRQRQAASWGVAGREAEGG
RPPAATSPAEVVWSSEKTPKADQDGRGDPGEGEEVAWEQAAVAFDAVLRFAERQPCFS
AQEVGQLRALRAVFRSQQQVRRRRGALGAVVKVEALQEGPGGCGATAQSPLPCSSTAG
DN"
misc_feature 378..536
/gene="JRK"
/gene_synonym="JH8"
/note="CENP-B_N; Region: CENP-B N-terminal DNA-binding
domain. Centromere Protein B (CENP-B) is a DNA-binding
protein localized to the centromere. Within the N-terminal
125 residues, there is a DNA-binding region, which binds
to a corresponding 17bp CENP-B box sequence. CENP-B dimers
either bind two separate DNA molecules or alternatively,
they may bind two CENP-B boxes on one DNA molecule, with
the intervening stretch of DNA forming a loop structure.
The CENP-B DNA-binding domain consists of two repeating
domains, RP1 and RP2. This family corresponds to RP1 has
been shown to consist of four helices in a
helix-turn-helix structure"
/db_xref="CDD:pfam04218"
misc_feature 585..773
/gene="JRK"
/gene_synonym="JH8"
/note="CENPB; Region: Putative DNA-binding domain in
centromere protein B, mouse jerky and transposases"
/db_xref="CDD:smart00674"
misc_feature 822..1499
/gene="JRK"
/gene_synonym="JH8"
/note="DDE; Region: DDE superfamily endonuclease. This
family of proteins are related to pfam00665 and are
probably endonucleases of the DDE superfamily. Transposase
proteins are necessary for efficient DNA transposition.
This domain is a member of the DDE superfamily, which
contain three carboxylate residues that are believed to be
responsible for coordinating metal ions needed for
catalysis. The catalytic activity of this enzyme involves
DNA cleavage at a specific site followed by a strand
transfer reaction. Interestingly this family also includes
the CENP-B protein. This domain in that protein appears to
have lost the metal binding residues and is unlikely to
have endonuclease activity. Centromere Protein B (CENP-B)
is a DNA-binding protein localised to the centromere"
/db_xref="CDD:pfam03184"
BASE COUNT 608 a 845 c 971 g 571 t
ORIGIN
1 gcaggcccct gaagtgctgt gcctggagat accattgtgg accctggaga ggcacctgct
61 gcttatgcga gagaattggc agctgactgc actgccagga gccaaggcct aggtgtgtgg
121 agcctggaag accagggata cctgaggcag gaggtgcagc gtgtggagtg agaaacccga
181 gagggactgg ggcctgtgct caggatcagc agagcaggca ggcagagtgg agagggagag
241 acggagcccc agaaggggag caggaggaga gggaagtagt ggcagcagcc caggccaggc
301 gtgtgtcctg caagtgctag gaccagccac ccctccccat ggcctccaag ccggctgccg
361 ggaagagcag aggggagaag cggaagaggg tggtgctgac actgaaggag aagattgaca
421 tctgcacgcg cctggagaag ggcgagagcc ggaaggcact gatgcaggag tacaatgtgg
481 gcatgtccac cctctacgac atcagggccc acaaggcgca gctgctccgg ttcttcgcca
541 gctccgactc caacaaggcg ctggagcagc ggcgcacgct gcacacgccc aagctggagc
601 acctggaccg cgtcctgtac gagtggttcc tggggaagcg ctccgagggc gtccccgtgt
661 caggccccat gctcatcgag aaggccaagg acttctacga gcagatgcag ctcactgagc
721 cctgcgtgtt ctccggaggg tggctttggc gctttaaggc cagacacggc attaaaaagc
781 tagatgcatc cagtgaaaag cagtcagccg accaccaggc cgcggagcag ttctgtgcgt
841 ttttcaggag cttggctgct gagcacgggc tgtccgccga gcaggtttac aacgctgatg
901 agaccggcct tttctggcgg tgcctgccaa atcccactcc ggaaggcggg gctgtgcctg
961 gccccaagca gggcaaggac cggctgaccg tgctgatgtg tgccaacgcc acgggctccc
1021 acaggctcaa gcccttggcc atcgggaagt gcagcggtcc cagggctttc aaaggcatcc
1081 agcacctgcc cgtcgcctat aaggcccagg ggaacgcctg ggtggacaag gagatttttt
1141 ccgattggtt ccatcatatc tttgtgccct cggtgagaga gcacttcaga accataggtt
1201 tgccggaaga cagcaaagcc gttctcttgc tggacagctc ccgggctcac ccgcaggagg
1261 ccgagctggt gtccagtaac gttttcacca tcttcctgcc tgccagcgtg gcctcattgg
1321 tgcagcccat ggagcagggc attcggagag atttcatgag gaacttcatt aaccctccgg
1381 tccccctgca gggcccccac gcccgctaca acatgaacga tgccatattc agcgtggcct
1441 gtgcctggaa cgcagtccct agccacgtct tcaggcgggc ctggaggaag ctgtggccgt
1501 cggttgcgtt tgccgaaggc tcctcctctg aggaggagtt ggaggcagag tgcttcccag
1561 tgaagcccca caacaagtcc tttgcacaca tcctggagct tgtgaaggaa ggctcctcct
1621 gcccgggcca gcttcgccag cgccaggccg ccagctgggg ggtagcggga agggaggcag
1681 aagggggacg gccccctgct gccacgtcgc cagcagaggt tgtgtggagt tcagaaaaga
1741 ctccgaaagc tgaccaggac ggcagaggag atcctggtga gggcgaggag gtggcctggg
1801 agcaggcggc cgtggccttt gacgcagtcc tgcgctttgc ggagcggcag ccatgcttca
1861 gtgcgcagga agtggggcag ctgcgggcgc tgcgtgccgt gttccggagc cagcagcagg
1921 tgaggaggcg gcgtggtgcc ctcggggctg tggtcaaggt tgaagccctc caggagggcc
1981 ctggtggctg tggggccaca gctcagtctc ccttgccctg ctcatccaca gcaggtgaca
2041 actgatggct tctctgccct gccctggcca ctggccctgt ttctccccac accctggagt
2101 ggcatggtcc tgtgccccga ccccacctga ggcaggaggg catgtgcaga cactcaagag
2161 cccttccagg agtgggtcgc ccacgggtgt ggctcgggtg cccaggacgg tctgtgcccg
2221 aggttcctgt caatacaggt tttattttat cacttgccgt gtcatccgaa agtgaggaaa
2281 tgttttggaa gggtccaccc tagcctagaa caagccagag ccgcaccctg gctggaatgg
2341 gggccaggct gagccgatct ggtctcgtgt tcgctggctg atcattgcag tatcagaggg
2401 tggagatgtc agtctgtcca cgtggagaga agttgccctc caggccgaca ggaggccatg
2461 cccaccgccc ctggacaggc ttcgtctcag aaggctctat ctgctgggct ggcggccatc
2521 cccgtgttgg gtggaccccg agcacggttg cctgaggtcc gatggcccga gagctgggac
2581 tcagttcttg gcctgctagc ggctgaacag gccgcacatc tcacttcagt tgtggcctca
2641 ttcagcagaa tgactctgga accatcctct gttacccgca gatcctgtcc catgggctct
2701 ggccccaaga tgttgggggc cccacggaga gttgacttgg tagagttcct ttctgggaag
2761 aaagtaggag tggctgacca ggccctgctc atcacccgga tagaggacac ggaccctgtg
2821 tggtattttg gcattttggc tcagagtcca atgtaccatg ttgcccagaa tttcatactt
2881 atggccttta catgaatacg tcttgatcag acattcagag attagtctta ggtttgcacg
2941 taagtcactg aaaacagtaa aacaggctgc ttagatttct aaaaaaaaaa aaaaa
//