LOCUS BC117204 2816 bp mRNA linear HUM 23-NOV-2007 DEFINITION Homo sapiens Gen homolog 1, endonuclease (Drosophila), mRNA (cDNA clone MGC:150813 IMAGE:40125755), complete cds. ACCESSION BC117204 VERSION BC117204.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2816) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2816) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (26-MAY-2006) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 6 Row: D Column: 20. FEATURES Location/Qualifiers source 1..2816 /db_xref="H-InvDB:HIT000387494" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:150813 IMAGE:40125755" /tissue_type="Brain, cerebellum, PCR rescued clones" /clone_lib="NIH_MGC_311" /note="Vector: pCR4-TOPO with reversed insert; Clone identification sequence tag: CATTTCAC" gene 1..2816 /gene="GEN1" /gene_synonym="Gen" /db_xref="GeneID:348654" /db_xref="HGNC:HGNC:26881" CDS 24..2750 /gene="GEN1" /gene_synonym="Gen" /codon_start=1 /product="Gen homolog 1, endonuclease (Drosophila)" /protein_id="AAI17205.1" /db_xref="GeneID:348654" /db_xref="HGNC:HGNC:26881" /translation="MGVNDLWQILEPVKQHIPLRNLGGKTIAVDLSLWVCEAQTVKKM MGSVMKPHLRNLFFRISYLTQMDVKLVFVMEGEPPKLKADVISKRNQTRYGSSGKSWS QKTGRSHFKSVLRECLHMLECLGIPWVQAAGEAEAMCAYLNAGGHVDGCLTNDGDTFL YGAQTVYRNFTMNTKDPHVDCYTMSSIKSKLGLDRDALVGLAILLGCDYLPKGVPGVG KEQALKLIQILKGQSLLQRFNRWNETSCNSSPQLLVTKKLAHCSVCSHPGSPKDHERN GCRLCKSDKYCEPHDYEYCCPCEWHRTEHDRQLNEVENNIKKKACCCEGFPFHEVIQE FLLNKDKLVKVIRYQRPDLLLFQRFTLEKMEWPNHYACEKLLVLLTHYDMIERKLGSR NSNQLQPIRIVKTRIRNGVHCFEIEWEKPEHYAMEDKQHGEFALLTIEEESLFEAAYP EIVAVYQKQKLEIKGKKQKRIKPKENNLPEPDEVMSFQSHMTLKPTCEIFHKQNSKLN SGISPDPTLPQESISASLNSLLLPKNTPCLNAQEQFMSSLRPLAIQQIKAVSKSLISE SSQPNTSSHNISVIADLHLSTIDWEGTSFSNSPAIQRNTFSHDLKSEVESELSAIPDG FENIPEQLSCESERYTANIKKVLDEDSDGISPEEHLLSGITDLCLQDLPLKERIFTKL SYPQDNLQPDVNLKTLSILSVKESCIANSGSDCTSHLSKDLPGIPLQNESRDSKILKG DQLLQEDYKVNTSVPYSVSNTVVKTCNVRPPNTALDHSRKVDMQTTRKILMKKSVCLD RHSSDEQSAPVFGKAKYTTQRMKHSSQKHNSSHFKESGHNKLSSPKIHIKETEQCVRS YETAENEESCFPDSTKSSLSSLQCHKKENNSGTCLDSPLPLRQRLKLRFQST" BASE COUNT 942 a 521 c 562 g 791 t ORIGIN 1 cgggggagca gataatcacc agaatgggag tgaatgactt gtggcaaatt ttggagcctg 61 ttaagcaaca catccccttg cgtaatcttg gtgggaaaac cattgcagtt gatctgagtc 121 tctgggtgtg tgaggcacag acagtcaaaa aaatgatggg cagcgtcatg aagccccacc 181 tcaggaactt attttttcgt atctcatatt taacacaaat ggatgtaaaa ctggtatttg 241 ttatggaagg ggaaccacca aagctgaaag ctgatgtcat aagcaagagg aatcagactc 301 ggtatgggtc ttctggaaaa tcgtggtctc agaaaacagg gagatcacat tttaaatcag 361 tcttaagaga gtgcctccat atgctcgaat gcttaggaat cccctgggtt caggctgctg 421 gggaagctga agccatgtgt gcttatctca atgctggtgg tcatgtcgat ggctgcctca 481 ccaatgatgg agatactttc ctttatgggg cccagactgt ttacaggaat ttcactatga 541 atacaaagga cccacatgtt gactgttaca caatgtcatc tatcaagagt aaactaggtt 601 tggatagaga tgctctggtt ggattagcaa tacttcttgg ctgtgattat ctcccaaagg 661 gagtccctgg agttggaaaa gagcaagcat taaaacttat acagattttg aaagggcaaa 721 gtttacttca gaggtttaat cggtggaatg aaacatcttg taactctagt ccacaactgc 781 tagtcactaa aaaactggct cattgttccg tatgttccca tccaggttca cctaaggatc 841 atgaacgtaa tggatgcaga ttatgtaaaa gtgataaata ttgtgagcca catgactatg 901 aatactgctg tccttgtgag tggcaccgta cagaacatga taggcaactc aatgaagtag 961 agaacaatat taagaagaaa gcttgctgtt gtgagggatt cccattccat gaggttattc 1021 aagaattcct tttaaacaag gataaattgg tgaaggttat caggtaccaa agacctgatt 1081 tgttattgtt tcagagattt actcttgaaa aaatggagtg gcccaatcac tatgcatgtg 1141 agaaattgct ggtacttttg acccattatg acatgataga aagaaagctt ggtagcagaa 1201 actctaatca actacagcca attcgaattg ttaagactcg aatcagaaat ggagttcatt 1261 gttttgaaat agaatgggaa aagcctgaac attatgctat ggaagataaa caacatggag 1321 aatttgcttt attaacaatt gaggaagaat cattgtttga agcagcatat cctgagatcg 1381 ttgctgttta ccaaaaacaa aagttagaaa ttaaagggaa gaaacaaaaa cgtattaagc 1441 ctaaagaaaa caatttgcca gaaccagatg aagtaatgag ctttcagtca cacatgactt 1501 taaaacccac atgtgaaatc tttcataagc agaattccaa gttaaattcg gggatttccc 1561 ctgatcctac attaccacag gaatctattt ctgcctcatt gaatagcttg cttttaccta 1621 aaaatactcc atgtttgaat gcacaagaac agttcatgtc ttctctaaga cctttggcta 1681 tacagcaaat taaagctgtc agtaagtctc taatttcaga atctagtcaa cccaatacct 1741 catctcataa tatatccgtg attgctgatc tacacttgag cactattgac tgggaaggta 1801 cttcttttag taattctcca gctattcaaa ggaatacttt ttctcatgat ttaaaatcag 1861 aagttgaatc agagctatca gccatccctg atggctttga aaatatccca gaacaactgt 1921 cctgtgaatc agaaaggtac actgcaaaca taaagaaagt gttggatgag gattctgatg 1981 ggattagtcc tgaagagcat ctactttctg gcattactga tttatgtctt caggatttgc 2041 ctttaaagga acgaatattt acaaaattat catatcctca ggataatcta caaccagatg 2101 tcaacctgaa aactttgtcc atacttagtg taaaagaatc ttgtattgct aacagtggtt 2161 ctgattgtac atcacatctt tcaaaggatc ttccaggaat tcccttgcaa aatgaatcca 2221 gagactctaa aattctaaaa ggagaccagc tgcttcaaga agactataaa gtcaatactt 2281 ctgtccctta ttctgtcagt aacacagtgg taaagacctg caatgttaga ccaccaaata 2341 ctgctttaga tcatagtaga aaagttgata tgcaaaccac tcggaaaatt ttaatgaaga 2401 agagtgtttg ccttgacaga cattcctctg atgaacaaag tgccccagtg tttgggaaag 2461 ctaagtacac aactcaaaga atgaagcaca gttctcaaaa gcataattca tcccatttca 2521 aagaaagtgg ccataacaag ttgagtagcc ctaagataca tattaaagaa actgaacagt 2581 gtgtcagatc ttatgaaaca gctgaaaatg aagaaagctg tttcccagat tcaacaaaaa 2641 gttctctgag ttctctacaa tgtcataaga aagaaaacaa ctctggtact tgtttggata 2701 gccctcttcc tttacgccag agattaaaac taagattcca aagcacttga aatttaaaac 2761 acttaggtat aacttaacta ttttagtact atcagcaata gcagagacag agggaa //