LOCUS BC017096 2081 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens alkB, alkylation repair homolog 4 (E. coli), mRNA (cDNA clone MGC:12618 IMAGE:2820440), complete cds. ACCESSION BC017096 VERSION BC017096.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2081) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2081) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-NOV-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: DCTD/DTP cDNA Library Preparation: Rubin Laboratory cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAL Plate: 18 Row: a Column: 4 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 20070290. FEATURES Location/Qualifiers source 1..2081 /db_xref="H-InvDB:HIT000037835" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:12618 IMAGE:2820440" /tissue_type="Lung, small cell carcinoma" /clone_lib="NIH_MGC_7" /lab_host="DH10B-R" /note="Vector: pOTB7" gene 1..2081 /gene="ALKBH4" /gene_synonym="FLJ20013" /db_xref="GeneID:54784" /db_xref="HGNC:HGNC:21900" CDS 6..914 /gene="ALKBH4" /gene_synonym="FLJ20013" /codon_start=1 /product="alkB, alkylation repair homolog 4 (E. coli)" /protein_id="AAH17096.1" /db_xref="GeneID:54784" /db_xref="HGNC:HGNC:21900" /translation="MAAAAAETPEVLRECGCKGIRTCLICERQRGSDPPWELPPAKTY RFIYCSDTGWAVGTEESDFEGWAFPFPGVMLIEDFVTREEEAELVRLMDRDPWKLSQS GRRKQDYGPKVNFRKQKLKTEGFCGLPSFSREVVRRMGLYPGLEGFRPVEQCNLDYCP ERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMCREAPGSLLLCSAPSAAPEALVDSV IAPSRSVLCQEVEVAIPLPARSLLVLTGAARHQWKHAIHRRHIEARRVCVTFRELSAE FGPGGRQQELGQELLRIALSFQGRPV" BASE COUNT 469 a 577 c 626 g 409 t ORIGIN 1 gcgcgatggc ggcggctgcc gccgagaccc ccgaagtcct tcgggaatgc ggttgcaagg 61 gcatccggac ctgtctgatc tgcgagcggc agcgcggcag tgacccgccc tgggagctgc 121 ccccagcgaa aacataccgt ttcatttact gctccgacac cggctgggcc gtgggcacag 181 aggagtctga ctttgagggc tgggccttcc ccttcccagg agtgatgctg atcgaggact 241 ttgtgacccg ggaggaagaa gccgagttgg tgcggctcat ggaccgtgac ccctggaagc 301 tctcccagtc tggacggagg aagcaggact atggccccaa agtcaacttt cggaaacaga 361 agctaaagac cgagggcttc tgcggcctcc ccagcttcag ccgggaggtg gtgcggagga 421 tgggcctcta cccggggctg gagggcttcc ggcccgtcga gcagtgcaac ctggactact 481 gccccgagcg gggctctgcc attgaccccc acctggacga cgcctggctg tggggggagc 541 ggctggtcag cctcaacctc ctgtccccca ccgtgctgtc catgtgtcgg gaggcgcccg 601 ggagcctgct cctctgctcg gccccgtcgg ctgccccgga ggccttggtg gacagcgtga 661 tagcacccag ccggtcggtg ctatgccagg aggtggaggt ggccatcccc ttacccgccc 721 gctccctgct ggtcctcacc ggggcggcac ggcaccagtg gaagcatgcc atccaccgca 781 gacacatcga ggcccgccgc gtctgcgtca ctttccggga gctgtcggct gagtttggcc 841 ctggagggag gcagcaagag ctgggccagg aactgctgcg gatcgccctc tccttccagg 901 gaagacccgt gtgaaccgcc tccttggctc cagacttgac tgatcccggg attgaaatga 961 ggagcacaga acagggcctc ctgcaactca cggggtttca agagaagatg gctgacccct 1021 gatgctgtga gcagtgtgag ccctgcccag gagcaggttt tgatgggaac gtacctccag 1081 gcagccccct tccacctgga ccgtggccac acttttttgg ttatttagtt tgtcacagtc 1141 ttggggacat gggatcattt gagcttaaaa aatactgggg gccgggcaca gtggctcaca 1201 cctgtaatcc taacactttg ggaggctgag gtgggcggat cacttgatgc caggagttcg 1261 agaccagcct ggccaacacg gtgaaaaccc gtctctacaa aaactacaaa aattagccgg 1321 gtgtggtgac tcacagccgt aatcccagct actcgggagg ctaaggtggg agaattgctt 1381 gaacctggga ggcggaggtt gcagtgagcc aagatcacgc cactgcactc cagcctcggt 1441 gacagagcaa gactgttttg aaaaaaaaaa aaaatgggaa cattttaaat gattttcacc 1501 tttattatgc atctattttc atggggtttc ccgatatctc actgtccagt cccttcattt 1561 ggggaatgtg ttggattagg gaacagggtt gaagatttga agtttagact aaagagctgg 1621 gaacagcttc agagtcaggc tcagcctgac tcatgcttga cacccccacg cccagggagg 1681 gttgggggat gtgaggaggg cagggaaatc tgagagcctc cttccagccc cataacgctg 1741 ttaacaagta ggaaaaatta aagctcccgg ccaggcgcgg tgactcacac ctgtaatccg 1801 agtactttgc ggggctcagg tgggaggatt gcttgaggcc agcctgggca acatagtgag 1861 acccccatct ctacaaaaaa tacaaacatt agctgggcgt ctgggcatgg tggcacacac 1921 ctgtagtccc agctactcga aaggctgagg cgggaggatg gctttaccac catgtcaagg 1981 ctgcagtgag ctcatgatca taccactgca cttaacttgg caacagagca agaccctgtc 2041 cctaaaataa ataaaaggaa aacaaaaaaa aaaaaaaaaa a //