LOCUS BC143325 1610 bp mRNA linear HUM 18-MAR-2009 DEFINITION Homo sapiens cytochrome P450, family 2, subfamily A, polypeptide 7, mRNA (cDNA clone MGC:176847 IMAGE:9051830), complete cds. ACCESSION BC143325 VERSION BC143325.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 1610) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 1610) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-JUN-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 15 Row: M Column: 21. FEATURES Location/Qualifiers source 1..1610 /db_xref="H-InvDB:HIT000501783" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:176847 IMAGE:9051830" /tissue_type="Liver, PCR rescued clones" /clone_lib="NIH_MGC_291" /note="Vector: pCR4-TOPO with reversed insert; Clone identification sequence tag: TACACAAC" gene 1..1610 /gene="CYP2A7" /gene_synonym="CPA7" /gene_synonym="CPAD" /gene_synonym="CYP2A" /gene_synonym="CYPIIA7" /gene_synonym="P450-IIA4" /db_xref="GeneID:1549" /db_xref="HGNC:HGNC:2611" /db_xref="MIM:608054" CDS 46..1530 /gene="CYP2A7" /gene_synonym="CPA7" /gene_synonym="CPAD" /gene_synonym="CYP2A" /gene_synonym="CYPIIA7" /gene_synonym="P450-IIA4" /codon_start=1 /product="CYP2A7 protein" /protein_id="AAI43326.1" /db_xref="GeneID:1549" /db_xref="HGNC:HGNC:2611" /db_xref="MIM:608054" /translation="MLASGLLLVALLACLTVMVLMSVWQQRKSRGKLPPGPTPLPFIG NYLQLNTEHICDSIMKFSECYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRG EQATFDWVFKGYGVAFSNGERAKQLLRFAIATLRDFGVGKRGIEERIQEESGFLIEAI RSTHGANIDPTFFLSRTVSNVISSIVFGDRFDYEDKEFLSLLSVMLGIFQFTSTSTGQ LYEMFSSVMKHLPGPQQQAFKLLQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQ EEEKNPNTEFYLKNLMMSTLNLFIAGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRV IGKNRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPKGTEVF PMLGSVLRDPSFFSNPQDFNPQHFLDDKGQFKKSDAFVPFSIGKRYCFGEGLARMELF LFFTTVMQNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR" BASE COUNT 348 a 466 c 463 g 333 t ORIGIN 1 aggcaaacca ccccacccat caccatctgt catctcacta ccaccatgct ggcctcaggg 61 ctgcttctgg tggccttgct ggcctgcctg actgtgatgg tcttgatgtc tgtctggcag 121 cagaggaaga gcagggggaa gctgcctccg ggacccaccc cactgccctt cattggaaac 181 tacctccagc tgaacacaga gcacatatgt gactccatca tgaagttcag tgagtgctat 241 ggccccgtgt tcaccattca cttggggccc cggcgggtcg tggtgctgtg tggacatgat 301 gccgtcaggg aggctctggt ggaccaggct gaggagttca gcgggcgagg cgagcaagcc 361 accttcgact gggtcttcaa aggctatggc gtggcgttca gcaacgggga gcgcgccaag 421 cagctcctgc gctttgccat cgccaccctg agggacttcg gggtgggcaa gcgaggcatc 481 gaggagcgca tccaggagga gtcgggcttc ctcatcgagg ccatccggag cacgcacggc 541 gccaatatcg atcccacctt cttcctgagc cgcacagtct ccaatgtcat cagctccatt 601 gtctttgggg accgctttga ctatgaggac aaagagttcc tgtcactgct gagcgtgatg 661 ctaggaatct tccagttcac gtcaacctcc acggggcagc tctatgagat gttctcttcg 721 gtgatgaaac acctgccagg accacagcaa caggccttta agttgctgca agggctggag 781 gacttcatag ccaagaaggt ggagcacaac cagcgcacgc tggatcccaa ttccccacgg 841 gacttcatcg actcctttct catccgcatg caggaggagg agaagaaccc caacacggag 901 ttctacttga agaacctgat gatgagcacg ttgaacctct tcattgcagg caccgagacg 961 gtcagcacca ccctgcgcta tggcttcttg ctgctcatga agcacccaga ggtggaggcc 1021 aaggtccatg aggagattga cagagtgatc ggcaagaacc ggcagcccaa gtttgaggac 1081 cgggccaaga tgccctacat ggaggcagtg atccacgaga tccaaagatt tggagacgtg 1141 atccccatga gtttggcccg cagggttaaa aaggacacca agtttcggga ttttttcctc 1201 cctaagggca ccgaagtgtt ccctatgctg ggctccgtgc tgagagaccc cagcttcttc 1261 tccaaccctc aggacttcaa tccccagcat ttcctggatg acaaggggca gtttaagaag 1321 agtgatgctt ttgtgccctt ttccatcgga aagcggtact gtttcggaga aggcctggcc 1381 agaatggagc tctttctctt cttcaccacc gtcatgcaga acttccgcct caagtcctcc 1441 cagtcaccta aggacattga cgtgtccccc aaacacgtgg gctttgccac gatcccacga 1501 aactacacca tgagcttcct gccccgctga gcgagggctg tgccggtgca ggtctggtgg 1561 gcggggccag ggaaaggcgg ggtcagggcg gggttcgcgg aagaggcggg //