LOCUS       BC033752                2020 bp    mRNA    linear   HUM 19-MAR-2009
DEFINITION  Homo sapiens cytochrome P450, family 20, subfamily A, polypeptide
            1, mRNA (cDNA clone MGC:45025 IMAGE:5211537), complete cds.
ACCESSION   BC033752
VERSION     BC033752.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2020)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2020)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JUL-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 68 Row: o Column: 6
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 29171729
            The stop codon of the CDS annotated on this record is located > 55
            bases upstream of a splice junction, and therefore the mRNA is
            predicted to be subject to nonsense-mediated mRNA decay (NMD).
FEATURES             Location/Qualifiers
     source          1..2020
                     /db_xref="H-InvDB:HIT000041937"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:45025 IMAGE:5211537"
                     /tissue_type="Blood, adult leukocytes"
                     /clone_lib="NIH_MGC_118"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2020
                     /gene="CYP20A1"
                     /gene_synonym="CYP-M"
                     /db_xref="GeneID:57404"
                     /db_xref="HGNC:HGNC:20576"
     CDS             55..1443
                     /gene="CYP20A1"
                     /gene_synonym="CYP-M"
                     /codon_start=1
                     /product="CYP20A1 protein"
                     /protein_id="AAH33752.1"
                     /db_xref="GeneID:57404"
                     /db_xref="HGNC:HGNC:20576"
                     /translation="MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGN
                     LPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTLDPFET
                     MLKSLLRYQSGGGSVSENHMRKKLYENGVTDSLKSNFALLLKLSEELLDKWLSYPETQ
                     HVPLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQKNHGTVWSEIGKGFLDGSLDKNM
                     TRKKQYEDALMQLESVLRNIIKERKGRNFSQHIFIDSLVQGNLNDQQILEDSMIFSLA
                     SCIITAKLCTWAICFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLRYCQHVLCETV
                     RTAKLTPVSAQFQDIEGKIDRFIIPRETLVLYALGVVLQDPNTWPSPHKFDPDRFDDE
                     LVMKTFSSLGFSGTQECPELRFAYMVTTVLLSVLVKRLHLLSVEGQVIETKYELVTSS
                     REEAWITVSKRY"
BASE COUNT          613 a          360 c          420 g          627 t
ORIGIN      
        1 ggcgctgctg ctggagcggc cgatccgaga cgtggctccc tgggcggcag aaccatgttg
       61 gacttcgcga tcttcgccgt taccttcttg ctggcgttgg tgggagccgt gctctacctc
      121 tatccggctt ccagacaagc tgcaggaatt ccagggatta ctccaactga agaaaaagat
      181 ggtaatcttc cagatattgt gaatagtgga agtttgcatg agttcctggt taatttgcat
      241 gagagatatg ggcctgtggt ctccttctgg tttggcaggc gcctcgtggt tagtttgggc
      301 actgttgatg tactgaagca gcatatcaat cccaataaga cattggaccc ttttgaaacc
      361 atgctgaagt cattattaag gtatcaatct ggtggtggca gtgtgagtga aaaccacatg
      421 aggaaaaaat tgtatgaaaa tggtgtgact gattctctga agagtaactt tgccctcctc
      481 ctaaagcttt cagaagaatt attagataaa tggctctcct acccagagac ccagcacgtg
      541 cccctcagcc agcatatgct tggttttgct atgaagtctg ttacacagat ggtaatgggt
      601 agtacatttg aagatgatca ggaagtcatt cgcttccaga agaatcatgg cacagtttgg
      661 tctgagattg gaaaaggctt tctagatggg tcacttgata aaaacatgac tcggaaaaaa
      721 caatatgaag atgccctcat gcaactggag tctgttttaa ggaacatcat aaaagaacga
      781 aaaggaagga acttcagtca acatattttc attgactcct tagtacaagg gaaccttaat
      841 gaccaacaga tcctagaaga cagtatgata ttttctctgg ccagttgcat aataactgca
      901 aaattgtgta cctgggcaat ctgtttttta accacctctg aagaagttca aaaaaaatta
      961 tatgaagaga taaaccaagt ttttggaaat ggtcctgtta ctccagagaa aattgagcag
     1021 ctcagatatt gtcagcatgt gctttgtgaa actgttcgaa ctgccaaact gactccagtt
     1081 tctgcccagt ttcaagatat tgaaggaaaa attgaccgat ttattattcc tagagagacc
     1141 ctcgtccttt atgcccttgg tgtggtactt caggatccta atacttggcc atctccacac
     1201 aagtttgatc cagatcggtt tgatgatgaa ttagtaatga aaactttttc ctcacttgga
     1261 ttctcaggca cacaggagtg tccagagttg aggtttgcat atatggtgac cacagtactt
     1321 cttagtgtat tggtgaagag actgcaccta ctttctgtgg agggacaggt tattgaaaca
     1381 aagtatgaac tggtaacatc atcaagggaa gaagcttgga tcactgtctc aaagagatat
     1441 taaaatttta tacatttaaa atcattgtta aattgattga ggaaaacaac catttaaaaa
     1501 aaatctatgt tgaatccttt tataaaccag tatcactttg taatataaac acctatttgt
     1561 acttaatttt gtaaatttgg atttttatat atcatatttt cttaattcat tgtacacatt
     1621 tgacttactg cacagtatat tgatcatttt aatgggaaac tttagctttc tactttttat
     1681 ttttgttttt ttcactttct atgccattat ttttgtattc tttttcttag tgtgagctct
     1741 aaaatcaatg ttcttgaaaa agaaattatt ttgcagaagt tggggaatca tgtttgttga
     1801 atatgtataa aatagaaaca taggctcggc ctcccagggt gctcggattg cagacgtgag
     1861 ccactgcgcc cggcccgttg gcttgctttc tagatactgg gaatagatct tgttatagta
     1921 tgtgataaat agatcttgtt atagtatgtg ataaaatgcc atgatcaaat aaattactta
     1981 tatcctgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
//