LOCUS       BC033691                2619 bp    mRNA    linear   HUM 24-JUL-2006
DEFINITION  Homo sapiens cytochrome P450, family 2, subfamily S, polypeptide 1,
            mRNA (cDNA clone MGC:44853 IMAGE:5212609), complete cds.
ACCESSION   BC033691
VERSION     BC033691.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2619)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2619)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (02-JUL-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 68 Row: o Column: 11.
FEATURES             Location/Qualifiers
     source          1..2619
                     /db_xref="H-InvDB:HIT000041907"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:44853 IMAGE:5212609"
                     /tissue_type="Blood, adult leukocytes"
                     /clone_lib="NIH_MGC_118"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..2619
                     /gene="CYP2S1"
                     /db_xref="GeneID:29785"
                     /db_xref="HGNC:HGNC:15654"
     CDS             18..1532
                     /gene="CYP2S1"
                     /codon_start=1
                     /product="cytochrome P450, family 2, subfamily S,
                     polypeptide 1"
                     /protein_id="AAH33691.1"
                     /db_xref="GeneID:29785"
                     /db_xref="HGNC:HGNC:15654"
                     /translation="MEATGTWALLLALALLLLLTLALSGTRARGHLPPGPTPLPLLGN
                     LLQLRPGALYSGLMRLSKKYGPVFTIYLGPWRPVVVLVGQEAVREALGGQAEEFSGRG
                     TVAMLEGTFDGHGVFFSNGERWRQLRKFTMLALRDLGMGKREGEELIQAEARCLVETF
                     QGTEGRPFDPSLLLAQATSNVVCSLLFGLRFSYEDKEFQAVVRAAGGTLLGVSSQGGQ
                     TYEMFSWFLRPLPGPHKQLLHHVSTLAAFTVRQVQQHQGNLDASGPARDLVDAFLLKM
                     AQEEQNPGTEFTNKNMLMTVIYLLFAGTMTVSTTVGYTLLLLMKYPHVQKWVREELNR
                     ELGAGQAPSLGDRTRLPYTDAVLHEAQRLLALVPMGIPRTLMRTTRFRGYTLPQGTEV
                     FPLLGSILHDPNIFKHPEEFNPDRFLDADGRFRKHEAFLPFSLGKRVCLGEGLAKAEL
                     FLFFTTILQAFSLESPCPPDTLSLKPTVSGLFNIPPAFQLQVRPTDLHSTTQTR"
BASE COUNT          564 a          860 c          694 g          501 t
ORIGIN      
        1 ggagccgacc tgccgagatg gaggcgaccg gcacctgggc gctgctgctg gcgctggcgc
       61 tgctcctgct gctgacgctg gcgctgtccg ggaccagggc ccgaggccac ctgccccccg
      121 ggcccacgcc gctaccactg ctgggaaacc tcctgcagct acggcccggg gcgctgtatt
      181 cagggctcat gcggctgagt aagaagtacg gaccggtgtt caccatctac ctgggaccct
      241 ggcggcctgt ggtggtcctg gttgggcagg aggctgtgcg ggaggccctg ggaggtcagg
      301 ctgaggagtt cagcggccgg ggaaccgtag cgatgctgga agggactttt gatggccatg
      361 gggttttctt ctccaacggg gagcggtgga ggcagctgag gaagtttacc atgcttgctc
      421 tgcgggacct gggcatgggg aagcgagaag gcgaggagct gatccaggcg gaggcccggt
      481 gtctggtgga gacattccag gggacagaag gacgcccatt cgatccctcc ctgctgctgg
      541 cccaggccac ctccaacgta gtctgctccc tcctctttgg cctccgcttc tcctatgagg
      601 ataaggagtt ccaggccgtg gtccgggcag ctggtggtac cctgctggga gtcagctccc
      661 aggggggtca gacctacgag atgttctcct ggttcctgcg gcccctgcca ggcccccaca
      721 agcagctcct ccaccacgtc agcaccttgg ctgccttcac agtccggcag gtgcagcagc
      781 accaggggaa cctggatgct tcgggccccg cacgtgacct tgtcgatgcc ttcctgctga
      841 agatggcaca ggaggaacaa aacccaggca cagaattcac caacaagaac atgctgatga
      901 cagtcattta tttgctgttt gctgggacga tgacggtcag caccacggtc ggctataccc
      961 tcctgctcct gatgaaatac cctcatgtcc aaaagtgggt acgtgaggag ctgaatcggg
     1021 agctgggggc tggccaggca ccaagcctag gggaccgtac ccgcctccct tacaccgacg
     1081 cggttctgca tgaggcgcag cggctgctgg cgctggtgcc catgggaata ccccgcaccc
     1141 tcatgcggac cacccgcttc cgagggtaca ccctgcccca gggcacggag gtcttccccc
     1201 tccttggctc catcctgcat gaccccaaca tcttcaagca cccagaagag ttcaacccag
     1261 accgtttcct ggatgcagat ggacggttca ggaagcatga ggcgttcctg cccttctcct
     1321 tagggaagcg tgtctgcctt ggagagggcc tggcaaaagc ggagctcttc ctcttcttca
     1381 ccaccatcct acaagccttc tccctggaga gcccgtgccc gccggacacc ctgagcctca
     1441 agcccaccgt cagtggcctt ttcaacattc ccccagcctt ccagctgcaa gtccgtccca
     1501 ctgaccttca ctccaccacg cagaccagat gaaggaaggc aacttggaag tggtgggtgc
     1561 ccgggacggt gcctccagcc tcaacagtgg gcatggacag ggttaatgtc tccagagtgt
     1621 acactgcagg cagccacatt tacacgcctg cagttgtttt ccggagtctg tcccacggcc
     1681 cacacgctca cttgactcat gctgctaaga tgcacaaccg cacacccata cacaactaca
     1741 agggccacaa agcaactgct gggttagctt tccacagaca taaatatagt ccatctgcaa
     1801 tcacaagcac atagccaggt aacccaccaa ctcccctgga tctgcagccc acacgtggga
     1861 gtctggctgt caccttcaca agccacagaa acggccacac atgttcacag ctcacacgcc
     1921 ctctccattc atcgaacttc tcagtgtccc tgtccctggt gcctggcaca gggaacagca
     1981 tgccccctcc ggggtcatgc cacccagaga ctgtcgctgt ctatggcccc aactcatgct
     2041 ccctctcttg gctacaccac tctcccagcc tgtgaccaca gatgtccaca cacccccaac
     2101 cacttgtcca cacagctacc cacgtacgac atcgtcctgg ctccccagag tatcttccca
     2161 ctgagacacg ccgcccccac agaggcacag tccccagcca cctctgcaac tgcagccctc
     2221 agtcacccct ttttaagcac cctgattcta ccaaatgcaa acacatctgg gtctgcgatt
     2281 atgcacagag actttggaca tacgaggacc ctcagaccgg aggaacacct gcccaacccc
     2341 aacacgtgct tatgtaacca cgtggaaagc ggcccctgct gcccctccac acacacatac
     2401 acactcactg atctacagcc cctgttcggc gtcagagtcc ccactagacc cagtggaagg
     2461 ggttagagac caagtagggg ccagtttcca attcaccctg tcagggagtg agccggatct
     2521 gacgttcctt gtgacttaag ggtccggctt gggaattaaa gtttgtttct ggcctttagc
     2581 ctaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaa
//