LOCUS       BC035575                4336 bp    mRNA    linear   HUM 29-AUG-2008
DEFINITION  Homo sapiens peroxisomal biogenesis factor 1, mRNA (cDNA clone
            MGC:45327 IMAGE:5498274), complete cds.
ACCESSION   BC035575
VERSION     BC035575.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 4336)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 4336)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (31-JUL-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Lou Staudt
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 69 Row: f Column: 3
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 4505724.
FEATURES             Location/Qualifiers
     source          1..4336
                     /db_xref="H-InvDB:HIT000051418"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:45327 IMAGE:5498274"
                     /tissue_type="Lymph, lymphoma"
                     /clone_lib="NIH_MGC_85"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..4336
                     /gene="PEX1"
                     /gene_synonym="ZWS1"
                     /db_xref="GeneID:5189"
                     /db_xref="HGNC:HGNC:8850"
                     /db_xref="MIM:602136"
     CDS             39..3890
                     /gene="PEX1"
                     /gene_synonym="ZWS1"
                     /codon_start=1
                     /product="peroxisomal biogenesis factor 1"
                     /protein_id="AAH35575.1"
                     /db_xref="GeneID:5189"
                     /db_xref="HGNC:HGNC:8850"
                     /db_xref="MIM:602136"
                     /translation="MWGSDRLAGAGGGGAAVTVAFTNARDCFLHLPRRLVAQLHLLQN
                     QAIEVVWSHQPAFLSWVEGRHFSDQGENVAEINRQVGQKLGLSNGGQVFLKPCSHVVS
                     CQQVEVEPLSADDWEILELHAVSLEQHLLDQIRIVFPKAIFPVWVDQQTYIFIQIVAL
                     IPAASYGRLETDTKLLIQPKTRRAKENTFSKADAEYKKLHSYGRDQKGMMKELQTKQL
                     QSNTVGITESNENESEIPVDSSSVASLWTMIGSIFSFQSEKKQETSWGLTEINAFKNM
                     QSKVVPLDNIFRVCKSQPPSIYNASATSVFHKHCAIHVFPWDQEYFDVEPSFTVTYGK
                     LVKLLSPKQQQSKTKQNVLSPEKEKQMSEPLDQKKIRSDHNEEDEKACVLQVVWNGLE
                     ELNNAIKYTKNVEVLHLGKVWIPDDLRKRLNIEMHAVVRITPVEVTPKIPRSLKLQPR
                     ENLPKDISEEDIKTVFYSWLQQSTTTMLPLVISEEEFIKLETKDGLKEFSLSIVHSWE
                     KEKDKNIFLLSPNLLQKTTIQVLLDPMVKEENSEEIDFILPFLKLSSLGGVNSLGVSS
                     LEHITHSLLGRPLSRQLMSLVAGLRNGALLLTGGKGSGKSTLAKAICKEAFDKLDAHV
                     ERVDCKALRGKRLENIQKTLEVAFSEAVWMQPSVVLLDDLDLIAGLPAVPEHEHSPDA
                     VQSQRLAHALNDMIKEFISMGSLVALIATSQSQQSLHPLLVSAQGVHIFQCVQHIQPP
                     NQEQRCEILCNVIKNKLDCDINKFTDLDLQHVAKETGGFVARDFTVLVDRAIHSRLSR
                     QSISTREKLVLTTLDFQKALRGFLPASLRSVNLHKPRDLGWDKIGGLHEVRQILMDTI
                     QLPAKYPELFANLPIRQRTGILLYGPPGTGKTLLAGVIARESRMNFISVKGPELLSKY
                     IGASEQAVRDIFIRAQAAKPCILFFDEFESIAPRRGHDNTGVTDRVVNQLLTQLDGVE
                     GLQGVYVLAATSRPDLIDPALLRPGRLDKCVYCPPPDQVSRLEILNVLSDSLPLADDV
                     DLQHVASVTDSFTGADLKALLYNAQLEALHGMLLSSGLQDGSSSSDSDLSLSSMVFLN
                     HSSGSDDSAGDGECGLDQSLVSLEMSEILPDESKFNMYRLYFGSSYESELGNGTSSDL
                     SSQCLSAPSSMTQDLPGVPGKDQLFSQPPVLRTASQEGCQELTQEQRDQLRADISIIK
                     GRYRSQSGEDESMNQPGPIKTRLAISQSHLMTALGHTRPSISEDDWKNFAELYESFQN
                     PKRRKNQSGTMFRPGQKVTLA"
BASE COUNT         1345 a          848 c          934 g         1209 t
ORIGIN      
        1 ccacgcgtcc ggcgaaccca gagcgacgct ccgggacgat gtggggcagc gatcgcctgg
       61 cgggtgctgg gggaggcggg gcggcagtga ctgtggcctt caccaacgct cgcgactgct
      121 tcctccacct gccgcggcgt ctcgtggccc agctgcatct gctgcagaat caagctatag
      181 aagtggtctg gagtcaccag cctgcattct tgagctgggt ggaaggcagg cattttagtg
      241 atcaaggtga aaatgtggct gaaattaaca gacaagttgg tcaaaaactt ggactctcaa
      301 atgggggaca ggtatttctc aagccatgtt cccatgtggt atcttgtcaa caagttgagg
      361 tggaacccct ctcagcagat gattgggaga tactggagct gcatgctgtt tcccttgaac
      421 aacatcttct agatcaaatt cgaatagttt ttccaaaagc catttttcct gtttgggttg
      481 atcaacaaac gtacatattt atccaaattg ttgcactaat accagctgcc tcttatggaa
      541 ggctggaaac tgacaccaaa ctccttattc agccaaagac acgccgagcc aaagagaata
      601 cattttcaaa agctgatgct gaatataaaa aacttcatag ttatggaaga gaccagaaag
      661 gaatgatgaa agaacttcaa accaagcaac ttcagtcaaa tactgtggga atcactgaat
      721 ctaatgaaaa cgagtcagag attccagttg actcatcatc agtagcaagt ttatggacta
      781 tgataggaag cattttttcc tttcaatctg agaagaaaca agagacatct tggggtttaa
      841 ctgaaatcaa tgcattcaaa aatatgcagt caaaggttgt tcctctagac aatattttca
      901 gagtatgcaa atctcaacct cctagtatat ataacgcgtc agcaacctct gtttttcata
      961 aacactgtgc cattcatgta tttccatggg accaggaata ttttgatgta gagcccagct
     1021 ttactgtgac atatggaaag ctagttaagc tactttctcc aaagcaacag caaagtaaaa
     1081 caaaacaaaa tgtgttatca cctgaaaaag agaagcagat gtcagagcca ctagatcaaa
     1141 aaaaaattag gtcagatcat aatgaagaag atgagaaggc ctgtgtgcta caagtagtct
     1201 ggaatggact tgaagaattg aacaatgcca tcaaatatac caaaaatgta gaagttctcc
     1261 atcttgggaa agtctggatt ccagatgacc tgaggaagag actaaatata gaaatgcatg
     1321 ccgtagtcag gataactcca gtggaagtta cccctaaaat tccaagatct ctaaagttac
     1381 aacctagaga gaatttacct aaagacataa gtgaagaaga cataaaaact gtattttatt
     1441 catggctaca gcagtctact accaccatgc ttcctttggt aatatcagag gaagaattta
     1501 ttaagctgga aactaaagat ggactgaagg aattttctct gagtatagtt cattcttggg
     1561 aaaaagaaaa agataaaaat atttttctgt tgagtcccaa tttgctgcag aagactacaa
     1621 tacaagtcct tctagatcct atggtaaaag aagaaaacag tgaggaaatt gactttattc
     1681 ttcctttttt aaagctgagc tctttgggag gagtgaattc cttaggcgta tcctccttgg
     1741 agcacatcac tcacagcctc ctgggacgcc ctttgtctcg gcagctgatg tctcttgttg
     1801 caggacttag gaatggagct cttttactca caggaggaaa gggaagtgga aaatcaactt
     1861 tagccaaagc aatctgtaaa gaagcatttg acaaactgga tgcccatgtg gagagagttg
     1921 actgtaaagc tttacgagga aaaaggcttg aaaacataca aaaaacccta gaggtggctt
     1981 tctcagaggc agtgtggatg cagccatctg ttgtcctgct ggatgacctt gacctcattg
     2041 ctggactgcc tgctgtcccg gaacatgagc acagtcctga tgcggtgcag agccagcggc
     2101 ttgctcatgc tttgaatgat atgataaaag agtttatctc catgggaagt ttggttgcac
     2161 tgattgccac aagtcagtct cagcaatctc tacatccttt acttgtttct gctcaaggag
     2221 ttcacatatt tcagtgcgtc caacacattc agcctcctaa tcaggaacaa agatgtgaaa
     2281 ttctgtgtaa tgtaataaaa aataaattgg actgtgatat aaacaagttc accgatcttg
     2341 acctgcagca tgtagctaaa gaaactggag ggtttgtggc tagagatttt acagtacttg
     2401 tggatcgagc catacattct cgactctctc gtcagagtat atccaccaga gaaaaattag
     2461 ttttaacaac attggacttc caaaaggctc tccgcggatt tcttcctgcg tctttgcgaa
     2521 gtgtcaacct gcataaacct agagacctgg gttgggacaa gattggtggg ttacatgaag
     2581 ttaggcagat actcatggat actatccagt tacctgccaa gtatccagaa ttatttgcaa
     2641 acttgcccat acgacaaaga acaggaatac tgttgtatgg tccgcctgga acaggaaaaa
     2701 ccttactagc tggggtaatt gcacgagaga gtagaatgaa ttttataagt gtcaaggggc
     2761 cagagttact cagcaaatac attggagcaa gtgaacaagc tgttcgggat atttttatta
     2821 gagcacaggc tgcaaagccc tgcattcttt tctttgatga atttgaatcc attgctcctc
     2881 ggcggggtca tgataataca ggagttacag accgagtagt taaccagttg ctgactcagt
     2941 tggatggagt agaaggctta cagggtgttt atgtattggc tgctactagt cgccctgact
     3001 tgattgaccc tgccctgctt aggcctggtc gactagataa atgtgtatac tgtcctcctc
     3061 ctgatcaggt gtcacgtctt gaaattttaa atgtcctcag tgactctcta cctctggcag
     3121 atgatgttga ccttcagcat gtagcatcag taactgactc ctttactgga gctgatctga
     3181 aagctttact ttacaatgcc caattggagg ccttacatgg aatgctgctc tcgagtggac
     3241 tccaggatgg aagttccagc tctgatagtg acctaagtct gtcttcaatg gtctttctta
     3301 accatagcag tggctctgac gattcagctg gagatggaga atgtggctta gatcagtccc
     3361 ttgtttcttt agagatgtcc gagatccttc cagatgaatc aaaattcaat atgtaccggc
     3421 tctactttgg aagctcttat gaatcagaac ttggaaatgg aacctcttct gatttgagct
     3481 cacaatgtct ctctgcacca agctccatga ctcaggattt gcctggagtt cctgggaaag
     3541 accagttgtt ttcacagcct ccagtgttaa ggacagcttc acaagagggt tgccaagaac
     3601 ttacacaaga acaaagagat caactgaggg cagatatcag tattatcaaa ggcagatacc
     3661 ggagccaaag tggagaggac gaatccatga accaaccagg accaatcaaa accagactgg
     3721 ctattagtca gtcacattta atgactgcac ttggtcacac aagaccatcc attagtgaag
     3781 atgactggaa gaattttgct gagctatatg aaagctttca aaatccaaag aggagaaaaa
     3841 atcaaagtgg aacaatgttt cgacctggac agaaagtaac tttagcataa aatatacttc
     3901 tttttgattt ggttctgtta agttttttga tggcttttcc atatgttgta acaggaaaaa
     3961 aatggtgtct atgaatttct tcttaattta acaaatttgg ttaatttata aaatcacaga
     4021 ttggtaaatg ctataattat gtaatgatca ggattgagat taatactgta gtataaattg
     4081 ggacattata acagattcca tattttattt cctaaaatct aaattcagtc tttaatgaaa
     4141 taatattagc caaatggtgg aactaattta tttcttttga ggaaaagata ataaagaatg
     4201 taattaaatt taaatttctt ggaattccca gttgtatatt catcaccttt gtagcatttg
     4261 acaaatttta tgcttagcag cttcttcact gttttgaaat aaaatatcct attacctact
     4321 gaaaaaaaaa aaaaaa
//