LOCUS       BC041144                3864 bp    mRNA    linear   HUM 18-MAR-2009
DEFINITION  Homo sapiens aprataxin and PNKP like factor, mRNA (cDNA clone
            MGC:47799 IMAGE:6042653), complete cds.
ACCESSION   BC041144
VERSION     BC041144.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3864)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3864)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (13-DEC-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: ATCC
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: National Institutes of Health Intramural
            Sequencing Center (NISC),
            Gaithersburg, Maryland;
            Web site: http://www.nisc.nih.gov/
            Contact: nisc_mgc@nhgri.nih.gov
            Akhter,N., Ayele,K., Beckstrom-Sternberg,S.M., Benjamin,B.,
            Blakesley,R.W., Bouffard,G.G., Breen,K., Brinkley,C., Brooks,S.,
            Dietrich,N.L., Granite,S., Guan,X., Gupta,J., Haghighi,P.,
            Hansen,N., Ho,S.-L., Karlins,E., Kwong,P., Laric,P., Legaspi,R.,
            Maduro,Q.L., Masiello,C., Maskeri,B., Mastrian,S.D.,McCloskey,J.C.,
            McDowell,J., Pearson,R., Stantripop,S., Thomas,P.J., Touchman,J.W.,
            Tsurgeon,C., Vogt,J.L., Walker,M.A., Wetherby,K.D., Wiggins,L.,
            Young,A., Zhang,L.-H. and Green,E.D.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 82 Row: n Column: 18
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 27734904.
FEATURES             Location/Qualifiers
     source          1..3864
                     /db_xref="H-InvDB:HIT000052523"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:47799 IMAGE:6042653"
                     /tissue_type="Testis, embryonal carcinoma"
                     /clone_lib="NIH_MGC_92"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3864
                     /gene="APLF"
                     /gene_synonym="APFL"
                     /gene_synonym="PALF"
                     /gene_synonym="Xip1"
                     /db_xref="GeneID:200558"
                     /db_xref="HGNC:HGNC:28724"
                     /db_xref="MIM:611035"
     CDS             123..1658
                     /gene="APLF"
                     /gene_synonym="APFL"
                     /gene_synonym="PALF"
                     /gene_synonym="Xip1"
                     /codon_start=1
                     /product="aprataxin and PNKP like factor"
                     /protein_id="AAH41144.1"
                     /db_xref="GeneID:200558"
                     /db_xref="HGNC:HGNC:28724"
                     /db_xref="MIM:611035"
                     /translation="MSGGFELQPRDGGPRVALAPGETVIGRGPLLGITDKRVSRRHAI
                     LEVAGGQLRIKPIHTNPCFYQSSEKSQLLPLKPNLWCYLNPGDSFSLLVDKYIFRILS
                     IPSEVEMQCTLRNSQVLDEDNILNETPKSPVINLPHETTGASQLEGSTEIAKTQMTPT
                     NSVSFLGENRDCNKQQPILAERKRILPTWMLAEHLSDQNLSVPAISGGNVIQGSGKEE
                     ICKDKSQLNTTQQGRRQLISSGSSENTSAEQDTGEECKNTDQEESTISSKEMPQSFSA
                     ITLSNTEMNNIKTNAQRNKLPIEELGKVSKHKIATKRTPHKEDEAMSCSENCSSAQGD
                     SLQDESQGSHSESSSNPSNPETLHAKATDSVLQGSEGNKVKRTSCMYGANCYRKNPVH
                     FQHFSHPGDSDYGGVQIVGQDETDDRPECPYGPSCYRKNPQHKIEYRHNTLPVRNVLD
                     EDNDNVGQPNEYDLNDSFLDDEEEDYEPTDEDSDWEPGKEDEEKEDVEELLKEAKRFM
                     KRK"
BASE COUNT         1350 a          677 c          735 g         1102 t
ORIGIN      
        1 gcctttgagg ccctccctcg gtgttttttc ccagggcgtg ggcttgcccc gcgcgtgtct
       61 gtggagggcg gaaacagcgg aggggccagt ctcctggcga aggggcctaa tccttgcccg
      121 ccatgtccgg gggcttcgag ctgcagccgc gggacggcgg tccccgggtg gccctggcgc
      181 ccggggagac ggtgatcggc cgcgggccgc tgctgggaat aacagacaag agagtatcca
      241 gaagacatgc cattcttgag gtggcaggtg gtcagctgcg aatcaaaccg atacacacaa
      301 atccatgttt ttaccagtct tcagagaaga gtcagctctt accattgaag ccaaatctat
      361 ggtgctattt gaatcctgga gacagctttt ctttgttagt tgacaaatac attttccgca
      421 ttctctctat accctctgaa gtggaaatgc aatgtacctt aagaaacagt caagtgcttg
      481 atgaagataa tatattgaat gaaacaccaa aatcccccgt gattaattta cctcatgaga
      541 ctactggtgc ctcacaactg gaaggaagca cagaaatagc caagacccag atgactccca
      601 caaatagtgt gtctttccta ggtgaaaata gagactgcaa taagcagcag ccaatccttg
      661 ccgagaggaa aagaatcctt ccaacttgga tgttagcaga acatttaagt gatcaaaacc
      721 tttcagtacc agcaatcagt ggaggtaatg taatccaggg aagtggaaaa gaagaaatct
      781 gcaaagataa atcccagcta aacacaaccc agcaaggaag aaggcaatta atttcatcag
      841 gaagttcaga aaatacatca gcagaacaag acacaggaga agagtgcaaa aatactgatc
      901 aggaagagtc taccatttca tccaaggaaa tgccacaatc attttctgca atcacattaa
      961 gtaacacaga gatgaataat attaagacta atgcacagag aaacaaactt ccaatagagg
     1021 aacttggtaa agtttctaaa cataaaattg ccactaaaag aacaccacat aaagaagatg
     1081 aggcaatgag ctgttctgaa aattgttcga gtgcccaggg cgactcactt caggatgagt
     1141 ctcaagggtc tcattctgag tccagctcta atccctccaa tcctgaaact ttgcatgcaa
     1201 aggcaactga ttcagttcta caaggttctg aaggaaacaa ggtcaagagg acatcctgca
     1261 tgtatggggc aaactgctat aggaagaatc ctgttcattt tcaacatttt agccatcctg
     1321 gtgatagtga ttatggaggt gtacaaatcg tgggccaaga tgagactgat gaccggcctg
     1381 aatgtcccta tggaccatcc tgttatagga agaatcccca gcacaagata gaatatagac
     1441 ataatacgct tccagtgaga aatgttttag atgaagataa tgataatgtt gggcaaccca
     1501 atgagtatga cctgaacgac agctttctag atgatgagga agaagactat gagccaacag
     1561 atgaagattc tgactgggaa ccaggaaagg aagatgaaga gaaggaagat gtggaagagc
     1621 ttttgaaaga agcaaaaagg tttatgaaaa gaaaatagta actaacttct gtagtcatat
     1681 ctgccttaca tttacttttt ctttgtggga aaacatttat gaactaagga ggtacttaag
     1741 tgacagttat tttccttctt ttgatagtta atgactttta ctactgactc tttacaaatg
     1801 agacattgaa acgtcagcct tcagtataat agatggaatt ttttgttgcc ttgccttttc
     1861 tttcctactt aaagcatctg gaaataggaa gacagagaag gtagagtaaa aatggatatt
     1921 ttttatgtac tttgtatatt ggtaataaaa agtaaaagcc ataggttgca taaaactttg
     1981 gtctttttaa atttttttcc aaagattatg gagtactctg caagtataac caggcaaagt
     2041 acatgaaaac atgcctcttt tcattgttac tgaagtaatt ctgtaagcag aaccaggctt
     2101 taaaaaatgc agccattaca taatggcgtt ttttttctag taccttagat gtgagctttg
     2161 caatttcact tactactttt atccttcaaa acttgggaag attggttcca aatgggtact
     2221 gaaaatagat tcaactaggc tgtagaagag aaatgttcac catgtaggga atgttatttt
     2281 gtgttccaca tctgcaattt actgtatgtt tgaatttaaa agtaaaaaaa ctcaaaaaac
     2341 ttgacctttt ttttcaaact aaagtcctag ctgattattt taactctcag tgtgctgtct
     2401 ttatattaag aatagagaaa cgacataact ttctcaaatg gcattagttt tgccttagtt
     2461 tttttgacct cagatgaaag tagcaagttg tttgccagtt gaaagttcaa gattctggcc
     2521 tcccatatca agaagaaacc acttattctc cagttttaca aagtatcttt ctgtatttct
     2581 gttctaaaac tactttaagc cctataatgc tcttcatatt acgtgacttt gaattgttaa
     2641 aatgtagttt agaatcttta ataacagttg ggttgtgtac acagagcagg agatatttat
     2701 tgaaatcata aatcactaag ccaaaagaat ctttgttaaa tgctattatt ttctactcta
     2761 aaggaaccta aaaatttata tcttaaaaga tcaaaacata tttataataa tattttctca
     2821 ttgctttaaa atatatctcc cagtaattat acaatattta atatttactt aaactggaac
     2881 aagaataaga taaacatttc tctagactat aacctattat tctgattttt atctcttatt
     2941 gttatttggg aactattttt ccccttataa tgtcccttta gccttggtag tataacaaat
     3001 ctacgaatgt aatatttaac ttattctcat ccctgccact aacttaatcc ttgaagtgtt
     3061 acataatcta ctcctaggta tatacccaga ggaattgaaa acatttccac ataaaaactg
     3121 tacacaatgt caatagaagc attattctta acagccaaaa agtataaaca cccaaagtct
     3181 atcaactatc aactgggtga atggataagc aaaacgtggt atatctgtgc aatggaatat
     3241 tatttggtca taaaaaggaa taaagtactg atacatgcta caagatagat gaaccttgaa
     3301 ggcaagtaaa agaagccaga cacaaaaggc cacatactgt atgaactcat ttatatgaaa
     3361 tttccaaaac aggcaaatcc ataaagacag aaagcaaatt tgtggttgcc agggacttgg
     3421 ggaggaagaa tggggagtga ctgctaatga gtataggatt tccttttgag tgataaaaat
     3481 gttctaagat tagatggcag tgatggttgc acaactctgt aaatatatta caaacaatgc
     3541 attgtacact ttcaaagggt agatttcatt gcacgtgaac tatatatttc aataaatctg
     3601 tcacaaaaaa gtaatgcaaa gggtaatacc agtcttttag aaggaaaaaa atatttaata
     3661 tcaaaggata ttcttcgtag taatgtaata gttcatggta gctgctttta acgatacagt
     3721 tttaatgcaa ctttcataat catcctgaag acacttttga gtataactat tgtataaata
     3781 aatgtatagc ttaattgaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3841 aaaaaaaaaa aaaaaaaaaa aaaa
//