LOCUS       BC038406                3375 bp    mRNA    linear   HUM 15-JUL-2006
DEFINITION  Homo sapiens chromosome 3 open reading frame 20, mRNA (cDNA clone
            MGC:35115 IMAGE:5167835), complete cds.
ACCESSION   BC038406
VERSION     BC038406.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 3375)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 3375)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (01-OCT-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Life Technologies, Inc.
            cDNA Library Preparation: Life Technologies, Inc.
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Baylor College of Medicine Human Genome
            Sequencing Center
            Center code: BCM-HGSC
            Web site: http://www.hgsc.bcm.tmc.edu/cdna/
            Contact: amg@bcm.tmc.edu
            Gunaratne, P.H., Garcia, A.M., Lu, X., Hulyk, S.W., Loulseged, H.,
            Kowis, C.R., Sneed, A.J., Martin, R.G., Muzny, D.M., Nanavati,
            A.N., Gibbs, R.A.
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 51 Row: a Column: 5
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 68163571.
FEATURES             Location/Qualifiers
     source          1..3375
                     /db_xref="H-InvDB:HIT000052026"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:35115 IMAGE:5167835"
                     /tissue_type="Brain, adult medulla"
                     /clone_lib="NIH_MGC_119"
                     /lab_host="DH10B"
                     /note="Vector: pCMV-SPORT6"
     gene            1..3375
                     /gene="C3orf20"
                     /gene_synonym="DKFZP434N1817"
                     /db_xref="GeneID:84077"
                     /db_xref="HGNC:HGNC:25320"
     CDS             373..3087
                     /gene="C3orf20"
                     /gene_synonym="DKFZP434N1817"
                     /codon_start=1
                     /product="chromosome 3 open reading frame 20"
                     /protein_id="AAH38406.2"
                     /db_xref="GeneID:84077"
                     /db_xref="HGNC:HGNC:25320"
                     /translation="MSYIKSNLELYQQYTAMAPKLLARISKLLMICQNAGISVPKGIR
                     NIFEFTWEELISDPSVPTPSDILGLEVSFGAPLVVLMEPTFVQVPTLKKPLPPPPPAP
                     PRPVLLATTGAAKRSTLSPTMARQVRTHQETLNRFQQQSIHLLTELLRLKMKAMVESM
                     SVGANPLDITRRFVEASQLLHLNAKEMAFNCLISTAGRSGYSSGQLWKESLANMSAIG
                     VNSPYQLIYHSSTACLSFSLSAGKEAKKKIGKSRTTEDVSMPPLHRGVGTPANSLEFS
                     DPCPEAREKLQELCRHIEAERATWKGRNISYPMILRNYKAKMPSHLMLARKGDSQTPG
                     LHYPPTAGAQTLSPTSHPSSANHHFSQHCQEGKAPKKAFKFHYTFYDGSSFVYYPSGN
                     VAVCQIPTCCRGRTITCLFNDIPGFSLLALFNTEGQGCVHYNLKTSCPYVLILDEEGG
                     TTNDQQGYVVHKWSWTSRTETLLSLEYKVNEEMKLKVLGQDSITVTFTSLNETVTLTV
                     SANNCPHGMAYDKRLNRRISNMDDKVYKMSRALAEIKKRFQKTVTQFINSILLAAGLF
                     TIEYPTKKEEEEFVRFKMRSRTHPERLPKLSLYSGESLLRSQSGHLESSIAETLKDEP
                     ESAPVSPVRKTTKIHTKAKVTSRGKAREGRSPTRWAALPSDCPLVLRKLMLKEDTRAG
                     CKCLVKAPLVSDVELERFLLAPRDPSQVLVFGIISSQNYTSTGQLQWLLNTLYNHQQR
                     GRGSPCIQCRYDSYRLLQYDLDSPLQEDPPLMVKKNSVVQGMILMFAGGKLIFGGRVL
                     NGYGLSKQNLLKQIFRSQQDYKMGYFLPDDYKFSVPNSVLSLEDSESVKKAESEDIQG
                     SSSSLALEDYVEKELSLEAEKTREPEVELHPLSRDSKITSWKKQASKK"
BASE COUNT          944 a          937 c          801 g          693 t
ORIGIN      
        1 gtccttttaa gtcagtaaat tgaactaagt cggttattcg gcaagcagtt cctataaaaa
       61 actacatggc taagatgtac acgttggata attccatgcc catcttggca ttgcgaccag
      121 gagcttctgc atcccttgct tccaaagaga accagtgaaa catcttttta ttcttgcagg
      181 ttcttaatga ttgaccacaa gcagatcttt caccctcgga tctctagcta caaaaggaac
      241 cactggctca atgacctgta agggccgttt cagcacatcc attctgtcca tctccaagcc
      301 ttcaccgtag ggaagaactt ttgctctcag tcacctctca gagagctctc tttatagctg
      361 aaggtccctc tcatgagtta catcaagagt aacctagaat tatatcagca atacacagcc
      421 atggccccca agctactggc ccgcatctcc aaactcctca tgatctgcca gaatgcaggc
      481 atttctgtac caaaaggcat cagaaacatc tttgagttca cttgggaaga gctcatcagt
      541 gacccttcag tgcctacccc gtccgacatc ttgggcctgg aggtcagctt tggagccccc
      601 ctggtggtgc tcatggaacc cacctttgtg caggtcccca cactgaagaa gccactacct
      661 ccaccaccac cagcaccacc acgtccagtg ctgctggcaa ccactggggc agccaagcgc
      721 tccaccctct ctcccaccat ggcccgtcag gtgcgcaccc accaggagac cctgaacagg
      781 tttcagcagc agtccatcca cctgctgacg gagctcctca gactgaagat gaaggccatg
      841 gtggagtcta tgtcggtggg tgccaacccc ttggacatca ccaggcgctt tgtggaggcc
      901 agccagctcc tccacctcaa tgccaaggag atggccttca actgcctgat cagcacagcc
      961 gggagaagtg gctacagcag cggacagttg tggaaagagt ccctcgcaaa catgtccgcc
     1021 attggggtga actcgcctta ccagctgatc taccactctt ccacagcctg tctgagcttt
     1081 tctctctctg ctggaaaaga agccaagaag aaaataggca aatctagaac tacagaagat
     1141 gtcagcatgc cgcccctgca tcgaggagtg ggaacccctg ccaacagcct ggagttcagc
     1201 gacccctgcc ctgaggcccg ggagaagctg caggagttgt gtcgccacat agaagctgaa
     1261 agggccacat ggaaagggag gaatatctcc taccccatga tcttacgaaa ctacaaggca
     1321 aagatgccct ctcatctaat gttggcccgc aaaggagact ctcagacccc gggtttacat
     1381 taccctccca ctgcaggtgc tcagactctc agccccacct ctcacccatc ttctgccaac
     1441 catcatttca gtcagcattg tcaagagggg aaggcaccca agaaggcctt caagtttcat
     1501 tacaccttct atgatggctc ctccttcgtt tactatccct ctggaaacgt cgctgtatgt
     1561 cagatcccca catgctgcag agggagaacc atcacctgcc tctttaatga catacctgga
     1621 ttctccttgc tggccctatt caatactgaa ggccagggct gtgttcacta caacctaaaa
     1681 accagttgcc catatgtctt aatcttggat gaggaaggtg ggaccaccaa tgaccagcag
     1741 ggctatgtag tccacaagtg gagctggact tccaggacag agaccctgct ttccctggaa
     1801 tacaaggtga atgaggaaat gaaactaaag gtactgggac aggactccat cacagtcacc
     1861 ttcacctccc tgaatgagac agtaacactc actgtgtcgg ccaacaattg tccccatgga
     1921 atggcatatg acaaacggct gaaccgcaga atcagcaaca tggacgacaa ggtgtataag
     1981 atgagccgag ccctggctga gatcaagaag cggtttcaga agacagtgac tcagttcatt
     2041 aattctatct tgctggccgc aggtctgttt accattgaat atcccaccaa aaaggaggag
     2101 gaagaatttg ttcggttcaa gatgagatcc agaactcatc ccgagcggct ccccaagcta
     2161 agtttatact caggagaaag tcttttacga tctcagtcag gccacctgga atcctcaatt
     2221 gcagagactt tgaaggatga gcctgagtct gctcctgtga gcccagttcg gaagaccacc
     2281 aaaatccaca ccaaagccaa ggtcacatcc agagggaagg cccgcgaggg gcgcagcccc
     2341 accaggtggg cggccttgcc ctcagactgc ccgctggtgc tgcggaagct catgctcaag
     2401 gaagacaccc gtgctggctg caagtgcctg gtgaaggcgc ccctggtctc tgacgtggag
     2461 ctggagcgct tcctgttggc gccccgagac cccagccaag tgctggtgtt tgggatcatc
     2521 tcaagccaga actacaccag cactgggcag ctccagtggc tgctgaacac tctctacaac
     2581 caccagcagc ggggccgtgg ctccccctgc atccagtgcc ggtatgactc ctaccgcctg
     2641 ctgcagtatg acctggacag ccccctgcag gaggaccctc ccctgatggt gaagaagaac
     2701 tctgtggtgc aggggatgat tctgatgttt gccgggggga agctcatttt tgggggccgt
     2761 gttttgaatg gatatggcct cagcaagcag aatctgctga aacagatctt ccggtctcaa
     2821 caggattaca agatgggcta cttcctgccg gatgactaca aattcagtgt tcccaactct
     2881 gtcctgagcc tggaggattc tgaatcagtc aagaaagccg agtcagaaga tatccaagga
     2941 agcagctcct cattggccct ggaagactat gtggagaagg agttatctct ggaggctgag
     3001 aagacaagag agcctgaagt ggagctacat cctctcagca gggacagcaa gataactagt
     3061 tggaagaagc aggcctccaa gaagtagcgc catcctggca gcagccaagt gagccaggcc
     3121 ccggcccggg gtgctggggc ttcttgccag cccagccctg cctccccggt ctcccaccct
     3181 gtcctccaag cttctataat aaaccagcgg gcctccagca ttggggtgag gctctgggga
     3241 aggacagaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3301 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa
     3361 aaaaaaaaaa aaaaa
//