LOCUS       BC026115                2133 bp    mRNA    linear   HUM 15-APR-2009
DEFINITION  Homo sapiens chromosome 1 open reading frame 228, mRNA (cDNA clone
            MGC:33556 IMAGE:4822241), complete cds.
ACCESSION   BC026115
VERSION     BC026115.1
KEYWORDS    MGC.
SOURCE      Homo sapiens (human)
  ORGANISM  Homo sapiens
            Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
            Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini;
            Catarrhini; Hominidae; Homo.
REFERENCE   1  (bases 1 to 2133)
  AUTHORS   Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G.,
            Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D.,
            Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K.,
            Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F.,
            Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L.,
            Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L.,
            Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S.,
            Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J.,
            Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J.,
            McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S.,
            Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W.,
            Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A.,
            Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S.,
            Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y.,
            Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D.,
            Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M.,
            Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E.,
            Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A.
  CONSRTM   Mammalian Gene Collection Program Team
  TITLE     Generation and initial analysis of more than 15,000 full-length
            human and mouse cDNA sequences
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002)
   PUBMED   12477932
REFERENCE   2  (bases 1 to 2133)
  CONSRTM   NIH MGC Project
  TITLE     Direct Submission
  JOURNAL   Submitted (26-MAR-2002) National Institutes of Health, Mammalian
            Gene Collection (MGC), Bethesda, MD 20892-2590, USA
  REMARK    NIH-MGC Project URL: http://mgc.nci.nih.gov
COMMENT     Contact: MGC help desk
            Email: cgapbs-r@mail.nih.gov
            Tissue Procurement: Miklos Palkovits, M.D., Ph.D.
            cDNA Library Preparation: Michael J. Brownstein (NHGRI) &  Shiraki
            Toshiyuki and Piero Carninci (RIKEN)
            cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL)
            DNA Sequencing by: Institute for Systems Biology
            http://www.systemsbiology.org
            contact: amadan@systemsbiology.org
            Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha
            Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting
            
            Clone distribution: MGC clone distribution information can be found
            through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov
            Series: IRAK Plate: 46 Row: j Column: 17
            This clone was selected for full length sequencing because it
            passed the following selection criteria: matched mRNA gi: 51972195.
FEATURES             Location/Qualifiers
     source          1..2133
                     /db_xref="H-InvDB:HIT000258990"
                     /organism="Homo sapiens"
                     /mol_type="mRNA"
                     /db_xref="taxon:9606"
                     /clone="MGC:33556 IMAGE:4822241"
                     /tissue_type="Testis"
                     /clone_lib="NIH_MGC_97"
                     /lab_host="DH10B"
                     /note="Vector: pBluescriptR"
     gene            1..2133
                     /gene="C1orf228"
                     /gene_synonym="p40"
                     /db_xref="GeneID:339541"
                     /db_xref="HGNC:HGNC:34345"
     CDS             179..1102
                     /gene="C1orf228"
                     /gene_synonym="p40"
                     /codon_start=1
                     /product="non-protein coding RNA 82"
                     /protein_id="AAH26115.1"
                     /db_xref="GeneID:339541"
                     /db_xref="HGNC:HGNC:34345"
                     /translation="MTSIKEQAAISRLLSFLQEWDNAGKVARSHILDKFIETNQGKTA
                     PELEQEFSQGASLFLVRLTTSLRITYMTDSCLEKLLRSIGIFLSAVSSNRYLIEFLEV
                     GGVLTLLEILGLEKIKEEAKKESVKLLQVIANSGRTYKELICESYGVRSIAEFLAKSK
                     SEETQEEVQVLLDSLVHGNPKYQNQVYKGLIALLPCESPKAQQLSLQTLRTAQPIIGT
                     THPSIVDCVLKVLGTMHLEVQYEAIELIKDLVGYDVRQALLKGLVALLIPSVKEISKL
                     QAKILSDPSVLQLTPSLPMFLQQAAAAKAIG"
BASE COUNT          490 a          619 c          608 g          416 t
ORIGIN      
        1 agcggaggcg gctgaagagt tgctgtattc tgggaatggg cagggtcact cgtccgaaca
       61 gagtcctatc ctacgcggcg gaagagtgtg ccctttgact tgcatcgtct acctacctct
      121 gccatcctct accagccgca actgcgaggg ctggagccaa cttcaggact gattgatcat
      181 gacttctata aaggagcagg cagcaattag caggctctta agttttttac aggagtggga
      241 caacgctggc aaagtcgcaa ggagtcacat cctcgacaag ttcattgaaa ccaaccaagg
      301 caagactgcc cctgaactgg agcaggagtt ttcccaggga gccagtttgt tcctggtacg
      361 cttgaccacc tcgcttagaa tcacctatat gactgactca tgtttagaaa agcttctcag
      421 gtccattggc atcttcttat cagctgtaag cagtaatcgg taccttatag aatttcttga
      481 ggttggaggt gtcctaaccc tcttggaaat acttgggcta gagaagatca aggaggaggc
      541 caagaaggaa tctgtcaaac tacttcaggt tattgcgaac tctggcagga catacaagga
      601 actcatttgt gaaagctatg gtgtacgatc catagcagaa tttttggcaa agtctaagtc
      661 agaagagacc caggaggaag tgcaggttct gttggattct ttggtccacg gcaatcccaa
      721 gtaccaaaat caagtgtata aaggtctaat agctttgctg ccctgcgagt ccccaaaagc
      781 ccagcagctg tccctgcaga ctctcaggac tgcccagcca atcattggga ccacacaccc
      841 cagcatcgtg gactgcgtgc tgaaggtgct gggcacgatg cacctggaag tccagtatga
      901 agccatcgag ttgatcaaag acctggtcgg ttacgatgtg cgccaggcgc tgctcaaggg
      961 cctcgtggcg ctgctgatac cgtcggtcaa ggagatctcc aaactgcagg ccaagatcct
     1021 cagtgacccc tcggttctcc agctcacccc cagcctgccg atgtttttgc agcaggccgc
     1081 ggccgccaag gccatcgggt aagcgggcag gggttagtgg gtagctgcag caagcctggc
     1141 ttggcgctgc cggcgggccc cgggagcgct ccgtgcgccg ggtgggcggg ggtgtgcgcc
     1201 gggtgagccc cagggcgtcg ccccagcccg aacccccggc ccagggtcct ggcgcgcaac
     1261 gacatgagca tcgccgagga gctgctgtac ctgcgcgtgg tgcgtggcct aatggccgcc
     1321 atgggcaaca cggaccacag caacagccag cggctggcca gcctcacgct ggagatgttc
     1381 cccttggtgg cggagcacgt gcgcaagtgc atgggggagg aactctacca gctcttcctg
     1441 gtaagtgcgc ccttcctgcc ccgccgcaat gagcagatgg cggctcggac agtgtgatgc
     1501 cccttcagac agtccccatc ctggagcgcg ccacatgcag agtggacctg gccaccagct
     1561 gcaggaggga ctgctctggc gtaggctcct ccacccgcca ccttcctgtt ccctttcctg
     1621 ccctttcggt caggctgccg acccgcaccc cacctgcaac atccctctgc caagcccaac
     1681 tccaagtcca gactgccctg gcaccccagc cgggtccccc ttgctcctgt cctcagagca
     1741 acgctgagga cttgtacatg aaaatagaca gcattcaggc ggacatcttg gcggccaaca
     1801 cagtcaatgt taccaaagcc ctgtgcctcc atggcagctc ctacagcatg aacactctct
     1861 atggctcgcg cgattcggct cagatggcct acctcacaca cttcgaggag gatgtagaat
     1921 caaaggagta acagcccctg tggcaaacca ggaaggccaa ggctgcgggg cagggaagcc
     1981 tggcaagagg aaggcgcctg gggtcaagct cagagccact ccacttggct ccagggggga
     2041 gacggggatt aggcatccca gaggggcaga ggaagagccg ctggctgcga agagtcaata
     2101 aacagccttg atacctgaaa aaaaaaaaaa aaa
//