LOCUS BC026115 2133 bp mRNA linear HUM 15-APR-2009 DEFINITION Homo sapiens chromosome 1 open reading frame 228, mRNA (cDNA clone MGC:33556 IMAGE:4822241), complete cds. ACCESSION BC026115 VERSION BC026115.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2133) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2133) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (26-MAR-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 46 Row: j Column: 17 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 51972195. FEATURES Location/Qualifiers source 1..2133 /db_xref="H-InvDB:HIT000258990" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33556 IMAGE:4822241" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..2133 /gene="C1orf228" /gene_synonym="p40" /db_xref="GeneID:339541" /db_xref="HGNC:HGNC:34345" CDS 179..1102 /gene="C1orf228" /gene_synonym="p40" /codon_start=1 /product="non-protein coding RNA 82" /protein_id="AAH26115.1" /db_xref="GeneID:339541" /db_xref="HGNC:HGNC:34345" /translation="MTSIKEQAAISRLLSFLQEWDNAGKVARSHILDKFIETNQGKTA PELEQEFSQGASLFLVRLTTSLRITYMTDSCLEKLLRSIGIFLSAVSSNRYLIEFLEV GGVLTLLEILGLEKIKEEAKKESVKLLQVIANSGRTYKELICESYGVRSIAEFLAKSK SEETQEEVQVLLDSLVHGNPKYQNQVYKGLIALLPCESPKAQQLSLQTLRTAQPIIGT THPSIVDCVLKVLGTMHLEVQYEAIELIKDLVGYDVRQALLKGLVALLIPSVKEISKL QAKILSDPSVLQLTPSLPMFLQQAAAAKAIG" BASE COUNT 490 a 619 c 608 g 416 t ORIGIN 1 agcggaggcg gctgaagagt tgctgtattc tgggaatggg cagggtcact cgtccgaaca 61 gagtcctatc ctacgcggcg gaagagtgtg ccctttgact tgcatcgtct acctacctct 121 gccatcctct accagccgca actgcgaggg ctggagccaa cttcaggact gattgatcat 181 gacttctata aaggagcagg cagcaattag caggctctta agttttttac aggagtggga 241 caacgctggc aaagtcgcaa ggagtcacat cctcgacaag ttcattgaaa ccaaccaagg 301 caagactgcc cctgaactgg agcaggagtt ttcccaggga gccagtttgt tcctggtacg 361 cttgaccacc tcgcttagaa tcacctatat gactgactca tgtttagaaa agcttctcag 421 gtccattggc atcttcttat cagctgtaag cagtaatcgg taccttatag aatttcttga 481 ggttggaggt gtcctaaccc tcttggaaat acttgggcta gagaagatca aggaggaggc 541 caagaaggaa tctgtcaaac tacttcaggt tattgcgaac tctggcagga catacaagga 601 actcatttgt gaaagctatg gtgtacgatc catagcagaa tttttggcaa agtctaagtc 661 agaagagacc caggaggaag tgcaggttct gttggattct ttggtccacg gcaatcccaa 721 gtaccaaaat caagtgtata aaggtctaat agctttgctg ccctgcgagt ccccaaaagc 781 ccagcagctg tccctgcaga ctctcaggac tgcccagcca atcattggga ccacacaccc 841 cagcatcgtg gactgcgtgc tgaaggtgct gggcacgatg cacctggaag tccagtatga 901 agccatcgag ttgatcaaag acctggtcgg ttacgatgtg cgccaggcgc tgctcaaggg 961 cctcgtggcg ctgctgatac cgtcggtcaa ggagatctcc aaactgcagg ccaagatcct 1021 cagtgacccc tcggttctcc agctcacccc cagcctgccg atgtttttgc agcaggccgc 1081 ggccgccaag gccatcgggt aagcgggcag gggttagtgg gtagctgcag caagcctggc 1141 ttggcgctgc cggcgggccc cgggagcgct ccgtgcgccg ggtgggcggg ggtgtgcgcc 1201 gggtgagccc cagggcgtcg ccccagcccg aacccccggc ccagggtcct ggcgcgcaac 1261 gacatgagca tcgccgagga gctgctgtac ctgcgcgtgg tgcgtggcct aatggccgcc 1321 atgggcaaca cggaccacag caacagccag cggctggcca gcctcacgct ggagatgttc 1381 cccttggtgg cggagcacgt gcgcaagtgc atgggggagg aactctacca gctcttcctg 1441 gtaagtgcgc ccttcctgcc ccgccgcaat gagcagatgg cggctcggac agtgtgatgc 1501 cccttcagac agtccccatc ctggagcgcg ccacatgcag agtggacctg gccaccagct 1561 gcaggaggga ctgctctggc gtaggctcct ccacccgcca ccttcctgtt ccctttcctg 1621 ccctttcggt caggctgccg acccgcaccc cacctgcaac atccctctgc caagcccaac 1681 tccaagtcca gactgccctg gcaccccagc cgggtccccc ttgctcctgt cctcagagca 1741 acgctgagga cttgtacatg aaaatagaca gcattcaggc ggacatcttg gcggccaaca 1801 cagtcaatgt taccaaagcc ctgtgcctcc atggcagctc ctacagcatg aacactctct 1861 atggctcgcg cgattcggct cagatggcct acctcacaca cttcgaggag gatgtagaat 1921 caaaggagta acagcccctg tggcaaacca ggaaggccaa ggctgcgggg cagggaagcc 1981 tggcaagagg aaggcgcctg gggtcaagct cagagccact ccacttggct ccagggggga 2041 gacggggatt aggcatccca gaggggcaga ggaagagccg ctggctgcga agagtcaata 2101 aacagccttg atacctgaaa aaaaaaaaaa aaa //