LOCUS BC032859 3038 bp mRNA linear HUM 17-JUL-2006 DEFINITION Homo sapiens chromosome 1 open reading frame 101, mRNA (cDNA clone MGC:33370 IMAGE:5269307), complete cds. ACCESSION BC032859 VERSION BC032859.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3038) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3038) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (07-JUN-2002) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Miklos Palkovits, M.D., Ph.D. cDNA Library Preparation: Michael J. Brownstein (NHGRI) & Shiraki Toshiyuki and Piero Carninci (RIKEN) cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Institute for Systems Biology http://www.systemsbiology.org contact: amadan@systemsbiology.org Anup Madan, Jessica Fahey, Erin Helton, Mark Ketteman, Anuradha Madan, Stephanie Rodrigues, Amy Sanchez and Michelle Whiting Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 47 Row: o Column: 17 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 31341128. FEATURES Location/Qualifiers source 1..3038 /db_xref="H-InvDB:HIT000051084" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:33370 IMAGE:5269307" /tissue_type="Testis" /clone_lib="NIH_MGC_97" /lab_host="DH10B" /note="Vector: pBluescriptR" gene 1..3038 /gene="C1orf101" /gene_synonym="MGC33370" /gene_synonym="RP11-523K4.1" /db_xref="GeneID:257044" /db_xref="HGNC:HGNC:28491" CDS 45..2543 /gene="C1orf101" /gene_synonym="MGC33370" /gene_synonym="RP11-523K4.1" /codon_start=1 /product="chromosome 1 open reading frame 101" /protein_id="AAH32859.1" /db_xref="GeneID:257044" /db_xref="HGNC:HGNC:28491" /translation="MSAREVAVLLLWLSCYGSALWRYSTNSPNYRIFSTRSTIKLEYE GTLFTEWSVPETCFVLNKSSPTTELRCSSPGVHAIKPIVTGPDEEERYLFVESSHTCF LWYYRVRHFFNNFTQLITVWAYDPESADPDELLGNAEEPSINSIVLSTQMATLGQKPV IHTVLKRKVYSSNEKMRRGTWRIVVPMTKDDALKEIRGNQVTFQDCFIADFLILLTFP LLTIPEIPGYLPISSPRGSQLMASWDACVVASAVLVTDMETFHTTDSFKSWTRIRVPP DILSDDERRSVAHVILSRDGIVFLINGVLYIKSFRGFIRLGGIVNLPDGGITGISSRK WCWVNYLLKAKGRRSTFAVWTENEIYLGSILLKFARLVTTTELKNILSLSVTATLTID RVEYTGHPLEIAVFLNYCTVCNVTKKIFLVIYNEDTKQWVSQDFTLDAPIDSVTMPHF TFSALPGLLLWNKHSIYYCYHNFTFTGILQTPAGHGNLSMLSNDSIIHEVFIDYYGDI LVKMENNVIFYSKINTRDAVKLHLWTNYTTRAFIFLSTSGQTYFLYALDDGTIQIQDY PLHLEAQSIAFTTKDKCPYMAFHNNVAHVFYFLDKGEALTVWTQIVYPENTGLYVIVE SYGPKILQESHEISFEAAFGYCTKTLTLTFYQNVDYERISDYFETQDKHTGLVLVQFR PSEYSKACPIAQKVFQIAVGCDDKKFIAIKGFSKKGCHHHDFSYVIEKSYLRHQPSKN LRVRYIWGEYGCPLRLDFTEKFQPVVQLFDDNGYVKDVEANFIVWEIHGRDDYSFNNT MAQSGCLHEAQTWKSMIELNKHLPLEEVWGPEFL" BASE COUNT 956 a 541 c 615 g 926 t ORIGIN 1 agcgtgagtg gccgaggcgg ttgggcggag gcggagcagg cgccatgtca gcccgggaag 61 tggccgtgct gctgctgtgg ctgagctgct atggctccgc cctttggagg tattccacta 121 acagcccaaa ctatcgcatt tttagtacca gaagtactat taagttagag tatgaaggaa 181 cattatttac tgagtggagt gtgccagaaa cttgttttgt gctaaataaa agctcaccca 241 cgacagaatt gcgttgttcc tcacctggtg ttcacgctat aaaaccaatt gttactggcc 301 cagatgaaga agaacgctat ttatttgtgg aaagttctca tacttgcttt ctgtggtact 361 atagagttag acatttcttt aacaacttta cccagcttat cactgtgtgg gcatatgatc 421 cagaaagtgc agatcctgat gagttgctgg ggaatgcaga agaaccttca ataaattcca 481 tagtactcag cacacagatg gccacattgg gacagaagcc tgtcatacat acagttctga 541 agagaaaagt ttattcttca aatgagaaaa tgagaagggg tacctggcgt attgtagtac 601 caatgacaaa agatgatgca ctaaaggaga ttagaggaaa ccaagttact tttcaggatt 661 gctttattgc agattttctt attctgttga cttttccttt gttgaccata cctgaaattc 721 ctggttattt accaatctcc tcaccacgtg gtagtcaatt aatggcttcc tgggatgctt 781 gtgtagttgc atctgctgtt ttggtgacag atatggagac ctttcacaca actgattcat 841 tcaaatcttg gaccagaatc agagtgcctc cagacattct gagtgatgat gaaagacgga 901 gtgtggctca tgtgatctta tcgcgggatg gaatcgtttt tcttataaat ggtgttcttt 961 acataaagag ttttcgtgga tttataagac tgggaggaat tgtaaatctt cctgatggtg 1021 gaattactgg catttcatca agaaaatggt gttgggtcaa ttatttatta aaggctaaag 1081 gaagaagaag cacctttgca gtctggacag aaaatgaaat ttacctcgga tccattcttc 1141 ttaagtttgc cagattagta actaccacag aactgaaaaa catcctaagt ctatcggtga 1201 ctgctactct gaccatagac agggttgagt atacaggaca ccctctggag attgctgtgt 1261 ttttaaatta ttgcactgta tgtaacgtca ccaaaaagat tttcttagtg atatataatg 1321 aagatacaaa acagtgggtt tcccaagact ttacattaga tgcccctatt gacagtgtta 1381 ccatgccaca ttttacattt tcagcactgc caggattact gctatggaac aagcatagta 1441 tctactattg ttaccataat ttcaccttta ctgggatttt acagacacct gcaggacatg 1501 gaaatctatc aatgctatca aatgacagca ttattcatga agttttcata gattattatg 1561 gagatatttt ggtaaaaatg gaaaataatg taatatttta ttccaagatt aatactagag 1621 atgcagtaaa gctgcattta tggacaaatt acacaacaag agcattcatt ttcttaagta 1681 catctggtca aacatatttc ctgtatgctt tggatgatgg cacaatacaa atacaggact 1741 atcccttaca tctggaagca caaagtatag ctttcacaac aaaagacaaa tgcccataca 1801 tggcatttca taacaatgtt gctcatgttt tttacttttt ggacaaggga gaggctctga 1861 cagtttggac tcagatcgtc tatccagaaa acactggtct gtatgttatt gtggaatctt 1921 atggcccaaa aatattacaa gagagtcatg agatttcctt tgaagctgcc tttggatact 1981 gcaccaaaac tctgacacta acattttatc agaatgtaga ttatgagaga atatctgatt 2041 actttgagac acaagacaag cacacgggtc ttgtgctggt tcagtttcga cctagtgaat 2101 attcaaaagc atgtccaata gcccaaaagg tgttccaaat agctgttggc tgtgatgata 2161 aaaaattcat tgcaattaaa ggatttagta aaaaaggatg tcatcaccat gatttttcat 2221 acgtgattga aaagtcatat ctgaggcatc agccatcgaa aaacttgaga gtaaggtata 2281 tttggggaga atatggctgc cctctgaggc ttgacttcac agaaaagttt caacctgtgg 2341 ttcaactatt tgatgataat ggctatgtta aagacgttga agcaaatttc atagtgtggg 2401 aaatacacgg cagggatgac tatagcttta ataatactat ggcacagagt ggttgtttac 2461 atgaagcaca gacatggaag tcaatgattg aacttaacaa gcacctccca ctagaagaag 2521 tctggggacc tgagtttctg taacctaaca gctatgtttg caatagagac atttggactg 2581 attcccagtc caagtgtcta cctggtagct tctttcctct tcgtcctgat gctgctcttc 2641 ttcactattc ttgttttgag ctactttcgg tacatgagga tttatagacg atatatttat 2701 gaaccacttc acaaacctca aagaaaacgt aagaagaatt aggaaaactg aaagtttgtt 2761 tattacagat atatgcatat agagaaacag tgtattacat agtgatattg agagtgtgtg 2821 tttgaccaag aaatactaaa tataagctcg tagtagtagg catcaccaaa ttcaagatct 2881 gaaaaatatt cttgaactat ctccaaaata gaaatgtttt catatatatt gttattaaat 2941 taatcctttg tttgccttca ttttaaagat actctatgta ctctcacatg gcatgaaaaa 3001 ataaactaaa tttgactatt acaaaaaaaa aaaaaaaa //