LOCUS BC016869 2142 bp mRNA linear HUM 15-JUL-2006 DEFINITION Homo sapiens chromosome 20 open reading frame 72, mRNA (cDNA clone MGC:17602 IMAGE:3850853), complete cds. ACCESSION BC016869 VERSION BC016869.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 2142) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 2142) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (05-NOV-2001) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: ATCC cDNA Library Preparation: Life Technologies, Inc. cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Sequencing Group at the Stanford Human Genome Center, Stanford University School of Medicine, Stanford, CA 94305 Web site: http://www-shgc.stanford.edu Contact: (Dickson, Mark) mcd@paxil.stanford.edu Dickson, M., Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, R. M. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRAK Plate: 20 Row: m Column: 16 This clone was selected for full length sequencing because it passed the following selection criteria: matched mRNA gi: 19923657. FEATURES Location/Qualifiers source 1..2142 /db_xref="H-InvDB:HIT000037734" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:17602 IMAGE:3850853" /tissue_type="Colon, adenocarcinoma" /clone_lib="NIH_MGC_65" /lab_host="DH10B" /note="Vector: pCMV-SPORT6" gene 1..2142 /gene="C20orf72" /gene_synonym="bA504H3.4" /gene_synonym="FLJ14597" /db_xref="GeneID:92667" /db_xref="HGNC:HGNC:16205" CDS 83..1117 /gene="C20orf72" /gene_synonym="bA504H3.4" /gene_synonym="FLJ14597" /codon_start=1 /product="chromosome 20 open reading frame 72" /protein_id="AAH16869.1" /db_xref="GeneID:92667" /db_xref="HGNC:HGNC:16205" /translation="MKMKLFQTICRQLRSSKFSVESAALVAFSTSSYSCGRKKKVNPY EEVDQEKYSNLVQSVLSSRGVAQTPGSVEEDALLCGPVSKHKLPNQGEDRRVPQNWFP IFNPERSDKPNASDPSVPLKIPLQRNVIPSVTRVLQQTMTKQQVFLLERWKQRMILEL GEDGFKEYTSNVFLQGKRFHEALESILSPQETLKERDENLLKSGYIESVQHILKDVSG VRALESAVQHETLNYIGLLDCVAEYQGKLCVIDWKTSEKPKPFIQSTFDNPLQVVAYM GAMNHDTNYSFQVQCGLIVVAYKDGSPAHPHFMDAELCSQYWTKWLLRLEEYTEKKKN QNIQKPEYSE" BASE COUNT 664 a 429 c 498 g 551 t ORIGIN 1 agcgaagtgt ggtggcttcc aaggaataca aacataaagg ccttcgaccg ttgcaaatag 61 actaaagtga aaacaaatct gaatgaagat gaagttattt cagaccattt gcaggcagct 121 caggagttca aagttttctg tggaatcagc tgcccttgtg gctttctcta cttcctctta 181 ctcatgtggc cggaagaaaa aagtgaaccc atatgaagaa gtggaccaag aaaaatactc 241 taatttagtt cagtctgtct tgtcatccag aggcgtcgcc cagaccccgg gatcggtgga 301 ggaagatgct ttgctctgtg gacccgtgag caagcataag ctgccaaacc aaggtgagga 361 cagacgagtg ccacaaaact ggtttcctat cttcaatcca gagagaagtg ataaaccaaa 421 tgcaagtgat ccttcagttc ctttgaaaat ccccttgcaa aggaatgtga taccaagtgt 481 gacccgagtc cttcagcaga ccatgacaaa acaacaggtt ttcttgttgg agaggtggaa 541 acagcggatg attctggaac tgggagaaga tggctttaaa gaatacactt caaacgtctt 601 tttacaaggg aaacggttcc acgaagcctt ggaaagcata ctttcacccc aggaaacctt 661 aaaagagaga gatgaaaatc tcctcaagtc tggttacatt gaaagtgtcc agcatattct 721 gaaagatgtc agtggagtgc gagctcttga aagtgctgtt caacatgaaa ccttaaacta 781 tataggtctg ctggactgtg tggctgagta tcagggcaag ctctgtgtga ttgattggaa 841 gacatcagag aaaccaaagc cttttattca aagtacattt gacaacccac tgcaagttgt 901 ggcatacatg ggtgccatga accatgatac caactacagc tttcaggttc aatgtggctt 961 aattgtggtg gcctacaaag atggatcacc tgcccaccca catttcatgg atgcagagct 1021 ctgttcccag tactggacca agtggcttct tcgactagaa gaatatacgg aaaagaaaaa 1081 gaaccagaat attcagaaac cagaatattc agaataggga gcaagttgct atttgggaac 1141 attcagcacc ttctcacagt ttgggaacat atattgctgt ttactccagt gtaaaaatga 1201 ggtgccactg gatctgagtg ctacacgaac acaagtagaa gtattaattt gttgaaatgt 1261 gttgttacca aaaagactga aaagccccaa agtctagata taaagaccta gacttcggca 1321 cgcgaaatcc cagctatgct acctcttatt tacctgaaag gaggacacgc aggatgggca 1381 gtcatgctgg tgactcttgt actcccttga gggacattgg gggggggggg gcgtggtccc 1441 aggcaggatg cccagtcttt gagctgagat tggaaggcag tgaggctgag ggtgccaaga 1501 tttccccagg gttcacccag aggggaaggg gctacatgcc cccagctgtg tgcagggagg 1561 acacatcagc ccactaccgc tgccaacacc aatgcctaaa acttgtttca tacattgggg 1621 ttttctatat atttcagctg ggaaaagctt acatttaacc ttttgaaaaa ataaatacgt 1681 gattagcctc aactaaacat tgctgactat aaagacagta tattcaccat gtcgctggca 1741 atatgtcatt gcgtaacacc aaataacccc ccagaagtag ccagaggcca gtttgaacat 1801 cacaattcta agtgttttag taactatttc tggcgtgagt caacagatca tgtagataga 1861 gtcaattatt gtttgtggag tttttcagct ataggggagg ggaactatta aaatccattt 1921 gtttctattc aataggtaat aaaaattagt tgtccctggg tttgggaaac ttaaatgccc 1981 attacagccc tggggaaggg ttttctgtct tatggagtga gtcttagcat ttaagttata 2041 cagttgctgc cttaaaatag tagcctgcta caatgacttc tttgggtagc cattttcata 2101 agaaataaaa tacaagatat gagtaaaaaa aaaaaaaaaa aa //