LOCUS BC117257 3859 bp mRNA linear HUM 03-NOV-2008 DEFINITION Homo sapiens diaphanous homolog 1 (Drosophila), mRNA (cDNA clone MGC:150866 IMAGE:40125808), complete cds. ACCESSION BC117257 VERSION BC117257.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3859) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3859) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (26-MAY-2006) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 6 Row: G Column: 1. FEATURES Location/Qualifiers source 1..3859 /db_xref="H-InvDB:HIT000387547" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:150866 IMAGE:40125808" /tissue_type="Lung and heart, PCR rescued clones" /clone_lib="NIH_MGC_317" /note="Vector: pCR-XL-TOPO with reversed insert; Clone identification sequence tag: CATGTCCA" gene 1..3859 /gene="DIAPH1" /gene_synonym="DIA1" /gene_synonym="DRF1" /gene_synonym="hDIA1" /gene_synonym="LFHL1" /db_xref="GeneID:1729" /db_xref="HGNC:HGNC:2876" /db_xref="MIM:602121" CDS 19..3807 /gene="DIAPH1" /gene_synonym="DIA1" /gene_synonym="DRF1" /gene_synonym="hDIA1" /gene_synonym="LFHL1" /codon_start=1 /product="DIAPH1 protein" /protein_id="AAI17258.1" /db_xref="GeneID:1729" /db_xref="HGNC:HGNC:2876" /db_xref="MIM:602121" /translation="MEPPGGSLGPGRGTRDKKKGRSPDELPSAGGDGGKSKKFLERFT SMRIKKEKEKPNSAHRNSSASYGDDPTAQSLQDVSDEQVLVLFEQMLLDMNLNEEKQQ PLREKDIIIKREMVSQYLYTSKAGMSQKESSKSAMMYIQELRSGLRDMPLLSCLESLR VSLNNNPVSWVQTFGAEGLASLLDILKRLHDEKEETAGSYDSRNKHEIIRCLKAFMNN KFGIKTMLETEEGILLLVRAMDPAVPNMMIDAAKLLSALCILPQPEDMNERVLEAMTE RAEMDEVERFQPLLDGLKSGTTIALKVGCLQLINALITPAEELDFRVHIRSELMRLGL HQVLQDLREIENEDMRVQLNVFDEQGEEDSYDLKGRLDDIRMEMDDFNEVFQILLNTV KDSKAEPHFLSILQHLLLVRNDYEARPQYYKLIEECISQIVLHKNGADPDFKCRHLQI EIEGLIDQMIDKTKVEKSEAKAAELEKKLDSELTARHELQVEMKKMESDFEQKLQDLQ GEKDALHSEKQQIATEKQDLEAEVSQLTGEVAKLTKELEDAKKEMASLSAAAITVPPS VPSRAPVPPAPPLPGDSGTIIPPPPAPGDSTTPPPPPPPPPPPPLPGGVCISSPPSLP GGTAISPPPPLSGDATIPPPPPLPEGVGIPSPSSLPGGTAIPPPPPLPGSARIPPPPP PLPGSAGIPPPPPPLPGEAGMPPPPPPLPGGPGIPPPPPFPGGPGIPPPPPGMGMPPP PPFGFGVPAAPVLPFGLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFWTKVKEDRFEN NELFAKLTLTFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDSKTAQNLSIFLGS FRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEYDDLAESEQFG VVMGTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACEELRKSESFSNLLEITLL VGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCENDYPDVLKFPDE LAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVEKMTSFVKDAQ EQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFLQAVKENQKRR ETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLEALQSGAAFRR KRGPRQANRKAGCAVTSLLASELTKDDAMAAVPAKVSKNSETFPTILEEAKELVGRAS " BASE COUNT 1037 a 939 c 986 g 897 t ORIGIN 1 gccagcgtga accgggacat ggagccgccc ggcgggagcc tggggcccgg ccgcgggacc 61 cgggacaaga agaagggccg gagcccagat gagctgccct cggcgggcgg cgacggcggc 121 aaatctaaga aatttctgga gagatttacc agcatgagaa ttaagaagga gaaggaaaag 181 cccaattctg ctcatagaaa ttcttctgca tcatatgggg atgatcccac agcacagtca 241 ttgcaagatg tttcagatga acaagtgctg gttctctttg aacagatgct gctggatatg 301 aacctgaatg aggagaaaca gcaacctttg agggagaagg acatcatcat caagagggag 361 atggtgtccc aatacttgta cacctccaag gctggcatga gccagaagga gagctctaag 421 tctgccatga tgtatattca ggagttgagg tcaggcttgc gggatatgcc tctgctcagc 481 tgcctggagt cccttcgtgt gtctctcaac aacaaccctg tcagttgggt gcaaacattt 541 ggtgctgaag gcttggcctc cttattggac attcttaaac gacttcatga tgagaaagaa 601 gagactgctg ggagttacga tagccggaac aagcatgaga tcattcgctg cttgaaagct 661 tttatgaaca acaagtttgg aatcaagacc atgttggaga cagaagaagg aatcctactg 721 ctggtcagag ccatggatcc tgctgttccc aacatgatga ttgatgcagc taagctgctt 781 tctgctcttt gtattctacc gcagccagag gacatgaatg aaagggtttt ggaggcaatg 841 acagaaagag ctgagatgga tgaagtggaa cgtttccagc cgctgctgga tggattaaaa 901 agtggaacca ctattgcact gaaggttgga tgcctacagc tgatcaatgc tctcatcaca 961 ccagcggagg aacttgactt ccgagttcac atcagaagtg aactgatgcg tttggggcta 1021 catcaggtgt tgcaggacct tcgagagatt gaaaatgaag atatgagagt gcaactaaat 1081 gtgtttgatg aacaagggga agaggattcc tatgacctga agggacggct ggatgacatt 1141 cgcatggaga tggatgactt taatgaagtc tttcagattc tcttaaacac agtgaaggat 1201 tcaaaggcag agccacactt cctttccatc ctgcagcact tactcttggt ccgaaatgac 1261 tatgaggcca gacctcagta ctataagttg attgaagaat gtatttccca gatagttctg 1321 cacaagaacg gggctgatcc tgacttcaag tgccggcacc tccagattga gattgaggga 1381 ttaattgatc aaatgattga taagacaaag gtggagaaat ctgaagccaa agctgcagag 1441 ctggaaaaga agttggactc agagttaaca gcccgacatg agctacaggt ggaaatgaaa 1501 aagatggaaa gtgactttga gcagaagctt caagatcttc agggagaaaa agatgcactg 1561 cattctgaaa agcagcaaat tgccacagag aaacaggacc tggaagcaga ggtgtcccag 1621 ctcacaggag aggttgccaa gctgacaaag gaactggaag atgccaagaa agaaatggct 1681 tccctctctg cggcagctat tactgtacct ccttctgttc ctagtcgtgc tcctgttccc 1741 cctgcccctc ctttacctgg tgactctggc actattattc caccaccacc tgctcctggg 1801 gatagtacca ctcctcctcc tcctcctcct cctcctcctc cacctccttt gcctgggggt 1861 gtttgcatct cctcaccccc ttctttacct ggaggtactg ctatctctcc accccctcct 1921 ttgtctgggg atgctaccat ccctccaccc cctcctttgc ctgagggtgt tggcatccct 1981 tcaccctctt ctttgcctgg aggtactgcc atccccccac ctcctccttt gcctgggagt 2041 gctagaatcc ccccaccacc acctcctttg cctgggagtg ctggaattcc ccccccacct 2101 cctcccttgc ctggagaagc aggaatgcca cctcctcctc cccctcttcc tggtggtcct 2161 ggaatccctc cacctcctcc atttcccgga ggccctggca ttcctccacc tccacccgga 2221 atgggtatgc ctccacctcc cccatttgga tttggagttc ctgcagcccc agttctgcca 2281 tttggattaa cccccaaaaa gctttataag ccagaggtgc agctccggag gccaaactgg 2341 tccaagcttg tggctgagga cctctcccag gactgcttct ggacaaaggt gaaggaggac 2401 cgctttgaga acaatgaact tttcgccaaa cttaccctta ccttctctgc ccagaccaag 2461 acttccaaag ccaagaagga tcaagaaggt ggagaagaaa agaaatctgt gcaaaagaaa 2521 aaagtaaaag agttaaaggt gttggattca aagacagccc agaatctctc aatctttttg 2581 ggttccttcc gcatgcccta tcaagagatt aagaatgtca tcctggaggt gaatgaggct 2641 gttctgactg agtctatgat ccagaacctc attaagcaaa tgccagagcc agagcagtta 2701 aaaatgcttt ctgaactgaa ggatgaatat gatgacctgg ctgagtcaga gcagtttggc 2761 gtggtgatgg gcactgtgcc ccgactgcgg cctcgcctca atgccattct cttcaagcta 2821 caattcagcg agcaagtgga gaatatcaag ccagagattg tgtctgtcac tgctgcatgt 2881 gaggagttac gtaagagtga gagcttttcc aatctcctag agattacctt gcttgttgga 2941 aattacatga atgctggctc cagaaatgct ggtgcttttg gcttcaatat cagcttcctc 3001 tgtaagcttc gagacaccaa gtccacagat cagaagatga cgttgttaca cttcttggct 3061 gagttgtgtg agaatgacta tcccgatgtc ctcaagtttc cagacgagct tgcccatgtg 3121 gagaaagcca gccgagtttc tgctgaaaac ttgcaaaaga acctagatca gatgaagaaa 3181 caaatttctg atgtggaacg tgatgttcag aatttcccag ctgccacaga tgaaaaagac 3241 aagtttgttg aaaaaatgac cagctttgtg aaggatgcac aggaacagta taacaagctg 3301 cggatgatgc attctaacat ggagaccctc tataaggagc tgggcgagta cttcctcttt 3361 gaccccaaga agttgtctgt tgaagaattt ttcatggatc ttcacaattt tcggaatatg 3421 tttttgcaag cagtcaagga gaaccagaag cggcgggaga cagaagaaaa gatgaggcga 3481 gcaaaactag ccaaggagaa ggcagagaag gagcggctag agaagcagca gaagagagag 3541 caactcatag acatgaatgc agagggcgat gagacaggtg tgatggacag tcttctagaa 3601 gccctgcagt caggggcagc attccgacgg aagagagggc cccgtcaagc caacaggaag 3661 gccgggtgtg cagtcacatc tctgctagct tcggagctga ccaaggatga tgccatggct 3721 gctgttcctg ccaaggtgtc caagaacagt gagacattcc ccacaatcct tgaggaagcc 3781 aaggagttgg ttggccgtgc aagctaatgt gggtcctgtg accgcggcag ctcctcagcg 3841 gagccgcaga ctgtcctgc //