LOCUS BC143413 3889 bp mRNA linear HUM 08-JAN-2009 DEFINITION Homo sapiens diaphanous homolog 1 (Drosophila), mRNA (cDNA clone MGC:176938 IMAGE:9051921), complete cds. ACCESSION BC143413 VERSION BC143413.1 KEYWORDS MGC. SOURCE Homo sapiens (human) ORGANISM Homo sapiens Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo. REFERENCE 1 (bases 1 to 3889) AUTHORS Strausberg,R.L., Feingold,E.A., Grouse,L.H., Derge,J.G., Klausner,R.D., Collins,F.S., Wagner,L., Shenmen,C.M., Schuler,G.D., Altschul,S.F., Zeeberg,B., Buetow,K.H., Schaefer,C.F., Bhat,N.K., Hopkins,R.F., Jordan,H., Moore,T., Max,S.I., Wang,J., Hsieh,F., Diatchenko,L., Marusina,K., Farmer,A.A., Rubin,G.M., Hong,L., Stapleton,M., Soares,M.B., Bonaldo,M.F., Casavant,T.L., Scheetz,T.E., Brownstein,M.J., Usdin,T.B., Toshiyuki,S., Carninci,P., Prange,C., Raha,S.S., Loquellano,N.A., Peters,G.J., Abramson,R.D., Mullahy,S.J., Bosak,S.A., McEwan,P.J., McKernan,K.J., Malek,J.A., Gunaratne,P.H., Richards,S., Worley,K.C., Hale,S., Garcia,A.M., Gay,L.J., Hulyk,S.W., Villalon,D.K., Muzny,D.M., Sodergren,E.J., Lu,X., Gibbs,R.A., Fahey,J., Helton,E., Ketteman,M., Madan,A., Rodrigues,S., Sanchez,A., Whiting,M., Madan,A., Young,A.C., Shevchenko,Y., Bouffard,G.G., Blakesley,R.W., Touchman,J.W., Green,E.D., Dickson,M.C., Rodriguez,A.C., Grimwood,J., Schmutz,J., Myers,R.M., Butterfield,Y.S., Krzywinski,M.I., Skalska,U., Smailus,D.E., Schnerch,A., Schein,J.E., Jones,S.J. and Marra,M.A. CONSRTM Mammalian Gene Collection Program Team TITLE Generation and initial analysis of more than 15,000 full-length human and mouse cDNA sequences JOURNAL Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) PUBMED 12477932 REFERENCE 2 (bases 1 to 3889) CONSRTM NIH MGC Project TITLE Direct Submission JOURNAL Submitted (12-JUN-2007) National Institutes of Health, Mammalian Gene Collection (MGC), Bethesda, MD 20892-2590, USA REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov COMMENT Contact: MGC help desk Email: cgapbs-r@mail.nih.gov Tissue Procurement: Mike Brownstein, NIMH cDNA Library Preparation: British Columbia Cancer Research Center cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) DNA Sequencing by: Genome Sequence Centre, BC Cancer Agency, Vancouver, BC, Canada info@bcgsc.bc.ca Martin Hirst, Thomas Zeng, Ryan Morin, Michelle Moksa, Johnson Pang, Diana Mah, Jing Wang, Kieth Fichter, Eric Chuah, Allen Delaney, Rob Kirkpatrick, Agnes Baross, Sarah Barber, Mabel Brown-John, Steve S. Chand, William Chow, Ryan Babakaiff, Dave Wong, Corey Matsuo, Jaclyn Beland, Susan Gibson, Luis delRio, Ruth Featherstone, Malachi Griffith, Obi Griffith, Ran Guin, Nancy Liao, Kim MacDonald, Mike R. Mayo, Josh Moran, Diana Palmquist, JR Santos, Duane Smailus, Jeff Stott, Miranda Tsai, George Yang, Jacquie Schein, Asim Siddiqui,Steven Jones, Rob Holt, Marco Marra. Clone distribution: MGC clone distribution information can be found through the I.M.A.G.E. Consortium/LLNL at: http://image.llnl.gov Series: IRCB Plate: 15 Row: M Column: 10. FEATURES Location/Qualifiers source 1..3889 /db_xref="H-InvDB:HIT000501868" /organism="Homo sapiens" /mol_type="mRNA" /db_xref="taxon:9606" /clone="MGC:176938 IMAGE:9051921" /tissue_type="Lung and heart, PCR rescued clones" /clone_lib="NIH_MGC_317" /note="Vector: pCR-XL-TOPO with reversed insert; Clone identification sequence tag: CCCAAACC" gene 1..3889 /gene="DIAPH1" /gene_synonym="DIA1" /gene_synonym="DRF1" /gene_synonym="hDIA1" /gene_synonym="LFHL1" /db_xref="GeneID:1729" /db_xref="HGNC:HGNC:2876" /db_xref="MIM:602121" CDS 19..3837 /gene="DIAPH1" /gene_synonym="DIA1" /gene_synonym="DRF1" /gene_synonym="hDIA1" /gene_synonym="LFHL1" /codon_start=1 /product="diaphanous homolog 1 (Drosophila)" /protein_id="AAI43414.1" /db_xref="GeneID:1729" /db_xref="HGNC:HGNC:2876" /db_xref="MIM:602121" /translation="MEPPGGSLGPGRGTRDKKKGRSPDELPSAGGDGGKSKKFTLKRL MADELERFTSMRIKKEKEKPNSAHRNSSASYGDDPTAQSLQDVSDEQVLVLFEQMLLD MNLNEEKQQPLREKDIIIKREMVSQYLYTSKAGMSQKESSKSAMMYIQELRSGLRDMP LLSCLESLRVSLNNNPVSWVQTFGAEGLASLLDILKRLHDEKEETAGSYDSRNKHEII RCLKAFMNNKFGIKTMLETEEGILLLVRAMDPAVPNMMIDAAKLLSALCILPQPEDMN ERVLEAMTERAEMDEVERFQPLLDGLKSGTTIALKVGCLQLINALITPAEELDFRVHI RSELMRLGLHQVLQDLREIENEDMRVQLNVFDEQGEEDSYDLKGRLDDIRMEMDDFNE VFQILLNTVKDSKAEPHFLSILQHLLLVRNDYEARPQYYKLIEECISQIVLHKNGADP DFKCRHLQIEIEGLIDQMIDKTKVEKSEAKAAELEKKLDSELTARHELQVEMKKMESD FEQKLQDLQGEKDALHSEKQQIATEKQDLEAEVSQLTGEVAKLTKELEDAKKEMASLS AAAITVPPSVPSRAPVPPAPPLPGDSGTIIPPPPAPGDSTTPPPPPPPPPPPPPLPGG VCISSPPSLPGGTAISPPPPLSGDATIPPPPPLPEGVGIPSPSSLPGGTAIPPPPPLP GSARIPPPPPPLPGSAGIPPPPPPLPGEAGMPPPPPPLPGGPGIPPPPPFPGGPGIPP PPPGMGMPPPPPFGFGVPAAPVLPFGLTPKKLYKPEVQLRRPNWSKLVAEDLSQDCFW TKVKEDRFENNELFAKLTLTFSAQTKTSKAKKDQEGGEEKKSVQKKKVKELKVLDSKT AQNLSIFLGSFRMPYQEIKNVILEVNEAVLTESMIQNLIKQMPEPEQLKMLSELKDEY DDLAESEQFGVVMGTVPRLRPRLNAILFKLQFSEQVENIKPEIVSVTAACEELRKSES FSNLLEITLLVGNYMNAGSRNAGAFGFNISFLCKLRDTKSTDQKMTLLHFLAELCEND YPDVLKFPDELAHVEKASRVSAENLQKNLDQMKKQISDVERDVQNFPAATDEKDKFVE KMTSFVKDAQEQYNKLRMMHSNMETLYKELGEYFLFDPKKLSVEEFFMDLHNFRNMFL QAVKENQKRRETEEKMRRAKLAKEKAEKERLEKQQKREQLIDMNAEGDETGVMDSLLE ALQSGAAFRRKRGPRQANRKAGCAVTSLLASELTKDDAMAAVPAKVSKNSETFPTILE EAKELVGRAS" BASE COUNT 1047 a 947 c 995 g 900 t ORIGIN 1 gccagcgtga accgggacat ggagccgccc ggcgggagcc tggggcccgg ccgcgggacc 61 cgggacaaga agaagggccg gagcccagat gagctgccct cggcgggcgg cgacggcggc 121 aaatctaaga aatttactct gaagcggctc atggcagatg agctggagag atttaccagc 181 atgagaatta agaaggagaa ggaaaagccc aattctgctc atagaaattc ttctgcatca 241 tatggggatg atcccacagc acagtcattg caagatgttt cagatgaaca agtgctggtt 301 ctctttgaac agatgctgct ggatatgaac ctgaatgagg agaaacagca acctttgagg 361 gagaaggaca tcatcatcaa gagggagatg gtgtcccaat acttgtacac ctccaaggct 421 ggcatgagcc agaaggagag ctctaagtct gccatgatgt atattcagga gttgaggtca 481 ggcttgcggg atatgcctct gctcagctgc ctggagtccc ttcgtgtgtc tctcaacaac 541 aaccctgtca gttgggtgca aacatttggt gctgaaggct tggcctcctt attggacatt 601 cttaaacgac ttcatgatga gaaagaagag actgctggga gttacgatag ccggaacaag 661 catgagatca ttcgctgctt gaaagctttt atgaacaaca agtttggaat caagaccatg 721 ttggagacag aagaaggaat cctactgctg gtcagagcca tggatcctgc tgttcccaac 781 atgatgattg atgcagctaa gctgctttct gctctttgta ttctaccgca gccagaggac 841 atgaatgaaa gggttttgga ggcaatgaca gaaagagctg agatggatga agtggaacgt 901 ttccagccgc tgctggatgg attaaaaagt ggaaccacta ttgcactgaa ggttggatgc 961 ctacagctga tcaatgctct catcacacca gcggaggaac ttgacttccg agttcacatc 1021 agaagtgaac tgatgcgttt ggggctacat caggtgttgc aggaccttcg agagattgaa 1081 aatgaagata tgagagtgca actaaatgtg tttgatgaac aaggggaaga ggattcctat 1141 gacctgaagg gacggctgga tgacattcgc atggagatgg atgactttaa tgaagtcttt 1201 cagattctct taaacacagt gaaggattca aaggcagagc cacacttcct ttccatcctg 1261 cagcacttac tcttggtccg aaatgactat gaggccagac ctcagtacta taagttgatt 1321 gaagaatgta tttcccagat agttctgcac aagaacgggg ctgatcctga cttcaagtgc 1381 cggcacctcc agattgagat tgagggatta attgatcaaa tgattgataa gacaaaggtg 1441 gagaaatctg aagccaaagc tgcagagctg gaaaagaagt tggactcaga gttaacagcc 1501 cgacatgagc tacaggtgga aatgaaaaag atggaaagtg actttgagca gaagcttcaa 1561 gatcttcagg gagaaaaaga tgcactgcat tctgaaaagc agcaaattgc cacagagaaa 1621 caggacctgg aagcagaggt gtcccagctc acaggagagg ttgccaagct gacaaaggaa 1681 ctggaagatg ccaagaaaga aatggcttcc ctctctgcgg cagctattac tgtacctcct 1741 tctgttccta gtcgtgctcc tgttccccct gcccctcctt tacctggtga ctctggcact 1801 attattccac caccacctgc tcctggggat agtaccactc ctcctcctcc tcctccacca 1861 ccacctcctc cacctccttt gcctgggggt gtttgcatct cctcaccccc ttctttacct 1921 ggaggtactg ctatctctcc accccctcct ttgtctgggg atgctaccat ccctccaccc 1981 cctcctttgc ctgagggtgt tggcatccct tcaccctctt ctttgcctgg aggtactgcc 2041 atccccccac ctcctccttt gcctgggagt gctagaatcc ccccaccacc acctcctttg 2101 cctgggagtg ctggaattcc ccccccacct cctcccttgc ctggagaagc aggaatgcca 2161 cctcctcctc cccctcttcc tggtggtcct ggaatccctc cacctcctcc atttcccgga 2221 ggccctggca ttcctccacc tccacccgga atgggtatgc ctccacctcc cccatttgga 2281 tttggagttc ctgcagcccc agttctgcca tttggattaa cccccaaaaa gctttataag 2341 ccagaggtgc agctccggag gccaaactgg tccaagcttg tggctgagga cctctcccag 2401 gactgcttct ggacaaaggt gaaggaggac cgctttgaga acaatgaact tttcgccaaa 2461 cttaccctta ccttctctgc ccagaccaag acttccaaag ccaagaagga tcaagaaggt 2521 ggagaagaaa agaaatctgt gcaaaagaaa aaagtaaaag agttaaaggt gttggattca 2581 aagacagccc agaatctctc aatctttttg ggttccttcc gcatgcccta tcaagagatt 2641 aagaatgtca tcctggaggt gaatgaggct gttctgactg agtctatgat ccagaacctc 2701 attaagcaaa tgccagagcc agagcagtta aaaatgcttt ctgaactgaa ggatgaatat 2761 gatgacctgg ctgagtcaga gcagtttggc gtggtgatgg gcactgtgcc ccgactgcgg 2821 cctcgcctca atgccattct cttcaagcta caattcagcg agcaagtgga gaatatcaag 2881 ccagagattg tgtctgtcac tgctgcatgt gaggagttac gtaagagtga gagcttttcc 2941 aatctcctag agattacctt gcttgttgga aattacatga atgctggctc cagaaatgct 3001 ggtgcttttg gcttcaatat cagcttcctc tgtaagcttc gagacaccaa gtccacagat 3061 cagaagatga cgttgttaca cttcttggct gagttgtgtg agaatgacta tcccgatgtc 3121 ctcaagtttc cagacgagct tgcccatgtg gagaaagcca gccgagtttc tgctgaaaac 3181 ttgcaaaaga acctagatca gatgaagaaa caaatttctg atgtggaacg tgatgttcag 3241 aatttcccag ctgccacaga tgaaaaagac aagtttgttg aaaaaatgac cagctttgtg 3301 aaggatgcac aggaacagta taacaagctg cggatgatgc attctaacat ggagaccctc 3361 tataaggagc tgggcgagta cttcctcttt gaccccaaga agttgtctgt tgaagaattt 3421 ttcatggatc ttcacaattt tcggaatatg tttttgcaag cagtcaagga gaaccagaag 3481 cggcgggaga cagaagaaaa gatgaggcga gcaaaactag ccaaggagaa ggcagagaag 3541 gagcggctag agaagcagca gaagagagag caactcatag acatgaatgc agagggcgat 3601 gagacaggtg tgatggacag tcttctagaa gccctgcagt caggggcagc attccgacgg 3661 aagagagggc cccgtcaagc caacaggaag gccgggtgtg cagtcacatc tctgctagct 3721 tcggagctga ccaaggatga tgccatggct gctgttcctg ccaaggtgtc caagaacagt 3781 gagacattcc ccacaatcct tgaggaagcc aaggagttgg ttggccgtgc aagctaatgt 3841 gggtcctgtg accgcggcag ctcctcagcg gagccgcaga ctgtcctgc //