LOCUS X12506 2051 bp DNA linear INV 14-NOV-2006 DEFINITION Drosophila melanogaster twist gene. ACCESSION X12506 VERSION X12506.1 KEYWORDS developmental regulation; glycoprotein; nucleoprotein; twist gene. SOURCE Drosophila melanogaster (fruit fly) ORGANISM Drosophila melanogaster Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Holometabola; Diptera; Brachycera; Muscomorpha; Ephydroidea; Drosophilidae; Drosophila; Sophophora. REFERENCE 1 (bases 1 to 2051) AUTHORS Thisse B., Stoetzel C., Gorostiza-Thisse C., Perrin-Schmitt F. TITLE Sequence of the twist gene and nuclear localization of its protein in endomesodermal cells of early Drosophila embryos JOURNAL EMBO J. 7(7), 2175-2183(1988). PUBMED 3416836 COMMENT see x14569 for twist cDNA seq FEATURES Location/Qualifiers source 1..2051 /organism="Drosophila melanogaster" /strain="Oregon R." /mol_type="genomic DNA" /clone_lib="lambda EMBL 18+" /db_xref="taxon:7227" precursor_RNA 37..2051 /note="minor primary transcript" precursor_RNA 41..2051 /note="major primary transcript" exon 41..1423 /number=1 CDS join(200..1423,1544..1792) /product="twist protein" /db_xref="GOA:P10627" /db_xref="InterPro:IPR011598" /db_xref="InterPro:IPR015789" /db_xref="InterPro:IPR036638" /db_xref="UniProtKB/Swiss-Prot:P10627" /protein_id="CAA31024.1" /translation="MMSARSVSPKVLLDISYKPTLPNIMELQNNVIKLIQVEQQAYMH SGYQLQHQQQHLHSHQHHQQHHQQQHAQYAPLPSEYAAYGITELEDTDYNIPSNEVLS TSSNQSAQSASLELNNNNTSSNTNSSGNNPSGFDGQASSGSSWNEHGKRARSSGDYDC QTGGSLVMQPEHKKLIHQQQQQQQQHQQQIYVDYLPTTVDEVASAQSCPGVQSTCTSP QSHFDFPDEELPEHKAQVFLPLYNNQQQQSQQQQQQQPHQQSHAQMHFQNAYRQSFEG YEPANSLNGSAYSSSDRDDMEYARHNALSSVSDLNGGVMSPACLADDGSAGSLLDGSD AGGKAFRKPRRRLKRKPSKTEETDEFSNQRVMANVRERQRTQSLNDAFKSLQQIIPTL PSDKLSKIQTLKLATRYIDFLCRMLSSSDISLLKALEAQGSPSAYGSASSLLSAAANG AEGDLKCLRKANGAPIIPPEKLSYLFGVWRMEGDAQHQKA" misc_feature 515..523 /note="pot. N-linked glycosylation site" misc_feature 557..565 /note="pot. N-linked glycosylation site" misc_feature 575..583 /note="pot. N-linked glycosylation site" misc_feature 590..598 /note="pot. N-linked glycosylation site" intron 1424..1543 /number=1 exon 1544..2051 /number=2 regulatory 2025..2030 /regulatory_class="polyA_signal_sequence" polyA_site 2051..2051 BASE COUNT 574 a 604 c 511 g 362 t ORIGIN 1 aaaatgtcaa tttgagcaat ggccggaagg atcctgcgtc agttgcgttc cgtaagtgcg 61 tgcgagcaga tcgatccagc aaaacgcggg cgtgaagaat atccacggag ttatacagtt 121 ccgaaataag aatatattgt tagataccac aaattctaac gtgaagaagg tgcgctaaaa 181 agccaagcaa gatcaccaaa tgatgagcgc tcgctcggtg tcgcccaaag tgctgctgga 241 cataagctac aagcccacac tgcccaacat catggagctg cagaacaatg tgatcaagct 301 gatacaggtg gagcagcagg cctacatgca ctccggctat cagctgcagc accagcagca 361 gcacctccac tcccaccagc accaccagca gcaccaccag cagcagcatg cccagtacgc 421 cccactgccc tcggagtacg ccgcctatgg tattaccgaa ctggaggaca cagactacaa 481 catacccagc aacgaggtcc tgagcaccag cagcaaccag agtgcccaga gcgccagtct 541 ggagctgaac aacaacaaca ccagcagcaa caccaacagc tccggcaaca atcccagtgg 601 cttcgatggc caggccagca gcggatcctc ttggaacgag cacggcaaga gggccaggag 661 cagtggcgac tacgattgcc aaaccggggg atcactggtc atgcagcccg agcacaagaa 721 gctaatccac cagcaacagc agcagcaaca acagcaccag caacagatct atgtggatta 781 cttgcccacc accgtggacg aggtggcctc ggctcaatct tgtcctggcg tccagagcac 841 atgcacctcc ccgcaatccc acttcgattt tcccgacgag gagctgcccg agcacaaggc 901 ccaggtgttc ctgcccctct acaacaacca gcagcagcag tcgcagcagc agcaacagca 961 gcagccgcac cagcaaagcc acgcccagat gcacttccaa aacgcctaca gacaaagttt 1021 cgagggctac gagccggcca actcgctgaa cggcagtgct tactccagct cggatcggga 1081 tgacatggag tacgcccgcc acaacgccct gagttcagtt agcgatctca acggaggagt 1141 catgtcgccc gcctgcttgg cggatgacgg cagtgccggc agtttgctgg acggatccga 1201 tgccggcgga aaggccttcc gcaagccacg tcgccggctg aagcggaagc ccagcaagac 1261 ggaggagacg gacgagttca gcaaccagcg ggtcatggcc aatgtgaggg agcgccagcg 1321 cacccagagc ctcaacgacg ccttcaagtc cctgcagcag atcatcccca cgctgcccag 1381 cgacaagctc agcaagatcc agaccctcaa actggccaca aggtaatata tctttagttt 1441 acatcttcat aatagttaaa agcattcata gatagataga tagataatag atgaatgaat 1501 cgaaactcgt aattaatcga actcttttta accaacttac agatacatcg acttcctgtg 1561 ccgcatgctc agctcgagtg atatatcttt gctgaaggcc ttggaggccc agggatcgcc 1621 ctcggcgtat ggatcggcca gctccctcct gagtgccgcc gccaatggag ccgaaggtga 1681 tctgaagtgc ctgcgcaagg ccaacggagc acccattatc ccgcccgaga agctgagtta 1741 tctgttcggg gtgtggcgca tggagggcga cgcgcagcac cagaaggcat agcggcggat 1801 caggacacta tagctatagt cctgtttcga gaggggcttc cagcagcaac tatcgtgtga 1861 attcagagcc ctggcgcatc tcactcttca attctctgtc acgctttcca tatatacgtg 1921 tcgactagat gattctaaga tgtctaagcc taaaacctac tgttaatgcc tatttaatgt 1981 catagtctaa actaaattaa ttgtaaaaag ccaacaagcc aagaaacaaa gaaacgcatg 2041 aacaaaacca g //