LOCUS CDG24127.2 2540 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Myosin motor domain-containing protein protein. ACCESSION BX284606-2740 PROTEIN_ID CDG24127.2 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="hum-4" /locus_tag="CELE_F46C3.3" /standard_name="F46C3.3f" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00002037" /db_xref="EnsemblGenomes-Tr:F46C3.3f" /db_xref="GOA:S6F568" /db_xref="InterPro:IPR000048" /db_xref="InterPro:IPR000299" /db_xref="InterPro:IPR000857" /db_xref="InterPro:IPR001452" /db_xref="InterPro:IPR001609" /db_xref="InterPro:IPR011993" /db_xref="InterPro:IPR019748" /db_xref="InterPro:IPR019749" /db_xref="InterPro:IPR027417" /db_xref="InterPro:IPR035963" /db_xref="InterPro:IPR036028" /db_xref="InterPro:IPR036961" /db_xref="InterPro:IPR038185" /db_xref="UniProtKB/TrEMBL:S6F568" /db_xref="WormBase:WBGene00002037" intron_pos 8:2 (1/40) intron_pos 30:0 (2/40) intron_pos 85:2 (3/40) intron_pos 150:2 (4/40) intron_pos 404:0 (5/40) intron_pos 459:0 (6/40) intron_pos 506:0 (7/40) intron_pos 542:2 (8/40) intron_pos 599:2 (9/40) intron_pos 856:0 (10/40) intron_pos 937:0 (11/40) intron_pos 990:2 (12/40) intron_pos 1035:2 (13/40) intron_pos 1100:2 (14/40) intron_pos 1142:1 (15/40) intron_pos 1193:2 (16/40) intron_pos 1269:1 (17/40) intron_pos 1428:0 (18/40) intron_pos 1493:0 (19/40) intron_pos 1533:2 (20/40) intron_pos 1550:0 (21/40) intron_pos 1577:2 (22/40) intron_pos 1620:0 (23/40) intron_pos 1678:0 (24/40) intron_pos 1718:1 (25/40) intron_pos 1797:1 (26/40) intron_pos 1837:0 (27/40) intron_pos 1878:2 (28/40) intron_pos 1963:0 (29/40) intron_pos 2013:0 (30/40) intron_pos 2049:1 (31/40) intron_pos 2142:2 (32/40) intron_pos 2187:1 (33/40) intron_pos 2219:0 (34/40) intron_pos 2277:1 (35/40) intron_pos 2331:0 (36/40) intron_pos 2377:2 (37/40) intron_pos 2425:1 (38/40) intron_pos 2481:0 (39/40) intron_pos 2515:0 (40/40) BEGIN 1 MDNLIELPDQ SEAGIAQNLH ERFKKGVTYT KASNVLVFVN DFNDKDSEDQ LSWETSSTSG 61 VNAVAKNALN KIFNMSSNAE SIVFGGESGS GKSYNVFKAF KYLTSQPKSK VSTKHSSSIE 121 FVFKSFGCAK TLKNDEATRF GCSIDLLYKR NVLTGLNLKY TVPLEVPRVI SQKPGERNFN 181 IFYEVYHGLS DEMKAKFGIK GLQKFFYINQ GNSSENIQHD VNRFKHLESA LHVLGFSDDH 241 CMSIYKIIST ILHIGNIYFR TKRNPNVEQD VVEIGNLAEL KWIAFLLEVD FDQLVKFLLP 301 TSEDGSTIEL NAALDNRDSF AMMIYEELFK WVLNRIGLQL KCSLHTGVIS ILDHYGFEKY 361 NNNGVEEFLI NSVNERIENL FVKHCFHDQL IDYAKDGISV DYKVPNSIEN GKTVELLFKK 421 PYGLLSLLTD ECKFPKGTHE TYLEHCNLNH TDRSAYGKAR NKERLEFGVR HCIGTTWYNV 481 TDFFARNKRI ISLSAVQLMR NSKNPIIGLL FESYGGNTSD IIVSQAQFVL RGAQDIADKI 541 NVSHVHFVRC IKSNNERQST KFDIPLVNRQ IKNLLLAELL SFRVKGYPVK ISKTTFARQY 601 RCLLPGDIAQ CQNEKEIIQD ILQGQGVKYE DDFKIGTEYV FLRERLAERY DGLQNKICGD 661 AAIIIQKNMK SFVAQKVYKR MRAAIIKLQS GLRGWKARRD YIIKREEMFK AIGRTMKRNK 721 RLDAYHQALG TENSGQLQQT LVGYIDINED AKKFLERPTS DSDATETLTH YLTVPMKNFL 781 SNMNSITLEQ FAEENFKGHL LEPRREPIMT PFLHKESDYD FRLSVEIFKL ILKYMNDIKL 841 TKKQREDLGR YIVQQGISNP CQRDEILVQT INQINKNQDK TASDNGWKLV HMAISVFPPT 901 ENIIPMLIGF FNKESVPMKE QLFATLQRRL KIYDSEIARE LPPSNLELIA TPNIPNTVAE 961 INCYDGLCCD VHLSPWSTTS EIAERILKQR GVGIALGWTV EVETPNRVFA PTGNHFIHDV 1021 FSQIEGANMD SENQQNIFFN FPPEKLVKKP PVVPKEVVQE QEIVSQNTNG TIPKAARTYP 1081 TPEEWINQPP QRTESRNTPR DHQKPDEESD ESLVREYNEE AGYAYTLPRK NKPAPQAVED 1141 NETSESEEDD EDAANRVGYE TVRPEENMKM METIQDHSHI LKSPVLPRKT YSRNEQHEEY 1201 TPMNFAPPPT FTYPQQMPMM QYVPVMMTSS MMTPSMMTPS MMPGQQIAMV PQQMMMQPHF 1261 SYVPQYPQIP QYCPPEPILS PQSVRSEVPP MMAPVMHDGH GSTRSRDYRI MKRGEVPSQY 1321 STIRNMPVPE HGKDVDQFLD AVFDQVLSKD EQRAAHFNSN QLANTIKGGK LRPEGYYEPP 1381 VQTYSPVPPR YPTLRRVDDS PLRSRAKSLP RIISPRHEHF VRRPHSRNSY SNESTSSDDQ 1441 MNYRTRSRER SLPRFHSNNG YNYDPSQPVY MMPVQMNGHG EMILLSPVAS EHKAQSRHRT 1501 HDRSHRHHVP SGVERYMAKR SPSVDILKPP MSRRTPDAMV EQTYVRPHLA QSPVGRQTSR 1561 FEEFSALPRG DSRARERMEN PLLARQIPRA YPYQNGNVVP PPPSSYRAPS PAPTSGDRRG 1621 LNRLPQESYT EPAVKNTKNN SEYLSPHRFN LDEQKAVIRH EKETAKNAIN MLSDRLRSPV 1681 PQPTPPPPPP PVREEWVIRD SVERDPVTQG RIRGSFRKEP LVPQPPRPPV VAEKPAVKFV 1741 KAPWKLTIRK EMFYPGEVLN DIQIIDQVFA QIVEDCKKSY PYRIREQDRK QVETVLRQYE 1801 IPPSDLNNQS NIHPDVKVAV IELARLWPLY FNQVYEVVEK RPDESVSTIF AISEHGIRLI 1861 VHTPHDLENP LKIQDFFPFE TIADVSLEAN DILSVHVRHE DEENAYSAVR IKTNQAPQIK 1921 KTLDRCLSGG VVPKRKFVRA LEDYVTSEVN HLSFKQGDVI ELLQEPEGET PPVGNWLYGK 1981 IENRFGFLLA QYVDSTDGDN VPPIRHETSE DRDERVRFFD DEVPFSSERY TMIDFATKYF 2041 RKPKDKKKQE TWAWEDISQI VRFSEKPISQ SLLADLGNEE SKYAVETFHA IMKFMGDEPL 2101 KKSESMTDVV FKVLLICHRQ PTLRDEVYCQ LIKQTTSNIS QKPNSALRAW RLLTIITAYF 2161 PSSLTLKPYV LQYLGDNADE WQRPFHGTAR ICQTNMIQTF KYGGRKVLLN ALEVQQITDG 2221 CQLRRQAFYI SKDHNVSQTL RPITVAEEMI QELCNLLNVR SLHEQQEFSL CYTVGKDKHL 2281 NYCKNDNYLM DIITESEHKK LPFQFYLKRT VWVHPLRYDN AAYIDSMFDQ VIDDYLRGSL 2341 ISTNSLGQLT AATTEEIIKL AAYLFLLLPD NPKGLNAKTL PQIVPKSVIE PKHRHQEEMV 2401 TRISRQLKMF GGRMRPAEAK SHFLELLSTW PTFGVLHYRL KSVVENGHQL PEVILTINKS 2461 GIQLLQPKSK EVFKERNYDQ IVSVESIRKT AYKIVRLVIN TMQGEETLDI KTDEADEISH 2521 LIGQYMFVTG GAEERGSTEL //