LOCUS CBK19448.2 1375 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Myosin motor domain-containing protein protein. ACCESSION BX284606-2741 PROTEIN_ID CBK19448.2 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="hum-4" /locus_tag="CELE_F46C3.3" /standard_name="F46C3.3d" /note="Confirmed by transcript evidence" /db_xref="GOA:D3YT13" /db_xref="InterPro:IPR000299" /db_xref="InterPro:IPR000857" /db_xref="InterPro:IPR001452" /db_xref="InterPro:IPR011993" /db_xref="InterPro:IPR019748" /db_xref="InterPro:IPR019749" /db_xref="InterPro:IPR035963" /db_xref="InterPro:IPR036028" /db_xref="InterPro:IPR038185" /db_xref="UniProtKB/TrEMBL:D3YT13" /db_xref="WormBase:WBGene00002037" intron_pos 67:1 (1/25) intron_pos 226:0 (2/25) intron_pos 291:0 (3/25) intron_pos 331:2 (4/25) intron_pos 348:0 (5/25) intron_pos 375:2 (6/25) intron_pos 418:0 (7/25) intron_pos 476:0 (8/25) intron_pos 513:0 (9/25) intron_pos 553:1 (10/25) intron_pos 632:1 (11/25) intron_pos 672:0 (12/25) intron_pos 713:2 (13/25) intron_pos 798:0 (14/25) intron_pos 848:0 (15/25) intron_pos 884:1 (16/25) intron_pos 977:2 (17/25) intron_pos 1022:1 (18/25) intron_pos 1054:0 (19/25) intron_pos 1112:1 (20/25) intron_pos 1166:0 (21/25) intron_pos 1212:2 (22/25) intron_pos 1260:1 (23/25) intron_pos 1316:0 (24/25) intron_pos 1350:0 (25/25) BEGIN 1 MNFAPPPTFT YPQQMPMMQY VPVMMTSSMM TPSMMTPSMM PGQQIAMVPQ QMMMQPHFSY 61 VPQYPQIPQY CPPEPILSPQ SVRSEVPPMM APVMHDGHGS TRSRDYRIMK RGEVPSQYST 121 IRNMPVPEHG KDVDQFLDAV FDQVLSKDEQ RAAHFNSNQL ANTIKGGKLR PEGYYEPPVQ 181 TYSPVPPRYP TLRRVDDSPL RSRAKSLPRI ISPRHEHFVR RPHSRNSYSN ESTSSDDQMN 241 YRTRSRERSL PRFHSNNGYN YDPSQPVYMM PVQMNGHGEM ILLSPVASEH KAQSRHRTHD 301 RSHRHHVPSG VERYMAKRSP SVDILKPPMS RRTPDAMVEQ TYVRPHLAQS PVGRQTSRFE 361 EFSALPRGDS RARERMENPL LARQIPRAYP YQNGNVVPPP PSSYRAPSPA PTSGDRRGLN 421 RLPQESYTEP AVKNTKNNSE YLSPHRFNLD EQKAVIRHEK ETAKNAINML SDRLRKLPPP 481 VDNVRLIRPI TPAQPPITQS PVPSEPVVVR TPSPVPQPTP PPPPPPVREE WVIRDSVERD 541 PVTQGRIRGS FRKEPLVPQP PRPPVVAEKP AVKFVKAPWK LTIRKEMFYP GEVLNDIQII 601 DQVFAQIVED CKKSYPYRIR EQDRKQVETV LRQYEIPPSD LNNQSNIHPD VKVAVIELAR 661 LWPLYFNQVY EVVEKRPDES VSTIFAISEH GIRLIVHTPH DLENPLKIQD FFPFETIADV 721 SLEANDILSV HVRHEDEENA YSAVRIKTNQ APQIKKTLDR CLSGGVVPKR KFVRALEDYV 781 TSEVNHLSFK QGDVIELLQE PEGETPPVGN WLYGKIENRF GFLLAQYVDS TDGDNVPPIR 841 HETSEDRDER VRFFDDEVPF SSERYTMIDF ATKYFRKPKD KKKQETWAWE DISQIVRFSE 901 KPISQSLLAD LGNEESKYAV ETFHAIMKFM GDEPLKKSES MTDVVFKVLL ICHRQPTLRD 961 EVYCQLIKQT TSNISQKPNS ALRAWRLLTI ITAYFPSSLT LKPYVLQYLG DNADEWQRPF 1021 HGTARICQTN MIQTFKYGGR KVLLNALEVQ QITDGCQLRR QAFYISKDHN VSQTLRPITV 1081 AEEMIQELCN LLNVRSLHEQ QEFSLCYTVG KDKHLNYCKN DNYLMDIITE SEHKKLPFQF 1141 YLKRTVWVHP LRYDNAAYID SMFDQVIDDY LRGSLISTNS LGQLTAATTE EIIKLAAYLF 1201 LLLPDNPKGL NAKTLPQIVP KSVIEPKHRH QEEMVTRISR QLKMFGGRMR PAEAKSHFLE 1261 LLSTWPTFGV LHYRLKSVVE NGHQLPEVIL TINKSGIQLL QPKSKEVFKE RNYDQIVSVE 1321 SIRKTAYKIV RLVINTMQGE ETLDIKTDEA DEISHLIGQY MFVTGGAEER GSTEL //