LOCUS CCD61190.1 2098 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Unconventional myosin heavy chain 6 protein. ACCESSION BX284606-481 PROTEIN_ID CCD61190.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="hum-6" /locus_tag="CELE_T10H10.1" /standard_name="T10H10.1" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00002039" /db_xref="EnsemblGenomes-Tr:T10H10.1" /db_xref="WormBase:WBGene00002039" intron_pos 7:0 (1/26) intron_pos 44:0 (2/26) intron_pos 93:0 (3/26) intron_pos 154:2 (4/26) intron_pos 226:1 (5/26) intron_pos 281:0 (6/26) intron_pos 417:2 (7/26) intron_pos 473:0 (8/26) intron_pos 813:0 (9/26) intron_pos 854:0 (10/26) intron_pos 888:0 (11/26) intron_pos 979:0 (12/26) intron_pos 1080:2 (13/26) intron_pos 1224:0 (14/26) intron_pos 1273:2 (15/26) intron_pos 1300:0 (16/26) intron_pos 1357:0 (17/26) intron_pos 1437:2 (18/26) intron_pos 1494:1 (19/26) intron_pos 1557:0 (20/26) intron_pos 1597:2 (21/26) intron_pos 1640:2 (22/26) intron_pos 1755:2 (23/26) intron_pos 1782:0 (24/26) intron_pos 2030:0 (25/26) intron_pos 2068:0 (26/26) BEGIN 1 MVLVSKGDFI WIEPGKTEGS IPIGARVIDQ DHGRLKVIDD LGNEQWLSAD RRVRLMHPTS 61 VQGVEDMCQL GDFHESAILR NLFIRYREKL IYAYTGSILI AVNPYMDIAI YTADEIRMYK 121 RKRIGELPPH IFAIADNAYT NMRREKKNQS VIISGESGAG KTESTKLVLQ FLATISGQHS 181 WIEQQVLEAN PVLEAFGNAK TIRNDNSSRF GKYIDVHFNE SGSIEGAKIE QYLLEKSRIV 241 TQSENERNYH IFYCLLAGLS REEKSELELG TAADYYYLIQ GKTLTAEGRD DAADLAEIRS 301 AMRVLMINEQ EIGSIFKLLA SLLHIGNIRF RQNTNDNMES VDVADPSTLV RIAKLLQLHE 361 QNLLDAITTK SLVTREERVI SRLNGQQAVD ARDALAKAIY GKLFIHIVRR VNDAIYKPSQ 421 SRRTSIGILD IFGFENFESN SFEQLCINFA NETLQQFFVH HVFKMEQKEY DEEHINWRHI 481 KFVDNQATVD LIAQRPLNIL SLIDEESIFP KGTDKTMLLK LHSTHGRNEL YLQPKSELQR 541 AFGVTHFAGN VFYNTRGFLE KNRDSFSADL SVLISSSKMP FLARLFDDIE YDTSSRKKVT 601 VGNQFRRSLE QLMSQLTQTH PFFIRCIKPN EMKRALVMDR DLVLRQLRYS GMMETIKIRR 661 SGYPIRHDYY PFVFRYRVLV SSIQGPVNRI DLHDAAKKIC HMILGTNADY QLGKTKVFLK 721 DKHDLVLEQE YYRILKDKAI VIQKNVRRWL VRKDFEKQRQ AAVTIQTAWR GFDQRKRYRQ 781 IISGFSRLQA VLRSRQLVSH YQTLRKTIIQ FQAVCRGSLV RRQVGEKRKR GEKAPLTEVS 841 STASVISDSH EELVGHLFDF LPSDGKDSGN ENDSADSSRR GSYSRLHTSP VMPPANIPRV 901 DSYVDEDLSK YQFGKYAATF FQAQATATHV KKPLKTALLT HTEPSAQLAA LTAWTTILRF 961 MGDLADVKPG STNGSEVYDK TPVMIKLYAT LGKKFSAHDL EEAMLSSEYG GAKTLKKGMG 1021 RKLISMTLKR KGKINGSDTS SISSDSVYSS FNAMLENKPM TSLDKLHYII GLGILREDLR 1081 DEIYCQLCKQ LSNNPSKLSA ARGWILLSLC VGCFAPSERF IKYLFCFIRE RGPAGTGYSK 1141 YIEDRLRRTQ VNGTRHQPPS YVELQANKSQ KPVVLAVTFM DGSVKTLCAD SATTAAELCK 1201 QLAEKVGLTN SFGFSLYIAL FDKVSSLGSG TDHVMDAISQ CEQYAKEQGR QERNAPWRLF 1261 FRKEIFSPWH DPRDDPVSTN LIYQQVIRGI KYGEYRCDKD EELAAICAQQ YYIDEGTMDV 1321 NKLENNLPSY LPDFEMSGKE MALEKWTQTI MHQYRKKFTG RLPSQIEVKE NVVSVAKTKW 1381 PLLFSRFYEA LKFAGPPLPK NEVIIAVNWT GVYVVDDREH VMLEFSFPEI STAYYGKGKR 1441 STTDTCTVRT VVGDEYTFQS PNADDITNLI VMFLEGLKKR SRYLVAIKSQ KGDEKNNFLE 1501 FEKGDLLILV NEFTGNTLLT ESVVKGENSR TCLFGLIRAE NVYVLPTLVK PSKNTLQIFP 1561 KDMDLSLDLF NNNKQVTVVD YNAEPYTLEN FAEDNFNSQV KRVGSQISLM TLRKKESQIE 1621 CWRFSREHID QPLLKKLNGR EDACRGAIEI FAAIMKYMGD EPSKRSRLGT HLTDHIFKLP 1681 ISMEALRDEL YCQLVKQLTL NPSIMSEERG WELLWMATGL FAPSAALAKE ISHFLKSRPH 1741 PIALDCQNRM QKLAKGGSRK YPPHLVEVEA IQHKTTQIFH KVFFPDNTDE AIEVDSATRA 1801 RDFCHKIGYR LGLKSSDGFS LFVKIKDKVL AVPESEFFFD YVRSLSDWVH TNHATQKDAT 1861 MIPINYQVYF MRKLWYNFVA GADPQADIIF HYHQESQKYL LGYHKTTKND VIELAALILR 1921 SMTKDGKNAP LAQIPQLLDE IIPKDSLKMY SASEWRKTIS NAYARIEHLK SDQAKIEFLN 1981 YICRWPTFGS AFFPVSQYSD LNLPDRLLLA INQTGVNIYH LDTKNLLVQY PFNVICNWTS 2041 GNTYFNMTVG NMLKGNEGKK LLLDTTVGYK MDDLLTSYIS LLISNQNNHP SKTREVAL //