LOCUS CCD72477.1 1744 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Myotubularin-related protein 5 protein. ACCESSION BX284606-1165 PROTEIN_ID CCD72477.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="mtm-5" /locus_tag="CELE_H28G03.6" /standard_name="H28G03.6" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00003477" /db_xref="EnsemblGenomes-Tr:H28G03.6" /db_xref="GOA:Q9TXP3" /db_xref="InterPro:IPR001194" /db_xref="InterPro:IPR001849" /db_xref="InterPro:IPR002219" /db_xref="InterPro:IPR004182" /db_xref="InterPro:IPR005112" /db_xref="InterPro:IPR005113" /db_xref="InterPro:IPR010569" /db_xref="InterPro:IPR011993" /db_xref="InterPro:IPR022096" /db_xref="InterPro:IPR029021" /db_xref="InterPro:IPR030564" /db_xref="InterPro:IPR037516" /db_xref="UniProtKB/Swiss-Prot:Q9TXP3" /db_xref="WormBase:WBGene00003477" intron_pos 27:1 (1/20) intron_pos 91:0 (2/20) intron_pos 189:1 (3/20) intron_pos 233:2 (4/20) intron_pos 265:0 (5/20) intron_pos 332:0 (6/20) intron_pos 616:0 (7/20) intron_pos 665:0 (8/20) intron_pos 851:1 (9/20) intron_pos 885:0 (10/20) intron_pos 988:1 (11/20) intron_pos 1139:0 (12/20) intron_pos 1179:1 (13/20) intron_pos 1259:0 (14/20) intron_pos 1348:0 (15/20) intron_pos 1460:1 (16/20) intron_pos 1536:2 (17/20) intron_pos 1568:1 (18/20) intron_pos 1618:1 (19/20) intron_pos 1714:0 (20/20) BEGIN 1 MRDPDKVKSG PICDTVAVIV LEESDDENAL PDVLHEVQSP HTSDNIPTSS IKKFARPRGW 61 YNQSVSSPSE FFYQILTTER GTRRIAYVLS TWEEDEKTLN FKAVSIVLIS QNFHPKAFKE 121 ILLEISNDLR TPEFSSSSEL IRFLTYELVE EGSTIEIRTK TLHVELGFEL IPISPVTGKD 181 VAMLFKMLGF QNVIKIIHAL LSDCRIVLAS SSLMRLSRCQ NAILSLLYPF EYVHSCVTIL 241 PDSLAEVLES PTPFLIGVLS EFVTSFGDEN IVVYLDNGEV HVPDHAEIYK SDDYYYNSLH 301 QRLRDVMFTT TSQEDLSIPN EERIEVDDFI LDKKLRACFI YYFAELLYGY QYYILYTRIK 361 GNFEKKLTTS LTFHVGAFRG FRKLTDMMSS SLLKSVYFQT FILTRALPRR KHDLFDEISC 421 FKELDQLIFK QNSTSSESKK IIEHISCELI QKERYMEKCS ARKQEIFTKI HWISGKELAQ 481 NNNSIIHTVK PKMRSNVILQ AMLPVVNTHA EYHANQFEAY AHRIEALRNC LAAIFEGKVA 541 FASKSLDAVK SSMRFAPLRI ELCRLLNQKC SHDKLTDKQF EDIALLMNAA LQAECEEDKD 601 GVVRSLMYLS NVYSRKVAQG MQQYMYTAVQ EHKVWKNQRF WTSCFYYEVH EMLFSEMLQK 661 DRKITESLWC HTLRPCAMEM INTDDTDQEE LVKQENEMIQ AQAKHFANIL ISLQIPLSEE 721 FFEHEDAHRS VLNEKCKWIV NTLDSILGVT GRINGLSLSR IQTYVEAHVE SLRDVYVEMS 781 TGEHLKKGNF DPVLAHGEFL ISDPIDCYLL TSIEESEMSL NRLENLLPAD GSLFLTNYRV 841 IFKGKSVDIN ATNGTIVQTI PLYSMESFKK LTNKKLIPTQ LIEKGVKIEH IISIRSSCAS 901 SIIIAFDEDE INNMAIEKFL EVIETNSHNS FAFYNTRKDM KVVENGSHKF GTLNSAIRGF 961 TKKKTDTRRI RSHSSHRGSI QLSFDKMEEL DYLKKNAHIR YAVIDYPRIG LNSKIVKLRM 1021 SHSNLDYTIC PSYPGNFIVP SETNESELAK VAKGFVEHRL PVVVWMNENG ALLVRASAFT 1081 SIDMVKKLKK VVNYRRNASK LTGSMTGSQQ TLHSKASSNE ESSSNIVAGA EIKSAEVQMN 1141 YIAKLSNSSQ RAVSYALPTQ YADKFSTFND GCTLTQNNAN GFPTTRIHRK ALYVLLEKGH 1201 GVKIPIDSNA EAIMVRSVKE SELRRSLQRA RQICSSEFQV ENRTSFLESW NASNWPQCVS 1261 RMIELSNSIV ALMNLYNSSV AICLEAGRSI TTILSSLSQL LSDPYYRTCD GFQVLVEKEW 1321 LAFGHYFHKD TETSSPSFIC FLDCVYQISQ QYPTAFEFSY FYISFLAYHS TAGYFRTFID 1381 DCEEKRLQSD ANEFYLPDNL ATINVWEFIK LRNRVSAAFY NELYEQIGDI VIPSSSIPQI 1441 HMWPFLAETH LKYGSPYDIE PASHEQQLVD PDYEEEEDWS KLNNTDIDER HLNRRVRSPE 1501 RDPANMDMIR LLQKSYLTEL FDASDRKTTT NGESNGKETI HELTPFTVGA RPVQCCYCTN 1561 ILTRWSKAVH CKKCRIHVHE GCVNRNITIG NITHTWDAKP FEDIKMPSGA IQIGTPQAEK 1621 MLHSPNNTLT RESMSPPTAN TIPPLCTGYL SKRGAKLKLW VPRFFVLYPD SPKVYYYEDF 1681 ENWKTAEKPS GCIDLVDFKS FNLEQTGRRG LIELHMKNKT HRLLSENINE AIRWKECIEQ 1741 VIRD //