LOCUS CCD63270.1 2476 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans E3 ubiquitin-protein ligase RING2 homolog spat-3 protein. ACCESSION BX284606-1021 PROTEIN_ID CCD63270.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="spat-3" /locus_tag="CELE_T13H2.5" /standard_name="T13H2.5a" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00020496" /db_xref="EnsemblGenomes-Tr:T13H2.5a" /db_xref="GOA:H2KYH3" /db_xref="InterPro:IPR001841" /db_xref="InterPro:IPR013083" /db_xref="InterPro:IPR017907" /db_xref="UniProtKB/TrEMBL:H2KYH3" /db_xref="WormBase:WBGene00020496" intron_pos 19:0 (1/18) intron_pos 136:0 (2/18) intron_pos 177:0 (3/18) intron_pos 496:1 (4/18) intron_pos 701:1 (5/18) intron_pos 824:1 (6/18) intron_pos 1061:1 (7/18) intron_pos 1143:0 (8/18) intron_pos 1193:0 (9/18) intron_pos 1320:0 (10/18) intron_pos 1656:1 (11/18) intron_pos 1708:1 (12/18) intron_pos 1766:0 (13/18) intron_pos 1784:0 (14/18) intron_pos 1940:2 (15/18) intron_pos 2355:0 (16/18) intron_pos 2391:1 (17/18) intron_pos 2416:0 (18/18) BEGIN 1 MDDSPGPSTS KSARDKAENA EENTSDSSSD SEVSSASEKS EESRPSSEKK KVITRVIPVR 61 PPTRDKGHRV NLLESGNESE TKSLYQRAKE GIPSYKGKPE IKLPTTSEQY YDLEEVLMNP 121 ARMEGRELTL NAYDAVRNKY NVLPGKSVCE ADLQKVIGSF SCDVCQELIQ GSIMTKKCGH 181 RFCDQCILVA FMRSGNTCPT CRQNLGSKRE LQQDPRFDQL IYQVVESRSI VGRMMAENRE 241 HEKDVYFGRK GYIEGGSDWN KRYGIDPNSK LKAPRPLKSA GRKKIRWFHE SDEDGSVRKV 301 MESKKGAPKE DDTNYLENDK EGTSVAAEKE VLEEGEMDFP IEIKSSDEEQ TDLDDEEESM 361 LDSDFEISDN EDVSKPSCST SKKTTNRSRD SSESDNDSRD NELQKKKRKM KRKNVPKTDG 421 SDVSNESFDE DASGEVVATK LIKESKKKPC GRPKKKFAPE LIEGDIPTPS EDSLTSSDEE 481 RDDNAADPYA FVFQKEFNRD PRRDGHPEKD KLYNFDFMID MNHQVDRKFE KDGEIHVISD 541 DSNSEHESDE AEDRESSIDS EHEKEISKFL SHRQPLPNPT SVDDDCQVIT VVKKDVKQSA 601 ITSKPGETSP DSSSKIEEKP DKVSEEVSDD EMTPEHITAD KGTDTFLNNI MEHDDEMYGG 661 YLFRPGDTGI SRPKVQRAPG TNRLSMNVCP EAVYVVYPQP VLKEGKKKLV IPPEDYEISS 721 DETVTLSDSE ETSPSAEMEQ SETSEAGPST IIKTSGTERE TQGSSSPSEP STSRDRKMHK 781 RKLDTRRRKL ADDSDLSDFD VFSIDGNELV ATGKPIIKHK VFYDSANRMP SKSNLDFTGR 841 RNAREIPMEE ISRLAEEQVA HEEYKIHRRR QVVLEAVEAA SKKLNVYVDT TEEEEIEEEE 901 TPEEEVVKVA SPTAPIATEN PTTSTAPFEE GVAMKETPIE EIFFDPDEPC SSAQAAQREL 961 IIERVGKEQQ IIEDSLEQNR KPSSKTVKES ESREAQEPRI EKDEMESEQQ KKDADNPTVE 1021 VDKESEASSS ESDKSDFEDE TLDAQSKTVK ISLKHEKTVS DEEIEDFDTK FGEFVATADA 1081 KMIKRTIGEY VSTEFLKLVA QQPAVTDEVL ALGFCVRNTD QEFSTIKETG KRTNKNPDDV 1141 RLESMVKNFR ESFAAKHRPV PRKLPTNIER MYIERAHMVK YKHVVDMEPL HMKILIALQK 1201 QQIAATCANL SQPVTVTPEE HAEQVQLLHN LQNPSILRPL LNNPQFALTL HKAQQQAIQQ 1261 QRAQQKAQTQ KELAARQAEQ ARVEELARKR IAQEDAEKAL RQKGEQMSNV SGIPVSSDQN 1321 AQSSNAQQTG LIENQTTTTN SDSLTRPNTL ADNSHLGESQ QIPVIESIQS STSEALKESE 1381 NYKDMPILTP ASTVSSKSSA PATRRPSRPC SSYDRPSSPS VVIRERLGSD GALINRPPNR 1441 CNIDKSRSRS PISRAPVETV RINDHGQNET ILAGNITHTV ETTILEEGTS IGQDSTIRYD 1501 GECSTTQYID KTIDLDNSKN GTNVDEEQSN VLKLRENDLN REMLRYANRY HPSTMLAMGN 1561 LSINERHNKV QQVLASQELQ DLIARHTSGA VSQTQVEVVG GEGVCAGTSD AIGETDEDDD 1621 VEEEPEFTVD QLELAKKILK QRQGLESSED SDSDEDMVYD NVDGSVIRRA PHKKRETRKK 1681 KNIFVPNIPP KIRRKYVDKK IEMERAKYRA RIKSQKMASI RIAVPQKPTQ QFATPQQPVR 1741 GPGKHSAAAA ARATPKPKKA KMSNVQMSIV QPPLPLHQLQ GHMSAPTKIT AVPNVAAGFH 1801 QNQQQLYSDM AQAPQSTPIR TTPQPGTGSA PQAQTPQSHL AQLGQFVNGA NQQQAPQQQG 1861 MYTAAQLQAM QAAVAQTAQA AQAAYAAEAA YQAQVAQQAR AAPPQQLVQR QVPVGHPGQV 1921 NVPMPAQMLN QGNPQMAVNP AQAQMMDERR KMEEVNAVYH LMQSRGQFPP TNQELFQVQF 1981 AQAQADLRSA AAQAAQAAQA QAQMTNMRAQ AEAVARQQAM MKQEQARAQA AAKEAARLKA 2041 ETEAAKAKVQ AEAEARRKAE QEMRVRQAQA AQTQAAQAQA QSQAHAQNQA QTQAIVEIQR 2101 MIQSGQPLSM QQMQQLQQMS QVQMQHAQQV QQMQQMQMQQ LQMQQFAARM QQGTPKPAVS 2161 QQAVQQGMPA GIQGMPTGMG MQQLQGLGMP GMQLPQQAGQ SQQTSQAQQQ QLFMQLQLQQ 2221 QQLMQHQLQQ QLQMQQHQQH QQQQQQIQMQ QQQQLAQQGL VPNVSSAWLQ QQAQLAQQQQ 2281 QVVQQNLLMR VPSAQTPVAP RAPTVAPQSV VQQAPAPATP IAISVATTQV TRPETVEPFR 2341 SISSGTGTGT ERINNNVELE LWPTNGYSEK KRKEGSQASP IFGASAGDST FYHVACLIRS 2401 RSACNADDVG SLWVLRLEDQ TLLQFQLQQT LAEAQRFVGK QHLVIFYDDV MSGEVQQSKE 2461 YVINREFKPS NRQIPN //