LOCUS CAB60351.1 2145 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans U5 small nuclear ribonucleoprotein 200 kDa helicase protein. ACCESSION BX284602-4440 PROTEIN_ID CAB60351.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 15279421) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="II" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="snrp-200" /locus_tag="CELE_Y46G5A.4" /standard_name="Y46G5A.4" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00012896" /db_xref="EnsemblGenomes-Tr:Y46G5A.4" /db_xref="WormBase:WBGene00012896" intron_pos 147:1 (1/11) intron_pos 421:2 (2/11) intron_pos 801:0 (3/11) intron_pos 937:0 (4/11) intron_pos 1329:0 (5/11) intron_pos 1459:0 (6/11) intron_pos 1562:1 (7/11) intron_pos 1699:1 (8/11) intron_pos 1753:0 (9/11) intron_pos 1829:1 (10/11) intron_pos 2083:2 (11/11) BEGIN 1 MADELARIQQ YEYRQNSNLV LSVDYNLTDR RGREEPTGEV LPITDKEMRK MKMGDRAIKG 61 KAPVQDQKKK RKKKDDEKAQ QFGRNVLVDN NELMGAYKPR TQETKQTYEV ILSFILDALG 121 DVPREVLCGA ADEVLLTLKN DKFRDKEKKK EVEALLGPLT DDRIAVLINL SKKISDFSIE 181 EENKPEGDGD IYENEGVNVQ FDSDEEEDDG GMVNEIKGDS EEESEEEEGV DTDYTATLKG 241 DGHLTEDEQK ARGILHPRDI DAHWIQRSLA KYFKDPLIAQ QKQTEVIGIL KNAADDRDAE 301 NQLVLLLGFD QFEFIKCLRQ NRLMILYCTL LRQANEKERL QIEDDMRSRP ELHPILALLQ 361 ETDEGSVVQV EKSKRDAEKS KKAATAANEA ISAGQWQAGR KMLDLNDLTF SQGSHLMSNK 421 RCELPDGSYR RQKKSYEEIH VPALKPRPFA EGEKLVSVSE LPKWAQPAFD GYKSLNRIQS 481 RLCDSALRSK EHLLLCAPTG AGKTNVALLT MLQEIGNHLA EDGSVKLDEF KIVYIAPMKS 541 LVQEMVGSFS KRLAPFGITV GEMTGDAQMS KEQFMATQVI VCTPEKYDVV TRKGGERAYN 601 QMVRLLIIDE IHLLHDDRGP VLESIVVRTI RQMEQNHDEC RLVGLSATLP NYQDVATFLR 661 VKPEHLHFFD NSYRPVPLEQ QYIGVTEKKA LKRFQAMNEV VYDKIMEHAG KSQVLVFVHS 721 RKETAKTAKA IRDACLEKDT LSAFMREGSA STEILRTEAE QAKNLDLKDL LPYGFAIHHA 781 GMNRVDRTLV EDLFADRHIQ VLFSTATLAW GVNLPAHTVI IKGTQIYNPE KGRWTELGAL 841 DIMQMLGRAG RPQYDDRGEG ILITNHSELQ YYLSLMNQQL PVESQMVSRL TDMLNAEVVL 901 GTVSSVSEAT NWLGYTFLFV RMLKNPTLYG ITHEQARADP LLEQRRADLI HTACVLLDKA 961 GLIKYDKRSG IIQATELGRI ASHFYCTYES MQTYNKLLVE TCSDIDLFRI FSMSSEFKLL 1021 SVRDEEKLEL QKMAEHAPIP IKENLDEASA KTNVLLQAYI SQLKLEGFAL QADMVFVAQS 1081 AGRLFRALFE IVLWRGWAGL AQKVLTLCKM VTQRQWGSLN PLHQFKKIPS EVVRSIDKKN 1141 YSFDRLYDLD QHQLGDLIKM PKMGKPLFKF IRQFPKLEMT TLIQPITRTT MRIELTITPD 1201 FKWDEKVHGS AEGFWIFIED TDGEKILHHE FFLLKQKFCS DEHVVKMIVP MFDPMPPLYY 1261 VRIVSDRWIG AETVLPISFR HLILPEKYPP PTELLDLQPL PISAVTNKEF QTVFAESGFK 1321 VFNPIQTQVF RTVFESNENV IVCAPNGSGK TAIAELAVLR HFENTPEAKA VYITPMEDMA 1381 TKVYADWKRR LEPAIGHTIV LLTGEQTMDL KLAQRGQLII STPERWDNIS RRWKQRKSVQ 1441 NVKLFIADDL HMIGASNGAV FEVVCSRTRY ISSQLESAVR VVALSSSLTN ARDLGMWLGC 1501 SASATFNFMP STRPVPLDLE IKSFNLSHNA SRFAAMERPV YQAICRHAGK LEPKPALVFV 1561 PVRRQTRPVA VALLTMALAD GAPKRFLRLA EHDDTFQALL ADIEDESLRE SVSCGVGFLH 1621 EGTAPKDVHI VQQLFESNAI QVCVVPRGMC YQIEMSAYLV VVMDTQFYNG KYHVYEDYPI 1681 ADMLHMVGLA NRPILDSDAK CVVMCQTSKR AYYKKFLCDP LPVESHLDHC LHDHFNAEIV 1741 TKTIENKQDA IDYLTWTLLY RRMTQNPNYY NLQGTTHRHL SDALSELVEL TLKDLENSKC 1801 IAVKDEMDTV SLNLGMIASY YYISYQTIEL FSMSLKEKTK TRALIEIISA SSEFGNVPMR 1861 HKEDVILRQL AERLPGQLKN QKFTDPHVKV NLLIHAHLSR VKLTAELNKD TELIVLRACR 1921 LVQACVDVLS SNGWLSPAIH AMELSQMLTQ AMYSNEPYLK QLPHCSAALL ERAKAKEVTS 1981 VFELLELEND DRSDILQMEG AELADVARFC NHYPSIEVAT ELENDVVTSN DNLMLAVSLE 2041 RDNDIDGLAP PVVAPLFPQK RKEEGWWLVI GDSESNALLT IKRLVINEKS SVQLDFAAPR 2101 PGHHKFKLFF ISDSYLGADQ EFDVAFKVEE PGRSNRKRKH EKEED //