LOCUS VVC12408.1 2288 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Lin-5 protein. ACCESSION BX284606-1111 PROTEIN_ID VVC12408.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="lfi-1" /locus_tag="CELE_ZC8.4" /standard_name="ZC8.4h" /note="Confirmed by transcript evidence" /db_xref="WormBase:WBGene00022500" intron_pos 43:2 (1/16) intron_pos 81:0 (2/16) intron_pos 111:0 (3/16) intron_pos 153:2 (4/16) intron_pos 275:0 (5/16) intron_pos 738:2 (6/16) intron_pos 1249:1 (7/16) intron_pos 1393:0 (8/16) intron_pos 1454:1 (9/16) intron_pos 1602:1 (10/16) intron_pos 1686:1 (11/16) intron_pos 1741:0 (12/16) intron_pos 2001:0 (13/16) intron_pos 2083:0 (14/16) intron_pos 2162:2 (15/16) intron_pos 2213:1 (16/16) BEGIN 1 MSTHDDTRSS GGGDGTIHIP SADAKEPTPE SPGSRSNYGE FYWDTSGENN YHHTSSFDDD 61 LLLEIDSSRG AAVARLDRTQ DDLNKFRQRI DNNVEQQREY SEMMAALQNK VHEYRKHIAE 121 LEGRMVGARN RMLDDPTSNV MIFDNYDPGN TYITNHNVEL WSPARGKRET ILGGGGAPGL 181 TTVNVHAGAG YSGSGVAGYG GGVQAMVGDP NANYEMIARL DEERRRSDEY RMQWENERQK 241 SLSLEDENDR LRREFERYAN DSKDKEKTFI NRERNLAQYL SDEQRKMLDL WTELQRVRKQ 301 FSDLKTHTEE DLDKQKAEFT RAIRNVNNIS RNAAFSAGAG DGLGLYGLED GGDVNRTTNN 361 YEKVFIETIK RMNGTGGAGS ASSADLLEEL RKIRGGGSSE GDAELHKELM TKYEESIERN 421 IELESRGDDS QRKIAELEAE LRRNREKLNE AQGALKKLHE MAQDSEKNVD GTVSIKRTRS 481 LSPGKTPLPP SEALRAVRNT FRNKDNDIQQ LERKLKIAES QVKEFLNKFE NADEARRRLD 541 KQFADAKREI SNLQKSVDEA ERNSRRTDDK LRASEAERVA AEKARKFLED ELAKLQASFQ 601 KSSTDDARKL RDEMDEHTNS IQEEFKTRID ELNRRVENLL RENNRLKSEV NPLKDKYRDL 661 ENEYNSTQRR IEEKETQIRY SDDIRRNIQK DLDDLREKYD RVHTDNEKIL GELEHAQKAA 721 HLAEQQLKEI KIQRDDYQKQ KDEHARHLFD IRHKLETEIK GRQDLEKNGA RNNDELDKLR 781 QTISDYESQI NLLRRHNDEL DTTIKGHQGK ITHLENELHS RSGEIEKLND LNQRLQKEKQ 841 DILNQKLKLD GDVQALKETI RKLENELEKL RNENKELVGK EARARDAANQ QLSRANLLNK 901 ELEDTKQDLK HSTDVNKQLE QDIRDLKERL ANIGKGGRIS RDSTTGTDGG AFGDRSSVAD 961 PSRTRGAAGS TVFVPAAEDI ESRGGGEIDI PSSGDVIHGR DGRDGRDAGN RGTHTITNTK 1021 ERIERIEKNI LDRYHDDELV EHKIREVNDR WKRELERLEN EKDDLERRIR ELEDELSQIG 1081 RGNDKTENDI TELKRKHAAE IDKLKSDISA LHDKHLSDLD DEKEQYGKAV ENLKSVEDDL 1141 RDKLNNLEKQ LADSLNRENE LEREKRDYDE KINSLYGQNQ KIKDEWDDFR NDADKEIQKW 1201 KTDAYTVRSE AKALETTNTA LKAQLQAAND RIDHLTKTVN DHTSKVRDLT SQVRHLEDEL 1261 ADTKGNLVQK EMDLESTQNR LRSLEDQHST LQSDANKWRG ELDAALREND ILKSNNTNME 1321 TDLTRLKNRL KSAEDALKEL KNSLSHAKTE KERLQNAFRE KTKQADHLNQ LASQFDTKLT 1381 KLRNELQDTN DKLITSDTER NALRNELQKL SQELKFGNEQ IQRKSDEYQT TIDDLAHSHR 1441 VSEDSRLNAL QELEARKYEI NDLTSRLDST EQRLATLQQD YIKADSERDI LSDALRRFQS 1501 SANRVINFHT FVDGGAGYVD GVPGGTSVIG GGPSAQRSGA YDPSSGGVIG SGISGGPGGS 1561 DFGREIEIGR GDSDQSDVAY PRSVPFPPSA DFSSGRPGAA SAGGRVINNL DGTTTVNMNG 1621 GFDIANLEGT LQSLLNKIEK LEMERNELRD TLARMKKKTT ETHTTINQKE TRYRNIEDNL 1681 QDAEEERRAL ESRLQSAKTL LRSQEEALKQ RDEERRQMKS KMVAAELQAR GKEAQLRHLN 1741 EQLKNLRTDL DNAHTDIRSL RDKEEQWDSS RFQLETKMRE SDSDTNKYQL QIASFESERQ 1801 ILTEKIKELD GALRLSDSKV QDMKDDTDKL RRDLTKAESV ENELRKTIDI QSKTSHEYQL 1861 LKDQLLNTQN ELNGANNRKQ QLENELLNVR SEVRDYKQRV HDVNNRVSEL QRQLQDANTE 1921 KNRVEDRFLS VEKVVNTMRT TETDLRQQLE TAKNEKRVAT KELEDLKRRL AQLENERRNS 1981 SQLSDGWKKE KITLLKKIEL LENEKRRTDA AIRETALQRE AIEKSLNAME RENKELYKNC 2041 AQLQQQIAQL EMENGNRILE LTNKQREEQE RQLIRMRQEK GQIEKVIENR ERTHRNRIKQ 2101 LEDQIAILRD QLDGERRRRR EYVDRSMVND IGRLGSNVLG IRNSYGDNSI DAIINGGSRS 2161 VGFYPRSTFA SNPLTPPLGS STPTHRPHVT DFRSAVDAGS SYRRPISTIE DSGSVYGGSI 2221 RDRDSVYGGG GRDSSFGTRA GDSISRAGVE PRDPSLVEIP SNEPPMTTST HSQGGSRYDT 2281 LAPNRDDL //