LOCUS VVC12406.1 2401 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Lin-5 (Five) Interacting protein protein. ACCESSION BX284606-1108 PROTEIN_ID VVC12406.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="lfi-1" /locus_tag="CELE_ZC8.4" /standard_name="ZC8.4f" /note="Confirmed by transcript evidence" /db_xref="WormBase:WBGene00022500" intron_pos 105:1 (1/17) intron_pos 151:2 (2/17) intron_pos 194:0 (3/17) intron_pos 224:0 (4/17) intron_pos 266:2 (5/17) intron_pos 388:0 (6/17) intron_pos 851:2 (7/17) intron_pos 1362:1 (8/17) intron_pos 1506:0 (9/17) intron_pos 1567:1 (10/17) intron_pos 1715:1 (11/17) intron_pos 1799:1 (12/17) intron_pos 1854:0 (13/17) intron_pos 2114:0 (14/17) intron_pos 2196:0 (15/17) intron_pos 2275:2 (16/17) intron_pos 2326:1 (17/17) BEGIN 1 MFFSCGVNSS SPGRQLVVSE SVSEDNLHNP SSHALRTLSP VSPPTANSSS STSRSCSPVV 61 RAETIVAGKW THISGKVSVR PLTPSETESV LDRQRQKNRR SSRKDKLKMS THDDTRSSGG 121 GDGTIHIPSA DAKEPTPESP GSRSNYGEFY WDEYSRDTSG ENNYHHTSSF DDDLLLEIDS 181 SRGAAVARLD RTQDDLNKFR QRIDNNVEQQ REYSEMMAAL QNKVHEYRKH IAELEGRMVG 241 ARNRMLDDPT SNVMIFDNYD PGNTYITNHN VELWSPARGK RETILGGGGA PGLTTVNVHA 301 GAGYSGSGVA GYGGGVQAMV GDPNANYEMI ARLDEERRRS DEYRMQWENE RQKSLSLEDE 361 NDRLRREFER YANDSKDKEK TFINRERNLA QYLSDEQRKM LDLWTELQRV RKQFSDLKTH 421 TEEDLDKQKA EFTRAIRNVN NISRNAAFSA GAGDGLGLYG LEDGGDVNRT TNNYEKVFIE 481 TIKRMNGTGG AGSASSADLL EELRKIRGGG SSEGDAELHK ELMTKYEESI ERNIELESRG 541 DDSQRKIAEL EAELRRNREK LNEAQGALKK LHEMAQDSEK NVDGTVSIKR TRSLSPGKTP 601 LPPSEALRAV RNTFRNKDND IQQLERKLKI AESQVKEFLN KFENADEARR RLDKQFADAK 661 REISNLQKSV DEAERNSRRT DDKLRASEAE RVAAEKARKF LEDELAKLQA SFQKSSTDDA 721 RKLRDEMDEH TNSIQEEFKT RIDELNRRVE NLLRENNRLK SEVNPLKDKY RDLENEYNST 781 QRRIEEKETQ IRYSDDIRRN IQKDLDDLRE KYDRVHTDNE KILGELEHAQ KAAHLAEQQL 841 KEIKIQRDDY QKQKDEHARH LFDIRHKLET EIKGRQDLEK NGARNNDELD KLRQTISDYE 901 SQINLLRRHN DELDTTIKGH QGKITHLENE LHSRSGEIEK LNDLNQRLQK EKQDILNQKL 961 KLDGDVQALK ETIRKLENEL EKLRNENKEL VGKEARARDA ANQQLSRANL LNKELEDTKQ 1021 DLKHSTDVNK QLEQDIRDLK ERLANIGKGG RISRDSTTGT DGGAFGDRSS VADPSRTRGA 1081 AGSTVFVPAA EDIESRGGGE IDIPSSGDVI HGRDGRDGRD AGNRGTHTIT NTKERIERIE 1141 KNILDRYHDD ELVEHKIREV NDRWKRELER LENEKDDLER RIRELEDELS QIGRGNDKTE 1201 NDITELKRKH AAEIDKLKSD ISALHDKHLS DLDDEKEQYG KAVENLKSVE DDLRDKLNNL 1261 EKQLADSLNR ENELEREKRD YDEKINSLYG QNQKIKDEWD DFRNDADKEI QKWKTDAYTV 1321 RSEAKALETT NTALKAQLQA ANDRIDHLTK TVNDHTSKVR DLTSQVRHLE DELADTKGNL 1381 VQKEMDLEST QNRLRSLEDQ HSTLQSDANK WRGELDAALR ENDILKSNNT NMETDLTRLK 1441 NRLKSAEDAL KELKNSLSHA KTEKERLQNA FREKTKQADH LNQLASQFDT KLTKLRNELQ 1501 DTNDKLITSD TERNALRNEL QKLSQELKFG NEQIQRKSDE YQTTIDDLAH SHRVSEDSRL 1561 NALQELEARK YEINDLTSRL DSTEQRLATL QQDYIKADSE RDILSDALRR FQSSANRVIN 1621 FHTFVDGGAG YVDGVPGGTS VIGGGPSAQR SGAYDPSSGG VIGSGISGGP GGSDFGREIE 1681 IGRGDSDQSD VAYPRSVPFP PSADFSSGRP GAASAGGRVI NNLDGTTTVN MNGGFDIANL 1741 EGTLQSLLNK IEKLEMERNE LRDTLARMKK KTTETHTTIN QKETRYRNIE DNLQDAEEER 1801 RALESRLQSA KTLLRSQEEA LKQRDEERRQ MKSKMVAAEL QARGKEAQLR HLNEQLKNLR 1861 TDLDNAHTDI RSLRDKEEQW DSSRFQLETK MRESDSDTNK YQLQIASFES ERQILTEKIK 1921 ELDGALRLSD SKVQDMKDDT DKLRRDLTKA ESVENELRKT IDIQSKTSHE YQLLKDQLLN 1981 TQNELNGANN RKQQLENELL NVRSEVRDYK QRVHDVNNRV SELQRQLQDA NTEKNRVEDR 2041 FLSVEKVVNT MRTTETDLRQ QLETAKNEKR VATKELEDLK RRLAQLENER RNSSQLSDGW 2101 KKEKITLLKK IELLENEKRR TDAAIRETAL QREAIEKSLN AMERENKELY KNCAQLQQQI 2161 AQLEMENGNR ILELTNKQRE EQERQLIRMR QEKGQIEKVI ENRERTHRNR IKQLEDQIAI 2221 LRDQLDGERR RRREYVDRSM VNDIGRLGSN VLGIRNSYGD NSIDAIINGG SRSVGFYPRS 2281 TFASNPLTPP LGSSTPTHRP HVTDFRSAVD AGSSYRRPIS TIEDSGSVYG GSIRDRDSVY 2341 GGGGRDSSFG TRAGDSISRA GVEPRDPSLV EIPSNEPPMT TSTHSQGGSR YDTLAPNRDD 2401 L //