LOCUS VVC12407.1 2293 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Lin-5 protein. ACCESSION BX284606-1110 PROTEIN_ID VVC12407.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="lfi-1" /locus_tag="CELE_ZC8.4" /standard_name="ZC8.4g" /note="Confirmed by transcript evidence" /db_xref="WormBase:WBGene00022500" intron_pos 43:2 (1/16) intron_pos 86:0 (2/16) intron_pos 116:0 (3/16) intron_pos 158:2 (4/16) intron_pos 280:0 (5/16) intron_pos 743:2 (6/16) intron_pos 1254:1 (7/16) intron_pos 1398:0 (8/16) intron_pos 1459:1 (9/16) intron_pos 1607:1 (10/16) intron_pos 1691:1 (11/16) intron_pos 1746:0 (12/16) intron_pos 2006:0 (13/16) intron_pos 2088:0 (14/16) intron_pos 2167:2 (15/16) intron_pos 2218:1 (16/16) BEGIN 1 MSTHDDTRSS GGGDGTIHIP SADAKEPTPE SPGSRSNYGE FYWDEYSRDT SGENNYHHTS 61 SFDDDLLLEI DSSRGAAVAR LDRTQDDLNK FRQRIDNNVE QQREYSEMMA ALQNKVHEYR 121 KHIAELEGRM VGARNRMLDD PTSNVMIFDN YDPGNTYITN HNVELWSPAR GKRETILGGG 181 GAPGLTTVNV HAGAGYSGSG VAGYGGGVQA MVGDPNANYE MIARLDEERR RSDEYRMQWE 241 NERQKSLSLE DENDRLRREF ERYANDSKDK EKTFINRERN LAQYLSDEQR KMLDLWTELQ 301 RVRKQFSDLK THTEEDLDKQ KAEFTRAIRN VNNISRNAAF SAGAGDGLGL YGLEDGGDVN 361 RTTNNYEKVF IETIKRMNGT GGAGSASSAD LLEELRKIRG GGSSEGDAEL HKELMTKYEE 421 SIERNIELES RGDDSQRKIA ELEAELRRNR EKLNEAQGAL KKLHEMAQDS EKNVDGTVSI 481 KRTRSLSPGK TPLPPSEALR AVRNTFRNKD NDIQQLERKL KIAESQVKEF LNKFENADEA 541 RRRLDKQFAD AKREISNLQK SVDEAERNSR RTDDKLRASE AERVAAEKAR KFLEDELAKL 601 QASFQKSSTD DARKLRDEMD EHTNSIQEEF KTRIDELNRR VENLLRENNR LKSEVNPLKD 661 KYRDLENEYN STQRRIEEKE TQIRYSDDIR RNIQKDLDDL REKYDRVHTD NEKILGELEH 721 AQKAAHLAEQ QLKEIKIQRD DYQKQKDEHA RHLFDIRHKL ETEIKGRQDL EKNGARNNDE 781 LDKLRQTISD YESQINLLRR HNDELDTTIK GHQGKITHLE NELHSRSGEI EKLNDLNQRL 841 QKEKQDILNQ KLKLDGDVQA LKETIRKLEN ELEKLRNENK ELVGKEARAR DAANQQLSRA 901 NLLNKELEDT KQDLKHSTDV NKQLEQDIRD LKERLANIGK GGRISRDSTT GTDGGAFGDR 961 SSVADPSRTR GAAGSTVFVP AAEDIESRGG GEIDIPSSGD VIHGRDGRDG RDAGNRGTHT 1021 ITNTKERIER IEKNILDRYH DDELVEHKIR EVNDRWKREL ERLENEKDDL ERRIRELEDE 1081 LSQIGRGNDK TENDITELKR KHAAEIDKLK SDISALHDKH LSDLDDEKEQ YGKAVENLKS 1141 VEDDLRDKLN NLEKQLADSL NRENELEREK RDYDEKINSL YGQNQKIKDE WDDFRNDADK 1201 EIQKWKTDAY TVRSEAKALE TTNTALKAQL QAANDRIDHL TKTVNDHTSK VRDLTSQVRH 1261 LEDELADTKG NLVQKEMDLE STQNRLRSLE DQHSTLQSDA NKWRGELDAA LRENDILKSN 1321 NTNMETDLTR LKNRLKSAED ALKELKNSLS HAKTEKERLQ NAFREKTKQA DHLNQLASQF 1381 DTKLTKLRNE LQDTNDKLIT SDTERNALRN ELQKLSQELK FGNEQIQRKS DEYQTTIDDL 1441 AHSHRVSEDS RLNALQELEA RKYEINDLTS RLDSTEQRLA TLQQDYIKAD SERDILSDAL 1501 RRFQSSANRV INFHTFVDGG AGYVDGVPGG TSVIGGGPSA QRSGAYDPSS GGVIGSGISG 1561 GPGGSDFGRE IEIGRGDSDQ SDVAYPRSVP FPPSADFSSG RPGAASAGGR VINNLDGTTT 1621 VNMNGGFDIA NLEGTLQSLL NKIEKLEMER NELRDTLARM KKKTTETHTT INQKETRYRN 1681 IEDNLQDAEE ERRALESRLQ SAKTLLRSQE EALKQRDEER RQMKSKMVAA ELQARGKEAQ 1741 LRHLNEQLKN LRTDLDNAHT DIRSLRDKEE QWDSSRFQLE TKMRESDSDT NKYQLQIASF 1801 ESERQILTEK IKELDGALRL SDSKVQDMKD DTDKLRRDLT KAESVENELR KTIDIQSKTS 1861 HEYQLLKDQL LNTQNELNGA NNRKQQLENE LLNVRSEVRD YKQRVHDVNN RVSELQRQLQ 1921 DANTEKNRVE DRFLSVEKVV NTMRTTETDL RQQLETAKNE KRVATKELED LKRRLAQLEN 1981 ERRNSSQLSD GWKKEKITLL KKIELLENEK RRTDAAIRET ALQREAIEKS LNAMERENKE 2041 LYKNCAQLQQ QIAQLEMENG NRILELTNKQ REEQERQLIR MRQEKGQIEK VIENRERTHR 2101 NRIKQLEDQI AILRDQLDGE RRRRREYVDR SMVNDIGRLG SNVLGIRNSY GDNSIDAIIN 2161 GGSRSVGFYP RSTFASNPLT PPLGSSTPTH RPHVTDFRSA VDAGSSYRRP ISTIEDSGSV 2221 YGGSIRDRDS VYGGGGRDSS FGTRAGDSIS RAGVEPRDPS LVEIPSNEPP MTTSTHSQGG 2281 SRYDTLAPNR DDL //