LOCUS CCD73132.2 2392 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Lin-5 protein. ACCESSION BX284606-1106 PROTEIN_ID CCD73132.2 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="lfi-1" /locus_tag="CELE_ZC8.4" /standard_name="ZC8.4e" /note="Partially confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00022500" /db_xref="EnsemblGenomes-Tr:ZC8.4e" /db_xref="GOA:H2L0I4" /db_xref="UniProtKB/TrEMBL:H2L0I4" /db_xref="WormBase:WBGene00022500" intron_pos 66:1 (1/18) intron_pos 112:2 (2/18) intron_pos 142:2 (3/18) intron_pos 185:0 (4/18) intron_pos 215:0 (5/18) intron_pos 257:2 (6/18) intron_pos 379:0 (7/18) intron_pos 842:2 (8/18) intron_pos 1353:1 (9/18) intron_pos 1497:0 (10/18) intron_pos 1558:1 (11/18) intron_pos 1706:1 (12/18) intron_pos 1790:1 (13/18) intron_pos 1845:0 (14/18) intron_pos 2105:0 (15/18) intron_pos 2187:0 (16/18) intron_pos 2266:2 (17/18) intron_pos 2317:1 (18/18) BEGIN 1 MSDPQVEQDD RDRAYSFMSS DSGVDTTPHG RVVREKPLKS TTTYEKYPPS GFASESTKSI 61 SFSSPDKLKM STHDDTRSSG GGDGTIHIPS ADAKEPTPES PGSRSNYGEF YCPTTDSLKT 121 YSYTIHMKKS EEEIHDFDMG SWDEYSRDTS GENNYHHTSS FDDDLLLEID SSRGAAVARL 181 DRTQDDLNKF RQRIDNNVEQ QREYSEMMAA LQNKVHEYRK HIAELEGRMV GARNRMLDDP 241 TSNVMIFDNY DPGNTYITNH NVELWSPARG KRETILGGGG APGLTTVNVH AGAGYSGSGV 301 AGYGGGVQAM VGDPNANYEM IARLDEERRR SDEYRMQWEN ERQKSLSLED ENDRLRREFE 361 RYANDSKDKE KTFINRERNL AQYLSDEQRK MLDLWTELQR VRKQFSDLKT HTEEDLDKQK 421 AEFTRAIRNV NNISRNAAFS AGAGDGLGLY GLEDGGDVNR TTNNYEKVFI ETIKRMNGTG 481 GAGSASSADL LEELRKIRGG GSSEGDAELH KELMTKYEES IERNIELESR GDDSQRKIAE 541 LEAELRRNRE KLNEAQGALK KLHEMAQDSE KNVDGTVSIK RTRSLSPGKT PLPPSEALRA 601 VRNTFRNKDN DIQQLERKLK IAESQVKEFL NKFENADEAR RRLDKQFADA KREISNLQKS 661 VDEAERNSRR TDDKLRASEA ERVAAEKARK FLEDELAKLQ ASFQKSSTDD ARKLRDEMDE 721 HTNSIQEEFK TRIDELNRRV ENLLRENNRL KSEVNPLKDK YRDLENEYNS TQRRIEEKET 781 QIRYSDDIRR NIQKDLDDLR EKYDRVHTDN EKILGELEHA QKAAHLAEQQ LKEIKIQRDD 841 YQKQKDEHAR HLFDIRHKLE TEIKGRQDLE KNGARNNDEL DKLRQTISDY ESQINLLRRH 901 NDELDTTIKG HQGKITHLEN ELHSRSGEIE KLNDLNQRLQ KEKQDILNQK LKLDGDVQAL 961 KETIRKLENE LEKLRNENKE LVGKEARARD AANQQLSRAN LLNKELEDTK QDLKHSTDVN 1021 KQLEQDIRDL KERLANIGKG GRISRDSTTG TDGGAFGDRS SVADPSRTRG AAGSTVFVPA 1081 AEDIESRGGG EIDIPSSGDV IHGRDGRDGR DAGNRGTHTI TNTKERIERI EKNILDRYHD 1141 DELVEHKIRE VNDRWKRELE RLENEKDDLE RRIRELEDEL SQIGRGNDKT ENDITELKRK 1201 HAAEIDKLKS DISALHDKHL SDLDDEKEQY GKAVENLKSV EDDLRDKLNN LEKQLADSLN 1261 RENELEREKR DYDEKINSLY GQNQKIKDEW DDFRNDADKE IQKWKTDAYT VRSEAKALET 1321 TNTALKAQLQ AANDRIDHLT KTVNDHTSKV RDLTSQVRHL EDELADTKGN LVQKEMDLES 1381 TQNRLRSLED QHSTLQSDAN KWRGELDAAL RENDILKSNN TNMETDLTRL KNRLKSAEDA 1441 LKELKNSLSH AKTEKERLQN AFREKTKQAD HLNQLASQFD TKLTKLRNEL QDTNDKLITS 1501 DTERNALRNE LQKLSQELKF GNEQIQRKSD EYQTTIDDLA HSHRVSEDSR LNALQELEAR 1561 KYEINDLTSR LDSTEQRLAT LQQDYIKADS ERDILSDALR RFQSSANRVI NFHTFVDGGA 1621 GYVDGVPGGT SVIGGGPSAQ RSGAYDPSSG GVIGSGISGG PGGSDFGREI EIGRGDSDQS 1681 DVAYPRSVPF PPSADFSSGR PGAASAGGRV INNLDGTTTV NMNGGFDIAN LEGTLQSLLN 1741 KIEKLEMERN ELRDTLARMK KKTTETHTTI NQKETRYRNI EDNLQDAEEE RRALESRLQS 1801 AKTLLRSQEE ALKQRDEERR QMKSKMVAAE LQARGKEAQL RHLNEQLKNL RTDLDNAHTD 1861 IRSLRDKEEQ WDSSRFQLET KMRESDSDTN KYQLQIASFE SERQILTEKI KELDGALRLS 1921 DSKVQDMKDD TDKLRRDLTK AESVENELRK TIDIQSKTSH EYQLLKDQLL NTQNELNGAN 1981 NRKQQLENEL LNVRSEVRDY KQRVHDVNNR VSELQRQLQD ANTEKNRVED RFLSVEKVVN 2041 TMRTTETDLR QQLETAKNEK RVATKELEDL KRRLAQLENE RRNSSQLSDG WKKEKITLLK 2101 KIELLENEKR RTDAAIRETA LQREAIEKSL NAMERENKEL YKNCAQLQQQ IAQLEMENGN 2161 RILELTNKQR EEQERQLIRM RQEKGQIEKV IENRERTHRN RIKQLEDQIA ILRDQLDGER 2221 RRRREYVDRS MVNDIGRLGS NVLGIRNSYG DNSIDAIING GSRSVGFYPR STFASNPLTP 2281 PLGSSTPTHR PHVTDFRSAV DAGSSYRRPI STIEDSGSVY GGSIRDRDSV YGGGGRDSSF 2341 GTRAGDSISR AGVEPRDPSL VEIPSNEPPM TTSTHSQGGS RYDTLAPNRD DL //