LOCUS CCD70137.1 2229 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans C-type LECtin protein. ACCESSION BX284606-687 PROTEIN_ID CCD70137.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="F28B4.3" /locus_tag="CELE_F28B4.3" /standard_name="F28B4.3" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00017892" /db_xref="EnsemblGenomes-Tr:F28B4.3.1" /db_xref="EnsemblGenomes-Tr:F28B4.3.2" /db_xref="InterPro:IPR000742" /db_xref="InterPro:IPR001304" /db_xref="InterPro:IPR002035" /db_xref="InterPro:IPR006582" /db_xref="InterPro:IPR013032" /db_xref="InterPro:IPR016186" /db_xref="InterPro:IPR016187" /db_xref="InterPro:IPR036465" /db_xref="PDB:1WK1" /db_xref="UniProtKB/TrEMBL:Q19853" /db_xref="WormBase:WBGene00017892" intron_pos 61:0 (1/22) intron_pos 98:1 (2/22) intron_pos 306:0 (3/22) intron_pos 326:1 (4/22) intron_pos 383:1 (5/22) intron_pos 407:1 (6/22) intron_pos 471:1 (7/22) intron_pos 510:2 (8/22) intron_pos 551:0 (9/22) intron_pos 599:0 (10/22) intron_pos 1535:1 (11/22) intron_pos 1598:1 (12/22) intron_pos 1637:2 (13/22) intron_pos 1684:0 (14/22) intron_pos 1728:0 (15/22) intron_pos 1790:1 (16/22) intron_pos 1905:0 (17/22) intron_pos 1957:1 (18/22) intron_pos 2018:0 (19/22) intron_pos 2046:0 (20/22) intron_pos 2151:2 (21/22) intron_pos 2219:2 (22/22) BEGIN 1 MRSWVLIAAL AVICLGAEPE LSHKERIRKV LKSWNPANSN QLFHPVSEQK IQFDEQSDLF 61 VDTHHISKRS IAEPHVFAGM ATRGCNKPGY TGATCQYPLC SARNPYIPDN KDSDDISIDA 121 TNLANCSQTY VVVVDETMRN IKIEVETESP LNPTFYLQSE SGDLIFPDTD RKTVTSYVAT 181 YETLAPGQYL LGPRADSGDE FCTMMMTAHT NIQVTGGFTS GDQPERSDYP TLKFAYFDTE 241 SAVVLHAQGL HFPGQIQAIG FTGAENHISR YIPMATRFNC TYPYILERYT CRKIGNNDIG 301 HNLLQVEGMS DNGYVFRRIL PYQCILPPVS TTTVPAPTTT AAPLTTCQNG GQVLKDSSGS 361 PYCYCFGLYT GRDCSQMLCA NGGFLPTPTS EHCECPEGFT GFHCQNIVCP GASGIDFNAE 421 NPTVTLVIRS RSQLSDVIQQ ATNSVSRIVD ELSAEPGYLT NFIVVLFDNA KLLVNQRYDS 481 WDAAMVDLLK AINSAPSDGG CDDVVFSAVA SALSLYPTNK SPIYVITDAN PNDSTEKQTI 541 VHLESYWRAP VYFIYVQPAI GSGCNTSPDS AGYRDMVDMA AMSSGNTFYF NNRTTISNFF 601 YVHMYNTLFR SQLALSGDYS HCANQNIYKS VAVDVTADQV VVVATGSNLK LQVTTPTGAH 661 PDFFVAFNDG VNYIWTSNQI FAGQWFFNLV SDSPNSACTF KVYQKKYNLG GMSQYNPDYD 721 IFWSFATTLT SAAGVLRQPV AGFDASPVFH VSNYPEFISM DRVHANLQIY AIRDGVQTEV 781 YGSSGMYRDA CEFHFYFPPF ICNVPDEVLY FNFFARDNND MALQRAGTML CSAVHPTPPP 841 QHQCQNGGVM NPTNTTCFCT PQFTGTYCQN IVCYNGGTVS GGQCVCPPGY AGESCEVPRC 901 IETGPNPEFI RYGVDMVFAV EITQQSLASI VMLDNNFQEI LRDVQMQDRG WIRNFVLVGF 961 NSTWGGPIAQ SPSNNLTAII AALHNLATNV PSDNGCSVKL WDALNHAIFA RDLVPGSFIE 1021 IFQTTPEYEL DQRSLGLFYD MSRAMDISLY GFLTAKPTLL PAGFVCNATQ VNYYVLFGMV 1081 TSSTGQTYIL QALEISNAIR LIPIQFSNGQ VTINGNNDCR HEDGLTTYFP VDAYTQTIQL 1141 TVFGYGTTIQ VYNGNGVLAE ALELFYDDYT GQSVYEIRQA CDNGWESFGQ YCVKFLTVND 1201 DILSMPQARN FCASAGGYLA DDLGDDKNNF YSSIAANTQF WIGLFKNSDG QFYWDRGQGI 1261 NPDLLNQPIT YWANGEPSND PTRQCVYFDG RSGDKSKVWT TDTCATPRPF ICQKHRYDSD 1321 HKPNTIGDAD LPAGDWYVKI KTNPTNSNPP YCSLSVRVQS SLQIVSGFAT KIGDDNPQID 1381 PIQDFSSNRL ITYVHSVDNE NRVPIMTDAI LWDFYNGTFY NGLKYQARFG CQYGWVSQDF 1441 PCPNSDNQNN EFGVLHVGED EFGNTFQRLT FGHCSPATIV CGNGGIRQNG QCICTDYWTG 1501 SRCTVPICVN GGTKNSDEAT CTCPDGYAGL NCQFEVCQPK VPQIFTDDTK TLLFVVETTR 1561 QNSDTVNQLI ANLKNIVTSA TNFAPFWFSY FGLVTFDTTG RTFEKYNYTT IDALITDLTA 1621 QSTAISTDGA CSMPYLGVLA HLLEHDNVIS IPNSEIFLVT AAGPSDLNKY GEAMNSLFNT 1681 EAHLHYIVSK SANCPTFEGV NNVQDMTWLG YGSSGNILFT DSSNIVSLMN SYLPSLYGAS 1741 ILQDPTGPAN YSCTDGSLPW FVPVDSNTTF IYVTTSSEFG SLSVKDPLGQ AHNVAPAYNV 1801 NSQKFYKIEV DRLGGIWTLQ LVQPPGLCLA HIYSTGGARV YTKFSLPNPV GGKEDPLGAH 1861 QDGRFVQPVS GFDNVAVFHI AGKPMQRGQL QYVEIFDIGQ VTVTNVLRSE LYRREGCSYE 1921 YYSDLFTCSG DMIAVFIHGV DEYNQKFRRQ QIVVCNGRNP TTGQPMTGTM VPVTGSMAPV 1981 TQATQQTQGP VTQQTQGPIT QATQPPQTVQ TQAPVTPTQN PQTGLQFDIV FLIDGSQAAQ 2041 QNFDSFTKFI QTMMVSFDVG IAGAHVGLVV VAADLNDQAP PVANLNAITS QQMLISYLNG 2101 LKDGYTDFDD AGQVLTYNLQ VVSSTDYMAA TAGYRAGISN HVIVYITSTT SFFTDPTPSA 2161 KTIIAQKKYG IITVGYGGAV DTGKLQTISG GSACSFTATD FTTLNNQIKP IQQLITAAST 2221 NGGNYCKST //