LOCUS       CCD67046.2              1554 aa    PRT              CON 06-FEB-2024
DEFINITION  Caenorhabditis elegans EGF-like domain-containing protein protein.
ACCESSION   BX284606-1239
PROTEIN_ID  CCD67046.2
SOURCE      Caenorhabditis elegans
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
            Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
            Caenorhabditis.
REFERENCE   1  (bases 1 to 17718942)
  AUTHORS   WormBase.
  CONSRTM   WormBase Consortium
  JOURNAL   Submitted (04-FEB-2024) to the INSDC. WormBase Group, European
            Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email:
            help@wormbase.org
REFERENCE   2  (bases 1 to 17718942)
  AUTHORS   Sulson J.E., Waterston R.
  JOURNAL   Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project:
            Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome
            Institute at Washington University, St. Louis, MO 63110, USA.
REFERENCE   3  (bases 1 to 17718942)
  AUTHORS   Sulson J.E., Waterston R.
  CONSRTM   Caenorhabditis elegans Sequencing Consortium
  TITLE     Genome sequence of the nematode C. elegans: a platform for
            investigating biology
  JOURNAL   Science 282(5396), 2012-2018(1998).
COMMENT     Annotated features correspond to WormBase release WS292.
            Protein-coding gene structures below are the result of integration
            and manual review of the following types of data: ab initio
            predictions by Genefinder (P. Green and L. Hillier, pers. comm.);
            alignments to published proteins and cDNAs; genome sequence
            conservation with other nematodes (e.g. to C. briggsae using WABA:
            Genome Res. 2000. 10:1115-1125); sequence features (such as
            trans-splice and polyA sites).
            Sources of data: large-scale EST projects of Yuji Kohara
            (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome
            cloning project (http://worfdb.dfci.harvard.edu); RST large-scale
            sequencing project (Genome Res. 2009. 19:2334-2342); IST library
            (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010
            Unpublished); UTRome EST data submission (UTRome v1 Mangone M.
            Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep
            sequencing data (454 read clusters - Makedonka Mitreva,
            unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66);
            Numerous data sets from the modENCODE project (Science. 2010.
            330:1775-87); Individual C. elegans Nucleotide Database
            submissions; Personal communications with C. elegans researchers;
            Non-Coding gene structures below are derived using the following
            methods and data: ab initio prediction of tRNAs by tRNAscan-SE
            (Nucl. Acids. Res., 25, 955-964); integration and appraisal of
            miRNAs from miRBase (http://www.mirbase.org); integration and
            appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell.
            2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87);
            manual curation of novel published ncRNAs from the literature.
FEATURES             Qualifiers
     source          /organism="Caenorhabditis elegans"
                     /chromosome="X"
                     /strain="Bristol N2"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:6239"
     protein         /transl_table=1
                     /gene="lgx-1"
                     /locus_tag="CELE_C54G7.3"
                     /standard_name="C54G7.3a"
                     /note="Confirmed by transcript evidence"
                     /db_xref="EnsemblGenomes-Gn:WBGene00002983"
                     /db_xref="EnsemblGenomes-Tr:C54G7.3a"
                     /db_xref="GOA:H2KZ94"
                     /db_xref="InterPro:IPR000742"
                     /db_xref="InterPro:IPR002509"
                     /db_xref="InterPro:IPR006149"
                     /db_xref="InterPro:IPR006150"
                     /db_xref="InterPro:IPR011330"
                     /db_xref="UniProtKB/TrEMBL:H2KZ94"
                     /db_xref="WormBase:WBGene00002983"
     intron_pos      16:1 (1/28)
     intron_pos      53:2 (2/28)
     intron_pos      92:1 (3/28)
     intron_pos      161:0 (4/28)
     intron_pos      295:1 (5/28)
     intron_pos      341:1 (6/28)
     intron_pos      377:0 (7/28)
     intron_pos      480:1 (8/28)
     intron_pos      524:1 (9/28)
     intron_pos      551:1 (10/28)
     intron_pos      580:0 (11/28)
     intron_pos      628:1 (12/28)
     intron_pos      678:1 (13/28)
     intron_pos      708:0 (14/28)
     intron_pos      748:2 (15/28)
     intron_pos      851:1 (16/28)
     intron_pos      888:1 (17/28)
     intron_pos      940:0 (18/28)
     intron_pos      967:1 (19/28)
     intron_pos      1056:1 (20/28)
     intron_pos      1105:1 (21/28)
     intron_pos      1150:1 (22/28)
     intron_pos      1196:1 (23/28)
     intron_pos      1238:2 (24/28)
     intron_pos      1277:2 (25/28)
     intron_pos      1329:0 (26/28)
     intron_pos      1417:0 (27/28)
     intron_pos      1493:0 (28/28)
BEGIN
        1 MRVLLAISVI YHITIAQQVG INVPCNKELD GLLTADPDGD ERAFMSCQGV GVGTIGFWER
       61 KLCPNNMVFD FINQHCKEQK KKARKQQNLS IAILNNSCAN GETCIGGSVC DLDTLRCMCP
      121 YGTTPKLDTL SCESSQPPFV SATVDGPAQF FNSFSGGNGN RNNQNNGFMP FGQPSELPDF
      181 QPNNNFKYGN NFGQNQNSNN NNNDFNWNPN FNWNNNQNNN NNNAPNKQAP MNNAGVGATT
      241 TPDFNPFQPN PNQQFVFNFN KNQGGQNNQN NHVNNNNHNQ NIPVLESKPK VATLVGIGAL
      301 CSDLAECDHG STCVMGRCTC VSPLVQHEGK CVLRQQQKIV VQVMTPPPTL PPVQVPIQTL
      361 PPQTRPPQLP PVQITQPPIQ TPVQPIFFQT TRPPMPTYAT TPATTQFIPK QIYTTPPQMP
      421 KNSIKINQMK IGGSKQAGVG VRCSLNTDCM IGAYCNGNTN PPSCQCLSTH VNIEGRCEKV
      481 IYPGQVGCRS DLQCHAAHSG THCIDRICVC PEGQRAVDQT CVSASEPVTS PPGGSCSNLR
      541 ECTGGSVCRE GWCICPDPSM IVNRGICIQS GPKPTLPPRT PIPQVPLPPQ LPISVHVPQV
      601 TITKAQPFIT EAPLAPQGKK IVPGGRCGPI DVCVGGSNCI EGFCLCPAGQ QPSNSGRCEK
      661 FTTTSRQTTL PSTTTTQATT ARLYSKPGES CTQGQTCVGG SACSFRKLCE CPQDKSEISQ
      721 GQCVTPRKLE VVPGASCNAN TVCTKGSTCE SGLCRCQPGY IAVSGNCVAL PMSTTPKMRV
      781 IAKPLESCEN GETCEGGSNC DYDTGICMCP PGQIVFNVQC MPPPTQPQIT TRVTTPVKAA
      841 PVVTPKPIHS TDCEIDANCG ENKICVSGKC KCKPGFVDNS GTCEPLEGIR GRSLRYGQAC
      901 DRYQDKCING AKCIDKICKC APGFLLAPNG WCEGFDLAKA IEINKQSIGT TPPVKANRVI
      961 QTKMQQVYTT SSTISSTTTS PAPTQKPRIV PKDFPTPPSN IFGFPDRRTT KIVPKVAAVG
     1021 SACRPIDICL GESVCTNGFC HCPENYIRQN GQCISKECKH KVAKVGETCK NGEICAGGSI
     1081 CDYDRKRCIC AAQHVAIRGI CKQKSAPAFA APGDTCSMRE KCTGGATCFE GMCTCDDHHF
     1141 AEDGYCRPIE ARSSKVQFVN GAGLRFSSMQ VPNRQRAQAC NEAECKLPNC FCTENGRRAP
     1201 GGLRPDETPQ FVVLTFDDAV NGKTFSDYKK LFENDVLKNP NGCDVKATFF ISHEWTNYDA
     1261 VNWLVQKNME IASNSISHES LENANTNRWL NEMDGQRRIL AKFGGAPEEE IVGIRSPQLA
     1321 LGGDNQFEMM IGAEFLWDNS MSANPGIHGE PFWPQTMDYQ VAWDCNEASC PKSSFPGVWS
     1381 VPLNQFYGSY MSQIDSFRRS SMLRAAVDLN NTVDELEEII TRNFERSYSA NRAPYVLSLN
     1441 ADFLQLGGHN KGMKAVQKFL NRMSAQKDVY IVTIKQLIDW MKRPVPISEM KSSKAVGCPI
     1501 TLSFNRNPSL STCDIPNKCL YSTPSLSSQE HQFLTCLPCP TMYPWLENPA GGIV
//