LOCUS VTW47456.1 5077 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Ig-like domain-containing protein protein. ACCESSION BX284606-2299 PROTEIN_ID VTW47456.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="him-4" /locus_tag="CELE_F15G9.4" /standard_name="F15G9.4f" /note="Confirmed by transcript evidence" /db_xref="GOA:A0A4V0IK01" /db_xref="UniProtKB/TrEMBL:A0A4V0IK01" /db_xref="WormBase:WBGene00001863" intron_pos 22:1 (1/60) intron_pos 237:0 (2/60) intron_pos 272:1 (3/60) intron_pos 379:2 (4/60) intron_pos 509:0 (5/60) intron_pos 566:1 (6/60) intron_pos 838:0 (7/60) intron_pos 879:1 (8/60) intron_pos 954:0 (9/60) intron_pos 1087:0 (10/60) intron_pos 1128:0 (11/60) intron_pos 1389:0 (12/60) intron_pos 1419:0 (13/60) intron_pos 1448:1 (14/60) intron_pos 1510:0 (15/60) intron_pos 1544:1 (16/60) intron_pos 1583:0 (17/60) intron_pos 1619:1 (18/60) intron_pos 1686:2 (19/60) intron_pos 1740:0 (20/60) intron_pos 1770:0 (21/60) intron_pos 2090:1 (22/60) intron_pos 2157:2 (23/60) intron_pos 2284:1 (24/60) intron_pos 2414:2 (25/60) intron_pos 2496:0 (26/60) intron_pos 2602:0 (27/60) intron_pos 2661:1 (28/60) intron_pos 2754:1 (29/60) intron_pos 2792:0 (30/60) intron_pos 2816:0 (31/60) intron_pos 2843:1 (32/60) intron_pos 2870:0 (33/60) intron_pos 2894:0 (34/60) intron_pos 2935:1 (35/60) intron_pos 2986:0 (36/60) intron_pos 3027:1 (37/60) intron_pos 3052:0 (38/60) intron_pos 3095:2 (39/60) intron_pos 3117:1 (40/60) intron_pos 3160:1 (41/60) intron_pos 3567:1 (42/60) intron_pos 3625:0 (43/60) intron_pos 3666:1 (44/60) intron_pos 3712:1 (45/60) intron_pos 3759:1 (46/60) intron_pos 3945:1 (47/60) intron_pos 4089:1 (48/60) intron_pos 4127:1 (49/60) intron_pos 4263:1 (50/60) intron_pos 4310:1 (51/60) intron_pos 4358:1 (52/60) intron_pos 4400:1 (53/60) intron_pos 4550:0 (54/60) intron_pos 4740:1 (55/60) intron_pos 4781:1 (56/60) intron_pos 4843:1 (57/60) intron_pos 4913:1 (58/60) intron_pos 4979:1 (59/60) intron_pos 5012:1 (60/60) BEGIN 1 MRQLSKVYVH GGGDCPEKTL TGILKALQIS LPSSFIYVFT DARSKDYHLE DEVLNTIQEK 61 QSSVVFVMTG DCGNRTHPGF RTYEKIAAAS FGQVFHLEKS DVSTVLEYVR HAVKQKKVHL 121 MYEARERGGT VSRNIPVDKH LSELTISLSG DKDDSDNLDI VLRDPEGRTV DKRLYSKEGG 181 TIDLKNVKLI RLKDPSPGVW TVNTNSRLKH TIRVFGHGAV DFKYGFASRP LDRIELARPR 241 PVLNQDTYLL INMTGLIPPG TVGEIDLVDY HGHSLYKAVA SPHRTNPNMY FAGPFVPPKG 301 LFFVRVQGYD EDNYEFMRIA PTAIGSVIVG GPRAFMSPIH QEFVGRDLNL SCTVESASAY 361 TIYWVKTGED IIGGPLFYHN TDTSVWTIPE LSLKDAGEYE CRVISNNGNY SVKTRVETRE 421 SPPEIFGVRN VSVPLGEAAF LHCSTRSAGE VEIRWTRYGA TVFNGPNTER NPTNGTLKIH 481 HVTRADAGVY ECMARNAGGM STRKMRLDIM EPPSVKVTPQ DVYFNMREGV NLSCEAMGDP 541 KPEVHWYFKG RHLLNDYKYQ VGQDSKFLYI RDATHHDEGT YECRAMSQAG QARDTTDLML 601 ATPPKVEIIQ NKMMVGRGDR VSFECKTIRG KPHPKIRWFK NGKDLIKPDD YIKINEGQLH 661 IMGAKDEDAG AYSCVGENMA GKDVQVANLS VGRVPTIIES PHTVRVNIER QVTLQCLAVG 721 IPPPEIEWQK GNVLLATLNN PRYTQLADGN LLITDAQIED QGQFTCIARN TYGQQSQSTT 781 LMVTGLVSPV LGHVPPEEQL IEGQDLTLSC VVVLGTPKPS IVWIKDDKPV EEGPTIKIEG 841 GGSLLRLRGG NPKDEGKYTC IAVSPAGNST LHINVQLIKK PEFVYKPEGG IVFKPTISGM 901 DEKHVAVVNS THDVLDGEGF AIPCVVSGTP PPIITWYLDG RPITPNSRDF TVTADNTLIV 961 RKADKSYSGV YTCQATNSAG DNEQKTTIRI MNTPMISPGQ SSFNMVVDDL FTIPCDVYGD 1021 PKPVITWLLD DKPFTEGVVN EDGSLTIPNV NEAHRGTFTC HAQNAAGNDT RTVTLTVHTT 1081 PTINAENQEK IALQNDDIVL ECPAKALPPP VRLWTYEGEK IDSQLIPHTI REDGALVLQN 1141 VKLENTGVFV CQVSNLAGED SLSYTLTVHE KPKIISEVPG VVDVVKGFTI EIPCRATGVP 1201 EVIRTWNKNG IDLKMDEKKF SVDNLGTLRI YEADKNDIGN YNCVVTNEAG TSQMTTHVDV 1261 QEPPIILPST QTNNTAVVGD RVELKCYVEA SPPASVTWFR RGIAIGTDTK GYVVESDGTL 1321 VIQSASVEDA TIYTCKASNP AGKAEANLQV TVIASPDIKD PDVVTQESIK ESHPFSLYCP 1381 VFSNPLPQIS WYLNDKPLID DKTSWKTSDD KRKLHVFKAK ITDSGVYKCV ARNAAGEGSK 1441 SFQVEVIVPL NLDESKYKKK VFAKEGEEVT LGCPVSGFPV PQINWVVDGT VVEPGKKYKG 1501 ATLSNDGLTL HFDSVSVKQE GNYHCVAQSK GNILDIDVEL SVLAVPIVGE DDNLEVFLGK 1561 DISLSCDLQT ESDDKTTFVW SINGSESDRP DNVQIPSDGH RLYITDAKPE NNGKYMCRVT 1621 NSAGKAERTL TLDVLEPPVF VEPVFEANQK LIGNNPIILQ CQVTGNPKPT VIWKIDGNDV 1681 DKSWLFDESL SLLRIEKLTG KSAQISCTAE NKAGTASRDF FIQNIAAPTF KNEGDQETIF 1741 RESETITLDC PVSLGDFQIT WMKQGLPLTE NDAIFTLDNT RLTILNANRD HEDIYTCVAN 1801 NTAGQVSKDF DVVVQVLPKI KNAVVTLEIN EGEEIILTCD AEGNPTPTAK WDFNQGDLPK 1861 EAVFVNNNHT VVVNNVTKYH TGVYKCYATN KVGQAVKTIN VHVRTKPRFE SGLTESELTV 1921 NLTRSITLEC DVDDAIGVGI SWTVNGKPFL AETDGVQTLA GGRFLHIVSA KTDDHGSYAC 1981 TVTNEAGVAT KTFNLFVQVP PTIVNEGGEY TVIENNSLVL PCEVTGKPNP VVTWTKDGRP 2041 VGDLKSVQVL SEGQQFKIVH AEIAHKGSYI CMAKNDVGTA EISFDVDIIT RPMIQKGIKN 2101 IVTAIKGGAL PFKCPIDDDK NFKGQIIWLR NYQPIDLEAE DARITRLSND RRLTILNVTE 2161 NDEGQYSCRV KNDAGENSFD FKATVLVPPT IIMLDKDKNK TAVEHSTVTL SCPATGKPEP 2221 DITWFKDGEA IHIENIADII PNGELNGNQL KITRIKEGDA GKYTCEADNS AGSVEQDVNV 2281 NVITIPKIEK DGIPSDYESQ QNERVVISCP VYARPPAKIT WLKAGKPLQS DKFVKTSANG 2341 QKLYLFKLRE TDSSKYTCIA TNEAGTDKRD FKVSMLVAPS FDEPNIVRRI TVNSGNPSTL 2401 HCPAKGSPSP TITWLKDGNA IEPNDRYVFF DAGRQLQISK TEGSDQGRYT CIATNSVGSD 2461 DLENTLEVII PPVIDGERRE AVAVIEGFSS ELFCDSNSTG VDVEWQKDGL TINQDTLRGD 2521 SFIQIPSSGK KMSFLSARKS DSGRYTCIVR NPAGEARKLF DFAVNDPPSI SDELSSANIQ 2581 TIVPYYPVEI NCVVSGSPHP KVYWLFDDKP LEPDSAAYEL TNNGETLKIV RSQVEHAGTY 2641 TCEAQNNVGK ARKDFLVRVT APPHFEKERE EVVARVGDTM LLTCNAESSV PLSSVYWHAH 2701 DESVQNGVIT SKYAANEKTL NVTNIQLDDE GFYYCTAVNE AGITKKFFKL IVIETPYFLD 2761 QQKLYPIILG KRLTLDCSAT GTPPPTILFM KDGKRLNESD EVDIIGSTLV IDNPQKEVEG 2821 RYTCIAENKA GRSEKDMMVE VLLPPKLSKE WINVEVQAGD PLTLECPIED TSGVHITWSR 2881 QFGKDGQLDM RAQSSSDKSK LYIMQATPED ADSYSCIAVN DAGGAEAVFQ VTVNTPPKIF 2941 GDSFSTTEIV ADTTLEIPCR TEGIPPPEIS WFLDGKPILE MPGVTYKQGD LSLRIDNIKP 3001 NQEGRYTCVA ENKAGRAEQD TYVEISEPPR VVMASEVMRV VEGRQTTIRC EVFGNPEPVV 3061 NWLKDGEPYT SDLLQFSTKL SYLHLRETTL ADGGTYTCIA TNKAGESQTT TDVEVLVPPR 3121 IEDEERVLQG KEGNTYMVHC QVTGRPVPYV TWKRNGKEIE QFNPVLHIRN ATRADEGKYS 3181 CIASNEAGTA VADFLIDVFT KPTFETHETT FNIVEGESAK IECKIDGHPK PTISWLKGGR 3241 PFNMDNIILS PRGDTLMILK AQRFDGGLYT CVATNSYGDS EQDFKVNVYT KPYIDETIDQ 3301 TPKAVAGGEI ILKCPVLGNP TPTVTWKRGD DAVPNDSRHT IVNNYDLKIN SVTTEDAGQY 3361 SCIAVNEAGN LTTHYAAEVI GKPTFVRKGG NLYEVIENDT ITMDCGVTSR PLPSISWFRG 3421 DKPVYLYDRY SISPDGSHIT INKAKLSDGG KYICRASNEA GTSDIDLILK ILVPPKIDKS 3481 NIIGNPLAIV ARTIYLECPI SGIPQPDVIW TKNGMDINMT DSRVILAQNN ETFGIENVQV 3541 TDQGRYTCTA TNRGGKASHD FSLDVLSPPE FDIHGTQPTI KREGDTITLT CPIKLAEDIA 3601 DQVMDVSWTK DSRALDGDLT DNVDISDDGR KLTISQASLE NAGLYTCIAL NRAGEASLEF 3661 KVEILSPPVI DISRNDVQPQ VAVNQPTIMR CAVTGHPFPS IKWLKNGKEV TDDENIRIVE 3721 QGQVLQILRT DSDHAGKWSC VAENDAGVKE LEMVLDVFTP PVVSVKSDNP IKALGETITL 3781 FCNASGNPYP QLKWAKGGSL IFDSPDGARI SLKGARLDIP HLKKTDVGDY TCQALNAAGT 3841 SEASVSVDVL VPPEINRDGI DMSPRLPAQQ SLTLQCLAQG KPVPQMRWTL NGTALTHSTP 3901 GITVASDSTF IQINNVSLSD KGVYTCYAEN VAGSDNLMYN VDVVQAPVIS NGGTKQVIEG 3961 ELAVIECLVE GYPAPQVSWL RNGNRVETGV QGVRYVTDGR MLTIIEARSL DSGIYLCSAT 4021 NEAGSAQQAY TLEVLVSPKI ITSTPGVLTP SSGSKFSLPC AVRGYPDPII SWTLNGNDIK 4081 DGENGHTIGA DGTLHIEKAE ERHLIYECTA KNDAGADTLE FPVQTIVAPK ISTSGNRYIN 4141 GSEGTETVIK CEIESESSEF SWSKNGVPLL PSNNLIFSED YKLIKILSTR LSDQGEYSCT 4201 AANKAGNATQ KTNLNVGVAP KIMERPRTQV VHKGDQVTLW CEASGVPQPA ITWYKDNELL 4261 TNTGVDETAT TKKKSVIFSS ISPSQAGVYT CKAENWVAST EEDIDLIVMI PPEVVPERMN 4321 VSTNPRQTVF LSCNATGIPE PVISWMRDSN IAIQNNEKYQ ILGTTLAIRN VLPDDDGFYH 4381 CIAKSDAGQK IATRKLIVNK PSDRPAPIWV ECDEKGKPKK TEYMIDRGDT PDDNPQLLPW 4441 KDVEDSSLNG SIAYRCMPGP RSSRTVLLHA APQFIVKPKN TTAAIGAIVE LRCSAAGPPH 4501 PTITWAKDGK LIEDSKFEIA YSHLKVTLNS TSDSGEYTCM AQNSVGSSTV SAFINVDNNI 4561 LPTPKPSSNQ KNVAVITCYE RNQAYSRGLT WEYNGVPMPK NLAGIHFMNN GSLVILDTSS 4621 LKEGDLELYT CKVRNRRRHS IPHLTSAFEG VPEVKTIDKV EVNNGDSVVL DCEVTSDPLT 4681 THVVWTKNDQ KMLDDDAIYV LPNNSLVLLN VEKYDEGVYK CVASNSIGKA FDDTQLNVYG 4741 GSSRREAYKK ENEDASTTTI TTTSPTTTTT ETPLTTTIIP ALITLPAKQY PTDDYHEGSA 4801 NDDGFGPTTQ DSLFEFNPPL HPEISVVNTD CAGTINENGD CVDKDGKTHN LKILTGENHC 4861 PEGFAMNPHT RICEDLDECA FYQPCDFECI NYDGGFQCNC PLGYELAEEG CRDVNECESV 4921 RCEDGKACFN QLGGYECIDD PCPANYSLVD DRYCEPECEN CTSTPIQVHM LAIPSGLPIS 4981 HIATLTAYDK SGRVLNDTTY AISDTGAPLA RGRMTSGPFT IKAVKRGHAQ VWTNRVLAAG 5041 DHHKVRVRAH SDHATNELHA PKETNFLVLI NVGQYPF //