LOCUS CAB02662.3 2022 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Serine/threonine-protein kinase TOR protein. ACCESSION BX284606-2335 PROTEIN_ID CAB02662.3 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 17718942) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 17718942) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="X" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="F21G4.6" /locus_tag="CELE_F21G4.6" /standard_name="F21G4.6" /note="Confirmed by transcript evidence" /db_xref="EnsemblGenomes-Gn:WBGene00009027" /db_xref="EnsemblGenomes-Tr:F21G4.6" /db_xref="InterPro:IPR028426" /db_xref="UniProtKB/TrEMBL:Q93547" /db_xref="WormBase:WBGene00009027" intron_pos 49:0 (1/27) intron_pos 85:0 (2/27) intron_pos 130:2 (3/27) intron_pos 183:0 (4/27) intron_pos 221:0 (5/27) intron_pos 266:0 (6/27) intron_pos 323:0 (7/27) intron_pos 425:1 (8/27) intron_pos 465:1 (9/27) intron_pos 558:1 (10/27) intron_pos 631:0 (11/27) intron_pos 746:0 (12/27) intron_pos 791:0 (13/27) intron_pos 948:0 (14/27) intron_pos 989:1 (15/27) intron_pos 1041:2 (16/27) intron_pos 1131:2 (17/27) intron_pos 1236:1 (18/27) intron_pos 1273:0 (19/27) intron_pos 1400:1 (20/27) intron_pos 1580:0 (21/27) intron_pos 1616:0 (22/27) intron_pos 1695:1 (23/27) intron_pos 1793:2 (24/27) intron_pos 1868:1 (25/27) intron_pos 1919:2 (26/27) intron_pos 1977:0 (27/27) BEGIN 1 MTSTVLAQIT DELSTNVSLD RELNLLKSAH KIIDKNLKVN VLTDRNDTYY RYVHRLMNLT 61 FISLEKNNAD IRLAAEAYYE HILKAYECYG FPETSLKILA MSIEKMNGAR QATKMIKYLV 121 YYLDILPKQH APEKSGRTFY QEVVKNFILK SLNINQTICH QALEKWCGKL ISSVVWQNAD 181 LKEITEKALQ KLWSTRGHAS RTMAAVIGAV IEESKEMFRV MFNTHLDKVI ETFVSNQVVP 241 VGTISMIRKS IHIYISGISV DSLKGLIEML LLMIRDSSNE VAYDAFATLE ELMKVEIPQM 301 DGFVPGNFLK LKYGESISKP EEHVEDVEFV DPLVNADLYE QLQETKPPPK IDLLLEDFEK 361 MSLNANRSVT NYVVAFLGKT FLLTGSEKLL KTDRQSKDSM KILALRVLNV ICARTSIENY 421 EVRFGDKNQL MIDVICYSTS TDDQLSQQAI KFMFLVLQRG TEIYNGLDLF KSVKKFRYLL 481 DSVSVEHVLH LKNQQLLEYL EIMAFRSLYS THYDFSLAEI TSACAFMSVM RDWIQVNNIK 541 ENKHLEVMER FQNCLKNCVI DCWDSPRFIN TISTMLNYHS SETACEDQYP PSLPLRVDFP 601 HVHRTSSKAN RFKQIVYDDW RTTLLCEARM PTYVKMTAIL RESVNSPEFS HEKVPKLSRM 661 HWNSDTLPAV LQVMTVSLSD SHSKEDCYHG FHVGMSVLNW VYRNVFIVRG AAETKEFPFL 721 QHSRTEMDDT LGNRTEFRAT LDAYYKTVQS SADEKLEQLL TPTINLMLAA MAHDFKTSTE 781 NILEIIVYIT VLFTLSPISA LKLLNVLLRC LLDPKLIESL TSGRVFYFAS SNTNNFETND 841 EFLIEALKCD GDKYFNRWGD DMEVEKIKMG AMKEEILEQL EPLITQSLKC FRYRGKNEKK 901 QVLQIMICLM NHKLKLSDAD PTECLMKFAV STFARPEECA HSELFDTLMH FLATATRFQK 961 DESYQKPMIA ATMLMKSIEK LSNPGIVNAM KAVSFALFNG RYEFTDIEEF LHTSHSTWKL 1021 CMQVAPTETL YTLSLLLEKN EFGMKNQKLF WSVLREWTPH DSLKVEVPFS AIALPVSLIT 1081 NYLTSDKVNI FEIINQWLDD PVPENTLKEC TIIVALYFTL NNGDRAIDWK RYFEHCFSKF 1141 NNTKATHLLS KYADSSLLPN RVEESDLDEE LTGNDSVDID NLLEKHHCGS IQDLCHSIIC 1201 SGETSVLDFL NLIEREYPND DSIWGFLLAE FRRLDDDLEH RLDITFRLQD LVDALSERFN 1261 SNMFLKLLGN TNLAGIHLNN LSIPKIKLMF DQVSFDYTED GYVLRGIRNL LVSPRMLHFI 1321 SEDELETAQM FLNLAEKVAG IMTTDALIAT YSQYRIDFQV KCDMDEAKVK EIKKFALSMF 1381 AFCQEAHLRH SKRFSKTLSA CFRHPILNSM FNIPLVAMKC FNWVPVVEIL RKPTITCLPP 1441 TGHVCDVMVL EDMKRRLSKV GLVTNAQFEV LFTTMQAVIA HTVIGPEKMH HDEKDVLERE 1501 ARSCNALQLY IATILTSLKY PNGGDPSSGF VLKSPYISEL FLQSTEFAHL CNLKSVWKCE 1561 PRTAFTTPLE RHDQKSWTHC DGKTNLYGIC QTPLFSLWQL CGMMPIEFRQ HANYHRIDHS 1621 ASNYFLTSAT NIDTISNVKQ LMNIFEYWYS QGIGELGDTL LHSILHTILY LSDFFDDPDL 1681 HKAVLRTTSI IYRHQYDQNS VLSSFVHAMF FKSIAVLGAD VHGTEFKPGE PESIALKLVS 1741 SGLSNDIKTV RIYTLAGVLY LVQSDSYESF ISSIDILSAY LEKYLKKLAN GSGRVESDES 1801 QFALALIIKL METPMRLKQD KKTILKLLLA SMRVRRERFI IELIAEGIEQ LLCRSNEFNN 1861 EVINFVLVGV DSGDATPFPA DNEYYCRAVY RILMVAATRE KVANDEVSMT RIYNALQIIG 1921 FDMLSRGETA PAISRTLPFF SICVNGVETT ISKYIERFVI NGNKKDRRFV STLINQIVET 1981 AATSKKWSVE LKLYRERLKS KSVANADDNQ WLLNILCNKI VE //