LOCUS CCD64612.1 1147 aa PRT CON 06-FEB-2024 DEFINITION Caenorhabditis elegans Sex-determining transformer protein 2 protein. ACCESSION BX284602-2498 PROTEIN_ID CCD64612.1 SOURCE Caenorhabditis elegans ORGANISM Caenorhabditis elegans Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida; Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis. REFERENCE 1 (bases 1 to 15279421) AUTHORS WormBase. CONSRTM WormBase Consortium JOURNAL Submitted (04-FEB-2024) to the INSDC. WormBase Group, European Bioinformatics Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org REFERENCE 2 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. JOURNAL Submitted (03-MAR-2003) to the INSDC. Nematode Sequencing Project: Sanger Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute at Washington University, St. Louis, MO 63110, USA. REFERENCE 3 (bases 1 to 15279421) AUTHORS Sulson J.E., Waterston R. CONSRTM Caenorhabditis elegans Sequencing Consortium TITLE Genome sequence of the nematode C. elegans: a platform for investigating biology JOURNAL Science 282(5396), 2012-2018(1998). COMMENT Annotated features correspond to WormBase release WS292. Protein-coding gene structures below are the result of integration and manual review of the following types of data: ab initio predictions by Genefinder (P. Green and L. Hillier, pers. comm.); alignments to published proteins and cDNAs; genome sequence conservation with other nematodes (e.g. to C. briggsae using WABA: Genome Res. 2000. 10:1115-1125); sequence features (such as trans-splice and polyA sites). Sources of data: large-scale EST projects of Yuji Kohara (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html); ORFeome cloning project (http://worfdb.dfci.harvard.edu); RST large-scale sequencing project (Genome Res. 2009. 19:2334-2342); IST library (Science. 2004. 303:540-3); RT-PCR EST set (Ewing B. Green P. 2010 Unpublished); UTRome EST data submission (UTRome v1 Mangone M. Piano F. 2009); TEC-RED data (PNAS 2004. 101:1650-1655); RNA Deep sequencing data (454 read clusters - Makedonka Mitreva, unpublished; Illumina sequence data, Genome Res. 2009. 19:657-66); Numerous data sets from the modENCODE project (Science. 2010. 330:1775-87); Individual C. elegans Nucleotide Database submissions; Personal communications with C. elegans researchers; Non-Coding gene structures below are derived using the following methods and data: ab initio prediction of tRNAs by tRNAscan-SE (Nucl. Acids. Res., 25, 955-964); integration and appraisal of miRNAs from miRBase (http://www.mirbase.org); integration and appraisal of RFAM predictions (rfam.sanger.ac.uk); 21U-RNAs (Cell. 2006. 127:1193-1207); modENCODE data (Science. 2010. 330:1775-87); manual curation of novel published ncRNAs from the literature. FEATURES Qualifiers source /organism="Caenorhabditis elegans" /chromosome="II" /strain="Bristol N2" /mol_type="genomic DNA" /db_xref="taxon:6239" protein /transl_table=1 /gene="tra-2" /locus_tag="CELE_C15F1.3" /standard_name="C15F1.3b" /note="Partially confirmed by transcript evidence" /db_xref="WormBase:WBGene00006605" intron_pos 62:0 (1/20) intron_pos 99:2 (2/20) intron_pos 132:0 (3/20) intron_pos 197:1 (4/20) intron_pos 235:0 (5/20) intron_pos 279:0 (6/20) intron_pos 506:0 (7/20) intron_pos 551:1 (8/20) intron_pos 639:1 (9/20) intron_pos 677:2 (10/20) intron_pos 710:0 (11/20) intron_pos 736:0 (12/20) intron_pos 767:2 (13/20) intron_pos 801:1 (14/20) intron_pos 826:0 (15/20) intron_pos 886:0 (16/20) intron_pos 917:2 (17/20) intron_pos 954:0 (18/20) intron_pos 988:1 (19/20) intron_pos 1044:1 (20/20) BEGIN 1 MKLKYNKLLV SVVIVTFVTF GLLLAECFGK SIDYQEKSIF PSFVSQGFFE TRTNNEEYII 61 EKIAQTQENG VDMRSTLHFT QHGYLLNNIS NLKIKFRQKT YTLNDVCFKP HITIFQQSSS 121 SDQNEYPHYI QRLLLEMQRL SPCLIVTPLN CFYDIYRIHG EISNWNKNTD FLNRRLRNSY 181 IEAIGENDER PYVKSNYGPS LIKSWADHMF DLPSKSFTNS TKDALFQKIK LWLLSIEPRQ 241 KTCAASIHSC DTPLDSEHYF NICTDMQSVD NFAEKKTKFK LEDVDEEFAM NLDCVDDQEQ 301 FIEWMQELEI RKMYSHVTEK PDYPNVVNQT CDKIFHDLNS TGIEFFDGSR SFSSTKSQFD 361 TMQTEIVLLT PEMLLSAMQH SDFVNGFESI WTIEKAEELI HEFRLALKEE TEKFKENRMS 421 KMIRVTSRVL DNTVTTKLQS FSEKQTIHFV VNVHSLIVIL FTIFVWSGAP LRSAFMFFVR 481 DALTCLLFCF VCSTDGVIVL DTELIKYIIV LTLANLYFTT RSSFCTERLS RCIQREKRFP 541 INSNFASLIT VDTMTDSRQI QYFLSTVTKY QAAQDSYSNE LFERFPKNWG CTSILIFPIV 601 FVYWYFIDSN FDKICVSVLP SFCLAAGEEL FAKNMFWKER EAMQAKQRLE NEEQAESITG 661 SSLEKLFAGN KPVSNTDKAN IVKKSSIIRN QKPCLQDLSP GTYDVSNFMK YPHQASRIFR 721 EKIIGLYLRI LKLRTLGVIL CIPAILLIVI SIGLLFIPVK RETLHTDSKQ DDIFIEFEIF 781 NFSTNWKIVN QNLKQFSEDI ESIGTLYTIS NWQKSFERFE QETNKNASAE WNILFKWIND 841 EPINSAVTLF SEKSSGNQTI ANPFKFRLRY GFDAKNETTV IEIVQKIDEL LSKCSKNLSP 901 KAVGVLYEHY HRIAVVWNLF AFNQLTTAGI FIILLSIITF IFAITPTIKA TFLFSLLVVG 961 TQIEVAALVH LFSLDHHQIY TNLALFAGFL AAWDPFCALL RYRRRILYKS ETRRTPELAS 1021 KRRVLLPIVA TADIAQFFVL LITEFPKREV TCQNWWPFSS SRSSICSFFF HSHPSSIILA 1081 TLSIFRQYIY LFSLFSFSHS SSDVSNKKSR GYCRCMSCFS IFFQRVWPQE HWNTANNEKG 1141 TQKESGC //