LOCUS AEC05810.1 1253 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana hypothetical protein protein. ACCESSION CP002685-595 PROTEIN_ID AEC05810.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /locus_tag="AT2G04235" /inference="Similar to RNA sequence, EST:INSD:ES029991.1,INSD:ES025441.1,INSD:AU238755.1" /inference="similar to RNA sequence, mRNA:INSD:BT003691.1,INSD:AK117915.1" /note="unknown protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: cellular_component unknown; EXPRESSED IN: cultured cell; Has 713 Blast hits to 345 proteins in 122 species: Archae - 2; Bacteria - 262; Metazoa - 138; Fungi - 55; Plants - 39; Viruses - 0; Other Eukaryotes - 217 (source: NCBI BLink)." /db_xref="Araport:AT2G04235" /db_xref="TAIR:AT2G04235" intron_pos 783:0 (1/13) intron_pos 841:0 (2/13) intron_pos 865:0 (3/13) intron_pos 892:0 (4/13) intron_pos 904:2 (5/13) intron_pos 934:0 (6/13) intron_pos 976:0 (7/13) intron_pos 1042:0 (8/13) intron_pos 1068:2 (9/13) intron_pos 1092:0 (10/13) intron_pos 1127:0 (11/13) intron_pos 1162:1 (12/13) intron_pos 1191:2 (13/13) BEGIN 1 MASEKPEDPM NNTAGIGTDE ESIAQRRKRL RRVSFADREI TSVHIFNRDE DYETPPNTSA 61 AKPQNGGDTS EPDEDNKVIR FFGELSDRED TDGDGDGEYE PILDKSFLRP KYSPSSGGST 121 VGSATSDNGT LQLLCEFRVL FFEFLAESIL FSLPEDNFFG PVSSHFINPG RLLDTPISEE 181 HHEMTMDSTA FSMHFRSLAR SESGDVRTPT SSHLLVEEKT PTEVTSRSDT GSAMVLTEPK 241 KLFPKSPVPV DKGSGGRDSN DMSIVGENSR RYDYGYLSPT LAALMGDESK ELLPEDNTVE 301 ARSPIDDFSS SLPNGCIPIG LQESGSQRYT KEASLSSSTI RRQSAFLVGM LPQSLSCVTP 361 SPTQGGSFMS RETRALVESL STIQKSKSRL GLIPPSPGSA LSQRIEKSKL QLSGHRFLTT 421 PSIGREEIGV LRDKHADIPI TNLEALLSKH DNRTPISEEK SMPDKCISGA LSHAVDTSDD 481 NRTPVPEEKG IPDQCISGAL SHAVDTSDDN KTPVPEEKGI PDQCSSGALN PAVDTSDDNR 541 TPVQEKKGLP DQCSSGALSP AVDTSDDRPP VSEKKGIPDQ HSCGALIPAV DISDVFARRS 601 PEGNTNSEIE GSLLCKQQQR NQAASTPEKF VSSPTNLSNA TTSASENFVP LQDQEQHSKD 661 IEKSETGDGN VTKEYASNCS MNTLSEKVDS LLAESSVLLT DTGFLNGSAQ QREKDSVRNK 721 KQNRTNISAA HILLKDNNPF KVHCETEVIS AEDFTAVAKE NLPSTSGSSS VDRSKNEASH 781 AKGPSRLKRK AEDVDCAARN CSPKVERSTK YISNSVMEHP DGNIDANDCR RVREQVNWVE 841 IPGKVSKEIN QMLAPLADKL NSRLICKLED ILTHMKKVHL CEMLCLQIQS QKVCDHLSGA 901 KTKRRVESRS LLCKLAYDKA KLELLHLKKE IMMKKFQAVS TGVQTSETLR LNCANFLRQH 961 GFRSTGLLNP DQAQEVIITG KRAEITQEIK EIDSKIKNLI QCFTACDTMT GPQPAYADTI 1021 MIAEETLKKR MSCRSLRQDI LIWKVDSLGE WNDCQSIVLN YSGVFNQRLT LKPGHPSCVL 1081 VSNSLSDTFV KHFPEMNVSI AFNSMFNAED SRRYIGGSNT LLEITQKTSL LLHNLLDVAE 1141 EFHLAQMNIP NLVQGNFDSP SAEQLHLQIS FLDCTNLRKL SVILDVTCLI HGKYPSDVVP 1201 CEFRKVSGTK RDGVVSKQLK KEIESTIDDV GVGYPRILRL CRCISKALQS EKR //