LOCUS AED90339.1 1467 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana HEAT repeat-containing protein protein. ACCESSION CP002688-74 PROTEIN_ID AED90339.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 26975502) AUTHORS Tabata,S., Kaneko,T., Nakamura,Y., Kotani,H., Kato,T., Asamizu,E., Miyajima,N., Sasamoto,S., Kimura,T., Hosouchi,T., Kawashima,K., Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S., Nakazaki,N., Naruo,K., Okumura,S., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A., Yamada,M., Yasuda,M., Sato,S., de la Bastide,M., Huang,E., Spiegel,L., Gnoj,L., O'Shaughnessy,A., Preston,R., Habermann,K., Murray,J., Johnson,D., Rohlfing,T., Nelson,J., Stoneking,T., Pepin,K., Spieth,J., Sekhon,M., Armstrong,J., Becker,M., Belter,E., Cordum,H., Cordes,M., Courtney,L., Courtney,W., Dante,M., Du,H., Edwards,J., Fryman,J., Haakensen,B., Lamar,E., Latreille,P., Leonard,S., Meyer,R., Mulvaney,E., Ozersky,P., Riley,A., Strowmatt,C., Wagner-McPherson,C., Wollam,A., Yoakum,M., Bell,M., Dedhia,N., Parnell,L., Shah,R., Rodriguez,M., See,L.H., Vil,D., Baker,J., Kirchoff,K., Toth,K., King,L., Bahret,A., Miller,B., Marra,M., Martienssen,R., McCombie,W.R., Wilson,R.K., Murphy,G., Bancroft,I., Volckaert,G., Wambutt,R., Dusterhoft,A., Stiekema,W., Pohl,T., Entian,K.D., Terryn,N., Hartley,N., Bent,E., Johnson,S., Langham,S.A., McCullagh,B., Robben,J., Grymonprez,B., Zimmermann,W., Ramsperger,U., Wedler,H., Balke,K., Wedler,E., Peters,S., van Staveren,M., Dirkse,W., Mooijman,P., Lankhorst,R.K., Weitzenegger,T., Bothe,G., Rose,M., Hauf,J., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Villarroel,R., Gielen,J., Ardiles,W., Bents,O., Lemcke,K., Kolesov,G., Mayer,K., Rudd,S., Schoof,H., Schueller,C., Zaccaria,P., Mewes,H.W., Bevan,M. and Fransz,P. CONSRTM Kazusa DNA Research Institute; Cold Spring Harbor and Washington University in St Louis Sequencing Consortium; European Union Arabidopsis Genome Sequencing Consortium TITLE Sequence and analysis of chromosome 5 of the plant Arabidopsis thaliana JOURNAL Nature 408 (6814), 823-826 (2000) PUBMED 11130714 REFERENCE 2 (bases 1 to 26975502) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 26975502) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="5" /ecotype="Columbia" protein /gene="ESP4" /locus_tag="AT5G01400" /gene_synonym="ENHANCED SILENCING PHENOTYPE 4" /gene_synonym="T10O8.110" /gene_synonym="T10O8_110" /inference="Similar to RNA sequence, EST:INSD:ES071419.1,INSD:ES072101.1,INSD:EH859230.1, INSD:EH825985.1,INSD:AV545988.1,INSD:AV529152.1, INSD:AV546483.1,INSD:DR383177.1,INSD:DR275772.1, INSD:DR275773.1,INSD:T45463.1,INSD:AA394514.1, INSD:ES017155.1,INSD:AV539512.1" /note="ENHANCED SILENCING PHENOTYPE 4 (ESP4); FUNCTIONS IN: binding; INVOLVED IN: posttranscriptional gene silencing by RNA, RNA processing; LOCATED IN: mRNA cleavage and polyadenylation specificity factor complex; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Symplekin tight junction protein C-terminal (InterPro:IPR022075), Armadillo-type fold (InterPro:IPR016024), Protein of unknown function DUF3453 (InterPro:IPR021850); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G27595.1); Has 1807 Blast hits to 1807 proteins in 277 species: Archae - 0; Bacteria - 0; Metazoa - 736; Fungi - 347; Plants - 385; Viruses - 0; Other Eukaryotes - 339 (source: NCBI BLink)." /db_xref="TAIR:AT5G01400" /db_xref="Araport:AT5G01400" intron_pos 68:2 (1/26) intron_pos 122:0 (2/26) intron_pos 153:0 (3/26) intron_pos 178:1 (4/26) intron_pos 234:2 (5/26) intron_pos 292:0 (6/26) intron_pos 332:0 (7/26) intron_pos 517:0 (8/26) intron_pos 699:0 (9/26) intron_pos 722:0 (10/26) intron_pos 761:0 (11/26) intron_pos 842:0 (12/26) intron_pos 858:0 (13/26) intron_pos 891:0 (14/26) intron_pos 957:2 (15/26) intron_pos 969:0 (16/26) intron_pos 1005:0 (17/26) intron_pos 1033:0 (18/26) intron_pos 1049:0 (19/26) intron_pos 1074:0 (20/26) intron_pos 1106:0 (21/26) intron_pos 1133:0 (22/26) intron_pos 1157:0 (23/26) intron_pos 1172:0 (24/26) intron_pos 1200:0 (25/26) intron_pos 1234:2 (26/26) BEGIN 1 MASYSRARLK DLANSAKSAT ELPPKLQRLR YMRRDLQKDD SVFPTELLPH LFDLLSDQFG 61 AVRKFVAEIL GEIGLKYVEL IPEIVPLLIK SLEDETPAVA RQVIACGADL FRSTLERVAV 121 QGLHSSELND LLESSWTWLI KFKDEICSVA FKQGNSGVKL CAMKFVEALI LLYTPHEGIE 181 ADFNISILRG GHPVLKIGDL SIEASQKLGL LLDQLRHPAA KSLNSSTIIV LINSLSSVAK 241 KRPAYCGRIL PVLLSLDPLS FLKGVYAAAT NLALKTVFLS CLKCTHPAAA PDRLTSALKE 301 IEGGGQAAKA KDLFYKTNGS IQDKDSVEDT KVSVEENPLC ASSDVAESNL SRKRSGSEYN 361 IDLNGDASDG KRARITPSVS EESTDGLNGN DGVSLPRVAS TSTGPSDSRG VSDSGPAQQL 421 VGLFGTLVSQ GEKAIGSLEI LISSISADLL TDVVMANMHN IPPNCSSYAD GTDELVMNMC 481 IVGSDAQIKY PPSFVAGVLS LSTAFPPIAA LINPHNEDEE VYSVHVDQQM FPAEDARTPP 541 GLLATCDTSF PENEESNTVS PQNVHYIGNR ESGIPGLESS AQHDGSGALV TNVLSSTNVE 601 AASKNQNASF SGKLLVDVIP SMSVDKLEEF SPKAVGTVAS ASQFVLPKIS APVVDLSDEE 661 KDSLQKLVFL RIVEAYKQIS MSGGSQLRFS LLAHLGVEFP SELDPWKILQ EHVLSDYLNH 721 EGHELTVRVL YRLYGEAEAE QDFFSSTTAA SAYESFLLTV AEALRDSFPP SDKSLSKLLG 781 DSPHLPKSVL MLLESFCCPG SGEVEKDLQH GDRVTQGLSA VWSLILMRPG IRNDCLNIAL 841 QSAVHHLEEI RMKAIRLVAN KLYSLSFITE QIEEFAKDRL FSVVSDDCDK MDLDLKSPPN 901 KPQHSISGMS METPSEATSS STSVTEAQRC LSLYFALCTK VLRIFTILRL MTNLVFNIYK 961 NASDPVKQAI HLQIPILVRT MGSSSELLKI IADPPSGSDN LLIQVLQTLT EGPTPSSELI 1021 LTIRKLFDTR IKDVEILFPI LPFLPRDDVL RIFPHMVNLP MEKFQVALSR VLQGSSQSGP 1081 VLSPSEALIA IHSIDPARDG IPLKQVTDAC NTCFAQRQTF TQQVLAGVLN QLVQQIPLPM 1141 LFMRTVLQAI GAFPALSDFI LEILSRLVSK QIWKYPKLWV GFLKCTQTTQ PQSYKVLLQL 1201 PPLQLGNALT KIPALRAPLT AHASQPEIQS SLPRSTLAVL GLVPDSQGTQ TSQVQANETQ 1261 TSQEQQQQQA SEPQQTSQSQ QVSVPLSHSQ VDHQEPSQVV ASQSQSSPIG TVQSAMSQSQ 1321 NSPIDTGRSE MSQSQNSPID TGRSEMSQSQ NSPIDTGRSE MSQSQNSPID TGRSEMSESQ 1381 SSPIGQSQSS PIGTGQSDMS QTPQVSDSSA PEPTSHTRTS DPQASSQTLR DDDEKIDDTA 1441 TSENEVTEIE KSKESSEEEE EEEEEEE //