LOCUS AEE85930.1 2730 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana nucleoporin protein. ACCESSION CP002687-5764 PROTEIN_ID AEE85930.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G31570" /gene_synonym="F28M20.240" /gene_synonym="F28M20_240" /inference="Similar to RNA sequence, EST:INSD:ES210621.1,INSD:ES110302.1,INSD:EG492261.1, INSD:ES176369.1,INSD:ES063286.1,INSD:EH812013.1, INSD:EL995340.1,INSD:ES213790.1,INSD:ES212052.1, INSD:EL052156.1,INSD:EL320164.1,INSD:Z17734.1, INSD:EL146659.1,INSD:ES111873.1,INSD:EL029852.1, INSD:EH969723.1,INSD:EG492264.1" /note="CONTAINS InterPro DOMAIN/s: Prefoldin (InterPro:IPR009053); BEST Arabidopsis thaliana protein match is: unknown protein (TAIR:AT1G24460.1); Has 194354 Blast hits to 66887 proteins in 3244 species: Archae - 3688; Bacteria - 38556; Metazoa - 84828; Fungi - 17265; Plants - 10589; Viruses - 805; Other Eukaryotes - 38623 (source: NCBI BLink)." /db_xref="TAIR:AT4G31570" /db_xref="Araport:AT4G31570" intron_pos 18:0 (1/7) intron_pos 240:1 (2/7) intron_pos 2137:0 (3/7) intron_pos 2399:1 (4/7) intron_pos 2618:0 (5/7) intron_pos 2663:1 (6/7) intron_pos 2697:2 (7/7) BEGIN 1 MDKKKNRADP LAAGRQKLQQ FRQKKADKGT DQKKDSKGST SQGKSSKKSN KSEKHERKPD 61 TSAVSDEAQA PSPVTVGGAT SHVNVAEEVV DSPQTSSDTK AHEYVSVHGS SSEPDALQPG 121 HTTSNDGSEA RKEVVNSEND ISKSLSTEEE NVKSINSGVA GTVDSLISDP ADSEKGVTHD 181 DASNVDGIFA ASGNIAEGEG VEVEGGSGNV EKPHQPSSLQ EYIPDVSLIR ARGDQVTDVG 241 EMQEEDMEQF SELSAKAGVD KIATEERQTS YPAVVDSSAS PSHFSEGSSV AFDTVELEGI 301 NGNFRSQQIR EAAELNEEKP ETSIDFPNNR DHVLSAEPEE SSVAEMASQL QLPESVSISG 361 VLSHEETRKI DTLNLSAELT SAHVHEGRSV SFLQLMDIVK GLGQDEYQIL CNAREAASST 421 EPGTSSLERL REELFVSSTM EDILHVQLTE QSHLQIEFDH QHNQFVAEIS QLRASYSAVT 481 ERNDSLAEEL SECQSKLYAA TSSNTNLENQ LLATEAQVED FTAKMNELQL SLEKSLLDLS 541 ETKEKFINLQ VENDTLVAVI SSMNDEKKEL IEEKESKNYE IKHLSSELCN CKNLAAILKA 601 EVEQFENTIG PLTDEKIHLV EEKYSLLGEA EKLQEELANC KTVVTLQEVE NSNMKETLSL 661 LTRQQTMFEE NNIHLREENE KAHLELSAHL ISETYLLSEY SNLKEGYTLL NNKLLKFQGE 721 KEHLVEENDK LTQELLTLQE HMSTVEEERT HLEVELREAI ARLDKLAEEN TSLTSSIMVE 781 KARMVDNGSA DVSGLINQEI SEKLGRSSEI GVSKQSASFL ENTQYTNLEE VREYTSEFSA 841 LMKNLEKGEK MVQNLEEAIK QILTDSSVSK SSDKGATPAV SKLIQAFESK RKPEEPESEN 901 AQLTDDLSEA DQFVSVNVQI RNLRGLLDQL LLNARKAGIQ FNQLNDDRTS TNQRLEELNV 961 EFASHQDHIN VLEADTIESK VSFEALKHYS YELQHKNHDL ELLCDSLKLR NDNISVENTE 1021 LNKKLNYCSL RIDELEIQLE NLQQNLTSFL STMEEQLVAL QDESERAMMV EHELTSLMSE 1081 FGEAVVRLDD CLLRSGTSGA HTGLDMTKRI SGSVDVAVNV IEDLKEKLEA AYVKHESTSN 1141 KYEELKQSFN TLFEKNEFTA SSMQKVYADL TKLITESCGS AEMTSLEVEN VAVFDPFRDG 1201 SFENLLEAVR KILSERLELQ SVIDKLQSDL SSKSNDMEEM TQRSLDSTSL RELVEKVEGL 1261 LELESGVIFE SPSSQVEFLV SQLVQKFIEI EELANLLRKQ LEAKGNELME IEESLLHHKT 1321 KIAGLRESLT QAEESLVAVR SELQDKSNEL EQSEQRLLST REKLSIAVTK GKGLIVQRDN 1381 VKQSLAEASA KLQKCSEELN SKDARLVEVE KKLKTYIEAG ERVEALESEL SYIRNSATAL 1441 RESFLLKDSL LHRIEEILED LDLPEHFHAR DILEKVEWLA RSANGNSSRP SGWDQKSSDG 1501 GAGFVLSEPW REDVQTGTSS EDDLRIKFEE LKGKFYGLAE QNEMLEQSLM ERNTLVQRWE 1561 KLLENIDIPP QLHSMEVENK IEWLASTITE ATHDRDNLQQ KIDNLEVYCQ SVTTDLEVSQ 1621 KQVGDVEGNL QSCVSERVNL SERLESLIGD HESLSARGIH LEVENEKLQN QVKDLHEKLV 1681 EKLGNEEHFQ TIEGDLLSLR YMIDDVIQED GLQDLALASN SENLDGVLRK LIDYYKNLVK 1741 SSLPGETDDN VCETRPSDAD VRSGESLGAH GATSHGQHFE LSDSNVVEAT SRDIAVVETP 1801 DVASLTKDLD QALHVQKLTR EERDLYMAKQ QSLVAENEAL DKKIIELQEF LKQEEQKSAS 1861 VREKLNVAVR KGKALVQQRD SLKQTIEEVN AELGRLKSEI IKRDEKLLEN EKKFRELESY 1921 SVRVESLESE CQLLKIHSQE TEYLLQERSG NLSMTLNALN SIDIGDEGDI NDPVMKLQRI 1981 SQLFQTMSTT VTSAEQESRK SRRAAELLLA ELNEVQETND SLQEDLSKFT YEIQQLSREK 2041 DAAEAAKVEA ISRFENLSAV SNEEKNKLYA QLLSCGTSVN SLRKILAGTN SCLADIFIMD 2101 MEFLHHLKAN MELCAKKTGT DLSGLPQLST ENLVDKEIFA RLSAAWSNIN LHETSSGGNI 2161 AEICGSLSQN LDQFVVGVSH LEEKVSKHLA TWHDQINIVS NSIDTFFKSI GTGTDSEVAA 2221 LGERIALLHG ACSSVLVEIE RRKAELVGND DFNMSLHQVD EDFSSMESVR SMVNRLSSAV 2281 KELVVANAET LERNEKEMKV IIANLQRELH EKDIQNNRTC NELVGQVKEA QAGAKIFAED 2341 LQSASARMRD MQDQLGILVR ERDSMKERVK ELLAGQASHS ELQEKVTSLS DLLAAKDLEI 2401 EALMQALDEE ESQMEDLKLR VTELEQEVQQ KNLDLQKAEA SRGKISKKLS ITVDKFDELH 2461 HLSENLLAEI EKLQQQVQDR DTEVSFLRQE VTRCTNEALA ASQMGTKRDS EEIQTVLSWF 2521 DTIASLLGIE DSLSTDADSH INHYMETFEK RIASMLSEID ELRLVGQSKD VLLEGERSRV 2581 AELRQKEATL EKFLLEKESQ QDISTSSTSE IVEVEPLINK WTKTSIPSQV RSLRKGNMDQ 2641 VAISIDADQT DQSGSLEEDD DKDHSLRQES FLDSQDPSLT WSMVYGQTLF IHGSRSVVSC 2701 DRTLMRQPAL RLGIMLYWAI LHALLAAFVV //