LOCUS AEE82164.1 855 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana U3 ribonucleoprotein (Utp) family protein protein. ACCESSION CP002687-490 PROTEIN_ID AEE82164.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G02400" /gene_synonym="T14P8.20" /gene_synonym="T14P8_20" /inference="Similar to RNA sequence, EST:INSD:ES116689.1,INSD:ES175382.1,INSD:ES072911.1, INSD:EH919198.1,INSD:BP780528.1,INSD:DR373976.1, INSD:ES045361.1,INSD:EG518871.1,INSD:AV805633.1, INSD:EH808016.1,INSD:ES094169.1,INSD:AV799621.1, INSD:EH957853.1,INSD:AV800412.1,INSD:EH812533.1, INSD:BP798110.1,INSD:EG513493.1,INSD:BP786924.1, INSD:EG518870.1,INSD:ES214991.1,INSD:ES124562.1, INSD:AV782454.1,INSD:EG513494.1,INSD:EL062559.1, INSD:EH883795.1" /inference="Similar to RNA sequence, mRNA:INSD:AK230231.1" /note="U3 ribonucleoprotein (Utp) family protein; FUNCTIONS IN: molecular_function unknown; INVOLVED IN: rRNA processing; LOCATED IN: small-subunit processome; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Small-subunit processome, Utp14 (InterPro:IPR006709); BEST Arabidopsis thaliana protein match is: U3 ribonucleoprotein (Utp) family protein (TAIR:AT5G08600.2); Has 7468 Blast hits to 4514 proteins in 366 species: Archae - 14; Bacteria - 554; Metazoa - 2650; Fungi - 816; Plants - 392; Viruses - 146; Other Eukaryotes - 2896 (source: NCBI BLink)." /db_xref="TAIR:AT4G02400" /db_xref="Araport:AT4G02400" intron_pos 90:0 (1/10) intron_pos 133:1 (2/10) intron_pos 294:0 (3/10) intron_pos 367:0 (4/10) intron_pos 472:0 (5/10) intron_pos 575:0 (6/10) intron_pos 602:0 (7/10) intron_pos 628:0 (8/10) intron_pos 755:0 (9/10) intron_pos 799:0 (10/10) BEGIN 1 MGEKRKSTSK TLAKNKKRKG PHLPNSILKT IANEKRPLNS DEDDDEIDSD DENVDLYEYE 61 EGVPEEESKK NNRYDRVDNY DYELPEDFED ENVESDDDED GGNSENEEGE GDDDRHTRML 121 QGLTGMPSAA FQEESKKRPV LYTEAYPESE FNPTRDVLEG KGLISVEDLL APLEGKPGFN 181 DLNKRINRMQ KDTQSVVHAP LPKPERERLE RKAVKGLVEK DFNKWVHLVK RNREAPTVYF 241 NQPVNVGYST VGAIASEFQP RTEFEKKMAS VLKDNELGEA HKEDGAKLLE LNEVSMEDHI 301 KYRDHIAKMR SLLFRHELKS KRIKKIKSKT YHRLKGKDLK KSAMGALMDP EMAKEEAIKQ 361 ETRRVEERMT LKHKNTGKWA KRMLSRGLTE RYDGTRAAIS EQLQINATLS RKMNSTNDGS 421 SSDESDDEEE LSCGSDQDTP SKLIAKAREK TLKTMEDDDV PNSGLLSLPF MARAMKKKNE 481 EANEEAKRAF GEYKELENFG GEDNPKKSAD VSGRRVFGAT SKVEAPKESK KDSDNFYDNS 541 DSDNDMEGIE NNDLGAVGDT ASPARNTGAI TETEKCCGDV ENPASKTTFD VALFASGSWK 601 KMKGSQNAES KKAPKTRVPI SKGQDKKESR DEESEDSESE AEQMVDGILT SASKETYEIP 661 SQAELIQRAF AGDDVVEEFE KDKQEVLNQE VPEPEKPVLV PGWGQWTNVQ KKRGLPSWMV 721 REHEDANKKR KLDLKTRKDY RLRNVIISEK VDKKADKLHT TTLPFPYTSK EVFEHSMRMP 781 IGPEFNPATI VGALNRPEVV KKAGVIIKPV KFEEVNPNEK ADDENPRSHQ KQRPKKGSKT 841 SKGQGKNKSK LKTKA //