LOCUS       AEE82164.1               855 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana U3 ribonucleoprotein (Utp) family
            protein protein.
ACCESSION   CP002687-490
PROTEIN_ID  AEE82164.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 18585056)
  AUTHORS   Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G.,
            Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N.,
            Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M.,
            Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R.,
            Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M.,
            Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M.,
            Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W.,
            Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L.,
            Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J.,
            Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I.,
            Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U.,
            Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van
            Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M.,
            Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M.,
            Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C.,
            Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J.,
            Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S.,
            Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A.,
            Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D.,
            Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de
            Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M.,
            Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S.,
            Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O.,
            Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S.,
            Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F.,
            Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T.,
            Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A.,
            Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D.,
            Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P.,
            Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W.,
            Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M.,
            Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E.,
            Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M.,
            Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G.,
            Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A.,
            Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N.,
            Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M.,
            Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S.,
            Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K.,
            Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C.,
            Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D.,
            Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A.,
            Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N.,
            Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M.,
            Martienssen,R. and McCombie,W.R.
  TITLE     Sequence and analysis of chromosome 4 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 402 (6763), 769-777 (1999)
   PUBMED   10617198
REFERENCE   2  (bases 1 to 18585056)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 18585056)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="4"
                     /ecotype="Columbia"
     protein         /locus_tag="AT4G02400"
                     /gene_synonym="T14P8.20"
                     /gene_synonym="T14P8_20"
                     /inference="Similar to RNA sequence,
                     EST:INSD:ES116689.1,INSD:ES175382.1,INSD:ES072911.1,
                     INSD:EH919198.1,INSD:BP780528.1,INSD:DR373976.1,
                     INSD:ES045361.1,INSD:EG518871.1,INSD:AV805633.1,
                     INSD:EH808016.1,INSD:ES094169.1,INSD:AV799621.1,
                     INSD:EH957853.1,INSD:AV800412.1,INSD:EH812533.1,
                     INSD:BP798110.1,INSD:EG513493.1,INSD:BP786924.1,
                     INSD:EG518870.1,INSD:ES214991.1,INSD:ES124562.1,
                     INSD:AV782454.1,INSD:EG513494.1,INSD:EL062559.1,
                     INSD:EH883795.1"
                     /inference="Similar to RNA sequence, mRNA:INSD:AK230231.1"
                     /note="U3 ribonucleoprotein (Utp) family protein;
                     FUNCTIONS IN: molecular_function unknown; INVOLVED IN:
                     rRNA processing; LOCATED IN: small-subunit processome;
                     EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13
                     growth stages; CONTAINS InterPro DOMAIN/s: Small-subunit
                     processome, Utp14 (InterPro:IPR006709); BEST Arabidopsis
                     thaliana protein match is: U3 ribonucleoprotein (Utp)
                     family protein (TAIR:AT5G08600.2); Has 7468 Blast hits to
                     4514 proteins in 366 species: Archae - 14; Bacteria - 554;
                     Metazoa - 2650; Fungi - 816; Plants - 392; Viruses - 146;
                     Other Eukaryotes - 2896 (source: NCBI BLink)."
                     /db_xref="TAIR:AT4G02400"
                     /db_xref="Araport:AT4G02400"
     intron_pos      90:0 (1/10)
     intron_pos      133:1 (2/10)
     intron_pos      294:0 (3/10)
     intron_pos      367:0 (4/10)
     intron_pos      472:0 (5/10)
     intron_pos      575:0 (6/10)
     intron_pos      602:0 (7/10)
     intron_pos      628:0 (8/10)
     intron_pos      755:0 (9/10)
     intron_pos      799:0 (10/10)
BEGIN
        1 MGEKRKSTSK TLAKNKKRKG PHLPNSILKT IANEKRPLNS DEDDDEIDSD DENVDLYEYE
       61 EGVPEEESKK NNRYDRVDNY DYELPEDFED ENVESDDDED GGNSENEEGE GDDDRHTRML
      121 QGLTGMPSAA FQEESKKRPV LYTEAYPESE FNPTRDVLEG KGLISVEDLL APLEGKPGFN
      181 DLNKRINRMQ KDTQSVVHAP LPKPERERLE RKAVKGLVEK DFNKWVHLVK RNREAPTVYF
      241 NQPVNVGYST VGAIASEFQP RTEFEKKMAS VLKDNELGEA HKEDGAKLLE LNEVSMEDHI
      301 KYRDHIAKMR SLLFRHELKS KRIKKIKSKT YHRLKGKDLK KSAMGALMDP EMAKEEAIKQ
      361 ETRRVEERMT LKHKNTGKWA KRMLSRGLTE RYDGTRAAIS EQLQINATLS RKMNSTNDGS
      421 SSDESDDEEE LSCGSDQDTP SKLIAKAREK TLKTMEDDDV PNSGLLSLPF MARAMKKKNE
      481 EANEEAKRAF GEYKELENFG GEDNPKKSAD VSGRRVFGAT SKVEAPKESK KDSDNFYDNS
      541 DSDNDMEGIE NNDLGAVGDT ASPARNTGAI TETEKCCGDV ENPASKTTFD VALFASGSWK
      601 KMKGSQNAES KKAPKTRVPI SKGQDKKESR DEESEDSESE AEQMVDGILT SASKETYEIP
      661 SQAELIQRAF AGDDVVEEFE KDKQEVLNQE VPEPEKPVLV PGWGQWTNVQ KKRGLPSWMV
      721 REHEDANKKR KLDLKTRKDY RLRNVIISEK VDKKADKLHT TTLPFPYTSK EVFEHSMRMP
      781 IGPEFNPATI VGALNRPEVV KKAGVIIKPV KFEEVNPNEK ADDENPRSHQ KQRPKKGSKT
      841 SKGQGKNKSK LKTKA
//