LOCUS AEC08963.1 1280 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana ARM repeat superfamily protein protein.
ACCESSION CP002685-4903
PROTEIN_ID AEC08963.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G34357"
/inference="Similar to RNA sequence,
EST:INSD:AU238655.1,INSD:AA585916.1,INSD:T04701.1,
INSD:EH800324.1,INSD:EH868005.1,INSD:ES157743.1,
INSD:EH919844.1,INSD:EL999819.1,INSD:ES197579.1,
INSD:ES100417.1,INSD:EL311830.1,INSD:ES185332.1,
INSD:ES160955.1,INSD:AU229856.1,INSD:EL024033.1"
/inference="similar to RNA sequence, mRNA:INSD:AK229546.1"
/note="ARM repeat superfamily protein; FUNCTIONS IN:
binding; INVOLVED IN: biological_process unknown; LOCATED
IN: cellular_component unknown; EXPRESSED IN: 20 plant
structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Armadillo-type fold
(InterPro:IPR016024), Domain of unknown function, NUC173
(InterPro:IPR012978); BEST Arabidopsis thaliana protein
match is: ARM repeat superfamily protein
(TAIR:AT4G23540.1); Has 35333 Blast hits to 34131 proteins
in 2444 species: Archae - 798; Bacteria - 22429; Metazoa -
974; Fungi - 991; Plants - 531; Viruses - 0; Other
Eukaryotes - 9610 (source: NCBI BLink)."
/db_xref="Araport:AT2G34357"
/db_xref="TAIR:AT2G34357"
intron_pos 175:0 (1/17)
intron_pos 348:1 (2/17)
intron_pos 434:1 (3/17)
intron_pos 465:0 (4/17)
intron_pos 535:0 (5/17)
intron_pos 669:0 (6/17)
intron_pos 727:2 (7/17)
intron_pos 759:0 (8/17)
intron_pos 778:0 (9/17)
intron_pos 821:0 (10/17)
intron_pos 844:0 (11/17)
intron_pos 882:0 (12/17)
intron_pos 936:0 (13/17)
intron_pos 979:0 (14/17)
intron_pos 1013:0 (15/17)
intron_pos 1039:2 (16/17)
intron_pos 1086:2 (17/17)
BEGIN
1 MELLCDDIGT SMCLTPSEPD LPVSEDFGEY MRSRLSQSKR PDHEHLCAVI EELSKTLAED
61 NHRRTPVAYF ACTCRSLDSL FSAHAEPPVD VVQPHIVILS LVFPKVSAGV LKRDGLALRL
121 VLNVLRLKSA TPECLISGLK CLVHLLTTVE SIMVNEGSDS YNILLNFVTH SDGKVRKLAS
181 SCLRDVLQKS HGTKAWQSVS GAITEMFQNY LDLAHKSEVG STEGARGAKQ VLYILSTLKE
241 CLALMSKKHI ATLIEGFKVL MILRDPYITR PVIDSLNAVC LNPTSEVPVE ALLEVLSLAA
301 GLFSGHETSA DAMTFTARLL KVGMTRSFTL NRDLCVVKLP SVFNGLNDII ASEHEEAIFA
361 ATDALKSLIF SCIDESLIRE GVNEIRNSNL NVRKPSPTVI EKLCATVESL LDYKYHAVWD
421 MAFQVVSAMF DKLGEHSAYF MRNTLQGLSD MQDLPDEGFP YRKQLHECVG SALGAMGPET
481 FLSIVRLNLE ANDLSEVKVW LFPILKQYTV GGRLSFFTEA IFSMVETMSH KAQKLKLQGL
541 PVASRSVDSL VYSLWALLPS FCNYPVDTVE SFADLGRILC GVLQTQAETH GIICASLNIL
601 IQQNKEVVEG KEVPTNDASP AMQRATARYD SQHAAANLKV LRLCAPKLLD VLSRIFHECS
661 KDDGGSLQSA IGNLASIAEK KTVSKLLFKT LQELLEATKT AIAQDESPVS GMDVDNTADK
721 NSSSNLRARL FDLLVSLLPG LDGQEVDTIF SSLKPAMQDS KGLIQKKAYK VLSVILKSSD
781 GFVSKNLEEL LVLMHNICHV SAKRHKLDCL YFLLAHASRT DDLKERKDIV SSFLPEVILA
841 LKEVNKKTRN RAYDVLVQIG HAYADEENGG DNEKLHGYFD MVVGCLAGEK PQMISAAVKG
901 VARLTYEFSD LISSAYNLLP STFLLLQRKN KEITKANLGL LKVLVAKSPV EGLHANLKSM
961 VEGLLKWPEG TKNLFKAKVR LLLEMLIKKC GTEAVKSVMP EEHMKLLTNI RKIKERKEKK
1021 YAAGSDISKS QHSKDTSSKV SRWNDTKIFS DVYADSEDSD GDDMDAESHG RSKASSLLKS
1081 KASALRSKKS RNQSHLEVDE SDDEPLDLMD QHKTRLALRS SELRKRKADS DEEAEFDVEG
1141 RLVIREGERS KRKELSDADS DAKSSKGSRF SGNSSKKNQK RMKTSESGYA YTGKEYASKK
1201 ASGDLKKKDK LEPYAYWPLD RKMMSRRPEQ RAVAVRGMSS VVKMAKKMEG KSAAEALATT
1261 KFKKFKRSGQ KKSAGKKKNK
//