LOCUS AEC09054.1 596 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana AICARFT/IMPCHase bienzyme family
protein protein.
ACCESSION CP002685-5017
PROTEIN_ID AEC09054.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G35040"
/gene_synonym="F19I3.27"
/gene_synonym="F19I3_27"
/inference="Similar to RNA sequence,
EST:INSD:W43391.1,INSD:EL279913.1,INSD:ES163703.1,
INSD:ES015315.1,INSD:EL318411.1,INSD:AV816466.1,
INSD:ES139601.1,INSD:ES169716.1,INSD:EL200436.1,
INSD:BE038398.1,INSD:BP800025.1,INSD:EL072692.1,
INSD:ES161053.1,INSD:BP784298.1,INSD:AV831546.1,
INSD:DR225460.1,INSD:BE038399.1,INSD:T42266.1,
INSD:EH838771.1,INSD:BE523620.1,INSD:ES193656.1,
INSD:EL297419.1,INSD:DR354345.1,INSD:EH938086.1,
INSD:BP584518.1,INSD:DR355708.1,INSD:AA042482.1,
INSD:EL131099.1,INSD:BP582904.1,INSD:ES023459.1,
INSD:EL328280.1,INSD:DR382224.1,INSD:EL291708.1,
INSD:EL273504.1,INSD:ES007419.1,INSD:EL284969.1,
INSD:BU634985.1,INSD:ES073104.1,INSD:EL253347.1"
/inference="similar to RNA sequence,
mRNA:INSD:AY133727.1,INSD:BX819257.1,INSD:AY091122.1"
/note="AICARFT/IMPCHase bienzyme family protein; FUNCTIONS
IN: phosphoribosylaminoimidazolecarboxamide
formyltransferase activity, IMP cyclohydrolase activity,
catalytic activity; INVOLVED IN: response to cold, purine
nucleotide biosynthetic process; LOCATED IN: stromule,
chloroplast, chloroplast stroma; EXPRESSED IN: 23 plant
structures; EXPRESSED DURING: 14 growth stages; CONTAINS
InterPro DOMAIN/s: AICARFT/IMPCHase bienzyme,
transformylase domain (InterPro:IPR013982),
AICARFT/IMPCHase bienzyme (InterPro:IPR002695), MGS-like
(InterPro:IPR011607); Has 10802 Blast hits to 10325
proteins in 2522 species: Archae - 69; Bacteria - 5512;
Metazoa - 346; Fungi - 183; Plants - 62; Viruses - 6;
Other Eukaryotes - 4624 (source: NCBI BLink)."
/db_xref="Araport:AT2G35040"
/db_xref="TAIR:AT2G35040"
intron_pos 37:0 (1/11)
intron_pos 70:1 (2/11)
intron_pos 95:2 (3/11)
intron_pos 128:0 (4/11)
intron_pos 162:1 (5/11)
intron_pos 207:0 (6/11)
intron_pos 266:1 (7/11)
intron_pos 319:0 (8/11)
intron_pos 389:0 (9/11)
intron_pos 505:0 (10/11)
intron_pos 547:1 (11/11)
BEGIN
1 MLSSAATATS VSARSGDILC GLFRKKSVAP FRFTQPVYRT SLCPSFVAVR AMAESQTAQR
61 NQPQSSGSSG EKQALISLSD KRDLASLGNG LQELGYTIVS TGGTASTLEN AGVSVTKVEK
121 LTHFPEMLDG RVKTLHPNIH GGILARRDVE HHMEALNEHG IGTFDVVVVN LYPFYEKVTA
181 PGGISFEDGI ENIDIGGPAM IRAAAKNHKD VLIVVDSGDY QAVLEYLKGG QSDQQFRRKL
241 AWKAFQHVAA YDSAVSEWLW KQTEGKEKFP PSFTVPLVLK SSLRYGENPH QKAAFYVDKS
301 LAEVNAGGIA TAIQHHGKEM SYNNYLDADA AWNCVSEFEN PTCVVVKHTN PCGVASRDDI
361 LEAYRLAVKA DPVSAFGGIV AFNVEVDEVL AREIREFRSP TDGETRMFYE IVVAPKYTAK
421 GLEVLKGKSK TLRILEAKKN DQGKLSLRQV GGGWLAQDSD DLTPEDISFN SVSDKTPTES
481 ELADAKFAWL CVKHVKSNAI VIAKNNCMLG MGSGQPNRVE SLRIAFKKAG EEAKGAALAS
541 DAFFPFAWKD AVEEACQMGI GVIAEPGGSI RDQDAIDCCK KYGVSLLFTN VRHFRH
//