LOCUS AEC10206.1 283 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana P4H isoform 1 protein.
ACCESSION CP002685-6596
PROTEIN_ID AEC10206.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /gene="AT-P4H-1"
/locus_tag="AT2G43080"
/gene_synonym="P4H isoform 1"
/gene_synonym="PROLYL 4-HYDROXYLASE"
/inference="Similar to RNA sequence,
EST:INSD:EL014020.1,INSD:DR273534.1,INSD:DR273525.1,
INSD:AV440058.1,INSD:DR273524.1,INSD:BP850238.1,
INSD:DR273521.1,INSD:DR273518.1,INSD:DR273520.1,
INSD:DR273528.1,INSD:EL314237.1,INSD:ES207470.1,
INSD:DR273537.1,INSD:EG483700.1,INSD:AU238533.1,
INSD:EG483702.1,INSD:ES105103.1,INSD:AU229725.1,
INSD:ES067510.1,INSD:EL987241.1,INSD:EL120007.1,
INSD:DR273533.1,INSD:DR273527.1,INSD:ES193875.1,
INSD:BP855440.1,INSD:EL019516.1,INSD:DR273539.1,
INSD:ES035157.1,INSD:AV565844.1,INSD:EH841861.1,
INSD:DR273529.1"
/inference="similar to RNA sequence,
mRNA:INSD:BX821796.1,INSD:AK117688.1,INSD:BX820965.1,
INSD:BT006098.1"
/note="P4H isoform 1 (AT-P4H-1); FUNCTIONS IN:
oxidoreductase activity, acting on paired donors, with
incorporation or reduction of molecular oxygen,
2-oxoglutarate as one donor, and incorporation of one atom
each of oxygen into both donors, procollagen-proline
4-dioxygenase activity; INVOLVED IN: peptidyl-proline
hydroxylation to 4-hydroxy-L-proline; LOCATED IN:
endomembrane system; EXPRESSED IN: 23 plant structures;
EXPRESSED DURING: 15 growth stages; CONTAINS InterPro
DOMAIN/s: Prolyl 4-hydroxylase, alpha subunit
(InterPro:IPR006620), Oxoglutarate/iron-dependent
oxygenase (InterPro:IPR005123); BEST Arabidopsis thaliana
protein match is: 2-oxoglutarate (2OG) and
Fe(II)-dependent oxygenase superfamily protein
(TAIR:AT1G20270.1); Has 2440 Blast hits to 2431 proteins
in 340 species: Archae - 0; Bacteria - 392; Metazoa - 982;
Fungi - 96; Plants - 404; Viruses - 14; Other Eukaryotes -
552 (source: NCBI BLink)."
/db_xref="Araport:AT2G43080"
/db_xref="TAIR:AT2G43080"
intron_pos 22:1 (1/11)
intron_pos 38:1 (2/11)
intron_pos 57:1 (3/11)
intron_pos 75:0 (4/11)
intron_pos 97:0 (5/11)
intron_pos 122:0 (6/11)
intron_pos 147:0 (7/11)
intron_pos 170:2 (8/11)
intron_pos 188:0 (9/11)
intron_pos 218:0 (10/11)
intron_pos 249:0 (11/11)
BEGIN
1 MAPAMKIVFG LLTFVTVGMV IGSLLQLAFI NRLEDSYGTG FPSLRGLRGQ NTRYLRDVSR
61 WANDKDAELL RIGNVKPEVV SWSPRIIVLH DFLSPEECEY LKAIARPRLQ VSTVVDVKTG
121 KGVKSDVRTS SGMFLTHVER SYPIIQAIEK RIAVFSQVPA ENGELIQVLR YEPQQFYKPH
181 HDYFADTFNL KRGGQRVATM LMYLTDDVEG GETYFPLAGD GDCTCGGKIM KGISVKPTKG
241 DAVLFWSMGL DGQSDPRSIH GGCEVLSGEK WSATKWMRQK ATS
//