LOCUS AEC10206.1 283 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana P4H isoform 1 protein. ACCESSION CP002685-6596 PROTEIN_ID AEC10206.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="AT-P4H-1" /locus_tag="AT2G43080" /gene_synonym="P4H isoform 1" /gene_synonym="PROLYL 4-HYDROXYLASE" /inference="Similar to RNA sequence, EST:INSD:EL014020.1,INSD:DR273534.1,INSD:DR273525.1, INSD:AV440058.1,INSD:DR273524.1,INSD:BP850238.1, INSD:DR273521.1,INSD:DR273518.1,INSD:DR273520.1, INSD:DR273528.1,INSD:EL314237.1,INSD:ES207470.1, INSD:DR273537.1,INSD:EG483700.1,INSD:AU238533.1, INSD:EG483702.1,INSD:ES105103.1,INSD:AU229725.1, INSD:ES067510.1,INSD:EL987241.1,INSD:EL120007.1, INSD:DR273533.1,INSD:DR273527.1,INSD:ES193875.1, INSD:BP855440.1,INSD:EL019516.1,INSD:DR273539.1, INSD:ES035157.1,INSD:AV565844.1,INSD:EH841861.1, INSD:DR273529.1" /inference="similar to RNA sequence, mRNA:INSD:BX821796.1,INSD:AK117688.1,INSD:BX820965.1, INSD:BT006098.1" /note="P4H isoform 1 (AT-P4H-1); FUNCTIONS IN: oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, procollagen-proline 4-dioxygenase activity; INVOLVED IN: peptidyl-proline hydroxylation to 4-hydroxy-L-proline; LOCATED IN: endomembrane system; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 15 growth stages; CONTAINS InterPro DOMAIN/s: Prolyl 4-hydroxylase, alpha subunit (InterPro:IPR006620), Oxoglutarate/iron-dependent oxygenase (InterPro:IPR005123); BEST Arabidopsis thaliana protein match is: 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein (TAIR:AT1G20270.1); Has 2440 Blast hits to 2431 proteins in 340 species: Archae - 0; Bacteria - 392; Metazoa - 982; Fungi - 96; Plants - 404; Viruses - 14; Other Eukaryotes - 552 (source: NCBI BLink)." /db_xref="Araport:AT2G43080" /db_xref="TAIR:AT2G43080" intron_pos 22:1 (1/11) intron_pos 38:1 (2/11) intron_pos 57:1 (3/11) intron_pos 75:0 (4/11) intron_pos 97:0 (5/11) intron_pos 122:0 (6/11) intron_pos 147:0 (7/11) intron_pos 170:2 (8/11) intron_pos 188:0 (9/11) intron_pos 218:0 (10/11) intron_pos 249:0 (11/11) BEGIN 1 MAPAMKIVFG LLTFVTVGMV IGSLLQLAFI NRLEDSYGTG FPSLRGLRGQ NTRYLRDVSR 61 WANDKDAELL RIGNVKPEVV SWSPRIIVLH DFLSPEECEY LKAIARPRLQ VSTVVDVKTG 121 KGVKSDVRTS SGMFLTHVER SYPIIQAIEK RIAVFSQVPA ENGELIQVLR YEPQQFYKPH 181 HDYFADTFNL KRGGQRVATM LMYLTDDVEG GETYFPLAGD GDCTCGGKIM KGISVKPTKG 241 DAVLFWSMGL DGQSDPRSIH GGCEVLSGEK WSATKWMRQK ATS //