LOCUS AEE86105.1 2154 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana MUS308 and mammalian DNA polymerase- like protein protein. ACCESSION CP002687-6001 PROTEIN_ID AEE86105.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 18585056) AUTHORS Mayer,K., Schuller,C., Wambutt,R., Murphy,G., Volckaert,G., Pohl,T., Dusterhoft,A., Stiekema,W., Entian,K.D., Terryn,N., Harris,B., Ansorge,W., Brandt,P., Grivell,L., Rieger,M., Weichselgartner,M., de Simone,V., Obermaier,B., Mache,R., Muller,M., Kreis,M., Delseny,M., Puigdomenech,P., Watson,M., Schmidtheini,T., Reichert,B., Portatelle,D., Perez-Alonso,M., Boutry,M., Bancroft,I., Vos,P., Hoheisel,J., Zimmermann,W., Wedler,H., Ridley,P., Langham,S.A., McCullagh,B., Bilham,L., Robben,J., Van der Schueren,J., Grymonprez,B., Chuang,Y.J., Vandenbussche,F., Braeken,M., Weltjens,I., Voet,M., Bastiaens,I., Aert,R., Defoor,E., Weitzenegger,T., Bothe,G., Ramsperger,U., Hilbert,H., Braun,M., Holzer,E., Brandt,A., Peters,S., van Staveren,M., Dirske,W., Mooijman,P., Klein Lankhorst,R., Rose,M., Hauf,J., Kotter,P., Berneiser,S., Hempel,S., Feldpausch,M., Lamberth,S., Van den Daele,H., De Keyser,A., Buysshaert,C., Gielen,J., Villarroel,R., De Clercq,R., Van Montagu,M., Rogers,J., Cronin,A., Quail,M., Bray-Allen,S., Clark,L., Doggett,J., Hall,S., Kay,M., Lennard,N., McLay,K., Mayes,R., Pettett,A., Rajandream,M.A., Lyne,M., Benes,V., Rechmann,S., Borkova,D., Blocker,H., Scharfe,M., Grimm,M., Lohnert,T.H., Dose,S., de Haan,M., Maarse,A., Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Fartmann,B., Granderath,K., Dauner,D., Herzl,A., Neumann,S., Argiriou,A., Vitale,D., Liguori,R., Piravandi,E., Massenet,O., Quigley,F., Clabauld,G., Mundlein,A., Felber,R., Schnabl,S., Hiller,R., Schmidt,W., Lecharny,A., Aubourg,S., Chefdor,F., Cooke,R., Berger,C., Montfort,A., Casacuberta,E., Gibbons,T., Weber,N., Vandenbol,M., Bargues,M., Terol,J., Torres,A., Perez-Perez,A., Purnelle,B., Bent,E., Johnson,S., Tacon,D., Jesse,T., Heijnen,L., Schwarz,S., Scholler,P., Heber,S., Francs,P., Bielke,C., Frishman,D., Haase,D., Lemcke,K., Mewes,H.W., Stocker,S., Zaccaria,P., Bevan,M., Wilson,R.K., de la Bastide,M., Habermann,K., Parnell,L., Dedhia,N., Gnoj,L., Schutz,K., Huang,E., Spiegel,L., Sehkon,M., Murray,J., Sheet,P., Cordes,M., Abu-Threideh,J., Stoneking,T., Kalicki,J., Graves,T., Harmon,G., Edwards,J., Latreille,P., Courtney,L., Cloud,J., Abbott,A., Scott,K., Johnson,D., Minx,P., Bentley,D., Fulton,B., Miller,N., Greco,T., Kemp,K., Kramer,J., Fulton,L., Mardis,E., Dante,M., Pepin,K., Hillier,L., Nelson,J., Spieth,J., Ryan,E., Andrews,S., Geisel,C., Layman,D., Du,H., Ali,J., Berghoff,A., Jones,K., Drone,K., Cotton,M., Joshu,C., Antonoiu,B., Zidanic,M., Strong,C., Sun,H., Lamar,B., Yordan,C., Ma,P., Zhong,J., Preston,R., Vil,D., Shekher,M., Matero,A., Shah,R., Swaby,I.K., O'Shaughnessy,A., Rodriguez,M., Hoffmann,J., Till,S., Granat,S., Shohdy,N., Hasegawa,A., Hameed,A., Lodhi,M., Johnson,A., Chen,E., Marra,M., Martienssen,R. and McCombie,W.R. TITLE Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 769-777 (1999) PUBMED 10617198 REFERENCE 2 (bases 1 to 18585056) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 18585056) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="4" /ecotype="Columbia" protein /locus_tag="AT4G32700" /inference="Similar to RNA sequence, EST:INSD:EG439508.1,INSD:ES118133.1,INSD:ES025298.1, INSD:EH813771.1,INSD:T44382.1,INSD:EG439520.1, INSD:EG439512.1,INSD:EG439522.1,INSD:EG439513.1, INSD:ES062654.1,INSD:EG439509.1,INSD:ES090115.1, INSD:BE530735.1,INSD:EL206206.1,INSD:AV546006.1, INSD:ES125691.1" /inference="Similar to RNA sequence, mRNA:INSD:AB192295.1" /note="helicases;ATP-dependent helicases;nucleic acid binding;ATP binding;DNA-directed DNA polymerases;DNA binding; FUNCTIONS IN: in 6 functions; INVOLVED IN: regulation of gene expression, DNA replication, DNA recombination, photomorphogenesis; EXPRESSED IN: 6 plant structures; EXPRESSED DURING: petal differentiation and expansion stage, E expanded cotyledon stage, D bilateral stage; CONTAINS InterPro DOMAIN/s: DNA/RNA helicase, DEAD/DEAH box type, N-terminal (InterPro:IPR011545), DEAD-like helicase, N-terminal (InterPro:IPR014001), DNA-directed DNA polymerase, family A, palm domain (InterPro:IPR001098), DNA/RNA helicase, C-terminal (InterPro:IPR001650), DNA polymerase A domain (InterPro:IPR002298), Helicase, superfamily 1/2, ATP-binding domain (InterPro:IPR014021); BEST Arabidopsis thaliana protein match is: U5 small nuclear ribonucleoprotein helicase (TAIR:AT2G42270.1); Has 17628 Blast hits to 16579 proteins in 2941 species: Archae - 600; Bacteria - 7507; Metazoa - 1254; Fungi - 1190; Plants - 590; Viruses - 412; Other Eukaryotes - 6075 (source: NCBI BLink)." /db_xref="TAIR:AT4G32700" /db_xref="Araport:AT4G32700" intron_pos 13:0 (1/25) intron_pos 155:2 (2/25) intron_pos 233:0 (3/25) intron_pos 520:0 (4/25) intron_pos 541:2 (5/25) intron_pos 577:0 (6/25) intron_pos 644:0 (7/25) intron_pos 711:0 (8/25) intron_pos 765:0 (9/25) intron_pos 845:0 (10/25) intron_pos 916:0 (11/25) intron_pos 1033:0 (12/25) intron_pos 1084:0 (13/25) intron_pos 1158:0 (14/25) intron_pos 1231:0 (15/25) intron_pos 1270:1 (16/25) intron_pos 1648:0 (17/25) intron_pos 1714:0 (18/25) intron_pos 1800:2 (19/25) intron_pos 1864:0 (20/25) intron_pos 1900:0 (21/25) intron_pos 2017:2 (22/25) intron_pos 2054:0 (23/25) intron_pos 2101:0 (24/25) intron_pos 2133:1 (25/25) BEGIN 1 MDSDSSKSRI DQFYVSKKRK HQSPNLKSGR NEKNVKVTGE RSPGDKGTLD SYLKASLDDK 61 STTNSGLQAR QEAFTRKLDL EVSASSVGQN IHPCLPKPVS FATFKECLGQ NGSQDLHKEG 121 VAAETHATDG LLCANQKDNS ELRDFATSFL SLYCSGVQSV VGSPPHQKEN ELKRRSSSSS 181 LAQDIQISHK RRCESENIPS LDDLTNPLGS KPESLARNGN NRDKPVSDPT KKMPSNESVE 241 IPMGLRKCSK APESSAHLTE FHTPGSAIKS CPVGTPKSGC GSSMFSPGEA FWNEAIQVAD 301 GLTIPIENFG SVEAKVRDQH VTILSCSKKT DKCTEKLERS LDLDEIRVKD KDAIGFSKVV 361 EKHGRDFNKE VYQLPVKNLE LLFQDKNING GIQERCASFD QNNITLGSSR ISESAFVGNK 421 GCENLDIANN AQADKGLIGK MYPEPEGKKV LLCEENRGVR SVSMISNMRK PVGSSESEES 481 HTPSSSHRNY DGLSLSTWLP SEVCSVYNKK GISKLYPWQV ECLQVDGVLQ KRNLVYCAST 541 SAGKSFVAEV LMLRRVIRTG KMALLVLPYV SICAEKAEHL EVLLEPLGKH VRSYYGNQGG 601 GTLPKDTSVA VCTIEKANSL INRLLEEGRL SELGIIVIDE LHMVGDQHRG YLLELMLTKL 661 RYAAGEGSSE SSSGESSGTS SGKADPAHGL QIVGMSATMP NVGAVADWLQ AALYQTEFRP 721 VPLEEYIKVG STIYNKKMEV VRTIPKAADM GGKDPDHIVE LCNEVVQEGN SVLIFCSSRK 781 GCESTARHIS KLIKNVPVNV DGENSEFMDI RSAIDALRRS PSGVDPVLEE TLPSGVAYHH 841 AGLTVEEREI VETCYRKGLV RVLTATSTLA AGVNLPARRV IFRQPMIGRD FIDGTRYKQM 901 SGRAGRTGID TKGDSVLICK PGELKRIMAL LNETCPPLQS CLSEDKNGMT HAILEVVAGG 961 IVQTAKDIHR YVRCTLLNST KPFQDVVKSA QDSLRWLCHR KFLEWNEETK LYTTTPLGRG 1021 SFGSSLCPEE SLIVLDDLLR AREGLVMASD LHLVYLVTPI NVGVEPNWEL YYERFMELSP 1081 LEQSVGNRVG VVEPFLMRMA HGATVRTLNR PQDVKKNLRG EYDSRHGSTS MKMLSDEQML 1141 RVCKRFFVAL ILSKLVQEAS VTEVCEAFKV ARGMVQALQE NAGRFSSMVS VFCERLGWHD 1201 LEGLVAKFQN RVSFGVRAEI VELTSIPYIK GSRARALYKA GLRTSQAIAE ASIPEIVKAL 1261 FESSAWAAEG TGQRRIHLGL AKKIKNGARK IVLEKAEEAR AAAFSAFKSL GLDVNELSKP 1321 LPLAPASSLN GQETTERDIS RGSVGPDGLQ QSIEGHMECE NFDMDNHREK PSEVLGDATL 1381 GVSSEINLTS RLPNFRPIGT AVGTNGPSAV SILSSDTFPI PVYDNREIKP KDNVEQHLTR 1441 NDHIPLSSNK DGTGEKGPVT AGNISGGFDS FLELWGSAGE FFFDLHYNKL QDLNSRISYE 1501 IHGIAICWNC SPVYYVNLNK DLPNLECVEK QKLIEDAVIG KSEVLASHNM LDVIKSRWNK 1561 ISKIMGNVNT RKFTWNLKVQ IQVLKSPAIS IQRCTRLNLP EGIRDELVDG SWLMMPPLHT 1621 SHTIDMSIVI WILWPDEERH SNPNIDKEVK KRLSPEAAEA ANRSGRWRNQ IRRVAHNGCC 1681 RRVAQTRALC SALWKILVSE ELLQALTTIE MPLVNVLADM ELWGIGIDIE GCLRARNILR 1741 DKLRSLEKKA FELAGMTFSL HNPADIANVL FGQLKLPIPE NQSKGKLHPS TDKHCLDLLR 1801 NEHPVVPIIK EHRTLAKLLN CTLGSICSLA KLRLSTQRYT LHGRWLQTST ATGRLSIEEP 1861 NLQSVEHEVE FKLDKNGRDV SSDADRYKIN ARDFFVPTQE NWLLLTADYS QIELRLMAHF 1921 SRDSSLISKL SQPEGDVFTM IAAKWTGKAE DSVSPHDRDQ TKRLIYGILY GMGANRLAEQ 1981 LECTSDEAKE KIRSFKSSFP AVTSWLNETI SFCQEKGYIQ TLKGRRRFLS KIKFGNAKEK 2041 SKAQRQAVNS MCQGSAADII KIAMINIYSA IAEDVDTAAS SSSSETRFHM LKGRCRILLQ 2101 VHDELVLEVD PSYVKLAAML LQTSMENAVS LLVPLHVKLK VGKTWGSLEP FQTD //