LOCUS AEC06955.1 952 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana Gls protein (DUF810) protein.
ACCESSION CP002685-2122
PROTEIN_ID AEC06955.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G20010"
/gene_synonym="T2G17.19"
/gene_synonym="T2G17_19"
/inference="Similar to RNA sequence,
EST:INSD:AU228272.1,INSD:AA650972.1,INSD:AV545927.1,
INSD:EL177005.1,INSD:BP602039.1,INSD:EL042314.1,
INSD:AV567639.1,INSD:EL323194.1,INSD:ES150479.1,
INSD:EG456079.1,INSD:R29866.1,INSD:EL318136.1,
INSD:CB185655.1,INSD:EL178192.1,INSD:EL338389.1,
INSD:EG456080.1,INSD:AV440017.1,INSD:AV549997.1,
INSD:CD530229.1,INSD:BP602089.1,INSD:AV442065.1,
INSD:AV546305.1,INSD:AV548512.1,INSD:EH881430.1"
/inference="similar to RNA sequence, mRNA:INSD:AK228728.1"
/note="FUNCTIONS IN: molecular_function unknown; INVOLVED
IN: biological_process unknown; LOCATED IN:
cellular_component unknown; EXPRESSED IN: 22 plant
structures; EXPRESSED DURING: 13 growth stages; CONTAINS
InterPro DOMAIN/s: Munc13 homology 1 (InterPro:IPR014770),
Protein of unknown function DUF810 (InterPro:IPR008528),
Mammalian uncoordinated homology 13, domain 2
(InterPro:IPR014772); BEST Arabidopsis thaliana protein
match is: Protein of unknown function (DUF810)
(TAIR:AT2G25800.1); Has 178 Blast hits to 167 proteins in
22 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi -
0; Plants - 172; Viruses - 0; Other Eukaryotes - 6
(source: NCBI BLink)."
/db_xref="Araport:AT2G20010"
/db_xref="TAIR:AT2G20010"
intron_pos 141:0 (1/4)
intron_pos 431:0 (2/4)
intron_pos 584:0 (3/4)
intron_pos 647:1 (4/4)
BEGIN
1 MESLPSPFGD PAPNLSNSEL RETAYEILVA ACRSTGSRPL TYIPQSPKSD RSNGLTTASL
61 SPSPSLHRSL TSTAASKVKK ALGMKKRIGD GDGGAGESSS QPDRSKKSVT VGELVRVQMR
121 ISEQIDSRIR RALLRIASGQ LGRRVEMMVL PLELLQQLKA SDFPDQEEYE SWQRRNLKLL
181 EAGLILYPCV PLSKSDKSVQ QLKQIIRSGL ERPLDTGKIT GETQNLRSLV MSLASRQNNN
241 GIGSETCHWA DGFPLNLRIY QMLLESCFDV NDELLIVEEV DEVLELIKKT WPVLGINQMI
301 HNVCFLWVLV NRYVSTGQVE NDLLVAAHNL ILEIENDAME TNDPEYSKIL SSVLSLVMDW
361 GEKRLLAYHD TFNIDNVETL ETTVSLGILV AKVLGEDISS EYRRKKKHVD SGRDRVDTYI
421 RSSLRMAFQQ TKRMVEHSKK SKSRQSTNNL PALAILAEDI GHLAFNEKAI FSPILKNWHP
481 LAAGVAAATL HSCYGTELKK FVSGITELTP DAIRVLTAAD KLEKDLVQIA VQDAVDSEDG
541 GKSVIREMPP FEAEVVIGNL VKSWIKIRVD RLKEWIDRNL QQEVWNPRSN KLGIAPSAVD
601 VLRMVDETLE AFFLLPILLH PVLLPELTSG LDKCMQHYVS KAKSSCGSRN TFLPVLPALT
661 RCTVGSRLHG VFKKKEKPMV ASHRRKSQLG TGNDSAEILQ FCCRINTLQY IRTEIESSGR
721 KTLNRLPESE VAALDAKGKI FEQSISYCSK GIQQLSEATA YKIVFHDLSN VLWDGLYLGE
781 VPSSRIEPFL QELERCLEII SSSVHDRVRT RVISDIMRAS FDGFLLVLLA GGPSRGFTIQ
841 DSAAVEEDFK FLCDLFWSNG DGLPLDLIEK VSTTVKSILP LLRTDTDSLI ERFKAVCLEN
901 HGSDRGKLPL PPTSGPWSPT EPNTLLRVLC YRYDEPATKF LKKTYNLPRK LT
//