LOCUS AEC05810.1 1253 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana hypothetical protein protein.
ACCESSION CP002685-595
PROTEIN_ID AEC05810.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 19698289)
AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D.,
Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V.,
Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L.,
Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L.,
Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H.,
Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D.,
Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and
Venter,J.C.
TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis
thaliana
JOURNAL Nature 402 (6763), 761-768 (1999)
PUBMED 10617197
REFERENCE 2 (bases 1 to 19698289)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 19698289)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="2"
/ecotype="Columbia"
protein /locus_tag="AT2G04235"
/inference="Similar to RNA sequence,
EST:INSD:ES029991.1,INSD:ES025441.1,INSD:AU238755.1"
/inference="similar to RNA sequence,
mRNA:INSD:BT003691.1,INSD:AK117915.1"
/note="unknown protein; FUNCTIONS IN: molecular_function
unknown; INVOLVED IN: biological_process unknown; LOCATED
IN: cellular_component unknown; EXPRESSED IN: cultured
cell; Has 713 Blast hits to 345 proteins in 122 species:
Archae - 2; Bacteria - 262; Metazoa - 138; Fungi - 55;
Plants - 39; Viruses - 0; Other Eukaryotes - 217 (source:
NCBI BLink)."
/db_xref="Araport:AT2G04235"
/db_xref="TAIR:AT2G04235"
intron_pos 783:0 (1/13)
intron_pos 841:0 (2/13)
intron_pos 865:0 (3/13)
intron_pos 892:0 (4/13)
intron_pos 904:2 (5/13)
intron_pos 934:0 (6/13)
intron_pos 976:0 (7/13)
intron_pos 1042:0 (8/13)
intron_pos 1068:2 (9/13)
intron_pos 1092:0 (10/13)
intron_pos 1127:0 (11/13)
intron_pos 1162:1 (12/13)
intron_pos 1191:2 (13/13)
BEGIN
1 MASEKPEDPM NNTAGIGTDE ESIAQRRKRL RRVSFADREI TSVHIFNRDE DYETPPNTSA
61 AKPQNGGDTS EPDEDNKVIR FFGELSDRED TDGDGDGEYE PILDKSFLRP KYSPSSGGST
121 VGSATSDNGT LQLLCEFRVL FFEFLAESIL FSLPEDNFFG PVSSHFINPG RLLDTPISEE
181 HHEMTMDSTA FSMHFRSLAR SESGDVRTPT SSHLLVEEKT PTEVTSRSDT GSAMVLTEPK
241 KLFPKSPVPV DKGSGGRDSN DMSIVGENSR RYDYGYLSPT LAALMGDESK ELLPEDNTVE
301 ARSPIDDFSS SLPNGCIPIG LQESGSQRYT KEASLSSSTI RRQSAFLVGM LPQSLSCVTP
361 SPTQGGSFMS RETRALVESL STIQKSKSRL GLIPPSPGSA LSQRIEKSKL QLSGHRFLTT
421 PSIGREEIGV LRDKHADIPI TNLEALLSKH DNRTPISEEK SMPDKCISGA LSHAVDTSDD
481 NRTPVPEEKG IPDQCISGAL SHAVDTSDDN KTPVPEEKGI PDQCSSGALN PAVDTSDDNR
541 TPVQEKKGLP DQCSSGALSP AVDTSDDRPP VSEKKGIPDQ HSCGALIPAV DISDVFARRS
601 PEGNTNSEIE GSLLCKQQQR NQAASTPEKF VSSPTNLSNA TTSASENFVP LQDQEQHSKD
661 IEKSETGDGN VTKEYASNCS MNTLSEKVDS LLAESSVLLT DTGFLNGSAQ QREKDSVRNK
721 KQNRTNISAA HILLKDNNPF KVHCETEVIS AEDFTAVAKE NLPSTSGSSS VDRSKNEASH
781 AKGPSRLKRK AEDVDCAARN CSPKVERSTK YISNSVMEHP DGNIDANDCR RVREQVNWVE
841 IPGKVSKEIN QMLAPLADKL NSRLICKLED ILTHMKKVHL CEMLCLQIQS QKVCDHLSGA
901 KTKRRVESRS LLCKLAYDKA KLELLHLKKE IMMKKFQAVS TGVQTSETLR LNCANFLRQH
961 GFRSTGLLNP DQAQEVIITG KRAEITQEIK EIDSKIKNLI QCFTACDTMT GPQPAYADTI
1021 MIAEETLKKR MSCRSLRQDI LIWKVDSLGE WNDCQSIVLN YSGVFNQRLT LKPGHPSCVL
1081 VSNSLSDTFV KHFPEMNVSI AFNSMFNAED SRRYIGGSNT LLEITQKTSL LLHNLLDVAE
1141 EFHLAQMNIP NLVQGNFDSP SAEQLHLQIS FLDCTNLRKL SVILDVTCLI HGKYPSDVVP
1201 CEFRKVSGTK RDGVVSKQLK KEIESTIDDV GVGYPRILRL CRCISKALQS EKR
//