LOCUS AEE75897.1 652 aa PRT PLN 23-MAR-2023
DEFINITION Arabidopsis thaliana high chlorophyll fluorescent 107 protein.
ACCESSION CP002686-3179
PROTEIN_ID AEE75897.1
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
Camelineae; Arabidopsis.
REFERENCE 1 (bases 1 to 23459830)
AUTHORS Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
Yamada,M., Yasuda,M. and Tabata,S.
CONSRTM European Union Chromosome 3 Arabidopsis Sequencing Consortium;
Institute for Genomic Research; Kazusa DNA Research Institute
TITLE Sequence and analysis of chromosome 3 of the plant Arabidopsis
thaliana
JOURNAL Nature 408 (6814), 820-822 (2000)
PUBMED 11130713
REFERENCE 2 (bases 1 to 23459830)
AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
CONSRTM TAIR
TITLE Direct Submission
JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE 3 (bases 1 to 23459830)
AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
Vaughn,M. and Town,C.D.
TITLE Direct Submission
JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
9704 Medical Center Dr, Rockville, MD 20850, USA
REMARK Protein update by submitter
FEATURES Qualifiers
source /organism="Arabidopsis thaliana"
/mol_type="genomic DNA"
/db_xref="taxon:3702"
/chromosome="3"
/ecotype="Columbia"
protein /gene="HCF107"
/locus_tag="AT3G17040"
/gene_synonym="high chlorophyll fluorescent 107"
/inference="Similar to RNA sequence,
EST:INSD:DR240168.1,INSD:DR240172.1,INSD:DR240173.1,
INSD:EL993654.1,INSD:BP828232.1,INSD:EH929117.1,
INSD:DR240166.1,INSD:EH836122.1,INSD:BX839007.1,
INSD:DR379791.1,INSD:DR240165.1,INSD:DR240174.1,
INSD:ES195264.1,INSD:BP778399.1,INSD:ES123569.1,
INSD:EL061089.1,INSD:AV828361.1,INSD:EG493500.1,
INSD:BP587541.1,INSD:ES197078.1,INSD:EH879552.1,
INSD:DR297133.1,INSD:ES013019.1,INSD:ES097580.1,
INSD:BP852197.1,INSD:DR240169.1,INSD:EL323442.1,
INSD:AV816900.1,INSD:EL112492.1,INSD:DR240162.1,
INSD:BP645621.1,INSD:DR383739.1,INSD:DR240160.1,
INSD:EH929721.1,INSD:AV801174.1,INSD:EL146991.1,
INSD:DR240167.1,INSD:BX841003.1,INSD:Z30486.1,
INSD:DR240164.1,INSD:DR297134.1,INSD:DR240170.1,
INSD:BX835387.1,INSD:DR240163.1,INSD:BP783082.1,
INSD:CB258267.1,INSD:N38275.1,INSD:BP788534.1,
INSD:EL022539.1,INSD:DR252705.1,INSD:EL140236.1,
INSD:DR378492.1,INSD:ES114386.1,INSD:EH868052.1,
INSD:AV805227.1,INSD:EL340573.1,INSD:EH957724.1,
INSD:EH866177.1,INSD:BP669387.1,INSD:AV527367.1,
INSD:DR240171.1,INSD:EL098333.1"
/inference="Similar to RNA sequence,
mRNA:INSD:AY093112.1,INSD:BX823785.1,INSD:BT008405.1"
/note="high chlorophyll fluorescent 107 (HCF107);
FUNCTIONS IN: binding; INVOLVED IN: plastid organization,
RNA processing, regulation of translation; LOCATED IN:
chloroplast, plasma membrane, chloroplast envelope;
EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13
growth stages; CONTAINS InterPro DOMAIN/s: RNA-processing
protein, HAT helix (InterPro:IPR003107),
Tetratricopeptide-like helical (InterPro:IPR011990),
Tetratricopeptide repeat-containing (InterPro:IPR013026),
Tetratricopeptide repeat (InterPro:IPR019734); BEST
Arabidopsis thaliana protein match is: pre-mRNA splicing
factor-related (TAIR:AT4G03430.1); Has 8355 Blast hits to
4834 proteins in 935 species: Archae - 387; Bacteria -
2714; Metazoa - 1907; Fungi - 1281; Plants - 751; Viruses
- 0; Other Eukaryotes - 1315 (source: NCBI BLink)."
/db_xref="TAIR:AT3G17040"
/db_xref="Araport:AT3G17040"
intron_pos 194:0 (1/7)
intron_pos 242:0 (2/7)
intron_pos 344:0 (3/7)
intron_pos 364:0 (4/7)
intron_pos 446:0 (5/7)
intron_pos 483:0 (6/7)
intron_pos 542:0 (7/7)
BEGIN
1 MHFFFVPNSS SSSPSPANTS SFSLSFLTPQ IPENLCKSPT KIHIGTHGIS GQSFLSHPTF
61 SSKNTYLYAV VDRSSSGVFS PQKESANGEG EESNTEEGVL VVRRPLLENS DKESSEEEGK
121 KYPARIDAGL SNIAKKMPIF EPERSESSSS SSAAAAARAQ ERPLAVNLDL SLYKAKVLAR
181 NFRYKDAEKI LEKCIAYWPE DGRPYVALGK ILSKQSKLAE ARILYEKGCQ STQGENSYIW
241 QCWAVLENRL GNVRRARELF DAATVADKKH VAAWHGWANL EIKQGNISKA RNLLAKGLKF
301 CGRNEYIYQT LALLEAKAGR YEQARYLFKQ ATICNSRSCA SWLAWAQLEI QQERYPAARK
361 LFEKAVQASP KNRFAWHVWG VFEAGVGNVE RGRKLLKIGH ALNPRDPVLL QSLGLLEYKH
421 SSANLARALL RRASELDPRH QPVWIAWGWM EWKEGNTTTA RELYQRALSI DANTESASRC
481 LQAWGVLEQR AGNLSAARRL FRSSLNINSQ SYVTWMTWAQ LEEDQGDTER AEEIRNLYFQ
541 QRTEVVDDAS WVTGFLDIID PALDTVKRLL NFGQNNDNNR LTTTLRNMNR TKDSQSNQQP
601 ESSAGREDIE TGSGFNLDVF LRSKLSLDPL KLDVNLDSKR LERFTRGRIN GA
//