LOCUS AEC08519.1 723 aa PRT PLN 23-MAR-2023 DEFINITION Arabidopsis thaliana transcription factor bHLH155-like protein protein. ACCESSION CP002685-4289 PROTEIN_ID AEC08519.1 SOURCE Arabidopsis thaliana (thale cress) ORGANISM Arabidopsis thaliana Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. REFERENCE 1 (bases 1 to 19698289) AUTHORS Lin,X., Kaul,S., Rounsley,S., Shea,T.P., Benito,M.I., Town,C.D., Fujii,C.Y., Mason,T., Bowman,C.L., Barnstead,M., Feldblyum,T.V., Buell,C.R., Ketchum,K.A., Lee,J., Ronning,C.M., Koo,H.L., Moffat,K.S., Cronin,L.A., Shen,M., Pai,G., Van Aken,S., Umayam,L., Tallon,L.J., Gill,J.E., Adams,M.D., Carrera,A.J., Creasy,T.H., Goodman,H.M., Somerville,C.R., Copenhaver,G.P., Preuss,D., Nierman,W.C., White,O., Eisen,J.A., Salzberg,S.L., Fraser,C.M. and Venter,J.C. TITLE Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana JOURNAL Nature 402 (6763), 761-768 (1999) PUBMED 10617197 REFERENCE 2 (bases 1 to 19698289) AUTHORS Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E. CONSRTM TAIR TITLE Direct Submission JOURNAL Submitted (18-FEB-2011) Department of Plant Biology, Carnegie Institution, 260 Panama Street, Stanford, CA, USA REFERENCE 3 (bases 1 to 19698289) AUTHORS Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M., Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R., Vaughn,M. and Town,C.D. TITLE Direct Submission JOURNAL Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute, 9704 Medical Center Dr, Rockville, MD 20850, USA REMARK Protein update by submitter FEATURES Qualifiers source /organism="Arabidopsis thaliana" /mol_type="genomic DNA" /db_xref="taxon:3702" /chromosome="2" /ecotype="Columbia" protein /gene="CPUORF7" /locus_tag="AT2G31280" /gene_synonym="conserved peptide upstream open reading frame 7" /gene_synonym="F16D14.12" /gene_synonym="F16D14_12" /gene_synonym="LHL2" /gene_synonym="LL2" /gene_synonym="LONESOME HIGHWAY LIKE 2" /inference="Similar to RNA sequence, EST:INSD:BE528282.1,INSD:EG445328.1,INSD:EG443658.1, INSD:BP837397.1,INSD:BP816989.1,INSD:EG435626.1, INSD:EL231518.1,INSD:EG435654.1,INSD:EG435659.1, INSD:EG435668.1,INSD:EG443677.1,INSD:EG443660.1, INSD:EG445236.1,INSD:EG445233.1,INSD:EH945559.1, INSD:EG443662.1,INSD:EG445232.1,INSD:EG435674.1, INSD:EG445234.1,INSD:EG443687.1,INSD:EG435661.1" /inference="similar to RNA sequence, mRNA:INSD:AJ576042.1" /note="conserved peptide upstream open reading frame 7 (CPUORF7); FUNCTIONS IN: sequence-specific DNA binding transcription factor activity; INVOLVED IN: regulation of transcription; LOCATED IN: nucleus; CONTAINS InterPro DOMAIN/s: Helix-loop-helix DNA-binding domain (InterPro:IPR001092), Helix-loop-helix DNA-binding (InterPro:IPR011598); BEST Arabidopsis thaliana protein match is: basic helix-loop-helix (bHLH) DNA-binding superfamily protein (TAIR:AT1G06150.1); Has 35333 Blast hits to 34131 proteins in 2444 species: Archae - 798; Bacteria - 22429; Metazoa - 974; Fungi - 991; Plants - 531; Viruses - 0; Other Eukaryotes - 9610 (source: NCBI BLink)." /db_xref="Araport:AT2G31280" /db_xref="TAIR:AT2G31280" intron_pos 33:1 (1/8) intron_pos 59:2 (2/8) intron_pos 99:2 (3/8) intron_pos 127:0 (4/8) intron_pos 147:0 (5/8) intron_pos 549:0 (6/8) intron_pos 583:0 (7/8) intron_pos 623:0 (8/8) BEGIN 1 MGKRQISQDE VGPPIKPRAG LRREQAGRGS YREILKSFCF NTDWDYAVFW QLNHRGSRMV 61 LTLEDAYYDH HGTNMHGAHD PLGLAVAKMS YHVYSLGEGI VGQVAVSGEH QWVFPENYNN 121 CNSAFETILV VAVGPCGVVQ LGSLCKPKMP SEGLHAEAFP DCSGEVDKAM DVEESNILTQ 181 YKTRRSDSMP YNTPSSCLVM EKAAQVVGGR EVVQGSTCGS YSGVTFGFPV DLVGAKHENQ 241 VGTNIIRDAP HVGMTSGCKD SRDLDPNLHL YMKNHVLNDT STSALAIEAE RLITSQSYPR 301 LDSTFQATSR TDKESSYHNE VFQLSENQGN KYIKETERML GRNCESSQFD ALISSGYTFA 361 GSELLEALGS AFKQTNTGQE ELLKSEHGST MRPTDDMSHS QLTFDPGPEN LLDAVVANVC 421 QRDGNARDDM MSSRSVQSLL TNMELAEPSG QKKHNIVNPI NSAMNQPPMA EVDTQQNSSD 481 ICGAFSSIGF SSTYPSSSSD QFQTSLDIPK KNKKRAKPGE SSRPRPRDRQ LIQDRIKELR 541 ELVPNGSKCS IDSLLERTIK HMLFLQNVTK HAEKLSKSAN EKMQQKETGM QGSSCAVEVG 601 GHLQVSSIIV ENLNKQGMVL IEFNLCLNSS PKFCECVLKV FLGIGQMLCE ECGHFLEIAN 661 VIRSLDLVIL RGFTETQGEK TWICFVTEVG SRITQFMKEI PKQIKVFSLV LIELWISMIY 721 MTD //