LOCUS       AEE75005.1              1705 aa    PRT              PLN 23-MAR-2023
DEFINITION  Arabidopsis thaliana Clathrin, heavy chain protein.
ACCESSION   CP002686-1960
PROTEIN_ID  AEE75005.1
SOURCE      Arabidopsis thaliana (thale cress)
  ORGANISM  Arabidopsis thaliana
            Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
            Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae;
            Pentapetalae; rosids; malvids; Brassicales; Brassicaceae;
            Camelineae; Arabidopsis.
REFERENCE   1  (bases 1 to 23459830)
  AUTHORS   Salanoubat,M., Lemcke,K., Rieger,M., Ansorge,W., Unseld,M.,
            Fartmann,B., Valle,G., Blocker,H., Perez-Alonso,M., Obermaier,B.,
            Delseny,M., Boutry,M., Grivell,L.A., Mache,R., Puigdomenech,P., De
            Simone,V., Choisne,N., Artiguenave,F., Robert,C., Brottier,P.,
            Wincker,P., Cattolico,L., Weissenbach,J., Saurin,W., Quetier,F.,
            Schafer,M., Muller-Auer,S., Gabel,C., Fuchs,M., Benes,V.,
            Wurmbach,E., Drzonek,H., Erfle,H., Jordan,N., Bangert,S.,
            Wiedelmann,R., Kranz,H., Voss,H., Holland,R., Brandt,P.,
            Nyakatura,G., Vezzi,A., D'Angelo,M., Pallavicini,A., Toppo,S.,
            Simionati,B., Conrad,A., Hornischer,K., Kauer,G., Lohnert,T.H.,
            Nordsiek,G., Reichelt,J., Scharfe,M., Schon,O., Bargues,M.,
            Terol,J., Climent,J., Navarro,P., Collado,C., Perez-Perez,A.,
            Ottenwalder,B., Duchemin,D., Cooke,R., Laudie,M., Berger-Llauro,C.,
            Purnelle,B., Masuy,D., de Haan,M., Maarse,A.C., Alcaraz,J.P.,
            Cottet,A., Casacuberta,E., Monfort,A., Argiriou,A., flores,M.,
            Liguori,R., Vitale,D., Mannhaupt,G., Haase,D., Schoof,H., Rudd,S.,
            Zaccaria,P., Mewes,H.W., Mayer,K.F., Kaul,S., Town,C.D., Koo,H.L.,
            Tallon,L.J., Jenkins,J., Rooney,T., Rizzo,M., Walts,A.,
            Utterback,T., Fujii,C.Y., Shea,T.P., Creasy,T.H., Haas,B.,
            Maiti,R., Wu,D., Peterson,J., Van Aken,S., Pai,G., Militscher,J.,
            Sellers,P., Gill,J.E., Feldblyum,T.V., Preuss,D., Lin,X.,
            Nierman,W.C., Salzberg,S.L., White,O., Venter,J.C., Fraser,C.M.,
            Kaneko,T., Nakamura,Y., Sato,S., Kato,T., Asamizu,E., Sasamoto,S.,
            Kimura,T., Idesawa,K., Kawashima,K., Kishida,Y., Kiyokawa,C.,
            Kohara,M., Matsumoto,M., Matsuno,A., Muraki,A., Nakayama,S.,
            Nakazaki,N., Shinpo,S., Takeuchi,C., Wada,T., Watanabe,A.,
            Yamada,M., Yasuda,M. and Tabata,S.
  CONSRTM   European Union Chromosome 3 Arabidopsis Sequencing Consortium;
            Institute for Genomic Research; Kazusa DNA Research Institute
  TITLE     Sequence and analysis of chromosome 3 of the plant Arabidopsis
            thaliana
  JOURNAL   Nature 408 (6814), 820-822 (2000)
   PUBMED   11130713
REFERENCE   2  (bases 1 to 23459830)
  AUTHORS   Swarbreck,D., Lamesch,P., Wilks,C. and Huala,E.
  CONSRTM   TAIR
  TITLE     Direct Submission
  JOURNAL   Submitted (18-FEB-2011) Department of Plant Biology, Carnegie
            Institution, 260 Panama Street, Stanford, CA, USA
REFERENCE   3  (bases 1 to 23459830)
  AUTHORS   Krishnakumar,V., Cheng,C.-Y., Chan,A.P., Schobel,S., Kim,M.,
            Ferlanti,E.S., Belyaeva,I., Rosen,B.D., Micklem,G., Miller,J.R.,
            Vaughn,M. and Town,C.D.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-MAY-2016) Plant Genomics, J. Craig Venter Institute,
            9704 Medical Center Dr, Rockville, MD 20850, USA
  REMARK    Protein update by submitter
FEATURES             Qualifiers
     source          /organism="Arabidopsis thaliana"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:3702"
                     /chromosome="3"
                     /ecotype="Columbia"
     protein         /locus_tag="AT3G11130"
                     /gene_synonym="AtCHC1"
                     /gene_synonym="CHC1"
                     /gene_synonym="clathrin heavy chain 1"
                     /gene_synonym="F11B9.30"
                     /inference="Similar to RNA sequence,
                     EST:INSD:ES143824.1,INSD:EH992183.1,INSD:AV554856.1,
                     INSD:EL125684.1,INSD:EG503879.1,INSD:ES164377.1,
                     INSD:EL124329.1,INSD:EL166859.1,INSD:ES070969.1,
                     INSD:EH807818.1,INSD:ES123793.1,INSD:EL987407.1,
                     INSD:BU635497.1,INSD:ES196248.1,INSD:AV544059.1,
                     INSD:EL225928.1,INSD:AV554841.1,INSD:ES113240.1,
                     INSD:EL040492.1,INSD:AV545812.1,INSD:T43744.1,
                     INSD:ES126424.1,INSD:ES118757.1,INSD:T76474.1,
                     INSD:AV547315.1,INSD:ES027180.1,INSD:AV540231.1,
                     INSD:EL262721.1,INSD:AV554881.1,INSD:AV542920.1,
                     INSD:EH898866.1,INSD:AV541492.1,INSD:EH982099.1,
                     INSD:AV546262.1,INSD:AV548433.1,INSD:EL011946.1,
                     INSD:AV539078.1,INSD:AV547582.1,INSD:AI993667.1,
                     INSD:R65303.1,INSD:EL167006.1,INSD:EL147920.1,
                     INSD:EL025865.1,INSD:EL995896.1,INSD:AV552270.1,
                     INSD:EL314527.1,INSD:ES199317.1,INSD:AV546707.1,
                     INSD:EH988141.1,INSD:EL308203.1,INSD:ES118110.1,
                     INSD:AV523609.1,INSD:AV519978.1,INSD:ES103874.1,
                     INSD:AU229160.1,INSD:EL105442.1,INSD:EL271050.1,
                     INSD:DR358871.1,INSD:AV546055.1,INSD:EL116331.1,
                     INSD:AV545851.1,INSD:ES075842.1,INSD:EH876874.1,
                     INSD:AV542242.1,INSD:ES090066.1,INSD:ES151102.1,
                     INSD:EL331281.1,INSD:BX839468.1,INSD:AV524224.1,
                     INSD:AV547602.1,INSD:EL340905.1,INSD:AV523293.1,
                     INSD:EL049531.1,INSD:AV543842.1,INSD:EL214959.1,
                     INSD:AV554796.1,INSD:AV551426.1,INSD:ES139139.1,
                     INSD:AV528941.1,INSD:EH934478.1,INSD:BX835623.1,
                     INSD:AV547902.1,INSD:EH939054.1,INSD:AV530084.1,
                     INSD:BU636239.1,INSD:ES140750.1,INSD:BE526904.1,
                     INSD:EL028016.1,INSD:EL217723.1,INSD:EH919642.1,
                     INSD:EH993354.1,INSD:BP814560.1,INSD:W43865.1,
                     INSD:ES182973.1,INSD:N96531.1,INSD:AV552525.1,
                     INSD:DR358872.1,INSD:AV548048.1,INSD:EH827404.1,
                     INSD:AV529323.1,INSD:AV519357.1,INSD:AV545882.1,
                     INSD:EL284183.1,INSD:AU238029.1,INSD:BE523905.1,
                     INSD:ES059822.1,INSD:Z29134.1,INSD:N65570.1,
                     INSD:EH831054.1,INSD:DR354393.1,INSD:EH932764.1,
                     INSD:BE528606.1,INSD:EL337710.1,INSD:EL033534.1,
                     INSD:EL330285.1,INSD:AV545876.1"
                     /inference="Similar to RNA sequence,
                     mRNA:INSD:AK229443.1,INSD:AK229949.1"
                     /note="Clathrin, heavy chain; FUNCTIONS IN: structural
                     molecule activity, binding; INVOLVED IN: intracellular
                     protein transport, vesicle-mediated transport; LOCATED IN:
                     plasma membrane, vacuole; EXPRESSED IN: male gametophyte,
                     guard cell, cultured cell, pollen tube; EXPRESSED DURING:
                     L mature pollen stage, M germinated pollen stage; CONTAINS
                     InterPro DOMAIN/s: Clathrin, heavy chain
                     (InterPro:IPR016341), Clathrin, heavy chain,
                     linker/propeller domain (InterPro:IPR016025),
                     Tetratricopeptide-like helical (InterPro:IPR011990),
                     Clathrin, heavy chain, propeller, N-terminal
                     (InterPro:IPR001473), Clathrin, heavy chain, linker, core
                     motif (InterPro:IPR015348), Clathrin, heavy chain,
                     propeller repeat (InterPro:IPR022365), Armadillo-type fold
                     (InterPro:IPR016024), Clathrin, heavy chain/VPS, 7-fold
                     repeat (InterPro:IPR000547); BEST Arabidopsis thaliana
                     protein match is: Clathrin, heavy chain
                     (TAIR:AT3G08530.1); Has 1621 Blast hits to 1503 proteins
                     in 495 species: Archae - 0; Bacteria - 35; Metazoa - 935;
                     Fungi - 178; Plants - 133; Viruses - 0; Other Eukaryotes -
                     340 (source: NCBI BLink)."
                     /db_xref="TAIR:AT3G11130"
                     /db_xref="Araport:AT3G11130"
     intron_pos      16:0 (1/29)
     intron_pos      84:1 (2/29)
     intron_pos      114:0 (3/29)
     intron_pos      140:1 (4/29)
     intron_pos      183:0 (5/29)
     intron_pos      216:0 (6/29)
     intron_pos      252:1 (7/29)
     intron_pos      279:0 (8/29)
     intron_pos      355:0 (9/29)
     intron_pos      377:0 (10/29)
     intron_pos      416:0 (11/29)
     intron_pos      482:0 (12/29)
     intron_pos      521:0 (13/29)
     intron_pos      541:0 (14/29)
     intron_pos      570:0 (15/29)
     intron_pos      598:0 (16/29)
     intron_pos      643:0 (17/29)
     intron_pos      664:0 (18/29)
     intron_pos      699:0 (19/29)
     intron_pos      736:2 (20/29)
     intron_pos      821:0 (21/29)
     intron_pos      866:2 (22/29)
     intron_pos      959:2 (23/29)
     intron_pos      1294:0 (24/29)
     intron_pos      1456:0 (25/29)
     intron_pos      1513:0 (26/29)
     intron_pos      1575:0 (27/29)
     intron_pos      1616:0 (28/29)
     intron_pos      1653:0 (29/29)
BEGIN
        1 MAAANAPIIM KEVLTLPSVG IGQQFITFTN VTMESDKYIC VRETAPQNSV VIIDMNMPMQ
       61 PLRRPITADS ALMNPNSRIL ALKAQVPGTT QDHLQIFNIE AKAKLKSHQM PEQVAFWKWI
      121 TPKMLGLVTQ TSVYHWSIEG DSEPVKMFDR TANLANNQII NYKCSPNEKW LVLIGIAPGS
      181 PERPQLVKGN MQLFSVDQQR SQALEAHAAS FAQFKVPGNE NPSILISFAS KSFNAGQITS
      241 KLHVIELGAQ PGKPSFTKKQ ADLFFPPDFA DDFPVAMQVS HKFNLIYVIT KLGLLFVYDL
      301 ETASAIYRNR ISPDPIFLTS EASSVGGFYA INRRGQVLLA TVNEATIIPF ISGQLNNLEL
      361 AVNLAKRGNL PGAENLVVQR FQELFAQTKY KEAAELAAES PQGILRTPDT VAKFQSVPVQ
      421 AGQTPPLLQY FGTLLTRGKL NSYESLELSR LVVNQNKKNL LENWLAEDKL ECSEELGDLV
      481 KTVDNDLALK IYIKARATPK VVAAFAERRE FDKILIYSKQ VGYTPDYMFL LQTILRTDPQ
      541 GAVNFALMMS QMEGGCPVDY NTITDLFLQR NLIREATAFL LDVLKPNLPE HAFLQTKVLE
      601 INLVTFPNVA DAILANGMFS HYDRPRVAQL CEKAGLYIQS LKHYSELPDI KRVIVNTHAI
      661 EPQALVEFFG TLSSEWAMEC MKDLLLVNLR GNLQIIVQAC KEYCEQLGVD ACIKLFEQFK
      721 SYEGLYFFLG SYLSMSEDPE IHFKYIEAAA KTGQIKEVER VTRESNFYDA EKTKNFLMEA
      781 KLPDARPLIN VCDRFGFVPD LTHYLYTNNM LRYIEGYVQK VNPGNAPLVV GQLLDDECPE
      841 DFIKGLILSV RSLLPVEPLV AECEKRNRLR LLTQFLEHLV SEGSQDVHVH NALGKIIIDS
      901 NNNPEHFLTT NPYYDSKVVG KYCEKRDPTL AVVAYRRGQC DEELINVTNK NSLFKLQARY
      961 VVERMDGDLW EKVLTEENEY RRQLIDQVVS TALPESKSPE QVSAAVKAFM TADLPHELIE
     1021 LLEKIVLQNS AFSGNFNLQN LLILTAIKAD PSRVMDYINR LDNFDGPAVG EVAVDAQLYE
     1081 EAFAIFKKFN LNVQAVNVLL DNVRSIERAV EFAFRVEEDA VWSQVAKAQL REGLVSDAIE
     1141 SFIRADDTTQ FLEVIRASED TNVYDDLVRY LLMVRQKVKE PKVDSELIYA YAKIERLGEI
     1201 EEFILMPNVA NLQHVGDRLY DEALYEAAKI IYAFISNWAK LAVTLVKLQQ FQGAVDAARK
     1261 ANSAKTWKEV CFACVDAEEF RLAQICGLNI IIQVDDLEEV SEYYQNRGCF NELISLMESG
     1321 LGLERAHMGI FTELGVLYAR YRYEKLMEHI KLFSTRLNIP KLIRACDEQQ HWQELTYLYI
     1381 QYDEFDNAAT TVMNHSPEAW EHMQFKDIVA KVANVELYYK AVHFYLQEHP DIINDLLNVL
     1441 ALRLDHTRVV DIMRKAGHLR LIKPYMVAVQ SNNVSAVNEA LNEIYAEEED YDRLRESIDL
     1501 HDSFDQIGLA QKIEKHELVE MRRVAAYIYK KAGRWKQSIA LSKKDNMYKD CMETASQSGD
     1561 HDLAEQLLVY FIEQGKKECF ATCLFVCYDL IRPDVALELA WINNMIDFAF PYLLQFIREY
     1621 SGKVDELIKD KLEAQKEVKA KEQEEKDVMS QQNMYAQLLP LALPAPPMPG MGGGGYGPPP
     1681 QMGGMPGMSG MPPMPPYGMP PMGGY
//