Introduction
Numerous proteins have been shown to fold spontaneously in vitro, confirming Anfinsen's pioneering insight that the linear sequence of the polypeptide chain contains all the necessary information to specify a protein's three-dimensional structure1. Although protein folding has been studied intensely for almost 50 years, how the final fold (and the folding process) is determined by the amino acid sequence remains one of the most important problems in biology. Moreover, in the more recent past it has become clear that, in the cell, a large fraction of newly synthesized proteins require assistance by molecular chaperones to reach their folded states efficiently and on a biologically relevant timescale2. Clearly, proteins in the test tube and in the cell are subject to the same laws of physics, so what is special about folding under cellular conditions, and why are chaperones necessary? The increasing availability of highly sensitive biophysical techniques to study folding in vitro and in cellular systems is now providing new insights into these issues (see the Review by Bartlett and Radford3 in this issue). These studies also shed light on the process of aggregation, a potentially dangerous off-pathway reaction that can cause disease and must be prevented by molecular chaperones.
Folding and aggregation
Folding intermediates are the rule for larger proteins of >100 amino acids (
90% of all proteins in a cell), which have a greater tendency to rapidly collapse in aqueous solution into compact non-native conformations4. As shown recently by a combination of rapid mixing techniques and sensitive spectroscopic measurements, even small proteins that fold on a subsecond timescale may pass through structural intermediates en route to the native state3, 4. Such intermediates either represent on-pathway 'stepping stones' toward the native state or kinetically stable, misfolded conformations that may require substantial reorganization before the native state can be reached. The formation of metastable, non-native interactions during folding is interpreted as a consequence of the ruggedness of the funnel-shaped folding energy landscape5, 6 (Fig. 1), irrespective of whether proteins are thought to fold through multiple downhill routes or through preferred pathways defined by the sequential assembly of elementary folding units, so-called 'foldons'7, 8. Examples of such minimal nucleation motifs are the two-stranded-helix motifs found in
/
domain proteins. Multistate folding behavior with populated intermediates would be observed when multiple foldons are separated and do not act cooperatively8 or when foldons misassemble, resulting in a kinetic block of folding. The propensity to misfold increases with topologically complex fold types that are stabilized by long-range interactions (for example,
/
domain architectures) or when proteins contain multiple domains that are separate in the native state but may interact during folding9, 10.
Figure 1: Energy landscape scheme of protein folding and aggregation.
The purple surface shows the multitude of conformations 'funneling' to the native state via intramolecular contacts and the pink area shows the conformations moving toward amorphous aggregates or amyloid fibrils via intermolecular contacts. Both parts of the energy surface overlap. Aggregate formation can occur from intermediates populated during de novo folding or by destabilization of the native state into partially folded states and is normally prevented by molecular chaperones. Cell-toxic oligomers may occur as off-pathway intermediates of amyloid fibril formation.
Full size image (54 KB)Partially folded or misfolded states often tend to aggregate, particularly when they represent major kinetic traps in the folding pathway. This is due to the fact that these forms typically expose hydrophobic amino acid residues and regions of unstructured polypeptide backbone, features that are mostly buried in the native state. Like intramolecular folding, aggregation—the association of two or more non-native protein molecules—is largely driven by hydrophobic forces and primarily results in the formation of amorphous structures (Fig. 1). Alternatively, aggregation can lead to the formation of highly ordered, fibrillar aggregates called amyloid, in which
-strands run perpendicular to the long fibril axis (cross-
structure) (Fig. 1). Although apparently restricted to a subset of proteins under physiological conditions, these thermodynamically highly stable structures are accessible to many proteins under denaturing conditions, largely independent of sequence, suggesting that their formation is an inherent property of the polypeptide chain11. The formation of amyloid fibrils is usually toxic to cells and may give rise to some of the most debilitating neurodegenerative diseases.
Folding in the cell—the molecular chaperone concept
New, fluorescence-based techniques now allow protein folding and aggregation to be observed in vivo in real time12. These and other studies indicate that the tendency of partially folded proteins to aggregate is greatly enhanced in the highly crowded environment of the cell, largely explaining the requirement of molecular chaperones13. Whereas folding experiments in vitro are typically performed in dilute solution to minimize aggregation, in the cell, folding occurs in the presence of 300–400 g l-1 of protein and other macromolecules. The resulting excluded volume effects substantially enhance the affinities between interacting protein molecules, including folding intermediates. The translation process can potentially further increase the risk of misfolding and aggregation, because incomplete polypeptide chains cannot fold into stable native conformations. Additionally, the exit channel of the large ribosomal subunit, which is
100 Å long but, at most, 20 Å wide14, largely precludes folding beyond the formation of
-helical elements15, 16 and thus prevents the C-terminal 40–60 residues of the chain from participating in long-range interactions (see the Review by Bukau and colleagues17 in this issue). As a consequence, productive folding can occur only after a complete protein or at least a domain (
50–300 amino acids) has emerged from the ribosome, consistent with the general rules of folding established in vitro and supported by recent simulations of nascent chain folding18. Because translation is relatively slow (
15–75 s for a 300-amino-acid protein), nascent chains are exposed in partially folded, aggregation-sensitive states for prolonged periods of time. Moreover, non-native intrachain contacts formed during translation could block folding upon completion of synthesis. Molecular chaperones therefore interact co-translationally with nascent polypeptides and inhibit their premature (mis)folding. For example, the chaperone Trigger factor binds to the small titin I27 chain (
120 amino acids) until its complete
-sandwich domain has emerged from the ribosome19.
The cellular chaperone machinery ensures that folding is efficient for most proteins20. We define a molecular chaperone as any protein which interacts, stabilizes or helps a non-native protein to acquire its native conformation but is not present in the final functional structure. Chaperones are involved in a multitude of cellular functions, including de novo folding, refolding of stress-denatured proteins, oligomeric assembly, intracellular protein transport and assistance in proteolytic degradation. Chaperones that participate broadly in protein biogenesis, such as the heat-shock protein (Hsp)-70s and chaperonins (Hsp60s), primarily recognize hydrophobic amino acid side-chains exposed by non-native proteins and promote their folding through ATP-regulated cycles of binding and release (Fig. 2). Binding to chaperone blocks aggregation and reduces the concentration of folding intermediates, whereas transient release of bound hydrophobic regions is necessary for folding to proceed. It is important to realize that chaperones act not by contributing steric information to the folding process but rather by optimizing the efficiency of folding. Notably, a number of essential proteins have extremely low intrinsic folding efficiencies and essentially do not fold in the absence of chaperones. For example, actins and tubulins seem to have highly energetically frustrated folding pathways and can overcome kinetic folding barriers only through assistance by chaperones. As mutations often disrupt a protein's ability to fold, it follows that the chaperone system is also important in buffering such deleterious mutations21, 22. This buffering function is thought to be crucial in the evolution of new protein functions and phenotypic traits21, 23, 24.
Figure 2: Chaperone mechanism in promoting folding through kinetic partitioning.
Many chaperones, such as the Hsp70s, are switched between high- and low-affinity states for unfolded and partially folded protein by ATP binding and hydrolysis. ATP binding causes substrate release, allowing folding toward the native state to proceed; hydrolysis is necessary for chaperone cycling. Folding is promoted and aggregation prevented when KFold is greater than KOn for chaperone binding (or rebinding) of partially folded states and when KOn is greater than intermolecular association by the higher-order rate constant KAgg (KFold > KOn > KAgg). Multiple cycles of substrate binding and release will slow folding relative to spontaneous folding in the absence of aggregation, assuming that the chaperone does not accelerate the rearrangement of kinetically trapped states. Note that for proteins that populate such misfolded states, KOn may be greater than KFold (KFold
KOn > KAgg). These proteins are stabilized in a non-aggregated state, but fail to fold through chaperone cycling. They may require transfer into the chaperonin cage for folding. Under certain conditions (overproduction of slow folding proteins, conformational stress), KAgg may become faster than KOn and aggregation occurs (KAgg > KOn
KFold), unless chaperone expression is induced via the stress-response pathway.
Numerous classes of structurally unrelated chaperones have been described25, 26. Many of these are known as stress proteins or heat-shock proteins, as they are upregulated by cells under conditions of conformational stress in which the concentration of aggregation-prone folding intermediates increases. Chaperones are usually classified according to their molecular weight (Hsp40, Hsp60, Hsp70, Hsp90, Hsp100 and the so-called small Hsp proteins). Although all these components have the capacity to prevent aggregation, only certain members of the Hsp100 family in bacteria and fungi can actively dissociate aggregates for subsequent protein refolding or degradation (reviewed in ref. 27). The cellular chaperone machinery forms complex networks that are indispensable for protein quality control and maintenance of protein homeostasis28.
Principles of chaperone pathways in de novo folding
The chaperone pathways and networks acting in protein folding in the cytosol follow general organizational rules29, 30. In all three domains of life, we find two major principles of chaperone action, represented by (i) machinery that functions in stabilizing nascent polypeptides on ribosomes and initiating folding (also see ref. 17 in this issue) and (ii) components that act downstream in completing the folding process31, 32 (Fig. 3). Both systems cooperate in coherent pathways. The number of interacting substrates and the degree of functional redundancy among chaperone components decreases from upstream to downstream.
Figure 3: Protein folding in the cytosol.
Models for the chaperone-assisted folding of newly synthesized polypeptides in the cytosol. (a) Bacteria. Nascent chains probably interact generally with Trigger factor (TF), and most small proteins (
70% of total) may fold rapidly upon synthesis without further assistance. Longer chains interact subsequently with DnaK and DnaJ (Hsp70 system) and fold upon one or several cycles of ATP-dependent binding and release (
20% of total). About 10% of chains transit the chaperonin system (GroEL and GroES) for folding. N, native protein. (b) Archaea. PFD, prefoldin; NAC, nascent chain–associated complex. Note that only some archaeal species contain DnaK and DnaJ. (c) Eukarya. Like TF, NAC probably interacts generally with nascent chains, but the role of NAC in folding is not yet clear. About 20% of chains reach their native states in a reaction assisted by RAC (ribosome-associated complex), Hsp70 and Hsp40. A fraction of these must be transferred to Hsp90 for folding. About 10% of chains are co- or post-translationally passed on to the chaperonin TRiC/CCT in a reaction mediated by Hsp70 and PFD, both of which interact directly with TRiC/CCT. PFD recognizes the nascent chains of certain TRiC substrates, including actin and tubulins.
The first category of factors includes chaperones that bind directly to the large ribosomal subunit in close proximity to the polypeptide exit site, such as bacterial Trigger factor (Fig. 3a) and a specialized Hsp70 system called RAC (ribosome-associated complex) in eukaryotes (Fig. 3c). In S. cerevisiae, RAC consists of the Hsp70 homologs Ssb1 (or Ssb2), Ssz1 and the Hsp40 homolog zuotin, whereas mammalian RAC consists of Hsp70L1 and Mpp11, homologs of Ssz1 and zuotin33, 34. Additionally, archaea and eukarya contain the nascent chain–associated complex, NAC17, 25, 26 (Fig. 3b,c). These ribosome-bound chaperones are in a privileged position to interact with the majority of nascent chains destined for the cytosol. They bind linear chain segments enriched with hydrophobic amino acids, thus delaying premature chain compaction and maintaining the elongating polypeptide in a non-aggregated state until sufficient structural information for productive folding is available. Based on a recent analysis by cryo–electron tomography, the three-dimensional organization of polyribosomes additionally seems to minimize the probability of nascent chain contacts between adjacent ribosomes35.
Members of the Hsp70 family (DnaK in bacteria, Hsc70 in higher eukaryotes) function as second-tier chaperones for longer nascent chains. These factors do not bind directly to the ribosome and mediate co- or post-translational folding through ATP-regulated binding cycles. Co-translational domain folding serves to avoid non-native interdomain contacts, thus smoothing the energy landscape for large proteins9, 36. This mechanism is essential for the folding of the large number of eukaryotic multidomain proteins and is facilitated by the slower translation speed in eukaryotes (
4 amino acids per second versus
20 amino acids per second in bacteria). The Hsp70s are also the central organizers of the chaperone network and distribute subsets of proteins to chaperones that act downstream, such as the chaperonins (Hsp60s) (GroEL in bacteria and TRiC/CCT in eukarya)24, 31, 37 and Hsp90 (ref. 38). The chaperonins are large, cylindrical complexes that function by enclosing protein molecules, one at a time, in a cage-like nanocompartment, so that folding can occur unimpaired by aggregation. Interestingly, the eukaryotic chaperonin interacts directly with Hsp70 (ref. 39) and other upstream factors, such as prefoldin40, and thus can be recruited to nascent chains that are unable to fold with Hsp70 alone41. This functional coupling by direct interaction between chaperones facilitates co-translational folding and avoids the partitioning of non-native protein into the bulk cytosol42, 43. Hsp90 is another important chaperone system that functions downstream of Hsp70 in the folding and conformational regulation of many signaling molecules (reviewed in refs. 38,44). Additional machinery may be required for the assembly of folded protein subunits into large oligomeric complexes45, as shown recently for RbcX, a specific assembly chaperone of hexadecameric Rubisco46.
The flux of newly synthesized proteins through the chaperone system is currently best understood for the bacterial cytosol (Fig. 3a). In Eschrichia coli, most of the
2,400 cytosolic proteins are thought to interact first with ribosome-bound Trigger factor, which is restricted to bacteria. Although most small, single-domain proteins probably do not require further chaperone interactions, longer chains (>20–30 kDa) may interact subsequently with the ATP-regulated Hsp70 DnaK and its Hsp40 cochaperone, DnaJ. Co-immunoprecipitation experiments indicate that
20% by mass of newly synthesized polypeptides fold through cycles of Hsp70 binding and release47. Trigger factor and DnaK have partially overlapping functions, but their combined deletion is lethal at temperatures above 30 °C47, 48. The GroEL chaperonin acts downstream of Trigger factor/DnaK and is involved in the folding of
10% of cytosolic proteins24, 49. GroEL and GroES are essential at all growth conditions, because a number of essential proteins are crucially GroEL dependent for folding24. Similar fractions of total protein have been shown to transit the eukaryotic Hsp70 and chaperonin systems42, 50, 51.
The chaperone pathways operating in the endoplasmic reticulum (ER) follow analogous organizational principles, but specialized machinery is used in disulfide bond formation and sugar modification of many secretory proteins (reviewed in ref. 52). Likewise, the biogenesis and folding of membrane proteins uses specialized machinery for insertion and assembly of the membrane-integrated parts, whereas cytosolic and ER luminal chaperones assist in the folding of exposed domains (for review, see the Perspective by Skach53 in this issue).
Chaperone paradigms
Several mechanistic paradigms of chaperone action in protein folding have been defined. The following sections provide a brief discussion of Trigger factor, an ATP-independent chaperone, the ATP-regulated Hsp70 system and the chaperonins, which mediate folding in the cytosol. For each of these systems, there are mechanistic models that are well supported by structural and functional data.
Trigger factor. Trigger factor is an abundant
50-kDa protein, consisting of an N-terminal ribosome-binding domain, a peptidyl-prolyl isomerase (PPIase) domain and a C-terminal domain, the latter being positioned in the structure between the N and PPIase domains54 (Fig. 4a). Trigger factor exists in two forms: a monomer when bound to the ribosome and a dimer when free in the cytosol. The PPIase domain, which can catalyze prolyl cis-trans isomerization in vitro, functions as an auxiliary chaperone site independent of proline residues19, 55, 56. The C domain, containing two arm-like protrusions, is the major binding region for hydrophobic nascent chain segments55, 57. The N domain binds to ribosomal proteins L23 and L29 next to the polypeptide exit site54, with a mean residence time of
10–15 s (ref. 19). Ribosome binding causes a conformational opening of Trigger factors, presumably activating it for nascent chain interaction19, 58, 59 (Fig. 4b). Chain release from Trigger factor is ATP-independent, and is probably driven by the tendency of the bound polypeptide to bury hydrophobic regions. Nascent chains that interact only weakly may begin to fold co-translationally, perhaps with the ribosome-bound Trigger factor providing a protected environment54, 56. However, when the nascent chain exposes strongly hydrophobic segments, Trigger factor leaves the ribosome but remains associated with the elongating chain, explaining how Trigger factor delays the folding of certain multidomain proteins relative to translation36. The eventual dissociation of Trigger factor facilitates folding or polypeptide transfer to downstream chaperones such as DnaK.
Figure 4: Structure and function of Trigger factor (TF).
(a) The domain structure of TF, including the FRK ribosome-binding loop in the N domain (PDB 1W26)54. (b) A model of the TF reaction cycle (modified from ref. 18). (1) Free TF is in rapid equilibrium between monomeric and dimeric states. (2) TF monomer binds to nontranslating ribosomes with a Kd of
1
M and a mean residence time of
10 s. Ribosome binding causes a conformational expansion of TF and may activate TF for interaction with nascent chains. (3) Nascent chains that interact weakly with TF may begin to compact co-translationally in the vicinity of TF. Release of TF from the nascent chain coincides with TF dissociation from the ribosome and allows completion of folding to native state (N). (4) Structurally more complex proteins may interact strongly with TF. TF remains bound to the nascent chain after dissociating from the ribosome and a new TF molecule can enter at the ribosome. Eventual chain dissociation from TF facilitates transfer to DnaK or folding. (5) Released TF enters the monomer-dimer pool.
The Hsp70 system. The Hsp70 proteins are the most versatile chaperones and occur both as constitutively expressed and stress-inducible forms25. Besides broadly assisting in de novo folding, they have various other functions, including protein trafficking and assistance in the proteolytic degradation of terminally misfolded proteins. Hsp70s generally collaborate with chaperones of the Hsp40 (DnaJ) family and nucleotide-exchange factors (NEFs) in the ATP-regulated binding and release of non-native proteins60. Their role in de novo folding begins by binding to nascent chains, but they generally do not interact directly with the ribosome (with the exception of the specialized Hsp70s Ssb1, Ssb2 and Ssz1 in fungi and Hsp70L1 in mammalian cells). Binding and release by Hsp70 is achieved through the allosteric coupling of a conserved N-terminal ATPase domain (
40 kDa) with a C-terminal peptide-binding domain (PBD) (
25 kDa), the latter consisting of a
-sandwich subdomain and an
-helical lid segment61 (Fig. 5a). The
-sandwich recognizes extended,
7-residue segments enriched with hydrophobic amino acids62. Such segments occur on average every 50–100 residues in proteins. The
-helical lid and a conformational change in the
-sandwich domain regulate the affinity state for peptide in an ATP-dependent manner60, 63. In the ATP-bound state, the lid adopts an open conformation, resulting in high on- and off-rates (low affinity) for peptide. Hydrolysis of ATP to ADP is strongly accelerated by Hsp40, leading to lid closure and stable peptide binding (low on- and off-rates; high affinity) (Fig. 5b). Interaction of substrate with Hsp70 is mediated by the so-called J domain, which is present in all Hsp40s60. Hsp40s also interact directly with unfolded polypeptides and can recruit Hsp70 to protein substrates64. Following ATP-hydrolysis, various NEFs (GrpE in bacteria; Bag, HspBP1 or Hsp110 in eukaryotes) bind to the Hsp70 ATPase domain and catalyze ADP-ATP exchange, which results in lid opening and substrate release, thereby completing the reaction cycle.
Figure 5: Hsp70 system.
(a) Structures of the ATPase domain (PDB 1DKG)104 and the peptide-binding domain (PDB 1DKZ)61 of Hsp70 shown representatively for E. coli DnaK. The
-helical lid of the peptide binding domain is shown in yellow and the extended peptide substrate as a ball-and-stick model in pink. ATP indicates the position of the nucleotide-binding site. The amino acid sequence of the peptide is indicated. The interaction of prokaryotic and eukaryotic cofactors with Hsp70 is shown schematically. Residue numbers refer to human Hsp70. Only the Hsp70 proteins of the eukaryotic cytosol have the COOH-terminal sequence EEVD, which is involved in binding of tetratricopeptide repeat (TPR) cofactors103. (b) Hsp70 reaction cycle. NEF, nucleotide-exchange factor (GrpE in case of E. coli DnaK; Bag, HspBP1 and Hsp110 in case of eukaryotic cytosolic Hsp70). (1) Hsp40-mediated delivery of substrate to ATP-bound Hsp70. (2) Hydrolysis of ATP to ADP, accelerated by Hsp40, results in closing of the
-helical lid and tight binding of substrate by Hsp70. Hsp40 dissociates from Hsp70. (3) Dissociation of ADP catalyzed by NEF. (4) Opening of the
-helical lid, induced by ATP binding, results in substrate release. (5) Released substrate either folds to native state (N), is transferred to downstream chaperones or rebinds to Hsp70.
Hsp70-mediated folding and prevention of aggregation may be explained by a process of kinetic partitioning, as shown in Figure 2: binding of Hsp70 to non-native substrate hinders aggregation by transiently shielding exposed hydrophobic segments and at the same time reducing the concentration of aggregation-prone species. Release allows fast-folding molecules (or domains) to bury hydrophobic residues, whereas molecules that need longer than a few seconds to fold will rebind to Hsp70 and avoid aggregation. Binding to Hsp70 may result in conformational remodeling65, perhaps removing kinetic barriers to fast folding. A subset of proteins that are unable to partition to fast-folding trajectories upon Hsp70 cycling remain stabilized in a non-aggregated state and must be transferred into the specialized environment of the chaperonin cage for folding24, 31. Interestingly, the recently described NEFs of the Hsp110 family in eukaryotes are themselves Hsp70 homologs66, 67, 68, 69, 70 and thus may cooperate with conventional Hsp70s in protein folding beyond catalyzing nucleotide exchange69. Such a cooperation could provide functional coordination between multiple bound Hsp70 molecules and facilitate folding for proteins with complex domain topologies.
The chaperonins. Chaperonins are large, double-ringed complexes of
800 kDa. There are two groups of chaperonin26, 71. Members of group I (also called Hsp60s) occur in bacteria (GroEL), mitochondria and chloroplasts. They have seven-membered rings and functionally cooperate with Hsp10 proteins (bacterial GroES), which form the lid of the folding cage. The group II chaperonins in archaea (thermosome) and in the eukaryotic cytosol (TRiC/CCT) consist of eight- or nine-membered rings. They are independent of Hsp10 factors, their lid function being built into the chaperonin ring in the form of specialized
-helical extensions. As in the case of Hsp70, substrate binding by chaperonin is ATP-regulated, but, unlike the Hsp70s, the chaperonins promote folding through cycles of global protein encapsulation.
The group I chaperonin of E. coli (GroEL–GroES) has been studied extensively2, 30, 71 (Fig. 6). GroEL interacts with at least
250 different cytosolic proteins, of which
85 are predicted to be obligate GroEL substrates. Most of these proteins are between 20 kDa and 50 kDa and have complex
/
or
+
domain topologies, with a distinct enrichment of the (
/
)8 TIM barrel fold24. Such proteins are stabilized by many long-range interactions and therefore tend to populate kinetically trapped folding intermediates72. The GroEL double-ring is composed of
57-kDa subunits that consist of an equatorial ATPase domain, an intermediate hinge domain and an apical substrate-binding domain. The apical domains expose hydrophobic amino acid residues for substrate binding toward the ring center. GroES is a heptameric ring of
10-kDa subunits that covers the ends of the GroEL cylinder (Fig. 6a)71, 73. Based on recent analyses by NMR, single-molecule Förster resonance energy transfer (FRET) and EM, GroEL-bound substrates populate an ensemble of compact and locally expanded states that lack stable tertiary interactions (Fig. 6b)74, 75, 76, 77. Whereas unfolding upon binding may help to resolve non-native interactions in trapped intermediates75, folding critically depends on the global encapsulation of the substrate in the chaperonin ring by GroES22, 71, 78. Binding of GroES is preceded by ATP binding to GroEL and causes a pronounced conformational change that leads to the formation of a cage with a highly hydrophilic, net negatively charged inner wall. Encapsulated protein up to
60 kDa is free to fold in this environment for 10–15 s, the time needed for ATP hydrolysis in the GroES-bound ring (cis-ring). Protein substrate leaves the cage upon GroES dissociation, which is induced by ATP binding in the opposite ring (trans-ring). Substrate that has not yet folded rapidly rebinds to GroEL for further folding attempts. Proteins that exceed the size limit of the chaperonin cage either use the Hsp70 system for folding24, 36 or may reach native state through binding and release from GroEL without encapsulation79.
Figure 6: The GroEL-GroES chaperonin.
(a) Crystal structure of the asymmetric GroEL–GroES complex (PDB 1AON)73, showing the GroES-bound chamber of GroEL (cis) and the opposite GroEL ring (trans). (b) Working model summarizing the conformational changes in a substrate protein upon transfer from DnaK–DnaJ (Hsp70 system) to GroEL and during GroEL–GroES-mediated folding. (1) Substrate protein may be delivered to GroEL by DnaK–DnaJ in a non-aggregated, but kinetically trapped, state. Upon binding to GroEL it undergoes local unfolding to an ensemble of expanded and more compact conformations. (2) ATP-dependent domain movement of the apical GroEL domains result in stretching of tightly bound regions of substrate and in release and partial compaction of less stably bound regions. (3) Compaction is completed upon substrate encapsulation by GroES. (4) Folding in the chaperonin cage. (5) Substrate release upon GroES dissociation. (6) Rebinding of incompletely folded states. Note that binding of a second substrate molecule to the open ring of GroEL in steps (4) and (5) as well as the transient formation of a symmetrical GroEL–GroES2 complex is omitted for simplicity. N, native state; I, folding intermediate.
Full size image (59 KB)Enclosing unfolded protein, one molecule at a time, in a specialized folding compartment provides an effective solution to the aggregation problem. However, the physical properties of the compartment are likely to have additional effects on folding beyond prevention of aggregation. Notably, larger substrates fully occupy the limited volume of the GroEL and GroES nanocage, as shown impressively by recent cryo-EM analysis of chaperonin complexes80. The constraints resulting from this high degree of geometric confinement will inevitably affect the folding energy landscape of such proteins. Although it has been suggested that the chaperonin merely functions as a passive aggregation-prevention device81, other studies provided evidence that encapsulation can accelerate folding up to tenfold over the rate of spontaneous folding (measured in the absence of aggregation)22, 78, 82. This rate acceleration may be attributed to steric confinement by the cage, entropically destabilizing misfolded states and promoting the acquisition of more compact, native-like conformations, consistent with confinement theory83, 84. Mutational analysis showed further that the polar residues of the cavity wall are crucial for rapid folding22, 85, 86. According to molecular dynamics simulations, these polar residues are expected to promote folding by accumulating ordered water molecules in their vicinity, thereby generating a local environment in which a substrate protein is forced to bury exposed hydrophobic residues more effectively87. This effect would be significant only with proteins that approach the size limit of the cage, consistent with the finding that folding of smaller substrates of 25–30 kDa is not accelerated78, 88.
Other elements of the chaperonin mechanism may also contribute to accelerating folding. It has been proposed that repeated unfolding events in successive binding-and-release cycles help to reverse kinetically trapped states ('iterative annealing')89, 90. Moreover, the release of substrate from the GroEL apical domains during encapsulation may follow a stepwise mechanism, with less tightly bound hydrophobic regions being released first75 (Fig. 6b). This stepwise release might facilitate the folding of proteins that fail to fold through spontaneous hydrophobic collapse. The contribution of these effects to chaperonin-assisted folding as well as the exact coordination of the two GroEL rings in the folding cycle remain to be clarified91.
Implications in understanding human disease
An increasing number of diseases are now recognized to be associated with aberrant protein folding, with pathomechanistic considerations converging on chaperone functions and related quality-control pathways. Of particular interest in this regard are pathological states in which aberrant folding results in a toxic gain of function. These disorders include, among others, Alzheimer's, Parkinson's and Huntington's disease as well as the prion diseases, all of which are associated with the deposition of fibrillar aggregates (amyloid) either within or around neurons in specific brain regions (also the Perspective by Tessier and Lindquist92 in this issue). Although the toxic principle operating in these disorders is far from being understood, a consensus is emerging that oligomeric, soluble states of the respective disease proteins are the primary cytotoxic species93. These intermediates may interact aberrantly with other proteins or with membrane surfaces, altering their functional properties94. Interestingly, increasing the levels of molecular chaperones, most prominently members of the Hsp70 family, has been shown to inhibit the formation of such oligomers and to prevent the formation of amyloid aggregates for different disease proteins94, 95, 96, 97. In the case of polyglutamine-repeat proteins, which cause Huntington's disease and several other related disorders, Hsp70 cooperates with the chaperonin TRiC/CCT in preventing the accumulation of toxic oligomers98, 99, 100, reminiscent of the functional cooperation between these chaperone systems in de novo protein folding.
These observations raise the question of why, at a certain point in life, the protein quality-control machinery ceases to cope with the formation of potentially toxic, misfolded proteins. Recent advances in understanding the genetic programs underlying the aging process indicate that the functional capacity of molecular chaperones and other aspects of protein quality control decrease during aging101. Although the underlying mechanisms of this decline are not yet understood in detail, this important insight would provide a plausible explanation for the late onset of many neurodegenerative diseases caused by aberrant protein folding. It also suggests that searching for ways to re-establish protein homeostasis, for example by upregulating the expression of chaperones, may offer promising venues for therapeutic intervention28, 102. A more detailed understanding of the complexities of protein folding and cellular chaperone networks will be required for these approaches to ultimately be of clinical benefit.

