Main

The failure of generic chemotherapy as well as targeted therapy, embodied by drug resistance and tumour relapse, demands a broader understanding of the tumour’s response to therapeutic perturbation, which is more complex than the binary outcome – ‘responding’ (e.g., tumour shrinks) vs ‘non-responding’. A more encompassing approach has to take into account the complexity of the regulatory networks that govern cell phenotype changes because the very same cellular machinery that affords a single metazoan genome the ability to produce the multitude of distinct, stable phenotypic cell states also accounts for the broad behavioural repertoire of cancer cells.

The current explanation for the development of therapy resistance and tumour recurrence invokes genomic alterations and selection of mutant cells carrying a relevant mutation acquired by chance before therapy (Nowell, 1976; Yachida et al, 2010; Sottoriva et al, 2013). Such somatic Darwinian selection leads to a shift in cell population composition but does not involve individual cells actively changing their phenotype: the mutant cells are passively selected by the therapy and overtake the cell population. Whole-genome sequencing of tumours with reconstruction of progression history (Yachida et al, 2010; Sottoriva et al, 2013) as well as the finding that ‘driver mutations’ affecting oncogenes within the same pathway are mutually exclusive (Sherr, 1996; DeNicola et al, 2011; Nakasone et al, 2012; Rosenzweig, 2012) offer convincing molecular footprints of Darwinian somatic evolution. The molecular nature of genetic mutations, such as point mutations that alter the target domain of protein kinases where selective inhibitor drugs bind (Vogelstein and Kinzler, 1993; DeNicola et al, 2011; Hanahan and Weinberg, 2011; Nakasone et al, 2012; Rosenzweig, 2012), also provide evidence of Darwinian selection. Accordingly, phenotype innovation during tumour progression, such as development of drug resistance or acquisition of any hallmark of cancer, is now, by default, explained by a genetic mutation (Vogelstein and Kinzler, 1993; Hanahan and Weinberg, 2011). However, a more recent study (Sottoriva et al, 2015) revealed the possibility of neutral evolution, which complicates the interpretation of genomic mutations found in the tumour tissue. In this process, which is well familiar to evolution biologists, most mutations are not selected for but instead accumulate as they are carried along and may by chance become sufficiently predominant to be detectable owing to genetic drift in relatively small cell populations.

In any case, it has long been suspected that driver mutations may only be part of the equation. The scheme of Darwinian evolution relies on the tacit assumption of a rigid one-to-one mapping from genotype to phenotype, such that a phenotype innovation can only result directly from a genomic mutation, which are random in nature but can be selected for. This line of thought ignores the diversity of cell phenotypes that can be generated by a single metazoan genome. In fact, a long line of evidence points to mechanisms that defy the scheme of genetic mutation/selection as the driver of tumour progression. The recent spate of cancer genome sequencing studies has reopened our eyes to the flexibility in the genotype–phenotype mapping, pushing us to think beyond the paradigm of mutation/selection. Such flexibility re-emerges under the term ‘phenotype plasticity’ and entails a departure from the classical mutations/selection scheme as an agent of tumour progression. The case of ependymoma without mutation (Mack et al, 2014; Shlush et al, 2014) or the reappearance of early clones, which if progression were to follow the Darwinian paradigm of selection would have been outcompeted by fitter clones (Nowak, 2006; Hoek and Goding, 2010; Roesch et al, 2010; Sharma et al, 2010; Chaffer and Weinberg, 2011; Shlush et al, 2014; Sun et al, 2014), underscore the need to embrace the non-uniqueness in the genotype–phenotype mapping and limited efficacy of selection.

In this review, we consider non-genetic cell phenotype plasticity as a central process in therapy resistance. We take into account the ability of cells to produce discretely distinct phenotypes, switch between them without genomic alterations and inherit the new phenotype non-genetically across cell generations (Brock et al, 2009). Here, we explain the underlying biology of cell state dynamics and the formal framework of cancer attractors that link cell phenotype dynamics to first principles of dynamical systems theory and offer a conceptual aid for comprehending mutation-independent tumour progression. As illustration, we present examples of therapy-induced drug resistance and discuss implications for new therapies.

Attractors and non-genetic switching on the epigenetic landscape

In the past years, cell fate-switching between a more stem-like, therapy-resistant state and a more differentiated, drug-sensitive state has been observed in a number of tumour cell populations (Hoek and Goding, 2010; Sharma et al, 2010; Chaffer and Weinberg, 2011; DeNicola et al, 2011; Nakasone et al, 2012; Sun et al, 2014). The reversibility of such switching and the clonality of the populations in which both states coexist indicate that this cell phenotype change is not caused by mutations.

Because of their longevity in the tissue, normal stem cells are endowed with increased xenobiotic resistance, in part, due to the expression of efflux pumps (Zhou et al, 2001; Challen and Little, 2006; Kenyon and Gerson, 2007; DeNicola et al, 2011; Nakasone et al, 2012; Rosenzweig, 2012), more efficient DNA repair (Kenyon and Gerson, 2007) and they also orchestrate tissue changes during wound healing. Thus, the stem-like state is deeply linked to resilience and stress response – a relationship that appears to hold for their neoplastic counterpart, the cancer stem cells (CSC) (Dean et al, 2005; Donnenberg and Donnenberg, 2005; Medema, 2013). A fundamental biological fact is then that the axis of cell phenotype with respect to drug tolerance is aligned with that of cellular stemness.

Here, we will rely on the example of a two-state dichotomy that can often be observed by measuring expression status of a single protein marker X (e.g., XHigh and XLow). This may appear simplistic, but serves well for the conceptualisation of cell state dynamics (Figure 1). Phenotype switching between the cancer stem-cell state, here XHigh (where X can be one of the stem-cell markers, such as CD117, MDR1, CD44, ALDH1; Hanahan and Weinberg, 2011; Medema, 2013), and the more differentiated, here XLow, state epitomises non-genetic plasticity of cancer cells. The underlying principles are the same as those that govern cell phenotype changes in development and tissue homeostasis: the acquisition of a stable and distinct cell phenotype governed by a coordinated gene expression pattern that can be passed along to daughter cells upon division. Although trivial, one often forgets that multiple distinct phenotypic states can be produced by the very same genome, which is the foundation for non-genetic switching between functional states during tumour progression.

Figure 1
figure 1

State transition and non-genetic plasticity. The cellular transition from the state with XHigh (red) to XLow (blue) and vice versa can be thought of as a state transition between the two subattractors on the epigenetic landscape (see Box 1). The reversibility of such switching and the clonality of the populations in which both states coexist indicate that such cell phenotype changes are not caused by mutations. As in multistable systems, the transition is noise driven but modulated by external conditions, including the presence of the drug. Note that by monitoring one dimension of the gene expression state space, for example, X=MDR1 expression, we are able to observe cell transition only as a projection (horizontal axis) and do not know what happens in orthogonal (non-observable) dimensions.

PowerPoint slide

The existence of multiple, almost discretely distinct, stable states, each characterised by its own stable gene expression profiles that is produced by the very same genome, can be explained by the concept of multistability. Herein, the stable states are the so-called attractor states – the potential wells in an energy landscape (Figure 1 and Box 1).

With this formal conceptual framework, non-genetic switching between phenotypic states can be represented as switching between distinct attractor states – a conceptualisation well familiar to physical scientists. The key properties that distinguish such non-genetic state switching from a mutation-driven phenotype alteration are that the former (i) occurs much more frequently than mutations, (ii) is in principle reversible and (iii) is linked to regulatory dynamics of the GRN, and thus has features of non-randomness. The last property is a central distinctive feature of non-genetic phenotype transitions: unlike undirected (blinded) mutations of DNA, non-genetic switching between attractor states is likely to turn on existing (evolved) cellular programs and to be influenced by external cues. This framework could explain why tumour cells can so readily acquire complex developmental programs, such as a stem-like state or any of the ‘hallmarks of cancer’ (Hoek and Goding, 2010; Hanahan and Weinberg, 2011). Their gradual Darwinian evolution de novo is not required because these states have equivalents in normal development, homeostasis and regeneration.

For such mutation-less cell phenotype switching in tumour progression, we can distinguish between two scenarios: (1) spontaneous (random) phenotype switching between a stem-like, resistant and a more differentiated (less aggressive) state and (2) externally induced (directed) switching to the stem-like state. The switching between the more mature and the stem-like states has been intensely studied in melanoma (Sharma et al, 2010; Hoek and Goding, 2010), where the bidirectional switching, that is, the reversibility of differentiation of CSC into multiple cell types, long thought to be an one-way process, supports the emerging notion that CSCs do not represent a static mutant clone but rather a functional state that can be induced in some cells under some conditions (Gupta et al, 2009a; Dirks, 2010; Gerlinger et al, 2012).

Spontaneous state switching

The first scenario, spontaneous switching between attractors, is evident in cell population measurements of a marker X in (clonal) tumour cell cultures by flow cytometry. Such measurements can reveal the coexistence of the two states, occupied by two fractions of a clonal population, through the appearance of a bimodal distribution of the flow cytometry histogram (two ‘peaks’) (Figure 1). The presence of distinct peaks suggests (but does not prove) multistability. Such non-genetic heterogeneity must be distinguished from genetic heterogeneity caused by genomic instability (Gerlinger et al, 2012; Pisco et al, 2013).

Using fluorescent-activated cell sorting to isolate cells from one peak followed by reculturing often – but not always – shows that one sub-population (e.g., XHigh) can repopulate the other (e.g., XLow) (Chang et al, 2006; Sharma et al, 2010; Pisco et al, 2013), demonstrating spontaneous, non-genetic transitions across a sufficiently low ‘energy’ barrier. Therefore, individual cells switch in a stochastic manner from one (attractor) state (XHigh) to the other (XLow), revealing characteristic rates for transitions in either direction. The driving force and source of randomness for such state switching is gene expression noise (=fluctuations of protein levels due to the random nature of molecular reactions within small volumes in the cell) (Kaern et al, 2005; Raj and van Oudenaarden, 2008; Zhou et al, 2014). Transitions between the XHigh and XLow state, although bidirectional, are often asymmetric – reflecting the relative propensity of a given cancer cell type to differentiate into a more mature state, or of the latter to dedifferentiate back into the stem-like state. ‘Dedifferentiation’ results in the continuous production of CSCs by the more differentiated cells and has been observed in various cancers posing a challenge to therapy aimed at targeting CSC (Brock et al, 2009; Gupta et al, 2009b).

Cell population dynamics with spontaneous conversions between resistant and sensitive states can be subjected to Darwinian selection in the same manner as phenotype variability caused by mutations (Figure 2A): the stochastic but non-genetic (mutation-independent) transitions between attractor states act in the same way as mutations in Darwinian evolution in producing a phenotype variability as substrate for selection – albeit with much higher frequency and repeatedly generating a particular complex phenotype (as it is a latently ‘preprogrammed’ attractor). Thus, in the presence of cytotoxic drugs the cells in the more stem-like, more drug-tolerant state in a population will have a proliferation advantage and can be selected for if they endure over several cell generations in that stem-like state (Brock et al, 2009). Such ‘non-genetic selection’ can provide temporary resistance and allow the sub-population of these cells to expand. As these non-genetically fitter variants are present in the multimodal population due to a dynamic equilibrium, they exist by default and at much higher frequency than genetic mutants (accounting for 1% or more in a clonal population). Thus, they can jump start classical (mutation-based) somatic evolution because they guarantee that there is always a small population of cells that independently of mutations can survive the treatment. As these cells expand, they can accumulate genetic mutations that are then selected for. This scheme of preselection due to non-genetic variability followed by later genetic fixation is in essence Waddington’s genetic assimilation (Waddington, 1942; Zhou et al, 2001; Kim et al, 2002). An interesting twist that may further accelerate this process of somatic evolution is that since many anticancer drugs are genotoxic, mutation rate is further increased among these initial stem-like survivors.

Figure 2
figure 2

Two schemes for the evolution of drug-resistant state. (A) In the first scenario, according to the Darwinian selection, the drug-resistant phenotype results from selection of resistant clones, which were produced by genetic mutations. The genetic mutation alters the genome, causing a rewiring of the gene regulatory network (GRN), which in turn changes the epigenetic landscape, creating a new phenotype (attractor state, S(XHigh)). (B) In the second scenario, the drug causes an attractor transition from the sensitive (blue) to the resistant (red) state in a multistable system (landscape with two potential wells). The GRN and the landscape remain unchanged.

PowerPoint slide

Therapy-induced state switching: therapy as doubled-edged sword

In the second scenario, an external stimulation that affects signalling pathways can change the gene expression state, thus triggering a state transition between attractor states in a multistable system. In normal tissue development, attractor transitions correspond to cell differentiation and are induced by tightly regulated developmental signals that trigger transitions to lower attractor states. Of particular interest for tumour progression is the stress inflicted onto the (non-killed) cancer cells by treatment and the ensuing state transition. Given the commonalities between stress-response and stemness programmes (Zhou et al, 2001; Kim et al, 2002; Blanpain et al, 2011) and the inherent immaturity of cancer cells explained above, it is possible that cell stress imparted by cytotoxic agents actively induces a state transition specifically into a stem-like state, which actually is a stress-response state.

Such cell response is not unexpected if we consider the theoretical epigenetic landscape (Figure 2B). In this framework, there are many possible (unused) cell states in the neighbourhood of a cancer attractor, which due to the tight relationship between ontogenesis and phylogenesis likely represent evolutionary ancient, stem-like immature states. Therefore, entry into these unoccupied attractors does not require very specific signals but can be triggered by many unrelated, nonspecific cell perturbations. As normal stem cells are specialised to respond to tissue damage by providing a protective response (Blanpain et al, 2011; Huang, 2013), these nearby abnormal immature states occupied by cancer states may express similar properties. Thus, a cytotoxic, non-lethal cell stress applied to an already pathologically immature cell in a cancer attractor has a high probability to push the cell further into more immature (‘dedifferentiated’) regions of the epigenetic landscape (Huang, 2013).

A therapy-induced, directed somatic evolution towards an advantageous phenotype would correspond to a Lamarckian scheme: a new, better ‘adapted’, inheritable state is induced by an environmental input (Mayr, 1972). In view of the conceptual framework of the epigenetic landscape, this event would not be uncommon, but in contrary almost a default reaction of cancer cells to massive, near-lethal perturbations. In fact, the list of cases in which chemotherapy appears to induce either a stem-like or/and drug-resistant state is quite old and growing (Bates et al, 1989; Chin et al, 1990; Ciocca et al, 1992; Chaudhary and Roninson, 1993; Abolhoda et al, 1999; Stein, 2001; Iyer and Lehnert, 2002; Notarbartolo et al, 2002; Bortul et al, 2003; Gréen et al, 2003; Baker et al, 2005; Camphausen et al, 2005; Andarawewa et al, 2007; Xia et al, 2008; Li et al, 2009; Pajic et al, 2009; Taube et al, 2010; Lee et al, 2010; Balachandran et al, 2011; Chang et al, 2011; Ghisolfi et al, 2012; Lagadec et al, 2012; Razandi et al, 2012; Abubaker et al, 2013; Pisco et al, 2013, 2014).

The induction of multidrug resistance (MDR) proteins, the ABC family membrane pumps that mediate the active efflux of drugs and also are markers of stem cells (‘dye exclusion’), is the best documented example of an induced adaptive trait (Bates et al, 1989; Chin et al, 1990; Chaudhary and Roninson, 1993; Abolhoda et al, 1999; Yamada et al, 2000; Stein, 2001; Notarbartolo et al, 2002; Gréen et al, 2003; Baker et al, 2005; Pajic et al, 2009; Kim et al, 2012; Stein et al, 2012; Pisco et al, 2013, 2014). Wnt signalling has a central role in induced stemness and MDR upregulation in various cancer cells (Yamada et al, 2000; Lee et al, 2010; Kim et al, 2012; Stein et al, 2012; Pisco et al, 2013). In irradiated leukaemia cells, resistance to radiotherapy was acquired without ‘clonal predisposition’ (Lee et al, 2010; Lagadec et al, 2012). Similarly, ionising radiation reprogrammed differentiated breast cancer cells into CSCs via Notch (Lagadec et al, 2012) and the CSC fraction increased following cisplatin therapy in head-and-neck cancer (Nör et al, 2014). Several reports suggest chemotherapy-induced epithelial–mesenchymal transition, which has been linked to transition into a stem-like state (Dallas et al, 2009; Polyak and Weinberg, 2009; Singh and Settleman, 2010), including in colon (Dallas et al, 2009; Xue et al, 2011), gastric (Pirozzi et al, 2011; Xue et al, 2011), liver cancer (Gupta et al, 2011; Pirozzi et al, 2011) and breast cancer (Donaldson et al, 1978; Wallner and Li, 1986; Gupta et al, 2011).

Treatment-induced stress response might indeed protect the cancer cell (Iliopoulos et al, 2011; Jackson et al, 2013; Halliday et al, 2014). This protective stress response was explicitly observed by exposing cancer cells to nonlethal heat-shock before chemotherapy and was associated with the upregulation of heat-shock proteins and ABC transporters (Ciocca et al, 1992).

Most studies do not distinguish between Darwinian and Lamarckian scheme in resistance development

Often in the literature terms such as ‘acquisition’, ‘enrichment’ or ‘induction’ of stemness and resistance (Pardoll, 2012; Heams et al, 2014; Sun et al, 2014), as well as expressions such as ‘activation of alternative pathways’ that circumvent the blockade by target-selective drug or ‘adaptive resistance’(Abolhoda et al, 1999; Huang et al, 2012; Pardoll, 2012; Tenbaum et al, 2012; Das Thakur et al, 2013; Sun et al, 2014), are used in a hand-waving manner without specifying the class of mechanisms behind the development of therapy resistance. Three types of scenarios are possible: tumour progression can be driven by (i) selection of genetic mutants (the classical Darwinian ‘adaptation’); (ii) selection of cells in non-genetic, pre-existing states occupied by chance by a sub-population of cells (see Section ‘Spontaneous state switching’); (iii) a true induction of cell state transition by the therapeutic agent in individual surviving cells (see Section ‘Therapy-induced state switching: therapy as doubled-edged sword’). The former two scenarios are cell population-level processes, involving selection of inherently fitter cells. Older analyses, based on the upregulation of resistance/stemness genes at the whole-cell population level (e.g., western blot) and lacking detailed kinetics, obscure the difference between population (selection) and cell-individual (induction) phenomena (Abolhoda et al, 1999; Huang et al, 2012; Tenbaum et al, 2012; Das Thakur et al, 2013; Sun et al, 2014).

The short timing for the appearance of stem-like or MDR+ cells after treatment (hours to few days) and the regularity with which any anticancer treatment triggers a complex stress-response phenotype suggests that the non-genetic process is often behind the observed change (scenario ii or iii). However, the question whether this apparent ‘enrichment’ of stem-like cells is due to selection of cells that occupied the stem-like state or induction (conversion of an individual cell by treatment) is usually not explicitly raised. Experimental demonstration of the latter requires longitudinal monitoring of phenotype change in individual cells (Nowell, 1976; Pisco et al, 2013). The essential difference between a Lamarckian, non-genetic, induced cell response and a classical Darwinian selection (of mutants or of attractor states) is that the induced response is mounted by surviving cells that were not a priori more resistant. This conceptual difference has therapeutic consequences for preventing resistance.

Outlook: therapeutic implications beyond killing

The mechanisms of resistance described here rest on the fact that non-mutant, non-resistant, treatment-naive cancer cells can survive treatment. However, why do they survive in the first place? First, tumour therapy has to balance between killing tumour cells and avoiding host toxicity, thus limiting the dose used. Second, penetration of tumour tissue by drug molecules is not uniform, creating ample opportunity for many tumour cells survive even if they are not a priori resistant. And third, even within a clonal population, the cell-intrinsic sensitivity to drug fluctuates, as do most cellular traits, resulting in a broad statistical distribution (heterogeneity) of drug sensitivity at any time point (Huang, 2009). Therefore, therapy will almost inevitably not kill all tumour cells, independently of pre-existing genetic mutants and even when considering cell-cycle phase. However, the surviving cancer cells are not innocent bystanders: having encountered a sublethal perturbation, which is a massive blow, some of them will be pushed into nearby (unused) attractors, the dead-end sidevalleys of the epigenetic landscape, which likely, as discussed above, encode stem-like, stress-response phenotypes. This explains why non-killed but near-lethally stressed cells are not necessarily weaker but instead might become more resilient (McDunn, 2005; Huang, 2014). Hence, rather than describing tumour progression with the principle of Darwin’s ‘Survival of the fittest’, it may be more suited to think of Nietzsche’s concept that ‘What does not kill me strengthens me’.

Although progression is an intrinsic trait of malignant tumours and debulking of the primary tumour is clinically necessary to alleviate disease burden and lessen the probability of metastatic dissemination, we also need to consider the progression-stimulating effect of therapy in the non-killed cells. Therefore, tumouricidal treatment is inherently a double-edged sword. The question is: how can we maximise the desired effects of therapy while minimising the strengthening of the (unavoidable) surviving cells? We can consider two types of approaches (Figure 3): the proactive solution would be to combine cytotoxic therapy with drugs that block the Lamarckian induction of a stress response. For instance, Wnt signalling has emerged as a central pathway in stress-induced stemness and resistance in tumour and stromal cells (Corrêa et al, 2012; He et al, 2014). Inhibitors of the Wnt-β–catenin axis, currently tested as differentiating agents (Liu et al, 2013), may be used before chemo- or targeted therapy to prevent stress response in surviving cells. The second approach is more passive and seeks ‘gentle containment’: avoid triggering the stress response associated with aggressive attempts to kill. This can be achieved by a departure from the traditional maxim of ‘maximal-tolerated dose’ through a variety of scheduling (Liao et al, 2012), such as metronomic therapy (lower dose, more frequent) (Scharovsky et al, 2009) or drug holidays to allow for reversion to the sensitive state (Sun et al, 2014). Non-cytotoxic differentiation therapy (Sell, 2004) may also belong to this category.

Figure 3
figure 3

Different anticancer therapy schemes can minimise the strengthening of surviving cells. If rather than treating the tumour using conventional, aggressive therapy that may stimulate the stressed, stem-like (=more aggressive) state, cancer is treated using alternative, gentler modification of cell growth we might be able to control the ratio between the stressed cells and the naive, non-stressed cancer cells. The idea of treating cancer as a chronic disease is to prevent the cells from transiting into the aggressive state in response to cytotoxic stress. This can be achieved by blocking transition into that state (right blue arrow) or by containing cells without cytotoxity, for example, by promoting differentiation (green arrow).

PowerPoint slide

The immense non-genetic phenotypic heterogeneity of tumour cells, which entails, by statistical necessity, incompleteness of any tumouricidal treatment, and the phenotype plasticity, which allows perturbed surviving cells to aberrantly access hidden, pre-existing, pathological stem-like states, offers a new set of principles for future exploration of therapeutic options that will reach beyond the current emphasis on killing.