Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Evaluating intrinsic and non-intrinsic cancer risk factors

## Abstract

Discriminating the contribution of unmodifiable random intrinsic DNA replication errors (‘bad luck’) to cancer development from those of other factors is critical for understanding cancer in humans and for directing public resources aimed at reducing the burden of cancer. Here, we review and highlight the evidence that demonstrates cancer causation is multifactorial, and provide several important examples where modification of risk factors has achieved cancer prevention. Furthermore, we stress the need and opportunities to advance understanding of cancer aetiology through integration of interaction effects between risk factors when estimating the contribution of individual and joint factors to cancer burden in a population. We posit that non-intrinsic factors drive most cancer risk, and stress the need for cancer prevention.

## Introduction

The past few decades have seen significant progress in our understanding of cancer aetiology as well as advances in early detection, treatment, and prevention1,2,3, which have led to declining cancer mortality in the industrialized world. Despite this progress, certain cancers continue to increase in different parts of the world due, in part, to longer lifespans and changing patterns of cancer risk factors4. This includes the first evidence of impacts of the obesity epidemic on cancers5. Furthermore, significant gaps in age-adjusted cancer incidence rates for nearly all cancers across different regions of the world suggest that much of cancer risk is due to causes other than unmodifiable intrinsic DNA replication errors common to all humans which we define as the ‘intrinsic risk’6.

Extensive efforts over several decades have been directed at and continue to be expended on identifying risk factors for cancer. For several cancers, aetiology has been convincingly linked to specific environmental factors resulting in effective cancer prevention (https://www.cancer.gov/about-cancer/causes-prevention/risk), e.g., smoking and lung cancer, sun exposure and skin cancer, human papillomavirus (HPV) and cervical cancer, Helicobacter pylori (H. pylori) and gastric cancer, and viral hepatitis and hepatocellular cancer (HCC).

While certain external exposures have been established in cancer causation, the contribution of random errors in DNA replication has been more difficult to estimate. Two recent modelling studies suggested that over 60% of tissue cancer burden may be due to factors that are intrinsic to human cell biology and thus, not modifiable7,8. This conclusion has been highly contested9,10,11,12,13,14,15. Nevertheless, these provocative findings gained media attention as evidence dampening healthy behaviours for cancer risk reduction and renewed old debates on the role of modifiable factors in cancer causation among scientists. They also raised questions about the evidence that scientists use and assumptions that they make to mathematically estimate the contribution of different factors to the burden of cancer in the population setting.

Here, we briefly introduce and refine the definitions of intrinsic and non-intrinsic risk factors that have been employed in these recent works and how evidence for their effects on cancer burden in human has been obtained considering the type of study (observational or experimental). This includes a discussion of the assumptions about cancer aetiology that have been used to estimate the contribution of various factors to the burden of cancer in the human population. Through clarifying the definitions, and analyzing the cumulated models, data, and findings from historical and modern literature, we develop our position that non-intrinsic factors are the major contributors to cancer risk and thus open the door for significant prevention.

## Cancer risk factors and cancer risks

The pursuit of cancer risk factors has been instrumental in the development of both data-driven analytical approaches and theory-driven models for carcinogenesis. The former was initiated by landmark epidemiological studies of lung cancer and tobacco smoking in the 1950’s. The latter began with the modelling of carcinogenicity in animals early in the 20th century16,17,18 and subsequently in humans19,20,21,22,23,24,25,26, culminating in two recent contrasting models that we highlight below8,15.

To facilitate the discussion and relate to recent published model-based estimates, separate categories for cancer risk factors are defined below based on their biologic nature, modifiability and use in the literature (Fig. 1):

1. (1)

Unmodifiable intrinsic risk refers to unavoidable spontaneous mutations that arise as a result of random errors in DNA replication related as a characteristic of being human. These unavoidable DNA replication processing errors occur in different organisms at different rates as a species specific, random replication error rate.

2. (2)

Non-intrinsic risk refers to factors that include: (2a) Modifiable exogenous/external factors (e.g., carcinogens, viruses, xenobiotic) and lifestyle factors (e.g., smoking, hormone therapy, nutrient intake, physical activity) that are exogenous to the host; and (2b) Endogenous factors that are partially modifiable and related to the characteristics of an individual (e.g., immune, metabolism, DNA damage response, hormone levels) and influence key aspects of cell growth control and genome integrity.

We have selected these definitions to assure better dissection of the contribution of ‘intrinsic errors’ as an unmodifiable risk factor for cancer modelled in recent studies27,28. Particularly, this definition of intrinsic risk implies three corollaries of high relevance to additional analyses: (1) The contribution of this unmodifiable risk to cancer incidence should be constant across populations since all humans have the same intrinsic mutation rates; (2) This contribution should also be consistent over time since the underlying mechanism is a property of the human species; and (3) the contribution of intrinsic DNA replication errors to mutational signatures should be constant across tissues and organs.

In addition, it should be noted that cancer displays a complex etiopathogenesis and that these various factors interact in tumour evolution (e.g., gene–gene or gene–environment interactions). For modelling and discussion purposes, cancer risk types have been discretized as the intrinsic risk and the non-intrinsic risk, which refers to all risk minus the intrinsic risk, or likewise, the sum of risks due to non-intrinsic factors, plus their interactions, plus the interactions between intrinsic and non-intrinsic factors. Accordingly, this definition of intrinsic risk accounts for those cancers where intrinsic error is sufficient for tumorigenesis. Thus, in deriving the intrinsic risk (so-called ‘bad luck’), one must subtract risk not only due to individual non-intrinsic factors, but also to their interactions with intrinsic factors. This group would include cases where the intrinsic factor (i.e., basal mutation rate) contributes and may be necessary but not sufficient. As an example, a specific lung cancer may arise from three driver mutations, one of which arises from an intrinsic error and two from mutagens in tobacco smoke. In this case, the intrinsic error is necessary but not sufficient for detectable invasive carcinoma to develop. From an intervention point of view, this is critical as preventing the modifiable component (i.e., smoking exposure) would still be effective in preventing cancer in these settings.

## Intrinsic risk factors

As defined above, intrinsic risk arises from the basal mutation rate operating in all dividing cells, in the absence of any non-intrinsic factors.

We have chosen to define unmodifiable intrinsic risk in this narrow way as it corresponds to a biologically intrinsic factor that causes DNA mutations in humans that is not modifiable. Thus, all humans are ‘stuck’ with this risk, unlike other sources of non-intrinsic factors that may vary between individuals.

Passage or fixation of randomly acquired mutations (e.g., single nucleotide errors, deletions and insertions) in a tissue is dependent on the survival and division of the mutated cell and its progeny. These mutations may yield “driver mutations” required for cancer development, in distinction from “passenger mutations” that do not impact cancer formation but are found commonly in cancers. A requirement for more than one driver mutation to initiate cancer increases the barrier to develop cancer with intrinsic mechanisms alone.

In 2015, Tomasetti and Vogelstein asked why different tissues exhibit dramatically disparate cancer rates. Using estimates of the number and dynamics of tissue-specific stem cells for 31 tissue types, they observed a strong correlation between estimated stem cell divisions and lifetime cancer risk at log10 scale. From their modelling work, they suggested that a significant and underappreciated component of cancer risk, as much as 64%, may be due to unmodifiable random errors in DNA synthesis or bad luck7. This hypothesis sparked debates9,13 on the nature of this correlation and its implications for causality of stem cells in cancer pathogenesis. In our work, we found that the correlation between stem cells and cancer risk does not distinguish the operation of intrinsic from non-intrinsic factors and vice versa, since many non-intrinsic factors (e.g., smoking) induce their own mutations, and the likelihood of induction and propagation of these mutations also correlates with tissue cell divisions15. Thus, tissues with much larger cell divisions are susceptible to higher intrinsic mutations as well as to higher mutations induced by external factors. This conclusion was supported in recent analysis by Nowak et al.29. Furthering the complexity of cancer risk factors, in one study, Klutstein et al.30 found a stronger correlation between tissue levels of DNA methylation and cancer burden. This correlation persisted even after correcting for the contribution of stem cells whereas the reverse did not hold. These authors concluded epigenetic changes, which can be influenced by exogenous and endogenous factors, and not only mutations contribute to cancer risk with a similar dependence on the number of cell divisions in a tissue. Thus, while these correlative studies support total tissue cell division in the observed variation between tissue-specific cancer risks, this association is correlative and only explains a part of that risk.

### Mutational signatures in human cancer reveal past events

The direct estimation of intrinsic error to cancer risk is challenged by the technical inability to truly separate intrinsic errors from the effects of non-intrinsic factors in humans. Evidence for intrinsic risk in cancer has historically relied on modelling studies of cancer development based on experimental/clinical data. The recent advent and rapid development of next-generation DNA sequencing technology has revolutionized the ability to survey genome-wide somatic mutations in cancer. Analyses of these data are providing new insight into the role of intrinsic versus non-intrinsic cancer risk factors, and in some cases, linking specific signatures to specific risk factors. Here we discuss recent work from large-scale tumour sequencing studies applied to estimating the magnitude of intrinsic risk and its contribution to human cancer.

Using genome sequence data, more than 30 distinct mutational signatures were recently uncovered in different cancers31. Of these, 10 can be associated, at least partially, to known mutagens. Interestingly, two signature mutations demonstrated strong positive correlations with age in most cancer types, indicating that they are acquired at a relative constant rate over the lifetime of cancer patients, regardless of tissue of origin. This pattern is most consistent with the action of an intrinsic error process, since errors arising with DNA replication during cell division would accumulate in a monotonic fashion over time. In contrast, the other signature mutations lack a consistent correlation with age, suggesting they may be acquired at different rates in life due to different influences31. Since all known carcinogen-specific signatures demonstrate an age-uncorrelated and tumour-specific pattern, it is reasonable to assume those with unknown causes are also a consequence of external exposures to DNA damaging agents.

Based on this segregation of signatures, the proportion of cancers driven by intrinsic risk can be calculated, as shown in Box 1, to account for no more than 10–30% of all cancer incidence15. Notably, a number of cancers, such as lung and skin cancer, with substantial environmental risk as determined from epidemiologic studies, also contain large percentages of non-intrinsic risk estimated from the mutational signature data (Extended Data Table 3 in ref. 15), supporting the validity of this approach.

### Modelling of contribution of intrinsic mutations

Several studies have attempted to estimate the number of driver mutations required for the development of an invasive carcinoma. The emerging consensus is that at least three hits are necessary for solid tumours and fewer for haematologic malignancies. The historical development of this work is shown in Box 2.

Replication error rate is a critical parameter in modelling intrinsic cancer risk in human cells, and the unrepaired error rate has been estimated at ~5 × 10−10 per nucleotide site per cell division32. This corresponds approximately to three new mutations per genome per cell division. Replication error rates between different cell types in an organism are roughly constant given the fundamental nature of the replication process. For proto-oncogenes, gain-of-function mutations typically occur at specific sites that increase action of the target protein (e.g., JAK2V617F or KRASG12V). In contrast, loss-of-function mutations can occur at multiple sites whereby numerous mutational events promote gene loss or dysfunction (e.g., P53 mutations). Thus, the probability of mutating at least one cancer relevant gene is larger than the somatic mutation rate of one nucleotide. For example, if 20 mutable sites correspond to one cancer relevant hit, the probability of that hit would be 1 × 10−8 per cell division.

Based on these and related data, we developed a discrete stochastic multistage cancer stem cell model, with the model parameters (number of stem cells, intrinsic mutation rate, and the generations of symmetric versus asymmetric divisions) estimated from the most recent literature15. Once the intrinsic risk due to replication errors was computed, the difference between the model estimation and the observed epidemiological cancer incidence provided an estimate of the non-intrinsic risk (residual risk). These results suggested that cancer risk due to intrinsic factors alone is very low for cancers requiring more than two hits, consistent with other independent analyses including observational studies and a mutational signature study. Based on these data, we concluded that intrinsic risk explains at most 10–30% of all cancers15.

More recently, Tomasetti et al.8 published a new estimate of the proportion of cancer driver gene mutations due to intrinsic factors. For 32 cancers examined, they concluded that 66% of mutations were attributable to intrinsic causes. A major cornerstone of this recent work was the calculation of the intrinsic risk as the amount of risk that remains after subtracting effects of known environmental and hereditary factors. That is, the percentage of mutations due to intrinsic factors was computed as:

$$({\mathrm{Percent}}\,{\mathrm{due}}\,{\mathrm{to}})\,{\mathrm{Intrinsic}} = 1 - {\mathrm{known}}\,{\mathrm{environmental}} - {\mathrm{known}}\,{\mathrm{hereditary}}$$

However, this approach inflates the effect of intrinsic factors by assuming there are no other non-intrinsic cancer-causing factors to be identified. Inclusion of a yet to be identified non-intrinsic factor can significantly drive down the contribution of the intrinsic factors as illustrated in Fig. 2. For lung cancer, while Tomasetti et al. estimated the mutation fraction due to intrinsic factors at 33.4%; based on our mutational signature analysis, we identified a much smaller estimate (3.6%) of the intrinsic mutation fraction or a 9-fold difference15. This discrepancy could be due to the exclusion by Tomasetti et al. of known exogenous risk factors for lung cancer including radon, a risk prevalent to the entire population and second only to cigarette smoking33, as well as second hand smoking and air pollution34,35 and yet to be determined environmental and hereditary factors.

## Non-intrinsic risk factors

Mechanisms of non-intrinsic risk factors thought to drive cancers are multifaceted. Some belong to the family of chemicals that induce new mutations (mutagens) while others, such as viruses, induce cancers through activating or repressing key cancer modulating genes (activating oncogenes or inhibiting tumour suppressor genes) in addition to inducing mutations. At least in the case of mutagens, these operate on cells that can divide and persist so as to facilitate tumour development. In defining such ‘at risk’ cell populations, biologic studies have focused on stem cells, progenitor cells, and other dividing cells36. In the definition proposed here ‘non-intrinsic factors’ refer to risk factors other than intrinsic replication error, and includes not only exogenous factors (e.g., tobacco, HPV, ultraviolet (UV) radiation, and drugs) but also endogenous factors, such as inflammation, hormones and growth factors, metabolic effects, reactive oxygen species, immune responses, etc. The evidence for non-intrinsic risk factors is mainly derived from studies in cancer epidemiology and cancer biology.

## Exogenous risk factors

Several landmark epidemiological and biological studies have identified exogeneous cancer risk factors such as tobacco smoke for lung cancer, UV radiation for skin cancer, and viruses for cervical and liver cancer. More recently, several groups have reported rising colorectal cancer incidence and mortality rates in Asia approaching those in western countries. Affluent Eastern Asian countries such as South Korea, Singapore, and Japan have experienced a two-fold to four-fold increase in incidence in recent decades37. In the USA, a recent study confirmed prior estimates that adults born in 1990 could experience twice the risk of colon cancer and four times the risk of rectal cancer at the same age had they been born in 1950. The reasons for the rise in incidence and death rates remain unclear38 but cannot be attributed to change in factors intrinsic to DNA replication machinery in humans and thus, strongly indicate a role for non-intrinsic factors.

### Geographic variation and immigrant studies

Evidence for causes of cancer in human populations has historically been guided by information on cancer incidence and prevalence rates in different populations. According to GLOBOCAN39, incidence rates of different cancers show distinct geographic patterns where estimates in high-incidence regions can be as much as one or two orders of magnitude higher than low-incidence areas. Consistent with this pattern, we recently analysed the World Cancer Registry data6 and found that the age-adjusted incidence rates of most cancers show distinct geographic patterns where estimates in high-incidence regions can be as much as ten folds or more than low-incidence areas6. Some examples, obtained by taking the ratio of the incidence rates at the 90th percentile and the 10th percentile, include: melanoma (40 fold), colorectal cancer (three fold), lung adenoma (seven fold), breast cancer (three fold), and prostate cancer (nine fold). The difference in world cancer incidence rates and wide disparity are shown in Fig. 3 (originally published in ref.6). As shown in this figure, the fold changes will be more dramatic if the ratio is between the regional maximum and minimum.

Favouring environmental risks, seminal work demonstrated that the offspring of immigrants to high incidence regions acquire the incidence patterns of the host country in one or two generations40. This adoption of the host country incidence pattern is consistent with changes in factors present in each geographic region. Indeed, higher incidences of lifestyle-related cancers (e.g., breast, prostate, colorectal and lung cancers) have been observed in the early industrialized countries. In contrast, higher incidences of infection-related cancers (e.g., cervical, stomach and liver cancers) have been observed in less developed regions and in areas with endemic infectious agents.

### Retrospective case-control studies

Numerous hypotheses about the role of environmental exposures and cancer have been generated using retrospective case-control studies, in which the association of exposures (e.g., smoking) with cancer can be quantified. Suspecting tarmac or motor car fumes as the major causes for the increased incidence in lung cancer, Doll and Hill41 undertook a historical case-control study in 1950. Comparing lung cancer patients with matched controls, they discovered tobacco smoking was strongly overrepresented in the cases. Their findings were subsequently confirmed in a prospective cohort of more than 30,000 British physicians42. Over the past several decades, numerous groups have employed the case-control design and odds ratio (OR), under certain assumptions, to estimate the preventable proportion of cancer risk that is attributable to a given exposure: For melanoma, risk ascribed to sun exposure is estimated at 65–86%, and for non-melanoma basal and squamous skin cancers, ~90% is attributable to UV43. Here, the attributable risk refers to cancer risk that is theoretically preventable. Additionally, >75% of oesophageal cancer, and >75% of head and neck cancer are attributable to tobacco and alcohol44,45 and for the latter, a large fraction of residual risk now suspected to be attributable to HPV46. Using this approach, several pathogens (HPV, hepatitis B virus (HBV), hepatitis C virus (HCV) and H. pylori) have been identified as cancer causing explaining a majority of cancer of the cervix, liver, stomach and others47,48,49,50.

### Prospective studies

Supporting links between risk factors and cancer identified in case-control studies, numerous prospective studies have been conducted and proven highly informative. For example, prospective studies on lung42, oesophagus and gastric51, bladder52 and other cancers53,54, have confirmed the association of smoking in human carcinogenesis, particularly in the aerodigestive tract. The robustness of the associations has yielded reliable estimates of cancer risk among smokers. Using these estimates and knowledge of smoking rates, the prevalence of smoking-associated cancers has been approximated to be as much as 25–30% of all human cancers55.

In the case of cancers associated with infectious agents56, prospective studies of H. pylori and gastric cancer57, HPV and cervical cancer58 and recently head and neck cancer59, as well as study of HBV and HCV in HCC60,61 have yielded evidence linking these agents to tissue-specific cancers. It is currently estimated that infectious agents contribute upwards of 15–20% of all human cancers56.

Other physical factors such as ionizing62 or UV radiation63 contribute causally to cancer incidence, and their linkage to cancer has led to effective preventive measures. High-dose mantle field radiation for the treatment of Hodgkin’s lymphoma was demonstrated unequivocally through prospective studies to increase breast cancer especially those exposed at younger ages64. Other sources of ionizing (e.g., environmental radon) and non-ionizing (e.g., UV) radiations have also been linked to lung cancer65, leukaemias and lymphomas66, melanoma and other skin cancers63. These preventable exposures have been estimated to contribute to ~20% of cancers67.

In addition to these defined exposures, more complex lifestyle and behaviour factors such as diet, physical activity, alcohol consumption and reproductive patterns have also been intensively studied in cancer risk using the prospective design. For example, physical activity and dietary patterns, particularly nutrient deficient and calorie-dense diets (i.e., high dietary fat, refined sugar, red and processed meats), have been positively associated with high-incidence cancers of modern society including colorectal68, breast69, prostate70 and lung cancer among non-smokers71. However, data from prospective studies on specific essential nutrients (i.e., folate, calcium, vitamin D, and others) on cancer risk have been equivocal. The European Prospective Investigation into Cancer and Nutrition (EPIC) study72 supports diet as an important or moderately important factor in risk of colorectal and breast but not prostate cancer.

In efforts to increase the sensitivity and reliability between individual dietary factors and cancer, epidemiologists have developed modern analytical methods73,74. Employing a meta-analysis of 53 retrospective epidemiologic studies comprising of 58,000 women, women who drank >45 g of alcohol per day were found to have a 1.5-fold higher risk of breast cancer than non-drinkers75. This finding was replicated in the Million Women Study in the UK76. Using such tools and data, diet has been estimated to contribute to 20–40% of all cancers77,78.

While epidemiological studies have a number of strengths, certain inherent weakness limit the reliability of findings when present in only one or a few studies. For geographic comparisons of cancer risks, information on routine medical records and death registries tend to be less accurate or complete in less developed countries and less impacted by asymptomatic, screen detected cancers. This impacts the accuracy and interpretability of the rates. These factors may inflate findings of difference between countries. On the other hand, other considerations may obscure effects of environmental factors. For example, if common exposures exist globally, which may happen increasingly with globalization, it will be harder to recognize their contribution to cancer risk. For retrospective (especially) and prospective study design, confounding effects and selection biases affect the accuracy of the risk and the estimation of the effect size. As such, while replication of findings across studies is among the more powerful criteria for establishing an association, gaining knowledge of the biological mechanisms linking an exposure to disease is a necessary component of the evidentiary process in establishing direct causal relationships.

Despite these inherent limitations, population studies have provided convincing evidence for a major contribution of exogenous factors in cancer risk.

### Mutagens and mutational signatures

Exogenous mutagens, such as UV irradiation, have long been recognized to induce specific mutation patterns in genomes79. However, it was not until recently that strong signatures were identified for tobacco80 and UV light81 in lung cancer and melanoma genomes, respectively. These also provided the proof of principle in discovering the effect of mutagens without knowing their origins. Particularly, capitalizing on many large consortia studies targeting sequencing of large numbers of genomes, such as The Cancer Genome Atlas (TCGA), several mutational signatures have now been identified and characterized with respect to a wide range of cancer types31,82. As discussed above (Box 1), this analysis suggests that non-intrinsic factors are dominant in imparting cancer risk. More importantly, given the rapid progress of sequencing technologies, new specific signatures are coming into light with new research that is assigning them to specific exposures. For example, aristolochic acid, common in east Asia and parts of Europe, has been shown to predispose to cancers of the renal pelvis, and is associated with a highly specific signature83.

## Endogenous risk factors

Certain cancer risk factors are endogenous to the individual and many have some genetic component. Individual levels of the sex steroid hormones and their role in breast cancer risk are among the best studied examples of an endogenous cancer risk factors84. As endogenous determinants of cancer risk, the steroid sex hormones vary over the life course and between individuals and are influenced by other exogenous factors (e.g., diet, therapeutic hormones, other drugs, physical activity levels) as well as other endogenous determinants such as genetic background. Importantly, endogenous sex steroid hormones and cell responses to hormones are proven targets for cancer prevention supporting the modifiability of endogenous risk factors.

More challenging is integrating information on the non-intrinsic effects of complex endogenous processes like ‘ageing’, inflammation, and obesity on cancer risk that are influenced by exogenous (environment) and hereditary (genetic) as an endogenous determinant. For example, obesity has a genetic basis but most often develops as a phenotype from interaction with exogenous factors (over consumption of food and sedentary behaviour) and is thus, highly modifiable. Obesity-associated changes in metabolism, hormones and inflammation are the suspect proximate biological culprits in cancer risk and they are modifiable (metformin, anti-inflammatory drugs, lipid lowering drugs, hormone therapies). Deregulated sex hormones for example are causally linked to the significant increase risk of uterine cancer in obese women85. And unlike other cancers, endometrial cancer incidence has continued to increase worldwide86 and in parallel with the obesity epidemic87. Notable is the reduction (modifiability) of endometrial cancer risk in the obese with weight loss surgery88, hysterectomy or use of progestins that oppose oestrogen effects on the endometrial lining87,88. In contrast to endometrial cancer, the mechanophysical effects of obesity (i.e., extrinsic gastric compression) in combination with endogenous bile acid reflux into the oesophagus and resultant metaplastic response of the epithelium, explain the rapid rise in oesophageal cancer in the obese—a cancer that was exceedingly rare until the obesity epidemic, and therefore it may be highly preventable.

Here we consider a few such complex endogenous factors and their modifiability. This includes considering ageing as a decline in endogenous anti-cancer processes.

### Inflammation and cancer

From the observations of Virchow and the carcinogenesis studies of Yamagiwa and Ichikawa, an ‘irritation theory’ of cancer was conceived where inflammation was subsequently identified as a major, and in some cancers e.g., asbestos-related mesothelioma and infection-related tumorigenesis, necessary component of malignancy89,90,91. Over the latter half of the 20th century, numerous cellular and molecular mechanism linking inflammation to malignant cell persistence and invasion have been characterized. These range from inflammation-induced reactive oxygen species that act in DNA damage and tumour initiation as well as inflammation-derived cytokine and chemokine effects on tumour growth, angiogenesis and tumour cell migration and invasion91,92. Most recently is the appreciation that immune cells play a significant role in suppressing anti-tumour immunity enabling tumour cell persistence and progression to life-threatening disease93.

Such effects, and the large body of evidence from animal and human studies, have led to the inclusion of inflammation as an enabling factor to carcinogenesis94, where inflammation is accepted to act across the continuum of tumorigenesis in a number of cancer types. The significance of inflammation in cancer development has been demonstrated in the chemoprevention field where randomized clinical trials and population studies of anti-inflammatory agents such as aspirin and other non-steroidal anti-inflammatory drugs have demonstrated the cancer prevention effects of suppressing pro-inflammatory mediators like prostaglandin E2 for several cancers95. Indeed, in 2015 the US Prevention Services Task Force recommended in favour of low dose aspirin use for the prevention of colorectal cancer in individuals at elevated risk that include patients with Hereditary Non-Polyposis Colorectal Cancer (HNPCC) Syndrome who carry germline mutations in mismatch repair genes96.

While it is clear that inflammation is critical for tumour development, incorporation of inflammation in mathematical models of tumour development is lacking. This stems in part from the lack of valid biomarkers of cancer associated inflammation. As with mutational signatures of carcinogens, and more recent efforts to assess ageing, integration of inflammatory signals with the genomic and sequence data may offer insights on the magnitude of cancer burden that can be attributed to inflammation—work that would greatly enhance efforts aimed at modifying inflammation as a prevention strategy for reducing cancer incidence in the population.

### Ageing

Ageing is considered among the most significant risk factors for cancer97. Yet, it is important to recognize that ageing can be defined chronologically or biologically. Chronological ageing contributes toward cancer by providing time for intrinsic risk as well as for exogenous and endogenous factors including mutagens to exert their effects. In contrast, biological ageing processes are more difficult to define or quantify since their full spectrum is not fully understood. The strong positive association of ageing with cancer is widely believed to reflect generalized declines in cellular and molecular system functions as an endogenous risk. Ageing encompasses at least nine recently proposed hallmarks98 for which there are numerous overlaps to the cancer hallmarks94: genomic instability, telomere attrition, epigenetic alterations, loss of proteostasis, deregulated nutrient sensing, mitochondrial dysfunction, cellular senescence, stem cell exhaustion, and altered intercellular communication.

In an effort to assess the impact of ageing, Podolskiy et al.99 reported that accumulation of age-associated CT and GA mutations at CpG sites (common replication errors) appears to accelerate in a monotonic fashion until later in life (50–80 years) when the rate declines. The group reports that that the acceleration in mutation burden is higher in men and initiates earlier in life in men. This parallels higher overall cancer incidence in men and an earlier age (about 10 years) at which cancer incidence begins to rise in men. The authors suggest that the strong representation of age-associated mutations in tumours reflect decreases in organismal fitness with ageing that differ by gender and tissue type.

Not all biological ageing is pro-tumorigenic. For example, mechanistic studies have suggested that cell senescence and stem cell exhaustion that accelerate with ageing may explain the observed decline in incidence of cancer at the extremes of human age100,101. It is worth mentioning that the rate and peak timing of age-related cancer risk varies from cancer to cancer and even within subgroups of specific cancers. This suggests that there is not always a positive relationship between age and cancer risk. For breast cancers, triple negative and HER2 positive breast cancers peak earlier in adulthood and exhibit a decline with advancing age where oestrogen receptor positive breast cancer incidence rises later and continues to increase with age102. These results may also reflect differences in susceptible cell populations in tissues that senescence at different life-stages; life cycle biology not currently considered in modern models of cancer risk.

To tease out ageing effects on cancer from non-intrinsic risk factors is challenging. The effect of ageing on cancer risk is commonly removed by testing the cancer risk in individuals with and without the exposure matched on age. Analysis of age-adjusted incidence rates provides the most common way to address this issue. In the geographic comparisons discussed previously, all the incidence rates are age-adjusted. In these cases, the chronologic age effect is accounted for.

### Heritable factors

Hereditary cancer can also operate through intrinsic and non-intrinsic mechanisms, by modulating the frequency of mutations per se (or their repair) but also by non-intrinsic mechanisms. For example, for cancers of the breast, prostate and colon, upwards of 30–40% of the attributable risk of these cancers is thought to be due to genetic causes. A landmark paper published in 2000 of 11 cancer types in 44,788 twin pairs concluded in favour of the environment as an “overwhelming contributor to the causation of cancer”, a statement that, like current discussions, prompted vigorous debates103. Importantly however, the work provided evidence of significant heritability in the common cancers of prostate (42%), colorectum (35%) and breast (27%). In a recent study of 80,309 monozygotic and 123,382 same-sex dizygotic twins of the Nordic Twin Study of Cancer (NorTwinCan)104, a 33% excess familial risk was observed for all cancer with confirmation that the magnitude of excess heritable risk was cancer specific with nearly 60% of prostate cancer estimated to be influenced by genetic factors. While much of the genetic basis of cancer risk remains to be identified, it is notable that a majority of the hereditary cancer mutations as well as the germ line variants identified involve DNA repair genes thought to act by increasing mutation rates often in tissue-specific fashion. However, it is also important to note that the increased risk may also derive from genetic mechanisms resulting in increased susceptibility to non-intrinsic factors and exposures to other DNA damaging processes.

### Tumour epigenetics

Our ability to understand and to model cancer aetiology and the impacts of exogenous and endogenous factors in risk has in recent years been extended to include consideration of effects of numerous factors on the epigenome. As with replication errors, epigenetic changes (e.g., DNA methylation) are passed on to daughter cells as non-sequence based chemical changes to the DNA. There is convincing evidence that epigenetic changes not only occur during tumour development, but they also play a direct causal role. This includes reproducible evidence that specific epigenetic silencing events, such as silencing of MLH-1 in a subset of human colon cancers, are essential alterations in human tumorigenesis105. Key epigenetics mechanisms in human carcinogenesis are beyond the scope of this perspective but have recently been reviewed for the major cancer types106. Noteworthy here for future models aimed at (1) identifying cancer risk factors and (2) for estimating contribution of factors (endogenous or exogenous) that impact the epigenome in cancer risk is consideration of the recent elegant work from the Baylin laboratory107. In their studies, they provide evidence that cigarette smoke (as a chronic exposure) induces time dependent alterations in the human bronchial epithelial cell epigenome that enhances their sensitivity to transform with a single KRAS mutation107. These data strongly suggest that a chronic exposure like smoking (or obesity, nutrient deprivation, ageing epigenetic effects on immune cell function, inflammation) may act by lowering the threshold of a cell to intrinsic errors for cancer development; an important interaction of the effects of the intrinsic and non-intrinsic risk factors not adequately considered in previous models. Similar effects of other exogenous and endogenous factors to the epigenome including inflammation, obesity and ageing may similarly alter the thresholds to transformation via effects on the epigenome108,109. Importantly, whether epigenetic changes represent reversible processes is currently debated and a subject of investigation. Studies in smokers, however, demonstrate smoking-specific changes to the epigenome persist for many years after smoking cessation, which may explain the long-lasting nature of elevated risk in former smokers.

### Other endogenous factors

In addition, less well defined ‘endogenous’ factors such as complex gene × gene interactions and gene × environment trait interactions are increasingly recognized as ‘cancer risks’. These include height and telomere length as examples along with emerging interest in the human microbiome as a modifier of cancer risk. Given progress toward understanding the significance of complex interactions in cancer, estimating their contribution to cancer burden will be important. While beyond the scope of this review, two recent lines of work on telomere genetics and cancer risk and human height and cancer risk are worth mentioning110.

The Telomeres Mendelian Randomization Collaboration Group110 recently demonstrated an association between genetic polymorphisms, telomere length and cancer. Longer telomere associated gene variants were associated with rare cancers and strikingly, with cancers in tissues with low stem cell divisions. As noted by the authors, the positive association with telomere length is consistent with evidence that telomere shortening with aging may act as an intrinsic protective mechanism against cancer by limiting cell division, explaining the lower rates of cancer in extreme age. While telomere length is a heritable trait, recent evidence from experimental models suggests that telomere length is malleable and influenced by numerous external stimuli111. Such findings provide new biological rationale for positive associations between environmental and psychosocial factors and telomere length observed in human studies that may impact cancer risk112.

Similarly, the repeated observation between adult height and cancer risk113 including breast114, prostate and colon115 is intriguing given the average height of humans continues to increase worldwide. Height is a heritable trait with estimates from twin studies suggesting that as much as 80% of height, especially in adolescence, can be attributed to parental height116. As such, the positive association between height and cancers has been hypothesized to reflect genetic traits that influence gestational, childhood and adolescent growth processes that also act on cancer progenitor cells. Indeed, 168 genetic variants associated with height and Mendelian randomization analysis were reported associated with genetically predicted height and risk of oestrogen receptor positive breast cancer114. Confounding the interpretation of these associations though is the strong influence of maternal nutrition as an equally strong non-genetic determinant of height117. Such important gene × environment interactions may partly explain geographical differences in risk of certain cancers like prostate cancer. For example, prostate cancer has been shown to be positively associated with height at 13 years of age118; a time when early life nutrition is most important in determining stature. This association was independent of adult height, suggesting nutrition in early life may be a modifying factor in prostate cancer risk. Like emerging evidence that obesity and other growth factor affect cancer risk via expansion on tissue stem cells119, it is plausible that nutrition and height genes interact with effects on stem cells affecting an intrinsic risk factor for cancer at the tissue level. Understanding such effects will be essential for modelling the contribution of each to cancer risk. Unfortunately, integration of early life exposures including nutritional status in human studies are challenging and make it difficult to tease out the effects of early life nutrition on adult cancer120. Studies in animals and in birth to death cohorts, where detailed early life exposures are collected, will be critical to advancing our understanding of such factors in risk of cancer in adults121.

## Conclusions and perspective

Multiple approaches have been applied over the past few decades to understand and determine cancer exposures and risks (Boxes 12). These have aided in mathematical modelling approaches aimed at estimating the contributions of non-intrinsic and intrinsic factors to cancer risk and cancer burden in the population. Overall, except for a few isolated studies7,8, for most cancers, estimates from various approaches attribute a sizable fraction of cancer risk (60–90%) to non-intrinsic risk (Fig. 4). These estimates of non-intrinsic risk are consistent across studies and support a substantial contribution of potentially modifiable or actionable risk to cancer77,78,122. Evolving theories in cancer molecular pathogenesis and technological innovations (for example the deeper understanding of the cancer epigenetics mechanisms) are resulting in finer estimates of the impact of intrinsic and non-intrinsic processes based on biological principles. The rapid advances in the molecular biology of human cancers, including emergent role of stem cells in cancer evolution and expansion of long lived clones with multiple mutations and epigenetic changes, favour a much more complex picture of cancer aetiology with heterogeneity among the cancers and within cancers of the same tissue type. These pave the way for development of new analytic approaches that better integrate new knowledge including considering contributions of individual factors as well as their joint effects on cancer burden.

Much discussion has been made recently of the role of ‘bad luck’ in cancer risk, where the contribution of intrinsic factors to cancer is considered unmodifiable bad luck7,8. Thus, someone who never smoked may still have a lifetime risk of lung cancer of 0.2% to 1%. However, it is important to realize that (1) Non-intrinsic factors themselves only impart an increase in risk in developing cancer, and therefore there is still an element of luck for non-intrinsic factors. For example, whether a smoker develops lung cancer or not, an event that has a lifetime probability of 10–25%, depends on other factors including their sex and degree of smoking. Smoking increases the probability by 10 to 25 folds. Thus, exposure to risk factors does not necessitate the development of cancer; nor does the absence of exposure (with a few exceptions e.g., HPV) provide a 100% guarantee to prevent cancer. (2) Non-intrinsic and intrinsic risk factors often do not act independently as we have highlighted, and the most likely scenario is that they cooperate to cause cancer. In this regard, cancer risk can still be modified even when intrinsic factors contribute to some of the risk. As such, for some cancers there is evidence that there is an ‘unmodifiable’ variation arising from the built-in randomness of intrinsic and non-intrinsic mechanisms and this is likely greater in tissues with a higher level of cell division.

As such, it is detrimental to prevention and cancer control measures if the risk, especially for clinically significant cancers, is over interpreted to be due solely to bad luck. This underestimates the potential impact of prevention and control measures aimed at reducing or delaying incidence and death due to cancer. Similarly, under-estimating the fraction of preventable cancer risk impedes progress to identify modifiable exposures for cancer prevention and control measures when possible (Fig. 2).

Indeed, the proportion of currently preventable cancers is mostly a subset of cancers with known non-intrinsic risk factors (as shown in Fig. 2). Per the Cancer Research UK, ~40% of cancer burden is currently preventable. For example, at present, several cancers (e.g., prostate, thyroid, brain and testicular cancers) show no benefit from the modification of 14 lifestyle and environmental risk factors123 even though epidemiologic and other studies suggest strong effects of the environment. Therefore, this does not negate the significant contribution of currently unidentified risk factors or that they would become modifiable. Moreover, other known non-intrinsic risk factors such as radon for lung cancer and geographic variations for breast, colorectal and prostate cancers are not currently considered in the Cancer Research UK estimates. While plausible, challenges remain in ascertaining exposure of humans to putative non-intrinsic risks with hypothesized but equivocal evidence for several suspects, including heavy metals, endocrine disruptors, cadmium, sleep deprivation, chemical mixtures especially at low doses and nutrient deficiencies (folate, selenium) identified from experimental systems as pro-tumorigenic.

Potential interactions among various risk factors further complicates measurement issues, though the identification of additional modifiable risk factor(s) will likely open new venues for prevention (or at least intervention). This has been amply illustrated with the successive discovery of risk factors such as smoking, HPV, inflammation in colon cancer, and many others. With modern knowledge, there are also prevention successes in several hereditary cancers. For example, knowledge of the effects of BRCA1 mutation (an endogenous risk) on biological process has facilitated primary prevention including removal of the ovaries to reduce risk of breast cancer and benefits of tamoxifen; findings that support hormone modifying effects on endogenous risk that are modifiable. Similar concepts for the effects of aspirin in families with Hereditary Non-Polyposis Syndrome, a mismatch repair gene deficiency that increases mutation rates are likely to advance prevention efforts aimed at modifying intrinsic and other endogenous processes like ageing.

From our perspective, critical challenges going forward in understanding cancer risks include the need to continue to advance the biological understanding of cancer causation. Importantly, this includes the modern challenge of defining and distinguishing significant cancers (i.e., those that pose risk to life and significantly impact patient well-being) from those that do not and determining to what degree the attributable risk is preventable. Moving forward we need to establish comprehensive sequencing databases on both high and low-incidence regions for major cancers, and link biological theories with observed/experimental data through enhanced modelling and analysis efforts with more concerted efforts to advance models that deal with the complexity of cancer aetiology including simulating the joint effects of intrinsic and non-intrinsic risk factors.

## References

1. 1.

Verellen, D. et al. Innovations in image-guided radiotherapy. Nat. Rev. Cancer 7, 949–960 (2007).

2. 2.

Chabner, B. A. & Roberts, T. G. Jr. Timeline: chemotherapy and the war on cancer. Nat. Rev. Cancer 5, 65–72 (2005).

3. 3.

Palucka, K. & Banchereau, J. Cancer immunotherapy via dendritic cells. Nat. Rev. Cancer 12, 265–277 (2012).

4. 4.

Global Burden of Disease Cancer, C. et al. The Global Burden of Cancer 2013. JAMA Oncol. 1, 505–527 (2015).

5. 5.

Steele, C. B. et al. Vital signs: trends in incidence of cancers associated with overweight and obesity - United States, 2005-2014. Morb. Mortal. Wkly. Rep. 66, 1052–1058 (2017).

6. 6.

Zhu, W., Wu, S. & Hannun, Y. A. Contributions of the Intrinsic Mutation Process to Cancer Mutation and Risk Burdens. EBioMedicine 24, 5–6 (2017). In this work, through analyzing the World Cancer Registry data, the same data used in reference [8] above, we showed that the large global geo-graphical variation in cancer incidence rates for major cancer types would refute the claims that over 60% of cancer risk and driver gene mutations are due solely to the ‘bad luck’ common to all humans.

7. 7.

Tomasetti, C. & Vogelstein, B. Variation in cancer risk among tissues can be explained by the number of stem cell divisions. Science 347, 78–81 (2015). This work suggests that the intrinsic risk factor due to random errors in DNA replication that is common for all humans (i.e. the ‘bad luck’) alone would accunt for over 60% of cancer risk.

8. 8.

Tomasetti, C., Li, L. & Vogelstein, B. Stem cell divisions, somatic mutations, cancer etiology, and cancer prevention. Science 355, 1330–1334 (2017). This work suggests that the intrinsic risk factor due to random errors in DNA replication that is common for all humans (i.e. the ‘bad luck’) alone would accunt for over 60% of cancer driver gene mutations.

9. 9.

Wild, C. et al. Cancer risk: role of chance overstated. Science 347, 728–728 (2015).

10. 10.

Song, M. Y. & Giovannucci, E. L. Cancer risk: many factors contribute. Science 347, 728–729 (2015).

11. 11.

Potter, J. D. & Prentice, R. L. Cancer risk: tumors excluded. Science 347, 727–727 (2015).

12. 12.

O’Callaghan, M. Cancer risk: accuracy of literature. Science 347, 729–729 (2015).

13. 13.

Gotay, C., Dummer, T. & Spinelli, J. Cancer risk: prevention is crucial. Science 347, 728–728 (2015).

14. 14.

Altenberg, L. Statistical problems in a paper on variation in cancer risk among tissues, and new discoveries. Preprint at http://arxiv.org/pdf/1501.04605v1.pdf (2015).

15. 15.

Wu, S., Powers, S., Zhu, W. & Hannun, Y. A. Substantial contribution of extrinsic risk factors to cancer development. Nature 529, 43–47 (2016). In this work, we analyzed data, and developed models and arguments showing that contrary to the claims of references [7, 8] above, the ‘bad luck’ would only account for a modest amount of cancer risk in general.

16. 16.

Twort, C. C. & Twort, J. M. Observations on the reaction of the skin to oils and tars. J. Hyg.-Camb. 28, 219–227 (1928).

17. 17.

Twort, J. M. & Twort, C. C. Comparative activity of some carcinogenic hydrocarbons. Am. J. Cancer 35, 80–85 (1939).

18. 18.

Yamagiwa, K. & Ichikawa, K. Experimental study of the pathogenesis of carcinoma. CA Cancer J. Clin. 27, 174–181 (1977).

19. 19.

Nordling, C. O. A new theory on the cancer-inducing mechanism. Br. J. Cancer 7, 68–72 (1953).

20. 20.

Armitage, P. & Doll, R. The age distribution of cancer and a multi-stage theory of carcinogenesis. Br. J. Cancer 8, 1–12 (1954).

21. 21.

Muller, H. J. Radiation damage to the genetic material. Am. Sci. 38, 33–59 (1950).

22. 22.

Knudson, A. G. Mutation and cancer - Statistical Study of Retinoblastoma. Proc. Natl Acad. Sci. USA 68, 820 (1971).

23. 23.

Cavenee, W. K. et al. Expression of recessive alleles by chromosomal mechanisms in retinoblastoma. Nature 305, 779–784 (1983).

24. 24.

Little, M. P. Are two mutations sufficient to cause cancer? Some generalizations of the two-mutation model of carcinogenesis of Moolgavkar, Venzon, and Knudson, and of the multistage model of Armitage and Doll. Biometrics 51, 1278–1291 (1995).

25. 25.

Tan, W. Y. & Yan, X. W. A new stochastic and state space model of human colon cancer incorporating multiple pathways. Biol. Direct. 5, 26 (2010).

26. 26.

Heidenreich, W. F., Luebeck, E. G., Hazelton, W. D., Paretzke, H. G. & Moolgavkar, S. H. Multistage models and the incidence of cancer in the cohort of atomic bomb survivors. Radiat. Res. 158, 607–614 (2002).

27. 27.

Ames, B. N. Mutagenesis and carcinogenesis - endogenous and exogenous factors. Environ. Mol. Mutagen. 14, 66–77 (1989).

28. 28.

Hussain, S. P. & Harris, C. C. Molecular epidemiology and carcinogenesis: endogenous and exogenous carcinogens. Mutat. Res-Rev. Mutat. 462, 311–322 (2000).

29. 29.

Nowak, M. A. & Waclaw, B. CANCER Genes, environment, and “bad luck”. Science 355, 1266–1267 (2017).

30. 30.

Klutstein, M., Moss, J., Kaplan, T. & Cedar, H. Contribution of epigenetic mechanisms to variation in cancer risk among tissues. Proc. Natl Acad. Sci. USA 114, 2230–2234 (2017). Demonstrating a stronger correlation between tissue levels of DNA methylation and cancer burden, these authors argued that epigenetic changes, which can be influenced by exogenous and endogenous factors, could account for even more cancer risk than those argued for due to pure ‘bad luck’.

31. 31.

Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415 (2013).

32. 32.

Bozic, I. & Nowak, M. A. Unwanted evolution. Science 342, 938–939 (2013).

33. 33.

Sethi, T. K., El-Ghamry, M. N. & Kloecker, G. H. Radon and lung cancer. Clin. Adv. Hematol. & Oncol.: Hamp;O 10, 157–164 (2012).

34. 34.

Besaratinia, A. & Pfeifer, G. P. Second-hand smoke and human lung cancer. Lancet Oncol. 9, 657–666 (2008).

35. 35.

Raaschou-Nielsen, O. et al. Air pollution and lung cancer incidence in 17 European cohorts: prospective analyses from the European Study of Cohorts for Air Pollution Effects (ESCAPE). Lancet Oncol. 14, 813–822, (2013).

36. 36.

Visvader, J. E. Cells of origin in cancer. Nature 469, 314–322 (2011).

37. 37.

Sung, J. J., Lau, J. Y., Goh, K. L. & Leung, W. K., Asia Pacific Working Group on Colorectal. C. Increasing incidence of colorectal cancer in Asia: implications for screening. Lancet Oncol. 6, 871–876 (2005). This is one representative study demonstrating the longitudinal increases in cancer incidence and mortality rates due to changes in lifestyles and other factors .

38. 38.

Siegel, R. L., Miller, K. D. & Jemal, A. Colorectal cancer mortality rates in adults aged 20 to 54 years in the United States, 1970-2014. JAMA 318, 572–574 (2017).

39. 39.

Ferlay, J. et al. Estimates of worldwide burden of cancer in 2008: GLOBOCAN 2008. Int. J. Cancer J. Int. du Cancer 127, 2893–2917 (2010).

40. 40.

Hemminki, K., Forsti, A., Khyatti, M., Anwar, W. A. & Mousavi, M. Cancer in immigrants as a pointer to the causes of cancer. Eur. J. Public Health 24, 64–71 (2014). Favouring environmental/lifestyle risks, this seminal work demonstrated that the offspring of immigrants to high incidence regions acquire the incidence patterns of the host country in one or two generations .

41. 41.

Doll, R. & Hill, A. B. Smoking and carcinoma of the lung - preliminary report. Br. Med. J. 2, 739–748 (1950).

42. 42.

Doll, R. & Hill, A. B. The mortality of doctors in relation to their smoking habits - a preliminary report. Br. Med. J. 1, 1451–1455 (1954).

43. 43.

Koh, H. K., Geller, A. C., Miller, D. R., Grossbart, T. A. & Lew, R. A. Prevention and early detection strategies for melanoma and skin cancer. Current status. Arch. Dermatol. 132, 436–443 (1996).

44. 44.

Blot, W. J. et al. Smoking and drinking in relation to oral and pharyngeal cancer. Cancer Res. 48, 3282–3287 (1988).

45. 45.

Kamangar, F., Chow, W. H., Abnet, C. C. & Dawsey, S. M. Environmental causes of esophageal cancer. Gastroenterol. Clin. North Am. 38, 27–57, vii (2009).

46. 46.

Koch Wayne, M., Lango, M., Sewell, D., Zahurak, M. & Sidransky, D. Head and neck cancer in nonsmokers: a distinct clinical and molecular entity. Laryngoscope 109, 1544–1551 (2009).

47. 47.

Bosch, F. X. et al. Prevalence of human papillomavirus in cervical-cancer - a worldwide perspective. J. Natl. Cancer Inst. 87, 796–802 (1995).

48. 48.

Frisch, M. et al. Sexually transmitted infection as a cause of anal cancer. New Engl. J. Med. 337, 1350–1358 (1997).

49. 49.

Chaturvedi, A. K. et al. Human papillomavirus and rising oropharyngeal cancer incidence in the United States. J. Clin. Oncol. 29, 4294–4301 (2011).

50. 50.

Webb, P. M. et al. Gastric cancer and Helicobacter pylori: a combined analysis of 12 case control studies nested within prospective cohorts. Gut 49, 347–353 (2001).

51. 51.

Freedman, N. D. et al. A prospective study of tobacco, alcohol, and the risk of esophageal and gastric cancer subtypes. Am. J. Epidemiol. 165, 1424–1433 (2007).

52. 52.

Alberg, A. J. et al. A prospective cohort study of bladder cancer risk in relation to active cigarette smoking and household exposure to secondhand cigarette smoke. Am. J. Epidemiol. 165, 660–666 (2007).

53. 53.

Fuchs, C. S. et al. A prospective study of cigarette smoking and the risk of pancreatic cancer. Arch. Intern. Med. 156, 2255–2260 (1996).

54. 54.

Terry, P., Ekbom, A., Lichtenstein, P., Feychting, M. & Wolk, A. Long-term tobacco smoking and colorectal cancer in a prospective cohort study. Int. J. Cancer J. Int. du Cancer 91, 585–587 (2001).

55. 55.

Irigaray, P. et al. Lifestyle-related factors and environmental agents causing cancer: an overview. Biomed. Pharmacother. 61, 640–658 (2007).

56. 56.

Plummer, M. et al. Global burden of cancers attributable to infections in 2012: a synthetic analysis. Lancet. Global Health, 4, e609–e616 (2016).

57. 57.

Helicobacter & Cancer Collaborative. Gastric cancer and Helicobacter pylori: a combined analysis of 12 case control studies nested within prospective cohorts. Gut 49, 347–353 (2001).

58. 58.

Arnheim Dahlstrom, L. et al. Prospective seroepidemiologic study of human papillomavirus and other risk factors in cervical cancer. Cancer Epidemiol., Biomark. Prev. 20, 2541–2550 (2011).

59. 59.

Agalliu, I. et al. Associations of Oral α-, β-, and γ-human papillomavirus types with risk of incident head and neck cancer. JAMA Oncol., https://doi.org/10.1001/jamaoncol.2015.5504 (2016).

60. 60.

Ringelhan, M., O’Connor, T., Protzer, U. & Heikenwalder, M. The direct and indirect roles of HBV in liver cancer: prospective markers for HCC screening and potential therapeutic targets. J. Pathol. 235, 355–367 (2015).

61. 61.

Sun, C. A. et al. Incidence and cofactors of hepatitis C virus-related hepatocellular carcinoma: a prospective study of 12,008 men in Taiwan. Am. J. Epidemiol. 157, 674–682 (2003).

62. 62.

Belpomme, D. et al. The multitude and diversity of environmental carcinogens. Environ. Res. 105, 414–429 (2007).

63. 63.

Veierod, M. B. et al. A prospective study of pigmentation, sun exposure, and risk of cutaneous malignant melanoma in women. J. Natl Cancer Inst. 95, 1530–1538 (2003).

64. 64.

Goss, P. E. & Sierra, S. Current perspectives on radiation-induced breast cancer. J. Clin. Oncol. 16, 338–347 (1998).

65. 65.

Turner, M. C. et al. Radon and Lung Cancer in the American Cancer Society Cohort. Cancer Epidem. Biomar. 20, 438–448 (2011).

66. 66.

Bizzozero, O. J. Jr., Johnson, K. G. & Ciocco, A. Radiation-related leukemia in Hiroshima and Nagasaki, 1946-1964. I. Distribution, incidence and appearance time. N. Eng. J. Med. 274, 1095–1101 (1966).

67. 67.

Anand, P. et al. Cancer is a preventable disease that requires major lifestyle changes. Pharm. Res. 25, 2097–2116 (2008).

68. 68.

Haggar, F. A. & Boushey, R. P. Colorectal cancer epidemiology: incidence, mortality, survival, and risk factors. Clin. Colon Rectal Surg. 22, 191–197 (2009).

69. 69.

Gray, J., Evans, N., Taylor, B., Rizzo, J. & Walker, M. State of the evidence the connection between breast cancer and the environment. Int. J. Occup. Env. Heal. 15, 43–78 (2009).

70. 70.

Shimizu, H. et al. Cancers of the prostate and breast among Japanese and White Immigrants in Los-Angeles-County. Brit. J. Cancer 63, 963–966 (1991).

71. 71.

Kabat, G. C., Matthews, C. E., Kamensky, V., Hollenbeck, A. R. & Rohan, T. E. Adherence to cancer prevention guidelines and cancer incidence, cancer mortality, and total mortality: a prospective cohort study. Am. J. Clin. Nutr. 101, 558–569 (2015).

72. 72.

Gonzalez, C. A. & Riboli, E. Diet and cancer prevention: Contributions from the European Prospective Investigation into Cancer and Nutrition (EPIC) study. Eur. J. Cancer 46, 2555–2562 (2010).

73. 73.

Hastie, T., Tibshirani, R. & Wainwright, M. Statistical Learning with Sparsity: The Lasso and Generalizations. (Chapman and Hall/CRC, 2015).

74. 74.

Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate - a practical and powerful approach to multiple testing. J. Roy. Stat. Soc. B Met. 57, 289–300 (1995).

75. 75.

Beral, V. et al. Alcohol, tobacco and breast cancer - collaborative reanalysis of individual data from 53 epidemiological studies, including 58515 women with breast cancer and 95067 women without the disease. Br. J. Cancer 87, 1234–1245 (2002).

76. 76.

Allen, N. E. et al. Moderate alcohol intake and cancer incidence in women. J. Natl. Cancer Inst. 101, 296–305 (2009). This study shows that through large scale epidemiologial studies such as the UK Million Women Study documented here, increasingly feasible thanks to the advancement of technologies and collaborations, we can increase the sensitivity and specificity of the anlsyes enough, to detect risk factors due to lifestyles

77. 77.

Willett, W. C. Diet, nutrition, and avoidable cancer. Environ. Health Perspect. 103(Suppl 8), 165–170 (1995).

78. 78.

Wiseman, M. The second World Cancer Research Fund/American Institute for Cancer Research expert report. Food, nutrition, physical activity, and the prevention of cancer: a global perspective. Proc. Nutr. Soc. 67, 253–256 (2008).

79. 79.

Drake, J. W. Mutagenic Mechanisms. Annu. Rev. Genet. 3, 247 (1969).

80. 80.

Pleasance, E. D. et al. A small-cell lung cancer genome with complex signatures of tobacco exposure. Nature 463, 184–190 (2010).

81. 81.

Pleasance, E. D. et al. A comprehensive catalogue of somatic mutations from a human cancer genome. Nature 463, 191–196 (2010). This ground breaking study revealed the diversity and complexity of somatic mutational processes underlying carcinogenesis in humans through mutational patterns buried within cancer genomes.

82. 82.

Alexandrov, L. B. & Stratton, M. R. Mutational signatures: the patterns of somatic mutations hidden in cancer genomes. Curr. Opin. Genet. Dev. 24, 52–60 (2014).

83. 83.

Hoang, M. L. et al. Mutational signature of aristolochic acid exposure as revealed by whole-exome sequencing. Sci. Transl. Med 5, 3006200 (2013).

84. 84.

Brown, S. B. & Hankinson, S. E. Endogenous estrogens and the risk of breast, endometrial, and ovarian cancers. Steroids 99, 8–10 (2015).

85. 85.

Kaaks, R., Lukanova, A. & Kurzer, M. S. Obesity, endogenous hormones, and endometrial cancer risk. A Synth. Rev. 11, 1531–1543 (2002).

86. 86.

Lortet-Tieulent, J., Ferlay, J., Bray, F. & Jemal, A. International patterns and trends in endometrial cancer incidence, 1978–2013. J. Natl Cancer Inst., 110, 354–361 (2017).

87. 87.

Onstad, M. A., Schmandt, R. E. & Lu, K. H. Addressing the role of obesity in endometrial cancer risk, prevention, and treatment. J. Clin. Oncol. 34, 4225–4230 (2016).

88. 88.

Upala, S. & Anawin, S. Bariatric surgery and risk of postoperative endometrial cancer: a systematic review and meta-analysis. Surgery Obes. Related Dis. 11, 949–955 (2015).

89. 89.

Carbone, M. & Bedrossian, C. W. The pathogenesis of mesothelioma. Semin. Diagn. Pathol. 23, 56–60 (2006).

90. 90.

Elinav, E. et al. Inflammation-induced cancer: crosstalk between tumours, immune cells and microorganisms. Nat. Rev. Cancer 13, 759–771 (2013).

91. 91.

Mantovani, A., Allavena, P., Sica, A. & Balkwill, F. Cancer-related inflammation. Nature 454, 436–444 (2008).

92. 92.

Yu, H., Pardoll, D. & Jove, R. STATs in cancer inflammation and immunity: a leading role for STAT3. Nat. Rev. Cancer 9, 798–809 (2009).

93. 93.

Parker, K. H., Beury, D. W. & Ostrand-Rosenberg, S. in Advances in Cancer Research Vol. 128, (eds Xiang-Yang, W. & Paul, B. F.) 95–139 (Academic Press, 2015).

94. 94.

Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: the next generation. Cell 144, 646–674 (2011).

95. 95.

Todoric, J., Antonucci, L. & Karin, M. Targeting inflammation in cancer prevention and therapy. Cancer Prev. Res. 9, 895–905 (2016).

96. 96.

Chubak, J., Kamineni, A., Buist, D. S., Anderson, M. L. & Whitlock, E. P. Aspirin Use for the Prevention of Colorectal Cancer: An Updated Systematic Evidence Review for the U.S. Preventive Services Task Force. (Agency for Healthcare Research and Quality, 2015).

97. 97.

de Magalhaes, J. P. How ageing processes influence cancer. Nat. Rev. Cancer 13, 357–365 (2013).

98. 98.

Lopez-Otin, C., Blasco, M. A., Partridge, L., Serrano, M. & Kroemer, G. The hallmarks of aging. Cell 153, 1194–1217 (2013).

99. 99.

Podolskiy, D. I. & Gladyshev, V. N. Intrinsic versus extrinsic cancer risk factors and aging. Trends Mol. Med. 22, 833–834 (2016).

100. 100.

Ruzankina, Y. & Brown, E. J. Relationships between stem cell exhaustion, tumour suppression and ageing. Br. J. Cancer 97, 1189–1193 (2007).

101. 101.

Burton, D. G. A. Cellular senescence, ageing and disease. Age 31, 1–9 (2009).

102. 102.

Clarke, C. A. et al. Age-specific incidence of breast cancer subtypes: understanding the black-white crossover. J. Natl Cancer Inst. 104, 1094–1101 (2012).

103. 103.

Lichtenstein, P. et al. Environmental and heritable factors in the causation of cancer–analyses of cohorts of twins from Sweden, Denmark, and Finland. N. Eng. J. Med. 343, 78–85 (2000).

104. 104.

Mucci, L. A. et al. Familial risk and heritability of cancer among twins in Nordic Countries. Jama-J. Am. Med. Assoc. 315, 68–76 (2016).

105. 105.

Donehower, L. A. et al. MLH1-silenced and non-silenced subgroups of hypermutated colorectal carcinomas have distinct mutational landscapes. J. Pathol. 229, 99–110 (2013).

106. 106.

Flavahan, W. A., Gaskell, E. & Bernstein, B. E. Epigenetic plasticity and the hallmarks of cancer. Science 357, 6348 (2017). While the above work [30] serves mainly as a counter example to the ‘bad luck’ theory [7] , this seminal work demonstrates that genetic, environmental, or metabolic insults can induce overly restrictive or permissive epigenetic landscapes that subsequently contribute to pathogenesis of cancer and other diseases – thus delineating the epigenetic mechanisms behind various risk factors, endogenous and exogenous, to cancer

107. 107.

Guida, F. et al. Dynamics of smoking-induced genome-wide methylation changes with time since smoking cessation. Hum. Mol. Genet. 24, 2349–2359 (2015).

108. 108.

Kundu, T. K. Epigenetics: development and disease. (Springer, 2013).

109. 109.

Sen, P., Shah, P. P., Nativio, R. & Berger, S. L. Epigenetic mechanisms of longevity and aging. Cell 166, 822–839 (2016).

110. 110.

The Telomeres Mendelian Randomization, Collaboration. Association between telomere length and risk of cancer and non-neoplastic diseases: A mendelian randomization study. JAMA Oncol., 3, 636–651 (2017).

111. 111.

Romano, G. H. et al. Environmental stresses disrupt telomere length homeostasis. PLoS Genet. 9, e1003721 (2013).

112. 112.

Starkweather, A. R. et al. An integrative review of factors associated with Telomere length and implications for biobehavioral research. Nurs. Res. 63, 36–50 (2014).

113. 113.

Jiang, Y. et al. An international ecological study of adult height in relation to cancer incidence for 24 anatomical sites. Cancer Causes Control 26, 493–499 (2015).

114. 114.

Zhang, B. et al. Height and breast cancer risk: evidence from prospective studies and Mendelian randomization. J Natl Cancer I 107, djv219 (2015).

115. 115.

Khankari, N. K. et al. Association between adult height and risk of colorectal, lung, and prostate cancer: results from meta-analyses of prospective studies and Mendelian randomization analyses. PLoS Med. 13, e1002118 (2016).

116. 116.

Jelenkovic, A. et al. Genetic and environmental influences on height from infancy to early adulthood: An individual-based pooled analysis of 45 twin cohorts. Sci. Rep. 6, 28496 (2016).

117. 117.

Addo, O. Y. et al. Maternal height and child growth patterns. J. Pediatr. 163, 549–554 (2013).

118. 118.

Bjerregaard, L. G. et al. Childhood height, adult height, and the risk of prostate cancer. Cancer Causes Control 27, 561–567 (2016).

119. 119.

DeClercq, V., McMurray, D. N. & Chapkin, R. S. Obesity promotes colonic stem cell expansion during cancer initiation. Cancer Lett. 369, 336–343 (2015).

120. 120.

Sutcliffe, S. & Colditz, G. A. Prostate cancer: is it time to expand the research focus to early-life exposures? Nat. Rev. Cancer 13, 208–518 (2013).

121. 121.

Riley, A. W. & Duncan, G. J. Completing a national birth cohort in the united states. JAMA Pediatr. 170, 829–830 (2016).

122. 122.

Doll, R. & Peto, R. The causes of cancer: quantitative estimates of avoidable risks of cancer in the United States today. J. Natl Cancer Inst. 66, 1191–1308 (1981). Re-analyzing the landmark 1981 paper from Doll and Peto (see citation below) using modern data, the authors confirmed that a substantial proportion of cancer risk is due to modifiabe exogeneous risk factors then and now, and echoed our observations [6] that the large geographic variation in cancer rates observed at most organs indicates that the percentage of cancers arising entirely due to bad luck is relatively low.

123. 123.

Parkin, D. M., Boyd, L. & Walker, L. C. 16. The fraction of cancer attributable to lifestyle and environmental factors in the UK in 2010. Br. J. Cancer 105(Suppl 2), S77–S81 (2011).

124. 124.

IARC Advisory Group. IARC Monographs on the Evaluation of Carcinogenic Risks to Humans Report No. 14/001 (World Health Organization, 2008).

125. 125.

Charles, D. R. & Luce-Clauses, E. M. The kinetics of papilloma formation in benzpyrene-treated mice. Cancer Res. 2, 261–263 (1942).

126. 126.

Moolgavkar, S. H. & Knudson, A. G. Mutation and cancer - a model for human carcinogenesis. J. Natl. Cancer Inst. 66, 1037–1052 (1981).

127. 127.

Moolgavkar, S. H. Model for human carcinogenesis - action of environmental agents. Environ. Health Perspect. 50, 285–291 (1983).

128. 128.

Vogelstein, B. & Kinzler, K. W. The Path to Cancer–Three Strikes and You’re Out. N. Eng. J. Med. 373, 1895–1898, (2015).

129. 129.

Ren, R. Mechanisms of BCR-ABL in the pathogenesis of chronic myelogenous leukaemia. Nat. Rev. Cancer 5, 172–183 (2005).

130. 130.

Faderl, S. et al. The biology of chronic myeloid leukemia. N. Eng. J. Med. 341, 164–172 (1999).

## Author information

Authors

### Contributions

S.W., W.Z. and Y.A.H. conceived the study. S.W., W.Z., P.T. and Y.A.H. performed the literature review and wrote the manuscript. Y.A.H. and W.Z. directed the study.

### Corresponding author

Correspondence to Yusuf A Hannun.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Wu, S., Zhu, W., Thompson, P. et al. Evaluating intrinsic and non-intrinsic cancer risk factors. Nat Commun 9, 3490 (2018). https://doi.org/10.1038/s41467-018-05467-z

• Accepted:

• Published:

• ### E3 ubiquitin ligase HECTD2 mediates melanoma progression and immune evasion

• Eleonora Ottina
• , Veera Panova
• , Laura Doglio
• , Anastasiya Kazachenka
• , Georgina Cornish
• , Joanna Kirkpatrick
• , Jan Attig
• , George R. Young
• , Kevin Litchfield
• , Tom Lesluyes
• , Peter Van Loo
• , Charles Swanton
• , James MacRae
• , Thomas Tüting
•  & George Kassiotis

Oncogene (2021)

• ### Mutational landscape of multiple primary lung cancers and its correlation with non-intrinsic risk factors

• Motohiro Izumi
• , Jun Oyanagi
• , Kenji Sawa
• , Mitsuru Fukui
• , Koichi Ogawa
• , Yoshiya Matsumoto
• , Yoko Tani
• , Tomohiro Suzumura
• , Tetsuya Watanabe
• , Hiroyasu Kaneda
• , Shigeki Mitsuoka
• , Kazuhisa Asai
• , Masahiko Ohsawa
• , Yasuhiro Koh
•  & Tomoya Kawaguchi

Scientific Reports (2021)

• ### NMR metabolomic profiles associated with long-term risk of prostate cancer

• Lucie Lécuyer
• , Agnès Victor Bala
• , Aicha Demidem
• , Mohamed Nawfal Triba
• , Pilar Galan
• , Serge Hercberg
• , Valentin Partula
• , Bernard Srour
• , Paule Latino-Martel
• , Emmanuelle Kesse-Guyot
• , Nathalie Druesne-Pecollo
• , Marie-Paule Vasson
• , Mélanie Deschasaux-Tanguy
• , Philippe Savarin
•  & Mathilde Touvier

Metabolomics (2021)

• ### Approaching precision medicine by tailoring the microbiota

• Gaeun Ryu
• , Hyojin Kim
•  & Ara Koh

Mammalian Genome (2021)

• ### To Study the Anti-cancer Effects of Shigella Flexneri in AspC-1 Pancreatic Cancer Cell Line in Approach to Bax and bcl-2 Genes

• Amir Khodavirdipour
• , Fahimeh Jamshidi
• , Masoud Zandi
•  & Rana Zarean

Journal of Gastrointestinal Cancer (2021)