Pro-inflammatory cytokine polymorphisms and interactions with dietary alcohol and estrogen, risk factors for invasive breast cancer using a post genome-wide analysis for gene–gene and gene–lifestyle interaction

Jung, Su Yon; Papp, Jeanette C.; Sobel, Eric M.; Pellegrini, Matteo; Yu, Herbert; Zhang, Zuo-Feng

doi:10.1038/s41598-020-80197-1

Download PDF

Article
Open access
Published: 13 January 2021

Pro-inflammatory cytokine polymorphisms and interactions with dietary alcohol and estrogen, risk factors for invasive breast cancer using a post genome-wide analysis for gene–gene and gene–lifestyle interaction

Su Yon Jung¹,
Jeanette C. Papp²,
Eric M. Sobel^2,3,
Matteo Pellegrini⁴,
Herbert Yu⁵ &
…
Zuo-Feng Zhang^6,7

Scientific Reports volume 11, Article number: 1058 (2021) Cite this article

2371 Accesses
5 Citations
Metrics details

Subjects

Abstract

Molecular and genetic immune-related pathways connected to breast cancer and lifestyles in postmenopausal women are not fully characterized. In this study, we explored the role of pro-inflammatory cytokines such as C-reactive protein (CRP) and interleukin-6 (IL-6) in those pathways at the genome-wide level. With single-nucleotide polymorphisms (SNPs) in the biomarkers and lifestyles together, we further constructed risk profiles to improve predictability for breast cancer. Our earlier genome-wide association gene-environment interaction study used large cohort data from the Women’s Health Initiative Database for Genotypes and Phenotypes Study and identified 88 SNPs associated with CRP and IL-6. For this study, we added an additional 68 SNPs from previous GWA studies, and together with 48 selected lifestyles, evaluated for the association with breast cancer risk via a 2-stage multimodal random survival forest and generalized multifactor dimensionality reduction methods. Overall and in obesity strata (by body mass index, waist, waist-to-hip ratio, exercise, and dietary fat intake), we identified the most predictive genetic and lifestyle variables. Two SNPs (SALL1 rs10521222 and HLA-DQA1 rs9271608) and lifestyles, including alcohol intake, lifetime cumulative exposure to estrogen, and overall and visceral obesity, are the most common and strongest predictive markers for breast cancer across the analyses. The risk profile that combined those variables presented their synergistic effect on the increased breast cancer risk in a gene–lifestyle dose-dependent manner. Our study may contribute to improved predictability for breast cancer and suggest potential interventions for the women with the risk genotypes and lifestyles to reduce their breast cancer risk.

Genome-wide association studies

Article 26 August 2021

Associations of dietary patterns with brain health from behavioral, neuroimaging, biochemical and genetic analyses

Article Open access 01 April 2024

Utility of polygenic scores across diverse diseases in a hospital cohort for predictive modeling

Article Open access 12 April 2024

Introduction

Chronic inflammation may play an important role in the pathogenesis of non-inflammatory diseases, such as breast cancer, from tumor initiation through progression^1,2. Activation of innate immunity creates a tissue microenvironment high in reactive oxygen and nitrogen species, leading to potential DNA damage and alterations in nearby cells^3,4,5. The inflammatory response also elevates the circulating levels of cancer-promoting inflammatory cytokines such as C-reactive protein (CRP) and interleukin-6 (IL-6)². These key pro-inflammatory biomarkers reflect different molecular pathways in the immune cascade in acute and chronic immune responses but may be interrelated in carcinogenesis, yielding a congruent association with breast cancer risk. For example, IL-6, upregulated by macrophages and adipose tissue, promotes breast tumor initiation and progression^6,7. CRP, a major acute-phase reactant and a biomarker of chronic low-grade inflammation, partially induced by IL-6, has been associated with increased risk of breast cancer^8,9. The carcinogenetic mechanisms of these markers are partially understood. IL-6 regulates aromatase activity responsible for estrogen production in adipose tissue, which is important in developing postmenopausal breast cancer^10,11. CRP levels are attenuated by prolonged inhibition of cyclooxygenase-2 action (promoting estrogen formation in adipose tissue)^11,12. Thus, IL-6 and CRP may be involved in inflammatory pathways connected to breast cancer tumorigenesis.

Given the relationships between those inflammatory markers and breast cancer risk, genetic variants involved in the biomarkers’ functional and structural regulation may have potential implication in the causal pathway, affecting the risk of breast cancer. Previous genomic epidemiology studies for the associations between CRP/IL-6-related genome-wide genetic variants and breast cancer risk are limited and mostly showed null results^{13,14,15,16,17}, while only a few reported a marginal effect on breast cancer risk⁶. The gene–phenotype pathway may not be connected to CRP and IL-6 alone, but also modulated by lifestyle pathways linked to obesity (overall and visceral)^{15,18,19,20,21,22,23,24,25}, lipid metabolism^25,26, high-fat diet, exercise, smoking, and alcohol^{18,27,28,29,30,31,32,33,34}. Further, the inflammatory cytokines and the genetic markers have demonstrated different associations with breast cancer according to obesity^16,35 and related lifestyle factors such as physical activity and dyslipidemia^36,37,38. Thus, studying how those lifestyle factors modify and interact with gene and phenotype, leading to increased breast cancer susceptibility, may contribute to the understanding of the complex genotype–phenotype pathway and is important to develop a genetically targeted intervention tool for use in primary breast cancer prevention efforts.

In addition, immune-related etiologic pathways in breast cancer development may differ by menopausal status, probably due to the role of sex hormones in mediating the innate and adaptive immune systems. Our current study has focused on postmenopausal women who are vulnerable to a high incidence of inflammation³⁹, obesity, and breast cancer (e.g., 80% of new cases occur in women age 50 years and older^40,41). Using a large-scale postmenopausal women cohort from the Women’s Health Initiative Database for Genotypes and Phenotypes (WHI dbGaP) Study, we previously performed a genome-wide association (GWA) gene–environment (G × E) interaction study for CRP and IL-6 by addressing the pleiotropic effect of those biomarkers on the gene–phenotype relationship; we identified 88 top GWA single-nucleotide polymorphisms (SNPs)⁴². We have now extended the scope of modeled genetic factors by including 68 additional SNPs in relation to CRP and IL-6 from previous GWA studies that focused on European ancestry with independent replications^20,21,43,44. We examined the association of those top GWA-based SNPs with primary invasive breast cancer risk overall and in obesity-related strata in which the SNPs were associated with CRP and IL-6 at genome-wide significance in our earlier GWA study⁴². This approach may allow us to elucidate an empirical pathway through which a substantial proportion of the susceptibility of GWA SNPs in CRP and IL-6 influences breast cancer risk through interactions with specific lifestyles (Figure S1).

In this study, we hoped to improve the predictability of breast cancer by better characterizing the genetic architecture of the inflammatory biomarkers that interact with lifestyle factors. We evaluated the GWA SNPs and 48 selected lifestyle factors together by conducting a two-stage multimodal random survival forest (RSF) analysis and ranked them according to their predictive value and accuracy for breast cancer. In addition, we applied a generalized multifactor dimensionality reduction (GMDR) model to characterize high-order gene–gene interactions and selected the best genetic prediction model^45,46,47,48. Finally, with the most predictive SNPs and lifestyle factors selected via the RSF and GMDR, we constructed prediction models for breast cancer risk and estimated the combined and joint interaction effects of genotypes and lifestyles on the development of breast cancer. Ultimately, we tested the empirical hypothesis that the most-predictive genetic and lifestyle factors in combination increase the predictability of breast cancer risk in a synergistic manner.

Material and methods

Study population

Our study included healthy postmenopausal women enrolled in the WHI Harmonized and Imputed GWA Studies (GWASs) which was coordinated by dbGaP to contribute to a joint imputation and harmonization effort for GWASs within the 2 representative study arms, Clinical Trials and Observational Studies. The detailed study designs and rationale are described elsewhere^49,50. Briefly, healthy women were enrolled in the WHI study between 1993 and 1998 at 40 clinical centers across the United States if they were 50–79 years old, postmenopausal, expected to stay near the clinical centers for at least 3 years after enrollment, and able to provide written informed consent. Participants were eligible for the WHI dbGaP study if they had met eligibility requirements for submission to dbGaP and provided DNA samples. The Harmonization and Imputation GWASs under the dbGaP study accession (phs000200.v12.p3) consist of 6 sub-studies (Table S1). Of the 16,088 women who reported their race or ethnicity as non-Hispanic white (Figure S2), in our earlier GWA GxE study, we applied the exclusion criteria (diabetes history; genetic data duplications; first- and second-degree relatives; and genetic quality control [QC] based on principal components), leaving 10,798 women. In the current study, we additionally excluded 619 with < 1 year follow-up period and/or a diagnosis of any type of cancer at enrollment, leaving a total of 10,179 women (94% of the eligible 10,798 GWA participants). These women had been followed up through August 29, 2014, with a mean of 16 years follow-up, and 537 of them had developed primary invasive breast cancer. The Institutional Review Boards of each WHI participating clinical center and the University of California, Los Angeles, approved this study. all methods were performed in accordance with the relevant guidelines and regulations.

Data collection and breast cancer outcome

The coordinating clinical centers conducted data quality assurance periodically and collected participant information through self-administered questionnaires. In this study, we initially selected 48 variables measured at screening for our analysis on the basis of (1) their association with inflammation and breast cancer through the literature review^{36,51,52,53,54} and (2) preliminary analyses including univariate and stepwise multiple regression analyses and a multicollinearity test. Those variables include demographic and socioeconomic factors (age, education, marital status, family income, and employment); family histories of breast and colorectal cancers and diabetes; medical histories (depressive symptoms, hypertension, high cholesterol, and cardiovascular disease); lifestyles (cigarette smoking and exercise); dietary factors (dietary energy, alcohol intake, total sugar, fiber, fruit, and vegetable consumption; % calories from protein, carbohydrates, saturated fatty acids [SFA], monounsaturated FA [MFA], and polyunsaturated FA [PFA]); and reproductive histories (history of hysterectomy, removal of one or both ovaries, ages at menarche and menopause, pregnancy, breast feeding, oral contraceptive (OC) use, and use of exogenous estrogen [E] only and E plus progestin [E + P]). We also included anthropometric variables, including height, weight, and waist and hip circumferences, which had been measured by trained staff.

The breast cancer outcomes were determined via a centralized review of medical charts by a committee of physicians on the basis of pathology or cytology reports. The time from enrollment to breast cancer development, censoring, or study end point was calculated and represented in years. Cancer cases were coded using the National Cancer Institute’s Surveillance, Epidemiology, and End-Results guidelines⁵⁵.

Genotyping

We extracted genotyped data from the WHI dbGaP Harmonized and Imputed GWASs. Details of the data-cleaning process have been previously discussed^42,56. Briefly, the genotypes were normalized to the reference panel GRCh37, and imputation was conducted via 1000 Genomes reference panels⁵⁷. SNPs for harmonization were checked for pairwise concordance among all samples across the GWASs. The initial data QC included SNP filtering with a missing-call rate of < 2% and a Hardy–Weinberg equilibrium of p ≥ 1E–04. The second QC step included SNPs with \({\widehat{R}}^{2}\ge 0.6\) imputation quality⁵⁸ but excluded individuals with a KING kinship estimate > 0.088⁵⁹.

Statistical analysis

Differences in participants’ baseline characteristics and allele frequencies by breast cancer development were examined with unpaired 2-sample t tests (for continuous variables) and chi-squared tests (for categorical variables). If continuous variables were skewed or had outliers, Wilcoxon’s rank-sum test was conducted. Our previous GWA analysis evaluated the gene–lifestyle interactions via stratifications defined by body mass index (BMI; cutoff, 30 kg/m²), waist circumstance (WST; cutoff, 88 cm), waist-to-hip ratio (WHR; cutoff, 0.85), metabolic equivalents (METs; cutoff, 10 h/week), and % calories from SFA (cut-off, 9%). The results (G × E formal test and stratified analysis) from the sub-GWASs were combined in a meta-analysis assuming a fixed-effect model. In this study, we performed an association study of the 88 SNPs identified in subgroups by obesity and obesity-related lifestyle variables with breast cancer risk in the identical subgroups. The additional 68 SNPs from other GWA studies were pulled together overall and in subgroups for the purpose of analysis.

In the current study, we conducted the RSF analysis. The RSF initially generates bootstrap samples using approximately 63% of the original data and grows a tree from each sample via a splitting rule to maximize survival differences across daughter nodes. This tree-building process is repeated numerous times (n = 5000 in this study), creating a forest of trees^60,61. An ensemble cumulative hazard estimate was calculated from each tree and averaged over all trees for each individual and used to compute a predicted cumulative breast cancer incidence rate. Also, using this ensemble estimate and creating the out-of-bag (OOB) data (about 37% of the original data not used for bootstrapping), the OOB concordance index (c-index) was estimated, which is a measure of prediction performance conceptually similar to the area under the receiver operating characteristic (AUROC) curve^60,62. The rank of each variable was determined on the basis of its predictability for breast cancer according to 2 predictive parameters: (1) minimal depth (MD), in which variables that have a small MD and split the tree close to the root are considered highly predictive and (2) variable importance (VIMP), computed as the difference between the OOB c-indexes from the original OOB data and from the permuted OOB data, in which variables that have greater VIMP values are the more predictive⁶³. Because they use different prediction algorithms, we expect the variables’ ranking to differ to some degree. The RSF, a machine-learning and nonparametric tree-based ensemble method, accounts for nonlinearity and high-order interactions among variables, which may not be handled by a traditional regression method^63,64. The RSF may thus provide a more accurate risk estimation.

We performed a 2-stage RSF analysis (Figure S3). In the first stage, we implemented an RSF on SNPs and lifestyle factors separately. Only those SNPs and lifestyle factors with distinctly low MD and high VIMP values were carried over in the second stage. In that second stage, we took a multimodal approach overall and in subgroups (by BMI, WHR, WST, MET, and SFA) by (1) comparing MD and VIMP measures in the plot, (2) computing the OOB c-index from the nested RSF model, and (3) estimating the incremental error rate of each variable in the nested sequence of RSF models from the top variable and calculating a dropping error rate. This RSF multimodal approach enabled us to exclude from the outset the SNPs and lifestyle factors that were not significantly associated with breast cancer, leading to increased statistical power and corrected type I error rate compared with the original RSF model⁶¹.

Further, we applied a GMDR model that is described in detail elsewhere^45,46,47. The GMDR reduces high-dimensional multifactor prediction to a single dimension by the ratio of high vs. low risk, and thus detects the best gene–gene interaction model. It produces key predictability performance measures, including testing balance accuracy (TBA), cross-validation consistency (CVC), and sign p value. The model with the highest TBA, CVC 10/10, and p < 0.05 based on 1000-times permutation testing was considered the best model.

Multiple Cox proportional hazards regressions, with a test of proportional hazards via a Schoenfeld residual plot and ρ evaluation, were conducted to obtain hazard ratios (HRs) and 95% confidence intervals (CIs) for the single and combined effects of SNPs and lifestyle factors on breast cancer, with adjustment for covariates (Table 1). A 2-tailed p value < 0.05 was considered statistically significant, and multiple comparisons were adjusted by the Benjamini–Hochberg method⁶⁵. GMDR v.1.0. and R v.3.5.2. (survival, survivalROC, randomForestSRC, ggRandomForests, gamlss, ggsurvplot, and forestplot packages) were used.

Table 1 Characteristics of participants, stratified by breast cancer.

Full size table

Results

The allele frequencies of 156 GWA CRP/IL-6-related SNPs and baseline characteristics of participants are displayed in Tables S1 and 1, respectively. Breast cancer patients had relatively higher education, greater family income, and family history of diabetes and breast cancer, smoked more cigarettes/day, consumed more dietary alcohol/day, and were more depressed, obese both overall and viscerally, and taller. They also tended to experience early menarche and late menopause and had less history of hysterectomy and shorter duration of OC and E-only use, but longer duration of E + P use.

Two-stage multimodal RSF and GMDR approach

With the 156 GWA SNPs and 48 lifestyle factors, we performed the two-stage RSF and GMDR (Figure S3) to determine the most predictive variables with the highest predictability and lowest prediction error for breast cancer risk. In the first stage, we estimated 2 predictability performance measures, MD and VIMP. For lifestyles and SNPs separately, we created a plot to compare those 2 measures and identified the strongest predictive lifestyle and genetic factors that were in agreement with high ranks (Figure S4) in overall analysis: 12 of 48 lifestyles and 13 of 156 SNPs. We further conducted the first stage of RSF for SNPs in the subgroups, which yielded the following results: 8 and 13 of 117 SNPs (BMI < 30 and ≥ 30, respectively); 14 and 7 of 70 SNPs (WHR ≤ 0.85 and > 0.85, respectively); 10 and 6 of 81 SNPs (WST ≤ 88 and > 88, respectively); 7 and 12 of 82 SNPs (METs ≥ 10 and < 10, respectively); and 19 and 12 of 116 SNPs (SFA < 9 and ≥ 9, respectively). All of the SNPs identified in this first stage of RSF were associated with CRP.

Next, with the 12 lifestyles and selected SNPs together, overall and in subgroups, we conducted the second multimodal RSF to construct risk profiles with the most predictive variables. Particularly, in the overall group, we first computed the 2 measures MD and VIMP (Table 2) and compared them in a plot (Fig. 1A), in which a dashed red line represents agreement of the 2 measures. Both measures with high ranks indicated 5 SNPs (SALL1 rs10521222; HLA-DQA1 rs9271608; DUSP1 rs17658229; APOC1 rs4420638; and TRAIP rs2352975) and 3 lifestyles (duration of OC and E + P use and BMI) as the most influential variables for breast cancer. Second, we estimated the c-index (i.e., the AUROC) from the nested RSF model (Table 2) and plotted (Fig. 1B) where variables ranked by MD, identifying the same set of top variables (5 SNPs and 3 lifestyles). Those top variables substantially improved the c-index prediction accuracy, whereas others did not, suggesting that the c-index has complementary prediction ability. Last, we computed a dropping error rate for each variable in the nested sequence of RSF models (Table 2), and once again identified the same top 8 variables as the strongest contributors to reduce the error rate, thus improving the prediction accuracy. Further, using the GMDR method, we determined the best gene-by-gene interaction models up to 5 orders of interactions (Table 3), of which the one-factor model including TRAIP rs2352975 was the best predictive with the highest TBA of 0.5382 and CVC of 10/10 (p < 0.001).

Table 2 The second stage of random survival forest analysis: predictive value of variable for breast cancer in overall analysis.

Full size table

Table 3 GMDR-based model for high-order gene–gene interactions in relation to breast cancer risk.

Full size table

For each of the obesity strata (BMI, WHR, WST, MET, and SFA), we continuously applied those multimodal (Tables S2.1–10 and Figures S5–9) and GMDR (Table 3) approaches, and determined the strongest predictive markers with the most common 6 SNPs (TRAIP rs2352975, DUSP1 rs17658229, HLA-DQA1 rs9271608, SALL1 rs10521222, HNF1A-AS1 rs2243616, and APOC1 rs4420638) and 5 lifestyle factors (dietary alcohol intake, E + P and OC use, BMI, and hip circumference).

Combined and joint effects of the most influential SNPs and lifestyles on breast cancer risk

By accounting for confounding factors and the nonlinearity of each variable via the RSF method, we estimated the predicted cumulative incidence rate of breast cancer (Fig. 2). The genotypes of each SNP were originally continuous variables and then categorized accordingly for further analysis with the following risk genotypes (Fig. 2A–E): TRAIP rs2352975 CT + TT, DUSP1 rs17658229 CC, HLA-DQA1 rs9271608 GG, SALL1 rs10521222 TT, and APOC1 rs4420638 GG. Also, by using a cutoff value bisecting variables (Fig. 2F–I), high-risk lifestyle groups were defined as ≥ 18 g/day of alcohol consumption, ≥ 10 years of E + P use, < 5 years of past OC use, or ≥ 30 BMI and further analyzed as binary variables. With the best predictive GMDR-modeled SNPs and risk lifestyles overall and in subgroups, we developed multivariate models for breast cancer risk (Table S3). These results suggested a stronger individual effect of some SNPs than the rest of the SNPs and lifestyles on breast cancer risk, even after accounting for confounding factors.

The SNPs and lifestyles, when combined or jointly associated, displayed different patterns of breast cancer risk. In particular, in the overall non-obese (BMI < 30) group (Table 4), the best predictive SNPs and lifestyles were combined separately. When stratified by alcohol intake, high alcohol consumers (≥ 18 g/day) who had the maximum number of risk genotypes had a 4 times increased risk for breast cancer than low alcohol consumers (< 18 g/day) who had less or null-risk genotypes. Consistently, high alcohol consumers with one or more risk lifestyles had 3 times higher risk than low alcohol consumers with a null-risk lifestyle. When SNPs and lifestyles were combined, compared with the lowest-risk group (null risk for genotypes and lifestyles), the moderate-risk (high risk of either genotypes or lifestyles) and the highest-risk groups (high risk of both genotypes and lifestyles) had about 3 times and 6 times greater risk, respectively, suggesting a gene–lifestyle dose–response relationship. Further, when stratified by alcohol consumption, higher alcohol consumers with high risk of both genotypes and lifestyles had 10 times the excessive risk, compared with low alcohol consumers with low risk of both genotypes and lifestyles. This indicates a significant joint effect of alcohol intake with the SNPs and lifestyles on breast cancer risk in an additive model (G × E: HR = 1.15, p 0.547). Multiple testing was corrected to control the false-discovery rate. The analyses of the non-viscerally obese (WHR ≤ 0.85) group (Table 4) yielded similar results but with stronger combined and joint effects of risk genotypes and lifestyles with alcohol intake on breast cancer risk in both additive and multiplicative models (G × E: HR = 1.37, p 0.253).

Table 4 Stratification analysis by BMI and WHR: joint effect of dietary alcohol intake with combined risk genotypes and behavioral factors on breast cancer risk.

Full size table

We further evaluated the combined effect of SNPs and lifestyle factors and their joint effect with E + P use on breast cancer risk (Table S4) and determined that the risk genotypes and lifestyles, both separately and in combination, had a synergistic effect with longer use of E + P (≥ 10 years) on cancer risk. This pattern appeared more strongly in obesity strata (BMI, WHR, MET, and SFA) than in the overall group (Fig. 3).

Discussion

An increasing number of population-based cancer genomic studies have incorporated environmental factors in the molecular causal pathway. Comprehending how lifestyle factors interact with genes and phenotypes, influencing risk for breast cancer, is important for constructing improved risk profiles, leading to the development of a gene–lifestyle combination intervention for primary cancer prevention efforts. Our 2-stage multimodal RSF and GMDR analyses identified the strongest predictive genetic and lifestyle variables overall and in obesity strata. The genetic effects in this study were associated with the SNPs involved in inflammatory cytokine pathways. The most common markers for breast cancer risk across the strata are 2 SNPs related to CRP (SALL1 rs10521222 and HLA-DQA1 rs9271608) and, consistent with previous studies^66,67,68, 5 lifestyle factors such as alcohol intake, lifetime cumulative exposure to estrogen (post OC and E + P use), and overall and visceral obesity. The risk profiles that combined those influential variables presented a synergistic effect on the increased risk for breast cancer in a gene–lifestyle dose-dependent manner.

One SNP near SALL1, in relation to CRP, both overall and in the obesity strata, is associated with breast cancer risk. SALL1 is a member of the SALL gene family, encoding a multiple zinc-finger transcription repressor that regulates organogenesis and development of embryonic stem cells^69,70,71. The role of the SALL genes (particularly SALL2 and SALL4) in tumorigenesis has recently been investigated as a tumor suppressor for ovarian and Wilms’ tumors^72,73, hepatoblastoma, and gastric carcinoma^74,75. However, the function of SALL1 in cancer development has not been determined. Few recent studies of in vivo RNAi screen and in vivo/in vitro breast cancer cells have implicated SALL1 as a tumor suppressor in breast cancer by inhibiting cancer cell growth, proliferation, and cell-cycle arrest, through the Nucleosome Remodeling and Deacetylase network⁷⁶ or by regulating CDH1, a contributor to epithelial-to-mesenchymal transition⁷⁷. Our finding of the SALL1 SNP’s association with CRP at the GWA level and with breast cancer risk is supported by these previous biologic studies and further suggests the involvement of SALL1 in immune mechanisms of breast cancer tumorigenesis.

HLA-DQA1 belongs to the human leukocyte antigen (HLA) class II alpha chain paralogues, which increase immune system sensitivity by distinguishing its own proteins from foreign invaders^78,79. HLA class II, the human version of the major histocompatibility complex (MHC) class II, regulates the antitumoral cellular immune response by presenting MHC antigen in tumor cells to the immune system, stimulating tumor infiltration of CD4 + T cells^80,81,82. Several previous studies reported that the SNPs of HLA class II have implications in the carcinogenesis of specific cancers (e.g., ovarian⁸³, squamous cell lung⁸⁴, gastric⁸⁵, and esophageal⁸⁶ cancers), but limited studies in association with breast cancer have been conducted and were restricted to subjects other than Caucasians; further, the results were inconsistent^80,81 or null⁸². Our study is the first to report the association of the HLA-DQA1 SNP with breast cancer risk in non-Hispanic white women, suggesting that HLA class II plays a decisive role in the pathogenesis of breast cancer in this population by diminishing the efficacy of the antitumoral immune response. Also, this association would have been missed without the incorporation of obesity factors, which calls for further study of the biologic mechanism.

A number of epidemiologic studies have revealed that alcohol intake, even of a small amount (e.g., ≤ 1 drink [moderate]/day), can increase breast cancer risk in both pre- and post-menopausal women^{66,87,88,89,90}. Notably, in postmenopausal women, few studies have examined the combined and joint effect of alcohol intake with other lifestyles^66,67,68 or relevant genetic variants^91,92 on breast cancer risk; in particular, the gene–lifestyle study results did not support a significantly increased risk among women who carried specific risk genotypes and had higher alcohol intake^91,92. Molecular biologic mechanisms of alcohol-associated tumorigenesis in breast cancer may involve complicated pathways: an elevated level of estrogen by testosterone conversion; an increased level of insulin-like growth factors from the liver due to alcohol consumption^93,94; and disruption of folate metabolism⁹⁵. Also, acetaldehyde, derived from the metabolism of ethanol, is a carcinogenic metabolite that causes formation of DNA adducts and inhibits DNA repair and methylation patterns^90,96. Further, high and regular alcohol intake may lead to a dietary deficiency of essential nutrients, making individuals susceptible to tumorigenesis⁹⁰. Corresponding to this alcohol-response tumorigenic environment, and supported by previous research⁶⁶, our study showed that more than moderate alcohol intake, jointly with the risk SNPs, substantially elevated the risk of breast cancer synergistically; and this synergistic effect occurred more strongly in the non-obese subgroups.

Another influential lifestyle factor in our study is the opposed E + P use that contributes to the lifetime cumulative exposure to estrogen. Synthetic progestin is a well-established risk factor for breast cancer^97,98,99, with an affinity for androgen and mineralocorticoid receptors, leading to cell proliferation and anti-apoptosis^97,100. Further, the joint effect of E + P use with the SNPs was profound in the non-obese subgroups, suggesting complementary pathways of sex hormones and obesity (i.e., the effect of sex hormones maximized in non-obese individuals with relatively lower hormone levels).

The amounts of daily dietary alcohol intake were obtained from self-reported food frequency questionnaires and then validated to be highly correlated with 1 month of food-diary records (r = 0.9)¹⁰¹. In addition, we confined our study population to non-Hispanic white postmenopausal women, limiting the generalizability of our study findings to other populations. Due to insufficient statistical power, we were unable to investigate the molecular subtypes of breast cancer. Despite several benefits from the 2-stage RSF multimodal and GMDR approaches, it can overfit the model owing to complicated analysis tasks, particularly in relatively small subgroups, so our results need to be replicated in an independent study with a large sample size.

Overall, in this study, the SNPs in proinflammatory cytokines previously identified as genome-wide significant had a synergistic effect on breast cancer risk by combining with lifestyle factors, including alcohol intake, lifetime cumulative exposure to estrogen, and obesity. Our findings warrant molecular biologic studies such as gene signature and aberrant cell signaling in relation to breast cancer in postmenopausal women who have a history of alcohol intake and estrogen use by different levels of obesity and related lifestyles. Our study may contribute to improved prediction accuracy and the ability to assess breast cancer risk, and suggest potential interventions for women who carry the risk genotypes, such as partial or absolute abstinence from alcohol intake, shorter duration of hormone therapy, and better weight control, potentially leading to an improved impact on the epigenetic aberrations and thus reducing the risk of breast cancer.

Data availability

The data that support the findings of this study are available in accordance with policies developed by the NHLBI and WHI in order to protect sensitive participant information and approved by the Fred Hutchinson Cancer Research Center, which currently serves as the IRB of record for the WHI. Data requests may be made by emailing helpdesk@WHI.org.

References

Coussens, L. M. & Werb, Z. Inflammation and cancer. Nature 420, 860–867. https://doi.org/10.1038/nature01322 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Disis, M. L. Immune regulation of cancer. J. Clin. Oncol. 28, 4531–4538. https://doi.org/10.1200/JCO.2009.27.2146 (2010).
Article CAS PubMed PubMed Central Google Scholar
Grivennikov, S. I., Greten, F. R. & Karin, M. Immunity, inflammation, and cancer. Cell 140, 883–899. https://doi.org/10.1016/j.cell.2010.01.025 (2010).
Article CAS PubMed PubMed Central Google Scholar
Hanahan, D. & Weinberg, R. A. Hallmarks of cancer: The next generation. Cell 144, 646–674. https://doi.org/10.1016/j.cell.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Ollberding, N. J. et al. Prediagnostic leptin, adiponectin, C-reactive protein, and the risk of postmenopausal breast cancer. Cancer Prev. Res. 6, 188–195. https://doi.org/10.1158/1940-6207.CAPR-12-0374 (2013).
Article CAS Google Scholar
Perks, C. M. & Holly, J. M. Hormonal mechanisms underlying the relationship between obesity and breast cancer. Endocrinol. Metab. Clin. N. Am. 40, 485–507. https://doi.org/10.1016/j.ecl.2011.05.010 (2011).
Article CAS Google Scholar
Roberts, D. L., Dive, C. & Renehan, A. G. Biological mechanisms linking obesity and cancer risk: New perspectives. Annu. Rev. Med. 61, 301–316. https://doi.org/10.1146/annurev.med.080708.082713 (2010).
Article CAS PubMed Google Scholar
Pierce, B. L. et al. Elevated biomarkers of inflammation are associated with reduced survival among breast cancer patients. J. Clin. Oncol 27, 3437–3444. https://doi.org/10.1200/JCO.2008.18.9068 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chan, D. S., Bandera, E. V., Greenwood, D. C. & Norat, T. Circulating C-reactive protein and breast cancer risk-systematic literature review and meta-analysis of prospective cohort studies. Cancer Epidemiol. Biomark. Prev. 24, 1439–1449. https://doi.org/10.1158/1055-9965.EPI-15-0324 (2015).
Article CAS Google Scholar
Purohit, A. & Reed, M. J. Regulation of estrogen synthesis in postmenopausal women. Steroids 67, 979–983. https://doi.org/10.1016/s0039-128x(02)00046-6 (2002).
Article CAS PubMed Google Scholar
Hanna, M. et al. Association between local inflammation and breast tissue age-related lobular involution among premenopausal and postmenopausal breast cancer patients. PLoS ONE 12, e0183579. https://doi.org/10.1371/journal.pone.0183579 (2017).
Article CAS PubMed PubMed Central Google Scholar
Bogaty, P. et al. Impact of prolonged cyclooxygenase-2 inhibition on inflammatory markers and endothelial function in patients with ischemic heart disease and raised C-reactive protein: A randomized placebo-controlled study. Circulation 110, 934–939. https://doi.org/10.1161/01.CIR.0000139338.12464.5F (2004).
Article CAS PubMed Google Scholar
Cherel, M. et al. Molecular screening of interleukin-6 gene promoter and influence of -174G/C polymorphism on breast cancer. Cytokine 47, 214–223. https://doi.org/10.1016/j.cyto.2009.06.011 (2009).
Article CAS PubMed Google Scholar
Yu, K. D. et al. Lack of an association between a functional polymorphism in the interleukin-6 gene promoter and breast cancer risk: A meta-analysis involving 25,703 subjects. Breast Cancer Res. Treat. 122, 483–488. https://doi.org/10.1007/s10549-009-0706-5 (2010).
Article CAS PubMed Google Scholar
Prizment, A. E. et al. Plasma C-reactive protein, genetic risk score, and risk of common cancers in the Atherosclerosis Risk in Communities study. Cancer Causes Control 24, 2077–2087. https://doi.org/10.1007/s10552-013-0285-y (2013).
Article PubMed Google Scholar
Connor, A. E. et al. Associations between ALOX, COX, and CRP polymorphisms and breast cancer among Hispanic and non-Hispanic white women: The breast cancer health disparities study. Mol. Carcinog. 54, 1541–1553. https://doi.org/10.1002/mc.22228 (2015).
Article CAS PubMed Google Scholar
Heikkila, K. et al. C-reactive protein-associated genetic variants and cancer risk: Findings from FINRISK 1992, FINRISK 1997 and Health 2000 studies. Eur. J. Cancer 47, 404–412. https://doi.org/10.1016/j.ejca.2010.07.032 (2011).
Article CAS PubMed Google Scholar
Amaral, W. Z., Krueger, R. F., Ryff, C. D. & Coe, C. L. Genetic and environmental determinants of population variation in interleukin-6, its soluble receptor and C-reactive protein: Insights from identical and fraternal twins. Brain Behav. Immun. 49, 171–181. https://doi.org/10.1016/j.bbi.2015.05.010 (2015).
Article CAS PubMed PubMed Central Google Scholar
Fried, S. K., Bunkin, D. A. & Greenberg, A. S. Omental and subcutaneous adipose tissues of obese subjects release interleukin-6: Depot difference and regulation by glucocorticoid. J. Clin. Endocrinol. Metab. 83, 847–850. https://doi.org/10.1210/jcem.83.3.4660 (1998).
Article CAS PubMed Google Scholar
Ligthart, S. et al. Genome analyses of >200,000 individuals identify 58 loci for chronic inflammation and highlight pathways that link inflammation and complex disorders. Am. J. Hum. Genet. 103, 691–706. https://doi.org/10.1016/j.ajhg.2018.09.009 (2018).
Article CAS PubMed PubMed Central Google Scholar
Dehghan, A. et al. Meta-analysis of genome-wide association studies in >80 000 subjects identifies multiple loci for C-reactive protein levels. Circulation 123, 731–738. https://doi.org/10.1161/CIRCULATIONAHA.110.948570 (2011).
Article CAS PubMed PubMed Central Google Scholar
Doumatey, A. P. et al. C-reactive protein (CRP) promoter polymorphisms influence circulating CRP levels in a genome-wide association study of African Americans. Hum. Mol. Genet. 21, 3063–3072. https://doi.org/10.1093/hmg/dds133 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ridker, P. M. et al. Loci related to metabolic-syndrome pathways including LEPR, HNF1A, IL6R, and GCKR associate with plasma C-reactive protein: The Women’s Genome Health Study. Am. J. Hum. Genet. 82, 1185–1192. https://doi.org/10.1016/j.ajhg.2008.03.015 (2008).
Article CAS PubMed PubMed Central Google Scholar
Reiner, A. P. et al. Genome-wide association and population genetic analysis of C-reactive protein in African American and Hispanic American women. Am. J. Hum. Genet. 91, 502–512. https://doi.org/10.1016/j.ajhg.2012.07.023 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hu, M., Lee, M. H., Mak, V. W. & Tomlinson, B. Effect of central obesity, low high-density lipoprotein cholesterol and C-reactive protein polymorphisms on C-reactive protein levels during treatment with Rosuvastatin (10 mg Daily). Am. J. Cardiol. 106, 1588–1593. https://doi.org/10.1016/j.amjcard.2010.07.044 (2010).
Article CAS PubMed Google Scholar
Wu, S. et al. Interactive effects of C-reactive protein levels on the association between APOE variants and triglyceride levels in a Taiwanese population. Lipids Health Dis. 15, 94. https://doi.org/10.1186/s12944-016-0262-z (2016).
Article CAS PubMed PubMed Central Google Scholar
Fraser, A. et al. Interleukin-6 and incident coronary heart disease: Results from the British Women’s Heart and Health Study. Atherosclerosis 202, 567–572. https://doi.org/10.1016/j.atherosclerosis.2008.04.048 (2009).
Article CAS PubMed Google Scholar
Winters-Stone, K. M., Wood, L. J., Stoyles, S. & Dieckmann, N. F. The effects of resistance exercise on biomarkers of breast cancer prognosis: A pooled analysis of three randomized trials. Cancer Epidemiol. Biomark. Prev. 27, 146–153. https://doi.org/10.1158/1055-9965.EPI-17-0766 (2018).
Article CAS Google Scholar
Lynch, B. M. et al. Associations of objectively assessed physical activity and sedentary time with biomarkers of breast cancer risk in postmenopausal women: Findings from NHANES (2003–2006). Breast Cancer Res. Treat. 130, 183–194. https://doi.org/10.1007/s10549-011-1559-2 (2011).
Article PubMed Google Scholar
van Gemert, W. A. et al. Effect of weight loss with or without exercise on inflammatory markers and adipokines in postmenopausal women: The SHAPE-2 trial, a randomized controlled trial. Cancer Epidemiol. Biomark. Prev. 25, 799–806. https://doi.org/10.1158/1055-9965.EPI-15-1065 (2016).
Article CAS Google Scholar
Rojo-Martinez, G. et al. Factors determining high-sensitivity C-reactive protein values in the Spanish population Diabetes study. Eur. J. Clin. Investig. 43, 1–10. https://doi.org/10.1111/eci.12002 (2013).
Article CAS Google Scholar
Dias, J. A. et al. A high quality diet is associated with reduced systemic inflammation in middle-aged individuals. Atherosclerosis 238, 38–44. https://doi.org/10.1016/j.atherosclerosis.2014.11.006 (2015).
Article CAS PubMed Google Scholar
Bermudez, E. A., Rifai, N., Buring, J. E., Manson, J. E. & Ridker, P. M. Relation between markers of systemic vascular inflammation and smoking in women. Am. J. Cardiol. 89, 1117–1119. https://doi.org/10.1016/s0002-9149(02)02284-1 (2002).
Article PubMed Google Scholar
Stewart, S. H., Mainous, A. G. 3rd. & Gilbert, G. Relation between alcohol consumption and C-reactive protein levels in the adult US population. J. Am. Board Fam. Pract. 15, 437–442 (2002).
PubMed Google Scholar
Dossus, L. et al. C-reactive protein and postmenopausal breast cancer risk: Results from the E3N cohort study. Cancer Causes Control 25, 533–539. https://doi.org/10.1007/s10552-014-0355-9 (2014).
Article PubMed Google Scholar
Fairey, A. S. et al. Effect of exercise training on C-reactive protein in postmenopausal breast cancer survivors: A randomized controlled trial. Brain Behav. Immun. 19, 381–388. https://doi.org/10.1016/j.bbi.2005.04.001 (2005).
Article CAS PubMed Google Scholar
Healy, L. A. et al. Metabolic syndrome, central obesity and insulin resistance are associated with adverse pathological features in postmenopausal breast cancer. Clin. Oncol. 22, 281–288. https://doi.org/10.1016/j.clon.2010.02.001 (2010).
Article CAS Google Scholar
Agresti, R. et al. Association of adiposity, dysmetabolisms, and inflammation with aggressive breast cancer subtypes: A cross-sectional study. Breast Cancer Res. Treat. 157, 179–189. https://doi.org/10.1007/s10549-016-3802-3 (2016).
Article CAS PubMed Google Scholar
Ellis, J. et al. Large multiethnic Candidate Gene Study for C-reactive protein levels: Identification of a novel association at CD36 in African Americans. Hum. Genet. 133, 985–995. https://doi.org/10.1007/s00439-014-1439-z (2014).
Article CAS PubMed PubMed Central Google Scholar
American Cancer Society. Breast Cancer Facts & Figures 2017–2018 (American Cancer Society, Inc., Atlanta, 2017). https://www.cancer.org/content/dam/cancer-org/research/cancer-facts-and-statistics/breast-cancer-facts-and-figures/breast-cancer-facts-and-figures-2017-2018.pdf.
American Cancer Society. Cancer Fact and Figures, 2019 (American Cancer Society, Inc., Atlanta). https://www.cancer.org/content/dam/cancerorg/research/cancer-facts-and-statistics/annual-cancer-facts-and-figures/2019/cancer-facts-and-figures-2019.pdf.
Jung, S. Y. et al. Genome-wide association analysis of pro-inflammatory cytokines and gene–lifestyle interaction for invasive breast cancer risk: The WHI dbGaP Study. Cancer Prev. Res. https://doi.org/10.1158/1940-6207.CAPR-20-0256 (2020).
Article Google Scholar
Schick, U. M. et al. Association of exome sequences with plasma C-reactive protein levels in >9000 participants. Hum. Mol. Genet. 24, 559–571. https://doi.org/10.1093/hmg/ddu450 (2015).
Article CAS PubMed Google Scholar
Prasad, G., Giri, A. K., Basu, A., Tandon, N. & Bharadwaj, D. Genomewide association study for C-reactive protein in Indians replicates known associations of common variants. J. Genet. 98, 20 (2019).
Article PubMed Google Scholar
Lou, X. Y. et al. A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence. Am. J. Hum. Genet. 80, 1125–1137. https://doi.org/10.1086/518312 (2007).
Article CAS PubMed PubMed Central Google Scholar
Hou, T. T. et al. Generalized multifactor dimensionality reduction approaches to identification of genetic interactions underlying ordinal traits. Genet. Epidemiol. 43, 24–36. https://doi.org/10.1002/gepi.22169 (2019).
Article PubMed Google Scholar
Xu, H. M. et al. GMDR: Versatile software for detecting gene–gene and gene–environment interactions underlying complex traits. Curr. Genomics 17, 396–402. https://doi.org/10.2174/1389202917666160513102612 (2016).
Article CAS PubMed PubMed Central Google Scholar
HM, X. et al. GMDR: Versatile software for detecting gene-gene and gene-environment interactions underlying complex traits, http://ibi.zju.edu.cn/software/GMDR/download.html (2019).
The Women’s Health Initiative Study Group. Design of the Women’s Health Initiative clinical trial and observational study. Control Clin. Trials 19, 61–109 (1998).
Article Google Scholar
NCBI: WHI Harmonized and Imputed GWAS Data. A sub-study of Women's Health Initiative (2019). https://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000746.v3.p3.
Orchard, T. S., Andridge, R. R., Yee, L. D. & Lustberg, M. B. Diet quality, inflammation, and quality of life in breast cancer survivors: A cross-sectional analysis of pilot study data. J. Acad. Nutr. Diet. 118, 578-588.e571. https://doi.org/10.1016/j.jand.2017.09.024 (2018).
Article PubMed Google Scholar
Simone, V. et al. Obesity and breast cancer: Molecular interconnections and potential clinical applications. Oncologist 21, 404–417. https://doi.org/10.1634/theoncologist.2015-0351 (2016).
Article CAS PubMed PubMed Central Google Scholar
Gunter, M. J. et al. Circulating adipokines and inflammatory markers and postmenopausal breast cancer risk. J. Natl. Cancer Inst. https://doi.org/10.1093/jnci/djv169 (2015).
Article PubMed PubMed Central Google Scholar
Pfeiffer, R. M., Webb-Vargas, Y., Wheeler, W. & Gail, M. H. Proportion of U.S. trends in breast cancer incidence attributable to long-term changes in risk factor distributions. Cancer Epidemiol. Biomark. Prev. 27, 1214–1222. https://doi.org/10.1158/1055-9965.EPI-18-0098 (2018).
Article Google Scholar
National Cancer Institute. (1993).
Jung, S. Y. et al. Genome-wide meta-analysis of gene-environmental interaction for insulin resistance phenotypes and breast cancer risk in postmenopausal women. Cancer Prev. Res. 12, 31–42. https://doi.org/10.1158/1940-6207.CAPR-18-0180 (2019).
Article CAS Google Scholar
NCBI: WHI Harmonized and Imputed GWAS Data. A sub-study of Women's Health Initiative. http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000746.v1.p3.
Schumacher, F. R. et al. Association analyses of more than 140,000 men identify 63 new prostate cancer susceptibility loci. Nat. Genet. 50, 928–936. https://doi.org/10.1038/s41588-018-0142-8 (2018).
Article CAS PubMed PubMed Central Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873. https://doi.org/10.1093/bioinformatics/btq559 (2010).
Article CAS PubMed PubMed Central Google Scholar
Ishwaran, H. & Kogalur, U. B. Random Survival Forests for R. (2007). https://pdfs.semanticscholar.org/951a/84f0176076fb6786fdf43320e8b27094dcfa.pdf.
Chung, R. H. & Chen, Y. E. A two-stage random forest-based pathway analysis method. PLoS ONE 7, e36662. https://doi.org/10.1371/journal.pone.0036662 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Harrell, F. E. Jr., Califf, R. M., Pryor, D. B., Lee, K. L. & Rosati, R. A. Evaluating the yield of medical tests. JAMA 247, 2543–2546 (1982).
Article PubMed Google Scholar
Mogensen, U. B., Ishwaran, H. & Gerds, T. A. Evaluating random forests for survival analysis using prediction error curves. J. Stat. Softw. 50, 1–23 (2012).
Article PubMed PubMed Central Google Scholar
Hamidi, O., Poorolajal, J., Farhadian, M. & Tapak, L. Identifying important risk factors for survival in kidney graft failure patients using random survival forests. Iran. J. Public Health 45, 27–33 (2016).
PubMed PubMed Central Google Scholar
Wiens, B. L., Dmitrienko, A. & Marchenko, O. Selection of hypothesis weights and ordering when testing multiple hypotheses in clinical trials. J. Biopharm. Stat. 23, 1403–1419. https://doi.org/10.1080/10543406.2013.834920 (2013).
Article MathSciNet PubMed Google Scholar
Arriaga, M. E. et al. The preventable burden of breast cancers for premenopausal and postmenopausal women in Australia: A pooled cohort study. Int. J. Cancer 145, 2383–2394. https://doi.org/10.1002/ijc.32231 (2019).
Article CAS PubMed Google Scholar
Guinter, M. A., McLain, A. C., Merchant, A. T., Sandler, D. P. & Steck, S. E. An estrogen-related lifestyle score is associated with risk of postmenopausal breast cancer in the PLCO cohort. Breast Cancer Res. Treat. 170, 613–622. https://doi.org/10.1007/s10549-018-4784-0 (2018).
Article PubMed PubMed Central Google Scholar
Tamimi, R. M. et al. Population attributable risk of modifiable and nonmodifiable breast cancer risk factors in postmenopausal breast cancer. Am. J. Epidemiol. 184, 884–893. https://doi.org/10.1093/aje/kww145 (2016).
Article PubMed PubMed Central Google Scholar
Chi, D., Zhang, W., Jia, Y., Cong, D. & Hu, S. Spalt-like transcription factor 1 (SALL1) gene expression inhibits cell proliferation and cell migration of human glioma cells through the Wnt/beta-catenin signaling pathway. Med. Sci. Monit. Basic Res. 25, 128–138. https://doi.org/10.12659/MSMBR.915067 (2019).
Article PubMed PubMed Central Google Scholar
Gene Card: Human Gene Database: SALL1 Gene (Protein Coding), https://www.genecards.org/cgi-bin/carddisp.pl?gene=SALL1 (2020).
Genetic Home Reference—SALL1 Gene, https://ghr.nlm.nih.gov/gene/SALL1 (2020).
Li, D., Tian, Y., Ma, Y. & Benjamin, T. p150(Sal2) is a p53-independent regulator of p21(WAF1/CIP). Mol. Cell. Biol. 24, 3885–3893. https://doi.org/10.1128/mcb.24.9.3885-3893.2004 (2004).
Article CAS PubMed PubMed Central Google Scholar
Ma, Y. et al. Cloning and characterization of two promoters for the human HSAL2 gene and their transcriptional repression by the Wilms tumor suppressor gene product. J. Biol. Chem. 276, 48223–48230. https://doi.org/10.1074/jbc.M106468200 (2001).
Article CAS PubMed Google Scholar
Gnemmi, V. et al. SALL4 is a marker of the embryonal subtype of hepatoblastoma. Histopathology 63, 425–428. https://doi.org/10.1111/his.12187 (2013).
Article PubMed Google Scholar
Ushiku, T. et al. SALL4 represents fetal gut differentiation of gastric cancer, and is diagnostically useful in distinguishing hepatoid gastric carcinoma from hepatocellular carcinoma. Am. J. Surg. Pathol. 34, 533–540. https://doi.org/10.1097/PAS.0b013e3181d1dcdd (2010).
Article PubMed Google Scholar
Ma, C. et al. SALL1 functions as a tumor suppressor in breast cancer by regulating cancer cell senescence and metastasis through the NuRD complex. Mol. Cancer 17, 78. https://doi.org/10.1186/s12943-018-0824-y (2018).
Article CAS PubMed PubMed Central Google Scholar
Wolf, J. et al. An in vivo RNAi screen identifies SALL1 as a tumor suppressor in human breast cancer with a role in CDH1 regulation. Oncogene 33, 4273–4278. https://doi.org/10.1038/onc.2013.515 (2014).
Article CAS PubMed Google Scholar
Gene Card: Human Gene Database: HLA-DQA1 Gene (Protein Coding), https://www.genecards.org/cgi-bin/carddisp.pl?gene=HLA-DQA1 (2020).
Genetic Home Reference - HLA-DQA1 Gene, https://ghr.nlm.nih.gov/gene/HLA-DQA1 (2020).
Mahmoodi, M. et al. HLA-DRB1,-DQA1 and -DQB1 allele and haplotype frequencies in female patients with early onset breast cancer. Pathol. Oncol. Res. 18, 49–55. https://doi.org/10.1007/s12253-011-9415-6 (2012).
Article CAS PubMed Google Scholar
Cantu de Leon, D. et al. High resolution human leukocyte antigen (HLA) class I and class II allele typing in Mexican mestizo women with sporadic breast cancer: Case-control study. BMC Cancer 9, 48. https://doi.org/10.1186/1471-2407-9-48 (2009).
Article CAS PubMed PubMed Central Google Scholar
Chen, P. C., Tsai, E. M., Er, T. K., Chang, S. J. & Chen, B. H. HLA-DQA1 and -DQB1 allele typing in southern Taiwanese women with breast cancer. Clin. Chem. Lab. Med. 45, 611–614. https://doi.org/10.1515/CCLM.2007.132 (2007).
Article CAS PubMed Google Scholar
Kubler, K. et al. HLA-class II haplotype associations with ovarian cancer. Int. J. Cancer 119, 2980–2985. https://doi.org/10.1002/ijc.22266 (2006).
Article CAS PubMed Google Scholar
Kohno, T. et al. Contribution of the TP53, OGG1, CHRNA3, and HLA-DQA1 genes to the risk for lung squamous cell carcinoma. J. Thorac. Oncol. 6, 813–817. https://doi.org/10.1097/JTO.0b013e3181ee80ef (2011).
Article PubMed Google Scholar
Huang, L. M. et al. Association between HLA-DQA1 gene copy number polymorphisms and susceptibility to gastric cancer. Zhonghua zhong liu za zhi Chin. J. Oncol. 34, 269–271. https://doi.org/10.3760/cma.j.issn.0253-3766.2012.04.007 (2012).
Article CAS Google Scholar
Shen, F. F. et al. High expression of HLA-DQA1 predicts poor outcome in patients with esophageal squamous cell carcinoma in Northern China. Medicine 98, e14454. https://doi.org/10.1097/MD.0000000000014454 (2019).
Article CAS PubMed PubMed Central Google Scholar
Baan, R. et al. Carcinogenicity of alcoholic beverages. Lancet Oncol. 8, 292–293. https://doi.org/10.1016/s1470-2045(07)70099-2 (2007).
Article PubMed Google Scholar
Henley, S. J. et al. Alcohol control efforts in comprehensive cancer control plans and alcohol use among adults in the USA. Alcohol Alcohol 49, 661–667. https://doi.org/10.1093/alcalc/agu064 (2014).
Article CAS PubMed Google Scholar
Nomura, S. J., Inoue-Choi, M., Lazovich, D. & Robien, K. WCRF/AICR recommendation adherence and breast cancer incidence among postmenopausal women with and without non-modifiable risk factors. Int. J. Cancer 138, 2602–2615. https://doi.org/10.1002/ijc.29994 (2016).
Article CAS PubMed PubMed Central Google Scholar
(World Cancer Research Fund and American Institute for Cancer Research., Washington DC: AICR, 2007).
Hahn, M., Simons, C., Weijenberg, M. P. & van den Brandt, P. A. Alcohol drinking, ADH1B and ADH1C genotypes and the risk of postmenopausal breast cancer by hormone receptor status: The Netherlands Cohort Study on diet and cancer. Carcinogenesis 39, 1342–1351. https://doi.org/10.1093/carcin/bgy101 (2018).
Article CAS PubMed Google Scholar
Maruti, S. S., Ulrich, C. M., Jupe, E. R. & White, E. MTHFR C677T and postmenopausal breast cancer risk by intakes of one-carbon metabolism nutrients: A nested case-control study. Breast Cancer Res. 11, R91. https://doi.org/10.1186/bcr2462 (2009).
Article CAS PubMed PubMed Central Google Scholar
Singletary, K. W. & Gapstur, S. M. Alcohol and breast cancer: Review of epidemiologic and experimental evidence and potential mechanisms. JAMA 286, 2143–2151. https://doi.org/10.1001/jama.286.17.2143 (2001).
Article CAS PubMed Google Scholar
Gavaler, J. S. & Van Thiel, D. H. The association between moderate alcoholic beverage consumption and serum estradiol and testosterone levels in normal postmenopausal women: Relationship to the literature. Alcohol. Clin. Exp. Res. 16, 87–92. https://doi.org/10.1111/j.1530-0277.1992.tb00642.x (1992).
Article CAS PubMed Google Scholar
Pereira, M. A. et al. Fast-food habits, weight gain, and insulin resistance (the CARDIA study): 15-year prospective analysis. Lancet 365, 36–42. https://doi.org/10.1016/S0140-6736(04)17663-0 (2005).
Article PubMed Google Scholar
Seitz, H. K. & Stickel, F. Acetaldehyde as an underestimated risk factor for cancer development: Role of genetics in ethanol metabolism. Genes Nutr. 5, 121–128. https://doi.org/10.1007/s12263-009-0154-1 (2010).
Article CAS PubMed Google Scholar
Cogliano, V. et al. Carcinogenicity of combined oestrogen-progestagen contraceptives and menopausal treatment. Lancet Oncol. 6, 552–553 (2005).
Article PubMed Google Scholar
Gartlehner, G. et al. Hormone therapy for the primary prevention of chronic conditions in postmenopausal women: Evidence report and systematic review for the US preventive services task force. JAMA 318, 2234–2249. https://doi.org/10.1001/jama.2017.16952 (2017).
Article PubMed Google Scholar
Rossouw, J. E. et al. Risks and benefits of estrogen plus progestin in healthy postmenopausal women: Principal results From the Women’s Health Initiative randomized controlled trial. JAMA 288, 321–333 (2002).
Article CAS PubMed Google Scholar
Asi, N. et al. Progesterone vs. synthetic progestins and the risk of breast cancer: A systematic review and meta-analysis. Syst. Rev. 5, 121. https://doi.org/10.1186/s13643-016-0294-5 (2016).
Article PubMed PubMed Central Google Scholar
Giovannucci, E. et al. The assessment of alcohol consumption by a simple self-administered questionnaire. Am. J. Epidemiol. 133, 810–817. https://doi.org/10.1093/oxfordjournals.aje.a115960 (1991).
Article CAS PubMed Google Scholar
Haskell, W. L. et al. Physical activity and public health: Updated recommendation for adults from the American College of Sports Medicine and the American Heart Association. Med. Sci. Sports Exerc. 39, 1423–1434. https://doi.org/10.1249/mss.0b013e3180616b27 (2007).
Article PubMed Google Scholar
Organization, t. W. H. In Report of a WHO Expert Consultation Geneva, 8–11 December 2008 (The World Health Organization, Geneva, 2011)

Download references

Acknowledgements

Part of the data for this project was provided by the WHI program, which is funded by the National Heart, Lung, and Blood Institute, the National Institutes of Health, and the U.S. Department of Health and Human Services through contracts HHSN268201100046C, HHSN268201100001C, HHSN268201100002C, HHSN268201100003C, HHSN268201100004C, and HHSN271201100004C. The datasets used for the analyses described in this manuscript were obtained from dbGaP at http://www.ncbi.nlm.nih.gov/sites/entrez?db=gap through dbGaP accession (phs000200.v11.p3).

Program Office National Heart, Lung, and Blood Institute, Bethesda, MD: Jacques Rossouw, Shari Ludlam, Dale Burwen, Joan McGowan, Leslie Ford, and Nancy Geller.

Clinical Coordinating Center Fred Hutchinson Cancer Research Center, Seattle, WA: Garnet Anderson, Ross Prentice, Andrea LaCroix, and Charles Kooperberg.

Investigators and Academic Centers JoAnn E. Manson, Brigham and Women's Hospital, Harvard Medical School, Boston, MA; Barbara V. Howard, MedStar Health Research Institute/Howard University, Washington, DC; Marcia L. Stefanick, Stanford Prevention Research Center, Stanford, CA; Rebecca Jackson, The Ohio State University, Columbus, OH; Cynthia A. Thomson, University of Arizona, Tucson/Phoenix, AZ; Jean Wactawski-Wende, University at Buffalo, Buffalo, NY; Marian Limacher, University of Florida, Gainesville/Jacksonville, FL; Robert Wallace, University of Iowa, Iowa City/Davenport, IA; Lewis Kuller, University of Pittsburgh, Pittsburgh, PA; and Sally Shumaker, Wake Forest University School of Medicine, Winston-Salem, NC.

Funding

This study was supported by the National Institute of Nursing Research of the National Institutes of Health under Award Number K01NR017852 and a University of California Cancer Research Coordinating Committee grant (CRN-18-522722).

Author information

Authors and Affiliations

Translational Sciences Section, Jonsson Comprehensive Cancer Center, School of Nursing, University of California, Los Angeles, 700 Tiverton Ave, 3-264 Factor Building, Los Angeles, CA, 90095, USA
Su Yon Jung
Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Jeanette C. Papp & Eric M. Sobel
Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Eric M. Sobel
Department of Molecular, Cell and Developmental Biology, Life Sciences Division, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Matteo Pellegrini
Cancer Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
Herbert Yu
Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Zuo-Feng Zhang
Center for Human Nutrition, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, 90095, USA
Zuo-Feng Zhang

Authors

Su Yon Jung
View author publications
You can also search for this author in PubMed Google Scholar
Jeanette C. Papp
View author publications
You can also search for this author in PubMed Google Scholar
Eric M. Sobel
View author publications
You can also search for this author in PubMed Google Scholar
Matteo Pellegrini
View author publications
You can also search for this author in PubMed Google Scholar
Herbert Yu
View author publications
You can also search for this author in PubMed Google Scholar
Zuo-Feng Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.Y.J., J.P., E.S., M.P., H.Y. and Z.Z. designed the study. S.Y.J. performed the genomic data QC and the statistical analysis, and interpreted the data. J.P. and E.S. supervised the genomic data QC and analysis. M.P. and H.Y. participated in the study coordination and interpreting the data. S.Y.J. secured funding for this project. Z.Z. supervised the project. All participated in the paper writing and editing. All authors have read and approved the submission of the manuscript.

Corresponding author

Correspondence to Su Yon Jung.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Jung, S.Y., Papp, J.C., Sobel, E.M. et al. Pro-inflammatory cytokine polymorphisms and interactions with dietary alcohol and estrogen, risk factors for invasive breast cancer using a post genome-wide analysis for gene–gene and gene–lifestyle interaction. Sci Rep 11, 1058 (2021). https://doi.org/10.1038/s41598-020-80197-1

Download citation

Received: 16 June 2020
Accepted: 17 December 2020
Published: 13 January 2021
DOI: https://doi.org/10.1038/s41598-020-80197-1

This article is cited by

HLA-DQA1 expression is associated with prognosis and predictable with radiomics in breast cancer
- JingYu Zhou
- TingTing Xie
- GuanXun Cheng
Radiation Oncology (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.