Circulating amino acids and amino acid-related metabolites and risk of breast cancer among predominantly premenopausal women

Zeleznik, Oana A.; Balasubramanian, Raji; Zhao, Yibai; Frueh, Lisa; Jeanfavre, Sarah; Avila-Pacheco, Julian; Clish, Clary B.; Tworoger, Shelley S.; Eliassen, A. Heather

doi:10.1038/s41523-021-00262-4

Download PDF

Article
Open access
Published: 18 May 2021

Circulating amino acids and amino acid-related metabolites and risk of breast cancer among predominantly premenopausal women

npj Breast Cancer volume 7, Article number: 54 (2021) Cite this article

3007 Accesses
14 Citations
3 Altmetric
Metrics details

Subjects

Abstract

Known modifiable risk factors account for a small fraction of premenopausal breast cancers. We investigated associations between pre-diagnostic circulating amino acid and amino acid-related metabolites (N = 207) and risk of breast cancer among predominantly premenopausal women of the Nurses’ Health Study II using conditional logistic regression (1057 cases, 1057 controls) and multivariable analyses evaluating all metabolites jointly. Eleven metabolites were associated with breast cancer risk (q-value < 0.2). Seven metabolites remained associated after adjustment for established risk factors (p-value < 0.05) and were selected by at least one multivariable modeling approach: higher levels of 2-aminohippuric acid, kynurenic acid, piperine (all three with q-value < 0.2), DMGV and phenylacetylglutamine were associated with lower breast cancer risk (e.g., piperine: OR_adjusted (95%CI) = 0.84 (0.77–0.92)) while higher levels of creatine and C40:7 phosphatidylethanolamine (PE) plasmalogen were associated with increased breast cancer risk (e.g., C40:7 PE plasmalogen: OR_adjusted (95%CI) = 1.11 (1.01–1.22)). Five amino acids and amino acid-related metabolites (2-aminohippuric acid, DMGV, kynurenic acid, phenylacetylglutamine, and piperine) were inversely associated, while one amino acid and a phospholipid (creatine and C40:7 PE plasmalogen) were positively associated with breast cancer risk among predominately premenopausal women, independent of established breast cancer risk factors.

Associations of dietary patterns with brain health from behavioral, neuroimaging, biochemical and genetic analyses

Article Open access 01 April 2024

Fasting-mimicking diet causes hepatic and blood markers changes indicating reduced biological age and disease risk

Article Open access 20 February 2024

Persistent elevation of plasma vitamin B12 is strongly associated with solid cancer

Article Open access 25 June 2021

Introduction

Breast cancer is the most common malignancy among women in the United States, with more than 250,000 cases diagnosed each year¹. Known modifiable risk factors are estimated to account for only around one-third of postmenopausal breast cancers^2,3,4, and an even smaller fraction of premenopausal cancers^2,5. Thus, new strategies are needed for the identification of modifiable risk factors, especially for premenopausal breast cancers.

Metabolites are small molecules that are produced and consumed by cellular metabolism. The study of the complete collection of metabolites, called metabolomics, provides a direct signature of cellular activity in the body and has emerged as a powerful tool for the diagnosis, characterization, and prediction of disease. Metabolomic methods have uncovered biomarkers for a wide variety of cancers including colorectal, gastric, pancreatic, liver, ovarian, breast, urinary, esophageal, and lung⁶. In breast cancer, metabolomics has proven useful for tumor biology characterization, predicting treatment response, anticipating recurrence, and estimating prognosis⁷.

More recently, prospective epidemiological studies have used metabolomics to identify metabolite risk factors for several cancers including pancreatic^8,9,10, prostate^11,12, liver¹³, colorectal¹⁴, ovarian^15,16, endometrial¹⁷, and breast cancer^{18,19,20,21,22}. For breast cancer, studies have used both targeted^18,21,23 and untargeted^20,22 methods to discover metabolomic risk factors associated with diet^20,23, body mass index (BMI)²¹, microbiota metabolism²⁰, lipid, amino acid, and other metabolic pathways^18,20,22. While several studies stratified results by estrogen receptor (ER) status^18,21,23, no prospective metabolomic breast cancer studies have investigated differential effects by menopausal status.

In this study, we assessed the association of over 200 prospectively measured circulating amino acid and amino acid-related metabolites with risk of breast cancer among the predominantly premenopausal women (1057 cases and 1057 matched controls) of the Nurses’ Health Study II (NHSII).

Results

Study population

1057 cases and 1057 matched controls were included in this study (Table 1). Women were an average 53 years old and predominantly premenopausal (80%) at the time of blood collection. At diagnosis, 42% of the women were premenopausal and 46% were postmenopausal. The mean time between blood collection and diagnosis was 8 years (SD = 4.4), ranging from 10 months to 17.4 years. (1st quartile: 4.25 years; 3rd quartile: 11.6 years).

Table 1 Characteristics of breast cancer cases and matched controls at blood collection in the Nurses’ Health Study II, mean (SD) or %.

Full size table

Conditional logistic regression (CLR)

Eleven metabolites were significantly associated with risk of breast cancer based on the simple model (Fig. 1 and Supplementary Table 1). Six metabolites were associated with lower risk while five metabolites were associated with higher risk of overall breast cancer. Dimethylguanidino valeric acid (DMGV; OR per 1-SD increase (95% CI) = 0.84 (0.77–0.92)), 2-aminohippuric acid (OR (95% CI) = 0.84 (0.76–0.92)), and piperine (OR (95% CI) = 0.85 (0.78–0.93)) had the strongest inverse associations. C40:7 phosphatidylethanolamine (PE) plasmalogen (OR (95% CI) = 1.14 (1.05–1.25)) and asparagine (OR (95% CI) = 1.14 (1.04–1.26)) had the strongest positive associations. Creatine was the only metabolite with q-value > 0.2 in the simple model but is included here as q-value < 0.2 in both Lasso models. Results were similar when we included adjustment for breast cancer risk factors (DMGV: 0.88 (0.79–0.97); 2-aminohippuric acid: 0.85 (0.77–0.93); C40:7 PE: plasmalogen: 1.11 (1.01–1.22); asparagine: 1.10 (1.00–1.22)) and when we restricted to premenopausal women (Fig. 1 and Supplementary Table 2) or ER+ tumors (Fig. 1 and Supplementary Table 3).

Fig. 1: Odds ratios and 95% confidence intervals (CI) per 1 SD increase for metabolites significantly associated with risk of overall breast cancer (p-value < 0.05 and q-value < 0.2) in Nurses’ Health Study II, among premenopausal women only, by BMI category (<25, ≥25)***, and by ER status (ER positive, ER negative)****.

Among the 11 selected metabolites, only DMGV showed effect modification by BMI with stronger associations among women with high BMI (adjusted model, high BMI: OR (95% CI) = 0.82 (0.70–0.97); normal BMI: OR (95% CI) = 0.94 (0.83–1.06); p-interaction = 0.04). Significant interactions with BMI were observed for several additional metabolites that were not significant overall; for example C36:4 DAG/TAG fragment (high BMI: OR (95% CI) = 0.92 (0.80–1.06); normal BMI: OR (95% CI) = 1.21 (1.06–1.37); p-interaction = 0.005), serine (high BMI: OR (95% CI) = 1.19 (1.04–1.38); normal BMI: OR (95% CI) = 0.93 (0.83–1.05); p-interaction = 0.008), and proline betaine (high BMI: OR (95% CI) = 0.82 (0.72–0.94); normal BMI: OR (95% CI) = 1.04 (0.92–1.17); p-interaction = 0.017).

Of the 11 selected metabolites, we only observed stronger associations with risk of ER- tumors in the adjusted model (ER−; Supplementary Table 3) for asparagine (ER− tumors: OR (95% CI) = 1.21 (0.99–1.47); ER+ tumors: OR (95% CI) = 1.06 (0.94–1.18); p-heterogeneity = 0.03). Three additional metabolites were suggestively different by ER status: betaine (ER− tumors: OR (95% CI) = 0.89 (0.73–1.09); ER+ tumors: OR (95% CI) = 1.07 (0.96–1.20); p-heterogeneity = 0.02), 4-acetamidobutanate (ER− tumors: OR (95% CI) = 0.89 (0.72–1.09); ER+ tumors: OR (95% CI) = 0.97 (0.87–1.08); p-heterogeneity = 0.03), and histidine (ER− tumors: OR (95% CI) = 1.06 (0.87–1.29); ER+ tumors: OR (95% CI) = 1.01 (0.91–1.13); p-heterogeneity = 0.05).

In an exploratory analysis including metabolites with >10% missing values (N = 19; four metabolites had >90% missingness), metoprolol (45% missing values) was nominally significantly associated with risk of breast cancer (likelihood-ratio test p-value = 0.04; data not shown). Women with detectable metoprolol levels had a 23% higher risk (presence–absence indicator p-value = 0.07) of breast cancer compared to women with undetectable metoprolol. However, among women with measured metoprolol, higher levels were associated with lower risk (OR per one unit increase in log-transformed and standardized metabolite levels = 0.91, p-value = 0.14). The remaining metabolites with high missingness were not associated with risk of breast cancer.

Multivariable models of the joint association of all metabolites

The inverse association of piperine with risk of breast cancer met the threshold for statistical significance (p-value < 0.05 and q-value < 0.20) in all three simple (without adjustment for risk factors) multivariable models, Lasso, Elastic Net, and Random Forests (Tables 2 and 3). In addition, DMGV and N2,N2-dimethylguanosine were detected in the Random Forests model with minimal adjustment, satisfying a q-value threshold of 0.05. Higher levels of C40:7 PE plasmalogen and creatine were associated with increased breast cancer risk in CLR Lasso models (q-value < 0.20). These associations remained significant after further adjustment for risk factors (nominal p < 0.05) with the exception of N2,N2-dimethylguanosine in Random Forests (p = 0.05) and piperine in Elastic Net (nominal p-value < 0.05, q-value > 0.2).

Table 2 All metabolites that met q-value < 0.2 (marked as √ in the table) in at least one primary model analysis that adjusts for matching factors (*marks p-value < 0.05^a).

Full size table

Table 3 All metabolites that met q-value < 0.2 in at least one primary model analysis that adjusts for matching factors. M1 is the simple model^a (accounting for matching factors) and M2 is the adjusted model^b (adjusting for matching factors and breast cancer risk factors).

Full size table

When all eleven identified metabolites were assessed together in an adjusted lasso CLR model, ten metabolites remained independently associated with risk of breast cancer. The direction of association in the lasso CLR model was consistent with the previous CLR and multivariable models except for N2,N2-dimethylguanosine who’s coefficient was estimated to be equal to zero, reflecting its high correlation with kynurenic acid and 2-aminohippuric acid (Fig. 2).

**Fig. 2: Heatmap of all pairwise correlations among the 11 metabolites associated with breast cancer risk.**

Discussion

We conducted a large-scale study of 207 circulating amino acid and amino acid-related metabolites, and risk of breast cancer in a nested case-control study (1057 cases and 1057 matched controls) within NHSII, a cohort of predominantly premenopausal women. Higher levels of six metabolites, 2-aminohippuric acid, DMGV, kynurenic acid, N2, N2-dimethylguanosine, phenylacetyl glutamine, and piperine, were associated with lower breast cancer risk while higher levels of asparagine, creatine, and three lipids, C20:1 LPC, C34:3 PC plasmalogen, C40:7 PE plasmalogen, were associated with increased breast cancer risk. Inverse associations between 2-aminohippuric acid, DMGV, kynurenic acid, phenylacetyl glutamine and piperine, and the positive associations with creatine and C40:7 PE plasmalogen remained statistically significant after adjusting for established risk factors and were selected by at least one multivariable modeling approach (Lasso, Elastic Net, Random Forests). Notably, associations between 2-aminohippuric acid, piperine, and kynurenic acid remained significant even after multiple testing correction. None of the metabolites showed effect modification by BMI, except DMGV. None of the metabolites showed heterogeneity by ER status, except asparagine.

Piperine is a polyphenol responsible for the pungency of black and long pepper and exhibits a wide range of properties: anti-diabetic, anti-inflammatory, immunomodulatory, reduction of insulin resistance, and enhanced drug bioavailability^24,25,26. Piperine also inhibits tumorigenesis, tumor angiogenesis, cancer cell proliferation, cancer cell migration and invasion, and enhances apoptosis and autophagy²⁷. Experimental and cell line studies identified anti-breast cancer specific mechanisms of action, including decreased matrix metalloproteinase 9 (MMP-9) and MMP-13 expression, induced apoptosis through activation of caspase-3 and inhibition of human epidermal growth factor receptor 2 (HER2) gene expression²⁸. Synergetic effects of piperine and chemotherapy drugs (paclitaxel, doxorubicin), hormone therapy drugs (tamoxifen), radiotherapy, TRAIL-based and nano-delivery-based therapy drugs (paclitaxel, rapamycine) were observed²⁸. Notably, piperine inhibited growth and motility²⁹, and enhanced efficacy of TRAIL-based therapy³⁰ in triple-negative breast cancer cells, the most aggressive breast cancer subtype. In a previous study of the associations of diet-related metabolites and breast cancer risk within the Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer screening trial (n = 1242, 621 cases), piperine was found to be modestly correlated with liquor consumption (correlation = 0.16) and similar to our study, inversely associated with breast cancer risk (OR comparing 90th versus 10th percentile = 0.74 (0.56–0.99, p = 0.045)), after adjusting for BMI and other potential confounders²³. Similarly, piperine was associated with lower risk of breast cancer (OR per 1 SD increase = 0.94 (0.89–0.99), p = 0.01) in the SU.VI.MAX cohort (n = 400, 200 cases)²⁰. Notably, PLCO women were postmenopausal while the SU.VI.MAX cohort included premenopausal and postmenopausal women at the time of blood collection suggesting that the association between piperine and breast cancer may be independent of menopausal status.

Dimethylguanadino valeric acid (DMGV), an organic keto acid, is the product of transamination of asymmetric dimethylarginine (ADMA), which inhibits nitric oxide signaling that is crucial to endothelial function—excess ADMA is associated with increased risk of cardiovascular disease^31,32. Plasma DMGV is positively associated with incident coronary artery disease, cardiovascular mortality, nonalcoholic fatty acid liver disease, and type II diabetes^33,34. Circulating DMGV is directly correlated with resistance to the metabolic benefits of exercise³⁵. Consumption of vegetables and red wine are associated with lower circulating DMGV, while sugar-sweetened beverage consumption is associated with higher circulating DMGV^34,36. DMGV was associated with lower breast cancer risk in our study. The association between DMGV and breast cancer risk has not been previously assessed in prospective cohort studies. However, this metabolite was correlated with liver fat in the offspring cohort of the Framingham Heart Study (β = 0.02, 95% CI: 0.018–0.022, p < 10⁻²³)³³. In addition, in the same study, baseline DMGV levels were associated with higher risk of type 2 diabetes, with replication of this association in the Malmo Diet and Cancer study and the Jackson Heart Study³³. Large prospective studies are required to validate the association between DMGV and breast cancer risk. If replicated, experimental studies will be needed to understand the complex relationship between DMGV, diet, physical activity, CVD, type II diabetes, and breast cancer. Of note is that this analysis was in predominantly premenopausal women, among whom adiposity is also inversely associated with risk of breast cancer for reasons that are still not fully understood³⁷.

Although the measurement platform used here was optimized to measure amino acids and related metabolites, not lipids, our analysis included a small number of lipids and identified a few significant associations. Plasmalogens are a subclass of phospholipids (components of the cell membrane and involved in cell signaling and cell cycle regulation³⁸) that constitute 15–20% of all phospholipids in cell membranes³⁹. Plasmalogens have head groups that are usually either phosphatidylcholine (PC plasmalogens) or ethanolamine (PE plasmalogens), and are characterized by an ether bond to an alkenyl group in the sn-1 position while the sn-2 position is usually occupied by polyunsaturated fatty acids³⁹. Certain cancers exhibit altered plasmalogen levels: circulating plasmalogens are depressed in pancreatic cancer patients⁴⁰ and increased in gastric carcinoma patients⁴¹ compared to healthy controls. Our study identified two plasmalogens, C34:3 PC plasmalogen and C40:7 PE plasmalogen, associated with increased risk of breast cancer. The fatty acid component of specific lipids may also reflect dietary or metabolic processes; notably C40:7 PE plasmalogen is highly unsaturated but the position of the double bonds cannot be determined in this metabolomics assay. Among 74 women with breast cancer, the levels of the majority of measured phospholipids (LPC, LPE, PC, and PE) were higher in tumor tissue when compared to normal breast tissue samples³⁸. Similar trends were observed in another study comparing tumor to normal tissue in 257 participants with breast cancer, and these lipids were correlated with cancer progression and patient survival⁴². Contrary to our findings, in the European Prospective Investigation into Cancer (EPIC) cohort, PC plasmalogens, including C34:3 PC plasmalogen, were inversely associated with risk of breast cancer^18,19. However, neither study stratified their results by menopausal status, thus making a direct comparison difficult. Additional studies are needed to evaluate how plasmalogens are associated with risk of breast cancer and if this relationship is modulated by menopausal status.

LPCs are derived from phosphatidylcholines after hydrolysis of one of the fatty acid groups. In the liver, LPCs upregulate genes involved in cholesterol biosynthesis, while circulating LPCs activate many inflammatory and oxidative stress signaling pathways, and are associated with inflammatory diseases such as atherosclerosis and multiple sclerosis⁴³. Circulating LPCs have mixed associations with certain cancers. For instance, circulating LPCs are elevated in ovarian cancer patients but depressed in leukemia patients relative to healthy controls⁴⁴; LPCs showed inverse associations with risk of endometrioid and clear cell ovarian tumors, with stronger inverse associations among premenopausal women, in NHS and NHSII^15,16. While most LPCs measured in EPIC were inversely associated with risk, one of our top hits was LPC C20:1 that was positively associated with risk¹⁸. In an earlier study nested within the EPIC cohort of 774 participants including 362 breast cancer cases, C18:0 LPC was identified as inversely associated with breast cancer risk, after adjusting for potential risk factors¹⁹, though analyses were not stratified by menopausal status.

Our study identified three amino acid derivatives associated with breast cancer risk. High levels of phenylacetylglutamine were associated with decreased breast cancer risk while high levels of asparagine and creatine were associated with increased risk. Phenylacetylglutamine is formed from phenylacetate and glutamine and is found as a normal constituent of human urine⁴⁵. Phenylacetylglutamine is a host microbiome cometabolite associated with bacterial phenylalanine metabolism^46,47,48,49. Clostridium difficile, F. prausnitzii, Bifidobacterium, Subdoligranulum, and Lactobacillus are all positively associated with hippuric acid^48,50, while Bifidobacterium is positively associated with phenylacetylglutamine and microbes of the Christensellaceae, Ruminococcaceae, and Lachnospiracaea families are negatively associated with phenylacetylglutamine^47,51. While not directly linked to breast cancer risk, high serum levels of phenylacetylglutamine is a potential early marker of kidney dysfunction in chronic kidney disease⁵¹. Glutamine, a precursor to phenylacetylglutamine, has been associated with breast cancer risk in a nested case-control study within the French SU.VI.MAX cohort (n = 211 cases). High levels of glutamine were associated with increased risk (OR per SD increase =1.33, 95% CI: 1.07–1.66) and this association persisted among the subgroup of premenopausal women (p for interaction = 0.003)⁵². Notably, elevated expression of asparagine synthetase correlates with lung metastasis in breast cancer patients⁵³. Furthermore, in vitro breast cancer studies show asparagine bioavailability promotes invasion and metastasis and that this effect is prevented by limiting asparagine bioavailability⁵³. In a nested case-control study within the EPIC cohort (n = 1624 cases), asparagine was inversely associated with breast cancer risk (OR = 0.87 per SD increase, 95% CI: 0.80–0.95, FDR p = 0.06), in contrast to the direction of association in our study¹⁸. However, the EPIC study participants were overwhelmingly (> 70%) postmenopausal at the time of blood collection, in contrast to our population with 80% premenopausal women. Differences in the menopausal status may partially explain the observed opposite directions of association between asparagine and breast cancer risk.

Creatine is obtained from meat consumption and synthesized endogenously from arginine, glycine, and methionine. Most creatine is found in skeletal muscle, and a significant amount is also found in the brain. Omnivores obtain roughly 50% of their daily creatine from meat and 50% is biosynthesized, while vegetarians biosynthesize most of their creatine⁵⁴ and have significantly lower muscular creatine levels than meat eaters⁵⁵. Creatine is broken down to creatinine in a first-order reaction, the rate of which decreases with age and decreased muscle mass⁵⁶. Creatine/creatinine metabolism plays an important role in energy metabolism in skeletal muscle tissue, and thus disturbances in this pathway are associated with many muscle diseases, whether as a cause or consequence⁵⁷. To the best of our knowledge, no previous work has reported a link between creatine and breast cancer risk. However, the association we observed in this analysis is consistent with the positive association between red meat and risk of breast cancer among premenopausal women in the NHSII cohort⁵⁸.

Kynurenic acid and 2-aminohippuric acid are benzenoids inversely associated with breast cancer risk in our study. 2-amminohippuric acid is a glycine conjugate of anthranilic acid and can be synthesized in the liver⁵⁹, but little is known about its biological function. Both kynurenic acid and 2-aminohippuric acid are part of the kynurenine branch of the tryptophan pathway, an essential amino acid and a precursor to many biologically active metabolites^60,61. Studies of the function of kynurenic acid suggest pleiotropic roles in disease. On the one hand, kynurenic acid has been shown to have anti-inflammatory and anti-ulcerative properties in animal models, as well as antioxidative properties in vitro in human cells⁶². On the other hand, circulating kynurenic acid was associated with increased risk of insulin resistance⁶³. Kynurenic acid acts as both an anti-inflammatory and immunosuppressive factor which in turn allows tumor proliferation⁶⁴. Tryptophan metabolism plays an important role in tumor progression and malignancy with several cancers expressing tryptophan-degrading enzymes such as IDO1^65,66. However, the direct role of kynurenic acid remains unclear, with both proliferative and antiproliferative effects on human glioblastoma cells, an antiproliferative effect on human colon adenocarcinoma cells, and decreased DNA synthesis and inhibited migration in both cell types⁶². Neither kynurenic acid nor 2-aminohippuric acid have been previously associated with breast cancer risk.

N2,N2-dimethylguanosine (DMGU) is a purine nucleoside and a primary degradation product of transport RNA. Elevated circulating DMGU levels may indicate cellular stress and is associated with several diseases, including pulmonary arterial hypertension⁶⁷, solid tumors⁶⁸, incident type II diabetes⁶⁹, and all-cause mortality⁷⁰. Elevated levels of N2,N2-dimethylguanosine were found in patients with acute leukemia and breast cancer⁷¹. Elevated circulating levels of this metabolite are associated with lower risk of breast cancer in our study.

Importantly, there are several studies suggesting lifestyle and dietary changes may impact circulating levels of the here identified metabolites; however, the relationships are complex and warrant further study. For example, vegetables intake was positively associated with piperine (manuscript in preparation) and inversely associated with DMGV³⁴ but both metabolites are inversely associated with risk. The Mediterranean diet was positively associated with C40:7 PE plasmalogen⁷² though this metabolite was positively associated with risk. Further, physical activity is inversely associated with creatine and positively associated with asparagine⁷³ though we observed positive associations with breast cancer risk for both metabolites. This work supports the fact that these metabolites are potentially modifiable, however the complex relationships with breast cancer risk preclude clear recommendations and more investigation is required.

Our study has several strengths and limitations. Notably, we conducted a prospective analysis of amino acid and amino acid-related metabolomics and risk of breast cancer among a large number of predominantly premenopausal women. We had detailed information on sample collection characteristics and risk factors, which we included in our statistical approaches. Although metabolomics was measured at only one point in time, the identified metabolites are reasonably stable over time⁷⁴ (ICCs or correlation over 1–2 years ≥0.75 for nine metabolites; no data are available for 2-aminohippuric acid and C20:1 LPC). Our cohort consisted of registered nurses, a group that are not representative of the general population (e.g., social economic status), however there is no evidence suggesting that breast carcinogenesis is different in this group of women. Additionally, our cohort also included predominantly Caucasian women. However, while the prevalence of risk factors often differs across population subgroups, many breast cancer risk factors have been documented to operate similarly across ethnic groups, as would be expected from a common underlying biology^{75,76,77,78,79,80,81,82,83,84,85,86,87,88}. Nonetheless, it is crucial that future studies conduct similar analyses in racially and ethnically diverse cohorts. While we had reasonable power in most of our analyses, we had limited power among ER negative tumors. Lastly, the uniqueness of our data measured among predominantly premenopausal women make replication studies challenging. We conducted this analysis in a hypothesis generating framework and hope that other cohorts will follow and analyze metabolomics data stratifying by menopausal status.

In summary, we identified eleven metabolites associated with risk of breast cancer among premenopausal women. Seven metabolites remained associated after adjustment for established risk factors and were selected by at least one multivariable modeling approach: higher levels of 2-aminohippuric acid, kynurenic acid, piperine, DMGV and phenylacetylglutamine were associated with lower breast cancer risk while higher levels of creatine and C40:7 PE plasmalogen were associated with increased breast cancer risk. Additional prospective cohort studies are needed to assess these associations considering menopausal status. If these findings are validated, experimental studies are warranted to understand the underlying biological mechanisms driving changes in metabolite levels.

Methods

Study population

In 1989, 116,429 female registered nurses aged 25–42 y returned a mailed questionnaire and were enrolled in the NHSII. Participants have been followed biennially since 1989 with questionnaires collecting information on reproductive history, lifestyle factors, diet, medication use, and new disease diagnoses.

In 1996–1999, 29,611 NHSII participants aged 32–54 y contributed blood samples, as previously described⁸⁹. Of these, 18,521 women who had not used oral contraceptives, been pregnant or breastfed in the previous 6 months provided samples timed within the menstrual cycle, targeting the early follicular (days 3–5 of the cycle) and mid-luteal (7–9 days prior to expected start of next cycle) phases. The remaining women donated a single untimed sample. Follicular plasma was separated and frozen by the participants and returned with the luteal sample; samples were collected and shipped overnight to our laboratory where we processed and archived aliquots of white blood cell, red blood cell, and plasma in liquid nitrogen freezers (≤−130 °C). Follow-up in the blood subcohort is high (96% in 2011).

The study protocol was approved by the institutional review boards of the Brigham and Women’s Hospital and Harvard T.H. Chan School of Public Health, and those of participating registries as required. The return of the self-administered questionnaire and blood sample was considered to imply consent.

Case and control selection

Cases of breast cancer were identified after blood collection among women who had no reported cancer (other than nonmelanoma skin). Thousand and fifty-seven cases (invasive cases n = 780) were diagnosed between 1999 and 2011. Breast cancer cases were reported by the participant, which were confirmed by medical record reviews (n = 1015) or verbally by the nurse (n = 42). Given the high confirmation rate by medical record for breast cancer in this cohort (99%), all cases are included in this analysis.

One control was matched per case by the following factors: age (+/− 2 y), menopausal status and postmenopausal hormone therapy (HT) use at blood collection and diagnosis (premenopausal, postmenopausal and not taking HT, postmenopausal and taking HT, and unknown), and month (+/− 1 month), time of day (+/− 2 h), fasting status at blood collection (≤8h after a meal or unknown; >8h), race/ethnicity (African–American, Asian, Hispanic, Caucasian, other) and luteal day (+/− 1 day; timed samples only).

Covariate information

Data on breast cancer risk factors, including anthropometric measures, reproductive history, and lifestyle factors, were collected from questionnaires administered biennially and at the time of blood collections. Case characteristics, including invasive vs. in situ, histologic grade, estrogen and progesterone receptor (ER, PR), were extracted from pathology reports. As previously described⁹⁰, immunohistochemical results for ER and PR, read manually by a study pathologist, were included for cases with available tumor tissue included in tissue microarrays.

Laboratory assay

Plasma metabolites were profiled at the Broad Institute of MIT and Harvard (Cambridge, MA) using a liquid chromatography tandem mass spectrometry (LC-MS) method designed to measure polar metabolites such as amino acids, amino acids derivatives, dipeptides, and other cationic metabolites as described previously^33,74,91,92. Pooled plasma reference samples were included every 20 samples and results were standardized using the ratio of the value of the sample to the value of the nearest pooled reference multiplied by the median of all reference values for the metabolite. Samples were run together, with matched case-control pairs (as sets) distributed randomly within the batch, and the order of the case and controls within each pair randomly assigned. Therefore, the case and its control were always directly adjacent to each other in the analytic run, thereby limiting variability in platform performance across matched case–control pairs. In addition, 238 quality control (QC) samples, to which the laboratory was blinded, were also profiled. These were randomly distributed among the participants’ samples.

Hydrophilic interaction liquid chromatography (HILIC) analyses of water soluble metabolites in the positive ionization mode were conducted using an LC-MS system comprised of a Shimadzu Nexera X2 U-HPLC (Shimadzu Corp.; Marlborough, MA) coupled to a Q Exactive mass spectrometer (Thermo Fisher Scientific; Waltham, MA). Metabolites were extracted from plasma (10 µL) using 90 µL of acetonitrile/methanol/formic acid (74.9:24.9:0.2 v/v/v) containing stable isotope-labeled internal standards (valine-d8, Sigma-Aldrich; St. Louis, MO; and phenylalanine-d8, Cambridge Isotope Laboratories; Andover, MA). The samples were centrifuged (10 min, 9000 × g, 4 °C), and the supernatants were injected directly onto a 150 × 2 mm, 3 µm Atlantis HILIC column (Waters; Milford, MA). The column was eluted isocratically at a flow rate of 250 µL/min with 5% mobile phase A (10 mM ammonium formate and 0.1% formic acid in water) for 0.5 min followed by a linear gradient to 40% mobile phase B (acetonitrile with 0.1% formic acid) over 10 min. MS analyses were carried out using electrospray ionization in the positive ion mode using full scan analysis over 70–800 m/z at 70,000 resolution and 3 Hz data acquisition rate. Other MS settings were: sheath gas 40, sweep gas 2, spray voltage 3.5 kV, capillary temperature 350 °C, S-lens RF 40, heater temperature 300 °C, microscans 1, automatic gain control target 1e6, and maximum ion time 250 ms. Metabolite identities were confirmed using authentic reference standards or reference samples.

In total, 259 known metabolites were measured in this study. Metabolites not passing our previously conducted processing delay pilot study⁷⁴ were excluded from this analysis (N = 33). All metabolites (N = 226) included here exhibited good reproducibility within person over 1-2 years⁷⁴. Two hundred and six metabolites had no missing values among participant samples. One metabolite had <10% missing values and 19 metabolites had ≥10% missing values. Most of the metabolites (N = 191) had a coefficient of variation (CV) < 25% and an intraclass correlation coefficient (ICC) > 0.4 among blinded QC samples. Twenty-five metabolites had CV ≥ 25%, five had ICCs ≤ 0.4, and five metabolites had CV ≥ 25% and ICC ≤ 0.4.

Statistical analysis

Metabolite levels were natural logarithm transformed and standardized prior to statistical analysis. Missing values were imputed by one half the lowest observed value per metabolite, for metabolites with <10% missing values (N = 1). Metabolites with >10% missing values (N = 19) were excluded from the main analysis and evaluated in an exploratory analysis. Association of metabolite levels with breast cancer risk was assessed in metabolite-by-metabolite models and in multivariable analyses that included all metabolites simultaneously.

The association of individual metabolites with breast cancer risk was assessed in conditional logistic regression models (1057 cases and 1057 controls). In a simple model, each metabolite was included without adjustment for other factors. In an adjusted model, the following additional factors were included: BMI at age 18, weight change from age 18 to time of blood draw, age at menarche, parity and age at first birth, family history of breast cancer, diagnosis of benign breast disease, physical activity, alcohol consumption, exogenous hormone use and breastfeeding history. Odds ratios (OR) and 95% confidence intervals (95% CI) were estimated for a one-unit (one standard deviation) increase in the log-transformed and standardized metabolites levels.

We performed analyses restricting to premenopausal women at blood collection (838 cases and 838 controls), and analyses stratified by BMI (<25 kg/m² [635 cases and 578 controls]; ≥25 kg/m² [422 cases and 479 controls]) and ER status (ER+ [585 cases and 1057 controls]; ER− [126 cases and 1057 controls]). In a sensitivity analysis, we observed similar results between conditional logistic regression and unconditional logistic regression adjusting for the matching factors. Thus, stratified analyses were conducted using unconditional logistic regression, additionally adjusting for the matching factors. To test for effect modifications by BMI and heterogeneity by ER status, we included cross-product terms of the metabolite with potential effect modifier in conditional logistic models and report the p-value for that interaction. As ER status represents a case characteristic, we assigned each control the ER status of its matched case.

In an exploratory analysis we assessed the association with risk of breast cancer for the 19 metabolites with >10% missing values. We included the continuous metabolite level as well as a presence/absence indicator in the fully adjusted conditional logistic regression model and performed a likelihood-ratio test (full model compared to a model excluding both the metabolite and the presence/absence indicator) to estimate the significance level of the association.

Multivariable analyses evaluating the joint association of the 207 metabolites with breast cancer risk were based on (1) conditional logistic regression with lasso penalty (“Lasso”), (2) conditional logistic regression with an elastic net penalty (“Elastic Net”), and (3) random forests. In the Lasso and Elastic Net analyses, a minimally adjusted model included only the set of 207 metabolites, whereas a fully adjusted model further adjusted for the risk factors noted above. In each analysis, the optimal values of the regularization parameter(s) were estimated as that which minimizes the average deviance in the left-out partitions, in a 10-fold cross validation procedure. A p-value for each metabolite was obtained from a permutation test in which the case/control labels were permuted within each matched stratum. A p-value for each metabolite was calculated as the proportion of permutations (out of 250) in which the magnitude of the coefficient under label permutation was at least as large as the regression coefficient in the observed dataset. Analyses were carried out using the R library clogitL1⁹³.

Random forests analyses included a minimally adjusted model that included the 207 metabolites and matching factors. A fully adjusted model also included the additional risk factors noted above. In all analyses, the parameter mtry corresponding to the number of variables randomly sampled as candidates at each split was set to the square root of the total number of covariates in the model. Each classifier was an aggregate of 5000 trees. A p-value for each metabolite was obtained from a permutation test as described above in which the case/control labels were randomly permuted 100 times. Analyses were carried out using the R library random forests⁹⁴.

To adjust for multiple testing in the conditional logistic regression and the multivariable models we estimated the positive FDR based on the q-value procedure⁹⁵. Metabolites that satisfied a p-value less than 0.05 and corresponding q-value less than 0.20 in the minimally adjusted model were discussed as primary findings.

Criterion for statistical significance: metabolites that met a p-value < 0.05 and q-value < 0.20 in at least one of the four models (Conditional Logistic Regression, Lasso, Elastic Net, and Random Forests) with minimal adjustment were considered as statistically significant. An exception was made for metabolites that did not meet a p-value threshold < 0.05 in the conditional logistic regression: four metabolites that met the threshold for statistical significance in Lasso only but had high raw p-values (>0.3) in the conditional logistic regression were excluded (C12:1 carnitine, C22:5 LPC, C46:2 TAG, glycine).

A metabolite score was estimated for each participant as a linear combination of all metabolites that met the threshold for statistical significance. The coefficients associated with each metabolite were estimated in a conditional logistic regression model with the Lasso penalty that included all metabolites simultaneously and with full adjustment for all potential confounders.

Analyses were caried out using R⁹⁶ version 3.6.3 and SAS version 9.4. All statistical tests were two-sided.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The data generated and analyzed during this study are described in the following data record: https://doi.org/10.6084/m9.figshare.14374349⁹⁷. The demographics and questionnaire data together with the plasma metabolomics profiles are not publicly available for the following reason: data contain information that could compromise research participant privacy. Requests to access these data should be made via http://www.nurseshealthstudy.org/researchers. Analysis results for all metabolites are openly available as part of this figshare metadata record in the files titled “supplementary_tables 1–4.csv”.

Code availability

The code used in this analysis is available upon request.

References

Harbeck, N. et al. Breast cancer. Nat. Rev. Dis. Prim. 5, 1–31 (2019).
Google Scholar
Dartois, L. et al. Proportion of premenopausal and postmenopausal breast cancers attributable to known risk factors: estimates from the E3N-EPIC cohort. Int. J. Cancer 138, 2415–2427 (2016).
Article CAS PubMed Google Scholar
Sprague, B. L. et al. Proportion of invasive breast cancer attributable to risk factors modifiable after menopause. Am. J. Epidemiol. 168, 404–411 (2008).
Article PubMed PubMed Central Google Scholar
Tamimi, R. et al. Population attributable risk of modifiable and nonmodifiable breast cancer risk factors in postmenopausal breast cancer. Am. J. Epidemiol. 184, 884–893 (2016).
Article PubMed PubMed Central Google Scholar
Maas, P. et al. Breast cancer risk from modifiable and nonmodifiable risk factors among white women in the United States. Jama Oncol. 2, 1295–1302 (2016).
Article PubMed PubMed Central Google Scholar
Armitage, E. G. & Barbas, C. Metabolomics in cancer biomarker discovery: current trends and future perspectives. J. Pharm. Biomed. Anal. 87, 1–11 (2014).
Article CAS PubMed Google Scholar
McCartney, A. et al. Metabolomics in breast cancer: a decade in review. Cancer Treat. Rev. 67, 88–96 (2018).
Article PubMed Google Scholar
Jiao, L. et al. A prospective targeted serum metabolomics study of pancreatic cancer in postmenopausal women. Cancer Prev. Res. 12, 237–246 (2019).
Article CAS Google Scholar
Mayers, J. R. et al. Elevation of circulating branched-chain amino acids is an early event in human pancreatic adenocarcinoma development. Nat. Med. 20, 1193 (2014).
Article CAS PubMed PubMed Central Google Scholar
Shu, X. et al. Prospective metabolomics study identifies potential novel blood metabolites associated with pancreatic cancer risk. Int. J. Cancer 143, 2161–2167 (2018).
Article CAS PubMed PubMed Central Google Scholar
Huang, J. et al. Prospective serum metabolomic profiling of lethal prostate cancer. Int. J. Cancer 145, 3231–3243 (2019).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y., Jacobs, E. J., Carter, B. D., Gapstur, S. M. & Stevens, V. L. Plasma metabolomic profiles and risk of advanced and fatal prostate cancer. Eur. Urol. Oncol. 4, 56–65 (2019).
Article PubMed Google Scholar
Loftfield, E. et al. Prospective investigation of serum metabolites, coffee drinking, liver cancer incidence, and liver disease mortality. J. Natl Cancer Inst. 112, 286–294 (2019).
Article PubMed Central CAS Google Scholar
Perttula, K. et al. Untargeted lipidomic features associated with colorectal cancer in a prospective cohort. BMC Cancer 18, 996 (2018).
Article CAS PubMed PubMed Central Google Scholar
Zeleznik, O. A. et al. Circulating lysophosphatidylcholines, phosphatidylcholines, ceramides, and sphingomyelins and ovarian cancer risk: a 23-year prospective study. J. Natl Cancer Inst. 112, 628–636 (2019).
Article PubMed Central CAS Google Scholar
Zeleznik, O. A. et al. A prospective analysis of circulating plasma metabolites associated with ovarian cancer risk. Cancer Res. 80, 1357–1367 (2020).
Article CAS PubMed PubMed Central Google Scholar
Troisi, J. et al. Metabolomic signature of endometrial cancer. J. Proteome Res. 17, 804–812 (2018).
Article CAS PubMed Google Scholar
His, M. et al. Prospective analysis of circulating metabolites and breast cancer in EPIC. BMC Med. 17, 178 (2019).
Article PubMed PubMed Central Google Scholar
Kühn, T. et al. Higher plasma levels of lysophosphatidylcholine 18:0 are related to a lower risk of common cancers in a prospective metabolomics study. BMC Med. 14, 13 (2016).
Article PubMed PubMed Central CAS Google Scholar
Lécuyer, L. et al. Diet-related metabolomic signature of long-term breast cancer risk using penalized regression: an exploratory study in the SU.VI.MAX cohort. Cancer Epidemiol. Biomark. Prev. 29, 396–405 (2019).
Article Google Scholar
Moore, S. C. et al. A metabolomics analysis of body mass index and postmenopausal breast cancer risk. J. Natl Cancer Inst. 110, 588–597 (2018).
CAS PubMed PubMed Central Google Scholar
Yoo, H. J. et al. Analysis of metabolites and metabolic pathways in breast cancer in a Korean prospective cohort: the Korean Cancer Prevention Study-II. Metabolomics 14, 85 (2018).
Article PubMed CAS Google Scholar
Playdon, M. C. et al. Nutritional metabolomics and breast cancer risk in a prospective study. Am. J. Clin. Nutr. 106, 637–649 (2017).
Article CAS PubMed PubMed Central Google Scholar
Derosa, G., Maffioli, P. & Sahebkar, A. In Anti-inflammatory Nutraceuticals and Chronic Diseases 173–184 (Springer, 2016).
Meghwal, M. & Goswami, T. Piper nigrum and piperine: an update. Phytother. Res. 27, 1121–1130 (2013).
Article CAS PubMed Google Scholar
Yadav, V., Krishnan, A. & Vohora, D. A systematic review on Piper longum L.: Bridging traditional knowledge and pharmacological evidence for future translational research. J. Ethnopharmacol. 247, 112255 (2019).
Zadorozhna, M., Tataranni, T. & Mangieri, D. Piperine: role in prevention and progression of cancer. Mol. Biol. Rep. 46, 1–13 (2019).
Aumeeruddy, M. Z. & Mahomoodally, M. F. Combating breast cancer using combination therapy with 3 phytochemicals: piperine, sulforaphane, and thymoquinone. Cancer 125, 1600–1611 (2019).
Article PubMed Google Scholar
Greenshields, A. L. et al. Piperine inhibits the growth and motility of triple-negative breast cancer cells. Cancer Lett. 357, 129–140 (2015).
Article CAS PubMed Google Scholar
Abdelhamed, S. et al. Piperine enhances the efficacy of TRAIL-based therapy for triple-negative breast cancer cells. Anticancer Res. 34, 1893–1899 (2014).
CAS PubMed Google Scholar
Rodionov, R. N., Murry, D. J., Vaulman, S. F., Stevens, J. W. & Lentz, S. R. Human alanine-glyoxylate aminotransferase 2 lowers asymmetric dimethylarginine and protects from inhibition of nitric oxide production. J. Biol. Chem. 285, 5385–5391 (2010).
Article CAS PubMed Google Scholar
Willeit, P. et al. Asymmetric dimethylarginine and cardiovascular risk: systematic review and meta‐analysis of 22 prospective studies. J. Am. Heart Assoc. 4, e001833 (2015).
O’Sullivan, J. F. et al. Dimethylguanidino valeric acid is a marker of liver fat and predicts diabetes. J. Clin. Investig. 127, 4394–4402 (2017).
Article PubMed PubMed Central Google Scholar
Ottosson, F. et al. Dimethylguanidino valerate: a lifestyle‐related metabolite associated with future coronary artery disease and cardiovascular mortality. J. Am. Heart Assoc. 8, e012846 (2019).
Article PubMed PubMed Central Google Scholar
Robbins, J. M. et al. Association of dimethylguanidino valeric acid with partial resistance to metabolic health benefits of regular exercise. JAMA Cardiol. 4, 636–643 (2019).
Article PubMed PubMed Central Google Scholar
Hernández‐Alonso, P. et al. Plasma metabolites associated with frequent red wine consumption: a metabolomics approach within the PREDIMED Study. Mol. Nutr. Food Res. 63, 1900140 (2019).
Article CAS Google Scholar
Schoemaker, M. J. et al. Association of body mass index and age with subsequent breast cancer risk in premenopausal women. JAMA Oncol. 4, e181771–e181771 (2018).
Article PubMed PubMed Central Google Scholar
Yamashita, Y. et al. Differences in elongation of very long chain fatty acids and fatty acid metabolism between triple-negative and hormone receptor-positive breast cancer. BMC Cancer 17, 589 (2017).
Article PubMed PubMed Central CAS Google Scholar
Messias, M. C. F., Mecatti, G. C., Priolli, D. G. & de Oliveira Carvalho, P. Plasmalogen lipids: functional mechanism and their involvement in gastrointestinal cancer. Lipids Health Dis. 17, 41 (2018).
Article PubMed PubMed Central CAS Google Scholar
Ritchie, S. A. et al. Metabolic system alterations in pancreatic cancer patient serum: potential for early detection. BMC Cancer 13, 416 (2013).
Article PubMed PubMed Central CAS Google Scholar
Lv, J., Lv, C.-Q., Xu, L. & Yang, H. Plasma content variation and correlation of plasmalogen and GIS, TC, and TPL in gastric carcinoma patients: a comparative study. Med Sci. Monit. Basic Res. 21, 157–160 (2015).
Article PubMed PubMed Central Google Scholar
Hilvo, M. et al. Novel theranostic opportunities offered by characterization of altered membrane lipid metabolism in breast cancer progression. Cancer Res. 71, 3236–3245 (2011).
Article CAS PubMed Google Scholar
Law, S.-H. et al. An updated review of lysophosphatidylcholine metabolism in human diseases. Int. J. Mol. Sci. 20, 1149 (2019).
Libert, D. M., Nowacki, A. S. & Natowicz, M. R. Metabolomic analysis of obesity, metabolic syndrome, and type 2 diabetes: amino acid and acylcarnitine levels change along a spectrum of metabolic wellness. PeerJ 6, e5410 (2018).
PubChem Database. Vol. Phenylacetylglutamine, CID=92258 (National Center for Biotechnology Information) https://pubchem.ncbi.nlm.nih.gov/compound/Phenylacetylglutamine Accessed 27 March 2020.
Lees, H. J., Swann, J. R., Wilson, I. D., Nicholson, J. K. & Holmes, E. Hippurate: the natural history of a mammalian–microbial cometabolite. J. Proteome Res. 12, 1527–1546 (2013).
Article CAS PubMed Google Scholar
Li, M. et al. Symbiotic gut microbes modulate human metabolic phenotypes. Proc. Natl Acad. Sci. USA 105, 2117–2122 (2008).
Article CAS PubMed PubMed Central Google Scholar
Pallister, T. et al. Hippurate as a metabolomic marker of gut microbiome diversity: Modulation by diet and relationship to metabolic syndrome. Sci. Rep. 7, 1–9 (2017).
Wikoff, W. R. et al. Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proc. Natl Acad. Sci. 106, 3698–3703 (2009).
Article CAS PubMed PubMed Central Google Scholar
Behr, C. et al. Gut microbiome-related metabolic changes in plasma of antibiotic-treated rats. Arch. Toxicol. 91, 3439–3454 (2017).
Article CAS PubMed Google Scholar
Barrios, C. et al. Gut-microbiota-metabolite axis in early renal function decline. PLoS ONE 10, e034311 (2015).
Lécuyer, L. et al. Plasma metabolomic signatures associated with long-term breast cancer risk in the SU. VI. MAX prospective cohort. Cancer Epidemiol. Prev. Biomark. 28, 1300–1307 (2019).
Article Google Scholar
Knott, S. R. et al. Asparagine bioavailability governs metastasis in a model of breast cancer. Nature 554, 378–381 (2018).
Article CAS PubMed PubMed Central Google Scholar
Riesberg, L. A., Weed, S. A., McDonald, T. L., Eckerson, J. M. & Drescher, K. M. Beyond muscles: the untapped potential of creatine. Int. Immunopharmacol. 37, 31–42 (2016).
Article CAS PubMed PubMed Central Google Scholar
Delanghe, J. et al. Normal reference values for creatine, creatinine, and carnitine are lower in vegetarians. Clin. Chem. 35, 1802–1803 (1989).
Article CAS PubMed Google Scholar
Brosnan, J. T. & Brosnan, M. E. Creatine metabolism and the urea cycle. Mol. Genet. Metab. 100, S49–S52 (2010).
Article CAS PubMed Google Scholar
Wyss, M. & Kaddurah-Daouk, R. Creatine and creatinine metabolism. Physiol. Rev. 80, 1107–1213 (2000).
Article CAS PubMed Google Scholar
Farvid, M. S., Cho, E., Chen, W. Y., Eliassen, A. H. & Willett, W. C. Dietary protein sources in early adulthood and breast cancer incidence: prospective cohort study. BMJ348, g3437 (2014).
Article PubMed PubMed Central CAS Google Scholar
Naito, J., Sasaki, E., Ohta, Y., Shinohara, R. & Ishiguro, I. Anthranilic acid metabolism in the isolated perfused rat liver: detection and determination of anthranilic acid and its related substances using high-performance liquid chromatography with electrochemical detection. Biochem. Pharmacol. 33, 3195–3200 (1984).
Article CAS PubMed Google Scholar
Badawy, A. A. B. Kynurenine pathway of tryptophan metabolism: regulatory and functional aspects. Int. J. Tryptophan Res. 10, 1178646917691938 (2017).
Richard, D. M. et al. L-Tryptophan: basic metabolic functions, behavioral research and therapeutic indications. Int J. Tryptophan Res. 2, 45–60 (2009).
Article CAS PubMed PubMed Central Google Scholar
Walczak, K., Wnorowski, A., Turski, W. A. & Plech, T. Kynurenic acid and cancer: facts and controversies. Cell. Mol. Life Sci. 77, 1531–1550 (2019).
Yu, E. et al. Association of tryptophan metabolites with incident type 2 diabetes in the PREDIMED trial: a case–cohort study. Clin. Chem. 64, 1211–1220 (2018).
Article CAS PubMed PubMed Central Google Scholar
Wirthgen, E., Hoeflich, A., Rebl, A. & Günther, J. Kynurenic acid: the Janus-faced role of an immunomodulatory tryptophan metabolite and its link to pathological conditions. Front. Immunol. 8, 1957 (2018).
Article PubMed PubMed Central CAS Google Scholar
Platten, M., Nollen, E. A. A., Röhrig, U. F., Fallarino, F. & Opitz, C. A. Tryptophan metabolism as a common therapeutic target in cancer, neurodegeneration and beyond. Nat. Rev. Drug Discov. 18, 379–401 (2019).
Article CAS PubMed Google Scholar
Labadie, B. W., Bao, R. & Luke, J. J. Reimagining IDO pathway inhibition in cancer immunotherapy via downstream focus on the Tryptophan–Kynurenine–Aryl hydrocarbon axis. Clin. Cancer Res. 25, 1462–1471 (2019).
Article CAS PubMed Google Scholar
Rhodes, C. J. et al. Plasma metabolomics implicates modified transfer RNAs and altered bioenergetics in the outcomes of pulmonary arterial hypertension. Circulation 135, 460–475 (2017).
Article CAS PubMed PubMed Central Google Scholar
Seidel, A., Brunner, S., Seidel, P., Fritz, G. I. & Herbarth, O. Modified nucleosides: an accurate tumour marker for clinical diagnosis of cancer, early detection and therapy control. Br. J. Cancer 94, 1726–1733 (2006).
Article CAS PubMed PubMed Central Google Scholar
Ottosson, F., Smith, E., Gallo, W., Fernandez, C. & Melander, O. Purine metabolites and carnitine biosynthesis intermediates are biomarkers for incident type 2 diabetes. J. Clin. Endocrinol. Metab. 104, 4921–4930 (2019).
Article PubMed PubMed Central Google Scholar
Balasubramanian, R. et al. Metabolomic profiles associated with all-cause mortality in the Women’s Health Initiative. Int. J. Epidemiol. 49, dyz211 (2019).
Levine, L., Waalkes, T. P. & Stolbach, L. Brief communication: serum levels of N 2, N 2-dimethylguanosine and pseudouridine as determined by radioimmunoassay for patients with malignancy. J. Natl Cancer Inst. 54, 341–343 (1975).
CAS PubMed Google Scholar
Li, J. et al. The Mediterranean diet, plasma metabolome, and cardiovascular disease risk. Eur. Heart J. 41, 2645–2656 (2020).
Ding, M. et al. Metabolome-Wide Association Study of the relationship between habitual physical activity and plasma metabolite levels. Am. J. Epidemiol. 188, 1932–1943 (2019).
Article PubMed PubMed Central Google Scholar
Townsend, M. K. et al. Reproducibility of metabolomic profiles among men and women in 2 large cohort studies. Clin. Chem. 59, 1657–1667 (2013).
Article CAS PubMed Google Scholar
Eliassen, A. H., Hankinson, S. E., Rosner, B., Holmes, M. D. & Willett, W. C. Physical activity and risk of breast cancer among postmenopausal women. Arch. Intern. Med. 170, 1758–1764 (2010).
Article PubMed PubMed Central Google Scholar
Adams-Campbell, L. L., Rosenberg, L., Rao, R. S. & Palmer, J. R. Strenuous physical activity and breast cancer risk in African-American women. J. Natl Med. Assoc. 93, 267–275 (2001).
CAS PubMed PubMed Central Google Scholar
Colditz, G. A. et al. Family history, age, and risk of breast cancer. J. Am. Med. Assoc. 270, 338–343 (1993).
Article CAS Google Scholar
Palmer, J. R., Boggs, D. A., Adams-Campbell, L. L. & Rosenberg, L. Family history of cancer and risk of breast cancer in the Black Women’s Health Study. Cancer Causes Control 20, 1733–1737 (2009).
Article PubMed PubMed Central Google Scholar
Kilfoy, B. A. et al. Family history of malignancies and risk of breast cancer: prospective data from the Shanghai women’s health study. Cancer Causes Control 19, 1139–1145 (2008).
Article PubMed PubMed Central Google Scholar
Hunter, D. J. et al. Oral contraceptive use and breast cancer: a prospective study of young women. Cancer Epidemiol. Biomark. Prev. 19, 2496–2502 (2010).
Article Google Scholar
Rosenberg, L., Palmer, J. R., Wise, L. A. & Adams-Campbell, L. L. A prospective study of female hormone use and breast cancer among black women. Arch. Intern. Med. 166, 760–765 (2006).
Article PubMed Google Scholar
Colditz, G. A. et al. The use of estrogens and progestins and the risk of breast cancer in postmenopausal women. N. Engl. J. Med. 332, 1589–1593 (1995).
Article CAS PubMed Google Scholar
Rosenberg, L., Boggs, D. A., Wise, L. A., Adams-Campbell, L. L. & Palmer, J. R. Oral contraceptive use and estrogen/progesterone receptor-negative breast cancer among African American women. Cancer Epidemiol. Biomark. Prev. 19, 2073–2079 (2010).
Article CAS Google Scholar
Fung, T. T. et al. Dietary patterns and the risk of postmenopausal breast cancer. Int. J. Cancer 116, 116–121 (2005).
Boggs, D. A. et al. Fruit and vegetable intake in relation to risk of breast cancer in the Black Women’s Health Study. Am. J. Epidemiol. 172, 1268–1279 (2010).
Article PubMed PubMed Central Google Scholar
Agurs-Collins, T., Rosenberg, L., Makambi, K., Palmer, J. R. & Adams-Campbell, L. Dietary patterns and breast cancer risk in women participating in the Black Women’s Health Study. Am. J. Clin. Nutr. 90, 621–628 (2009).
Article CAS PubMed PubMed Central Google Scholar
Boggs, D. A. et al. Tea and coffee intake in relation to risk of breast cancer in the Black Women’s Health Study. Cancer Causes Control 21, 1941–1948 (2010).
Article PubMed PubMed Central Google Scholar
Ganmaa, D. et al. Coffee, tea, caffeine and risk of breast cancer: a 22-year follow-up. Int J. Cancer 122, 2071–2076 (2008).
Article CAS PubMed PubMed Central Google Scholar
Eliassen, A. H. et al. Endogenous steroid hormone concentrations and risk of breast cancer among premenopausal women. J. Natl Cancer Inst. 98, 1406–1415 (2006).
Article CAS PubMed Google Scholar
Fortner, R. T. et al. Parity, breastfeeding, and breast cancer risk by hormone receptor status and molecular phenotype: results from the Nurses’ Health Studies. Breast Cancer Res. 21, 40 (2019).
Article PubMed PubMed Central Google Scholar
Mascanfroni, I. D. et al. Metabolic control of type 1 regulatory T cell differentiation by AHR and HIF1-α. Nat. Med. 21, 638 (2015).
Article CAS PubMed PubMed Central Google Scholar
Paynter, N. P. et al. Metabolic predictors of incident coronary heart disease in women. Circulation 137, 841–853 (2018).
Article PubMed PubMed Central Google Scholar
Avalos, M., Pouyes, H., Grandvalet, Y., Orriols, L. & Lagarde, E. Sparse conditional logistic regression for analyzing large-scale matched data from epidemiological studies: a simple algorithm. BMC Bioinform. 16, S1 (2015).
Article Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Storey, J. D. The positive false discovery rate: a Bayesian interpretation and the q-value. Ann. Stat. 31, 2013–2035 (2003).
Article Google Scholar
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. (2020) https://www.R-project.org/.
Zeleznik, O. A. et al. Metadata record for the article: Circulating amino acids and amino acid-related metabolites and risk of breast cancer among predominantly premenopausal women. (figshare https://doi.org/10.6084/m9.figshare.14374349, 2021).

Download references

Acknowledgements

This study was funded by the National Cancer Institute (R01 CA050385, UM1 CA186107, P01 CA087969, R01 CA49449, U01 CA176726, and R01 CA67262). We would like to thank the participants and staff of the Nurses’ Health Studies for their valuable contributions as well as the following state cancer registries for their help: A.L., A.Z., A.R., C.A., C.O., C.T., D.E., F.L., G.A., I.D., I.L., I.N., I.A., K.Y., L.A., M.E., M.D., M.A., M.I., N.E., N.H., N.J., N.Y., N.C., N.D., O.H., O.K., O.R., P.A., R.I., S.C., T.N., T.X., V.A., W.A. and W.Y. We would also like to thank Ryan Sheehan for his help with Fig. 2. The authors assume full responsibility for analyses and interpretation of these data.

Author information

These authors contributed equally: Oana A. Zeleznik, Raji Balasubramanian

Authors and Affiliations

Channing Division of Network Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
Oana A. Zeleznik, Lisa Frueh & A. Heather Eliassen
Department of Biostatistics & Epidemiology, University of Massachusetts – Amherst, Amherst, MA, USA
Raji Balasubramanian & Yibai Zhao
Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, USA
Sarah Jeanfavre, Julian Avila-Pacheco & Clary B. Clish
Department of Cancer Epidemiology, Moffitt Cancer Center, Tampa, FL, USA
Shelley S. Tworoger
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
A. Heather Eliassen

Authors

Oana A. Zeleznik
View author publications
You can also search for this author in PubMed Google Scholar
Raji Balasubramanian
View author publications
You can also search for this author in PubMed Google Scholar
Yibai Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Lisa Frueh
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Jeanfavre
View author publications
You can also search for this author in PubMed Google Scholar
Julian Avila-Pacheco
View author publications
You can also search for this author in PubMed Google Scholar
Clary B. Clish
View author publications
You can also search for this author in PubMed Google Scholar
Shelley S. Tworoger
View author publications
You can also search for this author in PubMed Google Scholar
A. Heather Eliassen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

O.A.Z. and R.B. contributed equally to this study. O.A.Z. and R.B. contributed to data curation, formal analysis, investigation, methodology, manuscript writing, and manuscript review. Y.Z. contributed to formal analysis and manuscript review. L.F. contributed to resources and manuscript review. S.J., J.A.P. and C.B.C. contributed to data curation, resources and manuscript review. S.S.T. contributed to conceptualization, investigation, methodology and manuscript review. A.H.E. contributed to conceptualization, funding acquisition, investigation, methodology, project administration, resources, supervision, manuscript writing, and manuscript review.

Corresponding author

Correspondence to Oana A. Zeleznik.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zeleznik, O.A., Balasubramanian, R., Zhao, Y. et al. Circulating amino acids and amino acid-related metabolites and risk of breast cancer among predominantly premenopausal women. npj Breast Cancer 7, 54 (2021). https://doi.org/10.1038/s41523-021-00262-4

Download citation

Received: 02 October 2020
Accepted: 15 April 2021
Published: 18 May 2021
DOI: https://doi.org/10.1038/s41523-021-00262-4