Introduction

Chagas disease (CD) is a parasitic disease caused by Trypanosoma cruzi, a protozoan endemic to the American continent. CD is considered a neglected tropical disease affecting the global health. According to the World Health Organization (WHO) it affects 6–7 million people1. After an acute phase, often asymptomatic, infected people with CD move to a chronic phase. The indeterminate CD form (ICD) involves a group of infected people who remain asymptomatic with no organ involvement. However, around 30% of the chronically infected individuals will develop an organic impairement, mainly affecting the heart and the digestive tract2. Chronic chagasic cardiomyopathy (CCC) represents the greatest burden of morbidity and mortality among these subjects, accounting to around 12.000 deaths annually3.

CD treatment in the chronic phase is recommended mainly in indeterminate cases and women in childbearing age. In addition, parasitological treatment shows a high rate of adverse events, and there are no reliable surrogate markers for disease progression or cure4,5. Thus, the identification of specific biomarkers for each clinical form is currently needed, as it may have a significant impact on patient management. On one hand, they will allow identifying individuals at a higher risk of progression. On the other hand, they can help monitor treatment response and evaluate drug effectiveness in clinical trials since the only definite criterion of cure is negativization of the serology, which can last more than 20 years6.

Untargeted metabolomics is a powerful tool to search for metabolic variations among pre-established groups. Metabolomic studies have been used for the discovery of novel biomarkers in disease states and report metabolic signatures that can be used for diagnosis, monitoring, and treatment response7. It also has the potential to provide insights into the pathophysiological mechanisms of a disease and, therefore, describe potential targets for new treatments. Previous studies have been conducted with the purpose of identifying metabolic conditions in CD9. However, the only study performed on human samples was performed in heart tissue of end-stage CCC individuals and healthy controls10. Furthermore, neither animal nor human CD metabolome analysis have compared the different stages of the disease.

Therefore, we designed a cross-sectional study with the main objective of describing the metabolomic profile associated with individuals with and without CD, depending on the presence or absence of cardiac involvement, to identify potential prognostic and cure biomarkers.

Subjects and methods

We conducted a cross-sectional study from January 2016 to August 2018. Participants were recruited from three international health centers: 1) Vall d’Hebrón—Drassanes International Health Unit (Barcelona, Spain), 2) National Reference Unit for Tropical Diseases. Infectious Diseases Department. Ramón y Cajal University Hospital (Madrid, Spain) and 3) Institute of Tropical Medicine of São Paulo—USP (São Paulo, Brazil). Biomarker determinations were performed by the enterprise Metabolon, Inc. (617 Davis Drive, Suite 400, Durham, NC 277,133 www.metabolon.com) and analyzed at the Centro de Biología Molecular Severo Ochoa (CBMSO) (Madrid, Spain).

Study population and definitions

The included participants were all from Latin American countries. We defined 4 groups of participants: Group 1 indeterminate Chagas disease (ICD), Group 2 chronic chagasic cardiomyopathy (CCC), Group 3 of healthy volunteers, and Group 4, which comprises individuals with cardiomyopathy other than CCC.

All participants were over 18 years old. Group 1 and 2 included people diagnosed with CD by two different validated serological tests. The organic involvement was studied with a chest radiography, an electrocardiogram (EKG) and heart ultrasound. Those showing abnormalities in at least one of the tests underwent echocardiography to evaluate CCC. We defined CCC as the presence of a conduction alteration related to CD, a left ventricle telediastolic volume (LVEDV) > 55mm3, ejection fraction (EF) < 45% or altered segmental contractility. Participants who had received anti-parasitic treatment before inclusion were also excluded. Group 3 were recruited at international health facilities after a negative ELISA test for CD and normal EKG and echocardiogram. Those participants with arterial hypertension, diabetes mellitus or other comorbidities were excluded from groups 1 to 3. Group 4 were selected from cardiology units and included patients with a LVEDV > 55mm3, EF < 45% or an altered segmental contractility and a negative serologic test.

All included participants underwent a clinical history and physical examination. Chest radiography and 12-lead electrocardiogram were performed at the inclusion visit and echocardiography was performed during the following month. Serum was extracted and aliquots of 100ul were conserved at − 80ºC till the analysis procedure.

Metabolomic analysis

Metabolomic analysis was performed using the Metabolon Global Discovery Panel. Samples were prepared using the automated MicroLab STAR® system from Hamilton Company. A quality assurance (QA)/ quality control (QC) process was performed according to Metabolon procedures. Sample extracts were analyzed by UPLC-MS/MS. Waters ACQUITY ultra-performance liquid chromatography (UPLC) and a Thermo Scientific Q-Exactive high resolution/accurate mass spectrometer interfaced with a heated electrospray ionization (HESI-II) source and Orbitrap mass analyzer operated at 35,000 mass resolution were utilized. Raw data was extracted, peak-identified and QC processed using Metabolon’s hardware and software. Metabolites were identified by comparison to library entries of purified standards.

A variety of curation procedures were carried out to ensure that a high-quality data set was made available for statistical analysis and data interpretation. The QC and curation processes were designed to ensure accurate and consistent identification of true chemical entities, and to remove those representing system artifacts, mis-assignments, and background noise. Proprietary visualization and interpretation software of Metabolon were used to confirm the consistency of peak identification among the samples.

Adobe Illustrator CS5 was used for figure presentation. Volcano plots were done with ggplot2 and ggrepel packages of R programming language (version 4.2.2). Heat map and PCA were computed with the online tool ClustVis, a web tool for visualizing clustering of multivariate data, using a predefined correlation clustering distance method (Pearson correlation subtracted from 1) based on the average distance of all possible pairs37. Bar plots of the most remarkable metabolites were performed in GraphPad Prism (version 8).

Statistics

Statistical analysis of metabolomic study was done according to Metabolon procedures. ANOVA contrasts were used to identify biochemicals that differed significantly between experimental groups, and an estimate of the false discovery rate (FDR) was calculated to consider the multiple comparisons that normally occur in metabolomic-based studies. Statistical analyses were conducted to detect differences in ratios of the mean scaled intensities for a metabolite between two experimental groups. In bar plots representation values are scaled intensity means ± SEMs; and statistical significance respect to healthy controls is displayed by asterisks (*) being **** p < 0.0001, *** p < 0.001, ** p < 0.01 and * p < 0.05; and statistical significance respect to ICD individuals is displayed by hashes (#) being #### p < 0.0001, ### p < 0.001, ## p < 0.01 and # p < 0.05.

Ethics declarations

The study was approved by the institutional review board of the coordinating center located at Vall d'Hebron Hospital in Barcelona, Spain (PR(AG)93/2015), and by the ethics committee of each participant center. There was no commercial sponsorship of the study. Written informed consent for participation was obtained from all the participants. Data on human subjects were analyzed anonymously and clinical investigations have been conducted according to the Declaration of Helsinki.

Results

From January 2016 until August 2018, a total of 120 participants were included in the study: 31 individuals with ICD, 41 individuals with CCC, 30 healthy controls and 18 individuals without CD suffering other cardiomyopathies (referred to as heart disease, HD). The demographics of the participants are presented in Table 1. Sixty-eight (56.7%) participants were male, and the mean age was 50.6 years. Additional information regarding the cardiac involvement of participants in group 2 and 4 can be found in Supplementary material Table S1.

Table 1 Demographics of the included participants.

A total of 986 metabolites with known identity from the Metabolon platform were analyzed. Initially, we performed a Principal Component Analysis using the scaled intensities of each metabolite (Fig. 1A). The results showed that the profile of CCC was highly similar to that of HD, while ICD displayed a distinct profile different from both the cardiac individuals and the healthy controls.

Figure 1
figure 1

Metabolite profile of the different groups. Metabolite profile was performed by Principal Component Analysis (A) and heat map analysis (B). Different groups: individuals with indeterminate Chagas disease (ICD), chronic Chagas cardiomyopathy (CCC) and cardiomyopathy without Chagas disease (HD). Mean scaled intensities of each metabolite were used. A color code was given from high (Red) to low (Blue) values in heat map representation. Cluster analysis for columns and rows was done according to the Pearson correlation.

Furthermore, in a heatmap analysis HD and CCC individuals clustered together again, whereas ICD participants formed a separate group (Fig. 1B). Based on these findings, we carried out a deeper analysis to identify potential biomarkers establishing 3 different comparisons: ICD-Control and CCC-Control for the identification of metabolites that changed from a control to a CD individual, and CCC-ICD to identify metabolites that varied with disease progression. Supplementary tables 24 compiles all the detected metabolites in each comparison.

In the ICD-control comparison, 96 metabolites were downregulated and 92 were upregulated (Fig. S1). The CCC-control comparison revealed fewer significant changes, with only 7 downregulated and 59 upregulated metabolites (Fig. S2). When comparing CCC and ICD individuals, 68 metabolites were downregulated, while 244 were upregulated (Fig. S3). Supplementary tables 57 provides a list of all the downregulated and upregulated metabolites in each comparison.

To further narrow down the potential biomarker candidates, a more stringent analysis was performed by increasing the thresholds of the ratios to > 1 of log2ratio and FDR < 0.01 for significance. The following summary presents the most relevant results based on the subpathways to which the metabolites belong.

Peptides and carbohydrate metabolism

Levels of dipeptide metabolites showed the most significant alterations in all the comparisons. Their expression decreased in almost all individuals with ICD but increased back to control levels in individuals of CCC. However, CCC individuals exhibited higher levels of phenylalanylalanine compared to controls (Fig. 2A,B). Gamma-glutamyl amino acids followed a similar pattern, except for gamma-glutamyl methionine, which showed an opposite behavior (increased in ICD and decreased in CCC). Notably, phenylacetylglutamine (PAG), an acetylated peptide, and fibrinopeptide B (1–13) increasingly changed from controls to ICD and then to CCC individuals. Supplementary table 8 provides a comprehensive list of all the peptide metabolites in each comparison classified by subpathways. Regarding carbohydrate metabolites, our analysis revealed only a few significant changes in the CCC-ICD and ICD-Control studies (Fig. S4 and Supplementary table 9).

Figure 2
figure 2

Peptide metabolites analysis. (A) Volcano plot of peptide metabolites with log2ratio of mean scaled intensities and -log10FDR of each metabolite. Thresholds of significance are shown by dashed lines: < -1 and > 1 for the log2ratio and > 2 for the -log10FDR. Significant down or upregulated metabolites of ICD-control (green), CCC-ICD (red) and CCC-control (blue) comparisons are displayed. Non-significant metabolites are marked in grey. Subpathways are shown with different symbols. (B) Bar plots of the potential candidates to biomarker of Chagas disease. Values are scaled intensity means ± SEMs. Statistical significance respect to healthy controls is displayed by asterisk (*) being *** p < 0.001, ** p < 0.01 and * p < 0.05. Statistical significance respect to ICD individuals is displayed by hash (#) being #### p < 0.0001, ### p < 0.001, ## p < 0.01 and # p < 0.05.

Lipid metabolism

In ICD participants, we observed reduced levels of primary and secondary bile acids, which returned to control levels in CCC individuals (Fig. S5). However, we did not find many changes in sterols and steroids, although progestin steroids exhibited a substantial increase in ICD individuals (Fig. S6). Eicosanoids and neutral lipids showed only a few significant metabolites, and only some diacylglycerols displayed significance in ICD individuals (Fig. S7). Furthermore, fatty acids and derivatives exhibited minimal changes (Fig. S8).

Regarding glycerolipids, phospholipids and lysophospholipids metabolism, only certain phosphatidylethanolamines and phosphatidylcholines exhibited significant up-regulation in the ICD set (Fig. S9). Supplementary table 10 presents all the lipid metabolites in each comparison, classified by subpathways.

Cofactors and vitamin metabolism

Particularly, within the hemoglobin and porphyrin metabolism subpathway, significant variations were evident. Bilirubin (Z,Z), bilirubin (E,E), and biliverdin exhibited significant increases in ICD participants. Moreover, L-urobilin demonstrated a substantial escalation from ICD to chronic chagasic cardiomyopathy (CCC) participants, indicating a progressive metabolic shift across disease stages. Beta-cryptoxanthin in the vitamin A metabolism showed a similar profile (Fig. 3A,B). Supplementary table 11 lists all the cofactor and vitamin metabolites in each comparison.

Figure 3
figure 3

Cofactors and vitamins analysis. (A) Volcano plot of cofactor and vitamin metabolites with log2ratio of mean scaled intensities and -log10FDR of each metabolite. Thresholds of significance are shown by dashed lines: < -1 and > 1 for the log2ratio and > 2 for the -log10FDR. Significant down or upregulated metabolites of ICD-control (green) and CCC-ICD (red) comparisons are displayed. Non-significant metabolites are marked in grey. Sub-pathways are shown with different symbols. (B) Bar plots of the potential candidates to biomarker of Chagas disease. Values are scaled intensity means ± SEMs. Statistical significance respect to healthy controls is displayed by asterisk (*) being **** p < 0.0001, *** p < 0.001, ** p < 0.01 and * p < 0.05. Statistical significance respect to ICD individuals is displayed by hash (#) being #### p < 0.0001, ### p < 0.001, ## p < 0.01 and # p < 0.05.

Amino acids metabolism

Significant differences were also found in the urea cycle, arginine, and proline metabolism, and the guanidino and acetamido metabolism. Urea, N2,N5-diacetylornithine and guanidinosuccinate increased in CCC participants compared to the other sets, indicating a progression biomarker pattern (Fig. 4A,B). Supplementary Table 12 lists all these metabolites in each comparison.

Figure 4
figure 4

Urea and arginine metabolism analysis. (A) Volcano plot of urea and arginine metabolites with log2ratio of mean scaled intensities and -log10FDR of each metabolite. Thresholds of significance are shown by dashed lines: < -1 and > 1 for the log2ratio and > 2 for the -log10FDR. Significant down or upregulated metabolites of ICD-control (green) and CCC-ICD (red) comparisons are displayed. Non-significant metabolites are marked in grey. Sub-pathways are shown with different symbols. (B) Bar plots of the potential candidates to biomarker of Chagas disease. Values are scaled intensity means ± SEMs. Statistical significance respect to healthy controls is displayed by asterisk (*) being **** p < 0.0001, *** p < 0.001, ** p < 0.01 and * p < 0.05. Statistical significance respect to ICD individuals is displayed by hash (#) being #### p < 0.0001, ### p < 0.001, ## p < 0.01 and # p < 0.05.

ICD individuals showed an increase in methionine metabolism, as well as in leucine, isoleucine, and valine metabolism (Fig. 5). In fact, methionine, N-acetylmethionine, N-formylmethionine, 3-methyl-2-oxobutyrate and 4-methyl-2-oxopentanoate exhibited some of the highest ratios for ICD participants (Fig. S10). Furthermore, tyramine O-sulfate and vanillic alcohol sulfate, metabolites of the tyrosine metabolism, showed a significant increase comparing from ICD to CCC subjects.

Figure 5
figure 5

Amino acids metabolism analysis. (A, B) Volcano plots of amino acids metabolites with log2ratio of mean scaled intensities and -log10FDR of each metabolite. Thresholds of significance are shown by dashed lines: < -1 and > 1 for the log2ratio and > 2 for the -log10FDR. Significant down or upregulated metabolites of ICD-control (green), CCC-control (blue) and CCC-ICD (red) comparisons are displayed. Non-significant metabolites are marked in grey. Sub-pathways are shown with different symbols. (C) Bar plots of the potential candidates to biomarker of Chagas disease. Values are scaled intensity means ± SEMs. Statistical significance respect to healthy controls is displayed by asterisk (*) being *** p < 0.001, ** p < 0.01 and * p < 0.05. Statistical significance respect to ICD individuals is displayed by hash (#) being #### p < 0.0001, ### p < 0.001, ## p < 0.01 and # p < 0.05.

However, the most notable profile as progression biomarkers, with a specific and significant increase in CCC participants, was observed among three metabolites: vanillactate and phenol glucuronide of the tyrosine metabolism, and the cystathionine of the cysteine metabolism. Supplementary Table 13 lists all the amino acids metabolites in each comparison.

Discussion

Despite significant advancements in the study of the immunopathogenesis of CD, there is an urgent need to identify markers for disease progression and cure in clinical practice, particularly for the development of drugs and vaccines. This is the first analysis using a metabolomic approach to provide a more comprehensive description of the various stages of CD, comparing them with healthy individuals and cardiopathies of different etiologies.

One of the major findings from our groundbreaking study is that patients with CD exhibit a distinct metabolic profile depending on the stage of the disease, with notable differences between ICD and the CCC forms. Interestingly, individuals with ICD demonstrated significant variations compared to healthy controls from the same areas11,12.

Another important finding is that CCC subjects share similarities with individuals suffering from other HD. While CCC has a distinctive physiopathology and characteristic presentation, its subsequent clinical manifestations are not specific to the disease13,14. The latter stages of the cardiac form led to heart failure, which likely exhibits common metabolic variations regardless of its origin. This somewhat validates our approach.

Among these findings, we observed variations in peptide metabolites between the different groups, some of which displayed a potential profile as cure biomarkers. Dipeptides were the most significantly altered metabolites in all the comparisons. Almost all of them exhibited decreased expression in ICD individuals, followed by an increase in CCC subjects, eventually returning to control levels15. Of particular significance is phenylalanylalanine, which was detected at higher levels among CCC participants with the lowest levels among ICD participants. Elevated levels of dipeptides typically result from protein degradation and have been identified as pathogenic factors for oxidative stress and cardiovascular disease16.

Interestingly, among the peptide metabolites, two of them, phenylacetylglutamine (PAG) and fibrinopeptide B (1–13), show promising potential as biomarkers for the cure of CD. PAG binds to various platelet receptors leading to platelet aggregation and subsequent atherothrombosis, which have been associated with myocardial infarction, stroke, and death17. PAG has been clinically linked to heart failure and its severity, and it has been proposed as a potential therapeutic agent for cardiovascular diseases18. Furthermore, elevated serum concentrations of PAG have been observed in patients with heart failure or cardiovascular disease and have been associated with increased risk of hospitalization and mortality19,20,21. Moreover, PAG levels increase with the severity of heart failure in subjects with chronic heart failure, suggesting its potential use as an early screening, diagnostic, and treatment indicator for chronic heart failure22. Therefore, our results strongly support the use of PAG as a biomarker not only for the progression of heart disease but also as a cure biomarker for specific CCC disease and ICD individuals, which is a significant finding considering the absence of cardiac symptoms in these individuals.

Fibrinopeptides are products derived from thrombin activation during coagulation23,24. In our study, most fibrinopeptides exhibited a significant increase in CCC patients, but the isomer fibrinopeptide B (1–13) showed the most pronounced increase compared to controls. This peptide displayed the same pattern as PAG, with increasing levels observed from controls to ICD and CCC individuals. The elevated levels of fibrinopeptides in our samples suggest an enhanced coagulation in infected subjects, potentially leading to cardiovascular disease25. Fibrinopeptide B (1–13) isomer has previously been detected in individuals with acute coronary syndrome (ACS) and ischemic heart disease (IHD), while the isomers B (1–11), B (1–12) and B (1–13) have been correlated with hypertension26,27.

Hemoglobin and porphyrin metabolism metabolites were particularly noteworthy. Heme proteins undergo degradation in the liver, resulting in the production of biliverdin, which is further converted to bilirubin through an exclusive pathway in mammals28. In our study, individuals with ICD exhibited significantly increased levels of biliverdin and bilirubin compared to controls and CCC individuals. Interestingly, population studies have demonstrated a negative correlation between serum bilirubin levels and the development of cardiovascular diseases29,30,31. Conversely, urobilin has been found to be increased in incident heart failure, ACS and IHD26,32. In our study, urobilin was also upregulated in CCC individuals. Thus, decreasing levels of bilirubin and biliverdin coupled with an elevated urobilin could potentially serve as biomarkers of disease progression.

In urea and arginine metabolism, three candidates showed promise as biomarkers of progression: urea, N2,N5-diacetylornithine, and guanidinosuccinate. These metabolites were significantly increased in CCC individuals compared to ICD individuals and controls. The enrichment of the urea cycle has been described in the heart tissue of CCC individuals, and its upregulation has also been associated with mutagenesis in tumor cells10,33. However, interpreting plasma levels of urea is challenging as they are influenced by various physiological processes and can be increased by aging34. Among the branched chain amino acids, in the isoleucine, leucine and valine metabolism, 4-methyl-2-oxopentanoate and 3-methyl-2-oxobutyrate showed a significant increase in ICD participants. This, along with the increase in pyruvate, suggests a potential mitochondrial metabolic disorder in ICD individuals.

In the cysteine, histidine and tyrosine metabolism, metabolites displayed remarkable increases in CCC individuals compared to ICD and controls. One of these metabolites was cystathionine, which is transformed into cysteine by the cystathionine-gamma-lyase (CSE). Studies have shown that CSE overexpression protects against heart failure, while its deficiency exacerbates it35. In our study, we observed an accumulation of cystathionine in CCC participants, suggesting lower CSE activity and exacerbation of cardiac pathology. Additionally, two metabolites of tyrosine metabolism, vanillactate and phenol glucuronide, exhibited significant increases in CCC individuals compared to the other groups. Increased levels of vanillactate have been reported in metabolomics analyses of hypertrophic and dilated cardiomyopathies in plasma as well as in individuals with heart failure, ACS, IHD and chronic IHD26,36. Based on our findings, these molecules hold potential as biomarkers of disease progression in CCC subjects, with cystathionine and phenol glucuronide being associated with an infectious or cardiovascular disease for the first time.

In conclusion, our novel exploratory study using metabolomics in human plasma identified potential biomarker candidates, either alone or in combination. Peptide metabolites such as PAG or fibrinopeptide B (1–13), decreasing levels of bilirubin and biliverdin along with elevated urobilin; cystathionine, phenol glucuronide and vanillactate have shown promise patterns. These candidates should be further validated in targeted studies, but if confirmed, they can be easily translated into clinical practice.