Introduction

Persistent immune activation (IA) fuels major chronic morbidities, including insulin resistance, metabolic syndrome, diabetes, atherothrombosis, neurocognitive disorders or liver steatosis. A model for IA is HIV-1 infection under efficient combined antiretroviral therapy. In people living with HIV-1, the immune system remains potentially activated, even if viral replication is controled by the treatment1. To better characterize IA in this model, we previously measured a series of cell-surface and soluble markers, in 120 efficiently treated HIV patients. A hierarchical clustering analysis identified 5 different IA profiles in these people2. To test whether the IA profiles were robust rather than specific to the 120 patients we had analyzed, we recruited 20 more HIV patients with divergent bioclinical characteristics, and performed the hierarchical clustering analysis again3. Once more, we observed 5 different IA profiles in these 140 HIV patients. We also analyzed the possibility that these IA profiles were the consequence of different causes. In favour of this model, we found a link between microbial translocation and one of the IA profiles3.

The general population, particularly in old age, shares many of the causes of IA with people living with HIV-1. First, we all harbour infectious agents that trigger our immune system4. Second, there is a low level of microbial translocation in each individual that increases over time5. Third, metabolic disorders that may stimulate the immune system also increase with age4. Fourth, as with any senescent cell, immune cells in aging individuals release factors responsible for IA4. And last, the efficiency of Treg cells, a CD4+ T cell subpopulation whose function is to downregulate IA, decreases over time6. Therefore, we reasoned that a more general population might also present with different IA profiles driven by these various etiologic factors. To test this hypothesis, we looked in the present study for the presence of distinct IA profiles in a general population, and for potential etiologic factors linked to these profiles.

Materials and methods

Study design

We recruited 150 adults over 55 years and below 70 years of age, affiliated to the French Social Security system who volunteered for a free health checkup at a Social Security Center in Nîmes, France. Pregnant women, people under immunomodulatory treatment or with diseases likely to modify their immune system were not included. This study was approved by the French Ethics Committee Sud Est IV. All methods were carried out in accordance with the French guidelines and regulations. All individuals had provided written informed consent. The trial was registered on ClinicalTrials.gov under the reference NCT04028882.

Flow cytometry

Monoclonal antibodies conjugated with fluorescein isothiocyanate (FITC), phycoerythrin (PE), energy-coupled dye (ECD), PE-Cyanine5.5 (PC5.5), PE-Cyanine7 (PC7), allophycocyanine (APC), APCAlexa700, or APCAlexa750 were purchased from Beckman Coulter (Villepinte, France). The antibodies were used in the following combinations; CD57-FITC/CD279-PE/CD45RA-ECD/CD28-PC5.5/CD27-PC7/CD8-APC/CD4-APCAlexa700/CD3-APCAlexa750, CD20-FITC/CD38-PE/HLADR-PC7/CD8-APC/CD4-APCAlexa700/CD3-APCAlexa750, CD57-FITC/CD14-PE/CD56-PC5.5/HLADR-PC7/CD16-APC/CD3-APCAlexa750, CD4-FITC/CD45RA-ECD/CD25-PC7/FoxP3-APC/CD127-APC750. Whole blood collected in EDTA tubes was stained within one hour for 10 mn at room temperature in the dark with the cocktail of antibodies and fixed using Immunoprep Reagent kit (Beckman Coulter) according to the manufacturer’s protocol.

For Treg quantification, cells were first fixed with Reagent 1 of the IntraPrep Permeabilization kit (Beckman Coulter) in the dark, and then stained with the CD4-FITC/CD45RA-ECD/CD25-PC7/CD127-APC750 cocktail of antibodies. Secondly, cells were permeabilized and an anti-FoxP3-APC antibody was added. Finally, red blood cells were lysed using Reagent 2. After one hour, cells were washed with Reagent 3.

Cells were run on a Navios flow cytometer and results were analyzed by using Kaluza® software (Beckman Coulter). A minimum of 20,000 lymphocytes were gated to analyze the subpopulations. We controled the inter-run variability with the same batch of Rainbow 8-peak beads (Beckman Coulter). During the study, no voltage adjustment was necessary to keep the beads into their respective defined targets.

Soluble immunologic markers in peripheral blood

ELISA was used to quantify soluble TNF receptor I (sTNFRI), soluble CD14 and soluble CD163 (sCD163) (Quantikine, R&D systems, Rennes, France), as well as tissue Plasminogen Activator (tPA) and soluble Endothelial Protein C Receptor (sEPCR) (Asserachrom, Stago, Asnières-sur-Seine, France) in plasma collected in EDTA Vacutainer tubes (Becton Dickinson, Le Pont-de-Claix, France) and frozen. C-Reactive Protein (CRP) and immunoglobulins (Ig) were measured by turbidimetry in plasma collected by the same way. 16s ribosomal bacterial DNA was measured in plasma by quantitative PCR as previously described7.

Statistical analysis

All data were standardized before statistical analysis. Next, a visual assessment of the possibility to cluster the data was made using principal component analysis, and also by seeking a cluster structure in the distance matrix. Next, the Hopkins statistic was calculated, with a value of 1 indicating the highest possibility to cluster the data8. Second, we determined the optimal number of clusters using several indexes (e.g., Silhouette9, Gap statistic10). The majority rule was used to determine the optimal number of clusters. Third, we performed two hierarchical clustering analyses. One clustering analysis was carried for volunteers, using the Euclidian distance to measure the distance between individuals and the other one for markers, using 1-abs (correlation) as a distance. For both of them, Ward’s minimum variance method was used as a linkage method. We then generated a heatmap using the classification of volunteers and markers. We evaluated the appropriateness of the classification through an internal validation test. We used two indexes, based on compactness and separation, (i) the silhouette width which varies between − 1 and 1 representing a wrong and perfect classification9, respectively, and (ii) the Dunn index which varies from 0 to infinity and should be maximized11,12. In addition, to analyze the significance of the hierarchical classification, we performed a permutation test on volunteer groups. To this aim, we computed the ratio between between-group and within-group variability in the true groups using the lda function of the MASS package (MASS_7.3-51.6) of the R software (R version 4.0.2 (2020-06-22)). This function computes the singular values, which give the ratio of the between- and within-group standard deviations on the linear discriminant variables. Their squares are the canonical F-statistics. The sum of all singular values provides the global ratio among between-group and within-group variability. In parallel we generated random groups by permutation of group labels. We repeated the permutation 1000 times and visualized their distribution compared to the ratio obtained from true groups.

In order to characterize each immune profile, a V-test was calculated13. The bigger the absolute value of the V-test is, the more characteristic the variable is. All analyses were performed using version 3.6.1 R software (R Development Core Team, A Language and Environment for Statistical Computing, Vienna, Austria, 2016. https://www.R-project.org/).

We used the Mann–Whitney test to compare markers and IA profiles. The links between biomarkers were determined by Spearman rank correlations.

Results

Different immune activation profiles may be distinguished in adults

We recruited 74 (49%) women and 76 (51%) men with a mean ± SD age of 62 ± 4 years. We determined the proportions of CD4+ and CD8+ T cells, of naïve (CD27+ CD45RA+), central memory (CD27+ CD45RA-), effector memory (CD27-CD45RA-), and terminally differentiated T cells (CD27-CD45RA+), activated (HLA-DR+ and/or CD38+), exhausted (PD-1+), and senescent (CD57+, eventually CD27- and CD28-) T cells. The percentages of activated (HLA-DR+), dysfunctional (CD56-), and senescent (CD57+) NK cells were also measured. For monocytes, we quantified the classical (CD14hiCD16lo), intermediate (CD14hiCD16+), and alternative CD14loCD16hi) subpopulations. Examples of our gating strategy are shown in Fig. 1. IgG, IgA, IgM, and sCD163 peripheral blood levels were used as markers of B-cell and monocyte activation, respectively. Inflammation was evaluated via sTNFRI and CRP concentrations, and endothelium activation via sEPCR and tPA concentrations in peripheral blood.

Figure 1
figure 1

Examples of flow cytometry staining of monocytes (A), CD4+ T cells including Treg cells (B), CD8+ T cells (B), and NK cells (C).

For both markers and individuals, a clustering tendency was observed (Hopkins statistics: 0.68 and 0.73, respectively). We performed two independent hierarchical clustering analyses, one for the activation markers and another one for the volunteers. The number of clusters chosen for markers and donors were 2 and 5, respectively, as they corresponded to the results obtained with the majority of the indexes we tested. Thus, the analysis of volunteers identified 5 groups of individuals presenting with different IA profiles (Fig. 2). Concerning the internal validation step, the Dunn index and the silhouette width were 0.26 and 0.06 for individuals, and 0.39 and 0.12 for markers, respectively. To show that the groups identified by hierarchical classification reflect a true structure of the data, we generated random groups of the same size as the true ones. The within-group and between-group variability allow to investigate the quality of clusters, as “good” clusters are compact (individuals in the same group have similar properties, reflected in a low within-group variability) and far from each other (individuals in different groups present distinct profiles, reflected in high between-group variability). Hence, random groups, or data with no defined clusters, would show a lower ratio between between-group and within-group variability. The distribution of the ratio of the between- and within-group standard deviations on the linear discriminant variables is represented in the Fig. 3. The histogram shows that the between/within ratio is much better for the real groups as the “best” random group exhibits a much lower ratio. Indeed, the median values for the random groups is 15.22 (minimum, 12.04 and maximum, 18.92), whereas it is 42.85 for the true groups, identified by the hierarchical classification (p < 0.001). Next, we computed a Principal Component Analysis of the volunteers (Fig. 4).

Figure 2
figure 2

Individuals have different immune activation profiles. Heatmap showing the hierarchical clustering of activation markers (vertical) and volunteers according to their immune activation profile (horizontal). Each immune activation profile is indicated.

Figure 3
figure 3

Analyse of the quality of the hierarchical clustering of the volunteers by permutation test. Distribution of the ratio between-group standard deviations: within-group standard deviations for the random groups (grey bars) and the true groups (red line).

Figure 4
figure 4

Subjects' map resulting from principal component analysis.

Characterization of the immune activation profiles observed

The age, sex, and ethnicity of the individuals are given according to their IA profile in Table 1. There are gender differences in the immune activation profiles (p < 0.001). For each profile, we compared each marker mean with the overall mean, using the V-test (Table 2). Each profile may be characterized by specific markers, comparatively to other profiles. Thus, individuals with Profile 1 had a high mean (± SD) percentage of naïve CD4+ T cells (55 ± 12 versus 37 ± 14, p < 10–4, Fig. 5A) and naïve CD8+ T cells (53 ± 12 versus 31 ± 12, p < 10–4, Fig. 5B) contrasting with a low frequency of terminally differentiated (1 ± 1 versus 3 ± 6, p < 10–4, Fig. 5C) and CD57+ senescent (2 ± 2 versus 7 ± 9, p < 10–4, Fig. 5D) CD4+ T cells, as well as of terminally differentiated (12 ± 8 versus 27 ± 18, p < 10–4, Fig. 5E) and senescent (17 ± 10 versus 34 ± 16, p < 10–4, Fig. 5F) CD8+ T cells. Profiles 2 and 3 were at the opposite end of the spectrum. Profile 2 was characterized by an elevated percentage of terminally differentiated (9 ± 11 versus 2 ± 3, p < 10–4, Fig. 5C) and CD57+ senescent (19 ± 12 versus 4 ± 4, p < 10–4, Fig. 5D) CD4+ T cells. In Profile 3, it is the CD8+ T cells that were more terminally differentiated (41 ± 12 versus 16 ± 14, p < 10–4, Fig. 5E), and senescent (45 ± 12 versus 24 ± 14, p < 10–4, Fig. 5F). Profile 4 presented a high proportion of activated NK cells (23 ± 13 versus 14 ± 11, p < 10–4, Fig. 5G). Finally, a high proportion of monocytes (41 ± 33 versus 9 ± 4, p = 0.018, Fig. 5H) was noteworthy in Profile 5.

Table 1 Volunteers characteristics.
Table 2 Description of the immune activation profiles using the V-test.
Figure 5
figure 5

IA profiles are characterized by specific markers. Differences in the level of various activation markers between volunteers with Profile 1 (A,B), 2 (C,D), 3 (E,F), 4 (G), and 5 (H), and the other profiles.

Links between immune activation profiles and etiologic factors

In Humans, various factors may be responsible for chronic immune activation. Thus, for instance, a deficiency in the mechanisms responsible for downregulating IA may be responsible for an overactivity of the immune system. Microbial translocation, the entry into our organism of microbial products originating from our microbiota, is another potential cause of IA. In a given individual, only some of these etiologic factors may be at work, e.g., immune dysregulation, but not microbial translocation. Therefore, we tested the hypothesis that the IA profiles that we unraveled might be fueled by specific causes. To this aim, we searched for correlations between potential etiologic factors and each profile. Treg is a CD4+ T cell subpopulation playing a major role in IA downregulation 14. Therefore, we measured the proportions and numbers of total Treg (CD4+ CD25+ FOXP3+ CD127lo), naïve Treg (CD4+ CD25+ FOXP3+ CD45RA+CD127lo), and memory Treg, activated (CD4+CD25hiFOXP3hiCD45RA-CD127lo) or not (CD4+CD25+FOXP3+CD45RA-CD127lo) in each individual. As compared with the other profiles, Profile 1, characterized by a low level of differentiated and senescent T cells, had a high percentage of naïve Treg (6.6 ± 4.4 versus 5.3 ± 5.0, p = 0.017, Fig. 6A), a reservoir of cells able to inhibit IA. Moreover, this percentage was linked to the proportions of naïve CD4+ T cells (r = 0.190, p = 0.022, Fig. 6B) and of naïve CD8+ T cells (r = 0.248, p = 0.003, Fig. 6C) in the whole population. By contrast, Profiles 2 and 3, characterized by a high level of differentiated and senescent T cells, tended to have a low percentage of Treg (5.4 ± 1.8 versus 5.9 ± 1.9, p = 0.080, Fig. 6D), in comparison with Profiles 1, 4, and 5. In addition, this percentage was negatively correlated with the proportions of terminally differentiated (r = -0.242, p = 0.003, Fig. 6E) and senescent (r = -0.197, p = 0.017, Fig. 6F) CD4+ T cells in the whole population.

Figure 6
figure 6

Immune activation Profiles 1, 2, 3, and 5 may be characterized by specific etiologic factors. Naïve Treg frequencies in Profile 1 as compared with the other profiles (A), and correlations with naïve CD4+ T cell (B) and naïve CD8+ T cell (C) subpopulations. Treg frequencies in Profiles 2 and 3 comparatively to Profiles 1, 4, and 5 (D), and correlations with terminally differentiated (E) and senescent (F) CD4+ T cells. Circulating bacterial DNA levels in Profile 5 in comparison with the other profiles (G).

Another potential cause of IA is the intensity of microbial translocation which increases with age5. A direct marker of microbial translocation is the presence of bacterial DNA in the circulation quantified by PCR targeting conserved sequences of the 16s ribosomal gene (rDNA). Strikingly, we observed a high, although not significant, level of rDNA in Profile 5 people (45 ± 59 versus 14 ± 13 copies/mL, p = 0.242, Fig. 6G), comparatively to the other volunteers.

Discussion

In this study, we have shown that various IA profiles may be distinguished using an unsupervised learning method in a population of adults volunteering for a health checkup. We revealed 5 distinct IA profiles that may be characterized according to their levels of CD4+ T cell, CD8+ T cell, NK cell and monocyte frequency, activation, and/or differentiation. One profile, Profile 1, has the lowest level of differentiated and senescent T cells, of activated NK cells, and the lowest percentage of monocytes. It may therefore be considered as the less activated profile. Of note, it is the group with the highest percentage of women. Profiles 2 and 3 are remarkable by their reduced percentages of naïve T cells and their elevated percentages of differentiated and senescent T cells. These profiles may therefore be considered as the T cell activated profiles. In Profile 4 it is the NK cells that are activated, and in Profile 5 it is the frequency of circulating monocytes that is noteworthy.

A second finding of this study is that some of these IA profiles are linked to potential causes of IA. Thus, Profile 5 individuals, characterized by a high frequency of peripheral blood monocytes, have a high circulating bacterial DNA load. Yet, the difference in rDNA level between Profile 5 and the other profiles was not significant, probably due to the small number of participants with this Profile (n = 5). Yet, this link between microbial translocation and the frequency of circulationg monocytes is in line with the observation that bacterial products may boost monopoiesis via TLR signaling15. Profiles 2 and 3 which are particular because of their high degree of CD4+ T cell and CD8+ T cell differentiation and senescence, tend to have low percentages of Treg. As Treg are known to interrupt the process of T cell activation16, these low Treg levels might participate in the increased T cell differentiation and senescence specific to these profiles. This hypothesis is supported by our observation of an inverse correlation between the percentage of Treg cells on one hand, and of differentiated and senescent CD4+ T cells on the other hand. Conversely, the high percentage of naïve Treg cells in Profile 1 may at least partly explain the low level of T cell differentiation and senescence in that profile.

One of the limitations of our study is that it is cross-sectional, highlighting only correlations. Further analysis is needed to definitively establish causative links between etiologic factors and IA profiles. Also, additional etiologic factors, different from the one we tested, could shape the IA profiles, as for instance the genetic background and the clinical history. Moreover, the stability over time of IA profiles has to be verified. Our study is also limited by the technology we used. Thus, we did not test the functionality of the various immune cell subpopulations we analyzed. The immune phenotyping could also be more precise by using single-cell transcriptomics analysis, high dimensional flow cytometry or CyTOF. On the other hand, our ultimate goal is to identify a simple signature of easily measurable markers characteristic of immune activation profiles that could fuel immune activation-induced morbidities. This goal may be achievable in routine with the tools we used.

Globally, we show that different IA profiles may be distinguished in a general population, and that some of these profiles are linked to potential etiologic factors such as Treg frequency and microbial translocation.

We propose a model where in each individual one or a few specific causes of IA shape a specific IA profile. Of interest, the different IA profiles we describe here may fuel different morbidities among those driven by IA, as insulin resistance, atherothrombosis or liver steatosis for instance, In this hypothesis, immune profiling might help to tailor the prevention and the screening of these IA-induced diseases. Moreover, deciphering soluble immune factors favoring each of these morbidities might open the way to new therapeutic strategies. In the near future, the immune activation profile of each individual might be identified via a simple signature of a reduced number of activation markers easily measurable, and this immune activation profile might predict the chronic morbidiites this individual is at risk of developing.