Main

The brain undergoes dynamic functional changes with age1,2,3. Accurately mapping the trajectory of these changes and how they relate to chronological age is critical for understanding the aging process, multilevel disparities4,5 and brain disorders1 such as the Alzheimer’s disease continuum, which includes mild cognitive impairment (MCI) and related disorders like behavioral variant frontotemporal dementia (bvFTD)6. Brain clocks or brain-age models have emerged as dimensional, transdiagnostic metrics that measure brain health influenced by a range of factors7,8,9, suggesting that they may be able to capture multimodal diversity10. Populations from LAC exhibit higher genetic diversity and distinct physical, social and internal exposomes11,12 that impact brain phenotypes4,13,14. Income and socioeconomic inequality15,16, high levels of air pollution17, limited access to timely and effective healthcare18, rising prevalence of communicable and noncommunicable diseases19,20, and low education attainment21,22 are determinants of brain health in LAC18. Thus, although measuring the brain-age gap could enhance our understanding of disease risk and its impact on accelerated aging23, there is a lack of research on brain-age models in underrepresented populations where they experience large socioeconomic and health disparities18,24,25.

Sex and gender differences emerge as critical factors influencing brain changes. Studies on atrophy in the Alzheimer disease continuum reveal a faster rate of brain atrophy in females than in males26. Moreover, country-level gender inequality is associated with sex differences in cortical thickness27. Structural gender inequality further impacts brain health, with adverse environments affecting dendritic branching and synapse formation28. However, no studies to date have explored the spectrum of brain-age abnormalities, including the effects of demographic heterogeneity across geographical regions, between sexes, and the continuum from brain health to disease. Further, most studies have been conducted with participants from the Global North, resulting in a lack of generalization to underrepresented populations from the Global South including LAC24,29,30,31.

Multimodal machine learning studies show promise in brain aging23; however, most rely on structural magnetic resonance imaging (MRI), overlooking brain network dynamics. Complex spatiotemporal dimensions can be tracked with spatial accuracy through functional magnetic resonance imaging (fMRI) and with millisecond precision using an electroencephalogram (EEG)32. Given the complementary strengths of fMRI and EEG, it is crucial to cross-validate existing brain clock models using these techniques. However, no studies have simultaneously applied EEG and fMRI to replicate brain-age effects. In addition, standard machine learning approaches are less generalizable than deep learning methods33. Brain-age indices have been restricted by the predominant use of MRI or positron emission tomography, which are less accessible and affordable in LAC, leading to selection biases34. EEG offers a solution because of its cost-effectiveness, portability and ease of implementation in aging and dementia35,36. However, few studies have combined accessible techniques with deep learning to develop scalable brain-age markers. The application of EEG is hindered by heterogeneity in recordings, electrode layouts, acquisition systems, processing pipelines and small sample sizes37. These standardization challenges have impeded the integration of fMRI and EEG in extensive, multicenter brain-age research.

We adopted a framework to tackle diversity by including datasets from LAC and non-LAC regions, utilizing graph convolutional networks (GCN) to functional connectivity of fMRI and EEG signals. We hypothesized that, across fMRI and EEG imaging, models would accurately predict brain-age gaps and be sensitive to the impacts of multimodal diversity, including geographical and sociodemographic effects, sex differences, health disparities and exposome influences. By testing this hypothesis, we aimed to assess the effectiveness of high-order interactions and deep learning in predicting brain-age differences across diverse and heterogeneous populations of healthy aging and neurocognitive disorders.

Results

We used resting-state fMRI and EEG signals separately to evaluate whether a deep learning computational pipeline (Fig. 1) captures differences in brain aging across heterogeneous populations from a total of 5,306 datasets. We included fMRI data from 2,953 participants from Argentina, Chile, Colombia, Mexico and Peru (LAC) and the USA, China and Japan (non-LAC). The EEG dataset involved 2,353 participants from Argentina, Brazil, Chile, Colombia and Cuba (LAC), and Greece, Ireland, Italy, Turkey and the UK (non-LAC). Healthy controls, MCI, Alzheimer disease and bvFTD groups were included. We focused on the Alzheimer disease and bvFTD because these conditions represent the most common late-onset and early-onset causes of dementia38,39. We included the Alzheimer’s disease continuum, which encompasses MCI, to capture the prodromal stages of the disease39. Raw fMRI and EEG signals were preprocessed to remove artifacts and then normalized. Based on multivariate information theory, we calculated high-order interactions1. Weighted graphs were used as inputs for a graph convolutional deep learning network trained to predict brain age, using one model for fMRI and another for EEG.

Fig. 1: Dataset characterization and analysis pipeline.
figure 1

Datasets included LAC and non-LAC healthy controls (HC, total n = 3,509) and participants with Alzheimer disease (AD, total n = 828), bvFTD (total n = 463) and MCI (total n = 517). The fMRI dataset included 2,953 participants from LAC (Argentina, Chile, Colombia, Mexico and Peru) as well as non-LAC (the USA, China and Japan). The EEG dataset involved 2,353 participants from Argentina, Brazil, Chile, Colombia and Cuba (LAC) as well as Greece, Ireland, Italy, Turkey and the UK (non-LAC). The raw fMRI and EEG signals were preprocessed by filtering and artifact removal and the EEG signals were normalized to project them into source space. A parcellation using the automated anatomical labeling (AAL) atlas for both the fMRI and EEG signals was performed to build the nodes from which we calculated the high-order interactions using the Ω-information metric. A connectivity matrix was obtained for both modalities, which was later represented by graphs. Data augmentation was performed only in the testing dataset. The graphs were used as input for a graph convolutional deep learning network (architecture shown in the last row), with separate models for EEG and fMRI. Finally, age prediction was obtained, and the performance was measured by comparing the predicted versus the chronological ages. This figure was partially created with BioRender.com (fMRI and EEG devices).

Brain-age gap across LAC and non-LAC datasets

We used the fMRI and EEG signals from the control’s datasets (LAC and non-LAC) to train and test brain-aging models. We used 80% cross-validation with a 20% hold-out testing split. As shown in Figs. 2a and 3a, our models predicting brain age obtained adequate goodness of fit (fMRI: R2 = 0.52, P < 0.001, f2 = 1.07; EEG: R2 = 0.45, P < 0.001, f2 = 0.83). We implemented the r.m.s.e. to evaluate models’ fit, obtaining acceptable brain-age predictions (fMRI-r.m.s.e. = 7.24, EEG-r.m.s.e. = 6.45). For both, fMRI and EEG, the main predictive brain-regional features included hubs in frontoposterior networks (nodes in precentral gyrus, the middle occipital gyrus, and the superior and middle frontal gyri; Figs. 2a and 3a). Additional nodes for the fMRI model included the inferior frontal gyri, and the anterior and median cingulate and paracingulate gyri (Fig. 2a.). For EEG, key nodes also comprised the superior and inferior parietal gyri and the inferior occipital gyrus (Fig. 3a). Thus, for both fMRI and EEG the models showed an adequate fit and predictive performance, with key predictive features involving frontoposterior networks in the brain.

Fig. 2: fMRI training and testing the deep learning model in different datasets.
figure 2

a, Ordinary least squares (OLS) regression comparing chronological age versus predicted age with the feature importance list for training (n = 1,155) and testing (n = 289) in the whole sample (P < 1 × 10−15). b, Regression comparing chronological age versus predicted age with the feature importance list for training (n = 773) and testing (n = 194) in the non-LAC dataset (P < 1 × 10−15). c, Regression comparing chronological age versus predicted age with the feature importance list for training (n = 381) and testing (n = 91) in the LAC dataset (P = 4.91 × 10−7). For a, b and c, the bars show the brain region feature importance list in descending order, with ring plots and glass brain representations of the most important network-edge connections. Feature importance (top 10) data are presented as mean values and 99% CI. The values for the features (mean, left limit, right limit) are: feature 1 = (0.975, 0.952, 0.999), feature 2 = (0.735, 0.715, 0.756), feature 3 = (0.627, 0.597, 0.656), feature 4 = (0.470, 0.449, 0.490), feature 5 = (0.375, 0.353, 0.397), feature 6 = (0.314, 0.285, 0.342), feature 7 = (0.239, 0.217, 0.262), feature 8 = (0.198, 0.169, 0.228), feature 9 = (0.161, 0.128, 0.193), feature 10 = (0.119, 0.093, 0.145) (a); feature 1 = (0.968, 0.937, 0.999), feature 2 = (0.736, 0.707, 0.764), feature 3 = (0.541, 0.518, 0.565), feature 4 = (0.434, 0.403, 0.464), feature 5 = (0.315, 0.290, 0.339), feature 6 = (0.253, 0.220, 0.286), feature 7 = (0.177, 0.156, 0.197), feature 8 = (0.140, 0.114, 0.166), feature 9 = (0.111, 0.078, 0.144), feature 10 = (0.079, 0.053, 0.106) (b); and feature 1 = (0.971, 0.944, 0.999), feature 2 = (0.847, 0.816, 0.878), feature 3 = (0.698, 0.667, 0.730), feature 4 = (0.533, 0.512, 0.555), feature 5 = (0.458, 0.430, 0.487), feature 6 = (0.371, 0.344, 0.399), feature 7 = (0.298, 0.272, 0.325), feature 8 = (0.242, 0.216, 0.269), feature 9 = (0.198, 0.169, 0.227), feature 10 = (0.163, 0.130, 0.196) (c). d, Histogram of the prediction error when training in non-LAC dataset (n = 967) and testing in LAC dataset (n = 477). e, Violin plot of the distribution and statistical comparison of training and testing with different regions using a two-sided permutation test without multiple comparisons (5,000 algorithm iterations) with a result of P < 1 × 10−15. Mean, first quartile (q1), third quartile (q3), whisker low, whisker high, minima and maxima values for violin plots are: LAC/non-LAC (−2.52, −7.74, 3.31, −22.52, 17.33, −22.52, 17.33); non-LAC/LAC (5.60, 0.85, 12.14, −12.82, 27.75, −12.82, 27.75). f, Violin plot of the distribution and statistical comparison of testing the models on females (n = 261) and males (n = 216) in LAC using a permutation test (5,000 iterations) with a result of P = 0.042. Mean, q1, q3, whisker low, whisker high, minima and maxima values for violin plots are: male (3.66, −1.83, 9.45, −12.49, 16.32, −12.49, 16.32); and female (6.93, 2.21, 12.78, −12.82, 27.75, −12.82, 27.75). ROI, region of interest. This figure was partially created with BioRender.com (fMRI device).

Fig. 3: EEG training and testing the deep learning model in different samples.
figure 3

a, OLS regression comparing chronological age versus predicted age with the feature importance list for training (n = 1,644) and testing (n = 411) in the whole sample (P < 1 × 10−15). b, Regression comparing chronological age versus predicted age with the feature importance list for training (n = 471) and testing (n = 118) in the non-LAC dataset (P < 1 × 10−15). c, Regression comparing chronological age versus predicted age with the feature importance list for training (n = 1,188) and testing (n = 298) in the LAC dataset (P = 3.51 × 10−7). For a, b and c, the bars show the brain region feature importance list in descending order, with ring plots and glass brain representations of the most important network-edge connections. Feature importance (top 10) data are presented as mean values and 99% CI. The values for the features (mean, left limit, right limit) are: feature 1 = (0.968, 0.946, 0.991), feature 2 = (0.759, 0.739, 0.779), feature 3 = (0.644, 0.617, 0.670), feature 4 = (0.531, 0.500, 0.561), feature 5 = (0.410, 0.384, 0.436), feature 6 = (0.336, 0.309, 0.363), feature 7 = (0.259, 0.239, 0.279), feature 8 = (0.218, 0.191, 0.245), feature 9 = (0.184, 0.150, 0.217), feature 10 = (0.146, 0.114, 0.177) (a); feature 1 = (0.967, 0.935, 0.999), feature 2 = (0.764, 0.741, 0.786), feature 3 = (0.569, 0.549, 0.590), feature 4 = (0.460, 0.435, 0.485), feature 5 = (0.354, 0.330, 0.377), feature 6 = (0.283, 0.256, 0.311), feature 7 = (0.216, 0.192, 0.241), feature 8 = (0.169, 0.145, 0.193), feature 9 = (0.129, 0.107, 0.150), feature 10 = (0.101, 0.077, 0.124) (b); feature 1 = (0.972, 0.949, 0.995), feature 2 = (0.833, 0.805, 0.860), feature 3 = (0.705, 0.677, 0.733), feature 4 = (0.564, 0.543, 0.584), feature 5 = (0.488, 0.463, 0.514), feature 6 = (0.408, 0.385, 0.431), feature 7 = (0.363, 0.334, 0.393), feature 8 = (0.292, 0.269, 0.314), feature 9 = (0.243, 0.222, 0.264), feature 10 = (0.221, 0.188, 0.254) (c). d, Histogram of the prediction error when training in non-LAC dataset (n = 569) and testing in LAC dataset (n = 1,486). e, Violin plot of the distribution and statistical comparison of training and testing with different regions using a two-sided permutation test without multiple comparisons (5,000 algorithm iterations) with a result of P < 1 × 10−15. Mean, q1, q3, whisker low, whisker high, minima and maxima values for violin plots are: LAC/non-LAC (−2.34, −6.07, 1.26, −13.25, 11.52, −20.08, 17.52); non-LAC/LAC (5.24, 1.95, 8.61, −5.24, 16.18, −12.73, 16.18). f, Violin plot of the distribution and statistical comparison of testing the models on females and males using a permutation test (5,000 iterations) with a result of P = 0.012. Mean, q1, q3, whisker low and whisker high values for violin plots are: male (3.66, 1.87, 7.83, −5.24, 16.18, −12.73, 16.18); female (6.19, 2.67, 9.39, −3.08, 15.52, −3.08, 15.52). This figure was partially created with BioRender.com (EEG device).

Brain-age gap in non-LAC datasets

Using the same data split ratio, we trained and tested the models in non-LAC datasets. As shown in Figs. 2b and 3b, our models predicting brain age yielded considerable goodness of fit (fMRI: R2 = 0.40, P < 0.001, f2 = 0.67; EEG: R2 = 0.43, P < 0.001, f2 = 0.76). The r.m.s.e. values were also adequate (fMRI-r.m.s.e. = 8.66; EEG-r.m.s.e. = 6.54). Mean directional errors (MDE) for fMRI and EEG were 0.69 and 1.07, respectively. For both fMRI and EEG, the main predictive features were hubs in frontoposterior networks including the superior frontal gyrus (dorsolateral), the precentral gyrus and the middle occipital gyrus (Figs. 2b and 3b). Additional critical nodes for the fMRI model included the inferior and middle frontal gyri, and the anterior and median cingulate and paracingulate gyri (Fig. 2b). For EEG, key nodes also comprised the superior and inferior occipital gyri, and the superior parietal gyrus (Fig. 3b). In brief, models trained on non-LAC datasets exhibited strong fit values and predictive features as in the overall dataset analysis.

Brain-age gap in LAC datasets

When trained and tested in the LAC datasets (Figs. 2c and 3c), models demonstrated moderate goodness of fit indexes but were less precise, as indicated by higher r.m.s.e. values (fMRI = 11.91; EEG = 9.82). We observed increased positive biases in the MDE measures compared with the non-LAC models (fMRI = 3.18; EEG = 5.34). Again, the main features involved frontoposterior networks. Common nodes for fMRI and EEG included the superior and middle occipital gyri, the superior and inferior parietal gyri, and the superior and middle frontal gyri (Figs. 2c and 3c). For EEG, the model also highlighted the precentral gyrus and the inferior occipital gyrus (Fig. 3c). Thus, models trained on LAC datasets showed moderate fit and positive biases (older brain age) in frontotemporal nodes (fMRI and EEG), compared with non-LAC models.

Cross-regional effects in model generalization

We investigated the effects of cross-region training and testing with data from non-LAC and LAC. Training with non-LAC data and testing on LAC data led to biases in predicting older brain ages than the respective chronological ages as shown by positive MDE values (Figs. 2d and 3d; fMRI: MDE = 5.60, r.m.s.e. = 9.44; EEG: MDE = 5.24, r.m.s.e. = 7.23). By contrast, training on LAC and testing on non-LAC resulted in negative age biases predicting younger brain age shown by the MDE (Figs. 2d and 3d; LAC/non-LAC fMRI: MDE = −2.52, r.m.s.e. = 8.41; LAC/non-LAC EEG: MDE = −2.34, r.m.s.e. = 5.69). Sex differences were observed in the brain-age gaps when training in the non-LAC and testing in LAC (Fig. 4a,b). Specifically, female participants in LAC exhibited a greater bias towards older brain age than males (fMRI: P = 0.04; EEG: P = 0.03). In conclusion, training with non-LAC data and testing on LAC data resulted in a bias towards predicting older brain ages, especially for female participants in LAC.

Fig. 4: Groups, sex and macrosocial influences in brain-age gaps.
figure 4

a,b, Violin plots for the distribution of prediction gaps for different groups and sex effects using (a) fMRI and (b) EEG datasets. Statistical comparisons were calculated using two-sided subsample permutation testing without multiple comparisons and with 5,000 algorithm iterations. c, Associations between macrosocial and disease disparity factors with brain-age gaps were assessed with a multi-method approach comprising SHAP values, feature importance (MDI) and permutation importance. Plots show the mean importance values for each method, along with their 99% CI, as well as the average R2 and Cohen’s f². *Features whose lower CI boundary does not cross zero. Shaded bars indicate significance across the three methods. We conducted a two-sided F-test to evaluate the overall significance of the regression models. The three models were significant: healthy controls LAC (R² = 0.37 (99% CI ±0.17), F² = 0.59 (99% CI ±0.21), r.m.s.e. = 6.9 (99% CI ±0.92), F = 138.78 (P < 1 × 10−15)); healthy controls non-LAC (R² = 0.41 (99% CI ±0.17), F² = 0.71 (99% CI ±0.21), r.m.s.e. = 6.57 (99% CI ±1.31), F = 135.91 (P < 1 × 10−15)) and total dataset (R² = 0.41 (99% CI ±0.12), F² = 0.71 (99% CI ±0.14), r.m.s.e. = 6.76 (99% CI ±0.89), F = 253.39 (P < 1 × 10−15)). The relevance of the features and their respective CI values are available in Supplementary Table 2. F, females; HC LAC, healthy controls from LAC; HC non-LAC, healthy controls from non-LAC; M, males. This figure was partially created with BioRender.com (fMRI and EEG devices).

Accelerated aging in MCI, Alzheimer disease and bvFTD

We investigated the effects of testing the controls-trained model (80%) on different subsamples across the different neurocognitive health and disease spectrum (controls non-LAC, controls LAC, MCI, Alzheimer disease and bvFTD) (Table 1), matched by age, sex and education. Permutation subsample analyses with 5,000 iterations revealed statistically significant brain-age gaps between the non-LAC and LAC control groups (Figs. 4a,b; fMRI: P < 0.01; EEG: P < 1 × 10−5). This difference was also observed for Alzheimer disease in the fMRI dataset (P < 1 × 10−5). In addition, for fMRI, we found significant differences between controls from non-LAC and all clinical groups from the same region (MCI (P < 1 × 10−5), Alzheimer disease (P < 1 × 10−5) and bvFTD (P < 1 × 10−5)). Similarly, for both fMRI and EEG, we observed significant differences between controls from LAC and all the clinical groups (fMRI: MCI (P < 1 × 10−5), Alzheimer disease (P < 1 × 10−5) and bvFTD (P < 1 × 10−5); EEG: MCI (P < 1 × 10−5), Alzheimer disease (P < 1 × 10−5) and bvFTD (P < 0.01)). Across fMRI and EEG datasetsf, both LAC and non-LAC, we observed a gradient of increasing brain age from controls to MCI to Alzheimer disease. The MCI groups from LAC and non-LAC significantly differed from Alzheimer disease (fMRI and EEG: P < 1 × 10−5) and bvFTD (fMRI: P < 1 × 10−5; EEG: P < 0.01) in the respective regions, with older brain ages for Alzheimer disease and bvFTD. For the fMRI and EEG non-LAC datasets, the Alzheimer disease group also showed an older brain age than the bvFTD group (P < 0.01). Thus, larger brain-age gaps were observed in LAC compared with non-LAC groups and across clinical groups, with ascending brain age from controls to MCI to dementia.

Table 1 Demographics for fMRI and EEG datasets

Sex differences in neurocognitive disorders

For fMRI, we analyzed the differences between male and female participants with the same diagnosis for the non-LAC and LAC datasets. There were no significant differences among groups from non-LAC datasets (Fig. 4a,b). However, females with Alzheimer disease from LAC exhibited significantly greater brain-age gaps compared with the respective males (fMRI: P < 1 × 10−3; EEG: P < 0.001). No other effects were observed. We conducted a supplementary analysis incorporating country-level gender inequality indexes (GII), sex, region (LAC versus non-LAC) and individual neurocognitive status (healthy controls versus MCI, Alzheimer disease or bvFTD) as predictors of brain-age gaps. The model demonstrated good performance (R² = 0.40, F² = 0.66, r.m.s.e. = 6.85, P < 1 × 10−15) and all predictors were influential. We found that female participants with a neurocognitive disorder living in countries with high gender inequality—particularly from LAC—were associated with higher brain-age gaps (Extended Data Fig. 1 and Supplementary Table 1). Overall, females with Alzheimer disease from LAC exhibited significantly greater brain-age gaps compared with males, influenced by high gender inequality in their countries.

Factors associated with brain-age gap

We used gradient-boosting regression models to explore the influence of physical and social factors, as well as factors of disease disparities on the brain-age gap. Predictors included aggregate country-level measures of air pollution (PM2.5), socioeconomic inequality (Gini index) and burdens of communicable, maternal, prenatal and nutritional conditions, and noncommunicable diseases. We also leveraged the individual neurocognitive status (healthy controls versus Alzheimer disease, MCI or bvFTD). We assessed predictors’ importance using a multi-method approach comprising permutation importance, mean decrease in impurity (MDI) and SHapley Additive exPlanations (SHAP) values (Fig. 4c). Across both LAC and non-LAC datasets, the models (R² = 0.41, F² = 0.71, r.m.s.e. = 6.76, F = 304.25, P < 1 × 10−15) identified neurocognitive disorders (MCI, Alzheimer disease or bvFTD) and higher socioeconomic inequality (Gini index) as the most influential and consistent predictors of increased brain-age gaps (Fig. 4c). High levels of pollution and burden of noncommunicable and communicable diseases were also predictive of increased brain-age gaps, albeit less substantial. Stratified models for LAC (R² = 0.37, F² = 0.59, r.m.s.e. = 6.9, F = 138.78, P < 1 × 10−15) and non-LAC (R² = 0.41, F² = 0.71, r.m.s.e. = 6.57, F = 135.91, P < 1 × 10−15) also showed good performance, with neurocognitive disorders being the most influential predictor in both. In LAC, higher socioeconomic inequality was the second most consistent and influential predictor of larger brain-age gaps across the three models. Air pollution and burden of communicable and noncommunicable diseases were also influential. None of these variables were influential predictors in the non-LAC models. Predictors’ estimation coefficients are presented in Supplementary Table 2. In sum, neurocognitive disorders, followed by macrosocial factors linked to socioeconomic inequality, air pollution and health disparities were influential predictors of increased brain-age gaps, especially in LAC.

Sensitivity analyses

We performed multiple tests to assess the validity of the results. First, we investigated whether variations in fMRI or EEG data quality explained the differences in brain age between the non-LAC and LAC. Subsample permutation tests with 5,000 iterations showed no differences between any of the groups for fMRI (Fig. 5a) or EEG (Fig. 5b) data quality metrics. In addition, a linear regression examining scanner-type effects showed that the fMRI data quality metric did not predict the brain-age gaps (R2 = 0.001, P = 0.18, Cohen’s f2 = 0.001, Fig. 5c). To further test for scanner effects, we implemented a harmonization strategy by normalizing the brain-age gap variable within each scanner type. We used the min–max scaler to ensure consistent minimum and maximum values across scanners. Results using this harmonization (Fig. 5d) and our initial approach were very similar. Additional analyses controlling for datasets collected with eyes open versus eyes closed protocols revealed no significant differences in brain-age gaps across any groups (Extended Data Fig. 2).

Fig. 5: Sensitivity analysis.
figure 5

a, Violin plots for the distribution of data quality metrics of fMRI (healthy controls non-LAC, n = 967, MCI non-LAC n = 215, Alzheimer disease non-LAC n = 214, bvFTD non-LAC n = 190, HC LAC n = 477, MCI LAC n = 169, AD LAC n = 505, bvFTD LAC n = 216). b, Violin plots for the distribution of data quality metrics of EEG datasets (HC non-LAC n = 569, HC LAC n = 1486, MCI LAC n = 133, Alzheimer disease LAC n = 108, bvFTD LAC n = 57). Both a and b indicate null results between groups in terms of data quality. c, Linear regression effects of scanner type, evidencing that the fMRI data quality was not significantly associated with fMRI brain-age gaps differences (P = 0.184). d, fMRI brain-age gap differences across groups controlling for scanner differences. The statistical comparisons were calculated using two-sided subsample permutation testing with 5,000 iterations. NS, not significant; ODQ, overall data quality.

We also controlled for the effects of age and years of education on the brain-age gap from fMRI and EEG by including them as covariates in the group comparisons. All reported group differences remained significant after covariate adjustment (Supplementary Table 3). Years of education did not change the results for any analyses. In eight of the nine analyses, age did not have a significant effect. Considering the chronological age differences between the Alzheimer disease and MCI groups, we performed a sensitivity analysis using a subset of participants with MCI (fMRI: n = 254, mean age = 73.287 ± 7.517 years; EEG: n = 52, mean age = 63.231 ± 6.549 years) age matched to participants with Alzheimer disease (fMRI: n = 254, mean age = 72.295 ± 7.530 years, P = 0.13; EEG: n = 52, mean age = 62.769 ± 6.302 years, P = 0.71). These results (Extended Data Fig. 3) confirmed those reported for the overall MCI and Alzheimer disease datasets (Fig. 4a,b). For both fMRI and EEG datasets, we found significantly larger brain-age gaps in Alzheimer disease compared with MCI (fMRI: P < 1 × 10−5; EEG: P < 0.01). For fMRI, these differences were observed in both LAC (P < 1 × 10−5) and non-LAC (P < 1 × 10−5) datasets. We also found differences between participants with MCI from LAC versus non-LAC (P < 1 × 10−5) and participants with Alzheimer disease from LAC versus non-LAC (P < 1 × 10−5). Thus, controlling for data quality, scanner effects, age and education confirmed that the reported effects in brain-age gaps remained the same.

Discussion

Our study used brain clocks to capture the diversity and disparities across LAC and non-LAC datasets using fMRI and source space EEG techniques. Despite heterogeneity in signal acquisition and methods, we captured patterns of brain-age modulations in healthy aging from diverse datasets and participants with MCI, Alzheimer disease and bvFTD. Models trained and tested on non-LAC datasets showed greater convergence with chronological age. Conversely, models applied to LAC datasets indicated larger brain-age gaps, suggesting accelerated aging. We observed ascending brain-age gaps from controls to MCI to Alzheimer disease. Sex differences revealed an increased brain-age gap in females in the control and Alzheimer disease groups. Most brain clock patterns were independently confirmed and replicated across fMRI and EEG. Aggregate-level macrosocial factors, including socioeconomic inequality, pollution and burden of communicable/noncommunicable conditions modulated the brain-age gap, especially in LAC. Variations in signal quality, demographics or acquisition methods did not account for the results. The findings offer a framework that captures the multimodal diversity associated with accelerated aging in global settings.

Our results suggest that being from LAC is associated with accelerated aging. The better fit of the non-LAC compared to the LAC models supports the notion that universal models of brain phenotypes do not generalize well to underrepresented populations24,29,40. Diversity-related factors associated with different exposures and disease outcomes4,10,24,41 may influence the brain-age gaps in LAC and non-LAC. Neurocognitive disorders played a crucial role4,42. However, structural socioeconomic inequality, a distinctive characteristic of LAC15, increased levels air pollution43, and the burden of noncommunicable19,20 and communicable18,44 diseases are also important factors on the brain-age gap. The fact that these effects were larger in LAC suggests that underlying inequalities and adverse environmental and health conditions play a macrosocial, structural driving role11 in the observed regional differences. Immigration may also influence brain age through social determinants of health45 and genetic diversity. In LAC, tricontinental admixtures lead to substantial ancestral diversity within and across countries46, impacting dementia prevalence and brain phenotypes41. Future studies should consider these potential effects in brain-age gaps.

Selective brain networks were associated with larger brain-age gap in the clinical groups. Both fMRI and EEG models of brain-age gaps yielded large-scale frontoposterior high-order interactions1, consistent with models of brain age involving long-range connections between frontal, cingular, parietal, and occipital hubs, which may be more vulnerable to aging effects47,48,49. Also consistent with the cumulative nature of neurobiological changes over time50, brain-age gaps increased from controls through MCI to Alzheimer disease. A previous deep learning study using MRI and positron emission tomography in participants with MCI and dementia also indicated increased brain-age associated with disease progression23. Our results point to the brain age of MCI as being an intermediate stage between healthy aging and dementia39, and suggest that both fMRI and EEG markers of brain age may help identify groups at greater risk of progressing to dementia.

Sex and gender have been linked to poorer brain health outcomes27,51. Larger brain-age gaps in healthy controls and females with Alzheimer disease from LAC may relate to sex-specific conditions such as menopause, which involves brain volume reduction and increased amyloid-beta deposition52,53. Females also exhibit a disproportionate tau brain burden54, pronounced inflammatory dysregulation55 and lower basal autophagy56 compared with males, all of which increase Alzheimer disease risk. Such sex-specific factors are intertwined with environmental factors and gender disparities51. Females in countries with higher gender inequality exhibit greater cortical atrophy27. Our sex effects were specific for Alzheimer disease and LAC, consistent with the impacts of environmental41 versus genetic risks57 in Alzheimer disease and bvFTD, respectively. Despite advances in gender equality, women in LAC still face important obstacles58 including lower education, less income and healthcare access, and greater caregiving burden, potentially exacerbating brain health issues and Alzheimer disease risk59,60. Previous models for brain age have been conducted predominantly in high-income settings, ignoring sex and gender differences triggered by region-specific influences30,31. Thus, the inclusion of diverse samples can help to better understand the biological and environmental interaction with sex and gender disparities.

Our study had different strengths. We used diverse datasets across LAC and non-LAC including 15 countries, featuring large sample sizes, and replicated results across fMRI and EEG. We used an integrative approach to analyze fMRI and EEG data across a large and geographically diverse sample. The convergence of two neuroimaging techniques and population heterogeneity enhanced the generalizability to the computational models that capture diversity10. In particular, incorporating EEG offers affordable and scalable solutions for low-resourced settings, such as those in LAC, compared with traditional neuroimaging techniques1,35. Brain clocks based on high-order interactions capture many risks to brain health, and thus, offer an approach to personalized medicine, particularly for underrepresented populations. Our framework combines multiple dimensions of diversity in brain health, the Alzheimer disease continuum and related disorders within a single measure of brain clocks. Accessible metrics of accelerated aging can offer personalized assessments of diversity, aging, and neurocognitive disorders.

This study has multiple limitations. Our EEG dataset lacks representation from clinical groups in non-LAC, which may limit the generalizability. This issue is partially mitigated by the consistent results from the fMRI data, which included MCI, Alzheimer disease and bvFTD groups from LAC and non-LAC regions. Our approach to measure the brain-age gap is unimodal. Future research should adopt multimodal approaches to deepen our understanding of brain aging across different pathophysiological mechanisms1. We leveraged two independent training and test datasets with fMRI and EEG, with out-of-sample validation yielding consistent results across geographical comparisons, sex effects and clinical conditions. These datasets involve multimodal settings and recording parameters, suggesting that our results are replicable across highly variable conditions. However, future research should include more regions to further validate our findings. In addition, we did not include individual-level data on gender identity, socioeconomic status and ethnic stratification. Future research incorporating these variables could further enrich our understanding of brain age across diverse populations. Lastly, the sex differences observed between controls from LAC and non-LAC exhibited moderate effect sizes. Further research should assess sex differences in other regions.

In conclusion, brain clock models were sensitive to the impact of diversity involving geographical, sex, macrosocial and disease-based factors from diverse populations, despite the heterogeneity in data acquisition and processing. Utilizing a deep learning architecture of the brain’s high-order interactions1 across fMRI and EEG signals, combined with globally accessible and affordable data, our study paves the way for more-inclusive tools to assess disparities and diversity in brain aging. These tools can be vital in identifying MCI, Alzheimer disease and bvFTD risk factors, as well as characterizing and staging disease processes. In the future, personalized medicine approaches could leverage models of brain-age gaps to establish worldwide protocols for aging and neurocognitive disorders.

Methods

The total dataset consisted of 5,306 participants, with 2,953 undergoing fMRI and 2,353 EEG acquisitions. Of these, 3,509 were controls, 517 had MCI, 828 had Alzheimer disease and 463 had bvFTD.

fMRI dataset

The fMRI dataset involved 2,953 participants from both non-LAC (USA, China, Japan) and LAC (Argentina, Chile, Colombia, Mexico, Peru), including 1,444 healthy controls. Two hundred and fifteen participants met the Petersen criteria for MCI with a 24 Mini-Mental State Examination (MMSE) cut-off value, 719 were diagnosed as probable Alzheimer disease61, and 402 fulfilled the diagnostic criteria for bvFTD62. LAC participants were recruited from the Multi-Partner Consortium to Expand Dementia Research in Latin America (ReDLat, with participants from Mexico, Colombia, Peru, Chile and Argentina)63. Non-LAC participants were non-Latino individuals from ReDLat, the Alzheimer’s Disease Neuroimaging Initiative and the Neuroimaging in Frontotemporal Dementia repository. The datasets were matched on sex, age and years of education (Table 1). Sex information was determined by self-report. No information regarding gender was inquired. To ensure data reliability, we excluded subjects who reported a history of alcohol/drug abuse or psychiatric or other neurological illnesses.

EEG dataset

The total dataset involved 2,353 participants. Controls comprised 1,183 participants, including 737 from non-LAC (Turkey, Greece, Italy, UK and Ireland) and 446 from LAC (Cuba, Colombia, Brazil, Argentina and Chile). Participants presenting with clinical conditions were recruited from a multisite study with harmonized assessments25,36,63 in LAC (Argentina, Brazil, Chile and Colombia). This dataset included 133 patients with MCI, 108 with Alzheimer disease, and 57 with bvFTD. The controls datasets were matched on age, sex and years of education concerning the clinical groups (MCI, Alzheimer disease and bvFTD) (Table 1). Sex information was determined by self-report. No information regarding gender was inquired. The diagnostic criteria for MCI, Alzheimer disease and bvFTD were the same as those used for the fMRI dataset. No subject in any of the clinical conditions reported a history of alcohol/drug abuse, psychiatric, or other neurological illnesses.

Ethics approval

The local institutions that contributed EEGs and/or fMRIs to this study approved the acquisitions and protocols (Supplementary Data 1), and all participants signed a consent form following the declaration of Helsinki. The overall study was approved by the consortium under multiple institutional review boards (FWA00028264, FWA00001035, FWA00028864, FWA00001113, FWA00010121, FWAA00014416, FWA00008475, FWA00029236, FWA00029089 and FWA00000068). Data collection and analysis posed no risks concerning stigmatization, incrimination, discrimination, animal welfare, environmental, health, safety, security or personal concerns. No transfer of biological materials, cultural artifacts or traditional knowledge occurred. The authors reviewed pertinent studies from all countries while preparing the manuscript.

fMRI preprocessing

The images were obtained from different scanners and in distinct acquisition settings (Supplementary Table 4). We included closed and open eyes recordings to increase the sample size for resting-state fMRI (rs-fMRI) data. The type of resting-state recording was controlled by a dummy variable (open or closed eyes) when using the functional connectivity metric64. The resting state of fMRI preprocessing was conducted using the fmriprep toolbox (v.22.0.2). Additional preprocessing was performed using the CONN22 (ref. 64) toolbox and including smoothing with a Gaussian kernel of 6 × 6 × 6 mm, the signal denoising through linear regression to account for confounding effects of white matter, cerebrospinal fluid, realignment, and scrubbing. A band-pass filter (0.008–0.09 Hz) was applied. After time series preprocessing, we used region-of-interest analysis based on the brain regions of the Automated Anatomical Labeling (AAL90) atlas to reduce the dimensionality of the fMRI data for machine learning algorithms.

EEG preprocessing

EEGs were processed offline using procedures implemented in a custom, automatic pipeline for computing brain functional connectivity using a mesh model for multiple electrode arrays and source space estimation (see Supplementary Table 5 for acquisition parameters). The pipeline allows for the multicentric assessment of resting-state EEG (rsEEG) connectivity and has been validated in a large-scale evaluation of connectivity in dementia65. Recordings were re-referenced to the average reference and band-pass filtered between 0.5 and 40 Hz using a zero-phase shift Butterworth filter of order 8. Data were downsampled to 512 Hz, referenced using the reference electrode standardization technique, and corrected for cardiac, ocular and muscular artifacts using two methods based on independent component analysis. ICLabel (a tool for classifying EEG independent components into signals and different noise categories)66, and EyeCatch (a tool for identifying eye-related independent component analysis scalp maps) were used67. Data were visually inspected after artifact correction, and malfunctioning channels were identified and replaced using weighted spherical interpolations.

EEG normalization

Following guidelines for multicentric studies37, EEG was rescaled to reduce cross-site variability. The normalization was carried out separately for each dataset and consisted of the Z-score transformation of the EEG time series. The Z-score quantifies the distance of raw data from the mean in standard deviation units. The Z-score transformed EEG connectivity matrices display more prominent interhemispheric asymmetry and reinforced long-distance connections than unweighted connectivity representations65.

EEG source space estimation

The source analysis of the rsEEG was conducted using the standardized low-resolution electromagnetic tomography method (sLORETA). sLORETA allows estimating the standardized current density at each of the predefined virtual sensors located in the cortical gray matter and the hippocampus of a reference brain (MNI 305, Brain Imaging Centre, Montreal Neurologic Institute) based on the linear, weighted sum of a particular scalp voltage distribution or the EEG cross-spectrum at the sensor level. sLORETA is a distributed EEG inverse solution method based on an appropriate standardized version of the minimum norm current density estimation. sLORETA overcomes problems intrinsic to the estimation of deep sources of EEG and provides exact localization to test seeds, albeit with a high correlation between neighboring generators.

The different electrode layouts were registered onto the scalp MNI 152 coordinates. A signal-to-noise ratio of 1 was chosen for the regularization method used to compute the sLORETA transformation matrix (forward operator for the inverse solution problem). The standardized current density maps were obtained using a head model of three concentric spheres in a predefined source space of 6,242 voxels (voxel size = 5 mm3) of the MNI average brain. A brain segmentation of 82 anatomic compartments (subcortical and cortical areas) was implemented using the automated anatomical labeling (AAL90) atlas. Current densities were estimated for the 153,600 voltage distributions comprising the 5 min of rsEEG (sampled at 512 Hz). The voxels belonging to the same AAL region were averaged such that a single (mean) time series was obtained for each cortical region32,68,69.

High-order interactions

After preprocessing 82 time series from the AAL brain parcellation for fMRI and EEG, we calculated the high-order interactions across triplets composed of a region i and region j and a set comprising all the brain regions without i and j. We evaluated high-order interactions using the organizational information (Ω) metric, a multivariate extension of Shannon’s mutual information, which assesses the dominant characteristic of multivariate systems (high-order interactions). To operationalize the Shannon entropy, we used the Gaussian copula approximation, which estimates the differential Shannon’s entropy from the covariance matrix of the Gaussian copula transformed data70. This is a mixture of a parametric and a nonparametric approach, as the copula is preserved in a nonparametric way but is then used to generate Gaussian marginals. The Ω quantifies the balance between redundancy and synergy in high-order interactions among brain regions. By definition, Ω > 0 implies that the interdependencies are better described as shared randomness, indicating redundancy dominance. Conversely, Ω < 0 suggests that the interdependencies are better explained as collective constraints, indicating synergy dominance. After normalization, its magnitude ranges from −1 to 1. Ω can be expressed as:

$$\varOmega \left({\mathbf{{X}}}^{n}\right)=\left(n-2\right)H\left({\mathbf{{X}}}^{n}\right)+\sum_{j=1}^{n}\left[H\left({X}_{j}\right)-H({\mathbf{{X}}}_{-j}^{n})\right]$$
(1)

where Xn is the random vector that describes the system and H is the Shannon’s entropy. When n is reduced to three variables (x, y and z), Ω can be expressed as

$$\varOmega \left(x,y,z\right)=H(x,y,z)-H(x,y)-H(x,z)-H(\;y,z)+H(x)+H(\;y)+H(z)$$
(2)

To analyze brain activity, z can be considered a multivariate time series representing the activity of all brain regions except for x and y. Therefore, O-info measures how synergistic or redundant is the relationship between two brain regions concerning the rest of the regions.

Model input preprocessing

As input to the models, the weighted adjacency matrix corresponding to the Ω metric was converted to a graph. This matrix defines the edges in the graph, where the weight of each edge reflects the Ω value between the corresponding regions. The feature vectors at each graph node are derived from the O-info matrix; specifically, each node’s feature vector is the corresponding row in the Ω matrix. To this end, the connectivity matrices were first converted to tensors using the PyTorch deep learning library v.2.3.0, enabling their efficient manipulation. These tensors were reshaped, organizing the connectivity data into a structure where each tensor represented the features of nodes within a graph. This transformation preserved the relational information from the original matrices, making it accessible for analysis by graph neural networks. To ensure the integrity of the data, graphs containing not a number (NaN) values, either in their features or target values, were filtered out. The remaining graphs were then split into training and validation sets using a stratified split to ensure a balanced representation of age groups in both sets.

Data augmentation

We used augmentation tailored for connectivity matrices to make the model more resilient to heterogeneity and generalizability. Linear interpolation between matrices corresponding to neighboring age values was used, in contrast to traditional image augmentation techniques such as random rotations or crops that are inappropriate for connectivity data.

Given two matrices, M1 and M2, representing fMRI or EEG connectivity at ages ɑ1 and ɑ2, respectively, the interpolation to produce a matrix for a target age where ɑ1 < ɑt < ɑ2 was conducted using the formula:

$${M}_{t}=\left(1-\alpha \right){M}_{1}+\alpha {M}_{2}$$
(3)

Here, \(\alpha =\frac{{a}_{t}-{a}_{1}}{{a}_{2}-{a}_{1}}\) represents the interpolation factor.

This augmentation method enabled the generation of fMRI and EEG connectivity matrices for age values previously absent in the data set. The derived matrices, through interpolation, ensure a smooth transition in the fMRI and EEG patterns from one age value to another, thereby maintaining the inherent physiological significance of the original data—preliminary validation against a hold-out dataset showed improvements in model fit against dataset heterogeneity. We included 500 samples with data augmentation only the training datasets for both modalities, half for the non-LAC and half for the LAC samples.

The architecture of the models

Two GCNs71 were designed, tailored to process graph-structured data. We used the PyTorch Geometric code library v.2.5.3 based on the PyTorch library v.2.3.0 to develop and train the models. Two models were created, one for fMRI data and another for EEG data. Unlike traditional convolutional networks suited for neuroimaging data, functional connectivity demands a specialized approach because neighboring data points are not necessarily close in native space (adjacent brain areas). The GCN uses adjacency matrices of graphs as inputs comprised of node features. Each node in the graph aggregates features from its neighbors through a series of operations, including multiplication by a normalized adjacency matrix, transformation using a weight matrix, and applying an activation function, here the ReLU72. The architecture consisted of two graph convolutional layers. The input features (O-info matrix) were passed through the first convolutional layer, followed by a ReLU activation function and a dropout layer for regularization. The features were then passed through the second convolutional layer. Finally, average pooling was used to aggregate the output features. To train the two models, we combined mean squared error as the loss function and the Adam optimizer. Given the variability in the data and potential model configurations, we implemented a hyperparameter tuning process using a grid search over specified learning rates and epoch numbers. For each model for the controls, the data was initially split into 80% for training and validation, and 20% for hold-out testing. Within the 80% training and validation set, we applied fivefold cross-validation to determine the optimal hyperparameters for the model. After determining the best hyperparameters through this cross-validation process, the final model’s performance was evaluated on the remaining 20% hold-out test set to assess its generalization capability73.

Statistical analyses

Following hyperparameter tuning, each model was retrained using the best hyperparameters on the training set and evaluated on the test set. For a more comprehensive assessment, the predicted age values were compared with the actual age values using Pearson’s correlation coefficient, R2 and Cohen’s f2 effect size for each model74. We used the method outlined below to evaluate if the model was predicting increased or decreased ages concerning the actual chronological age. All statistical analyses were run using Python v.3.9.13.

The MDE is a diagnostic metric used to evaluate the prediction accuracy of the models, specifically focusing on the direction of prediction gaps rather than their magnitude to detect bias. It is calculated as follows:

$${\mathrm{MDE}}=\frac{1}{n}\mathop{\sum }\limits_{i=1}^{n}(\;{y}_{i}-{\hat{y}}_{i})$$
(4)

The function ‘sign’ yields a value of +1 if the prediction is above the actual value, −1 if below, and 0 if they are equal. yi is the real age of subject i and ŷi is the predicted age. An MDE value close to zero suggests a balanced number of overestimations and underestimations. Positive or negative values indicate systematic biases in the prediction method, where a positive MDE means the model generally overpredicts, and a negative MDE indicates underprediction.

We examined potential regional biases in predictive accuracy and possible sex effects or signal acquisition noise. The statistical approach involved conducting permutation tests (5,000 subsample iterations each), a nonparametric statistical test that does not assume a specific distribution of the data. Given the nature of the permutation test, our analysis constituted two-sided tests, assessing the likelihood of observing the obtained difference under the null hypothesis of no difference between the models. Although the permutation test alleviates the need for normality assumptions, making it resilient to deviations from normal distribution, it addresses multiple comparison concerns by evaluating the empirical distribution of the test statistic under the null hypothesis.

We compared the adequacy of the models using the r.m.s.e. This is a metric to quantify the discrepancies between predicted and observed values in modeling, given by the formula:

$${{\mathrm{r.m.s.e.}}}=\sqrt{\frac{1}{N}\mathop{\sum }\limits_{i=1}^{n}{(\;{y}_{i}-{\hat{y}}_{i})}^{2}}$$
(5)

In this equation, yi is the observed value, \(\hat{{y}_{i}}\) is the predicted value and N is the total number of observations. The r.m.s.e. measures the average magnitude of errors between predicted and actual observations. The squaring process results in a higher weight to outliers, making it a useful measure to evaluate if a model is robust to outliers.

To evaluate feature importance, we used bootstrapping to assess the significance of individual nodes (brain areas) and edges (connections between brain nodes/regions) within the graph neural network. With this approach, we executed a two-step process to quantify the node and its edge’s impact on the model’s predictions. Initially, the model’s output was calculated with all nodes and its edges present to establish a baseline performance metric. Subsequently, the analysis was repeated after removing each node and edge at a time, thus simulating network information absence. The difference in the model’s output, with and without each area and edge was quantified, providing a measure of the network node importance. This process was repeated across multiple bootstrap testing dataset samples (n = 5,000) to calculate confidence intervals (CI). Finally, a feature importance list of nodes was generated in descending order of importance for brain-age prediction. This methodological framework allowed for an analysis of network-level contributions to each model’s overall predictive performance.

Gradient-boosting regression models

We used gradient-boosting regression models75 to investigate the impact of factors associated with the physical and social exposomes, and disease disparities, on brain-age gaps between LAC and non-LAC populations. As predictors, we included country-level measures of: (1) air pollution (PM2.5 exposure); (2) socioeconomic inequality (the Gini index)76; (3) the burden of communicable, maternal, prenatal and nutritional conditions; and (4) the burden of noncommunicable diseases. These indicators were sourced from the updated country-specific data provided on the World Bank’s platform (https://databank.worldbank.org/). In addition, individual neurocognitive status (being controls versus having Alzheimer disease, MCI or bvFTD) was included as predictor. Brain-age gaps from fMRI and EEG datasets were the outcomes.

Models were trained using 90% of the dataset and subsequently tested on an independent 10% subset, using a 10-fold cross-validation framework. The cross-validation was repeated 10 times. Within each iteration, estimation coefficients for the predictors, as well as the R2, Cohen’s f² (ref. 74) and r.m.s.e., were computed. We assessed feature importance using a multi-method approach incorporating permutation importance, features importance based on the MDI and SHAP values77. We provided the mean importance values for each method, along with their 99% CI, as well as the average R2 and Cohen’s f² (ref. 74). Features whose lower confidence interval boundary crosses zero are considered nonsignificant. To optimize Ridge’s hyperparameters, Bayesian optimization was used.

Following the same multi-method approach, we conducted gradient-boosting regressions to explore the effect of gender inequality and sex on brain-age gaps. As predictors, we included: (1) the country-level GII, a composite metric measuring reproductive health, empowerment and the labor market; (2) sex; (3) region (LAC versus non-LAC); and (4) individual neurocognitive status (healthy controls versus Alzheimer disease, MCI or bvFTD). Brain-age gaps from fMRI and EEG were the outcomes.

Data quality assessment

For the fMRI overall data quality (ODQ) metric, each time series was segmented in 20 repetition time (TR) length to evaluate the temporal signal-to-noise ratio (tSNR)78, which is calculated as the mean fMRI signal divided by its standard deviation within each segment. Segments with tSNR above a threshold of 50 were classified as high quality78. As additional evaluations, we checked the variability of the tSNR segments of all the time series in the brain to check for spatial consistency. Lastly, we checked for remaining outliers as signal spikes from movement or transient gradient artifacts. Thus, the fMRI ODQ was computed as a percentage of good segments considering its tSNR, low spatial variability and the number of segments not having spikes from movement or transient gradient remaining artifacts.

For the EEG data quality assessment79, signals were divided into 1-s segments, and the quality of each segment was evaluated using four specific metrics. These metrics included the detection of weak or constant signals based on standard deviation, the identification of artifacts through signal amplitude ratios, the presence of high-frequency noise and low correlation between channels. The EEG ODQ was then calculated as the percentage of segments exhibiting good quality. A value of 0 indicated that all segments were of poor quality, whereas a value of 100 indicated that all segments were of high quality.

Sensitivity analyses

We examined whether variations in fMRI or EEG data quality explained the differences in brain age between the non-LAC and LAC, comparing different groups’ fMRI78 and EEG79 data quality metrics, with subsample permutation tests with 5,000 iterations for each comparison. In addition, we conducted a linear regression to examine the association between the fMRI data quality metrics and the brain-age gaps. To further control for scanner effects, we implemented an additional harmonization strategy in the fMRI training dataset. This method involves normalizing the brain-age gap variable within each scanner type by scaling the data to a fixed range using the min–max scaler14. This ensures that the minimum and maximum values of the brain-age gap variable are consistent across different scanners, thereby reducing variability caused by scanner differences. In addition, we accounted for the sign of the brain-age gap after normalization to maintain the interpretability of positive and negative values. This procedure adjusts for location and scale differences (for example, mean and variance) across sites, minimizing scanner-related variability.

We used permutation tests (5,000 subsample iterations each) to compare the brain-age gaps between subsamples of participants undergoing fMRI with open versus closed eyes. We included 124 controls with closed eyes and 86 with open eyes, 269 Alzheimer disease with closed eyes and 164 with open eyes, and 88 bvFTD with closed eyes and 69 with open eyes. Notably, all MCI participants underwent fMRI with open eyes. Our findings revealed no significant differences in brain-age gaps when analyzing data from open versus closed eyes conditions across all group comparisons (permutation test = 5,000 iterations).

Ethics and inclusion statement

This work involved a collaboration between researchers in multiple countries. Contributors from different sites are included as coauthors according to their contributions. Researchers residing in low and middle income countries (LMIC) were involved in study design, study implementation, methodological procedure, writing and reviewing processes. The current research is locally relevant due to the larger disparities observed in LAC and other regions. Roles and responsibilities were agreed among collaborators ahead of the research. Ethics committees approved all research involving participants. To prevent any stigmatization, all identifying information has been removed to preserve the privacy of individuals. We endorse the Nature Portfolio journals’ guidance on LMIC authorship and inclusion. Authorship was based on the intellectual contribution, commitment, and involvement of each researcher in this study. We included authors born in LMICs and other underrepresented countries.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.