Reconstructing lost BOLD signal in individual participants using deep machine learning

Yan, Yuxiang; Dahmani, Louisa; Ren, Jianxun; Shen, Lunhao; Peng, Xiaolong; Wang, Ruiqi; He, Changgeng; Jiang, Changqing; Gong, Chen; Tian, Ye; Zhang, Jianguo; Guo, Yi; Lin, Yuanxiang; Li, Shijun; Wang, Meiyun; Li, Luming; Hong, Bo; Liu, Hesheng

doi:10.1038/s41467-020-18823-9

Download PDF

Article
Open access
Published: 07 October 2020

Reconstructing lost BOLD signal in individual participants using deep machine learning

Yuxiang Yan^1,2^na1,
Louisa Dahmani^1,3^na1,
Jianxun Ren^1,4^na1,
Lunhao Shen^1,4,
Xiaolong Peng ORCID: orcid.org/0000-0002-4488-9628¹,
Ruiqi Wang¹,
Changgeng He^1,4,
Changqing Jiang ORCID: orcid.org/0000-0003-1666-8120⁴,
Chen Gong⁴,
Ye Tian ORCID: orcid.org/0000-0002-0657-1600⁴,
Jianguo Zhang⁵,
Yi Guo⁶,
Yuanxiang Lin⁷,
Shijun Li¹,
Meiyun Wang³^na2,
Luming Li^4,8^na2,
Bo Hong²^na2 &
…
Hesheng Liu ORCID: orcid.org/0000-0002-7233-1509^1,8,9^na2

Nature Communications volume 11, Article number: 5046 (2020) Cite this article

6999 Accesses
16 Citations
25 Altmetric
Metrics details

Subjects

Abstract

Signal loss in blood oxygen level-dependent (BOLD) functional neuroimaging is common and can lead to misinterpretation of findings. Here, we reconstructed compromised fMRI signal using deep machine learning. We trained a model to learn principles governing BOLD activity in one dataset and reconstruct artificially compromised regions in an independent dataset, frame by frame. Intriguingly, BOLD time series extracted from reconstructed frames are correlated with the original time series, even though the frames do not independently carry any temporal information. Moreover, reconstructed functional connectivity maps exhibit good correspondence with the original connectivity maps, indicating that the model recovers functional relationships among brain regions. We replicated this result in two healthy datasets and in patients whose scans suffered signal loss due to intracortical electrodes. Critically, the reconstructions capture individual-specific information. Deep machine learning thus presents a unique opportunity to reconstruct compromised BOLD signal while capturing features of an individual’s own functional brain organization.

Frequency-specific brain network architecture in resting-state fMRI

Article Open access 20 February 2023

Intracranial electrophysiological and structural basis of BOLD functional connectivity in human brain white matter

Article Open access 09 June 2023

An integrated resource for functional and structural connectivity of the marmoset brain

Article Open access 01 December 2022

Introduction

The blood oxygen level-dependent (BOLD) signal, acquired during functional magnetic resonance imaging (fMRI), is subject to a number of artifacts, such as magnetic susceptibility artifacts and interference from metal implants. For example, intracortical electrodes implanted in patients interfere with the BOLD signal, potentially due to their lead connectors, resulting in a significant signal loss in brain regions close to the connection site on the skull¹. This hampers studies that investigate whole-brain activity and functional connectivity and may result in misinterpretation of findings. To date, there are no post-processing MRI methods that can mitigate such interference.

A newly proposed deep machine learning model, called deep convolutional generative adversarial networks (DCGAN), provides a possible solution for reconstructing lost information^2,3,4,5,6. In the DCGAN approach, two networks—a generator and a discriminator—are pitted one against the other and are trained and optimized simultaneously. Remarkably, it does not simply assemble pieces of images it was trained on, but rather generates new images that are internally cohesive. For example, DCGAN models can successfully fill in missing portions of photographs of human faces⁷ and create pictures of human faces, birds, and even art. Like photographs, BOLD images carry internally cohesive information. Embedded within resting-state data, for example, is information about BOLD signal fluctuations in each cortical surface vertex⁸, from which we can extract meaningful information such as functional connectivity and task-evoked brain activity⁹.

Here, we show that DCGAN can be harnessed to learn individual patterns of brain activity and generate BOLD signals in artificially and non-artificially compromised cortical regions. We trained a deep learning model on a sample of the Brain Genomics Superstruct Project (GSP) data set¹⁰, containing intact BOLD frames from healthy young adult participants (Fig. 1a). We used the trained model to reconstruct BOLD images, frame by frame, in an independent test sample from the GSP data set, in which we had artificially removed cortical surface regions of different sizes (Fig. 1b). Although the individual input frames did not carry information about the evolution of the BOLD signal through time, we set out on the ambitious goal of investigating the times series and functional connectivity (FC) maps extracted from the reconstructed frames. We hypothesized that the reconstructed times series and FC maps would bear high similarity to the original ones. Additionally, the large amount of resting-state data that was available enabled us to calculate individual-level functional connectivity^11,12. We thus tested whether machine learning can be used to reconstruct individual-specific information, or whether its ability is limited to generating images based on group-level information. We replicated our analyses using the Human Connectome Project (HCP) data set¹³ and compared our DCGAN reconstructions to those generated through a simple diffusion-based algorithm. Finally, we tested the DCGAN model in a clinical application, where we acquired a unique data set by collecting extensive resting-state fMRI data both before and after electrode implantation surgery in patients with Parkinson’s disease. We sought to reconstruct regions in the post-operative scans that suffered substantial interference from the deep-brain stimulation (DBS) electrodes and connectors. The availability of pre-implantation scans meant we had a reference against which to compare the reconstructed images, assuming functional connectivity is stable and unchanged by the implantation surgery. We hypothesized that the reconstructed BOLD signals from the post-surgical data would be highly similar to the pre-surgical BOLD signals and that they would reflect patterns of activity that are specific to the individual.

**Fig. 1: Method for reconstructing lost signal in blood oxygen level-dependent (BOLD) images using deep convolutional generative adversarial networks (DCGAN).**

Results

Reconstructed BOLD signals are correlated with original signals

The DCGAN model was trained on a resting-state fMRI data set of 80 randomly chosen participants (240 frames for each participant, in vertex space) from the publicly available GSP database¹⁰. In an independent test data set comprised of 20 participants, we artificially removed the BOLD signal in various cortical surface regions and used the trained generator to reconstruct compromised BOLD frames in vertex space.

We found that the reconstructed BOLD frames appear similar to the original intact images (see Fig. 2a for an example). To quantitatively evaluate the reconstructive accuracy of the DCGAN model, we concatenated the reconstructed frames and compared the reconstructed and original time series. This is a particularly challenging endeavor, as images are reconstructed at each time point, and each frame does not independently hold temporal information. To conduct this comparison, we assessed the correlation between the original and reconstructed BOLD time series for each vertex within each of the artificially compromised regions, located in different lobes (see “Methods” for more details), and calculated the overall average of these correlations across all participants. Table 1 shows the correlation r coefficients and statistical values of this model (along with those of other models and data sets, described later). We found significant positive correlations, using multilevel linear models corrected for multiple comparisons using the Bonferroni correction: temporal cortex region: r = 0.33 ± 0.02 (95% confidence intervals (CI) [0.32, 0.34], t(19) = 65.49, bootstrapped p < 0.001), lateral frontal cortex region: r = 0.16 ± 0.07 (CI [0.12, 0.19], t(19) = 9.74, p < 0.001), medial frontal cortex region: r = 0.23 ± 0.04 (CI [0.21, 0.25], t(19) = 23.24, p < 0.001), parietal cortex region: r = 0.25 ± 0.03 (CI [0.24, 0.27], t(19) = 37.03, p < 0.001), and occipital cortex region: r = 0.37 ± 0.06 (CI [0.34, 0.40], t(19) = 26.65, p < 0.001) (Table 1). Although these correlations are low to moderate, the sheer fact that the DCGAN model was able to learn individual-specific features and capture part of the BOLD fluctuations in regions with complete signal loss is impressive. Figure 2b shows an example of a reconstruction of the lateral temporal cortex in one participant, where the correlations between the reconstructed and original time series of two randomly selected vertices are r = 0.67, p < 0.001 for Vertex 1 and r = 0.46, p < 0.001 for Vertex 2. Importantly, within the compromised region, the reconstructed BOLD signals exhibit various patterns of activity that are not necessarily correlated with each other. For example, the correlation between the time series of the two vertices above is r = −0.08, p = 0.22 (Fig. 2b).

**Fig. 2: The reconstructed BOLD signals are highly similar to the original signals.**

Table 1 Time series and functional connectivity map reconstruction accuracy following various models.

Full size table

We also assessed BOLD reconstructive accuracy by generating FC maps for all reconstructed vertices and comparing them to the FC maps of the corresponding vertices in the original intact BOLD frames. As an example, in Fig. 2c, we show the similarity between reconstructed and original FC maps for the same two temporal vertices as in Fig. 2b, in a representative participant. The FC map similarity is very high, with r = 0.96, p < 0.001 for Vertex 1 and r = 0.81, p < 0.001 for Vertex 2. In Fig. 3, we show original (top row) and DCGAN-reconstructed (middle row) FC maps sourced from one randomly selected vertex within each of the artificially compromised regions in the same participant. All reconstructed FC maps yielded high similarity to the original ones: r = 0.96, p < 0.001 for the temporal vertex (also shown in Fig. 2c), r = 0.84, p < 0.001 for the lateral frontal vertex, r = 0.89, p < 0.001 for the medial frontal vertex, r = 0.84, p < 0.001 for the lateral parietal vertex, and r = 0.88, p < 0.001 for the occipital vertex. We then evaluated the average reconstructive accuracy across all participants. When taking into account all the vertices within the compromised regions, multilevel linear models, corrected for multiple comparisons using the Bonferroni correction, revealed that the cortical FC maps of the reconstructed vertices are highly similar to those of the original vertices in all regions. The average correlation between corresponding FC maps is r = 0.62 ± 0.06 (CI [0.60, 0.65], t(19) = 47.88, p < 0.001) for the temporal cortex, r = 0.60 ± 0.04 (CI [0.58, 0.62], t(19) = 65.04, p < 0.001) for the lateral frontal cortex, r = 0.61 ± 0.05 (CI [0.58, 0.63], t(19) = 51.87, p < 0.001) for the medial frontal cortex, r = 0.70 ± 0.03 (CI [0.68, 0.71], t(19) = 110.62, p < 0.001) for the parietal cortex, and r = 0.79 ± 0.05 (CI [0.76, 0.81], t(19) = 68.39, p < 0.001) for the occipital cortex (Table 1).

**Fig. 3: The DCGAN-generated functional connectivity maps are highly similar to the original maps, throughout cortex.**

We also evaluated reconstructive accuracy according to the size of the compromised regions (Supplementary Fig. 1a) by correlating the reconstructed and original FC maps. When there are no compromised regions (0%; Supplementary Fig. 1b), the reconstructive accuracy is very high, r = 0.85 ± 0.00, p < 0.001 (CI [0.85, 0.85]). Once the compromised region covers 10% of the cortical surface, the mean reconstructive accuracy drops to r = 0.51 ± 0.10, p < 0.001 (CI [0.46, 0.56]). From there, there is a steady decrease in reconstructive accuracy as the mask of compromised regions increases in size (F(2.62,23.55) = 93.68, p < 0.001, η_p² = 0.91) (Supplementary Fig. 1b). When the mask size reaches 40% of the cortical surface, the reconstructive accuracy is r = 0.35 ± 0.12, p < 0.001 (CI [0.28, 0.41]). It should be noted that a complete signal loss in 40% of the cortical surface may represent an extreme case; nevertheless, the reconstruction still recovers some important characteristics of a given individual’s functional connectivity.

We replicated our findings by performing the same time series and FC-based analyses in a data set sporting higher spatial and temporal resolutions: the HCP data set¹³. Again, the data of 80 randomly chosen participants were used to train our DCGAN model, and the data of 20 independent participants were used to test reconstruction. Multilevel linear models revealed significant positive correlations between the original and reconstructed time series in all regions: temporal cortex: r = 0.31 ± 0.05 (CI [0.29, 0.33], t(19) = 27.60, p < 0.001), lateral frontal cortex: r = 0.12 ± 0.04 (CI [0.11, 0.14], t(19) = 14.33, p < 0.001), medial frontal cortex: r = 0.18 ± 0.05 (CI [0.16, 0.20], t(19) = 16.73, p < 0.001), parietal cortex: r = 0.24 ± 0.05 (CI [0.22, 0.27], t(19) = 19.71, p < 0.001), and occipital cortex: r = 0.35 ± 0.07 (CI [0.32, 0.39], t(19) = 23.39, p < 0.001) (Table 1). We reconstructed FC maps, and multilevel linear models again revealed significant positive correlations in all regions: temporal cortex: r = 0.60 ± 0.07 (CI [0.57, 0.63], t(19) = 38.04, p < 0.001), lateral frontal cortex: r = 0.44 ± 0.05 (CI [0.42, 0.47], t(19) = 39.47, p < 0.001), medial frontal cortex: r = 0.46 ± 0.06 (CI [0.43, 0.49], t(19) = 32.53, p < 0.001), parietal cortex: r = 0.62 ± 0.03 (CI [0.60, 0.63], t(19) = 81.92, p < 0.001), and occipital cortex: r = 0.65 ± 0.06 (CI [0.62, 0.68], t(19) = 46.90, p < 0.001) (Table 1). Comparing the HCP and GSP reconstructions, the GSP-trained model yielded marginally more accurate time series reconstructions when correcting for multiple comparisons (mean difference = 0.02, CI [0.004, 0.04], t(30.21) = 2.41, p = 0.022, η² = 0.16; significant p threshold = 0.017) and significantly more accurate FC map reconstructions (mean difference = 0.11, CI [0.10, 0.13], t(38) = 12.88, p < 0.001, η² = 0.81) (Fig. 4a), despite the HCP data set having higher spatial and temporal resolutions. We postulated that the HCP data may have a lower temporal signal-to-noise ratio (tSNR) than the GSP data set, and indeed this was the case (t(139.43) = 49.31, bootstrapped p = 0.001, η² = 0.95) (Fig. 4b). To counteract this, we temporally smoothed the HCP data by averaging together every 4 frames before retraining and retesting the model. This significantly increased tSNR (t(99) = −40.68, p < 0.001, η² = 0.94) (Fig. 4b) and yielded more accurate reconstructions than the raw HCP-trained model (time series: mean difference = −0.01, CI [−0.02, −0.01], t(19) = −4.67, p < 0.001, η² = 0.53; FC maps: mean difference = −0.09, CI [−0.10, −0.09], t(19) = −143.21, p < 0.001; η² = 1.00) (Fig. 4a; see Table 1 for reconstruction details on each cortical region). The tSNR of the temporally smoothed HCP data remained lower than the GSP’s (t(198) = 23.26, bootstrapped p = 0.001, η² = 0.73) (Fig. 4b), however, the time series reconstructions are similar in accuracy to the GSP’s (mean difference = 0.01, CI [−0.008, 0.03], t(32.12) = 1.16, p = 0.26, η² = 0.04), while the reconstructed FC maps are marginally less accurate after Bonferroni correction (mean difference = 0.02, CI [0.003, 0.04], t(38) = 2.43, p = 0.020, η² = 0.13; significant p threshold = 0.017) (Fig. 4a). These findings indicate that tSNR has an important effect on machine learning reconstructive accuracy.

**Fig. 4: DCGAN successfully reconstructs images across data sets, and the temporal signal-to-noise ratio (tSNR) of the training data modulates reconstructive accuracy.**

To assess the power of DCGAN, we compared its performance to a simpler diffusion-based method for filling in compromised cortical regions (see “Methods” section). The diffusion model was able to reconstruct both time series and FC maps (Table 1). As predicted, its reconstructions are less accurate than the DCGAN model’s (time series: mean difference = 0.07, CI [0.06, 0.09], t(19) = 9.15, p < 0.001, η² = 0.82; FC maps: mean difference = 0.18, CI [0.16, 0.19], t(19) = 27.62, p < 0.001, η² = 0.98) (Fig. 5). As an example, in Fig. 3 we show FC maps generated from (i) original (top row), (ii) DCGAN-reconstructed (middle row), and (iii) diffusion-reconstructed (bottom row) vertices in all five cortical areas. We replicated these results in the raw HCP data set (reconstructed time series: mean difference = 0.07, CI [0.065, 0.067], t(19) = 176.20, p < 0.001, η² = 1.00; reconstructed FC maps: mean difference = 0.06, CI [0.059, 0.064], t(19) = 53.50, p < 0.001, η² = 0.99) (Fig. 5 and Table 1). This finding suggests that our DCGAN model extrapolates information embedded within nearby as well as distant cortical regions to reproduce patterns of brain activity, while more naive methods can only rely on nearby information.

**Fig. 5: The DCGAN model outperforms a diffusion-based model.**

The resting-state fMRI data used for reconstruction were preprocessed with global signal regression (GSR), which introduces spurious temporal anticorrelations in FC analysis¹⁴. While progress has been made, there is still no consensus about whether GSR should or should not be included in resting-state data preprocessing¹⁵. To ensure the robustness of our results, we retrained our DCGAN model using the same data, albeit preprocessed without GSR. This data yielded lower reconstructive accuracy for time series (mean difference = −0.001, CI [−0.003, −0.0004], t(19) = −2.81, p = 0.01, η² = 0.29) and higher reconstructive accuracy for FC maps (mean difference = 0.003, CI [0.002, 0.005], t(19) = 4.87, p < 0.001, η² = 0.55) compared to GSR-preprocessed data (Supplementary Fig. 2; see Table 1 for reconstruction details on each cortical region). Importantly, however, the effect was negligible in both cases, as evidenced by the near-zero mean differences in r coefficients: −0.001 (CI [−0.003, −0.0004]) for time series and 0.003 (CI [0.002, 0.005]) for FC maps (Supplementary Fig. 2). This finding indicates that GSR does not affect machine learning reconstructive accuracy in any meaningful way and that the information learned by the DCGAN model is stable. The remaining analyses were conducted on GSR-preprocessed GSP data.

Reconstructed BOLD signals are individual-specific

We investigated whether the reconstructed BOLD signals reflect general trends in BOLD activity or whether they capture patterns of activity that are specific to the individual participant. To test this, for each vertex within a compromised region, we calculated the correlation between each individual’s reconstructed FC map and (i) their original intact FC map; (ii) the group-averaged FC map from the training data set; and (iii) the FC map of each participant’s most similar individual (MSI; see “Methods” section), i.e., the individual in the training data set that most resembles their functional connectivity patterns. In Fig. 6a, we show examples of FC maps and correlations for the same two vertices as in the BOLD time series analyses above. The reconstructed FC maps show the highest correlations with the original FC maps. In Fig. 6b, we make the same comparisons but this time across all vertices within the five cortical masks combined using a repeated-measures ANOVA. Reconstructed and original FC maps share many features and exhibit an average correlation of r = 0.66 ± 0.03 (CI [0.65, 0.68]), which is significantly different from 0 (t(19) = 116.77, p < 0.001). The average correlation between the reconstructed FC maps and the training group-averaged FC maps is r = 0.44 ± 0.02 (CI [0.44, 0.46]) and is also significantly different from 0 (t(19) = 85.33, p < 0.001).

**Fig. 6: DCGAN-reconstructed functional connectivity maps capture individual-specific information.**

Finally, we compared the reconstructed FC maps of each individual in the test data set to the FC maps of their MSI. The average correlation between reconstructed and MSI FC maps is r = 0.55 ± 0.02 (CI [0.54, 0.56]), which is significantly different from 0 (t(19) = 116.86, p < 0.001). We statistically compared these correlations and found them to be significantly different from one another (F(2,38) = 927.69, p < 0.001, η_p² = 0.98). Post-hoc tests (bootstrapped paired t-tests) revealed that the correlation between reconstructed and original FC maps is greater than all other correlations (reconstructed vs. training group-averaged maps: mean difference = 0.22, CI [0.21, 0.23], t(19) = 34.53, bootstrapped p = 0.001, η² = 0.98; reconstructed vs. MSI maps: mean difference = 0.11, CI [0.10, 0.12], t(19) = 20.70, bootstrapped p = 0.001, η² = 0.96). The fact that the correlation between reconstructed and original FC maps is higher than the correlation between reconstructed and training group FC maps indicates that during the training process, the generator did not simply learn general trends in BOLD activity but was able to infer the co-activating patterns from the individual-specific BOLD frames used as input in the test phase. Critically, the reconstructed BOLD FC maps are more representative of each individual’s own specific patterns of functional connectivity than of any other individual in the training data set, indicating that the DCGAN model is able to capture individual-specific information about functional brain organization.

Signals are successfully reconstructed in a clinical sample

We tested the DCGAN model in 12 patients with Parkinson’s disease (PD) whose BOLD signals were interfered with by metal implants. Intracortical electrodes were implanted in these patients for DBS treatment^16,17. Wires outside the skull connecting the simulator to the implanted electrodes strongly interfere with the acquisition of the BOLD signal during post-surgical fMRI studies, resulting in a signal loss in temporal, parietal, and occipital regions (see Fig. 7 for examples in two representative patients) and in abnormal measurements of functional connectivity. Extensive resting-state fMRI data were acquired both before and after electrode implantation surgery in these 12 patients. We first identified the compromised region where vertices exhibited a sharp contrast in signal amplitude before and after the electrode implantation surgery. The compromised region covered on average 8.36% of the cortical surface.

**Fig. 7: Implanted electrode interference severely reduces BOLD signal amplitude.**

Once the DCGAN model reconstructed the BOLD signals in the compromised regions, we investigated whether there was any residual loss within these regions. We compared the normalized amplitudes of the pre-operative BOLD signal with that of (i) the post-operative compromised BOLD signal (Supplementary Fig. 3a, left), and (ii) the post-operative reconstructed BOLD signal (Supplementary Fig. 3a, right). While the post-operative signal demonstrated substantial attenuation within the DBS-compromised cortical regions, the reconstruction displayed no residual signal loss. Using the pre-operative average signal amplitudes as a reference, we calculated the normalized BOLD amplitudes (%) of the masked-out vertices in the post-operative images and reconstructed images. A multilevel linear model shows that the BOLD amplitudes are significantly different under pre-operative, post-operative, and reconstructed conditions (F(2,1555.89) = 9695.62, p < 0.001). Examination of the parameter estimates revealed that the post-operative BOLD amplitudes are substantially and significantly attenuated in comparison to the pre-operative BOLD amplitudes (mean difference = −32.54, CI [−33.08, −32.01], t(1554.80) = −119.98, p < 0.001, η² = 0.90). The reconstructed BOLD signals are not significantly different from the pre-operative BOLD signals (mean difference = −0.33, CI [−0.20, 0.86], t(1554.44) = 1.21, p = 0.23, η² = 0.0009). The average pre-operative, post-operative, and reconstructed normalized BOLD amplitudes are shown in Supplementary Fig. 3b.

Next, we sought to evaluate the reconstructive accuracy using functional connectivity analyses, similar to those conducted in the healthy cohorts above. We did not consider the BOLD time series here as the pre- and post-operative fMRI scans were obtained at different time points. However, functional brain organization, as assessed with functional connectivity, is assumed to be relatively stable over time¹⁸. Although the implantation surgery may cause microlesion effects and lead to changes in brain circuits involving the stimulation target (i.e., the subthalamic nucleus), the surgery is unlikely to change functional connectivity in the area of signal loss, which is relatively far from the location of the stimulator and the motor network being modulated. As a proof of concept, we investigated the functional connectivity of the right temporoparietal region in the same two representative patients (note that signal loss was observed only in the left hemisphere). We found that the post-operative FC maps in the right hemisphere are highly and significantly correlated with the pre-operative FC maps, r = 0.75, standard error = 0.02 (CI [0.71, 0.79], t(10.98) = 42.96, p < 0.001, η² = 0.99). However, for the compromised region in the left hemisphere, we found that the post-operative FC maps are only weakly positively correlated with the pre-operative FC maps, r = 0.37, standard error = 0.03 (CI [0.30, 0.44], t(11.02) = 11.24, p < 0.001, η² = 0.92). As an example, we show cortical FC maps using a seed placed in two representative patients’ compromised regions (Supplementary Fig. 4). Unlike the weak and disorganized FC maps obtained from patients’ compromised left temporoparietal region, the FC maps generated from seeds in the right temporoparietal region show high similarity to their pre-operative FC maps (right hemisphere seeds across the two patients in Supplementary Fig. 4: r = 0.83 ± 0.03, p < 0.001; left hemisphere seeds across both patients: r = 0.38 ± 0.13, p < 0.001) (Supplementary Fig. 4).

Having shown that FC maps are relatively stable following electrode implantation, we next assessed the reconstructive accuracy of our DCGAN model for the patients’ compromised region in the left hemisphere. A multilevel linear model that took into account all the vertices inside the compromised regions showed that the reconstructed post-operative FC maps are moderately positively correlated with the pre-operative FC maps, r = 0.56, standard error = 0.02 (CI [0.52, 0.60]), and the correlation is significantly different from 0 (t(10.83) = 30.50, p < 0.001, η² = 0.99). As mentioned above, the post-operative FC maps were also positively correlated with the pre-operative FC maps, but this correlation was weak (r = 0.37, standard error = 0.03; CI [0.30, 0.44], t(11.02) = 11.24, p < 0.001, η² = 0.92). Another multilevel linear model showed that the correlations between reconstructed and pre-operative FC maps are significantly higher than the correlations between post-operative and pre-operative FC maps (mean difference = 0.18, CI [0.15, 0.20], t(667.35) = 13.63, p < 0.001, η² = 0.22). As an example, we show cortical FC maps using a seed placed in two representative patients’ compromised regions (Fig. 8, same patients and seeds as in Fig. 7 and Supplementary Figs. 3 and 4). As expected, the FC maps obtained from the patients’ compromised post-operative BOLD images do not capture the connectivity patterns observed in the pre-surgical data. However, the FC maps generated from the reconstructed BOLD images show high similarity to the pre-operative FC maps (reconstructed vs. pre-operative across the two patients in Fig. 8: r = 0.61 ± 0.01; post-operative vs. pre-operative across both patients: r = 0.38 ± 0.13). These results indicate that the BOLD signals reconstructed in the compromised regions are representative of the patients’ intact functional connectivity patterns.

**Fig. 8: The DCGAN model can reconstruct BOLD signals compromised by the implantation of a deep-brain stimulator in patients with Parkinson’s disease.**

Discussion

The current study aimed to reconstruct fMRI BOLD signal inside cortical regions that suffered a signal loss due to various artifacts. We used deep convoluted generative adversarial networks (DCGAN), a recent advance in machine learning algorithms, to leverage functional information embedded within BOLD frames and reconstruct the signal in compromised regions, frame by frame. We reconstructed BOLD signals in three cohorts: healthy young adults (GSP and HCP data sets) with artificially compromised cortical regions and patients whose fMRI scans suffered from interference due to metal implants. Our results indicate that such a machine learning technique successfully reconstructs individual-specific BOLD signals and can approximate the functional connectivity patterns observed in the intact or unimpaired state.

The missing BOLD signal was reconstructed frame by frame, following which we modeled the time series for all individual vertices whose signal was compromised. We found the reconstructed time series to be similar to the original time series. This indicates that our model was able to recover dynamic brain activity over time, at the level of single vertices within individual participants. The same was found for the reconstructed and original functional connectivity maps. These findings are intriguing, as the images were reconstructed at each time point independently and single input frames did not carry time-varying information. The generator was thus able to learn information beyond what was presented at face value during the test phase, and modeled accurate functional interactions between brain regions and the changing dynamics of the BOLD signal through time. The DCGAN model outperformed a more naive diffusion-based reconstruction method, indicating that machine learning is able to extrapolate information embedded within the whole BOLD image, while simpler filling-in methods are restricted to using nearby information, thereby limiting their ability to capture principles of brain organization. Additional control analyses revealed that global signal regression (GSR), a preprocessing step that amplifies anticorrelations in the brain through its mathematical mandate^14,15, does not meaningfully impact the learning of these principles. Of note, the temporal signal-to-noise ratio (SNR) of the training data is an important factor that modulates reconstructive accuracy.

The reconstruction of the compromised signals is based on learning the functional relationships between different brain regions in intact BOLD frames from a large data set. The trained model deduces possible signal distributions in the compromised region using the remaining intact BOLD amplitude patterns in the individual. Thus, the generator learns functional activity patterns that are specific to each individual and builds a high-dimensional space that is sensitive to variations across individuals. Indeed, we show that the reconstructed BOLD signals capture individual differences in patterns of activity. A given individual’s reconstructed FC maps were more similar to their original FC maps than to the training group-averaged maps or to the ones belonging to the most similar individual from the training data set. Therefore, the deep machine learning model may be useful in recovering lost signal in clinical settings, where the focus is on the individual.

On that note, our method has several potential clinical applications. Revealing individual-based functional activity is critical not only for understanding the functional network organization of the human brain¹⁹ but also for personalized medicine, such as when precise cortical mapping is required for neurosurgery or neuromodulation^20,21. Our individual-specific machine learning method can, as we have shown, reconstruct the BOLD signal that was lost due to intracortical electrode interference. We showed that the model generates FC maps with high reconstructive accuracy, as they exhibit high similarity to maps derived from pre-surgical images. Thus, machine learning-based reconstruction can impact the investigation of various disorders where intracortical electrodes are used for diagnosis or therapy, such as in epilepsy, Parkinson’s disease, depression, obsessive-compulsive disorder, among others. This method could also help mitigate registration problems in fMRI that occur with data from patients with brain lesions due to stroke or tumor resections, by filling in the compromised regions prior to registration. It could also fill in regions that are routinely cut off during acquisition, such as the top of the brain or the lower portion of the cerebellum. Another application would be to reconstruct whole frames in patient fMRI data in which many frames were scrubbed due to excessive head motion, a common pitfall in clinical studies. Finally, in both clinical and non-clinical settings, machine learning could be used to reconstruct the BOLD signal in regions that are susceptible to signal loss and geometric distortion, such as the orbitofrontal cortex and temporal cortex. However, this could only be done once the model can be trained on uncompromised images, thus appropriate data acquisition methods that can counteract these susceptibility artifacts will first have to be developed.

We observed that the reconstructive accuracy differed from region to region. Reconstructive accuracy may be affected by a number of factors. One factor is the size of the compromised region, as smaller regions yield better reconstructions. A second factor is the shape of the region, as closer proximity to uncompromised vertices is likely to result in better reconstruction. A third factor is whether the affected region has important large-scale network connectivity, which would increase accuracy as fMRI activity outside of the compromised region would bear information relevant to the activity within the compromised region. Additionally, the distribution of learned functional activation patterns is constrained by the training data set. More efforts are necessary to evaluate the reconstructive accuracy of the model when the compromised BOLDs are not acquired using the same MRI and scanning parameters as the training data. Another limitation of our study is that the DCGAN model dealt with 2D images. Currently, it is not able to reconstruct whole-brain 3D images as the computational power required is too high. We hope that technological advances will soon enable this type of modeling. In the meantime, it may be possible to reconstruct signal in small 3D volumes.

Besides 3D modeling, we propose one area of future study to improve the current machine learning model, which would be to consider the causal interactions across time frames. Currently, each of the BOLD frames is used separately to supervise the learning process of the adversarial networks. In this way, the connections among cortical areas, which are not included in single BOLD frames, are difficult to detect. Feeding DCGAN a combination of BOLD frames may improve their modeling power.

In summary, we harnessed the learning power of deep convolutional neural networks to generate BOLD signals in regions that experienced signal loss. We have replicated our findings in multiple data sets and have shown that it is possible to reconstruct lost BOLD signal in healthy individuals as well as in a clinical sample with compromised fMRI. In all cases, the reconstructed signals closely resemble the uncompromised signals. Notably, the reconstructed signals are coherent with each individual’s functional brain organization. Such a method could benefit personalized clinical and non-clinical studies where brain regions suffer signal dropout, distortion, or deformation.

Methods

Participants

We used the resting-state fMRI data of 100 healthy young adult participants randomly chosen from the Brain Genomics Superstruct Project (GSP)¹⁰ data set (50 women, 50 men; mean age: 22.0 ± 3.2 years), as well as the “100 Unrelated Subjects” data set²² that was taken from the large publicly available Human Connectome Project¹³ database (54 women, 46 men; age range 22–36). We also analyzed the MRI data of 12 patients with Parkinson’s disease (PD; 5 women, 7 men; mean age = 55.3 ± 7.5) from a previous clinical trial (https://clinicaltrials.gov/ct2/show/NCT02937727), who had intracortical electrodes implanted for deep-brain stimulation (DBS) treatment. The patients underwent stereotactic implantation of quadripolar DBS electrodes (PINS Medical, Model L301C) in the subthalamic nucleus. Microelectrode recording and stimulation guided the electrode implantation, and the electrodes were connected to extension leads (PINS Medical, Model E202C), which were themselves connected to the implanted pulse generator (PINS Medical, Model G106R). All participants provided written informed consent in accordance with guidelines set by the Institutional Review Boards of Harvard University, Partners Healthcare, or Beijing Tiantan Hospital of Capital Medical University.

MRI data acquisition

GSP data set. Each healthy young participant from the GPS data set underwent one structural scan and two resting-state fMRI scans (6 min and 12 s per scan). Data were collected on matched 3 T Tim Trio scanners (Siemens, Erlangen, Germany) using a 12-channel phased-array head coil. Structural data included a high-resolution multi-echo T1-weighted magnetization-prepared gradient-echo image (TR = 2200 ms, TE = 1.54 ms for image 1 to 7.01 ms for image 4, TI = 1100 ms, flip angle = 7°, voxel size: 1.2 × 1.2 × 1.2 mm, FOV = 230, slices = 720). Resting-state fMRI images were acquired using the gradient-echo echo-planar imaging (EPI) pulse sequence (TR = 3000 ms, TE = 30 ms, flip angle = 85°, voxel size: 3 × 3 × 3 mm voxels, FOV = 216, slices = 47 slices collected with the interleaved acquisition with no gap between slices). Whole-brain coverage including the entire cerebellum was achieved with slices aligned to the anterior commissure-posterior commissure plane using an automated alignment procedure, ensuring consistency across participants²³. Participants were instructed to stay awake, keep their eyes open, and minimize head movement; no other task instruction was provided.

HPC data set. HCP participants underwent structural MRI scans (20 min) and two resting-state fMRI scans (30 min each) on a 3 T Siemens Skyra MRI scanner equipped with a 32-channel head coil. Structural images were acquired using a 3D MPRAGE T1-weighted sequence (TR = 2400 ms, TE = 2.14 ms, TI = 1000 ms, flip angle = 8°; voxel size: 0.7 × 0.7 × 0.7 mm, FOV = 224 mm, matrix = 320, 256 sagittal slices in a single slab). Functional images were acquired using a multiplexed EPI pulse sequence (TR = 720 ms, TE = 33.1 ms, flip angle = 52°, voxel size: 2 × 2 × 2 mm, FOV = 208 × 180 mm, 72 slices, multiband factor = 8, echo spacing = 0.58 ms, bandwidth = 2290 Hz/px). To match the duration of the GSP data, we truncated the HCP resting-state data to 8 min.

Clinical data set. Each of the 12 patients with Parkinson’s disease underwent four resting-state fMRI scans (6 min and 8 s per scan) at two time points: at baseline before the DBS electrode implantation surgery and one month after. The patients were instructed to stay awake and keep their eyes open. The deep-brain stimulator was turned off during post-surgical fMRI scanning. The specific absorption rate-estimated values were continuously monitored throughout the scanning sessions. MRI data were collected on a Philips Achieva 3.0 Tesla TX whole-body MR scanner using a 32-channel receive-only head coil. Structural images were acquired using a sagittal magnetization-prepared rapid gradient-echo T1-weighted sequence (TR = 7.6 ms, TE = 3.7 ms, TI = 1000 ms, flip angle = 8°, voxel size = 1 × 1 × 1 mm, FOV = 256, slices = 180). Functional data were collected using an echo-planar imaging sequence (TR = 2000 ms, TE = 30 ms, flip angle = 90°, voxel size = 2.875 × 2.875 × 4 mm, FOV = 230, slices = 37).

Data processing

Structural data were processed using FreeSurfer version 4.5.0. Surface mesh representations of the cortex from each individual participant’s structural images were reconstructed and registered to a common spherical coordinate system²⁴. The structural and functional images were aligned using boundary-based registration using the FsFast software package (http://surfer.nmr.mgh.harvard.edu/fswiki/FsFast)²⁵. The preprocessed resting-state fMRI data were then aligned to the common spherical coordinate system via sampling from the middle of the cortical ribbon in a single interpolation step²⁶. We registered each individual’s fMRI data to the FreeSurfer template which consisted of 40,962 vertices in each hemisphere. A 6-mm full-width half-maximum (FWHM) smoothing kernel was applied to the fMRI data in the surface space. The smoothed data were downsampled to a mesh of 2562 vertices in each hemisphere using the mri_surf2surf function in FreeSurfer.

Resting-state fMRI data were processed using the following procedures: (i) slice timing correction (SPM2; Wellcome Department of Cognitive Neurology, London, UK)²⁷; (ii) rigid body correction for head motion with the FSL package^28,29; (iii) normalization of global mean signal intensity across runs; and (iv) bandpass temporal filtering (0.01–0.08 Hz), head-motion regression, whole-brain global signal regression (GSR), and ventricular and white-matter signal regression in a single step. To test the effects of GSR, we also preprocessed the data without GSR. After preprocessing, each participant’s resting-state fMRI data were normalized to [−1,1] by dividing the BOLD amplitude of each vertex by the maximum absolute BOLD value observed in each session using Matlab R2014b. The normalized 2562-vertex mesh of the BOLD frames was then flattened to 2-dimensional maps using the tksurfer and mris_flatten functions in FreeSurfer²⁴.

Machine learning

We used DCGAN to reconstruct lost or compromised BOLD information. The neural network modeling was conducted in three steps using Python 3.6.1. In the first step, the training phase, two competing models are trained: a generator and a discriminator (Fig. 1a). The generator is trained to encode BOLD information by feeding it intact BOLD frames from the training data set. Using information within these frames, the generator creates new frames, which the discriminator then classifies as being either authentic (real BOLD frame) or artificial (generator-created BOLD frame). Both the generator and discriminator simultaneously continue training with new frames, and through many iterations, each becomes optimized. Training ends when an optimized discriminator classifies the generated frames into one category or the other at chance level. In the second step, we created artificially compromised BOLD frames by removing the BOLD signal within certain predefined regions, using data from the test data set. In the third step, the signal in the compromised region is reconstructed by feeding the compromised BOLD images to the DCGAN generator (Fig. 1b). The generator then produces new complete frames based on these. Using the mask of the compromised region, the region with a reconstructed signal from the newly generated frame replaces the one in the compromised frame to form a complete BOLD frame which includes the original information (BOLD signal outside of the compromised region) and the newly generated information (BOLD signal inside the reconstructed region). Following this, we evaluated the similarity between the reconstructed BOLD information and the original intact BOLD information. Each of these steps is described in more detail below.

Step 1: Training a generative network to encode BOLD information. Eighty participants were randomly selected from the Brain GSP data set¹⁰ to build the training data set, which consisted of 19,200 intact flattened BOLD frames in vertex space. The data from 20 other participants, again selected randomly from the GSP, constituted the test data set, which was independent from the training data set. We used a DCGAN model² to create BOLD frames in vertex space based on each individual participant’s data. The generator’s goal is to create images similar enough to the original images that the discriminator is forced to randomly classify them as authentic or simulated, while the goal of the discriminator is to correctly classify images as either authentic or simulated.

In mathematical terms, the generator (G) samples data x from the true data population p_data and produces parameters. It maps these parameters onto a random vector z, which is sampled from latent space Z, and creates artificial images G(z), which are part of the generated distribution p_g. When the discriminator (D) detects a difference between the distributions p_data and p_g, the generator G tweaks its parameters and generates images that are more similar to the authentic images. This process is repeated until the generator produces a generated distribution p_g that so closely matches the true data distribution p_data that the discriminator D is unable to detect a difference and classifies the authentic or generated images G(z) randomly.

Convolutional neural networks (CNN) were used to build the generator and discriminator⁴. During the adversarial training process, the generator and discriminator were trained simultaneously. They were optimized using a Nash equilibrium of costs two-player minimax game with value function V(G, D):

$$\min _G\max _DV\left( {D,G} \right) = \, {\Bbb E}_{x\sim p_{\mathrm{data}}\left( x \right)}\left[ {\log D\left( x \right)} \right] + \\ \,{\Bbb E}_{z\sim p_z\left( z \right)}\left[ {\log \left( {1 - D\left( {G\left( z \right)} \right)} \right)} \right].$$

(1)

The input z was a sample taken from 100 dimensional uniformly distributed noise; in each dimension, the value varied from −1 to 1. The generator projected the input z to a small convolutional representation and then converted the representation into a 500 × 500 pixel image through four-layer fractionally strided convolutions⁴. The discriminator estimated the input images through four-layer-strided convolutions and fed the layers into single sigmoid outputs. Rectified Linear Unit activation was used in the generator’s layers, except for the output layer, which used the Tanh function³⁰. Leaky rectified activation was used in all of the discriminator’s layers³¹. A 64-size batch normalization was used in the training procedure for stabilization³², and the sigmoid cross entropy was calculated to measure the probability difference between two images. We used an Adam optimizer during the optimizing procedure of the generator and discriminator³³, with a learning rate of 0.0002. The generator’s parameters were adjusted twice during each iteration to balance the learning speed between the generator and the discriminator.

Step 2: Creating the compromised BOLD frames. To allow the DCGAN model to generate BOLD signals in compromised regions, we created frames in which we removed the BOLD signal in predefined regions. We aimed to test the generator on five regions spread across the cortical surface: the lateral frontal cortex, the medial frontal, cortex, the lateral parietal cortex, the lateral temporal cortex, and the occipital cortex. These were delineated using FreeSurfer’s Desikan–Killiany atlas³⁴. The Desikan–Killiany atlas labels of the artificially compromised regions are: 4, 13, 19, 20, 21, 28 for the lateral frontal cortex (329 eliminated vertices, 12.8% of the cortical surface); 3,15, 27, 29 for the medial frontal cortex (223 eliminated vertices, 8.7% of the cortical surface); 9, 30, 32 for the lateral parietal cortex (429 eliminated vertices, 16.7% of the cortical surface); 10 and 16 for the lateral temporal cortex (140 eliminated vertices, 5.5% of the cortical surface); and 6, 12, 14, 22 for the occipital cortex (188 eliminated vertices, 7.3% of the cortical surface). Masks M were created separately to mask out each of the eliminated regions and leave intact the other parts of the flattened BOLD activation map. After preprocessing, BOLD signals within the masks were artificially set to 0 to create the compromised BOLD frames.

We also sought to evaluate the reconstructive accuracy of our model according to the size of the compromised regions. To do this, we created ten sets of incrementally larger masks (Supplementary Fig. 1a). We selected at random 10 vertices among all 2562 cortical surface vertices, located in various cortical regions. Each of these vertices served as the center of its mask set. Each mask set was comprised of six masks, whose coverage went from 10 to 60% of the cortical surface, with incremental steps of 10% (i.e., 10, 20, 30, 40, 50, 60%). The BOLD signal inside these masks was eliminated to create 10 sets of BOLD images with increasingly larger compromised regions.

Finally, for the patients with Parkinson’s disease, we created masks that encompassed the regions in which interference was observed due to the implanted DBS electrodes. We first calculated the absolute values of the surface-based BOLD signal for each vertex, which were then averaged and normalized to [0,1] to achieve a normalized BOLD signal strength for each vertex before and after implantation surgery. Then, the pre- and post-operative BOLD amplitude maps were contrasted and the vertices with strongly reduced amplitude post-operatively were extracted to create a mask. The compromised regions comprised 193 vertices (7.5% of the cortical surface) in one patient and 200 vertices in the other (7.8% of the cortical surface).

Step 3: Reconstructing compromised BOLD signals. The trained generator was used to reconstruct the BOLD frames with artificially compromised regions, by generating an image G(z) with maximal similarity to the original BOLD activation x in the cortical regions outside of the mask. The generator’s loss function serves to calculate the divergence between the real and simulated BOLD frames, and was defined as:

$$L = {\sum} {\left( {x. \ast M - G\left( z \right). \ast M} \right)^2}.$$

(2)

In order to minimize the loss function, an optimized z must be found in the latent space. After random sampling, z was mapped to G(z) by the trained generator. The location of z was rearranged iteratively during the optimization process using the gradient-descent method to find the most similar generated image G(z) with minimal L to x. The number of iterations was set to 500, with a 0.000002 learning rate.

In the reconstruction step, we used 4800 BOLD frames from the test data set of 20 participants. The reconstructed BOLD frames were built from the masked-out flattened images. To do this, we first had to determine the spatial location of each vertex on a flattened BOLD amplitude map. Each vertex was generated and subsequently projected onto a flattened cortical surface map. The coordinates with maximum value in the flattened map were regarded as the location of the generated vertex. Using this method, we determined the correspondence between each of the vertices and their spatial location on the flattened map. This allowed us to then take the flattened map and to project it back into vertices on a BOLD frame. For each participant, the simulated BOLD frames were assembled temporally to reconstruct the BOLD time series.

Diffusion model

We compared our DCGAN model to a more naive reconstruction model based on diffusion, whereby each compromised vertex was assigned a BOLD value based on the average of the BOLD signal in adjacent vertices. The process was started at the perimeter of the compromised region so that those vertices could be filled in using the adjacent uncompromised vertices. Then, reconstruction moved inwards, with new vertices being assigned a BOLD value based on the adjacent vertices that were reconstructed in the previous step. This process was reiterated until all compromised vertices were assigned a value.

Temporal signal-to-noise ratio

Temporal signal-to-noise ratio (tSNR) was measured by dividing 1 by the standard deviation of the BOLD signal as described in previous studies^35,36 and then averaged across all voxels within the brain, across frames, and across participants.

Statistical analysis

The reconstructive accuracy of the DCGAN model was evaluated by calculating correlations between original and reconstructed BOLD signal time series and functional connectivity. The similarity between the original and reconstructed BOLD time series was only investigated in the first part of the study with healthy adults. The reconstructed signal was compared to the original signal. For the patient study, time series obtained at different time points cannot be directly compared, for this reason, we did not compare the post-operative reconstructed time series to the pre-operative time series in the patients with PD. As the brain’s functional connectivity (FC) patterns are relatively stable through time, we compared the FC maps generated from various seeds inside the compromised regions, in both the healthy adult data sets and the PD data set. To show that functional connectivity is relatively stable and unaffected by electrode implantation surgery, we also generated FC maps using seeds in uncompromised regions.

For the time series comparisons at any given vertex, the similarity between reconstructed and original signals was quantified by calculating the Pearson correlation between the two time series. The correlation values within a given masked region were then averaged across participants to represent the similarity between the reconstructed and original BOLD signals in that region.

The cortical FC of the original and reconstructed BOLD signals was also compared. For the original BOLD images, an FC map was created by calculating the z value of the correlation between the BOLD signal at a given vertex and the BOLD signals at all the vertices in the two cerebral hemispheres. For the reconstructed BOLD images, the compromised regions were filled in with the reconstructed BOLD signal to calculate the FC of the whole brain. The vectors storing the FC of the original and reconstructed BOLD signals for a given vertex were then correlated to determine their similarity. For each of the five cortical masks, the statistical significance for the time series and FC map correlations was assessed using multilevel linear models with vertices nested within participants. Differences between various models or data sets were statistically assessed using independent (e.g., GSP vs. HCP) and paired (e.g., raw HCP vs. temporally smoothed HCP, DCGAN vs. diffusion) samples t-tests. Whenever the assumption of normality was violated, bootstrapped p-values were calculated. Multiples comparisons were corrected for using the Bonferroni correction.

To assess whether the reconstructed BOLD signals were representative of individuals’ own BOLD signals or whether they simply reflected general trends in BOLD activity learned from the training data set, we calculated the correlation between the reconstructed BOLD FC map from the test data set and the group-averaged FC map from the training data set in corresponding vertices. The correlations were calculated using all vertices within the five cortical masks combined.

We also compared the reconstructed FC map of each participant in the test data set to the intact FC map of the individual in the training data set that most resembles them, called the most similar individual (MSI). To determine each participant’s MSI, we correlated a given individual’s functional connectivity vectors for the vertices inside all five cortical masks combined with the vectors of corresponding vertices in each of the 80 training data set participants. The training data set individual showing the highest similarity (highest correlation) to a given participant from the test data set was identified as that participant’s MSI. The statistical significance of each of these average correlations (reconstructed vs. original, reconstructed vs. training, reconstructed vs. MSI) was assessed by examining parameter estimates generated from a repeated-measures analysis of variance (ANOVA), which also served to statistically compare whether these three average correlations were different from one another. Finally, we used paired t-tests as post-hoc tests to determine which pairs of correlations demonstrated a significant difference. We used a repeated measures ANOVA to determine whether the size of the compromised region significantly affected the reconstructive accuracy of the generated frames across 10 cortical masks.

Statistical tests were performed using SPSS Statistics 20.0 (IBM, NY). All statistical tests were two-sided, and 95% CI are reported. Effect sizes are reported for group-level analyses: η_p² for F-tests, η_p² for t-tests, and r coefficients for correlations. Means are presented along with their standard deviations (mean ± s.d.) in the results section, except when indicated otherwise.

Data availability

The GSP data set is available at http://neuroinformatics.harvard.edu/gsp/. The HCP data set is available at https://www.humanconnectome.org/study/hcp-young-adult/data-releases. The DBS data set is available from the corresponding authors upon reasonable request. Source data are provided with this paper.

Code availability

The code used in this article is available at http://nmr.mgh.harvard.edu/bid/DownLoad.html.

References

Sammartino, F. et al. 3-Tesla MRI in patients with fully implanted deep brain stimulation devices: a preliminary study in 10 patients. J. Neurosurg. 127, 892–898 (2017).
Article Google Scholar
Goodfellow, I. et al. Generative adversarial nets. Adv. Neural Inform. Process. Syst. 2672–2680 (2014).
Pathak, D., Krahenbuhl, P., Donahue, J., Darrell, T. & Efros, A. A. Context encoders: Feature learning by inpainting. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 2536–2544 (2016).
Radford, A., Metz, L. & Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. Preprint at https://arxiv.org/abs/1511.06434 (2015).
Schlegl, T., Seeböck, P., Waldstein, S. M., Schmidt-Erfurth, U. & Langs, G. Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. In International Conference on Information Processing in Medical Imaging 146–157 (2017).
Yeh, R. A. et al. Semantic image inpainting with deep generative models. In Proc. IEEE Conference on Computer Vision and Pattern Recognition 5485–5493 (2017).
Liu, Z., Luo, P., Wang, X. & Tang, X. Deep learning face attributes in the wild. In Proc. IEEE International Conference on Computer Vision 3730–3738 (2015).
Liu, X. & Duyn, J. H. Time-varying functional network information extracted from brief instances of spontaneous brain activity. Proc. Natl Acad. Sci. USA 110, 4392–4397 (2013).
Article ADS CAS Google Scholar
Tavor, I. et al. Task-free MRI predicts individual differences in brain activity during task performance. Science 352, 216–220 (2016).
Article ADS CAS Google Scholar
Holmes, A. J. et al. Brain Genomics Superstruct Project initial data release with structural, functional, and behavioral measures. Sci. Data 2, 150031 (2015).
Article Google Scholar
Langs, G. et al. Identifying shared brain networks in individuals by decoupling functional and anatomical variability. Cereb. Cortex 26, 4004–4014 (2016).
Article Google Scholar
Poldrack, R. A. et al. Long-term neural and physiological phenotyping of a single human. Nat. Commun. 6, 8885 (2015).
Article ADS CAS Google Scholar
Van Essen, D. C. et al. The WU-Minn human connectome project: an overview. Neuroimage 80, 62–79 (2013).
Article Google Scholar
Murphy, K., Birn, R. M., Handwerker, D. A., Jones, T. B. & Bandettini, P. A. The impact of global signal regression on resting state correlations: are anti-correlated networks introduced? Neuroimage 44, 893–905 (2009).
Article Google Scholar
Murphy, K. & Fox, M. D. Towards a consensus regarding global signal regression for resting state functional connectivity MRI. Neuroimage 154, 169–173 (2017).
Article Google Scholar
Deuschl, G. et al. A randomized trial of deep-brain stimulation for Parkinson’s disease. N. Engl. J. Med. 355, 896–908 (2006).
Article CAS Google Scholar
Deep-Brain Stimulation for Parkinson’s Disease Study Group. Deep-brain stimulation of the subthalamic nucleus or the pars interna of the globus pallidus in Parkinson’s disease. N. Engl. J. Med. 2001, 956–963 (2001).
Google Scholar
Wang, D. et al. Parcellating cortical functional networks in individuals. Nat. Neurosci. 18, 1853–1860 (2015).
Article CAS Google Scholar
Laumann, T. O. et al. Functional system and areal organization of a highly sampled individual human brain. Neuron 87, 657–670 (2015).
Article CAS Google Scholar
Fox, M. D. et al. Combining task-evoked and spontaneous activity to improve pre-operative brain mapping with fMRI. NeuroImage 124, 714–723 (2016).
Article Google Scholar
Qian, T. et al. Fast presurgical functional mapping using task-related intracranial high gamma activity. J. Neurosurg. 119, 26–36 (2013).
Article Google Scholar
Hodge, M. R. et al. ConnectomeDB—sharing human brain connectivity data. Neuroimage 124, 1102–1107 (2016).
Article Google Scholar
van der Kouwe, A. J. W., Benner, T., Salat, D. H. & Fischl, B. Brain morphometry with multiecho MPRAGE. Neuroimage 40, 559–569 (2008).
Article Google Scholar
Fischl, B., Sereno, M. I. & Dale, A. M. Cortical surface-based analysis. II: Inflation, flattening, and a surface-based coordinate system. Neuroimage 9, 195–207 (1999).
Article CAS Google Scholar
Greve, D. N. & Fischl, B. Accurate and robust brain image alignment using boundary-based registration. Neuroimage 48, 63–72 (2009).
Article Google Scholar
Yeo, B. et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J. Neurophysiol. 106, 1125–1165 (2011).
Article Google Scholar
Penny, W. D., Friston, K. J., Ashburner, J. T., Kiebel, S. J. & Nichols, T. E. Statistical Parametric Mapping: The Analysis of Functional Brain Images (Elsevier, 2011).
Jenkinson, M., Bannister, P., Brady, M. & Smith, S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage 17, 825–841 (2002).
Article Google Scholar
Smith, S. M. et al. Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage 23(Suppl 1), S208–219 (2004).
Article Google Scholar
Nair, V. & Hinton, G. E. Rectified linear units improve restricted boltzmann machines. In Proc. 27th International Conference on Machine Learning (ICML-10) 807–814 (2010).
Maas, A. L., Hannun, A. Y. & Ng, A. Y. Rectifier nonlinearities improve neural network acoustic models. Proc. ICML 30, 3 (2013).
Ioffe, S. & Szegedy, C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In International Conference on Machine Learning 448–456 (2015).
Kinga, D. & Adam, J. B. International Conference on Learning Representations Vol. 5 (ICLR, 2015).
Klein, A. & Tourville, J. 101 labeled brain images and a consistent human cortical labeling protocol. Front. Neurosci. 6, 171 (2012).
Marcus, D. S. et al. Human Connectome Project informatics: quality control, database services, and data visualization. Neuroimage 80, 202–219 (2013).
Article Google Scholar
Triantafyllou, C. et al. Comparison of physiological noise at 1.5 T, 3 T and 7 T and optimization of fMRI acquisition parameters. Neuroimage 26, 243–250 (2005).
Article CAS Google Scholar

Download references

Acknowledgements

This work is supported by National Natural Science Foundation of China grants 81790650 (H.L.), 81790652 (H.L.), 81527901 (L.L.), and 81720108021 (M.W.); the National Key Research and Development Program of China grants 2016YFC0105502 (L.L.), 2017YFA0205904 (B.H.), and 2017YFE0103600 (M.W.); Shenzhen International Cooperative Research Project grant GJHZ20180930110402104 (L.L.); Zhongyuan Thousand Talents Plan Project grant ZYQR201810117 (M.W.); NIH grants 1R01NS091604 (H.L.), R01DC017991 (H.L.), R21MH121831 (H.L.), and P50MH106435 (H.L.). L.D. is supported by a Canadian Institutes of Health Research postdoctoral fellowship, FRN: MFE-171291. We thank the National Center for Protein Sciences at Peking University for assistance with fMRI data processing. Data were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University.

Author information

These authors contributed equally: Yuxiang Yan, Louisa Dahmani, Jianxun Ren.
These authors jointly supervised this work: Meiyun Wang, Luming Li, Bo Hong, Hesheng Liu.

Authors and Affiliations

Athinoula A. Martinos Center for Biomedical Imaging, Department of Radiology, Massachusetts General Hospital, Harvard Medical School, Charlestown, MA, USA
Yuxiang Yan, Louisa Dahmani, Jianxun Ren, Lunhao Shen, Xiaolong Peng, Ruiqi Wang, Changgeng He, Shijun Li & Hesheng Liu
Department of Biomedical Engineering, School of Medicine, Tsinghua University, Beijing, China
Yuxiang Yan & Bo Hong
Department of Radiology, Zhengzhou University People Hospital & Henan Provincial People’s Hospital, Zhengzhou, China
Louisa Dahmani & Meiyun Wang
National Engineering Laboratory for Neuromodulation, School of Aerospace Engineering, Tsinghua University, Beijing, China
Jianxun Ren, Lunhao Shen, Changgeng He, Changqing Jiang, Chen Gong, Ye Tian & Luming Li
Department of Neurosurgery, Tiantan Hospital, Capital Medical University, Beijing, China
Jianguo Zhang
Department of Neurosurgery, Peking Union Medical College Hospital, Beijing, China
Yi Guo
Department of Neurosurgery, First Affiliated Hospital of Fujian Medical University, Fuzhou, China
Yuanxiang Lin
Beijing Institute for Brain Disorders, Capital Medical University, Beijing, China
Luming Li & Hesheng Liu
Department of Neuroscience, Medical University of South Carolina, Charleston, SC, USA
Hesheng Liu

Authors

Yuxiang Yan
View author publications
You can also search for this author in PubMed Google Scholar
Louisa Dahmani
View author publications
You can also search for this author in PubMed Google Scholar
Jianxun Ren
View author publications
You can also search for this author in PubMed Google Scholar
Lunhao Shen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaolong Peng
View author publications
You can also search for this author in PubMed Google Scholar
Ruiqi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Changgeng He
View author publications
You can also search for this author in PubMed Google Scholar
Changqing Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Chen Gong
View author publications
You can also search for this author in PubMed Google Scholar
Ye Tian
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yuanxiang Lin
View author publications
You can also search for this author in PubMed Google Scholar
Shijun Li
View author publications
You can also search for this author in PubMed Google Scholar
Meiyun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Luming Li
View author publications
You can also search for this author in PubMed Google Scholar
Bo Hong
View author publications
You can also search for this author in PubMed Google Scholar
Hesheng Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L., Y.Y., B.H., L.L., and M.W. designed the research; Y.Y., L.D., J.R., L.S., X.P., R.W., C.H., C.J., C.G., Y.T., J.Z., Y.G., Y.L., and S.L. performed the research, Y.Y., L.D., J.R., and H.L. analyzed data, L.D., Y.Y., H.L., M.W., L.L., and B.H. wrote and improved the paper.

Corresponding authors

Correspondence to Meiyun Wang, Luming Li, Bo Hong or Hesheng Liu.

Ethics declarations

Competing interests

Luming Li serves on the chief scientific advisory board for Beijing Pins Medical Co., Ltd, and is listed as an inventor on issued patents and patent applications pertaining to the deep-brain stimulator used in this work. The remaining authors declare no competing interests.

Additional information

Peer review information Nature Communications thanks Alfonso Fasano and other, anonymous, reviewers for their contributions to the peer review of this work. Peer review reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yan, Y., Dahmani, L., Ren, J. et al. Reconstructing lost BOLD signal in individual participants using deep machine learning. Nat Commun 11, 5046 (2020). https://doi.org/10.1038/s41467-020-18823-9

Download citation

Received: 07 May 2019
Accepted: 14 September 2020
Published: 07 October 2020
DOI: https://doi.org/10.1038/s41467-020-18823-9

This article is cited by

Atypical functional connectivity hierarchy in Rolandic epilepsy
- Qirui Zhang
- Jiao Li
- Zhiqiang Zhang
Communications Biology (2023)
Fast cortical surface reconstruction from MRI using deep learning
- Jianxun Ren
- Qingyu Hu
- Hesheng Liu
Brain Informatics (2022)
Common and differential connectivity profiles of deep brain stimulation and capsulotomy in refractory obsessive-compulsive disorder
- Xiaoyu Chen
- Zhen Wang
- Zheng Wang
Molecular Psychiatry (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.