Introduction

Despite the significant morbidity, and mortality associated with bipolar disorder, its pathophysiology remains poorly understood. Moreover, while the available pharmacotherapeutics for bipolar mania have demonstrated efficacy, their mechanisms of action remain to be fully understood. Obtaining neuroimaging data prior to, and during the course of treatment provides an opportunity to more directly evaluate the effects of treatment on brain networks, and potentially to identify pretreatment brain features that predict treatment response [1, 2].

Developments in systems neuroscience have documented that the human brain is comprised of interconnected networks with distinct behavioral relevance. Psychiatric symptoms, such as manic episodes in bipolar illness may represent a disruption of normal integration amongst these cortical and subcortical brain regions [3,4,5,6,7], and these alterations may predict and normalize in relation to treatment outcome [8,9,10].

Graph-based theoretical analysis provides a powerful framework for characterizing topological properties of brain networks [11,12,13,14]. In this approach, the brain is modeled as a network composed of a number of nodes and edges connecting the nodes. Nodes represent individual cortical and subcortical regions, and the edges reflect their connectivity, which is needed for information transfer within and between networks. High-resolution structural MRI has been increasingly used to delineate whole-brain connectivity patterns by evaluating patterns of interregional gray matter (GM) volume measurements [15,16,17,18]. Such measures can help to understand how alterations in network anatomy are related to the neurofunctional and behavioral characteristics associated with bipolar disorder [19, 20], and have been used to detect topological differences in brain networks of individuals across different psychiatric disorders [18, 21, 22]. Moreover, there have been recent report that alteration of connectome structure has value in identifying and predicting treatment response in psychotic disorders [23].

In this study, we employed graph-based analysis to examine anatomic brain networks of acutely manic individuals at baseline, and over 1 and 8 weeks of treatment with either quetiapine or lithium. Analyses utilized a recently developed approach in which interregional similarities of regional brain gray matter volume measurements are used to identify structural brain networks [24, 25]. Based on previous studies in bipolar disorder by Xia and colleagues who found greater randomization in network configuration, represented by lower clustering coefficient (Cp) and characteristic path length (λ) [26], we hypothesized that: (i) untreated individuals with bipolar mania would demonstrate less coherent network configuration than healthy subjects, represented by differences in neuroanatomic network measures; (ii) these topological differences would be reduced with treatment, with changes positively correlating with treatment response; and (iii) structural connectome findings would differ between those individuals with bipolar disorder who responded to 8 weeks of treatment and those who did not respond.

Methods

Participants

This study was approved by the University of Cincinnati Institutional Review Board. All participants were evaluated clinically and using the Structured Clinical Interview for DSM-IV, Patient version (SCID-P) or the Washington University in St. Louis Kiddie Schedule for Affective Disorders and Schizophrenia (KSADS) [27]; manic symptoms were assessed with the Young Mania Rating Scale (YMRS) [28]. We enrolled 121 individuals (age range 16–55) meeting DSM-IV criteria for bipolar I disorder, from inpatient and outpatient services or by referral from community psychiatrists; all had an index YMRS total score ≥20. Of the 121 patients enrolled, data from seven in the lithium treatment group and five in quetiapine treatment group were lost to follow-up due to reasons including: moving out of town, lack of efficacy, withdrawal of consent, medication noncompliance, and adverse events. Statistical analysis of treatment effects only included the 109 patients who complete the clinical trial and provided usable MRI scan data at all time points.

We also recruited 60 healthy individuals from the communities where patients resided. All included healthy individuals had three MRI and clinical evaluations paralleling those of the patients, and had no known history of mood or psychotic disorders personally or in their first- or second-degree relatives. All participants were physically healthy based on a medical interview and a physical exam and were excluded for any substance use disorder with the exception of nicotine. Additional clinical information including lifetime comorbidity, lifetime medication exposure, smoking status, and body mass index are presented in Supplementary Materials (Supplementary Table S1). Written informed consent was obtained from all participants, or their legal guardian prior to their participation; written assent was obtained from all participants under age 18 years of age.

Treatment procedures

Following clinical evaluation and MRI scanning, subjects with bipolar disorder received treatment with quetiapine (n = 43) or lithium (n = 66), and followed clinically over 8 weeks as previously described. Lithium target dose was based on achieving serum levels of 0.8–1.2 mEq/L; the maximum level allowed was 1.5 mEq/L. Quetiapine was flexibly dosed with a target daily dose of 100 mg on day 1, 200 mg by day 2, 300 mg by day 3, and 400 mg by day 4. Quetiapine could be titrated to 600 mg on day 5, and to a maximum of 800 mg on subsequent days, or decreased to a minimum of 200 mg, at the clinician’s discretion. Treatment outcome was assessed using scores from the YMRS; response was defined as a ≥ 50% reduction in YMRS scores from baseline.

Data acquisition

MRI acquisitions were performed on a 4 Tesla (4 T) Varian Unity INOVA scanner with a 12-channel head coil as previously described [10, 29]. Earplugs and headphones were provided to block background noise, and foam padding around the head minimized head motion. Participants were scanned at three time points, at baseline (prior to treatment for patients), week 1, and week 8.

Following a three-plane gradient echo scan for alignment and localization, a shim procedure was performed to generate a homogeneous magnetic field. High resolution T1-weighted three-dimensional images were acquired with a Modified Driven Equilibrium Fourier Transform (MDEFT) protocol, optimized for the 4 T Varian scanner [Tau (magnetization preparation time) = 1.1 s, TR = 13 ms, TE = 5.3 ms, field of view = 192 mm × 256 mm × 256 mm, matrix = 192 × 256 × 256, flip angle = 20 degrees, slice thickness = 1 mm] as previously described [10, 29,30,31]. T1-weighted images were visually inspected to identify scans with excessive motion artifact for exclusion. No scanning artifacts or gross brain abnormalities were observed in any participant included in analyses.

MRI data preprocessing

Structural images were processed using Statistical Parametric Mapping software (SPM 12; http://www.fil.ion.ucl.ac.uk/spm). In brief, individual structural images were first segmented into gray matter (GM), white matter (WM), and cerebral spinal fluid (CSF) using the unified segmentation model [32]. The resulting GM maps were then normalized to Montreal Neurological Institute (MNI) space using a high-dimensional “DARTEL” approach and subjected to nonlinear modulation to compensate for spatial normalization effects. Finally, the GM data were re-sampled to 1.5 mm3 voxels and spatially smoothed (Gaussian kernel with a full width at half maximum of 6 mm).

Construction of structural networks and calculation of network properties

We used the automated anatomical labeling (AAL) algorithm to parcellate GM into 90 non-cerebellar anatomical regions-of-interest (ROIs) representing network “nodes” [33]. Edges represent the morphological connections between nodes. The well-established Kullback-Leibler divergence-based similarity (KLS) index was utilized to characterize the connections or edges between anatomic regions [10, 24]. In short, higher similarity in gray matter density distribution between two anatomical regions is represented by higher KLS scores; suggesting stronger connections and shorter edges between regions. The range of KLS is from 0 to 1, with 1 representing an identical density distribution for two regions. We calculated the KLS values between all possible pairs of 90 brain regions, generating a 90 × 90 similarity matrix for each subject. In this 90 × 90 network matrix, each row and column represent a brain region and each element represents the similarity of morphological distributions between a pair of brain regions.

Thresholding of correlation matrices reveal different topological properties and which may obscure putative group differences [34]. Determining a threshold involves a tradeoff between a sparse network (low threshold) and a highly linked one (high threshold). To avoid the bias coming from choosing single threshold level, we applied a wide range of sparsity (S) thresholds to all correlation matrices. The parameter S was determined ensuring that thresholded networks were estimable for the small-worldness scalar and that the small-world index (σ) was larger than 1.0 [35]. The range of S parameters thus determined was 0.10 < S < 0.34 with an interval of 0.01. For each network metric, the area under the curve (AUC), which provides a summarized scalar for the topological characterization of brain networks independent of a single threshold selection, was calculated. The AUC metric has been proven to be sensitive in detection of topological alterations of brain networks [36, 37].

Based on those thresholded weighted matrices, both global and nodal network properties were calculated for brain networks. The global metrics, including small-world parameters and network efficiency [38, 39], were examined as follows: small-world parameters included clustering coefficient (Cp), characteristic path length (Lp), normalized clustering coefficient (γ), normalized characteristic path length (λ), and small worldness (σ). The network efficiency parameters included local efficiency (Eloc) and global efficiency (Eglob). Node metrics obtained included nodal degree [34], nodal efficiency [38], and betweenness centrality [40]. More details of structural network construction and topological metrics calculation are included in Supplementary Materials.

As we had multiple scans on HC, we computed the intraclass correlation coefficient (ICC) of topological measurements to assess consistency of these measures over time in the healthy control group. Results of this analysis are presented in Supplementary Table S2.

Statistical analysis

Using MATLAB (www.mathworks.com), we applied nonparametric permutation tests to identify significant between-group differences in the AUC of network metrics [37]. Baseline values in global and nodal network properties of structural connectomes were compared between bipolar and healthy individuals. We randomly reallocated all values for each network metric into two groups and recomputed the mean differences between them. This randomization procedure was repeated 10,000 times, and the 95th percentile of each distribution was used as the critical value for a two-tailed test of the null hypothesis with a type I error of 0.05. The Benjamini-Hochberg false discovery rate (FDR) correction was applied to testing for differences in nodal measures.

To determine whether medication treatment had significant effects on topological measurements (all bipolar group vs control group), group-by-time interaction effects were tested using mixed effects models.

To test for changes in patients over the course of treatment, the primary analysis tested for significant normalization of alterations present at baseline, considering data from all three time points simultaneously. Correlations between changes in network measurements and changes in manic symptoms (YMRS) were examined, treating both age and sex as covariates. Similar analyses controlling for baseline YMRS scores were also conducted in the bipolar patients. In order to test for differences between quetiapine and lithium treatment on topological alterations, drug-by-time interaction effects within the bipolar group were examined using mixed effects models for global and nodal measurements that were altered relative to controls at baseline. In supplemental analyses, as smoking status and body mass index may impact changes of topological measurements, we tested the effects of these two variables on treatment-related changes using a general linear model.

Finally, we compared baseline connectome measures of treatment responders and non-responders. As a strategy for examining whether baseline structural connectome measures considered together can predict treatment response at week 8, we applied support vector machine (SVM) to the preprocessed GM images, as well as the whole connectome-wide matrices (90 × 90 Pearson correlation matrix). Statistical significance was estimated using the permutation method (1000 permutations). The methods used for these analyses are described in detail in Supplementary Materials.

Results

Demographic and clinical characteristics

Bipolar and healthy subjects did not significantly differ in either sex or age, nor did they differ between bipolar subjects receiving quetiapine or lithium treatment. Quetiapine and lithium groups did not differ in YMRS score at baseline, week 1, or week 8 (Table 1).

Table 1 Demographics and clinical characteristics of the participants.

There were no significant treatment-by-time interaction effects for YMRS scores between lithium and quetiapine (F = 0.14, p = 0.87). In addition, medication response, defined as a 50% decrease in YMRS score, did not differ between participants treated with quetiapine or lithium (Table 1).

Alterations of brain network properties at baseline

In the defined threshold range, both bipolar and healthy subjects showed small-world topology in brain structural connectomes at baseline. However, the bipolar group showed significantly lower Cp (p = 0.012) and λ (p = 0.004) at baseline (Fig. 1A, B) than controls, but the groups did not significantly differ in Lp (p = 0.27), γ (p = 0.69), σ (p = 0.71), Eloc (p = 0.074), or Eglob (p = 0.43).

Fig. 1: Change in global network measures over the course of treatment and correlation to YMRS change in individuals with bipolar disorder.
figure 1

A Clustering coefficient showed significant group-by-time interaction effects (F = 4.58, p = 0.01). B Normalized characteristic path length (λ) showed significant group-by-time interaction effects (F = 3.93, p = 0.02). C Normalized characteristic path length (λ) significantly correlated with changes in YMRS score (r = −0.22, p = 0.02). *Clustering coefficient (Cp) and normalized characteristic path length (λ) significantly differed between bipolar and healthy subjects at baseline (Cp: p = 0.01, λ: p < 0.01) and week 1 (Cp: p = 0.01, λ: p = 0.03). There were no significant differences in global brain network properties between groups at week 8. Quantitative data were demonstrated as mean ± standard error. HC Healthy individuals, P Patients with bipolar disorder, WK Week, YMRS Young Mania Rating Scale.

Nodal centrality differences between bipolar and healthy subjects at baseline did not survive application of an FDR threshold of q = 0.05, but nodal results are noted with nominal significance thresholds for heuristic purposes. Brain regions that showed these nominal between-group differences in nodal parameters at baseline included bilateral superior frontal gyrus (medial orbital aspect), right inferior frontal gyrus (triangular and orbital part), right middle cingulate and paracingulate gyri, right postcentral gyrus, right superior parietal gyrus, left anterior cingulate and paracingulate gyri, left posterior cingulate gyrus, left parahippocampal gyrus, left caudate, and left temporal pole (Fig. 2 and Table 2).

Fig. 2: Nodal centrality differences between bipolar and healthy subjects across time points.
figure 2

The nodes were mapped onto the cortical surfaces by using the BrainNet Viewer package (http://www.nitrc.org/projects/bnv). The nodal centrality differences between bipolar and healthy subjects at baseline did not survive application of an FDR threshold of q = 0.05, so the nodal results are presented with nominal significance thresholds for heuristic purposes. IFGtriang Inferior frontal gyrus, triangular part, ORBinf Inferior frontal gyrus, orbital part, ORBsupmed Superior frontal gyrus, medial orbital, ACG Anterior cingulate and paracingulate gyri, DCG Median cingulate and paracingulate gyri, PCG Posterior cingulate gyrus, PHG Parahippocampal gyrus, PoCG Postcentral gyrus, SPG Superior parietal gyrus, CAU Caudate, TPOmid Temporal pole: middle temporal gyrus, PUT Putaman, MTG Middle temporal gyrus, AMYG Amygdala. R = right hemisphere; L = left hemisphere.

Table 2 Regions showing differences in nodal centralities between bipolar and health subjects across different time points.

Differences between quetiapine and lithium treatment

There were no significant differences in any network parameters between treatment groups at baseline (Supplementary Table S3), or significant treatment-by-time interaction effects in any global or nodal network parameters (Supplementary Table S4). Therefore, in further analyses of changes with treatment and symptomatic correlations the two drug treatment groups were combined.

Topological alterations after medication treatment

We observed significant changes in Cp (F = 4.58, p = 0.01) and λ (F = 3.93, p = 0.02) in bipolar subjects with treatment, with both measures normalizing toward HC over the course of the study (Fig. 1A, B). Subjects with bipolar disorder continued to show significantly lower Cp (p = 0.01) and λ (p = 0.03) versus healthy subjects at week 1 of treatment, but there were no significant differences between bipolar and healthy subjects in these or any other global brain network property at week 8. Individual trajectories of Cp and λ over time are provided in Supplementary Materials. Further, change in λ over 8 weeks of treatment was significantly correlated with change in YMRS ratings (r = −0.22, p = 0.02) (Fig. 1C). Considering the baseline mania symptoms as an additional covariate did not meaningfully change this finding (r = −0.19, p = 0.05).

Bipolar participants showed progressively fewer brain regions with nominally significant differences from healthy subjects in nodal networks after 1 and 8 weeks of treatment (Fig. 2, Table 2). Nodal efficiency of left anterior cingulate (F = 3.63, p = 0.03) and left posterior cingulate gyri (F = 4.12, p = 0.02) showed nominally significant group-by-time interactions, in both case with bipolar participants demonstrating a relative normalization of nodal efficiency toward values of healthy participants (Fig. 3A, B). Moreover, changes in nodal efficiency of left anterior cingulate and left posterior cingulate gyri were significantly correlated with changes in YMRS scores (Fig. 3C, D). These relationships remained significant when considering baseline YMRS as a covariate (nodal efficiency of left anterior cingulate and paracingulate gyri [ACG]: r = 0.26, p < 0.01; nodal efficiency of left posterior cingulate gyrus [PCG]: r = −0.27, p < 0.01). Our general linear models also showed that changes of topological measurements were not significantly affected by body mass index or smoking status (Supplementary Tables S5S6).

Fig. 3: Change in regional network measures over the course of treatment and correlation to YMRS change in individuals with bipolar disorder.
figure 3

A Nodal efficiency of left anterior cingulate showed significant group-by-time interaction effects (F = 3.36, p = 0.03). B Nodal efficiency of left posterior cingulate gyrus showed significant group-by-time interaction effects (F = 4.12, p = 0.02). C Changes in nodal efficiency of left anterior cingulate significantly correlated with changes in YMRS score (r = 0.35, p < 0.001). D Changes in nodal efficiency of left posterior cingulate significantly correlated with changes in YMRS score (r = −0.22, p = 0.02). ACG Anterior cingulate and paracingulate gyri, HC Healthy individuals, L Left, P Patients with bipolar disorder, PCG Posterior cingulate gyrus, WK Week, YMRS Young Mania Rating Scale.

Prediction of treatment outcome

We applied machine learning to baseline scan data to predict treatment outcome. Using baseline preprocessed smoothed gray matter images (voxel level), the balanced mean accuracy of classification of responders versus non-responders (58%) was only slightly higher than chance (sensitivity 75% and specificity 42%, p = 0.04). However, using connectome-wide matrices at baseline, we achieved a mean accuracy of 74% (sensitivity 57% and specificity 91%, p < 0.05).

Discussion

We applied graph theory-based analysis methods to MRI structural images to identify and track changes in topological properties of GM networks, and the relationship of these changes to treatment outcome in acutely manic patients with bipolar disorder. Our findings indicate that at baseline, acute mania is associated with a more randomized structural brain network, reflected in lower clustering coefficient and lower normalized characteristic path length. Global network disruption also has been reported in other neuropsychiatric disorders including posttraumatic stress disorder [41], obsessive compulsive disorder [39], schizophrenia [42], and major depressive disorder [43], though with distinguishing specific characteristics. In addition, we observed a normalization of the identified neuroanatomic connectome alterations in clustering coefficient and normalized characteristic path length following acute treatment. Neuroanatomic normalization was correlated with reduction in mania symptom severity. These effects did not significantly differ between lithium and quetiapine therapy, suggesting that they represent downstream effects of the two drugs with different pharmacological mechanisms of action.

At the regional level, exploratory analyses of nodal alterations using nominal significance thresholds suggest altered nodal centralities in multiple brain regions. Although significance of these nodal-level findings does not survive correction for multiple comparisons, the large number of nominally significant differences between bipolar and healthy subjects is consistent with findings from our global network level analysis. Moreover, also consistent with our primary findings, the number of regions showing between-group differences decreased with treatment. Of note, both left anterior and posterior cingulate gyri, regions implicated in cognitive and affective regulation, showed normalization of nodal efficiency with treatment that correlated with symptomatic improvement [44, 45].

While most changes after treatment involved narrowing of differences between bipolar and healthy subjects in brain metrics, differences in amygdala nodal centrality were not present at baseline, but were evident after 8 weeks of treatment. The decrease of topological centrality of amygdala in manic patients is of interest given the recognized importance of this region in bipolar illness [46, 47]. The amygdala changes were later emerging (only at week 8) than most other treatment-related changes, and to not differ between drug treatments. These findings may be of clinical interest, but in the timing and nature of this effect, as it suggests a reduced integration of the amygdala with broader brain networks after treatment. The psychological consequence of such a change might be to tune down the bottom-up affective salience of life events on neocortical network function.

Consistent with prior studies [48, 49], we found that treatment with medications commonly used in psychiatric patient care can have a significant acute impact on structural brain network properties in patients with BD. Medications used to treat patients with BD have diverse effects on receptor and neuronal processes, but mechanisms of their effects on widely-distributed brain networks remain less well understood. Treatment mechanisms of lithium include inhibition of GSK3, increased expression of neurotrophins, and decreased expression of AMPA receptors [50]. Effects of quetiapine include antagonism on D2 and 5-HT2 receptors and others [51]. Inducing changes in receptor-mediated effects alters the functional activity of neurons, with downstream effects on other neurons with which they interact in brain networks. These changes in neuronal activity can induce changes in proliferation, differentiation, and myelination to alter neuronal microstructure with effects that in aggregate can impact regional cortical thickness [52]. The lack of significant differences in treatment effects between our two study drugs that have quite different pharmacological effects suggests that our findings may represent aspects of the final common pathway of effective pharmacological treatments for bipolar disorder rather than features specific to either study drug.

The potential clinical utility of identifying neuroimaging biomarkers of treatment response in bipolar disorder has been increasingly recognized [53,54,55]. There are several independent aspects of biomarker development, with most early machine learning studies focusing on case-control differentiation. However, there may be greater clinical utility to the prediction of treatment outcome. In this study, we found that connectome matrices provided a more accurate prediction of treatment response than regional data directly derived from preprocessed smoothed gray matter images. The prediction model using preprocessed smoothed gray matter images showed 75% sensitivity and 42% specificity, while using connectome-wide matrices we observed lower sensitivity (57%) but high specificity (91%). Thus, the method using preprocessed smoothed gray matter images performed better in detecting true positives (i.e., the true responders) while the method using connectome-wide matrices performed better in detecting true negatives (i.e., the true non-responders). While noteworthy and statistically significant, utility of connectome measures for predicting treatment outcome remains less than required for clinical application at this point. And, our findings failed to identify connectome features that differentially predicted response to the two study medications.

While this is one of the first studies to examine the clinical relevance of structural connectome alterations in bipolar disorder, there are several important limitations. As noted, the significance of nodal centrality differences did not survive application of an FDR threshold of q = 0.05, though the aggregate findings and treatment-related changes in anterior and posterior cingulate gyrus are of interest. Second, brain parcellation template selection may affect network analysis results, as different templates may lead to different estimates of graph theory parameters [56]. Third, the machine learning method is exploratory; optimal resolution/reduction of features for machine learning studies remains to be established, and including higher or lower numbers of input features may lead to somewhat different solutions--evaluating the possible impact of alternative feature selection strategies on results awaits future studies. Finally, two major questions remain to be addressed: what are the specific neural changes that occurred that led to the connectome-level changes, and what are the cognitive and behavioral changes related to the connectome changes, and do they persist over the course of long-term treatment. Examining the relation of structural and functional connectome changes may be particularly informative, structural connectomics have modest correspondence to brain functional networks in patients with bipolar [57].

In conclusion, the present study provides evidence of global network alterations in bipolar disorder during acute mania. Our observation of baseline connectome abnormalities provides neuroanatomic evidence for distributed dysconnectivity during acute episodes of mania, and novel evidence that these alterations can be normalized with widely use therapeutics for mania and are associated with symptomatic improvement. The potential clinical relevance of these findings is emphasized by the particularly strong specificity in predicting treatment response, as well as by the similarities in findings between subjects receiving quetiapine and lithium. An increasing number of studies have shown that morphological covariance networks can exhibit adaptive reorganization in various brain disorders, and our findings indicate that such clinically-relevant changes in anatomic measures can occur over the course of acute treatment [58,59,60]. Taken together, these studies suggest that morphological covariance networks are a clinically meaningful way to identify illness-related brain features of bipolar illness. Such structural connectome measures may significantly contribute to the development of improved imaging models to better track and predict treatment outcome in bipolar disorder.