Introduction

GM atrophy is a key determinant of long-term disability in MS1,2,3. However, the intermediate steps linking brain tissue damage to accrual of disability in MS are not well understood and therefore cannot be effectively tackled by treatments or used to build accurate predictive models of clinical progression. In this context, it has been proposed that the disruption of functional and structural brain networks may be related to these intermediate processes leading, eventually, to physical and cognitive disability in MS4.

Among the types of studied brain networks, there are structural covariance networks (SCNs), which are based on the anatomical similarity, at the group level, between different cortical areas5,6,7,8. Cortical areas sharing common functions with evolutionary shared development have greater similarity for macroscopic features than non-functionally related regions9. Thus, the study of the changes in the patterns of morphological similarity in the cortex has emerged as a useful tool to examine the link between structural damage and functional consequences10.

SCNs are constructed from the statistical covariance between cortical regions in terms of any morphological variable, such as cortical thickness, cortical surface or cortical volume. However, SCNs built based on the statistical covariance of thicknesses of different cortical regions are most frequently studied. Thus, the nodes of these networks are the different brain areas, and the edge connecting any two of these nodes is determined as the correlation in thicknesses between cortical areas forming the two nodes. Because SCNs are strongly influenced by congenital and developmental functional relationships between cortical areas, but are also susceptible to acquired damage in the cortical GM, the investigation of SCN can provide information complementary to functional and diffusion-based connectivity analyses6,7,11.

SCNs have also the potential to provide information complementary to more classical univariate analyses of structural damage, such as cortical thinning. Covariance network analysis accounts for inter-regional correlations in cortical thickness data and allows us to estimate the actual impact of such correlations on disease progression. This implies that SCN analysis is able to better evaluate the complexity of human brain cortex structure than classical univariate approaches, where relationships between cortical regions are not taken into consideration. In fact, SCN analysis has been applied to several neurodegenerative disorders, revealing neurodegeneration at the network level that might be missed at regional-level12.

However, despite the potential of SCN analysis to unveil new mechanisms of neurodegeneration, the longitudinal assessment of SCN dynamics and their pathological relevance have not yet been explored.

We hypothesised that longitudinal SCN analyses in patients at the earliest stages of MS may provide complementary information to conventional cortical GM thickness analysis, revealing important –and otherwise hidden– features of GM dynamics. Our aims were to characterise the longitudinal changes in SCN parameters in the early CIS, comparing the behaviour of SCNs of CIS patients and age-matched healthy controls, and importantly, to investigate the clinical relevance of SCNs, through assessing SCN changes in those who converted to clinically definite MS within the first year. Our aims were achieved applying robust, computationally intensive bootstrapping-based statistical methods to obtain data-driven estimations of confidence intervals of our network parameters13, which allowed us to make unbiased statistical inferences. Of note, given the particular group-level nature of SCN metrics, such statistical inferences would not have been possible using more conventional statistical approaches.

Results

Descriptive statistics

For this study, we included 21 patients (18 female, mean age [standard deviation, SD]: 32.24 [6.18] years) and 7 healthy controls (HCs) (5 female; 30.14 [3.44] years). All subjects included in this study belong to a cohort of 28 CIS patients and 10 HCs, whose clinical and demographic features have been previously reported14,15,16,17. We excluded: two HCs because they only had baseline data; one HC and four CIS patients because of the low quality of the cortical thickness segmentation; and three CIS patients because they did not attend the last follow-up and they had also missed at least one more time point, implying that a single imputation method would have not provided reliable enough information.

During follow-up, nine patients presented a second clinical attack suggestive of MS. These patients were diagnosed with clinically definite MS (called ‘clinical MS converters’). Three additional patients presented new (asymptomatic) lesions, fulfilling the criteria for dissemination in space (DIS) and time (DIT). Thus, the nine patients with a second attack plus the three patients fulfilling DIS and DIT criteria through MRI fulfilled the 2010 McDonald criteria for MS18 (Table 1).

Table 1 Demographical, clinical and MRI characteristics of the study cohort.

Cortical thickness and other structural imaging data analyses

At baseline, mean (SD) cortical thickness was 2.713 (0.135) mm in patients, and 2.751 (0.058) mm in controls. At follow-up, mean (SD) cortical thickness was 2.715 (0.087) mm in patients and 2.726 (0.052) mm in HCs. In patients, median lesion loads at baseline and follow-up were 1.525 mL (range: 0 to 16.199 mL) and 2.918 mL (range: 0 to 17.038 mL). The main locations of the MS lesions are shown in Supplementary Table 1. After adjusting for age, gender and T2 lesion load, no significant differences in mean cortical thickness were observed between CIS patients and HCs and baseline or at one-year follow-up (p = 0.748 and 0.367, respectively). Over time, neither CIS patients: −0.0003 mm/month (95% CI: −0.003 to 0.003 mm/month), p = 0.822, nor HCs: −0.002 mm/month (−0.004 to 0.0004 mm/month), p = 0.338, showed a significant change in mean cortical thickness over time. No differences between these rates of change were observed either (p = 0.46).

Similarly, no differences in cortical thickness were observed between patients with and without early conversion to MS at baseline or follow-up (p = 0.5079 and 0.9103, respectively), after adjusting for age, gender and T2 lesion load. Additionally, none of the two patient groups showed a significant change in cortical thickness over one year: CIS with conversion to MS: 0.001 (−0.005 to 0.007), p = 0.655; CIS without conversion to MS: −0.001 (−0.005 to 0.004), p = 0.586. Besides, these rates of change were not significantly different from each other (p = 0.4842).

Structural cortical network analysis

All CIS patients vs. HCs

At baseline, no differences were found in any of the metrics between all CIS patients and HCs (Fig. 1 and Supplementary Figure 1).

Figure 1
figure 1

Network metrics over time in all CIS patients and HCs. This figure shows the evolution of network metrics over time for patients’ and controls’ networks, with the bootstrap standard errors. Although at follow-up only the mean shortest path was significantly different between patients’ and controls’ networks, at six-month follow-up, most of the metrics were significantly different between the two networks; *p < 0.05; **p < 0.01; ***p < 0.001.

When looking at the changes over time, CIS patients’ network tended to show an overall decrease in connectivity and significant differences in several metrics were observed between patients’ and controls’ networks at 3 and, especially, at 6 months of follow-up: mean nodal strength (p < 0.001), mean clustering coefficient (p = 0.01), mean shortest path (p < 0.001), global efficiency (p = 0.01) and local efficiency (p = 0.01) (Fig. 1). Yet none of the 12-month longitudinal changes in either patients’ or controls’ networks reached statistical significance (Table 2).

Table 2 One-year changes in SCN parameters, in CIS patients and HCs.

Supplementary Figure 2 shows the bootstrap distributions used to ascertain the statistical significance of the 12-month changes in network parameters and, as mentioned, no differences between groups were observed. Supplementary Table 2 shows the point estimates and bootstrap-based 95% CIs for baseline and 12-month network parameters: only the mean shortest path was significantly different (i.e. greater, p < 0.001) in patients than controls, at 12-month follow-up.

CIS patients with vs. without clinical conversion to MS

At baseline, the network of CIS patients who later clinically converted to MS during the follow-up showed features of greater nodal connectivity, i.e. significantly greater mean nodal strength (p = 0.02) and clustering coefficient (p = 0.02), and features of better network integration, i.e. lower mean shortest path (p < 0.001) and greater global (p = 0.02) and local (p = 0.02) efficiency, than the network of non-converters, who showed similar values to HCs’ network (Fig. 2).

Figure 2
figure 2

Network metrics over time in MS converters and non-MS converters. This figure shows the evolution of network metrics over time for MS converters and non-MS converters, with the bootstrap standard errors. *p < 0.05; **p < 0.01; ***p < 0.001. Please look at Table 3 for more details.

Over one year, clinical MS converters’ network showed significant changes in all metrics related to nodal connectivity: mean nodal strength decreased by 10.18 units (bootstrap-based 95% CI: −16.22 to −0.87), p = 0.02; mean clustering coefficient decreased by 0.16 units (−0.27 to −0.01), p = 0.02; and all metrics related to nodal distance: both global (−0.12 (−0.20 to −0.02), p = 0.01) and local efficiency (−0.15 (−0.25 to −0.01), p = 0.02) decreased, whereas mean shortest path increased (0.56 (0.19 to 0.80), p = 0.001) over one year. Instead, non-converters network did not show any significant changes over time, except for the mean shortest path, which decreased over 1 year (p = 0.01), mirroring HCs’ network behaviour. The rates of change over time of the two networks were all strongly significantly different from each other (p-values from 0.01 to <0.001), except for the change in modularity coefficient (p = 0.5) (Table 3 and Fig. 2). Supplementary Figures 3 and 4 show the bootstrap distributions used to ascertain the statistical significance of the parameters. Supplementary Table 3 shows the point estimates and bootstrap-based 95% CIs for baseline and 12-month network parameters.

Table 3 One-year changes in SCN parameters, in CIS patients with and without clinical conversion to MS.

When analysing regional changes in those metrics that admit a node-level analysis, i.e. nodal strength, clustering coefficient and local efficiency, the changes in the network of patients with an early conversion to MS showed a very similar pattern to those network changes observed for all patients together. Instead, network changes of those who did not convert were similar to changes in HCs’ network (Fig. 3). Please see Table 4 for a summary of these results.

Figure 3
figure 3

Estimated monthly changes in nodal metrics for all groups. This figure shows the estimated monthly changes in the three node-level metrics: (A) nodal strength, (B) clustering coefficient and (C) local efficiency. In this figure, the size of the nodes indicates the absolute value of the change: bigger nodes have greater changes, in either direction, and smaller nodes have changes close to zero. Instead, the color of the nodes indicates the direction of the change: redder nodes indicate positive changes (increase) and bluer nodes indicate negative changes (decrease). The behavior over time of all-CIS network differed from that of HCs’ network, yet these differences did not reach statistical significance (see Table 2). This different behavior was mirrored by the two patient groups: whereas the non-converters’ network only showed minimal changes over time, the converters’ network showed an overall decrease in nodal strength, clustering coefficient and local efficiency. Of note, the regional distribution of nodes with greatest changes over time (pre- and post-central gyri and paracentral lobule, bilaterally) was similar for the all-CIS and the converters’ networks. Abbreviations: CIS: clinically isolated syndrome; HCs: healthy controls.

Table 4 Summary table of the differences between groups.

A post-hoc analysis comparing the network of patients who were diagnosed with McDonald MS (i.e. either they had a second clinical attack or fulfilled the criteria for DIS and DIT during the follow-up, N = 12) with the network of those not fulfilling the diagnosis of McDonald MS (N = 9) was also performed. This comparison revealed very similar results to those obtained with the comparison of clinical MS converters (N = 9) vs. non-clinical MS converters (N = 12), as shown in Supplementary Tables 4 and 5.

Discussion

In this longitudinal study, we provide evidence, for the first time, of reorganisational changes in SCNs of CIS patients who experience an early conversion to clinically definite MS, in the absence of detectable cortical atrophy. These results suggest that SCN analysis may be more sensitive to those changes occurring in the cortical GM at the earliest stages of the disease than conventional, regression-type (average-based) approaches of structural data. Besides, these results indicate that SCN changes may be clinically relevant.

Although structural covariance networks and cortical thickness data are indirectly related8, structural networks reflect global aspects of the data that cannot be captured by conventional analyses of cortical thickness: SCNs reflect the associations between cortical regions, and their longitudinal evaluation hence reflects how these associations change over time, revealing hidden patterns in GM dynamics not observable with classical (linear regression-type) approaches8. However, despite numerous longitudinal analyses reported in the literature of cortical thickness data to assess disease progression in neurodegenerative conditions19,20,21, this has not yet been assessed through SCNs. In our study, despite the absence of cortical atrophy in patients, we observed features of increased network connectivity and network integration at baseline among those patients who later converted to MS. This was followed by a clear trend towards an overall loss of network connectivity over time and a deterioration of the network’s ability to integrate information, which was statistically significant in those patients who converted to MS during the follow-up. None of these reorganisational changes, either at baseline or during follow-up, were observed in non-converters’ or HCs’ networks. These results suggest that SCN analyses may be more sensitive to pathological processes occurring at these very early stages of MS than conventional analyses of cortical thickness data. Of note, the experimental groups of our study were biased towards females, indicating they were representative of the population of MS, which mainly affects females, especially with the relapse-onset phenotypes. On the other hand, in our analyses, we had initially adjusted our cortical thickness data for age and gender. This step was performed to ensure that the potential confounding effect of gender in our results was minimised or even eliminated.

Importantly, these results are in line with the idea that SCN analysis provides complementary information to more classical analyses of structural damage because they account for aspects of the data that conventionally were considered, at most, as nuisance parameters. In SCN analysis, instead, such correlations are no longer nuisance parameters to be allowed for when computing the uncertainty with which a given parameter of interest is estimated, but they are the main parameters of interest.

Disruption of SCNs was first described in MS in 2009 by He et al.22, and in 2014, Tewarie et al.11, found that SCNs disruption in MS was echoed by changes in functional networks obtained through magnetoencephalography. However, the cross-sectional design of those studies did not allow the characterisation of the dynamics of such disruption, as opposed to our study, where we could observe that the disruption occurred after an initial explosion of SCN connectivity, suggesting the presence of a network reorganisation after an initial perturbation, which would have been overlooked should we not had used a longitudinal design. Importantly, our longitudinal design also allowed us to evaluate the clinical relevance of such reorganisational network changes, which would not have been possible if we had only considered a single time point. Notably, such initial explosion of SCN connectivity observed in patients and especially in patients who later on converted to MS was mainly at the expense of negative correlations across cortical areas, as can be observed in Fig. 5. As time went by, such strong correlations tended to disappear, especially in the MS converters’ network. At the end of the follow-up, both patients’ networks had lower connectivity and global and local efficiency than the HCs’ network (although non-significantly) (Supplementary Tables 69).

Figure 4
figure 4

Study pipeline. This is the study pipeline that we followed to obtain the network parameters at each time point and also the changes over 1 year. For each parameter and time point, and for each parameter change, 1000 bootstrap samples were used to compute the 95% Confidence Intervals. See main text for more details. Abbreviations: CIS: clinically isolated syndrome; HCs: healthy controls; FU: follow-up.

Structural covariance networks are built upon the assumption that, in the healthy brain, two brain cortical areas showing morphological similarity in terms of their cortical thickness are physically and/or functionally connected5,8,23,24. However, in the diseased brain, the interpretation of modified or disrupted covariance networks is not straightforward. The time scale for the shaping of the covariance networks in the healthy brain is believed to span many years7, and it is therefore unlikely that their abnormal configuration or disruption, especially if observed over the short time period of one year, can be attributed to a disease-related damage in the underlying physical or functional connection between cortical areas. Besides, the fact that our cortical networks were built using cortical thickness values adjusted for lesion load (and other confounders) makes it unlikely that the disruption of WM tracts is the cause of the alteration in the correlations between cortical areas. On the other hand, it is also unlikely that the initially increased covariance network connectivity observed in those patients who later on converted to MS and its subsequent disruption over time are unrelated events. This statement would be supported by the presence of strikingly similar values of SCN connectivity at one-year follow-up in converters and non-converters. We speculate that these early reorganisational changes could reflect the presence of a microscopic inflammatory cortical process around the time of the CIS in those who will experience early future conversion, leading to an initially more homogeneous cortex among future MS converters manifesting as greater connectivity. Importantly, all cortical thickness measures used to build SCNs had been adjusted for white matter lesion load at baseline, implying that any presence of cortical inflammation at the time of the CIS must have behaved independently of macroscopic white matter lesion load. However, since we used 1.5 T scans, we could not assess the presence of cortical lesions in our patients25, so we could not explore this hypothesis further. Yet we did explore, in a post-hoc analysis, the effect of the lesion load on the actual network metrics (Supplementary Tables 1013). This analysis showed that the metrics of the HCs’ network were almost unchanged, as expected, when using cortical thickness unadjusted for lesion load, though adjusted for age, gender and mean cortical thickness (see Methods for full details), as compared with the ones obtained with fully adjusted cortical thickness values. Instead, the metrics of the patients’ networks, especially the networks for all CIS patients and clinical MS converters, tended to show, in general, lower connectivity than the corresponding adjusted networks. This was especially evident for the 12-month follow-up networks. Therefore, this implied that, in general, differences between patients’ and controls’ networks or between converters’ and non-converters’ networks increased, as compared to fully adjusted cortical networks. This supports the idea that the explosion of connectivity at onset was indeed probably associated with greater inflammation. Future studies with high-field multi-modal MR images are warranted to understand the pathological substrate underlying cortical network changes in early MS.

When patients were divided into two groups – those who clinically converted and those who did not convert to MS within the one-year follow-up, SCN analysis revealed striking differences between groups: at baseline, converters’ network showed signs of greater connectivity than non-converters’ network. However, during the follow-up, whereas MS converters exhibited longitudinal changes consistent with progressive disruption of SCNs, the non-converters’ network overall showed minimal, non-significant, changes over time, mirroring the behaviour of HCs’ network. This is the first time that clinical relevance of covariance cortical networks in MS has been reported from longitudinal data. Reassuringly, similar results were found when MS converters were compared to non-MS converters according to the 2010 McDonald criteria18 (Supplementary Tables 4 and 5).

Importantly, the similarity between all patients’ and MS converters’ networks was also reflected in the regional distribution of nodal connectivity changes, reinforcing the concept that SCN changes may be clinically relevant: losses of nodal strength, clustering coefficient and local efficiency seemed to be more obvious in pre- and post-central gyri and the paracentral lobule, bilaterally, not only in (all) patients’ network, but also in the MS converters’ network, as shown in Fig. 3. Interestingly, these central regions, which in this context would indicate those with greatest loss of similarity with other cortical regions, have been also identified, in diffusion-based networks, as hubs and rich-club members26. Rich-club members are nodes with a high number of anatomical connections, preferentially involving other highly connected nodes, which are known to play key roles in the hierarchical organisation of the human brain26 and seem to possess specific microstructural features27.

Although SCN analyses cannot elucidate the biological processes underlying the observed changes in covariance patterns, this does not reduce the importance of our findings. Our study may have revealed a crucial aspect of MS pathology detectable in those who convert early to MS, indicating its clinical relevance, and only when an analysis of the covariance across cortical regions’ thickness is performed. Besides, although observed at the group level (because of the nature of SCNs), our results provide valid grounds for the generation of new hypotheses about unexplored mechanisms underlying disability progression. In this case, it is likely that the ultimate cause associated with such cortical network reorganisational changes is also related to conversion to MS, rather than being the presence of such SCN changes per se the factor actually conferring a higher risk for conversion. For instance, a post-hoc analysis comparing patients with supratentorial lesions only and patients with supra- and infratentorial lesions, at any time during the 12-month follow-up period, revealed that the network of those patients with both supra- and infratentorial lesions behaved similarly to that of patients with an early conversion to MS (Supplementary Tables 14 and 15). Whereas this reflects the known relationship between presence of infratentorial lesions and a worse prognosis, it might also reflect a possible mechanism through which the presence of infratentorial lesions entails this worse prognosis. In any case, further studies combining SCN analysis and other connectivity and structural imaging methods, such as resting-state fMRI, high-resolution magnetisation transfer imaging28 and diffusion-weighted imaging29, are required to elucidate this point. Importantly, SCN methodology can also help assess how the presence of certain well-known risk factors, such as smoking habit or low vitamin D levels, which many times can only be assessed at the group level, or how the implementation of population-level measures, such as education programmes about healthier diet or habits, can modify multiple pathological aspects of the disease, at once (i.e. in a multivariate manner) and straightforwardly.

Covariance networks are built based on correlations at the group level. This means that classical statistical methods such as t-test or regression-type approaches cannot assess differences between groups or between time points. Even if for many network metrics, in particular for those implying node-level estimations (i.e. nodal clustering coefficient or nodal local efficiency), mean values are estimated and there is a network-level variability based on the values estimated for the different nodes, this variability could not be used to carry out statistical tests based on the group variance given the strong dependence of such values within the network. In this study, we utilised a bootstrap-based approach to estimate the limits of the 95% confidence intervals for the parameters that define the networks (e.g. mean nodal strength), needed to estimate whether two networks are actually different or whether an observed change over time is statistically significant. Bootstrap-based approaches rely on the variability of the original data to estimate the confidence intervals, without making any distributional assumptions13,30. Thus, they are much more robust and conservative than classical inference methods, implying that the risk of type I error is actually below the nominal 5% and that the changes we observed in the parameters of the converters’ network are likely to be genuine. Of note, the use of robust bootstrapping techniques to assess the uncertainty of estimated parameters is particularly relevant when these parameters are derived from a given network, where it would not be correct to make parametric assumptions based on the distribution of individual nodal metrics within the same network to estimate the uncertainty of the network-defining parameter30.

Although SCN analysis challenges the possibility of drawing conclusions at the individual level, it appears as an attractive and straightforward method to assess the relationship between complex multi-dimensional predictors, such as cortical thickness data, and a given clinical outcome. Although some authors have proposed individualised covariance networks using cortical thickness data to study neurodegeneration31,32, these have not yet been published in MS. Besides, the relationship between individual-level and group-level network disruptions in other neurodegenerative conditions has not yet been assessed either.

A further methodological consideration relates to the fact that thickness values from each region were treated equally in the statistical analysis, as previous studies have reported8. This implies that we did not account for the different volumes of the brain areas, although this might not be relevant in our case, since we were interested in the associations between cortical thicknesses, which are those that are believed to reflect underlying physical or functional connections8. Another consideration is that we constructed the weighted graphs using the absolute values of the Pearson’s correlation coefficients of the covariance matrices, as described previously8. Although a visual inspection of the correlation matrices at baseline and at 12-month follow-up suggest that increased baseline connectivity in patients (and especially in early MS converters) was mainly at the expense of negative correlations, a formal connectivity analysis for positive and negative graphs was not performed in this study. Although it may be interesting to assess independently networks derived from positive and negative correlations, this was beyond the scope of this study. This analysis would also pose other challenges with methodology and interpretation and may deserve attention in future studies.

Another consideration refers to the fact that we used the 2010 revisions to the McDonald Criteria18 for the diagnosis of McDonald MS, instead of the most recent ones, published in 201833. The 2010 revisions to the McDonald Criteria for the diagnosis of MS were the current criteria for the diagnosis of MS when this study was designed, in 2015. We acknowledge that the results might be slightly different from those that we present now if we had used the 2017 revisions of the McDonald criteria.

Finally, the small sample size, especially in controls, is an important limitation of the study. Although the lack of significant changes in controls could have theoretically been attributed to controls’ smaller sample size, when we compared MS converters with non-converters, where also covariance networks were built with small sample sizes, MS converters showed statistically significant changes that were not observed in non-converters. Importantly, because all our inferences related to SCNs were made through robust bootstrap-based approaches, it is possible that the observed significant results are in fact genuine. However, it is also true that the sample size was low even for using a bootstrap approach, therefore the robustness of the network of anatomical covariance analysis was not guaranteed, and this could have biased the statistical results. We therefore consider these results as preliminary and acknowledge the need to replicate them in larger cohorts.

In conclusion, SCNs of patients with their first inflammatory-demyelinating episode of the CNS, and especially of those who convert to MS, show clear reorganisational changes in the absence of detectable cortical atrophy. Thus, covariance network analysis not only is more sensitive than conventional analysis of structural data but also seems to unveil clinically relevant aspects of this condition. Of note, the low sample size indicates this is preliminary work and the results and conclusions derived from it should be taken with caution and be confirmed in larger cohorts. In any case, the fact that covariance network analysis can be applied retrospectively to large cohorts of patients who have undergone T1-weighted scans within other protocols makes this technique a powerful and straightforward method to help us understand pathological processes underlying disease progression in MS.

Methods

Subjects

We included consecutive patients with a diagnosis of optic neuritis as their first demyelinating attack of the central nervous system (i.e. CIS) within the previous four weeks. All patients underwent clinical assessments, at baseline, 3, 6 and 12 months. At each visit, apart from visual assessments, we recorded whether the patient had suffered a second clinical attack. Additionally, we also recorded the presence of new white matter T2 lesions and whether these occurred in different locations, fulfilling dissemination in time and space criteria and therefore fulfilling the diagnostic criteria of MS according to the 2010 revisions to the McDonald criteria18 (Table 1). This has been reported elsewhere16,17. We also included a group of age-matched healthy controls (HCs). All participants underwent MRI scans. The study was approved by the Ethics Committee of the National Hospital for Neurology and Neurosurgery, part of the University College London Hospitals NHS Foundation Trust. This is an observational study and all methods were performed in accordance with the relevant guidelines and regulations. All participants provided informed written consent.

MRI analysis

All MRI scans were performed with a 1.5 T GE Signa Echospeed MRI (Milwaukee, WI) scanner. The scanner maximum gradient strength was 33 mT m−1.

Acquisition of brain structural scans

For all subjects, we acquired the following images at baseline, 3, 6 and 12 months: axial oblique, proton-density (PD), dual echo, fast spin echo images (resolution: 0.9 × 0.9 × 5 mm3), as previously described14,15,16,17; axial three-dimensional fast prepared spoiled gradient recall (3D-FSPGR, 3D T1-weighted) (resolution: 1.2 × 1.2 × 1.2 mm3).

T2 hyperintense WM lesions were manually outlined by an experienced observer from the PD-weighted images using the semi-automated edge finding tool from JIM (JIM v6.0, Xinapse systems, Aldwincle, UK, http://www.xinapse.com). PD-weighted lesion masks were co-registered to the 3D-T1 images using a pseudo-T1 image generated by subtracting the PD from the T2-weighted image34. Lesion masks were transformed from native space to 3DT1 space. The 3DT1 images were filled using a non-local patch match lesion filling technique35.

Measurement of cortical thickness

We calculated cortical thicknesses for 68 bilateral brain cortical areas using the FreeSurfer version 5.3 longitudinal stream36, the technical details of which have been explained elsewhere36,37,38. Briefly, this included skull-stripping, intensity normalisation, non-linear registration to Talairach space, segmentation, estimation of brain surfaces, and surface parcellation. We visually assessed the final segmentation, and re-ran the pipeline after manual correction in cases of incorrect surface estimation. To perform unbiased longitudinal image analysis, we created a symmetric within-subject template39. Next, all the steps were re-initialised for each time point using the common information to increase reliability and statistical power36. We extracted cortical thickness values for each cortical parcellation according to Desikan-Killiany atlas38.

Structural covariance network analysis

Construction of weighted structural covariance networks (Fig. 5): We constructed two SCNs, one at baseline and one at 12-month follow-up, for each of these four groups – a) HCs, b) all CIS patients, c) MS-converter CIS patients and d) non-converter CIS patients, as follows (of note, although SCNs were also built at 3- and 6-month follow-up, the main analysis involved only baseline and 12-month follow-up SCNs):

  1. 1)

    In 3 patients who did not attend the 12-month follow-up but who attended the 0, 3 and 6-month time points, cortical thickness at 12-month follow-up was imputed using each patient’s cortical thickness trajectory, i.e. a single imputation technique based on simple linear regression.

  2. 2)

    Because we were interested in knowing the relationships between cortical thickness values across regions whilst adjusting for the effects of possible confounders, we removed variability in the cortical thickness data related to lesion load, cortical atrophy, age or gender, all cortical thickness values, for all subjects and time points, regressing all cortical thickness values at once over baseline lesion load and baseline cortical thickness, age and gender. Total intracranial volume (TIV) obtained through GIF segmentation40 of the T1-weighted images was also explored as a covariate. Yet, given its potential association with age and gender, it was only kept if significant at 5% level, to avoid collinearity. The residuals of these regression models were considered the new, adjusted, cortical thickness values to be used in subsequent steps8. For this step, controls were assigned a lesion load equal to zero mL, as previously performed17,41,42, to be able to obtain the residuals for controls through the same regression model as that used to obtain patients’ residuals. This allowed us to maintain the original relationships between patients and controls. The assumptions of the linear regression were checked through the analysis of the residuals.

  3. 3)

    Pairwise correlation matrices using adjusted cortical thickness values, with Pearson’s correlation coefficient as the association measure, were obtained for each group (i.e. CIS patients, HCs, CIS patients with early conversion to MS and CIS patients without early conversion to MS), at baseline and at 12-month follow-up, using MATLAB (The MathWorks, Inc., Natick, Massachusetts, USA) (Fig. 4).

  4. 4)

    Since we aimed to build weighted networks reflecting the strength of the correlation regardless of whether there was a positive or negative association8, we obtained correlation matrices with the absolute value of Pearson’s correlation coefficients.

Figure 5
figure 5

Correlation matrices. This figure shows the correlation matrices for each one of the groups (all CIS, HCs, CIS with early conversion, CIS without early clinical conversion), at baseline and at follow-up. From these correlation matrices, we extracted the absolute values of the correlations and built the weighted graphs, used to obtain network metrics. Abbreviations: CIS: clinically isolated syndrome; HCs: healthy controls; MS: multiple sclerosis.

Obtaining network topological metrics: For each weighted matrix, considered as the numerical representation of a network with 68 nodes (i.e. 68 cortical areas) and edges indicating the strength of the connection between two cortical areas, we obtained the following network topological metrics at the network level. That is, for each metric, we obtained a value for each network (i.e. for each group and time point). We used the freely available Brain Connectivity Toolbox43 (https://sites.google.com/site/bctnet/) in MATLAB. The metrics were divided into three groups:

  1. 1.

    Metrics of nodal connectivity: mean nodal strength and mean clustering coefficient

    These metrics describe the connectivity of each node in the network. Mean nodal strength is the average, across all nodes, of the nodal strength, defined as the sum of the correlation coefficients of the edges emerging from a given node. Thus, a node with a high strength indicates that the node is very well connected, and a network with high mean nodal strength indicates a very well-connected network. Importantly, in this context, highly connected networks indicate a high degree of similarity across cortical regions.

    Nodal clustering coefficient reflects the connectivity among the neighbours of a given node, and can be understood as the probability that each pair of nodes that are connected to a given node are also connected among themselves. It can be defined as in [1]:

    $${\rm{Clustering}}\,{\rm{coefficient}}\,{\rm{of}}\,{\rm{node}}\,i=\frac{Number\,of\,pairs\,of\,node\,i\mbox{'}\,s\,neighbours\,that\,are\,connected\,}{Number\,of\,pairs\,of\,node\,i\mbox{'}\,s\,neighbours}$$
    (1)

    Therefore, the mean clustering coefficient is the average clustering coefficient across all nodes of the network44. In this context, a network with a high mean clustering coefficient would indicate that cortical regions have, in general, strong similarities in terms of cortical thickness with neighbour regions.

  2. 2.

    Metrics of network integration: mean shortest path, global efficiency and local efficiency

    These metrics describe the ability of the network to efficiently integrate information across nodes. Smaller values of mean shortest path infer more efficient information transfer between nodes and greater information integration within the network45. Mean shortest path (or characteristic path length, L) is the average of the shortest path lengths between all pairs of nodes in the network43, and can be defined as in [2]:

    $${\rm{L}}=\frac{1}{N}\sum _{i}{l}_{i}=\frac{1}{N(N-1)}\,\sum _{i\ne j}{l}_{ij},$$
    (2)

    where N is number of nodes in the network; li is the average shortest path from node i to all other nodes; and lij is the shortest path length from node i to node j.

    Global efficiency (Eglob) is the reciprocal of the harmonic mean of the shortest path lengths of the entire network45, and is defined as in [3]:

    $${E}_{glob}=\frac{1}{L^{\prime} }=\frac{1}{N(N-1)}\sum _{i\ne j}\frac{1}{{l}_{ij}},$$
    (3)

    where L′ is the harmonic mean of the mean shortest path; N is number of nodes in the network; and lij is the shortest path length from node i to node j.

    Local efficiency, which is a node-level metric of integration of information, is the reciprocal of the harmonic mean of the shortest path lengths of the subgraph (i.e. subnetwork) defined by the neighbours of a given node45. Local efficiency of node i (Eloc (i)), is defined as in [4]:

    $${E}_{loc}(i)=\frac{1}{{N}_{{G}_{i}}({N}_{{G}_{i}}-1)}\,\sum _{j,h\in {G}_{i}}\frac{1}{{l}_{jh}},$$
    (4)

    where Gi is the subnetwork including all nodes that are neighbours of node i; \({N}_{{G}_{i}}\)is the number of nodes belonging to subnetwork Gi; and ljh is the shortest path length from node j to node h belonging to subnetwork Gi.

    Higher global and local efficiency values indicate greater ability to integrate information, globally and locally, respectively. Mean local efficiency is the across-node average of local efficiency values of the nodes of the network43.

  3. 3.

    Metric of network organisation: modularity coefficient

    Modularity coefficient (ModC) assesses how groups of nodes are clustered in local groups. That is, ModC, describes how well a network can be subdivided into groups of nodes (i.e. modules), highly correlated among each other, creating small communities inside the network.

Statistical analysis

Descriptive and structural imaging data analyses

Descriptive statistics are reported as mean (standard deviation [SD]) or median (range), depending on the nature of the variable.

We assessed changes in mean cortical thickness over one year as part of the preliminary analysis using mixed-effects models, where ‘mean cortical thickness’ was the dependent variable and ‘time’, a ‘binary group indicator’ (i.e. patient/control) and an interaction term such as ‘time X group indicator’ were the main explanatory variables. Whenever the interaction term was significant, it was assumed that the rate of change in cortical thickness differed between the two groups. All models were adjusted for age, gender and lesion load at baseline (HCs were assigned lesion load equal to zero). For the final analyses, total intracranial volume was not included among the covariates: it was not significant in any of the models and its introduction did not change the direction or size of the effects of the other covariates. Mixed-effects models allowed for repeated measures within the same subject17,46 and their random structure consisted of a random intercept (for each subject) and random slope (for time, which was nested in ‘subject’). Whenever residuals from mixed-effects regression models showed deviation from normality, 95% confidence intervals (95% CI) and p-values were obtained using bias-corrected non-parametric bootstrap with 1000 replicates.

All these statistical analyses were carried out with Stata 14.1.

SCN analysis in all CIS patients versus healthy controls

Since all network parameters were obtained at the network level, i.e. we had a value for each network, classical statistical approaches could not be used to compute the 95% confidence intervals (95% CI) of the metric values at baseline and their changes over time, so we employed a bootstrap-based approach with these steps:

  1. 1.

    For each network metric, we initially obtained our baseline value and 12-month follow-up value. From the subtraction of these two, we obtained the change in that network. These were our original estimates for baseline, 12-month follow-up and change values.

  2. 2.

    For the groups of CIS patients and HCs, we then created 1000 bootstrap samples (i.e. random samples of n subjects chosen –allowing sampling with replacement– from each group, being n = sample size of the original group).

  3. 3.

    In each bootstrap sample, covariance matrices for baseline and follow-up values were computed using exactly the same methodology as the one used with the original group.

  4. 4.

    From each covariance matrix, network topological measures were computed, for baseline and 12-month follow-up. The difference between baseline and follow-up metrics was also obtained for each bootstrap sample. Importantly, for this step (i.e. to compute the difference between baseline and follow-up), the same subset of bootstrap samples (subset of subjects) was used. These were our bootstrap estimates for baseline, 12-month follow-up and change values.

  5. 5.

    Once all bootstrap samples were analysed, distributions of bootstrap estimates were obtained. These were then compared against the original estimates and the bootstrap-based 95% CI were obtained for each metric and each group, at baseline and follow-up. Bootstrap-based 95% CI were also obtained for each change in the network metric. For the estimation of the change, whenever the 95% CI did not include the null value (=0), we assumed the change was significant over time. To compute differences between two groups, a 1000 × 1000 matrix was obtained, where all possible combinations of differences between the bootstrap estimates (for baseline, follow-up or change parameters) for the two groups were calculated. Original differences between the two groups that we wanted to compare were then plotted against such 1,000,000-item distribution of combinations and combined-bootstrap-based 95% CI were obtained for these original differences. Whenever these combined-bootstrap-based 95% CIs did not include the null value (=0), we inferred significant differences between the two groups for a particular baseline, follow-up or change parameter. All these analyses were carried out with MATLAB.

SCN analysis in CIS patients who convert versus those who do not convert to MS during follow up

The same steps described above were also applied to assess the 95% CI of baseline, follow-up and change values for the network metrics in the two groups of CIS patients with and without early conversion to MS. Different behaviours between the two groups in a given network parameter were interpreted as this metric being potentially relevant from a clinical point of view.

A post-hoc analysis comparing the networks of patients who were diagnosed with McDonald MS (i.e. either they had a second attack or fulfilled the criteria for dissemination in time and space during the follow-up) was also performed (results shown in Supplementary Tables 4 and 5).

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.