Consciousness & Brain Functional Complexity in Propofol Anaesthesia

Varley, Thomas F.; Luppi, Andrea I.; Pappas, Ioannis; Naci, Lorina; Adapa, Ram; Owen, Adrian M.; Menon, David K.; Stamatakis, Emmanuel A.

doi:10.1038/s41598-020-57695-3

Download PDF

Article
Open access
Published: 23 January 2020

Consciousness & Brain Functional Complexity in Propofol Anaesthesia

Scientific Reports volume 10, Article number: 1018 (2020) Cite this article

7177 Accesses
37 Citations
17 Altmetric
Metrics details

Subjects

Abstract

The brain is possibly the most complex system known to mankind, and its complexity has been called upon to explain the emergence of consciousness. However, complexity has been defined in many ways by multiple different fields: here, we investigate measures of algorithmic and process complexity in both the temporal and topological domains, testing them on functional MRI BOLD signal data obtained from individuals undergoing various levels of sedation with the anaesthetic agent propofol, replicating our results in two separate datasets. We demonstrate that the various measures are differently able to discriminate between levels of sedation, with temporal measures showing higher sensitivity. Further, we show that all measures are strongly related to a single underlying construct explaining most of the variance, as assessed by Principal Component Analysis, which we interpret as a measure of “overall complexity” of our data. This overall complexity was also able to discriminate between levels of sedation and serum concentrations of propofol, supporting the hypothesis that consciousness is related to complexity - independent of how the latter is measured.

Consciousness-specific dynamic interactions of brain integration and functional diversity

Article Open access 10 October 2019

Critical dynamics in spontaneous EEG predict anesthetic-induced loss of consciousness and perturbational complexity

Article Open access 05 August 2024

Complexity and 1/f slope jointly reflect brain states

Article Open access 07 December 2023

Introduction

The science of complex systems has gained increasing prominence in the 21st century. It combines the reductionist ideal of science, with the notion of emergence, whereby high-level phenomena can result from the interactions of simple constituent parts, confirming Aristotle’s saying that “the whole is more than the sum of its parts”¹. However, complexity science is also a discipline still in its infancy. In particular, due to its appealing and apparently intuitive nature, the notion of complexity has remained relatively ill-defined. The interdisciplinary nature of this science has resulted in different fields applying the term complexity to multiple quantities, variously measured. Complexity is perhaps best understood as the negation of simplicity. A system exhibits complex behaviour when it is not uniform, stereotyped, or predictable. However, there is a key assumption that this is not sufficient: complexity must emerge from the underlying orderly interactions of a system’s components, about which its behaviour must provide information – in other words, its unpredictability must be more than mere randomness, but rather the result of interesting behaviours emerging. Thus, a complex system lies between complete order – such as the perfectly predictable regularity of a crystal – and complete disorder, as exhibited for instance by the random motion of molecules of a gas. Historically, complex, self-organizing systems have been separated from both “simple” systems, and systems that display “disorganized complexity”².

Complexity can be identified in more than one dimension of the same system, too. It may be due to the structure of the interactions between components, such as the connections in a social or biological network. Or it may only become apparent over time, as when it is applied to signals and temporal patterns. Furthermore, there are different ways in which something can be said to be complex, reflected in the different ways that have been developed to estimate complexity. On the one hand, methods from algorithmic information theory such as Shannon entropy and Lempel-Ziv compressibility^3,4 emphasise unpredictability as the key property for complexity. One downside of such approach, however, is that they would treat a purely random sequence as maximally complex. Alternatively, methods from the physics of dynamical systems focus on the aspect of interactions in the process – whether between the system’s elements (e.g. synchronisability⁵), between its present and past states (e.g. Hurst exponent⁶), or between different scales⁷. In this work, we aim to explore the relation between algorithmic and process measures of complexity, in both the topological and temporal dimensions.

We choose to test these measures on a paradigmatic complex system: the human brain. Not only is the brain the source of humans’ widely diverse range of behaviours and accomplishments, which is itself suggestive of a highly complex underlying organisation; its structure is also that of a complex network of sub-networks, in turn made of multiple kinds of neurons obeying nontrivial plasticity rules for their interactions. For these reasons, it has been proposed that the brain’s complexity may explain another unique property it possesses: consciousness. Recent scientific theories of consciousness have emphasised, in one way or another, the brain’s complexity as a crucial requirement for consciousness^8,9,10,11. Anaesthetic drugs such as the GABA-ergic agonist propofol provide a way to control and reversibly modulate the brain’s state of consciousness. Its complexity, in various aspects, may then be assessed based on signals from noninvasive neuroimaging techniques. Previous research using electrophysiological imaging methods such as EEG has found that the complexity of brain activity changes with alteration of consciousness, decreasing under propofol sedation^12,13,14, increasing under the influence of psychedelic drugs like LSD or ketamine¹⁵, and decreasing during sleep¹⁶ or in patients with disorders of consciousness¹⁷. Nonlinear analysis of BOLD signals from functional MRI is a less explored area, largely because BOLD timeseries tend to comprise a limited number of timepoints and have a much more restricted frequency domain; however, there is fast-growing interest in this kind of analysis, owing to recent studies suggesting that this may be a rich field to explore^18,19,20. Moreover, functional MRI (fMRI) has the advantage of providing high spatial resolution, thus allowing for estimation of the brain’s network properties in greater detail than afforded by other methods such as EEG.

Here, we chose to evaluate whole-brain measures of algorithmic and process complexity applied to the temporal and topological (network) dimensions, derived from fMRI blood-oxygen-level-dependent (BOLD) signals of volunteers undergoing sedation with propofol. Our aim was twofold: first, to investigate the relationship between the different measures of complexity as applied to human fMRI data, with the hypothesis that results should be consistent across measures; secondly, we sought to provide a comprehensive investigation of the hypothesis that whole-brain complexity of the human brain measured from fMRI BOLD data is reduced when consciousness is lost as a result of anaesthetic-induced unconsciousness. We also replicated our results in an independent dataset of propofol anaesthesia, in order to demonstrate their robustness.

Results

Temporal Algorithmic Complexity

Lempel-Ziv Compressibility

The first measure of algorithmic complexity we used was normalised Lempel-Ziv compressibility^14,16 of BOLD signals. We found significant differences between conditions in both Datasets A and B. In Dataset A, Kruskal-Wallis Analysis of Variance found significant differences between all three conditions (H(10.57), p = 0.005), and post-hoc analysis with Wilcoxon Signed-Rank test found significant differences between the Awake and Mild conditions (W(21), p = 0.05), Awake and Moderate conditions (W(9), p = 0.006), and Mild and Moderate conditions (W(4), p = 0.002). In Dataset B, the Wilcoxon test found significant difference between the Awake and Deep conditions (H(19), p = 0.011). In both datasets, the Awake condition had the highest complexity, and as the depth of sedation increased, the associated LZC decreased. In Dataset A, the transition from Awake to Moderate showed $\Delta =-\,0.029\pm 0.027$, and in Dataset B, the transition from Awake to Deep showed Δ = −0.029 ± 0.039. We note that these two results are remarkably similar, although this is likely a coincidence. For full results from Dataset A, see Table 1, and for Dataset B, Table 2. In the propofol sedation conditions of Dataset A (Mild and Moderate), we found significant negative correlations between LZC and serum concentrations of propofol (r = −0.55, p = 0.002).

Table 1 The values for all the complexity measures, temporal and spatial, for Dataset A.

Full size table

Table 2 The values for all the complexity measures, temporal and spatial, for Dataset B.

Full size table

These results are consistent with previous findings that Lempel-Ziv compressibility of spontaneous brain activity is discriminative of level of consciousness in humans^14,16 and animals²¹.

Of all the time-series measures described, the LZC algorithm described here is distinct in that it communicates information about the spatial complexity as well as the temporal complexity. This is because, unlike other measures like Sample Entropy or Higuchi Fractal Dimension which are calculated on 234 individual time-series and then averaged, LZC is calculated on an entire dataset, which has been flattened column-wise, as was done in^14,15,16, by “stacking” each column on top of the next, resulting in a one-dimensional vector where the first 234 elements are the first column, the second 234 elements are the second column, etc. This means that the vector V (see Methods section) can be divided into 234 segments where every entry corresponds to the coarse activation of a distinct brain region at the same time. The result is that each entry in the dictionary D created by the Lempel-Ziv algorithm corresponds, not to a series of samples from a single ROI, but rather a distribution of cortical regions that are “on” or “off.”

Sample Entropy

We found significant decreases in the Sample Entropy of BOLD signals under anaesthesia in both Datasets A and B. In Dataset A, Kruskal-Wallis Analysis of Variance found a significant difference between all three conditions (H(12.94), p = 0.002) and post-hoc analysis with the Wilcoxon Signed-Rank test found significant differences between the Awake and Moderate conditions (W(6), p = 0.004), and Mild versus Moderate conditions (W(8), p = 0.005), but not the Awake versus Mild conditions. In Dataset B there was a significant difference between the Awake and Deep conditions (W(21), p = 0.015). As with the LZC analysis, the Awake condition had the highest Sample Entropy in both Datasets A and B, with the mean value decreasing with increasing sedation. In Dataset A, we observed $\Delta =0.036\pm 0.035$ from Awake to Moderate, and in Dataset B we observed $\Delta =-\,0.023\pm 0.031$. In the Mild and Moderate conditions of Dataset A, we found a significant negative correlation between serum concentration of propofol and Sample Entropy of BOLD signals (r = −0.53, p = 0.003).

These results are consistent with both the LZC results reported above and the findings of Ferenets et al., (2007), who found that Sample Entropy decreased with increasing sedation in much the same way that LZC does.

PCA of BOLD Signals

As with LZC, the PCA-based measure of BOLD signal complexity returns a measure of how compressible the set of data are as a proxy for complexity, by identifying the number of components required to explain a fixed proportion of the variance in the data. A larger number of components to explain the same amount of variance would indicate less compressibility of the data. Thus, we hypothesised that as level of sedation increased, so would the compressibility of BOLD signals, as measured by the number of components required to explain 95% of the variance. In Dataset A, Kruskal-Wallis Analysis of Variance found significant differences between all three conditions (H(8.13), p = 0.017), and post-hoc testing found significant differences between all three sets of conditions: Awake vs. Mild (W(9), p = 0.03), Awake vs. Moderate (W(6), p = 0.016), and Mild vs. Moderate (W(11), p = 0.048). In Dataset B, we found a significant difference between Awake and Deep (W(4), p = 0.002). As before, there was a consistent pattern of increasing mean compressibility (and a consequent decreasing number of required components) as sedation increased. In Dataset A, $\Delta =-\,2.214\pm 2.73$ from Awake to Moderate, and in Dataset B, $\Delta =-\,5.188\pm 4.68$. Here, the Δ is negative because the number of components decreased between the Awake and sedated conditions, and is non-integer because it is the average over all subjects in the datasets. Of all the measures of BOLD signal compressibility, this was the only measure that did not significantly correlate with serum propofol concentration in the Mild and Moderate conditions in Dataset A.

As with LZC and the SampEn, these results indicate that as propofol-induced sedation increases, the algorithmic complexity of BOLD signals decreases. All measures of complexity discussed so far support each other, despite being a variety of linear and non-linear algorithms.

Temporal Process Complexity

Hurst Exponent

The Hurst Exponent was the only measure that we hypothesised would increase as consciousness was lost, rather than decrease, since as a signal becomes more predictable, its Hurst Exponent tends towards unity⁶. In both Datasets A and B we found significant differences between conditions. In Dataset A, Kruskal-Wallis Analysis of Variance found an omnibus difference (H(9.11), p = 0.01), and post-hoc testing found significant differences between Awake and Mild (W(16), p = 0.022), Awake and Moderate (W(8), p = 0.005), and Mild and Moderate (W(17), p = 0.026). In Dataset B, we found significant differences between the Awake and Deep conditions (W(26), p = 0.02). Unlike the previous two metrics, and as we expected, we found a relative increase in the Hurst Exponent as sedation increased: in Dataset A, we found $\Delta =0.014\pm 0.014$ from Awake to Moderate sedation, and in Dataset B we found $\Delta =0.01\pm 0.016$ from Awake to Deep sedation. In Dataset A, we found a significant correlation between serum concentration of propofol and Hurst Exponent in the Mild and Moderate sedation conditions (r = 0.393, p = 0.039).

This is consistent with our initial hypothesis that as sedation increased and consciousness was lost, the BOLD signals would become more predictable, as measured by an increasing Hurst Exponent.

Higuchi Fractal Dimension

The Higuchi Fractal Dimension was one of the least sensitive measures of BOLD signal complexity sampled here. In Dataset A, Kruskal-Wallis Analysis of Variance found a significant difference between all three conditions (H(8.27), p = 0.016), and post-hoc analysis found significant differences between the Awake and Moderate conditions (W(15), p = 0.019) and the Mild and Moderate conditions (W(17), p = 0.026), but not the Awake and Mild conditions. In Dataset B we found a significant difference between the Awake and Deep conditions (W(23), p = 0.02). As with LZC and Sample Entropy, the Awake condition had the highest mean fractal dimension in both samples, which went down as sedation increased: in Dataset A $\Delta =-\,0.024\pm 0.032$ from Awake to Moderate and in Dataset B, $\Delta =-\,0.018\pm 0.027$ from Awake to Deep. Surprisingly, the Higuchi Fractal dimension showed a very strong negative correlation with serum propofol concentration in the Mild and Moderate conditions of Dataset A (r = −0.614, p = 0.0005).

The finding that Higuchi Fractal dimension was relatively less able to discriminate between level of consciousness than LZC or Sample Entropy but more predictive of serum propofol concentration is interesting. While it is hard to come up with a definitive interpretation, it may suggest that there is some variable factor in individuals that makes their level of consciousness more or less resistant to the changes in brain activity (as measured by Higuchi Fractal Dimension) induced by propofol, or that the plasma concentration data offer more resolution, extending beyond the three artificially imposed bins of Awake, Mild and Moderate sedation.

Topological Algorithmic Complexity

Graph Lempel-Ziv Compressibility

The final metric we tested, and the second measure of topological complexity, was the compressibility of functional connectivity adjacency matrices using the Lempel-Ziv algorithm. This was the weakest of all the measures explored: the only significant difference was in Dataset A, between the Awake and Moderate conditions (W(11), p = 0.009), although in Dataset B there was a similar trend that approached, but did not reach significance (W(32), p = 0.06). The general trend of Awake having the highest value which decreased under increasing sedation was conserved (although note the large standard deviations): in Dataset A $\Delta =-\,2885.71\pm 4937.34$ and in Dataset B $\Delta =-\,4923.44\pm 8967.29$. There was no significant correlation between graph compressibility and serum propofol concentrations in Dataset A.

While this is clearly the weakest result, in the context of the others, we still find its success at discriminating between the Awake and Moderate conditions of Dataset A intriguing, and suspect that in a larger set of data it may have more discriminative power. The relationship between consciousness and network compressibility may not be as direct as when performing analysis such as LZC on BOLD signals, but these results suggest this is an area worth exploring.

Topological Process Complexity

Algebraic Connectivity

Our first of two measures of functional network complexity is algebraic connectivity, which returns information about the robustness of the network to removal of elements²². In Dataset A, Kruskal-Wallis analysis found a significant difference in algebraic connectivity between all three conditions (H(9.654), p = 0.008). Post-hoc analysis found significant differences between the Awake and Moderate conditions (W(12), p = 0.011) and the Mild and Moderate conditions (W(15), p = 0.019), but not the Awake versus Mild conditions. In Dataset B, we found a significant difference between the Awake and Deep conditions (W(23), p = 0.02). As before, in Datasets A and B the Awake condition had the highest mean algebraic connectivity, with mean values dropping as sedation increased. In Dataset A, $\Delta =-\,107.67\pm 116.98$ from Awake to Moderate, while in Dataset B, −$906.09\pm 1235.07$. Despite the ability of algebraic connectivity to discriminate between conditions, there was no significant correlation with serum propofol concentration in the Mild and Moderate conditions of Dataset A.

These results suggest that, while graph theoretical measures may be predictive of level of consciousness in propofol anaesthesia, algebraic connectivity in particular seems to lack the discriminative power of direct analysis on BOLD signals. Nevertheless, these results are promising as they show that the topological complexity of functional brain networks can communicate information relevant to the level of consciousness of an individual.

Higher Order Analysis of Overall Complexity

Every metric, when correlated against every other metric, showed a highly significant correlation (see Fig. 1), all of which were significant with the sole exception of the correlation between the number of PCA components required to explain the majority of the variance and the Hurst exponent in Dataset A. We had hypothesised that, if the different kinds of complexity explored here (algorithmic and process-based, in both the temporal and topological dimensions) all were ways to quantify an underlying construct of “overall complexity”, then there should be a single component that explains the majority of the variance of the results. In Dataset A, we found that the principal component explained 67.07% of the variance in the set of results and in Dataset B the principal component explained 71.05% of the variance of the results. In both datasets, this component correlated extremely highly with each metric: in Dataset A it correlated most highly with LZC (r = −0.947, p ≤ 1 × 10⁻⁵), followed by Sample Entropy (r = −0.929, p ≤ 1 × 10⁻⁵). In Dataset B, these two were also the most highly correlated with the principal component, although the order was flipped, with Sample Entropy having the highest correlation (r = −0.95, p ≤ 1 × 10⁻⁵), followed by LZC (r = −0.932, p ≤ 1 × 10⁻⁵). When broken down by condition, in both Datasets, the principal component was able to discriminate between states of consciousness: in Dataset A the Kruskal-Wallis test found a significant difference between all three conditions (H(12.048, p = 0.002), and post-hoc testing found significant differences between the Awake and Moderate conditions (W(8), p = 0.005), and the Mild and Moderate conditions (W(3), p = 0.001) but not the Awake and Mild conditions. In Dataset B we found a significant difference between the Awake and Deep conditions (W(8), p = 0.002). In the Mild and Moderate conditions of Dataset A, the principal component significantly correlated with serum concentrations of propofol (r = 0.531, p = 0.004) (see Fig. 2). Thus, the principal component derived from multiple specific measures of complexity can be related to states of consciousness in the human brain, and may be identified with the “overall complexity” of the dataset. For visualisation of these results, see Fig. 3.

Discussion

In the present work, we have investigated measures of complexity from algorithmic information theory and the physics of dynamical systems, as they apply to the temporal and topological (network) dimensions of functional MRI brain data from individuals under different levels of propofol sedation. Two main insights can be derived from our results. The first is that, at least in the context of the human brain, different measures purporting to quantify “complexity” are indeed related to some underlying common construct, regardless of the dimension along which they measure complexity, or the aspect of complexity that they measure. This provides much-needed validation to the idea that a dataset - and the system from which it derives - can be considered complex tout court, rather than just being complex in a specific dimension, and according to a specific way of assessing complexity. We term this the “overall complexity” of the system or dataset. In turn, this suggests that it is appropriate to use the term “complexity” for the various specific measures, because there does seem to exist a common underlying property of the data that they tap into. In particular, we have demonstrated that the complexity of the human brain activity, as inferred from fMRI BOLD signals, is modulated by one’s state of consciousness - supporting previous results from macaque electrocorticography data²³. This was observed both with the individual measures - validating and extending previous results - and, most importantly, with the underlying construct of overall complexity, which demonstrates its validity as a construct. The latter is also reinforced by the fact that we were able to replicate this finding with a separate dataset.

Secondly, it is important to observe that different complexity measures, though correlated to each other and related to the same underlying construct of overall complexity, are nevertheless sensitive to different aspects of the data. In particular, measures operating along the temporal dimension appeared especially sensitive at discriminating between levels of sedation; conversely, topological measures failed to discriminate between Awake and Mild conditions in Dataset A, and also did not correlate with propofol serum levels. This suggests that the temporal dimension of the human brain’s complexity, as derived from BOLD signal timeseries (despite their limited temporal resolution compared to EEG), may be especially vulnerable to loss of consciousness, at least as it is induced by the GABA-ergic agent propofol. Further work may seek to identify whether this effect is uniform across cortical regions, or whether specific areas’ timeseries are more largely affected by propofol than others. This represents a novel insight regarding the ways in which anaesthetic drugs such as propofol intervene on the brain to cause unconsciousness. Additionally, it would be worth exploring whether this observation of different sensitivity of temporal and topological measures of complexity is drug-specific, or if instead it is a generalisable feature of how the brain loses consciousness. Thus, one future direction of research is to apply these same metrics to states of consciousness induced by different anaesthetic agents, whose molecular mechanisms of action can vary widely. Disorders of consciousness (DOC) due to severe brain injury may also represent a crucial future step for research: unlike anaesthetics, DOC involve changes in the physical structure of the brain, which is bound to impact the topology of brain networks. Investigating how this impacts the relation between different measures and dimensions of complexity will provide further understanding into the relation between complexity and consciousness in the brain. Additionally, as MRI is already a routine part of care for DOC patients, algorithms such as those explored here might be helpful in determining the presence or absence of consciousness in ambiguous states such as minimally conscious state.

Importantly, our results also show that, despite the relative temporal paucity of information in BOLD signals, these signals carry sufficient information to discriminate between states of consciousness. While preliminary, these findings suggest that the process complexity of individual BOLD signals is at least partially re-encoded as topological complexity when forming functional connectivity networks. One possible avenue of future work is to explore the parameters under which this conservation of complexity is maximised (different similarity functions, different thresholding procedures, etc), in order to increase the sensitivity of these measures. Crucially, even higher discriminative power may be achieved by applying the same analyses to measures with higher temporal information, such as EEG or ECoG²³, which may then improve anaesthetists’ ability to detect unwanted residual consciousness in patients, thereby avoiding the rare but extremely distressing condition known as intraoperative awareness²⁴.

Nevertheless, our work also presents a number of limitations, and these should be borne in mind when evaluating the present results. We do not claim to have exhaustively searched all measures of algorithmic or process complexity: as already evidenced, there are a vast number of measures to choose from (and many measures themselves have multiple implementations, eg. Lempel-Ziv compressibility features in multiple compression algorithms including Lempel-Ziv-Welch compressibility²⁵ and LZ77/8²⁶, as well as being a component of the perturbational complexity index¹⁷). Other measures of algorithmic complexity might include permutation entropy²⁷ or Shannon entropy^28,29. Alternative measures of process complexity might be multiscale sample entropy³⁰ or Lyapunov exponent³¹. When selecting which measures to include in this analysis, we included several criteria we hoped our measures would satisfy: the first is that they had been used in previous electrophysiological or fMRI studies of consciousness. The second criteria was that they should be relatively accessible theoretically and computationally. Finally, they shouldn’t require excessively long time-series and thus be amenable to BOLD signals (which tend to be $\ll $500 samples, excluding some measures like multiscale entropy).

Furthermore, as already mentioned the temporal information available in the BOLD signal is limited, and it is also not a direct measure of neural activity. More generally, the optimal way to construct brain graphs from BOLD signal data is an area of active investigation; although the approach we have taken here is among the most common in the literature³², alternative methods exist for defining nodes (for instance by using different parcellations³³, or components derived from Independent Components Analysis) and for defining edges, such as by using partial correlation³⁴, wavelet coherence³⁵, or normalised mutual information instead of Pearson correlation^36,37. In particular, our analysis pipeline involved removing the negative correlations between brain regions, before the network analysis. Removing negative correlations is the most commonly adopted approach in network neuroscience³² especially since their inclusion has been found to decrease reproducibility of brain network properties^38,39. However, the importance of negative correlations in the brain has been demonstrated for both waking cognition⁴⁰ and consciousness, with reductions in their prevalence observed during anaesthesia and other states of unconsciousness^41,42,43; thus, ignoring them may have different effects on conscious versus unconscious brain networks, which could explain the reduced sensitivity of topological measures. In future work, it would be worthwhile to explore other methods of constructing functional connectivity networks that do not return negative edges at all such as wavelet coherence³⁵ or normalised mutual information^36,37, or the soft thresholding approach proposed by Schwarz and McGonigle⁴⁴, thus avoiding the problem entirely.

Finally, in Dataset A the state of consciousness was determined based on the estimated propofol concentration, rather than behaviour, so that different individuals’ susceptibility to the drug may have led to different levels of sedation, despite the same level of propofol. However, this concern is mitigated by the replication of our results in Dataset B, where sedation was deeper and it was assessed behaviourally, so that all individuals met the same criteria. Finally, the measures of complexity explored here are but a subset of those that have been proposed over the years in the literature. Future research could benefit from expanding this repertoire, for instance including estimates of Phi, a measure of integrated information derived from neural complexity¹¹, which has been proposed to quantify a system’s consciousness^10,45.

Conclusion

We have investigated measures of algorithmic and process complexity of fMRI BOLD signal in both the temporal and topological dimensions, at various levels of consciousness induced by propofol sedation. Our results demonstrate that complexity measures are differently able to discriminate between levels of sedation, with temporal measures showing higher sensitivity. Additionally, all measures were strongly correlated, and most of the variance could be explained by a single underlying construct, which may be interpreted as a more general quantification of complexity, and which also proved capable of discriminating between levels of sedation, demonstrating a relation between consciousness and “complexity”, broadly defined, with a clear biological grounding given by the relation to propofol serum concentration. The finding that complexity measures formalized in very different ways are similarly useful suggests a deeper relationship between dynamical and algorithmic complexity, and the capacity of the human brain to support consciousness. Finally, these results provide strong evidence that many non-linear time-series analysis techniques that have previously been restricted to M/EEG imaging modalities can also be successfully applied to detect alterations of complexity in BOLD signals, expanding the repertoire of available fMRI image and time-series analyses.

Methods

Data Acquisition and Preprocessing

Ethics Statements

All data were collected in accordance with the relevant guidelines and under the oversight of the relevant ethical bodies. The specifics for ethical approvals are detailed below.

Dataset A

Ethical approval for these studies was obtained from the Cambridgeshire 2 Regional Ethics Committee, and all subjects gave informed consent to participate in the study. Twenty-five healthy volunteer subjects were recruited for scanning. The acquisition procedures are described in detail by Stamatakis et al.⁴⁶: MRI data were acquired on a Siemens Trio 3T scanner (WBIC, Cambridge). Each functional BOLD volume consisted of 32 interleaved, descending, oblique axial slices, 3 mm thick with interslice gap of 0.75 mm and in-plane resolution of 3 mm, field of view = 192 × 192 mm, repetition time = 2 s, acquisition time = 2 s, time echo = 30 ms, and flip angle 78. We also acquired T1-weighted structural images at 1 mm isotropic resolution in the sagittal plane, using an MPRAGE sequence with TR = 2250 ms, TI = 900 ms, TE = 2.99 ms and flip angle = 9 degrees, for localisation purposes. Of the 25 healthy subjects, 14 were ultimately retained: the rest were excluded, either because of missing scans (n = 2), or due of excessive motion in the scanner (n = 9, 5 mm maximum motion threshold).

Propofol Sedation

Propofol was administered intravenously as a “target controlled infusion” (plasma concentration mode), using an Alaris PK infusion pump (Carefusion, Basingstoke, UK). Three target plasma levels were used - no drug (baseline), 0.6 mg/ml (mild sedation) and 1.2 mg/ml (moderate sedation). A period of 10 min was allowed for equilibration of plasma and effect-site propofol concentrations. Blood samples were drawn towards the end of each titration period and before the plasma target was altered, to assess plasma propofol levels. In total, 6 blood samples were drawn during the study. The mean (SD) measured plasma propofol concentration was 304.8 (141.1) ng/ml during light sedation, 723.3 (320.5) ng/ml during moderate sedation and 275.8 (75.42) ng/ml during recovery. Mean (SD) total mass of propofol administered was 210.15 (33.17) mg, equivalent to 3.0 (0.47) mg/kg. The level of sedation was assessed verbally immediately before and after each of the scanning runs. The three conditions from this dataset are referred to as Awake, Mild and Moderate sedation respectively.

Two senior anesthetists were present during scanning sessions and observed the subjects throughout the study from the MRI control room and on a video link that showed the subject in the scanner. Electrocardiography and pulse oximetry were performed continuously, and measurements of heart rate, noninvasive blood pressure, and oxygen saturation were recorded at regular intervals.

Dataset B

These data were provided by the Brain and Mind Institute, Department of Psychology, The University of Western Ontario. All scans were collected at the Robarts Research Institute in London, Ontario (Canada) between May and November 2014. A total of 19 (18–40 years; 13 males) healthy, right- handed, native English speakers, with no history of neurological disorders were recruited. Each volunteer provided written informed consent, following relevant ethical guidelines, and received monetary compensation for their time. The Health Sciences Research Ethics Board and Psychology Research Ethics Board of Western University (Ontario, Canada) ethically approved this study. Due to equipment malfunction or physiological impediments to anaesthesia in the scanner, data from three participants (1 male) were excluded from analyses, leaving 16.

Scanning was performed using a 3 Tesla Siemens Tim Trio system with a 32-channel head coil, at the Robarts Research Institute in London, Ontario, Canada. Participants lay supine in the scanner. Function echo-planar images (EPI) were acquired (33 slices, voxel size: 3 × 3 × 3 mm; inter-slice gap of 25%, TR = 2000ms, TE = 30 ms, matrix size = 64 × 64, FA = 75 degrees). An anatomical volume was obtained using a T1-weighted 3D MPRAGE sequence (32 channel coil, voxel size: 1 × 1 × 1 mm, TA = 5 min, TE = 4.25 ms, matrix size = 240 × 256, FA = 9 degrees).

Propofol Sedation

Intravenous propofol was administered with a Baxter AS 50 (Singapore). The infusion pump was manually adjusted using step-wise increases to achieve desired levels of sedation of propofol (Ramsay level 5). Concentrations of intra-venous propofol were estimated using the TIVA Trainer (the European Society for Intravenous Aneaesthesia, eurosiva.eu) pharmacokinetic simulation program. If Ramsay level was lower than 5, the concentration was slowly increased by increments of 0.3 μg/ml with repeated assessments of responsiveness between increments to obtain a Ramsay score of 5. Ramsay level 5 was determined as being unresponsive to verbal commands and rousable only by physical stimulus. In contrast to Propofol Dataset A, the two conditions from this dataset are referred to by Awake and Deep sedation respectively, reflecting the substantial increase in sedation depth present in this dataset.

Image Pre-Processing

All of the collected images were preprocessed using the CONN functional connectivity toolbox⁴⁷ (http://www.nitrc.org/projects/conn), using the default pre-processing pipeline, which includes realignment and unwarping (motion estimation and correction), slice-timing correction, outlier detection, structural coregistration and spatial normalisation using standard grey and white matter masks, normalisation to the Montreal Neurological Institute space (MNI), and finally spatial smoothing with a 6 mm full width at half-maximum (FWHM) Gaussian kernel.

Temporal preprocessing included nuisance regression using anatomical CompCor to remove noise attributable to white matter and CSF components from the BOLD signal, as well as subject-specific realignment parameters (three rotations and three translations) and their first-order temporal derivatives⁴⁸. Linear detrending was also applied, as well as band-pass filtering in the default range of [0.008, 0.09] Hz⁴⁰. For a more detailed discussion of the details of the CONN default preprocessing pipeline, see Whitefield-Gabrieli and Nieto-Castanon, 2012.

Complexity of BOLD Signals

To explore the space of different formalisations of complexity, we used algorithms from algorithmic information theory (Lempel-Ziv compressibility, sample entropy, and principal component analysis), as well as from dynamical systems physics (Higuchi fractal dimension, Hurst exponent). Before analysis, the BOLD time-series were transformed by applying the Hilbert transform. The absolute value of the transformed signal was then taken, to remove negative frequencies and ensure that all series were positive. The Hilbert transform was also used to maintain consistency with earlier studies exploring the complexity of brain activity as it relates to consciousness^14,16.

Lempel-Ziv Complexity

The Lempel-Ziv algorithm is a computationally tractable method for quantifying the complexity of a data-series by calculating the number of distinct patterns present in the data. For sufficiently large datasets, it is a useful approximation of Kolmogorov complexity, which is famously uncomputable for most strings³. The method used here is described in Schartner et al., (2015). Briefly: for every ROI in our parcellated brain, a time-series $F(t)$ is binarised according to the following procedure:

$${F}_{B}({t}_{i})=\left\{\begin{array}{ll}1, & {\rm{if}}\,F({t}_{i})\ge mean(F(t))\\ 0, & {\rm{otherwise}}\end{array}\right.$$

The resulting time-series are stacked into a binary matrix $M(X,T)$, where every row corresponds to the time-series ${F}_{B}(t)$ for every ROI $x\in X$ and every column is a time-point $t\in T$. The matrix is then flattened orthogonally to T, resulting in a vector $V$ of length $X\times T$, on which the Lempel-Ziv analysis was performed.

The Lempel-Ziv algorithm creates a dictionary $D$, which is the set of binary patterns that make up $V$ and returns a value $L{Z}_{C}\propto |D|$. For every time-series ${F}_{B}(t)\in X$, a random time-series was created, by shuffling all the entries in $F(t)$. These were stacked into a binary matrix ${M}_{rand}$, with the same dimensions as $M$, however containing only noise. This random matrix was flattened and its $L{Z}_{C}$ value calculated. As the randomness of a string increases, $L{Z}_{C}\to 1$, so this value was used to normalise the “true” value of $L{C}_{C}$, which was divided by $L{Z}_{{C}_{Rand}}$ to ensure all values were within a range $(0,1)$.