Advances in neuroimaging have permitted the non-invasive examination of the human brain in pain. However, a persisting challenge is in the objective differentiation of neuropathic facial pain subtypes, as diagnosis is based on patients’ symptom descriptions. We use artificial intelligence (AI) models with neuroimaging data to distinguish subtypes of neuropathic facial pain and differentiate them from healthy controls. We conducted a retrospective analysis of diffusion tensor and T1-weighted imaging data using random forest and logistic regression AI models on 371 adults with trigeminal pain (265 classical trigeminal neuralgia (CTN), 106 trigeminal neuropathic pain (TNP)) and 108 healthy controls (HC). These models distinguished CTN from HC with up to 95% accuracy, and TNP from HC with up to 91% accuracy. Both classifiers identified gray and white matter-based predictive metrics (gray matter thickness, surface area, and volume; white matter diffusivity metrics) that significantly differed across groups. Classification of TNP and CTN did not show significant accuracy (51%) but highlighted two structures that differed between pain groups—the insula and orbitofrontal cortex. Our work demonstrates that AI models with brain imaging data alone can differentiate neuropathic facial pain subtypes from healthy data and identify regional structural indicates of pain.
Chronic pain conditions affect up to 20% of the population and are among the most common presenting complaints from adults seeking medical care1,2,3. An ongoing challenge for the clinical and research communities is the lack of objective assessment and diagnostic criteria for these conditions3,4. For example, trigeminal neuralgia (TN) is a common neuropathic facial pain that is frequently referred to neurosurgical care, yet its diagnosis can be complex. TN is often diagnosed in the context of neurovascular compression of the trigeminal nerve, however, neurovascular compression does not pre-define pain in TN. In fact, up to 40% of patients diagnosed with TN may not demonstrate this type of vascular compression on MR imaging5. Furthermore, lancinating pain is a common descriptive symptom for trigeminal pain syndromes such as classical trigeminal neuralgia (CTN), persistent idiopathic facial pain, and secondary trigeminal pain syndromes6,7,8. However, these syndromes may have very different etiologies9. The limited precision of clinical diagnosis of pain may result in patients receiving a treatment plan that does not specifically align with their form of chronic facial pain and provides limited relief3,10,11,12.
Extensive studies using functional and structural brain imaging have uncovered objective neuroanatomical signatures of chronic facial pain and consistently point to gray matter volume and thickness abnormalities, including at the insula, anterior cingulate cortex, thalamus, and hippocampus13,14,15. Yet, the impact of these candidate biomarkers remains poorly discerned, and not integrated within clinical practice due to the lack of generalizability and reproducibility16,17. Another factor that hinders the clinical utility of these studies is their incorporation of conventional statistical approaches to delineate structural differences, as the multiple tests problem makes classical statistical interference less tractable18. The use of advanced analysis methods such as artificial intelligence (AI) tools offer a more effective way of analyzing large datasets18,19,20. AI tools have been increasingly sought due to their ability to identify undiscovered patterns that connect complex and heterogeneous forms of neurological disease21,22,23,24. For example, neuroimaging datasets with diffusion tensor (DTI) and T1-weighted imaging information typically contain large numbers of features that cannot be easily analyzed using conventional statistical approaches23,25,26. The inherent complexity of these imaging datasets is highly suitable for AI models, as they can efficiently analyze unstructured data where the number of input variables exceeds the number of subjects18,19.
Multiple forms of AI tools, including unsupervised and supervised learning models, can be used to guide data-driven analyses. Unsupervised learning can identify hidden patterns of unlabeled data while supervised learning can precisely and consistently handle the labelled data, such as predicting the outcome of the surgical intervention, or identifying the diagnosis24,27,28. Both unsupervised and supervised learning AI techniques benefit from large sample sizes (n > 100) and can reveal new foci which may not be elucidated in traditional hypothesis-driven investigations24,28. Importantly, AI techniques frequently use multimodal data, including combining brain imaging with clinical charts data. Since retrospective clinical chart data points are inherently subjective, they can potentially introduce bias and decrease the accuracy of the model29,30. This moves us to inquire whether imaging data alone is sufficiently able to predict pain from non-pain states. If so, are AI tools capable of distinguishing between different types of chronic facial neuropathic pain24.
In the present study, we conduct a retrospective AI-driven analysis of brain imaging (DTI, T1-weighted imaging data) from chronic facial pain patients diagnosed with either CTN or trigeminal neuropathic pain (TNP) and healthy controls. We inquire whether unsupervised and supervised learning AI tools may be sufficiently powerful to distinguish chronic facial pain from healthy controls based on brain imaging data alone, and what are the neuroanatomical signatures that distinguish chronic facial pain subtypes from healthy controls. We hypothesized that unsupervised and supervised learning AI tools applied to neuroimaging data can distinguish CTN and TNP subtypes of chronic facial pain from healthy controls, and that AI tools can identify neuroimaging signatures that distinguish CTN and TNP from healthy control (HC) data.
We used data from 371 adults with chronic facial pain conditions and 108 HC. Facial pain subjects were divided into two groups (CTN and TNP) according to their clinical diagnoses. We identified 265 adults with CTN, and 106 with TNP. The TNP group included patients with idiopathic and secondary symptomatic variants of trigeminal pain (idiopathic facial pain, deafferentation pain and postherpetic neuralgia). No statistically significant differences were found in mean age, duration of pain, pain intensity nor the proportion of left/right-sided pain or proportion of males/females across populations. Complete demographic information and diagnosis codes are shown in Table 1.
Healthy control datasets
In order to confirm the usability of healthy control data from the CamCAN dataset and our internal collection, we conducted two-one-sided t-tests (TOST)31 on imaging data from age- and sex-matched subjects (n = 62 from each dataset) between these two sets. The TOST procedure showed significant p-values for all regions (p < 0.001), rejecting the null hypothesis of TOST (Supplemental Fig. 2, Supplemental Table 2).
In addition, the t-distributed stochastic neighbour embedding visualization of data uncorrected for the pain side showed no specific clustering with regard to the data source (supplemental Fig. 3). Given the similarity of the distributions of two datasets confirmed by two different methods, we conclude that these datasets are suitable for combined AI-driven analyses.
Unsupervised learning and data visualization
We used the principal component analysis (PCA) and t-distributed stochastic neighbour embedding (t-SNE) dimensionality reduction unsupervised learning approaches to visualize our multivariate data. The t-SNE analysis revealed two major clusters, with the primary factor contributing to the separation being the hemisphere affected by TN, along with the corresponding assignment of the ipsilateral hemisphere on the healthy controls’ data. Within each cluster, the separation of healthy controls from subjects with trigeminal pain was observed. Comparison of data uncorrected for the pain laterality showed a similar trend within one large cluster (Supplemental Fig. 3). Both t-SNE and PCA methods were able to differentiate between pain and HC groups. The differentiation was statistically significant in PC1 (p < 0.01). Visualization in PC1 and PC2 did not result in a clear separation of different groups (Supplemental Fig. 4). CTN and TNP appeared relatively similar on both t-SNE visualization of the T1 and DTI scalar data structure, and on PC1 (Fig. 1). This initial lower-dimensional visualization of imaging data informed of clustering of the groups in our multivariate brain imaging dataset.
CTN versus HC
Random forest (RF) and bagged logistic regression (LR) performances were assessed using areas under the receiver operator curve (AUC). CTN versus HC classifier models achieved prediction accuracies of 95% for LR (CTN recall 0.97, HC recall 0.92) and 86% for RF (CTN recall 0.87, HC recall 0.84) (Fig. 2A). Analysis of variance of the brain imaging features revealed that RF prioritized diffusion imaging metrics (FA, RD, AD) for this classification task, ranking these features significantly higher compared to the T1 measures (cortical thickness, surface area, and gray matter volume) (p < 0.05). In contrast, there was no significant difference in the weights of T1 and DTI measures for the LR model of CTN versus HC classification (Fig. 2B).
The top predictors of CTN versus HC differentiation identified with supervised learning models included gray matter thickness at the temporal area (bilaterally), contralateral insula, olfactory cortex and parahippocampal gyrus, white matter diffusivity at the fornix, corpus callosum and cingulum, and gray matter volume reductions at the contralateral inferior pulvinar and medial geniculate nuclei (see Fig. 3). Univariate statistics demonstrated significant differences between TNP and HC for 42 out of the 50 top predictors of LR and all of the top 50 features derived by RF model (p-corrected < 0.05) (Supplemental Table 3).
TNP versus HC
Classification of TNP and HCs demonstrated 84% accuracy for the RF (TNP recall 0.85, HC recall 0.84) and 91% accuracy for the LR (TNP recall 0.93, HC recall 0.88) (Fig. 4A). The accuracy was slightly lower than the performance of the CTN versus HC classification models (Fig. 4A). Analysis of variance of the features lists showed a similar trend as the CTN versus HC models for feature weights: RF prioritized DTI measures, whereas LR demonstrated no preference with regards to imaging modality-specific metrics (Fig. 4B). Our model identified several gray and white matter metrics as predictive for distinguishing TNP from HC, including corpus callosum, fornix and hippocampal cingulum diffusivity measures, contralateral insula and temporal lobe area gray matter (superior and inferior temporal gyri) changes bilaterally, and medial geniculate, ventral anterior and inferior pulvinar nuclei gray matter volume. Regional grey matter volume, thickness and surface area was reduced in TNP comparing to HCs (Fig. 3). Univariate statistics of top 50 features revealed 39 statistically significant LR predictors derived by LR and 48 statistically significant RF predictors (Supplemental Table 4).
CTN versus TNP
RF and LR classifiers of CTN and TNP showed approximately 51% accuracy with AUC equal to 0.51 (RF CTN recall 0.54, TNP recall 0.47) and 0.55 (LR CTN recall 0.65, TNP recall 0.41), respectively (Fig. 5A). However, univariate statistics found significant differences between CTN and TNP (p-corrected < 0.05) at two cortical measures: ipsilateral insular thickness and contralateral orbital sulcus thickness (Fig. 3 and Supplemental Table 5). These structures were significantly reduced in TNP group comparing to CTN. Both RF and LR CTN versus TNP models ranked T1 measures significantly higher than DTI measures for the CTN versus TNP classification task (p < 0.05) (Fig. 5B).
In this study, we focused on the use of multimodal brain imaging data alone to explore whether AI tools can distinguish chronic neuropathic facial pain subtypes from healthy control data. We found that AI tools with brain imaging data distinguished facial pain (CTN, TNP) from healthy controls with very high accuracy (up to 95% for CTN versus HC and up to 91% for TNP versus HC). Univariate statistics confirmed the directionality of changes in neuroanatomical substrates identified in both models (CTN versus HC and TNP versus HC) between compared groups (p-corrected < 0.01). Thus, using multiple AI models, we were able to identify neuroanatomical features that distinguish CTN from HC and TNP from HC. Importantly, while our models could not sufficiently discriminate between CTN and TNP, the models uncovered structural differences between CTN and TNP in the ipsilateral insula and contralateral orbital cortex, with subsequent univariate statistical analysis confirming the significant differences in these structures between the CTN and TNP groups (p-corrected < 0.05). Using a large dataset with heterogenous brain imaging data, we demonstrate that AI models can capture structural imaging-based facial pain signatures. The use of such a large-scale, dataset—to our knowledge the largest in chronic pain literature—permits precise and generalizable AI-driven investigation of chronic facial pain signatures20. Our findings open the pathway for a greater level of objectivity in the assessment of chronic neuropathic facial pain conditions.
AI for MR data analysis
AI can significantly improve the investigation of hidden factors in clinical data to obtain actionable gap-based information about patients and both prevent and rapidly detect disease32. This is important, especially for medical imaging, as AI is able to analyze and interpret large quantities of data without the subjective input of human factors33. Supervised learning offers highly focused and efficient analysis and can also highlight hidden predictive features of classified groups. In addition, AI can provide complementary insight when combined with conventional statistics—this has been demonstrated in the current study as well as in previous reports, summarized by Rajula et al.34.
An important requirement for AI models built with MR imaging data is their interpretability. RF and Bagged LR are ensemble methods which consist of multiple decision trees and logistic regressions, respectively. These models offer a high level of interpretability, necessary for effectively studying medical imaging data19,22,27,35. In addition, RF selects the best feature during training for splitting each node, therefore these classifiers work well with complex multivariate data36. Model performance and generalizability also depends on multiple factors, including sample size, heterogeneity of data, and the implementation of cross-validation procedures37,38,39. While larger training sample sizes increase the generalizability of study findings, it has been reported that studies with n > 150 actually suffer from lower model performance due to data heterogeneity40. Prior studies in chronic neuropathic facial pain used AI techniques such as support vector machines and Gaussian process classification with diffusion imaging to identify markers distinguishing CTN from HC. These studies achieved accuracies of 88% and 85% in distinguishing TN and HC subjects on relatively small datasets (46 and 72 subjects respectively)24. Compared to these attempts, we used different AI approaches and further refined our models to utilize multimodal (T1 and DTI) imaging data (from 371 CTN and TNP subjects and 108 healthy controls). This results in higher generalizability, and, more importantly, an increase in model performance (up to 95%).
The addition of clinical features (such as characteristics of pain presentation) to our models may further uncover the unique clinical attributes of pain syndromes. However, mixing the multimodal MR data with subjective categorical clinical variables would require the introduction of additional feature transformation approaches, potentially lowering the interpretability of the models, introducing sampling and cognitive biases to the analysis, resulting in model overfitting41. Therefore, our study focused on the objective multimodal MR data only as the model input.
AI-driven brain imaging biomarkers of chronic neuropathic facial pain
Prior literature on gray and white matter alterations has revealed regional brain abnormalities in neuropathic facial pain. Reduction of overall thalamic volume and pulvinar volume has been reported previously for CTN subjects42. Chronic and acute pain-related blood flow changes were demonstrated on facial pain patients with thalamic deep brain stimulation and significant differences in anterior insula and pulvinar regional blood flow between pain-free and non-stimulated pain patients has been reported43. In addition to group-level regional differences, grey matter volume decreases in the parahippocampal gyrus and temporal lobe have been correlated with the duration of TN44. Abnormalities in CNS white matter, in areas such as corpus callosum, cingulum and fornix have been previously reported, and are consistent with our observation (Fig. 3)45,46,47. While previous studies have used univariate statistical approaches to focus on specific regions of interest, our current unrestricted AI methodology has identified similar white matter regions of interest. This provides an important corroboration of the neuroanatomical regions using different methods of analysis.
Furthermore, the interpretability of the AI algorithms allowed us to both identify the previously described CNS structures associated with trigeminal pain, and highlight additional regions whose role in pain and sensory modulation should be further investigated (e.g., medial geniculate nucleus, posterior transverse collateral sulcus). It is plausible, for instance, that medial geniculate nucleus abnormalities are consistent with previous reports of patients with TN having an increased risk of developing auditory system disorders such as tinnitus48, and the concept that chronic pain may share some elements of pathogenesis with thalamocortical dysrhythmia49,50. A previous AI study also objectively identified occipital gray matter as predictive for surgical treatment response in TN patients28. Significant differences observed in pulvinar volume might also suggest that visual circuitry could be involved in chronic pain42. That supervised learning AI methodology is able to pinpoint these possible associations with structural features could be the subject of more focused research in both pathogenetic mechanisms and/or therapeutic interventions.
Orbitofrontal cortex and insula as key structures that differentiate between CTN and TNP
CTN and TNP classification models identified two important grey matter regions which demonstrated statistically significant differences between compared groups: ipsilateral insular gyri and contralateral orbitofrontal cortex. In the context of a comparison of CTN and TNP, these findings might suggest the link between severity, treatment outcomes and pain inhibition differences between these two syndromes. Insular abnormalities in chronic pain syndromes have been linked to the chronicity of pain51,52. Reward and pleasure-induced pain inhibition were reported to be modulated by orbitofrontal cortex areas53,54,55. De Souza et al. reported significant grey matter reduction in the insula and orbitofrontal cortex in classical and idiopathic TN patients56,57. Our results highlight the value of objective AI-based imaging analyses for the identification of group-level abnormalities, which may not be identifiable using univariate methods. For instance, Obermann et al. did not find morphological features differentiating constant and paroxysmal trigeminal pain subjects44.
While prior studies identified gray and white matter changes in TN using conventional statistical analysis methods, our study adds supervised learning AI algorithms as a modern way to investigate pain. We propose that these two sites (insula, OFC) should be studied further in the context of constant trigeminal pain. Given that patients with CTN and TNP have significant variability in pain relief after surgery, these structures may serve as potential predictors of surgical outcomes11,58,59.
Limitations and future directions
Our work focuses on brain imaging as the sole data source that informs the AI algorithms; however, we note that each brain imaging dataset is in fact labelled as either CTN or TNP, which are labels that arise from a clinical diagnosis. This dichotomy is an inherent limitation of supervised learning approaches in pain. As this is the early stage of applying AI in pain, we expect that new methods will lead to more complex ways of classifying facial pain syndromes and, consequently, more complex classification models.
The poor distinction between CTN versus TNP classification (~ 51%) can be considered as a limitation of this study. The clinical diagnosis of CTN and TNP is chiefly based on the clinical expression of pain. We acknowledge that there is a Venn diagram between these two entities, and it is possible that the clinical label is not perfectly accurate. This might contribute to the limited AI-based discrimination of these entities. Furthermore, atypical forms of trigeminal pain (constant pain, deafferentation pain and postherpetic neuralgia) consist in pooled data, of what may constitute several pain conditions that are different from each other but have a similar expression of pain. Future studies can focus on larger TNP and greater distinction among these conditions, to investigate whether there are any unique structural patterns which can be identified by more complex AI tools.
This retrospective study may provide insights into how structural changes in T1 and DTI imaging are associated with specific diagnoses of chronic pain and potentially identify a novel approach to classifying trigeminal pain syndromes. As our study focuses on the cortical and subcortical gray and white matter metrics, however, it does not include measurements from the trigeminal nerve, either along the cisternal segment, root entry zone or the pons. Prior reports of trigeminal nerve level abnormalities in TN describe the role of these regions in the prediction of the type of pain or the successful surgical outcome60,61. Possible future addition of the trigeminal nerve diffusivity metrics to the model can further improve the model performance.
Subjects with TN secondary to multiple sclerosis (MS-TN group) were excluded from analysis due to having predominantly MS-specific MR signatures. Classification of the CTN and MS-TN groups would reveal MS/non-MS comparison and therefore highlight MS-specific demyelination-related features. Since our primary focus is the trigeminal pain-specific imaging features, the analysis of the MS-TN group compared to the pain-free MS subjects would be more appropriate for future studies.
This study demonstrated the ability of AI to serve not only as a classification tool, but also as the brain imaging data analysis framework which can uncover structural insights into chronic pain. The use of AI in combination with multimodal MR imaging data may permit novel, objective data-driven patient management.
This retrospective study was approved by the University Health Network (UHN) Research Ethics Board (Protocol number: 18-5780). As patient data included in the study was collected retrospectively, additional informed consent was not required according to the University Health Network Research Ethics Board policy. Healthy control participants were recruited from the community and provided written informed consent before the study. All experiments were performed in accordance with relevant guidelines and regulations.
Data acquisition and selection
914 adult patients diagnosed with neuropathic facial pain conditions at the Toronto Western Hospital, locally followed up between 2005 and 2018, were identified as potential study subjects. All patients included in this study were diagnosed by a neurosurgeon (MH) based on the International Classification of Headache Disorders, 3rd edition (ICHD-3) and Burchiel Classification9,62. Of these, we initially selected 586 subjects with pre-surgical 3T T1- and diffusion-weighted imaging data as well as present pain symptoms at a time of the MR imaging acquisition. We also identified data from healthy individuals recruited locally (n = 50) and the publicly available Cambridge Centre for Ageing Neuroscience dataset (n = 58)25,63. An equivalence test (TOST) and t-SNE visualization were used to assess the similarity of data extracted from local cohort and CamCAN31,64. Exclusion criteria for the healthy control (HC) and patient data included: a history of any neurological disease, clinically evident sensory deficit, and MR data artifacts (e.g., sliced image, major processing errors resulting in a failure of the pipeline) (Supplementary Fig. 1). MS-TN subjects (n = 64) were also excluded from the further analysis given their clear DTI-based distinction from the non-MS population. After exclusion, including quality assurance of the processed T1 and DWI imaging data and verification of clinical information, data from 371 adults with neuropathic facial pain and 108 HC was used in this study.
All patients and healthy controls included in this study underwent similar clinical imaging protocols, detailed in Table 2.
T1-weighted MR images were converted to ‘nifti’ format and processed in FreeSurfer (version 7.0; https://surfer.nmr.mgh.harvard.edu/). Cortical thickness metrics were extracted from each subject with its standard image processing and segmentation pipeline (“recon-all” function)65,66. Outputs of the pipeline were used to extract regional grey matter measurements (gray matter thickness, surface area, volume). Regional cortical thickness and surface area metrics for each study subject were exported using Destrieux atlas segmentation67. We segmented individual thalamic nuclei using “segmentThalamicNuclei” and extracted gray matter volume for each nucleus68. All gray matter measurements were normalized by estimated total intracranial volume for each subject.
Diffusion-weighted images underwent eddy current, motion correction and skull-stripping in FSL (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki), followed by diffusion tensor estimation and derivation of three white matter diffusivity scalars; fractional anisotropy (FA), radial diffusivity (RD), and axial diffusivity (AD) using 3D Slicer (SlicerDMRI, https://www.slicer.org/). These DTI scalar volumes were aligned with the Montreal Neurological Institute (MNI) standard template space by nonlinear registration using Advanced Normalization Tools (ANTs; http://stnava.github.io/ANTs/)69. The Johns Hopkins University (JHU) White Matter Atlas was then used to extract regional DTI metrics for each scalar volume70. A summary of all imaging data processing is shown in Fig. 6.
All brain imaging features in the dataset were first standardized to unit variance and labelled with respect to pain laterality (ipsilateral or contralateral) to facilitate subsequent supervised and unsupervised ML analyses. This improves the generalizability and interpretability of constructed models71. If the subject is a healthy control, the ipsilateral hemisphere was randomly chosen from either the left or right hemispheres. This has been done to ensure that HC data follow the same transformation as the subjects in the pain dataset. All 490 features were used to perform unsupervised and supervised ML tasks.
Artificial intelligence models
We used dimensionality reduction algorithms to overcome the limitations of conventional voxel-wise statistics-based methods and to identify brain imaging data patterns that correspond to different chronic facial pain classes within our subject cohort. Specifically, we used principal component analysis (PCA) and t-distributed stochastic neighbour embedding (t-SNE)64,72, both of which allow a large number of redundant metrics to be condensed into relatively few new variables, which carry the data-representative information and can be easily visualized73.
A two-dimensional t-SNE was used to visualize the dataset. PCA was used to reduce the dimensionality of the dataset. The most important features in the PCA were defined as those with the highest absolute values for the PCA loadings for a particular component (PC1). PC1 of facial pain and healthy control subjects were compared using Student’s independent t-test under the null hypothesis of no difference between compared distributions.
We used two supervised learning methods—balanced Random Forest (RF) and bagged Logistic Regression (LR). The RF classifier model was selected given our multivariate data and the RF’s inherent ability to perform feature selection36. The metrics used to evaluate the models were: accuracy, recall, precision, and the receiver operating characteristic (ROC) curve. Default hyperparameters were used (Supplemental Table 1). The most important features were defined as the top 10 predictors that resulted in the highest mean impurity decrease in the RF classification.
We used bagged LR as a second classification algorithm. To mitigate class imbalance, a bagging classifier consisting of 10 regressions was trained, with each LR trained on a subset of the data that was randomly under-sampled from the majority class. Regressions were implemented in scikit-learn (Python), while the balanced bagging classifier was implemented in imbalanced-learn74,75. Usage of the bagged LR and RF with the tabulated data extracted from MR imaging allows for the interpretability of the models, which is a strong advantage compared to “black box” deep learning models and raw imaging data76,77.
We used leave-one-out cross-validation for all tasks. On each iteration, before training, we normalized all features in the training dataset to attempt to fit a standard normal distribution. To ensure that features are scaled to the same magnitude, the testing example was transformed using the mean and standard deviations for each feature found in the training set. This process was repeated 10 times for each classifier. Mean cross-validated prediction from all 10 training iterations was used to generate an overall ROC curve for each classifier and the mean accuracy had been reported.
Feature weights for the models were stratified by the imaging modality and compared using one-way analysis of variance (ANOVA) to identify whether the model prioritizes one MR modality over the other.
Post-hoc univariate statistics
Following the supervised learning tasks, we performed a post-hoc univariate statistical analysis of the top 50 predictors identified by each classifier, to identify the directionalities of regional gray and white matter alterations contributing to each model prediction. The cut-off was arbitrary and defined based on the feature weights decay of classifier models (Supplemental Fig. 5). Student’s independent t-tests were used to determine the p-value under the null hypothesis of no difference between groups (CTN versus TNP, CTN versus HC, and TNP versus HC), as well as the confidence interval for α = 0.05. To correct p-values for the use of multiple tests, the Holm-Sidak method was used to maintain a false discovery rate of 0.05 (the confidence intervals remained uncorrected). Statistically significant regions from 10 predictors of each classifier were used for visualization purposes.
Data and code availability
MR data from trigeminal pain subjects and code will be shared upon reasonable request. The CamCAN imaging data is publicly available (https://camcan-archive.mrc-cbu.cam.ac.uk/dataaccess/index.php).
Cambridge Centre for Ageing and Neuroscience
Central nervous system
Classical trigeminal neuralgia
Diffusion tensor imaging
The International Classification of Headache Disorders 3rd edition
Johns Hopkins University
Magnetic resonance imaging
Principal component analysis
Receiver operator curve
Trigeminal neuropathic pain
T-distributed Stochastic Neighbor Embedding
Yasaei, R., Peterson, E. & Saadabadi, A. Chronic Pain Syndrome (StatPearls, Treasure Island, 2021).
Hoy, D. et al. The global burden of low back pain: Estimates from the Global Burden of Disease 2010 study. Ann. Rheum. Dis. 73, 968–974 (2014).
Cohen, S. P., Vase, L. & Hooten, W. M. Chronic pain: An update on burden, best practices, and new advances. Lancet 397, 2082–2097 (2021).
Lund, I. & Lundeberg, T. Aspects of pain, its assessment and evaluation from an acupuncture perspective. Acupunct. Med. 24, 109–117 (2006).
Chun-Cheng, Q., Qing-Shi, Z., Ji-Qing, Z. & Zhi-Gang, W. A single-blinded pilot study assessing neurovascular contact by using high-resolution MR imaging in patients with trigeminal neuralgia. Eur. J. Radiol. 69, 459–463 (2009).
Maarbjerg, S., Di Stefano, G., Bendtsen, L. & Cruccu, G. Trigeminal neuralgia—diagnosis and treatment. Cephalalgia 37, 648–657 (2017).
Di Stefano, G., La Cesa, S., Truini, A. & Cruccu, G. Natural history and outcome of 200 outpatients with classical trigeminal neuralgia treated with carbamazepine or oxcarbazepine in a tertiary centre for neuropathic pain. J. Headache Pain 15, 1–5 (2014).
Cruccu, G. et al. Trigeminal neuralgia: New classification and diagnostic grading for practice and research. Neurology 87, 220–228 (2016).
Olesen, J. et al. The international classification of headache disorders, 3rd edition (beta version). Cephalalgia 33, 629–808 (2013).
Burchiel, K. J. & Slavin, K. V. On the natural history of trigeminal neuralgia. Neurosurgery 46, 152–155 (2000).
Tyler-Kabara, E. C. et al. Predictors of outcome in surgically managed patients with typical and atypical trigeminal neuralgia: comparison of results following microvascular decompression. J. Neurosurg. 96, 527–531 (2002).
Lavrov, I. et al. Pre-motor versus motor cerebral cortex neuromodulation for chronic neuropathic pain. Sci. Rep. 11, 12688 (2021).
Davis, K. D. & Moayedi, M. central mechanisms of pain revealed through functional and structural MRI. J. Neuroimmune Pharmacol. 8(3), 518–534 (2012).
Moayedi, M. et al. Contribution of chronic pain and neuroticism to abnormal forebrain gray matter in patients with temporomandibular disorder. Neuroimage 55, 277–286 (2011).
Murray, M. G. et al. Different pain, different brain: thalamic anatomy in neuropathic and non-neuropathic chronic pain syndromes. J. Neurosci. 31, 5956–5964 (2011).
Doshi, T. L., Nixdorf, D. R., Campbell, C. M. & Raja, S. N. Biomarkers in temporomandibular disorder and trigeminal neuralgia: A conceptual framework for understanding chronic pain. Can. J. Pain Rev. Can. la douleur 4(1), 1–18 (2020).
Marek, S. et al. Reproducible brain-wide association studies require thousands of individuals. Nature 603(7902), 654–660 (2022).
Bzdok, D., Altman, N. & Krzywinski, M. Statistics versus machine learning. Nat. Methods 15, 233 (2018).
Erickson, B. J., Korfiatis, P., Akkus, Z. & Kline, T. L. Machine learning for medical imaging. Radiographics 37, 505–515 (2017).
Mamoshina, P., Vieira, A., Putin, E. & Zhavoronkov, A. Applications of deep learning in biomedicine. Mol. Pharm. 13, 1445–1454 (2016).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) vol. 9351 234–241 (Springer Verlag, 2015).
Kim, C. H. et al. Artificial intelligence techniques for prostate cancer detection through dual-channel tissue feature engineering. Cancers (Basel) 13, 1524 (2021).
Bakas, S. et al. Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge. arXiv 124 (2018).
Zhong, J. et al. Multivariate pattern classification of brain white matter connectivity predicts classic trigeminal neuralgia. Pain 159, 2076–2087 (2018).
Taylor, J. R. et al. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. Neuroimage 144, 262–269 (2017).
Dong, H., Yang, G., Liu, F., Mo, Y. & Guo, Y. Automatic brain tumor detection and segmentation using U-net based fully convolutional networks. Commun. Comput. Inf. Sci. 723, 506–517 (2017).
Ashkezari, S. F. S. et al. Prediction of bleb formation in intracranial aneurysms using machine learning models based on aneurysm hemodynamics, geometry, location, and patient population. J. Neurointerv. Surg. 0, neurintsurg-2021–017976 (2021).
Hung, P.S.-P. et al. Regional brain morphology predicts pain relief in trigeminal neuralgia. Neuroimage (Amst.) 31, 102706 (2021).
Acosta, J. N., Falcone, G. J., Rajpurkar, P. & Topol, E. J. Multimodal biomedical AI. Nat. Med. 28(9), 1773–1784 (2022).
Cukurova, M., Giannakos, M. & Martinez-Maldonado, R. The promise and challenges of multimodal learning analytics. Br. J. Educ. Technol. 51, 1441–1449 (2020).
Lakens, D. Equivalence tests: a practical primer for t tests, correlations, and meta-analyses. Soc. Psychol. Pers. Sci. 8, 355–362 (2017).
Ahmed, Z., Mohamed, K., Zeeshan, S. & Dong, X. Q. Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database (Oxford) 2020, baaa010 (2020).
Greenspan, H., Van Ginneken, B. & Summers, R. M. Guest editorial deep learning in medical imaging: Overview and future promise of an exciting new technique. IEEE Trans. Med. Imaging 35, 1153–1159 (2016).
Rajula, H. S. R., Verlato, G., Manchia, M., Antonucci, N. & Fanos, V. Comparison of conventional statistical methods with machine learning in medicine: Diagnosis, drug development, and treatment. Medicina (B. Aires) 56, 455 (2020).
Rokach, L. & Maimon, O. Data Mining with Decision Trees Vol. 81 (WORLD SCIENTIFIC, Singapore, 2014).
Baka, N., Leenstra, S. & van Walsum, T. Random forest-based bone segmentation in ultrasound. Ultrasound Med. Biol. 43, 2426–2437 (2017).
Varoquaux, G. Cross-validation failure: Small sample sizes lead to large error bars. Neuroimage 180, 68–77 (2018).
Poldrack, R. A., Huckins, G. & Varoquaux, G. Establishment of best practices for evidence for prediction a review. JAMA Psychiat. 77, 534 (2020).
An, C. et al. Radiomics machine learning study with a small sample size: Single random training-test set split may lead to unreliable results. PLoS ONE 16, e0256152 (2021).
Schnack, H. G. & Kahn, R. S. Detecting neuroimaging biomarkers for psychiatric disorders: Sample size matters. Front. Psychiatry 7, 50 (2016).
Gosiewska, A., Kozak, A. & Biecek, P. Simpler is better: Lifting interpretability-performance trade-off via automated feature engineering. Decis. Support Syst. 150, 113556 (2021).
Tsai, Y. H. et al. Altered structure and functional connection in patients with classical trigeminal neuralgia. Hum. Brain Mapp. 39, 609–621 (2018).
Kupers, R. C., Gybels, J. M. & Gjedde, A. Positron emission tomography study of a chronic pain patient successfully treated with somatosensory thalamic stimulation. Pain 87, 295–302 (2000).
Obermann, M. et al. Gray matter volume reduction reflects chronic pain in trigeminal neuralgia. Neuroimage 74, 352–358 (2013).
Hayes, D. J. et al. Affective circuitry alterations in patients with trigeminal neuralgia. Front. Neuroanat. 11, 73 (2017).
Desouza, D. D., Hodaie, M. & Davis, K. D. Abnormal trigeminal nerve microstructure and brain white matter in idiopathic trigeminal neuralgia. Pain 155, 37–44 (2014).
Li, R. et al. The integrity of the substructure of the corpus callosum in patients with right classic trigeminal neuralgia. J. Craniofac. Surg. 32, 632–636 (2021).
Cheng, Y. F. et al. Increased risk of tinnitus following a trigeminal neuralgia diagnosis: A one-year follow-up study. J. Headache Pain 21, 1–7 (2020).
Vanneste, S., Song, J. J. & De Ridder, D. Thalamocortical dysrhythmia detected by machine learning. Nat. Commun. 9, 1–13 (2018).
Tu, Y. et al. Distinct thalamocortical network dynamics are associated with the pathophysiology of chronic low back pain. Nat. Commun. 11(1), 1–12 (2020).
Wiech, K. & Tracey, I. The influence of negative emotions on pain: behavioral effects and neural mechanisms. Neuroimage 47, 987–994 (2009).
Desouza, D. D., Hodaie, M. & Davis, K. D. Structural magnetic resonance imaging can identify trigeminal system abnormalities in classical trigeminal neuralgia. Front. Neuroanat. 10, 95 (2016).
Becker, S., Gandhi, W., Pomares, F., Wager, T. D. & Schweinhardt, P. Orbitofrontal cortex mediates pain inhibition by monetary reward. Soc. Cogn. Affect. Neurosci. 12, 651 (2017).
Younger, J., Aron, A., Parke, S., Chatterjee, N. & Mackey, S. Viewing pictures of a romantic partner reduces experimental pain: involvement of neural reward systems. PLoS ONE 5, e13309 (2010).
Roy, M., Piché, M., Chen, J. I., Peretz, I. & Rainville, P. Cerebral and spinal modulation of pain by emotions. Proc. Natl. Acad. Sci. U. S. A. 106, 20900–20905 (2009).
De Souza, D. D., Davis, K. D. & Hodaie, M. Reversal of insular and microstructural nerve abnormalities following effective surgical treatment for trigeminal neuralgia. Pain 156, 1112–1123 (2015).
DeSouza, D. D., Moayedi, M., Chen, D. Q., Davis, K. D. & Hodaie, M. Sensorimotor and pain modulation brain abnormalities in trigeminal neuralgia: A paroxysmal, sensory-triggered neuropathic pain. PLoS ONE 8, 66340 (2013).
Li, S. T. et al. Studies on the operative outcomes and mechanisms of microvascular decompression in treating typical and atypical trigeminal neuralgia. Clin. J. Pain 21, 311–316 (2005).
Miller, J. P., Acar, F. & Burchiel, K. J. Classification of trigeminal neuralgia: clinical, therapeutic, and prognostic implications in a series of 144 patients undergoing microvascular decompression: Clinical article. J. Neurosurg. 111, 1231–1234 (2009).
Tohyama, S., Hung, P., Zhong, J. & Hodaie, M. Early postsurgical diffusivity metrics for prognostication of long-term pain relief after Gamma Knife radiosurgery for trigeminal neuralgia. J. Neurosurg. 131, 539–548 (2018).
Hung, P., Tohyama, S., Zhang, J. Y. & Hodaie, M. Temporal disconnection between pain relief and trigeminal nerve microstructural changes after Gamma Knife radiosurgery for trigeminal neuralgia. J. Neurosurg. https://doi.org/10.3171/2019.4.JNS19380 (2019).
Burchiel, K. J. A new classification for facial pain. Neurosurgery 53, 1164–1167 (2003).
Snoek, L. et al. The Amsterdam Open MRI Collection, a set of multimodal MRI datasets for individual difference analyses. BioRxiv https://doi.org/10.1101/2020.06.16.155317 (2020).
Panta, S. R. et al. A tool for interactive data visualization: Application to over 10,000 brain imaging and phantom MRI data sets. Front. Neuroinformatics 10, 9 (2016).
Desikan, R. S. et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage 31, 968–980 (2006).
Fischl, B. et al. Automatically parcellating the human cerebral cortex. Cereb. Cortex 14, 11–22 (2004).
Destrieux, C., Fischl, B., Dale, A. & Halgren, E. Automatic parcellation of human cortical gyri and sulci using standard anatomical nomenclature. Neuroimage 53, 1–15 (2010).
Iglesias, J. E. et al. A probabilistic atlas of the human thalamic nuclei combining ex vivo MRI and histology. Neuroimage 183, 314–326 (2018).
Avants, B. B. et al. A reproducible evaluation of ANTs similarity metric performance in brain image registration. Neuroimage 54, 2033–2044 (2011).
Mori, S., Oishi, K. & Faria, A. V. White matter atlases based on diffusion tensor imaging. Curr. Opin. Neurol. 22, 362 (2009).
Han, J., Kamber, M. & Jian, P. Data Mining: Concepts and Techniques—Google Books. https://www.google.com/books/edition/Data_Mining_Concepts_and_Techniques/pQws07tdpjoC?hl=en&gbpv=0.
Hu, Y., Li, X., Wang, L., Han, B. & Nie, S. T-distribution stochastic neighbor embedding for fine brain functional parcellation on rs-fMRI. Brain Res. Bull. 162, 199–207 (2020).
Kobak, D. & Berens, P. The art of using t-SNE for single-cell transcriptomics. Nat. Commun. 10, 5416 (2019).
Pedregosa, F. et al. Scikit-learn: Machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Lemaître, G., Nogueira, F. & Aridas, C. K. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. J. Mach. Learn. Res. 18, 1–5 (2017).
Greener, J. G., Kandathil, S. M., Moffat, L. & Jones, D. T. A guide to machine learning for biologists. Nat. Rev. Mol. Cell Biol. 23(1), 40–55 (2021).
Shwartz-Ziv, R. & Armon, A. Tabular data: Deep learning is not all you need. Inf. Fusion 81, 84–90 (2021).
This work has been funded by the CIFAR-Temerty Innovation Catalyst Grant. Dr. Mojgan Hodaie is the surgical co-director of Joey & Toby Tanenbaum Family Gamma Knife Centre and an unpaid member of the scientific advisory board of the Facial Pain Research Foundation. Dr. Frank Rudzicz is the CIFAR Chair in AI at the Vector Institute for Artificial Intelligence, Associate Professor at Dalhousie University and at the University of Toronto. The authors declare no other financial or non-financial conflicts of interest.
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Latypov, T.H., So, M.C., Hung, P.SP. et al. Brain imaging signatures of neuropathic facial pain derived by artificial intelligence. Sci Rep 13, 10699 (2023). https://doi.org/10.1038/s41598-023-37034-y