Migraine headache (MH) classification using machine learning methods with data augmentation

Khan, Lal; Shahreen, Moudasra; Qazi, Atika; Jamil Ahmed Shah, Syed; Hussain, Sabir; Chang, Hsien-Tsung

doi:10.1038/s41598-024-55874-0

Download PDF

Article
Open access
Published: 02 March 2024

Migraine headache (MH) classification using machine learning methods with data augmentation

Lal Khan¹,
Moudasra Shahreen²,
Atika Qazi³,
Syed Jamil Ahmed Shah²,
Sabir Hussain⁴ &
…
Hsien-Tsung Chang^5,6,7

Scientific Reports volume 14, Article number: 5180 (2024) Cite this article

1516 Accesses
Metrics details

Subjects

Abstract

Migraine headache, a prevalent and intricate neurovascular disease, presents significant challenges in its clinical identification. Existing techniques that use subjective pain intensity measures are insufficiently accurate to make a reliable diagnosis. Even though headaches are a common condition with poor diagnostic specificity, they have a significant negative influence on the brain, body, and general human function. In this era of deeply intertwined health and technology, machine learning (ML) has emerged as a crucial force in transforming every aspect of healthcare, utilizing advanced facilities ML has shown groundbreaking achievements related to developing classification and automatic predictors. With this, deep learning models, in particular, have proven effective in solving complex problems spanning computer vision and data analytics. Consequently, the integration of ML in healthcare has become vital, especially in developing countries where limited medical resources and lack of awareness prevail, the urgent need to forecast and categorize migraines using artificial intelligence (AI) becomes even more crucial. By training these models on a publicly available dataset, with and without data augmentation. This study focuses on leveraging state-of-the-art ML algorithms, including support vector machine (SVM), K-nearest neighbors (KNN), random forest (RF), decision tree (DST), and deep neural networks (DNN), to predict and classify various types of migraines. The proposed models with data augmentations were trained to classify seven various types of migraine. The proposed models with data augmentations were trained to classify seven various types of migraine. The revealed results show that DNN, SVM, KNN, DST, and RF achieved an accuracy of 99.66%, 94.60%, 97.10%, 88.20%, and 98.50% respectively with data augmentation highlighting the transformative potential of AI in enhancing migraine diagnosis.

Highly accurate protein structure prediction with AlphaFold

Article Open access 15 July 2021

Virtual reality-empowered deep-learning analysis of brain cells

Article Open access 22 April 2024

Demographic bias in misdiagnosis by computational pathology models

Article 19 April 2024

Introduction

Migraine or Headache is one of the most frequent signs seen by neurologists. It is very common in the general population. A study¹ revealed that more than 90% of the population suffers from headaches. Patients in China spend 672.7 billion yuan each year² to get medical help in this regard. Although migraine may not pose a serious threat to human life, they can have a significant negative impact on work performance, life quality, and the physiology and psyche of the patient³.

Headache itself is an agonizing as well as disabling neurological disease. According to the International Classification of Headache Disorders (ICHD), headache has been broadly categorized into primary headaches (migraine, tension-type headache, and trigeminal autonomic cephalgia or cluster headache), secondary headaches or facial pain, and painful cranial neuropathies⁴.

Primary headache disorders are more prevalent in people globally while the estimated active rate of tension+n-type headache and migraine headache is approximately (40% and 10%) respectively. On the other hand cluster headache which is also known as trigeminal autonomic cephalgia is sporadic compared to tension-type headaches and migraine headaches where their occurrence in the population is nearby 0.1%^5,6. Migraine headache with severity is a very common neurological disease with an incidence of 1 year approximately 16% in the overall population. It is the second highest prominent brain condition in the world as well as causes higher impairment than any other neurological ailments together⁷. Migraines are categorized into; migraines with aura, migraines without aura, and chronic migraines. Around 25% individuals are those who experience aura migraines. The symptoms of aura migraine include the gradual appearance of visual, speech, and other central nervous-related signs. A migraine with an aura lasts an hour⁸. , mainly appearances in migraines without aura have unilateral, pulsating, moderate to severe pain, annoying by or causing to avoid of daily physical activity. When the headache is treated not well it lasts from 4 to 72 h and has signs associated with nausea, vomiting, phonophobia, and photophobia⁹. However, chronic migraine refers to a migraine recurrence whereby an individual has at least 15 headache days per month, along with at least eight completely established migraine days per month¹⁰. Human beings in the current scenario live in a digital environment, where everything as well as their lives are connected with data sources and are inscribed digitally¹¹.

In the medical world, each disease is distinguished from the others by its symptoms. In migraine sufferers, the following symptoms appear Vomiting and Nausea, Increased urination, Lethargy (lack of energy), Phonophobia (Noise), Photobhobia (Sensitivity to light), and Throbbing headache.

The disease of migraine is abandoned. A patient’s migraine headache (MH) begins as a result of some triggers. An event or behavior that you experience or engage in that seems to cause a migraine episode is referred to as a trigger. There are some migraine triggers such as Stress, Drinks (coffee), Medication, Sleep changes, Changes in the weather, and Hormonal changes in women.

Traditional migraine classification and prediction machines such as Magnetic Resonance imaging (MRI), Positron emission tomography (PET), and computed tomography (CT) scans are very expensive. Additionally, very highly experienced medical doctors are required to advise migraine patients. Especially, people living in developing countries can’t afford it financially and there is also a dearth of resources for migraine classification and prediction. Therefore, an automatic affordable, accessible approach is required for migraine classification and prediction. Fortunately, machine learning algorithms show state-of-the-art performance in tasks such as text classification^12,13,14,15 speech recognition^16,17,18,19, and many health-related automatic disease classifications and predictions^20,21.

The majority of the existing models were relatively trained on tiny data sets, therefore despite significant efforts being made for ML-based migraine classification tasks, their performance is not up to par. Furthermore, real patient data sets are not used by the existing models. In contrast with others, we used a slightly larger data set with data augmentation for training our suggested DNN model. The performance of our suggested model outperforms that of earlier studies since the corpus size was expanded utilizing the data augmentation technique. The suggested models are also trained using data from actual patients. As a result, the models we’ve suggested paint a true picture of this migraine classification. The primary contributions of this investigation are as follows:

The key contribution of this paper is the investigation of various machine learning algorithms for classifying different types of migraines. The study explores the use of support vector machine (SVM), K-nearest neighbors (KNN), random forest (RF), decision tree (DST), and deep neural network (DNN) algorithms trained on a publicly available dataset. The study also investigates the impact of data augmentation on the performance of these algorithms. The results of the study show that the proposed models with data augmentation achieve high accuracy in classifying the different types of migraines, with the DNN model achieving the highest accuracy of 99.66%. The study shows that these machine learning algorithms could be effective tools for predicting and classifying migraine in patients, particularly in underdeveloped countries where there is a lack of medical instruments, staff, and doctors. Overall, this paper contributes to the growing body of literature on the use of machine learning in healthcare and demonstrates the potential of these techniques for improving diagnosis and treatment of migraine.
To use and find the effectiveness of data augmentation techniques, and train and test proposed models on actual patient data.
We used a data augmentation technique to expand the data set, which is another addition. It will be made public for further investigation. Finally, we provide a Python code for the Migraine classification framework. We believe that the enhanced corpus for migraine and model library will be useful in fostering research in migraine classification in underdeveloped countries.
Another key contribution is to compare and identify the best classifier among the applied machine learning algorithms after data augmentation. A set of machine learning models such as SVM, Random Forest, KNN, DST, and DNN were implemented to predict and classify migraine.

The rest of the paper is structured as follows: the related research is described in "Related work" section, and the proposed methodology and corpus are explained in "Materiels and method"Section. The experiments and their revealed findings are described in "Experiments" section IV. Portion V, explains the conclusion and future work.

Related work

Although Machine learning exhibits huge success in health care data related computer-aided diagnosis, image certification, image classification, image-guided medical aid, data repository acquisition, multidimensional image integration, and medical image dissection still there is a lot of work that needs to be done chen2021ethical. Inside the medical industry, ML has undoubtedly had relatively little social influence. The answer to lowering the rising cost of treatment and fostering better clinician-patient dialogue is found in ML. Frequent health-related applications of ML techniques comprise helping physicians find numerous patient-specific medications and treatments, as well as helping patients choose when and whether they need to schedule follow-up consultations²². There is presently a vast amount of content available in the medical industry. It includes EMRs which are made up of data, both structured and unstructured. Structured clinical data is easy to evaluate in a database and will include several figures and classifications in addition to patient weights but also perhaps general signs like headaches and stomachaches²³. The majority of clinical data consists of unstructured data that is spread throughout a wide range of annotations, pictures, podcasts, assessments, and exit statements. A discussion between a provider and a patient is very difficult to quantify and evaluate since it is very individualized and might go in a lot of different ways²⁴. Such ML algorithms help locate complex correlations in a wealth of enormous data. Healthcare procedures, notably ones that depend on cutting-edge genomes and proteomics analysis, are exceptionally compatible with this technology. It is frequently employed in the diagnosis and monitoring of many illnesses. ML algorithms will provide better therapeutic strategies for individuals in clinical sectors by making recommendations for building beneficial medical systems²⁵. Inside the fields of machine learning and data science, several classification methods have been suggested. Foremost extensively utilized techniques across a range of application areas are outlined in the sections that follow; such as Naïve Bayes, Linear Discriminant Analysis (LDA), Logistic regression (LR), K-nearest neighbors (KNN), decision tree (DST), random forest (RF), Fuzzy logic, support vector machine (SVM), and Classification and Regression tree (CART)²⁶.

In Study²⁷, authors used five supervised machine learning techniques to classify migraine disease based on the subject symptoms. A Weka data mining tool was used for the implementation and classification. According to the results, the Naïve Bayes model was the best-suited and simplest algorithm out of the chosen models.

EGG signals were used in study²⁸, to detect and classify migraine types. Computer-aided diagnostic (CAD) system that utilized deep learning models such as VGG16, ResNET101, and DenesetNet121 to classify migraines. In another study²⁹, electroencephalography (EEG) was used to support proficient decisions in the automatic detection of migraine. The dataset of EGG signals consists of 18 migraine and 21 healthy volunteers. The revealed results showed that the Bi-LSTM algorithm with 128 channels achieved the highest accuracy of 95.99% when compared with the other models (support vector machine, linear discriminant analysis, and random forest). In another study³⁰, authors used clinical data annotated by domain experts of 400 patients. Initially, authors collected symptom-based data, then they identified and selected the 24 most related variables, after related variables selections artificial neural network (ANN), and other traditional ML models were used to classify migraine. The artificial neural network model achieved the highest accuracy of 97% for migraine classification tasks as compared to other used models such as SVM, LR, decision tree, and nearest neighbor.

In Study³¹, somatosensory evoked potential features were built in time and frequency domains for migraine classification using machine learning algorithms including (support vector machines, random forest, K-nearest neighbors, extreme gradient boosting trees, linear discriminant analysis multilayer perceptron, and logistic regression). They were able to achieve over 88% accuracy in interictal or migraine ictal versus healthy detection.

The inception module-based CNN Approach showed improvement over 86.18% when compared with the traditional support vector machine which gained an accuracy of 83.67% to discriminate healthy and migraineur controls along with the two subtypes of headaches³². In Study³³, a feature selection method was introduced to improve the classification of the migraine group. With the naïve Bayes, SVM, and Adaboost classifiers the performance was improved from 67 to 93%, 90 to 95%, and 93 to 94% respectively. Similarly, another technique was proposed by³⁴ for early diagnosis of migraine disease by using EGG signals. ANN reached the highest accuracy of 88% among SVM and logistic regression. During the mental arithmetic task (MAT), functional near-infrared spectroscopy was used to observe hemoglobin change in the prefrontal cortex (PFC). The specificity and sensitivity of that model showed 75% and 100% for Chronic migraine (CM) respectively. Similarly, the model achieved 100% specificity and 75% sensitivity for Medication-overuse headache (MOH). According to the results, fNIRS is more feasible for migraine classification when combined with machine learning³⁵.

In Study²⁰, the primary focus of the authors was to design and implement a decision support system for diagnosing tension-type and migraine headaches using machine learning. The logistic regression model achieved the best results with an accuracy of 0.84 out of other models such as gradient boosting algorithms and random forest.

Data mining techniques play a vital role in the field of medicine. In study³⁶, Data mining classification techniques were used such as Naïve, KNN, SVM, and random forest. Among these, all Naïve bays was the best classifier with a precision of 0.905 and an accuracy of 0.475. A realistic monitoring scenario was proposed for monitoring hemodynamic variables from real patients in which a wireless body sensor network (WBSN) was used. N4SID models were developed which provide a low rate of false positives and average forecast windows of 47 min³⁷. A machine learning technique was used to differentiate between healthy and migraine patients with the combination of three functional measures of rs-fMRI³⁸.

Ufuk et al.³⁹ proposed DNN for diagnosing migraine and achieved 95% accuracy. The authors used 8 attributes and diagnosed 3 types of migraine (migraine with aura, c, and migraine without aura). Ferroni⁴⁰ proposed DSS for diagnosing migraine due to medication overuse and achieved 82% accuracy. Another study⁴¹ proposed DSS for diagnosing primary headaches and achieved 80% accuracy. The authors compared four different machine learning techniques as Bagging, Navie-Bayes, Boosting, and Random Forest. Rober keight³¹ proposed DST for diagnosing types of primary headaches using 9 types of Machine Learning classifiers and achieved 95% accuracy. Hao Yang⁴² proposed CNN for MRI-based classification of migraine and achieved 99% accuracy. Akben⁴³ implemented ANN for diagnosing migraine and achieved 83.3% accuracy. Akben⁴⁴ deployed an SVM classifier for diagnosing migraine and achieved 85% accuracy. Subasi⁴⁵ used various versions of Random Forest for diagnosing migraine and achieved 85.95% accuracy. De la Hoz⁴⁶ used ANN for diagnosing migraine and achieved 88% accuracy. Yolanda Garcia³³ suggested features selection for diagnosing migraine and achieved 90% accuracy. Recently, scholars have been increasingly focusing on utilizing MRI and fMRI images for the detection and classification of migraine headaches^47,48,49.

Materiels and method

Processing data requires much time and computing resources, even using elementary methods such as machine learning or deep learning. Therefore, collecting relevant insights requires a robust machine-learning network. Creating a sophisticated ML framework is also tricky. Personalization is made possible by modifying the parameters of a machine learning classifier. In this paper, the models were trained using various ML algorithms. Moreover, the parameters of the ML algorithms were modified and tailored to the input dataset for enhanced classification. Machine learning algorithms that have been used in this study as shown in Fig. 1 are the Support vector machine, K nearest neighbor, decision tree, random forest, and deep neural network.

Pre-processing

It is crucial to pre-process the data before supplying it to machine learning models to get better results. Noisy data is eliminated, inconsistent data is made consistent, and error identification and data translation into numerical variables were carried out on the experimented migraine corpus. and data augmentation is chosen to increase the volume of the corpus during the pre-processing stage. Noise has been eliminated by transforming the object data and removing irregular patterns, such as atypical data, poor typing, blank, incomplete, and inconsistent data. Data augmentation is a method to increase the volume of data by making new data points to current data. Synthetic Minority Oversampling Technique (SMOTE)⁵⁰ is a data augmentation technique used to increase the data size and minimize the unbalancing problem. SMOTE adds synthetic minority class examples to the original data set. We used the SMOTE augmentation technique to achieve a balanced data set.

Dataset

Four hundred people’s medical histories were reviewed, all of whom had been diagnosed with one of many diseases linked to migraines. Expert medical personnel from the Hospital Materno Infantil de Soledad collected data. The patient’s name, address, insurance company, primary care physician, symptoms, age, patient ID, treatment plan, etc were all stored in the dataset. In contrast, this investigation puts emphasis only on clinical presentation. A need for personally identifiable information about patients was not present.

Our used dataset initially consisted of 400 patient records with 24 attributes. After the data augmentation process, the corpus size increased to 1447 patient records. Training and testing data were separated from the dataset. Out of a total of 1447, 1157 records made up the training set, and the rest were used for testing. The dataset was significantly unbalanced before data augmentation, as shown in Fig. 3. However, Fig. 4 demonstrates that the dataset is perfectly balanced after data augmentation. The Fig. 5 represents the age-wise distribution of patients. The proposed dataset comprised 7 migraine classes named: (a) Typical aura with migraine, (b) Migraine without aura, (c) Typical aura without migraine, (d) Familial hemiplegic migraine, (e) Sporadic hemiplegic migraine, (f) Basilar-type aura, and (d) Other.

Classification models

After applying basic pre-processing and data augmentation techniques, a set of machine learning classifiers such as SVM, KNN, DT RF, and the deep neural network (DNN) models have experimented with the migraine classification corpus. Figure 2 shows the basic architecture of DDN. A total of four layers such as inputs, two hidden layers, and classification layers are designed for the migraine classification task.

Experiments

This section elaborates on an extensive set of experiments conducted on the dataset to validate the effectiveness of data augmentations

Hyper-parameters

The Table 1 describes the parameters that were used during DNN experimentation.

Table 1 Hyper-parameter used against various models.

Full size table

Discussion

A corpus of 400 clinical records of patients with varied pathologies related to migraines was initially employed in the suggested study paper. After using the data augmentation technique the number of patient records was increased to 1447. Initial data were collected by skilled medical staff at the Hospital Materno Infantil de Soledad, Colombia in early 2013. The experiments were implemented before and after data augmentations. Various ML classifiers such as KNN, SVM, RF, DST, and DNN were used before and after data augmentation. The experimented dataset comprises patient ID, HealthCare ID, physician ID, migraine symptoms, identified or diagnosed disease, and treatment. However, our proposed study used only diagnosed disease and migraine symptoms. Identifiable data were not used in this paper. A total of 23 variables such as age, family background, dizziness, vomiting, etc. were identified and selected related to symptoms that patients have during headaches. Additionally, one variable is identified and selected which is related to the diagnosis of migraine type. The value of symptoms-related variables can be, dizziness, vomiting, etc. On the other hand, the value of diagnosis-related variable represents migraine “Type” labeled by the treating physician based on medical history and symptoms of the patients, probably with one of the migraine types presented in Fig. 4.

The revealed results were authenticated and created on the type of migraine annotated by the Human domain expert and spending the particular measurement made through the classification experiment revealed ML algorithms and deep neural network (DNN). The training and validation accuracy of the DNN model is presented in Fig. 6. The highest accuracy was achieved by a deep neural network implemented with 512 hidden neurons at a dense layer, which gained an accuracy of 99.6% with data augmentation. Table 2 demonstrates that the classification performance achieved by our proposed DNN+ Data augmentation model for headache migraines matched with the classification made in 97% of the 80 test instances made by the domain expert (treating physician). Due to its deep nature, the deep neural network achieves the maximum accuracy in comparison with traditional ML models such as KNN, SVM, RF, and DST. As already established deep learning models are hungry for data and give better results when these models are trained considerably with bigger datasets.

In Table 3, the results of existing work for migraine classification are equated with the accuracy of the classification produced by our proposed model with data augmentation. All other existing models outperformed the ML models and DNN with the Data Augmentation strategies we provided. The data augmentation techniques proved very effective since the model’s performance increased by 2%.

Table 2 Classification Report of all used algorithms with and without data augmentation technique.

Full size table

Table 3 Classification Report of all used algorithms with and without data augmentation technique.

Full size table

Furthermore, in this research work, we applied various machine learning models and preprocessing techniques using Panda’s library for cleaning and removing outliers from the migraine dataset. Additionally, we have applied various SVC kernels and preprocessing techniques for better performance. For model evaluation, we also used different performance metrics, such as the classification report, and the confusion matrix. However, the deep neural network outclassed all the traditional machine learning classifiers and got an accuracy score of 99.66% on the migraine dataset.

Over the past few years, there has been a proliferation of various automatic classification techniques based on machine learning. These include ANN, DT, logistic regression, Bayesian classifiers, nearest neighbors, support vector machines (SVMs), and multiple discriminant analysis^54,55. Table3 lists the accuracy-based migraine-type classification results from previous studies utilizing accuracy as an evaluation measure. In comparison to other earlier studies, our suggested data augmentation strategy using a deep neural network model with two hidden layers and 512 neurons achieved the best accuracy of 99.66%.

Conclusion

Migraine is a very common and complex neurovascular disorder. Medical Doctors commonly use scales to identify and classify migraine into their types. As it’s a common disease and every second/third person is suffering from migraine the ratio between patients and medical doctors is too high. On the other hand, underdeveloped or developing countries have a shortage of well-trained doctors, medical staff, and medical instruments. Therefore, we need to utilize machine learning models to classify and predict the migraine. Because machine learning models showed state-of-the-art performances in task-related text classification and automatic prediction. In this study, we implement four traditional machine learning algorithms such as SVM, KNN, Random Forest, and DST. We used various performance evaluation techniques such as Confusion Matrix, Accuracy, and F1-measures to compare the revealed results of proposed algorithms with existing studies. After preprocessing steps, the proposed machine learning algorithms were trained on the publicly available corpus to classify migraine into its basic seven types. The revealed results show that The DNN beats other applied traditional models quite comprehensively.

Implications

The findings from this research on migraine diagnosis using machine learning (ML) have crucial implications for policymakers in the healthcare sector. Here can write some such as

Implications: Policymakers stress the necessity for extensive awareness campaigns to overcome the difficulties in migraine diagnosis. Due to the limitations of subjective pain intensity ratings, migraine headache, a common and complex neurovascular disorder, presents considerable problems in clinical identification. It is critical to raise awareness of headaches among the general population because they have a profound influence on people’s brains, bodies, and overall functioning and have low diagnostic specificity.

policymakers should give top priority to creating and implementing awareness campaigns that inform the public and healthcare professionals about the value of early detection, accurate diagnosis, and AI-based remedies. By raising awareness, policymakers can encourage people to seek prompt medical attention, make it easier to incorporate ML into healthcare systems, and eventually improve migraine management as a whole.

For researchers: Overall, this study demonstrates how machine learning models may be used to solve the difficulties in migraine diagnosis and categorization, particularly in areas with few medical resources.

Future work: Future developments can improve accuracy and change the migraine therapy landscape by utilizing cutting-edge algorithms and growing datasets.

Although, the machine learning algorithms performed well still there is enough room to improve the performance in the future by using the latest transformer-based algorithms such as Bidirectional Encoder Representations from Transformer (BERT). There is a dearth of publicly available datasets for migraine classification therefore in the future, we plan to build a new comparatively big dataset for migraine classification to achieve higher accuracy.

Data availibility

Migraine classification dataset is publically available on the internet on various platforms such as Kaggle. The link to dataset is: https://www.kaggle.com/datasets/weinoose/migraine-classification.

References

Hagen, K. et al. The epidemiology of headache disorders: A face-to-face interview of participants in hunt4. J. Headache Pain 19, 1–6 (2018).
Article Google Scholar
Yao, C. et al. Burden of headache disorders in china, 1990–2017: Findings from the global burden of disease study 2017. J. Headache Pain 20, 1–11 (2019).
Article Google Scholar
Takeshima, T. et al. Prevalence, burden, and clinical management of migraine in china, japan, and south Korea: A comprehensive review of the literature. J. Headache Pain 20, 1–15 (2019).
Article Google Scholar
Wu, Q. et al. Determining the efficacy and safety of acupuncture for the preventive treatment of menstrual migraine: A protocol for a prisma-compliant systematic review and meta-analysis. J. Pain Res. 16, 101–109 (2023).
Article ADS PubMed PubMed Central Google Scholar
Pacheco-Barrios, K. et al. Primary headache disorders in Latin America and the Aaribbean: A meta-analysis of population-based studies. Cephalalgia 43, 03331024221128265 (2023).
Article Google Scholar
Islam, J. et al. Modulation of trigeminal neuropathic pain by optogenetic inhibition of posterior hypothalamus in cci-ion rat. Sci. Rep. 13, 489 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Safiri, S. et al. The burden of Parkinson’s disease in the middle east and north Africa region, 1990–2019: Results from the global burden of disease study 2019. BMC Public Health 23, 107 (2023).
Article PubMed PubMed Central Google Scholar
Barral, E., Martins Silva, E., García-Azorín, D., Viana, M. & Puledda, F. Differential diagnosis of visual phenomena associated with migraine: Spotlight on aura and visual snow syndrome. Diagnostics 13, 252 (2023).
Article PubMed PubMed Central Google Scholar
Hansen, J. M. & Charles, A. Differences in treatment response between migraine with aura and migraine without aura: Lessons from clinical practice and rcts. J. Headache Pain 20, 1–10 (2019).
Article Google Scholar
Khanal, S. et al. A systematic review of economic evaluations of pharmacological treatments for adults with chronic migraine. J. Headache Pain 23, 122 (2022).
Article CAS PubMed PubMed Central Google Scholar
Cao, L. Data science: A comprehensive overview. ACM Comput. Surv. 50, 1–42 (2017).
Article CAS Google Scholar
Ashraf, N. et al. Multi-label emotion classification of URDU tweets. PeerJ Comput. Sci. 8, e896 (2022).
Article PubMed PubMed Central Google Scholar
Khan, L., Amjad, A., Ashraf, N., Chang, H.-T. & Gelbukh, A. Urdu sentiment analysis with deep learning methods. IEEE Access 9, 97803–97812 (2021).
Article Google Scholar
Khan, L., Amjad, A., Ashraf, N. & Chang, H.-T. Multi-class sentiment analysis of URDU text using multilingual Bert. Sci. Rep. 12, 5436 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Khan, L., Amjad, A., Afaq, K. M. & Chang, H.-T. Deep sentiment analysis using CNN-LSTM architecture of English and roman URDU text shared in social media. Appl. Sci. 12, 2694 (2022).
Article CAS Google Scholar
Amjad, A., Khan, L. & Chang, H.-T. Semi-natural and spontaneous speech recognition using deep neural networks with hybrid features unification. Processes 9, 2286 (2021).
Article Google Scholar
Amjad, A., Khan, L., Ashraf, N., Mahmood, M. B. & Chang, H.-T. Recognizing semi-natural and spontaneous speech emotions using deep neural networks. IEEE Access 10, 37149–37163 (2022).
Article Google Scholar
Amjad, A., Khan, L. & Chang, H.-T. Effect on speech emotion classification of a feature selection approach using a convolutional neural network. PeerJ Comput. Sci. 7, e766 (2021).
Article PubMed PubMed Central Google Scholar
Amjad, A. & Khan, L. Data augmentation and deep neural networks for the classification of Pakistani racial speakers recognition. PeerJ Comput. Sci. 8, e1053 (2022).
Article PubMed PubMed Central Google Scholar
Liu, F., Bao, G., Yan, M. & Lin, G. A decision support system for primary headache developed through machine learning. PeerJ 10, e12743 (2022).
Article PubMed PubMed Central Google Scholar
Aggarwal, S. & Pandey, K. Early identification of PCOS with commonly known diseases: Obesity, diabetes, high blood pressure and heart disease using machine learning techniques. Expert Syst. Appl. 217, 119532 (2023).
Article Google Scholar
Saini, A., Meitei, A. & Singh, J. Machine learning in healthcare: A review. In Proceedings of the International Conference on Innovative Computing & Communication (ICICC) (2021).
Tam, C. S. et al. Combining structured and unstructured data in EMRS to create clinically-defined EMR-derived cohorts. BMC Med. Inform. Decis. Mak. 21, 1–10 (2021).
Article Google Scholar
Scheurwegs, E., Luyckx, K., Luyten, L., Daelemans, W. & Van den Bulcke, T. Data integration of structured and unstructured sources for assigning clinical codes to patient stays. J. Am. Med. Inform. Assoc. 23, e11–e19 (2016).
Article PubMed Google Scholar
Akila1, A., Parameswari, R. & Jayakumari, C. Big data in healthcare: Management, analysis, and future prospects. In Handbook of Intelligent Healthcare Analytics: Knowledge Engineering with Big Data Analytics 309–326 (2022).
Lutz, W. et al. Prospective evaluation of a clinical decision support system in psychological therapy. J. Consult. Clin. Psychol. 90, 90 (2022).
Article PubMed Google Scholar
Gulati, S., Guleria, K. & Goyal, N. Classification of migraine disease using supervised machine learning. In ’2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions)(ICRITO), 1–7 (organizationIEEE, 2022).
Aslan, Z. Deep convolutional neural network-based framework in the automatic diagnosis of migraine. Circuits Syst. Signal Process. 42(5), 3054–3071 (2022).
Article Google Scholar
Göker, H. Automatic detection of migraine disease from EEG signals using bidirectional long-short term memory deep learning model. Signal Image Video Process. 17(4), 1255–1263 (2022).
Article Google Scholar
Sanchez-Sanchez, P. A., García-González, J. R. & Rúa Ascar, J. M. Automatic migraine classification using artificial neural networks. F1000Research 9, 618 (2020).
Article PubMed PubMed Central Google Scholar
Zhu, B., Coppola, G. & Shoaran, M. Migraine classification using somatosensory evoked potentials. Cephalalgia 39, 1143–1155 (2019).
Article PubMed Google Scholar
Yang, H., Zhang, J., Liu, Q. & Wang, Y. Multimodal MRI-based classification of migraine: Using deep learning convolutional neural network. Biomed. Eng. Online 17, 1–14 (2018).
Article Google Scholar
Garcia-Chimeno, Y., Garcia-Zapirain, B., Gomez-Beldarrain, M., Fernandez-Ruanova, B. & Garcia-Monco, J. C. Automatic migraine classification via feature selection committee and machine learning techniques over imaging and questionnaire data. BMC Med. Inform. Decis. Mak. 17, 1–10 (2017).
Article Google Scholar
Jindal, K. et al. Migraine disease diagnosis from eeg signals using non-linear feature extraction technique. In ’2018 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), 1–4 (organizationIEEE, 2018).
Chen, W.-T. et al. Migraine classification by machine learning with functional near-infrared spectroscopy during the mental arithmetic task. Sci. Rep. 12, 14590 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Sah, R. D., Sheetlani, J., Kumar, D. R. & Sahu, I. N. Migraine (headaches) disease data classification using data mining classifiers. J. Res. Env. Earth Sci. 3, 10–16 (2017).
Google Scholar
Pagán, J. et al. Robust and accurate modeling approaches for migraine per-patient prediction from ambulatory data. Sensors 15, 15419–15442 (2015).
Article ADS PubMed PubMed Central Google Scholar
Chong, C. D. et al. Migraine classification using magnetic resonance imaging resting-state functional connectivity data. Cephalalgia 37, 828–844 (2017).
Article PubMed Google Scholar
Celik, U., Yurtay, N. & Pamuk, Z. Migraine diagnosis by using artificial neural networks and decision tree techniques. AJIT-e Acad. J. Inform. Technol. 5, 79–90 (2014).
Article Google Scholar
Ferroni, P. et al. Machine learning approach to predict medication overuse in migraine patients. Comput. Struct. Biotechnol. J. 18, 1487–1496 (2020).
Article CAS PubMed PubMed Central Google Scholar
Krawczyk, B., Simić, D., Simić, S. & Woźniak, M. Automatic diagnosis of primary headaches by machine learning methods. Open Med. 8, 157–165 (2013).
Article Google Scholar
Chen, I. Y. et al. Ethical machine learning in healthcare. Ann. Rev. Biomed. Data Sci. 4, 123–144 (2021).
Article Google Scholar
Akben, S. B., Tuncel, D. & Alkan, A. Classification of multi-channel eeg signals for migraine detection. Biomed. Res. 27, 743–748 (2016).
Google Scholar
Akben, S. B., Subasi, A. & Tuncel, D. Analysis of repetitive flash stimulation frequencies and record periods to detect migraine using artificial neural network. J. Med. Syst. 36, 925–931 (2012).
Article PubMed Google Scholar
Subasi, A., Ahmed, A., Aličković, E. & Hassan, A. R. Effect of photic stimulation for migraine detection using random forest and discrete wavelet transform. Biomed. Signal Process. Control 49, 231–239 (2019).
Article Google Scholar
Casas Pulido, A. F., Hernandez Cely, M. M. & Rodriguez, O. M. H. Análisis experimental de flujo líquido-líquido en un tubo horizontal usando redes neuronales artificiales. Revista UIS Ingenierías 22, 49–56 (2023).
Article Google Scholar
Dumkrieger, G., Chong, C. D., Ross, K., Berisha, V. & Schwedt, T. J. The value of brain MRI functional connectivity data in a machine learning classifier for distinguishing migraine from persistent post-traumatic headache. Front. Pain Res. 3, 1012831 (2023).
Article Google Scholar
Nie, W., Zeng, W., Yang, J., Zhao, L. & Shi, Y. Classification of migraine using static functional connectivity strength and dynamic functional connectome patterns: A resting-state fmri study. Brain Sci. 13, 596 (2023).
Article PubMed PubMed Central Google Scholar
Marino, S. et al. Classifying migraine using pet compressive big data analytics of brain’s \(\mu\)-opioid and d2/d3 dopamine neurotransmission. Front. Pharmacol. 14, 1173596 (2023).
Article CAS PubMed PubMed Central Google Scholar
Chawla, N. V., Bowyer, K. W., Hall, L. O. & Kegelmeyer, W. P. Smote: Synthetic minority over-sampling technique. J. Art. Intell. Res. 16, 321–357 (2002).
Google Scholar
Uddin, S., Khan, A., Hossain, M. E. & Moni, M. A. Comparing different supervised machine learning algorithms for disease prediction. BMC Med. Inform. Decis. Mak. 19, 1–16 (2019).
Article Google Scholar
Mitrović, K., Petrušić, I., Radojičić, A., Daković, M. & Savić, A. Migraine with aura detection and subtype classification using machine learning algorithms and morphometric magnetic resonance imaging data. Front. Neurol. 14, 1106612 (2023).
Article PubMed PubMed Central Google Scholar
Kwon, J. et al. Machine learning-based automated classification of headache disorders using patient-reported questionnaires. Sci. Rep. 10, 14062 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Doupe, P., Faghmous, J. & Basu, S. Machine learning for health services researchers. Value Health 22, 808–815 (2019).
Article PubMed Google Scholar
Waring, J., Lindvall, C. & Umeton, R. Automated machine learning: Review of the state-of-the-art and opportunities for healthcare. Artif. Intell. Med. 104, 101822 (2020).
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Ibadat International University Islamabad Pakpattan Campus, Pakpattan, Pakistan
Lal Khan
Department of Computer Science, Mir Chakar Khan Rind University, Sibi, Pakistan
Moudasra Shahreen & Syed Jamil Ahmed Shah
Centre for Lifelong Learning, Universiti Brunei Darussalam, Bandar Seri Begawan, Brunei Darussalam
Atika Qazi
Department of Agriculture, Mir Chakar Khan Rind University, Sibi, Pakistan
Sabir Hussain
Bachelor Program in Artificial Intelligence, Chang Gung University, Taoyuan, Taiwan
Hsien-Tsung Chang
Department of Computer Science and Information Engineering, Chang Gung University, Taoyuan, Taiwan
Hsien-Tsung Chang
Department of Physical Medicine and Rehabilitation, Chang Gung Memorial Hospital, Taoyuan, Taiwan
Hsien-Tsung Chang

Authors

Lal Khan
View author publications
You can also search for this author in PubMed Google Scholar
Moudasra Shahreen
View author publications
You can also search for this author in PubMed Google Scholar
Atika Qazi
View author publications
You can also search for this author in PubMed Google Scholar
Syed Jamil Ahmed Shah
View author publications
You can also search for this author in PubMed Google Scholar
Sabir Hussain
View author publications
You can also search for this author in PubMed Google Scholar
Hsien-Tsung Chang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All these authors contributed equally.

Corresponding author

Correspondence to Hsien-Tsung Chang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Khan, L., Shahreen, M., Qazi, A. et al. Migraine headache (MH) classification using machine learning methods with data augmentation. Sci Rep 14, 5180 (2024). https://doi.org/10.1038/s41598-024-55874-0

Download citation

Received: 26 May 2023
Accepted: 28 February 2024
Published: 02 March 2024
DOI: https://doi.org/10.1038/s41598-024-55874-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.