Self-report symptom-based endometriosis prediction using machine learning

Goldstein, Anat; Cohen, Shani

doi:10.1038/s41598-023-32761-8

Download PDF

Article
Open access
Published: 04 April 2023

Self-report symptom-based endometriosis prediction using machine learning

Anat Goldstein¹ &
Shani Cohen²

Scientific Reports volume 13, Article number: 5499 (2023) Cite this article

2201 Accesses
3 Citations
Metrics details

Subjects

Abstract

Endometriosis is a chronic gynecological condition that affects 5–10% of reproductive age women. Nonetheless, the average time-to-diagnosis is usually between 6 and 10 years from the onset of symptoms. To shorten time-to-diagnosis, many studies have developed non-invasive screening tools. However, most of these studies have focused on data obtained from women who had/were planned for laparoscopy surgery, that is, women who were near the end of the diagnostic process. In contrast, our study aimed to develop a self-diagnostic tool that predicts the likelihood of endometriosis based only on experienced symptoms, which can be used in early stages of symptom onset. We applied machine learning to train endometriosis prediction models on data obtained via questionnaires from two groups of women: women who were diagnosed with endometriosis and women who were not diagnosed. The best performing model had AUC of 0.94, sensitivity of 0.93, and specificity of 0.95. The model is intended to be incorporated into a website as a self-diagnostic tool and is expected to shorten time-to-diagnosis by referring women with a high likelihood of having endometriosis to further examination. We also report the importance and effectiveness of different symptoms in predicting endometriosis.

Machine learning algorithms as new screening approach for patients with endometriosis

Article Open access 12 January 2022

Clinical use of artificial intelligence in endometriosis: a scoping review

Article Open access 04 August 2022

Real world data on symptomology and diagnostic approaches of 27,840 women living with endometriosis

Article Open access 14 October 2021

Introduction

Endometriosis is a chronic gynecological condition that affects 5–10% of women of reproductive age^1,2. Women with endometriosis have endometrial-type tissue outside of the uterus^1,3. In exceptional cases, endometriosis lesions may reach organs distant from the pelvis such as the membranes of the lungs, heart, limbs, and brain. As a result, and in response to the substances that this tissue produces, the immune system is activated, and a chronic inflammatory process is triggered, leading to the formation of adhesions, scars, and cysts between the pelvic and abdominal organs. Endometriosis tissue can also penetrate various organs in the body, including the digestive and urinary systems, and attach to nerves^4,5.

Endometriosis is associated with a wide variety of symptoms such as pain, abnormal bleeding, gastrointestinal symptoms, urinary system problems, and even emotional effects^2,4,6. This variety, together with a lack of awareness, may explain the relatively long duration until the condition is typically diagnosed: currently, the average time-to-diagnosis of women suffering from the disease is about 6–10 years from symptom onset⁷.

Usually, an endometriosis diagnosis includes a pelvic exam, ultrasound imaging of reproductive organs, an MRI, and laparoscopy. These tests are expensive and invasive and require the involvement of a physician. The literature recognizes the need for non-invasive screening tools to simplify the diagnostic process and shorten time-to–diagnosis^8,9, and various studies have investigated the feasibility of several non-invasive tools. One example of such non-invasive indicators are biomarkers obtained from blood-tests^10,11,12,13. For example, Nisenblat et al.¹² reviewed works that combined non-invasive blood tests and transvaginal ultrasound to improve the diagnostic accuracy of pelvic endometriosis. However, they found that the accuracy obtained in those works was insufficient to replace laparoscopy. Another non-invasive tool whose effectiveness for endometriosis prediction has been studied is genomic data^{14,15,16,17,18}. Studies have identified several biomarker genes that are indicative of endometriosis¹⁴ and developed ML-based models for endometriosis prediction^14,15. The use of patient-reported symptoms is another non-invasive approach that has been investigated in previous studies. However, most of these studies have incorporated not only symptoms, but also imaging and clinical parameters, which are often available only in later diagnosis stages, are costly, and require the involvement of physician^5,19,20,21. In fact, in a review study, Surrey et al.¹⁹ found only one study that used a questionnaire based exclusively on patients’ self-reported symptoms²². This study applied multiple logistic regression to subfertile women undergoing laparoscopy and analyzed the associations between seven self-reported symptoms and endometriosis. However, only one symptom, period pain, was found to be significantly different between women with endometriosis and women with a normal pelvis.

In recent years, machine learning (ML) has been used as a promising approach for patient classification, with excellent results in various medical fields^{23,24,25,26,27}. ML has also been used for endometriosis prediction and diagnosis^{3,14,15,24,28,29}. Indeed, ML is promising because it facilitates the discovery of complex, non-linear relationships between a set of variables (such as patient characteristics or symptoms) and a target variable (such as the patient’s likelihood of having endometriosis). A recent review by Sivajohan et al.³ found 36 studies that applied ML in endometriosis prediction, diagnosis, and research. Only three of these studies^6,24,30 used self-report questionnaires to develop ML-based models for endometriosis prediction. However, in addition to symptoms experienced, these models also included clinical data, which were available since these studies focused on women who underwent or were scheduled for laparoscopy, that is, women who were in advanced diagnosis stages and could provide such data.

Our research, in contrast, aims to serve women who are only beginning their medical investigative journey and who have not yet received any test results or formal diagnosis. For this population, we develop an easy-to-use self-diagnostic tool based exclusively on self-reported symptoms, rather than on information that is available to women who went through medical investigation.

Thus, the main goal of the presented research is to develop an ML-based model that predicts the likelihood of having endometriosis based on patient-completed questionnaires, in which they report their experienced symptoms. Such a model is intended to serve as a preliminary tool for self-test, which women can take to provide them with indication or likelihood for having endometriosis. A second goal is to identify a sufficient subset of symptoms that are most relevant for endometriosis prediction.

Our investigation generated a set of 24 symptoms that were found to be most effective for endometriosis prediction. This model obtained sensitivity of 0.93 and specificity of 0.95 on holdout data. The developed model is intended to be incorporated into a website that offers women a questionnaire they can complete about the symptoms they experience and that returns their likelihood of having endometriosis. The model and the website are expected to shorten the currently long time-to-diagnosis. We also offer insights on the importance of the different symptoms and their effectiveness in predicting endometriosis.

Materials and methods

Data collection

To collect the data for our endometriosis prediction model, we distributed a survey (see Supplementary Information) via Facebook to women over the age of 18 who were and were not diagnosed with endometriosis. To reach women with endometriosis, we distributed the survey in Facebook groups dedicated to women who suffer from the disease. Members of these groups included women from Europe, United States, Australia, and Israel, however, no demographic information related to respondents’ age or ethnicity was recorded to maintain respondents’ anonymity.

The survey included 56 endometriosis symptoms that were compiled based on an extensive review of relevant literature^{2,5,7,19,20,21,22,30,31,32,33,34}. Respondents indicated (true/false) whether they experienced each symptom in the past month. Informed consent was obtained from all responders and that the study was approved by the ethics committee of Ariel University and performed in accordance with all relevant guidelines and regulations.

Descriptive statistics analysis

We started with model-free analysis to better understand the frequency of symptom occurrence in the two groups of women (diagnosed/undiagnosed). We used chi-square tests to investigate the differences between the frequencies of each symptom in the two groups. A large difference between the two groups in the occurrence rate of a symptom indicates the symptom’s predictive power of endometriosis.

Machine learning

We applied several ML algorithms to train multiple endometriosis prediction models. Specifically, we applied decision trees, Random Forest, Gradient Boosting Classifier (GBC), and Adaptive Boosting (AdaBoost). Besides generating predictions, these models also provide an importance analysis feature, which can be used to identify and remove non-contributing features from future surveys.

Model performance was evaluated using common ML metrics: accuracy, sensitivity (recall), specificity, precision, F1-score, and area under the ROC curve (AUC). To ensure significance of the results, we used a ten-fold cross-validation procedure.

Machine learning algorithms

We applied several ML algorithms to train four types of classification models:

Decision Tree classifier—This is a simple, tree-structured classifier, where internal nodes represent the features of a dataset, branches represent the decision rules, and each leaf node represents the outcome (class). The tree structure (organization of nodes) is determined based on the importance of the nodes using an attribute selection measure, such as information gain or Gini index^35,36. The model’s simplicity is both its weakness and its strength: On the one hand, this model is limited in its capacity to capture complex relationships between variables, yet on the other hand, its classification process is simple to interpret.
Random Forest classifier—This model generates a “forest” of decision trees, such that each tree is trained on a random subset of the features. The Random Forest model uses the entire collection of decision trees to classify a given sample, and eventually determines the classification output based on the trees’ majority vote, that is, the class that is the output of by most trees^37,38.
Gradient Boosting Tree classifier—This model is an ensemble of multiple decision trees (weak learners) that are added together to create a strong predictive model. In the training process of this model, trees are added to the model in an effort to minimize the error of the model, as in a gradient descent procedure. Gradient Boosting models are known to be effective at classifying complex datasets³⁹.
Adaptive Boosting (AdaBoost) classifier is a boosting technique used as an ensemble method. It is called adaptive because weights are reassigned to each sample such that higher weights are assigned to incorrectly classified samples⁴⁰.

Symptom importance analysis

Based on the descriptive statistics and feature (symptom) importance obtained by the trained models, we analyzed each symptom’s contribution to the model’s ability to correctly classify women. We also analyzed the correlation between the symptoms. A high correlation may indicate that a symptom is redundant. Because symptom values are binary (indicate whether a respondent does or does not experience the symptom), we use the Jaccard index⁴¹, which is commonly applied for measuring similarity between two binary datasets (in our case, representing symptom values). We analyzed the performance of the models after removing symptoms that are highly correlated with other symptoms (Jaccard Index close to 1).

To further analyze the importance of the different symptoms in the various types of models, we extracted from each model its feature importance ranking (we used the built-in ‘feature_importances_’ attribute of scikit-learn classifier classes). We then trained and tested each model using its first n important symptoms, where n = 1, 2, …, 56 (using ten-fold cross-validation), and compared each symptom’s contribution to the model’s performance, in order to identify the optimal set of symptoms.

Results

Descriptive statistics

In total, 886 responders completed the survey. Of these, 474 had a diagnosis of endometriosis and 412 had no diagnosis, that is, did not undergo a diagnostic procedure. We note that it is possible that some proportion of the undiagnosed women suffer from endometriosis but have not yet been diagnosed. Such respondents may introduce bias into our model and cause false negatives. Nevertheless, as the percentage of endometriosis is estimated between 5 and 10%, we expect such bias to be relatively small.

Table 1 presents descriptive statistics of the symptoms, including their frequency, that is, the percentage of women who suffer from each symptom (mean value) in each group (1—with endometriosis, 0—without), the absolute difference between the mean values, and the p-values (chi-square-test) indicating the significance of mean differences. Symptoms are listed in descending order of absolute mean differences. Table 1 also includes symptom importance according to an AdaBoost model.

Table 1 Descriptive statistics that indicate the importance of each symptom.

Full size table

Endometriosis classification models

Four types of classification models were trained: Decision Tree, Random Forest, Gradient Boosting and Adaptive Boosting (AdaBoost). Table 2 summarizes the performance of these models. To ensure significance, we used a ten-fold cross-validation procedure, and we report the mean and standard deviation (in parentheses) of the following performance metrics: recall (sensitivity), specificity, precision, F1-score, accuracy, and AUC.

Table 2 Classification models performance metrics.

Full size table

We find while all models demonstrate high performance, the AdaBoost achieves the best results with AUC and accuracy of 94%.

Symptom importance

Table 1 presents symptom occurrence frequency by group. A large difference in a symptom’s frequency between the two groups indicates that the symptom may be effective for an endometriosis diagnosis classification. The rows in Table 1 are sorted by the absolute difference between group means (frequencies) in descending order of symptoms’ importance for classification. Although, as seen in Table 1, all differences are statistically significant (all p-values are smaller than 0.01), the symptoms (features) at the bottom of the table may be non-contributing and may even cause overfitting of the models.

High correlations between symptoms may indicate redundancy. To identify symptoms that are highly correlated with other symptoms, we calculated the correlation between each pair of symptom values using the Jaccard Index. Figure 1 shows a heatmap of the Jaccard Index values, indicating the correlation between each pair of symptom values. In this figure, the yellow rows/columns indicate that the symptom is highly correlated with many other symptoms. We identified six symptoms that are highly correlated (Jaccard Index > 0.8) with more than 30% of the symptoms: fever, abnormal uterine bleeding, syncope (fainting, passing out), infertility, constant bleeding, and malaise/sickness. Five of these symptoms appear at the bottom of Table 1.

To investigate whether removing these potentially redundant features improves the models’ classification performance, we trained the models again without these six symptoms. Table 3 present the performance results of the different models. After removing the highly correlated symptoms, the performance of the Decision Tree model improved, whereas the performance of the remaining models diminished slightly.

Table 3 Classification models performance metrics, after excluding 6 variables.

Full size table

As discussed above, for each model type we also analyzed the effect of adding each symptom in the order of its importance based on the feature importance ranking derived from initial classification models (the models that were trained on the entire set of features, as shown in Table 2). Figure 2 demonstrates the improvement in the performance using AUC and F1-score (ten-fold cross-validation mean values) of the Decision Tree (a), Random Forest (b), Gradient Boosting Trees (c) and AdaBoost (d) models when adding features one by one.

These results provide insights on the performance of each model type and how performance changes when additional symptoms are added to the model. For example, we see that the Decision Trees model generates the best results (AUC of 0.898) when the model includes 14 symptoms, and adding additional symptoms hampers the model’s performance. In contrast, the performance of the Random Forest model improves as symptoms are added to the model, and provides the best performance with 55 symptoms (AUC of 0.938).

For each model type we selected the number of features (n) that yields the best AUC. We then trained each model using only that selected number of features, that is, the n most important features. Table 4 presents the performance metrics of each model (mean values and standard deviations of tenfold cross-validation).

Table 4 Performance metrics when including the first n important features of each model.

Full size table

The AdaBoost model remains the best performing model, with AUC of 93.9%. It is based on only 24 symptoms. Other symptom subsets selected on the basis of criteria other than feature importance, may yield better performance. However, because it is impossible to check all possible subsets, the method used here, based on feature importance, should be effective for identifying relevant subsets of features and for creating optimal models.

To further verify that no additional symptoms should be removed, we iteratively removed each feature, and then retrained and tested all the models. In all cases, performance metrics became worse.

The features included in the best performing model (the 24-feature AdaBoost model) are, in descending order of importance: heavy/extreme menstrual bleeding, irregular/missed periods, abnormal uterine bleeding, menstrual pain (dysmenorrhea), painful bowel movements, bowel pain, pelvic pain, IBS-like symptoms, painful cramps during period, fatigue/chronic fatigue, loss of appetite, constant bleeding, painful ovulation, hormonal problems, malaise, fever, cramping, bloating, painful/burning pain during intercourse (dyspareunia), extreme/severe pain, pain/chronic pain, ovarian cysts, fertility issues, and feeling sick.

Sample size adequacy

As a robustness check, to confirm that we used an adequate number of samples, we trained the 24-symptom AdaBoost model on different dataset sizes and measured model performance. Figure 3 shows the model’s AUC and F1-score (ten-fold cross-validation means) when trained on different dataset sizes. It shows that adding the dataset samples beyond 600 samples has little effect on the model’s performance and indicates that our sample size is sufficient.

Discussion and conclusion

In this study, we developed several classification models for endometriosis prediction, based exclusively on self-reported symptoms. We compared four types of classification models, namely, Decision Tree, Random Forest, Gradient Boosting Trees and AdaBoost, and showed that the AdaBoost model obtained the best results, with AUC, accuracy, and F1-score of 0.94; sensitivity of 0.93; and specificity of 0.95. We also applied multiple approaches to analyze the importance of each symptom and found that the best performing AdaBoost model is based on a subset of 24 of the original 56 symptoms.

While numerous studies developed questionnaire-based models and indices to predict or indicate endometriosis, these models include clinical parameters that were correlated with macroscopic/microscopic presence or absence of endometriosis^5,7,21,22. Other studies investigated the relationship between different symptoms and the likelihood of endometriosis, however most were unable to successfully predict whether a patient has endometriosis^22,31,34,42. For example, Forman et al.²² found that severe period pain (dysmenorrhea) was the single symptom found to be predictive of endometriosis, yet were unable to sufficiently distinguish women with endometriosis from women with a normal pelvis using the questionnaire used in their study. Calhaz-Jorge et al.³⁴ focused on subfertile women and found subfertility, dysmenorrhea, chronic pelvic pain, oral contraception use (ever), and obesity (inverse relationship) to be predictive of endometriosis.

Only few studies have employed ML to develop endometriosis prediction models based on self-reported symptoms. As discussed above, ML models can capture complex and non-linear relationships between a set of independent variables and a target variable and are thus expected to be effective for linking between sets of symptoms and endometriosis diagnosis. Most of the models developed in these studies were trained on data that were collected from women who had or were planned to have laparoscopy^6,7,20,24,30 and included information that is not available in the early phases of the diagnosis process. For example, Nnoaham et al.³⁰ included indications of past surgeries, ultrasound evidence, etc. Their model has sensitivity of 83% and specificity of 76%. Bendifallah et al.²⁴ also used patient history and treatment data and developed a model with sensitivity of 93% and specificity of 92% (no information on significance is provided). Chapron et al.⁶ also used previous surgery for endometriosis as a predictor. Their model has sensitivity of 75% and specificity of 69%. Yeung²⁰ in contrast, used only standard pain symptoms and quality-of-life questions. They studied women with chronic pelvic pain before surgery and developed a logistic regression model that had sensitivity of 80.5% and specificity of 57.7%.

Nevertheless, classification models that were trained on women in advanced diagnosis stages (e.g.,^6,7,20,24,30), are expected not to work well when applied to the general population of women at reproductive age. First, these models are expected to learn to give less weight to symptoms experienced by all women who started medical investigation– whether they were eventually diagnosed with endometriosis or not. For example, an ML model that was trained on women with chronic pelvic pain before surgery, will give less weight to the pelvic pain symptom, whereas for women in the general population pelvic pain is considered a common symptom, which strongly differentiates women with and without endometriosis. Second, because these models also rely on data that were obtained during the diagnosis process (e.g., results of a laparoscopy) and are unavailable to women in the early stage of the diagnostic process, these models may falsely classify women as not having endometriosis because they are missing this information. Thus, a model that intends to serve women who have not yet begun a medical investigation and will be applied to the general population of women, should be trained exclusively on experienced symptoms and only on data that are available to women who are at that point in their medical journey.

Two recent studies developed endometriosis classification models based on symptom-only questionnaires^2,7. Chapron et al.⁷ applied multiple logistic regressions on pain symptoms and patient data obtained through pre-surgery interviews to predict endometriosis at different stages of the condition, and showed that patient questionnaires can be used to identify women at high risk of endometriosis (sensitivity of 91% for a highly sensitive model and sensitivity of 73% and specificity of 75% in a model that maximizes both sensitivity and specificity). Fauconnier et al.² used a 21-symptom questionnaire on women with endometriosis confirmed by histology, asymptomatic women, and women without endometriosis diagnosis who suffer from pain/infertility. They applied binary logistic regression analysis to predict endometriosis and obtained AUC of 92%.

Similarly to these studies, in our study we also developed models for predicting the likelihood of endometriosis based only on symptoms experienced (or not experienced) by women. Our study differs from these studies in two main respects: First, it uses tree ensemble models, which are able to capture complex and non-linear relationships between the variables. Second, we used Facebook to collect data rather than patient interviews. While this is a convenient way to collect data and allowed us to collect responses from almost 1000 women within a few months, it gives us less information on the respondents.

The developed models, and in particular the 24-feature AdaBoost model, can be self-administered by women who suffer from symptoms and are at the beginning their diagnostic investigation to discover the likelihood that their symptoms are caused by endometriosis. It should, however, be noted that as our models are trained on women who were clinically diagnosed with endometriosis and on women who were not diagnosed (rather than who were clinically found not to have endometriosis), our models may be biased by women who have endometriosis yet were not diagnosed. Nevertheless, since this may affect only a small percentage of the non-diagnosed group, the effect on the models’ classification performance is expected to be relatively small and the best performing model is expected to identify most of those women who have endometriosis. Moreover, had we tested the models on women with a positive or negative clinical diagnosis of endometriosis, the models’ performance would have been even better, as false positive samples would have become true positives.

It should also be noted that our data did not include information on respondents’ demographics (e.g., ethnicity, geographic location, and age) and thus our models did not account for these variables. Future research should validate these models on different populations. Future research should also investigate the effectiveness of the models (i.e., their predictive power) for women at different stages of diagnosis and account for additional variables that are available to women who have not started a medical investigation, such as use of contraception and hormones.

To summarize, the contribution of our study is threefold. First, we developed a questionnaire for self-reporting of endometriosis symptoms based on 56 symptoms that are commonly found in the literature. Second, we analyzed the importance of these symptoms for endometriosis prediction. We also analyzed the frequency of each symptom in the group of women with endometriosis, compared to the frequency in the general population. We further identified a subset of symptoms that provided the highest endometriosis prediction accuracy. Third, we developed a model that is able to predict endometriosis in the general population of women with high accuracy (94%), based on a subset of 24 self-reported symptoms. The developed model is expected to shorten time-to-diagnosis, which is currently 6 to 10 years from symptom onset. Furthermore, the developed model is intended to be incorporated into a website that women can use to self-test themselves and discover their likelihood of suffering from endometriosis. This website is intended to refer women to conduct further examinations for endometriosis at an early stage in the diagnostic investigation.

Data availability

The data and code used in the current study are available from the corresponding author upon reasonable request.

References

Taylor, H. S., Kotlyar, A. M. & Flores, V. A. Endometriosis is a chronic systemic disease: Clinical challenges and novel innovations. Lancet 397, 839–852 (2021).
Article CAS PubMed Google Scholar
Fauconnier, A. et al. Early identification of women with endometriosis by means of a simple patient-completed questionnaire screening tool: A diagnostic study. Fertil. Steril. 116, 1580–1589 (2021).
Article PubMed Google Scholar
Sivajohan, B. et al. Clinical use of artificial intelligence in endometriosis: A scoping review. NPJ Dig. Med. 5, 109 (2022).
Article Google Scholar
Murphy, A. A. Clinical aspects of endometriosis. Ann. N. Y. Acad. Sci. 955, 1–10 (2002).
Article ADS PubMed Google Scholar
Eskenazi, B. et al. Validation study of nonsurgical diagnosis of endometriosis. Fertil. Steril. 76, 929–935 (2001).
Article CAS PubMed Google Scholar
Chapron, C. et al. Presurgical diagnosis of posterior deep infiltrating endometriosis based on a standardized questionnaire. Hum. Reprod. 20, 507–513 (2005).
Article CAS PubMed Google Scholar
Chapron, C. et al. A new validated screening method for endometriosis diagnosis based on patient questionnaires. Eclinicalmedicine 44, 101263 (2022).
Article PubMed PubMed Central Google Scholar
Duffy, J. M. N. et al. Top 10 priorities for future infertility research: An international consensus development study. Hum. Reprod. 35, deaa342 (2020).
Google Scholar
Horne, A. W., Saunders, P. T. K., Abokhrais, I. M., Hogg, L. & Appendix, E. P. S. P. S. G. Top ten endometriosis research priorities in the UK and Ireland. Lancet 389, 2191–2192 (2017).
Article PubMed Google Scholar
Dutta, M. et al. A metabonomics approach as a means for identification of potential biomarkers for early diagnosis of endometriosis. Mol. Biosyst. 8, 3281–3287 (2012).
Article CAS PubMed Google Scholar
Wang, L., Zheng, W., Mu, L. & Zhang, S. Identifying biomarkers of endometriosis using serum protein fingerprinting and artificial neural networks. Int. J. Gynecol. Obstet. 101, 253–258 (2008).
Article CAS Google Scholar
Nisenblat, V. et al. Combination of the non-invasive tests for the diagnosis of endometriosis. Cochrane Db. Syst. Rev. 2016, CD012281 (2016).
Google Scholar
Nisenblat, V. et al. Blood biomarkers for the non-invasive diagnosis of endometriosis. Cochrane Db. Syst. Rev. 2016, CD012179 (2016).
Google Scholar
Akter, S. et al. Machine learning classifiers for endometriosis using transcriptomics and methylomics data. Front. Genet. 10, 766 (2019).
Article CAS PubMed PubMed Central Google Scholar
Akter, S. et al. GenomeForest: An ensemble machine learning classifier for endometriosis. AMIA Jt. Summits Transl. Sci. Proc. 2020, 33–42 (2020).
PubMed PubMed Central Google Scholar
Li, B., Wang, S., Duan, H., Wang, Y. & Guo, Z. Discovery of gene module acting on ubiquitin-mediated proteolysis pathway by co-expression network analysis for endometriosis. Reprod. Biomed. Online 42, 429–441 (2021).
Article CAS PubMed Google Scholar
Bouaziz, J. et al. How artificial intelligence can improve our understanding of the genes associated with endometriosis: Natural language processing of the pubmed database. Biomed. Res. Int. 2018, 6217812 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fassbender, A. et al. Combined mRNA microarray and proteomic analysis of eutopic endometrium of women with and without endometriosis. Hum. Reprod. 27, 2020–2029 (2012).
Article CAS PubMed Google Scholar
Surrey, E. et al. Patient-completed or symptom-based screening tools for endometriosis: A scoping review. Arch. Gynecol. Obstet. 296, 153–165 (2017).
Article PubMed PubMed Central Google Scholar
Yeung, P., Bazinet, C. & Gavard, J. A. Development of a symptom-based, screening tool for early-stage endometriosis in patients with chronic pelvic pain. J. Endometr. Pelvic Pain Disord. 6, 174–189 (2014).
Article Google Scholar
Fasciani, A. et al. Endometriosis index: A software-derived score to predict the presence and severity of the disease. J. Endometr. Pelvic Pain Disord. 2, 79–86 (2010).
Google Scholar
Forman, R. G., Robinson, J. N., Mehta, Z. & Barlow, D. H. Patient history as a simple predictor of pelvic pathology in subfertile women. Hum. Reprod. 8, 53–55 (1993).
Article CAS PubMed Google Scholar
Raphaeli, O. et al. Feeding intolerance as a predictor of clinical outcomes in critically ill patients: A machine learning approach. Clin. Nutr. Espen 46, S546–S547 (2021).
Article Google Scholar
Bendifallah, S. et al. Machine learning algorithms as new screening approach for patients with endometriosis. Sci. Rep.-UK 12, 639 (2022).
Article ADS CAS Google Scholar
Rajkomar, A., Dean, J. & Kohane, I. Machine learning in medicine. New Engl. J. Med. 380, 1347–1358 (2019).
Article PubMed Google Scholar
Adler, E. D. et al. Improving risk prediction in heart failure using machine learning. Eur. J. Heart Fail. 22, 139–147 (2020).
Article PubMed Google Scholar
Islam, Md. M. et al. Breast cancer prediction: A comparative study using machine learning techniques. SN Comput. Sci. 1, 290 (2020).
Article Google Scholar
Urteaga, I., McKillop, M. & Elhadad, N. Learning endometriosis phenotypes from patient-generated data. NPJ Dig. Med. 3, 88 (2020).
Article Google Scholar
Kleczyk, E. J. et al. Predicting endometriosis onset using machine learning algorithms. NPJ Dig. Med. https://doi.org/10.21203/rs.3.rs-135736/v1 (2020).
Article Google Scholar
Nnoaham, K. E. et al. Developing symptom-based predictive models of endometriosis as a clinical screening tool: Results from a multicenter study. Fertil. Steril. 98, 692-701.e5 (2012).
Article PubMed PubMed Central Google Scholar
Ballard, K., Lane, H., Hudelist, G., Banerjee, S. & Wright, J. Can specific pain symptoms help in the diagnosis of endometriosis? A cohort study of women with chronic pelvic pain. Fertil. Steril. 94, 20–27 (2010).
Article PubMed Google Scholar
Abdulai, A.-F. et al. Developing an educational website for women with endometriosis-associated dyspareunia: Usability and stigma analysis. JMRI Hum. Fact. https://doi.org/10.2196/31317 (2022).
Article Google Scholar
World Endometriosis Research Foundation WHSS Questionnaire. https://www.endometriosisfoundation.org/WERF-WHSS-Questionnaire-English.pdf (2022).
Calhaz-Jorge, C., Mol, B. W., Nunes, J. & Costa, A. P. Clinical predictive factors for endometriosis in a Portuguese infertile population. Hum. Reprod. 19, 2126–2131 (2004).
Article CAS PubMed Google Scholar
Kumar, P. & Kumar, D. Decision tree classifier: A detailed survey. Int. J. Inf. Decis. Sci. 12, 246–269 (2020).
Google Scholar
Safavian, S. R. & Landgrebe, D. A survey of decision tree classifier methodology. IEEE Trans. Syst. Man Cybern. 21, 660–674 (1991).
Article MathSciNet Google Scholar
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
Article MATH Google Scholar
Biau, G. & Scornet, E. A random forest guided tour. TEST 25, 197–227 (2016).
Article MathSciNet MATH Google Scholar
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 7, 21 (2013).
Article PubMed PubMed Central Google Scholar
Schapire, R. E. & Freund, Y. Boosting: Foundations and algorithms. Kybernetes 42, 164–166 (2013).
Article MATH Google Scholar
Fletcher, S. & Islam, M. Z. Comparing sets of patterns with the Jaccard index. Austral. J. Inf. Syst. 2017, 22 (2017).
Google Scholar
Hackethal, A. et al. A structured questionnaire improves preoperative assessment of endometriosis patients: A retrospective analysis and prospective trial. Arch. Gynecol. Obstet. 284, 1179–1188 (2011).
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

Department of Industrial Engineering and Management, Ariel University, 65 Ramat HaGolan St., Ariel, Israel
Anat Goldstein
Department of Computer Science, Ariel University, 65 Ramat HaGolan St., Ariel, Israel
Shani Cohen

Authors

Anat Goldstein
View author publications
You can also search for this author in PubMed Google Scholar
Shani Cohen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.C. conceived the study and collected the data. A.G. and S.C. analyzed the data and developed the models. A.G. wrote the manuscript. S.C. and A.G. read and agreed to the submitted version.

Corresponding author

Correspondence to Anat Goldstein.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Goldstein, A., Cohen, S. Self-report symptom-based endometriosis prediction using machine learning. Sci Rep 13, 5499 (2023). https://doi.org/10.1038/s41598-023-32761-8

Download citation

Received: 23 November 2022
Accepted: 01 April 2023
Published: 04 April 2023
DOI: https://doi.org/10.1038/s41598-023-32761-8

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.