Clinical use of artificial intelligence in endometriosis: a scoping review

Sivajohan, Brintha; Elgendi, Mohamed; Menon, Carlo; Allaire, Catherine; Yong, Paul; Bedaiwy, Mohamed A.

doi:10.1038/s41746-022-00638-1

Download PDF

Review Article
Open access
Published: 04 August 2022

Clinical use of artificial intelligence in endometriosis: a scoping review

npj Digital Medicine volume 5, Article number: 109 (2022) Cite this article

7627 Accesses
16 Citations
13 Altmetric
Metrics details

Subjects

Abstract

Endometriosis is a chronic, debilitating, gynecologic condition with a non-specific clinical presentation. Globally, patients can experience diagnostic delays of ~6 to 12 years, which significantly hinders adequate management and places a significant financial burden on patients and the healthcare system. Through artificial intelligence (AI), it is possible to create models that can extract data patterns to act as inputs for developing interventions with predictive and diagnostic accuracies that are superior to conventional methods and current tools used in standards of care. This literature review explored the use of AI methods to address different clinical problems in endometriosis. Approximately 1309 unique records were found across four databases; among those, 36 studies met the inclusion criteria. Studies were eligible if they involved an AI approach or model to explore endometriosis pathology, diagnostics, prediction, or management and if they reported evaluation metrics (sensitivity and specificity) after validating their models. Only articles accessible in English were included in this review. Logistic regression was the most popular machine learning method, followed by decision tree algorithms, random forest, and support vector machines. Approximately 44.4% (n = 16) of the studies analyzed the predictive capabilities of AI approaches in patients with endometriosis, while 47.2% (n = 17) explored diagnostic capabilities, and 8.33% (n = 3) used AI to improve disease understanding. Models were built using different data types, including biomarkers, clinical variables, metabolite spectra, genetic variables, imaging data, mixed methods, and lesion characteristics. Regardless of the AI-based endometriosis application (either diagnostic or predictive), pooled sensitivities ranged from 81.7 to 96.7%, and pooled specificities ranged between 70.7 and 91.6%. Overall, AI models displayed good diagnostic and predictive capacity in detecting endometriosis using simple classification scenarios (i.e., differentiating between cases and controls), showing promising directions for AI in assessing endometriosis in the near future. This timely review highlighted an emerging area of interest in endometriosis and AI. It also provided recommendations for future research in this field to improve the reproducibility of results and comparability between models, and further test the capacity of these models to enhance diagnosis, prediction, and management in endometriosis patients.

Machine learning algorithms as new screening approach for patients with endometriosis

Article Open access 12 January 2022

Sofiane Bendifallah, Anne Puchar, … Emile Daraï

Artificial intelligence in ovarian cancer histopathology: a systematic review

Article Open access 31 August 2023

Jack Breen, Katie Allen, … Nishant Ravikumar

Predicting non-muscle invasive bladder cancer outcomes using artificial intelligence: a systematic review using APPRAISE-AI

Article Open access 18 April 2024

Jethro C. C. Kwong, Jeremy Wu, … Girish S. Kulkarni

Introduction

Endometriosis is a chronic, gynecologic condition¹ estimated to affect 190 million women worldwide². This benign, but often debilitating condition is thought to impact ~10% of women based on extrapolations of pelvic pain and subfertility in the general population³ and of those that are symptomatic, the prevalence is thought to be 30% to 50%⁴. True prevalence rates are difficult to estimate because this condition is often underreported, undiagnosed or misdiagnosed¹. In Canada, the national societal burden of endometriosis is estimated at CAD $1.8 billion annually based on treatment costs, caregiver costs, quality of life and work absenteeism⁵. Endometriosis poses a large economic and disease burden on society and the precise scope of the problem remains unknown.

Endometriosis is characterized by extrauterine growth of endometrial-like tissue in areas of the pelvis (i.e., ovaries), bowels, bladder, and peritoneum⁶. These growths are rarely found in the thoracic region, and other organ systems^7,8. Endometriosis has three predominant phenotypes: superficial endometriosis, endometriomas and deep endometriosis (DE)^8,9. There are many staging systems for endometriosis, including the American Society for Reproductive Medicine classification system: stage I (minimal), stage II (mild), stage III (moderate), and stage IV (severe)^10,11. However, given the complexity of this disease, it is difficult to universally stage and characterize under the present systems. Significant research has been done in recent years in attempts to elucidate the pathogenesis of this disease and many etiological factors are currently being explored including immune-mediated, inflammatory, genetic and environmental components^12,13.

The signs and symptoms of this disease are non-specific and can vary in severity, creating clinical heterogeneity, which adds to the diagnostic difficulty associated with this disease⁸. Patients can present with a range of symptomatology depending on the type of endometriosis, location of implants, stage, and severity including but not limited to dysmenorrhea, dyspareunia, abdominal pain, chronic pelvic pain, menorrhagia, bowel symptoms, urinary symptoms, and subfertility or infertility⁸. Due to the combination of non-specific symptoms, a long differential list, lack of provider awareness, unnecessary investigations, and a lack of non-invasive diagnostic tools, many patients experience significant delays in receiving an endometriosis diagnosis^1,14,15,16. The current literature has documented diagnostic delays of up to 6 to 12 years globally before patients receive a definitive diagnosis and adequate management^1,17,18. Currently, the gold standard diagnostic procedure for endometriosis remains laparoscopic visualization of lesions followed by histologic confirmation of ectopic endometriotic implants⁸, a costly and invasive process that requires a skilled clinician. Transvaginal ultrasonography is a commonly used clinical technique in endometriosis screening and diagnosis, given its non-invasive nature and widespread accessibility⁸.

In the past 5 years, the emergence of artificial intelligence (AI) has spread rapidly into healthcare; it has demonstrated marked potential in disease diagnostics, treatments, and a higher-level analysis of large biomedical datasets^19,20. With the increase in digitization in healthcare, AI presents novel opportunities to decrease the amount of time required for diagnosis and to streamline care in many settings¹⁹. Machine learning (ML) is a subset of AI and includes common methods such as logistic regression with the use of training and test sets and support vector machines (SVMs)¹⁹. Currently, AI has been used to analyze multi-omics, clinical, behavioral/wellness, environmental and research and developmental data¹⁹, and it has been applied to decision-making, patient self-management, triage, understanding disease mechanisms, and drug discovery^21,22. However, AI methods require an expert’s oversight to help inform the model’s development since clinical problems are often complex and multifaceted¹⁹. Additionally, the privacy and the security of patient data remain a consideration when introducing new technology into healthcare; thus researchers should be aware of any risks associated with AI models¹⁹.

From fetal heart monitoring to reproductive medicine, AI technologies have been used in the field of obstetrics and gynecology and have demonstrated the potential to significantly aid in prediction of outcomes^22,23,24,25. Given the diversity of its use in the clinical context, there is great potential to apply AI to the complex challenges presented by endometriosis and improve non-invasive diagnostics to reduce the delays and human error associated with diagnosis²². However, clinicians face significant challenges in the field of AI applications including a widespread lack of understanding about different AI methods and the competencies and limitations of AI technologies²¹. This review examines the different ways AI methods have been applied to solve pressing issues in endometriosis diagnostics, prediction, and research as shown in Fig. 1. By providing a thorough understanding of the different models and their application to clinical problems, and by analyzing their strengths and limitations, recommendations will be provided to help future researchers adequately develop AI models to advance the field of endometriosis.

**Fig. 1: Potential area of use for artificial intelligence applications in endometriosis.**

Results

Study selection

A total of 1309 titles were identified by searching the PubMed, Medline-OVID, EMBASE, and CINAHL database, and 115 full-texts were eligible for screening after studies were excluded during the title and abstract-screening stages. Of these, 79 papers were excluded in the final review based on our exclusion criteria and 36 studies were included in the final review (Fig. 2). A summary of the eligible studies and extracted study characteristics is shown in Table 1. The majority of studies were predominantly retrospective designs (n = 20) using data from large clinical databases and registries and some prospective designs (n = 16); no randomized studies were included. Samples sizes ranged from modest numbers of 26 patients with endometriosis²⁶ to 1396 symptomatic patients²⁷, with the average sample size being 245 individuals for studies exploring diagnosis and prediction in endometriosis.

Table 1 Description of the studies.

Full size table

Study characteristics

In the field of endometriosis, AI utilization spanned three overarching categories: predicting outcomes in endometriosis populations, building diagnostic models, and improving research efficacy. Most interventions were developed to assist with prediction of endometriosis in patients. However, the type, stage and specific characteristics of endometriosis that these interventions predicted, differed among the studies, depending on the research question generated by the authors. Approximately 44.4% (n = 16) of the studies analyzed the predictive capabilities of AI approaches in patients with endometriosis, while 47.2% (n = 17) explored diagnostic capabilities. The predictive capabilities differed between studies but included many aims such as predicting fertility therapy success in endometriosis patients, the likelihood of endometriosis versus other pelvic pain pathologies, predicting the presence of DE, and many more as seen in Table 1. Only 8.33% (n = 3) of the studies used AI technologies to advance the understanding of disease pathophysiology^28,29,30. The AI methods that were used included: logistic regression, K-nearest neighbor, Naïve Bayes, random forest, decision tree, SVMs, neural networks, classification tree analysis, genetic algorithm, least squares support vector machines (LSSVMs), partial least squares discriminant analysis (PLSDA), margin tree classification, quick classifier algorithm, quadratic discriminant analysis (QDA), natural language processing (NLP), principle component analysis (PCA), adaptive boosting, eXtreme gradient boosting, voting classifier (hard/soft), deep learning and new ensemble ML classifiers. However, logistic regression (n = 15) was the AI intervention that was most frequently used to build predictive and diagnostic models.

The types of inputs used in different AI models varied among the studies. Four studies used biomarkers as the specific inputs for their final predictive model, but the types of biomarkers differed including: angiogenic factors, cytokines, serum microRNAs signatures, and other metabolite biomarkers. Some studies also used metabolite spectra as inputs for their AI models (n = 10) however, there was significant diversity between the type of spectrometry method (i.e., Raman spectrometry versus hydrogen nuclear magnetic resonance [1H-NMR] Carr-Purcell-Meiboom-Gill [CPMG] spectrometry) and the specific mass-dependent velocity (m/z, mass divided by charge number) peak ranges that were used among the studies. Other studies also used genetic variables such as large transcriptomics datasets (n = 5) and clinical factors (n = 6) as inputs for their final models. The clinical factors that were used in different models demonstrated some similarity with age, history of pelvic surgery, dysmenorrhea, and pelvic pain being commonly used variables. However, many studies used different combinations, thresholds and classifiers for these variables in their models. For instance, various combinations of severe dysmenorrhea, primary dysmenorrhea, and secondary dysmenorrhea were used in different ML models.

Although the AI approaches were heterogenous, most models generally achieved sensitivity and specificity above 85%, as demonstrated in Table 1. All of the studies (n = 33) used a validation process to train and validate AI models with various methods of cross-validation (i.e., bootstrapping method, leave-one-out cross-validation, etc.) or by implementing a validation/test cohort not used in the initial training set. Table 1 also reports on sensitivity and specificity for the models.

Given the heterogeneity in the purpose of the AI intervention, type and stage of endometriosis being examined, type of AI methodology used, and evaluation metrics, the included studies were grouped into six categories based on the inputs used to create the AI models. These categories are discussed in detail below.

Diagnostic or predictive models for endometriosis using biomarkers

Four different studies^{31,32,33,34,35} examined the use of biomarkers as inputs to create diagnostic or predictive AI models in endometriosis populations. As seen in Table 2, the type of biomarkers used differed among the studies. Knific et al.³¹ was the only study that used protein ratios while others used metabolites³³, miRNAs³⁵ and other biomarkers³⁴. Knific et al.³¹ and Bendifallah et al.³⁵ were the only studies in this category to use the random-forest method to develop a diagnostic model for endometriosis and the accuracy of Knific et al.’s³¹ model was reported to be 59%³¹ —the lowest accuracy for all the models in this category—while the clinical accuracy of Bendifallah et al.’s³⁵ model was significantly higher with a sensitivity and specificity of 96.8 and 100%. One study used LSSVMs³⁴ and the accuracy of this method was deemed to be 79% with a sensitivity and specificity of 82% and 75%, respectively. One study also used SVMs to develop a diagnostic model for endometriosis using lipidomic profiling of endometrial fluid in patients with ovarian endometriosis³³. The accuracy of this method was reported to be 85.7% with a sensitivity and specificity of 58.3% and 100%, respectively. It should be noted that among the four studies that were examined, there were no commonalities in the specific biomarker inputs used; thus, it is difficult to compare the accuracy of each AI model given the differences in the inputs used. The pooled SE and SP for each study’s most accurate model were 85.6% and 85%, respectively^33,34,35.

Table 2 Diagnostic and predictive moels built using biomarkers.

Full size table

Diagnostic or predictive models for endometriosis using protein spectra

Ten studies^{26,36,37,38,39,40,41,42,43,44} used various metabolite spectra as their primary inputs to develop diagnostic and predictive models in endometriosis populations. In this specific problem formulation, it is important to note the methodology that is used. The most popular method to determine metabolite spectra for model development was surface-enhanced laser desorption/ionization time-of-flight mass spectrometry, which was used by four studies^26,41,43,44. The pooled SE for the models with highest accuracy in each study was 91.7%, while the pooled SP was 81.1%^{26,37,38,39,40,41,42,43,44}. Table 3 presents the other methods of spectrometry and spectroscopy that were used to determine the metabolite spectra of interest for the model inputs.

Table 3 Diagnostic and predictive models built using protein spectra.

Full size table

Among the studies in this category, artificial neural networks (ANNs) were the most popular method used in three of the models^26,38,44. However, although these three studies used the same type of AI intervention, the inputs varied greatly between them. Two studies used PLSDA to compute their final models^36,42, albeit using different methodologies (mass spectroscopy³⁶ and 1H-NMR spectrophotometer⁴²). While the inputs also varied between both models, they both had a similar correct classification rates of 84%³⁶ and 86.67%⁴². Further studies between similar inputs are needed to determine if PLSDA is an appropriate AI intervention to compute diagnostic and predictive models in endometriosis populations.

Diagnostic or predictive models for endometriosis using clinical variables and symptoms

Six studies^{45,46,47,48,49,50} grouped in this category strongly preferred using logistic regression; two studies^50,51 used decision tree methods to build a model and one study⁵⁰ also used random forest, eXtreme gradient boosting and voting classifier (soft/hard) ML algorithms as shown in Table 4. Interestingly many studies in this category examined predictive and diagnostic model capabilities in patients with some form of deep endometriosis (n = 5). The pooled SE for the models with highest accuracy in each study was 81.7% while the pooled SP was 91.6%^47,48,49,50. Specific inputs into each model varied as seen in previous categories with Bendifallah et al.⁵⁰ using the largest number of clinical features for their models. However, there were some commonalities in the types of inputs that were used in each model. Patient age was the most frequently used input (n = 5) in diagnostic and predictive models using clinical variables. Given that endometriosis most commonly presents in reproductive-aged women, it is not surprising that age is the most frequent input in a diagnostic/predictive AI model. Other significant inputs included the presence or severity of dysmenorrhea, presence or severity of dyspareunia, visual analogic scale for dyspareunia, infertility, and previous surgery for endometriosis or pelvic surgery. Among the studies that did report SE and SP metrics, the SE values ranged from 51% to 95% and SP values ranged from 77.1 to 95.7%^47,48,49,50.

Table 4 Diagnostic and predictive models built using clinical variables and symptoms.

Full size table

Diagnostic or predictive models for endometriosis using genetic variables

Models that were built using genetic variables as their primary inputs used a significantly larger number of inputs than any of the other six input categories referenced in this review. Only five studies^{52,53,54,55,56} used genetic variables to build their predictive and diagnostic models, however, the type of input varied between individual gene candidates^52,56, large protein-coding gene datasets from transcriptomics and methylomics data^53,55, and 16S rRNA gene amplicon data⁵⁴. The AI methods used in this category included: deep ML algorithm, decision tree, GenomeForest (a new ensemble ML classifier), random-forest-based ML classification analysis, PLSDA, SVM, random forest, and margin tree classification. The pooled SE for the models with highest accuracy in each study was 96.7%, while the pooled SP was 70.7%^52,53,55.

Two studies compared the use of large transcriptomics and methylomics datasets to build different AI models that were compared with each other^53,55. As seen in Table 5, regardless of which AI method was used, the models built using the transcriptomics dataset outperformed the models built with the methylomics dataset, albeit marginally. Akter⁵³ used GenomeForest, a novel ensemble technique based on chromosomal partitioning, to classify endometriosis and control samples using both transcriptomics and methylomics datasets. The authors concluded that this new classifier could help identify candidate biomarkers for endometriosis; they further demonstrated that three different ML models (GenomeForest, decision tree, and Biosigner) independently identified NOTCH3 as candidate gene with differential expression in the endometriosis samples^53,55. ML methods may be of particular use when analyzing very large genomic datasets to help identify candidate genes that have altered expression in endometriosis patients versus control samples.

Table 5 Diagnostic and predictive models built using genetic variables.

Full size table

Diagnostic or predictive models for endometriosis using mixed variables

Three studies^27,57,58 used mixed variable types to create predictive or diagnostic models for endometriosis as shown in Table 6. All three studies used logistic regression as the methodology to construct models and the sample sizes ranged from 119 patients⁵⁷ to 1396 patients²⁷. Inputs included clinical variables collected from patient medical history, physical exam findings, ultrasonography evidence, and MRI visualization. It should be noted that Chattot et al.⁵⁷ had the smallest sample size. The study with the largest sample size²⁷ reported a SE and SP of 82.6% and 75.8%, respectively. The accuracy for studies in this category was relatively consistent compared to other categories with similar SE and SP.

Table 6 Diagnostic and predictive models built using mixed variables.

Full size table

Diagnostic or predictive models for endometriosis using imaging

Only three studies^59,60,61 explored the use of imaging variables as their primary inputs for their AI models as seen in Table 7. Guerriero⁵⁹ built models specifically for rectosigmoid endometriosis and compared the accuracy of the different AI methods using the same inputs for each model. This specific study allows one to draw conclusions about the accuracy of different methodologies in developing predictive models to increase suspicion for rectosigmoid endometriosis. The Naïve Bayes and SVM approaches produced the models with the highest accuracy (75%) in this study and K-nearest neighbor produced the lowest accuracy (69%). SVM also produced the highest SE at 84% while Naïve Bayes and decision tree showed the highest SP (77%). The pooled SE for the models with highest accuracy in each study was 88% while the pooled SP was 89.7%^59,60,61.

Table 7 Diagnostic and predictive models built using imaging.

Full size table

Reid et al.⁶⁰ also produced two logistic regression models using different imaging variables; the accuracy of both models was higher than the logistic regression model produced by Guerriero et al.⁵⁹ indicating that perhaps the inputs for Reid’s model⁶⁰ played a role in the higher accuracy, SE and SP. All three studies in this category explored “sliding sign” on transvaginal ultrasound as an important features in their models.

Maicus et al.⁶¹ was the only study to use a deep learning model called Resnet (2 + 1)D to classify the state of the pouch of Douglas with regards to adhesions indicative of endometriosis in patients. Their model was trained, internally validated, and externally tested on a dataset to evaluate the sliding sign on ultrasound, demonstrating an accuracy of 88.8%.

Discussion

In the field of endometriosis, AI interventions have proven to be heterogenous in terms of their purpose, methodology, input selection and accuracy. Given the wide range of problems that exist in the field of endometriosis diagnosis, prediction and research, it is not surprising that models were built to tackle many different problem formulations. This study performed a thorough scoping review on the literature intersecting endometriosis and AI, and it provides a timely understanding of AI technology in the field of endometriosis. A meta-analysis of the data was not possible due to the diverse nature of studies included in this scoping review. Our study identified six major categories of model inputs that were used to build AI interventions in addition to three studies that used AI methods to improve research techniques^28,29,30 and one study that only used lesion characteristics to build a predictive model⁶². Of the six major input categories, biomarkers, clinical variables, genetic variables and metabolite spectra were the most frequently used input types for building diagnostic and predictive AI models.

AI interventions that were built using biomarker inputs included diagnostic and predictive models for ultrasound-negative endometriosis³⁴, and ovarian endometriomas³³. Biomarker inputs for these models included plasma biomarkers collected in all phases of the menstrual cycle³⁴, lipidomic profiling of endometrial fluid³³, and serum miRNA markers³⁵. AI interventions built using metabolite spectra as their primary input included detecting endometriosis in serum samples^43,44, screening for biomarkers in eutopic endometrium²⁶, diagnosing ultrasound-negative endometriosis⁴⁰, diagnosing endometriosis using messenger RNA expression in endometrium biopsies⁴¹, identifying predictive serum biomarkers⁴², diagnosing and staging endometriosis using peptide profiling³⁹, determining classifier metabolites for early prediction risk³⁸, and diagnosing stage 3 and stage 4 endometriosis in infertile patients³⁶. Studies that used genetic variables to build AI interventions included classifying endometriosis using RNAse and enrichment-based DNA-methylation datasets⁵³, diagnosing endometriosis using gut and/or vaginal microbiome profiles⁵⁴, using transcriptomics or methylomics to classify endometriosis⁵⁵, and staging pelvic endometriosis using genomic data⁵⁶. Some studies also used clinical signs and symptoms collected when obtaining a patient’s medical history as well as other clinical variables to build models. These AI interventions included predicting the presence of posterior deep endometriosis in patients with chronic pelvic pain symptoms⁴⁹, predicting pregnancy rates in patients with endometriosis⁴⁸, predicting medical care decision rules for patients with recurrent endometriomas⁵¹, diagnosing DE pre-operatively for patients with endometriomas⁴⁷ and differentiating between patients with and without endometriosis⁵⁰.

Our scoping review was able to evaluate the current literature and map out the field of study to demonstrate that AI applications in endometriosis look promising for improving diagnostics, research efficacy and outcome prediction in this patient population. Pooled SE ranged between 81.7 and 96.7% and pooled SP ranged between 70.7 and 91.6%. Our review included a range of heterogenous study designs, large retrospective analyses, various ML interventions and diverse research questions in the field of endometriosis. This is a timely review providing clinicians and computer scientists with an extensive understanding of AI applications in endometriosis. Clinical decision-making by humans is often prone to errors, biases and heuristics⁶³. However, this review shows strong promise for AI’s ability to mitigate these human errors and provide superior outcome prediction with high SE and SP. Although many of the studies included in this review relied on a human component for data analysis/collection and determining feature extraction, AI technologies (especially when using standardized and validated models) may present the potential to reduce diagnostic error that can result from individual practicing biases and clinical heuristics. Future studies with human comparators are required to determine this. This review also demonstrated how AI can be used to improve research efficacy particularly through the use of natural language processing²⁸ and identification of potential biomarkers³⁰ and diseases²⁹ associated with endometriosis pathophysiology. Lastly, this scoping review adds to future recommendations for research in this field and supports the need for standardized guidelines for ML applications in medicine.

Approximately 44.4% (n = 16) of AI interventions were predictive models meant to predict various outcomes in patients with endometriosis or undifferentiated symptomatic patients. Models were built to predict the presence of posterior DE in patients with chronic pelvic pain⁴⁹, the clinical pregnancy rate in patients with endometriosis⁴⁸, and many other outcomes in this patient population. However, many of these studies were conducted retrospectively and they did not adequately compare the AI’s ability to outperform existing decision tools and clinical diagnostics. Additionally, none of the studies involved a human comparator (since many models were trained and validated on retrospectively diagnosed patient datasets) and thus make it difficult to comment on AI’s superiority as a tool clinicians can use for predictive modeling.

The type and stage of endometriosis varied among the included studies; thus, the AI approaches to prediction and diagnosis also differed. This makes it difficult to compare AI models used in the studies. Many studies lacked detailed information on the methods used to verify patients with endometriosis with regards to a reference standard, while others cited gold standard laparoscopic visualization with subsequent histopathologic confirmation as the modality of diagnosis. Additionally, the heterogeneity of the study designs, input data used, and AI interventions, made it difficult to compare the accuracy and efficacy of the different models. Many studies lacked transparent descriptions of their modeling making it difficult to critique methodology and determine if the right AI model was being used to predict the outcome in question.

Applying AI to assess endometriosis is relatively new, and most AI methods used are still relatively simple. Various data types continue to be explored; however, each data type was utilized exclusively up to date. As can be seen from the tables, the use of protein spectra continues to be perhaps the most common approach, but generally only with small sample sizes. In the future, the increasing adoption of AI in assessing endometriosis will also likely play an essential role in women’s healthcare.

Our recommendations, based on this review and challenges of employing AI, are as follows:

1.
The types and stages of endometriosis included in the study sample need to be clearly defined, and models should specify what type/stage of endometriosis they are built to predict, classify or diagnose.
2.
The gold standard (a reference where we compare the AI model against) has to be defined and justified to assess reliability.
3.
The evaluation metric (e.g., sensitivity and specificity) needs to be tested and reported clearly.
4.
Transparent descriptions of the used AI model is needed for reproducibility.
5.
Applying multiple AI models to determine the most accurate one for specific outcomes and diagnostic goals.
6.
A large sample size with a diverse age group used is required for achieving generalizability.
7.
Training and testing phases need to be clearly explained, specifically stating whether cross-validation or holdout is implemented; and
8.
Logistic regression models incorporating a training and test/validation cohort would be more effective in establishing external validation of the model; and
9.
Studies using retrospective analyses of large clinical datasets to build models should attempt to validate their models in prospective controlled clinical trials. Controlled clinical trials are required to determine whether AI can outperform human decision-making and remove any potential biases. Although internal validation samples are essential to test a model’s performance, these models should also be tested through prospective controlled trials to ensure that they are generalizable in a clinical context and that their performance is not limited to an artificial set of parameters.

Of the 36 studies included in this review, 50% were published in the last 5 years, indicating that there is recent and rapidly growing interest in AI applications to improve diagnostic, predictive and research capabilities for a complex disease such as endometriosis. Further research should be conducted using human comparators and should include comparisons with existing scoring systems and diagnostic tools to determine AI’s superiority for predictive and diagnostic modeling in endometriosis. These AI algorithms should also be externally validated or tested through prospective controlled trials to ensure that they contribute to advancing real-world clinical practice and diagnostics. This review was able to identify this interest in AI and highlight the benefits and shortcomings of AI interventions to improve future models for endometriosis.

Methods

Study guidelines

Given the heterogeneity and breadth of research in this field, a scoping review was performed to summarize the use of AI applications in endometriosis research, diagnostics, and prediction to help identify gaps in knowledge and address broad research questions⁶⁴. The guidelines of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses for Scoping Review (PRISMA-ScR)⁶⁵ and Arksey and O’Malley’s recommendations for scoping review methodology⁶⁶ were followed. A prior review protocol was drafted using the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Protocols⁶⁷ for internal use amongst the research team but it was not externally published or registered prospectively.

Search strategy and study eligibility

The PubMed, Medline-OVID, EMBASE, and CINAHL databases were searched sequentially from January 2000 to March 2022 for all English-language papers using the following search strategy (adapted for each database): [(Endometriosis) OR (Endometrioma)] AND [(AI) OR (ML) OR (Prediction Model) OR (Classification)]. Gray literature was not included in this scoping review in attempt to only include peer-reviewed studies. This timeframe was chosen to reflect advances in AI technologies and applications in medicine. The scope of the search was not restricted to a particular type or stage of endometriosis. The search for this scoping review was completed in March 2022.

Inclusion and exclusion criteria

The following inclusion criteria were used to determine study eligibility for this review: (1) the study involved assessing an AI approach or model to advance prediction, diagnosis, management or disease understanding in the field of endometriosis; (2) the study reported a quantitative metric on the accuracy/performance of the AI method; (3) the study was conducted using humans; (4) the article was accessible in English; and (5) the study used a validation method to test its model. Studies were excluded if: (1) they were not conducted using humans; (2) did not assess or evaluate an AI approach or model; (3) did not pertain to the field of endometriosis; and (4) developed a logistic regression model without the use of a training and test/validation set. One reviewer (BS) conducted the literature search and two reviewers (BS and ME) screened the titles, abstracts and full-texts independently for potentially eligible studies. Reference lists of eligible studies were also hand-searched but no additional studies were included on this basis.

Study selection and data extraction

One author (B.S.) conducted the literature search, and two authors (B.S. and M.E.) independently screened the titles and abstracts for potentially eligible studies. Each potential study for inclusion underwent full-text screening and was assessed to extract study-specific information and data; Table 1 presents a summary of the title, lead author, publication year, study design, AI intervention, purpose/aim, sample size, type of inputs used in the AI method, specific inputs in the final model, evaluation metrics used and AI accuracy. Two reviewers (B.S. and M.E.) independently conducted a full-text screening and extracted information from potentially eligible studies. They then cross-checked the identified studies to determine eligibility through discussion and used consensus to resolve discrepancies. The information collated in the initial evidence table was used to aggregate data and determine the main themes of use for AI in endometriosis in the currently published literature. Where studies explored more than one AI model, the model with the highest accuracy was assessed and included in the review.

Pooled evaluation metric

Pooled sensitivities and specificities were calculated for studies within the same input category. The following formula⁶⁸ was used to combine means across different studies where SE or SP is the pooled mean for sensitivity or specificity, as follows:

$${{{\mathrm{SE}}}}\,{{{\mathrm{or}}}}\,{{{\mathrm{SP}}}} \,=\, \frac{{N_1X_1 \,+\, N_2X_2 \,+\, \cdots }}{{N_1 \,+\, N_2 \,+\, \cdots }}$$

(1)

where, for example, N₁ is the number of participants in study 1 and X₁ is the value of the reported sensitivity or specificity in study 1.

Data availability

The authors declare that all data supporting the findings of this study are available within the paper.

References

Nnoaham, K. E. et al. Impact of endometriosis on quality of life and work productivity: a multicenter study across ten countries. Fertil. Steril. 96, 366 (2011).
Article PubMed PubMed Central Google Scholar
Zondervan, K. T., Becker, C. M. & Missmer, S. A. Endometriosis. N. Engl. J. Med. 382, 1244–1256 (2020).
Article CAS PubMed Google Scholar
Shafrir, A. L. et al. Risk for and consequences of endometriosis: a critical epidemiologic review. Best. Pract. Res. Clin. Obstet. Gynaecol. 51, 1–15 (2018).
Article CAS PubMed Google Scholar
Barbieri, R. L. Etiology and epidemiology of endometriosis. Am. J. Obstet. Gynecol. 162, 565–567 (1990).
Article CAS PubMed Google Scholar
Levy, A. R. et al. Economic burden of surgically confirmed endometriosis in Canada. J. Obstet. Gynaecol. Can. 33, 830–837 (2011).
Article PubMed Google Scholar
Practice bulletin no. 114: Management of endometriosis. Obst. Gynecol. 116, 223–236 (2010).
Johnson, N. P. et al. World Endometriosis Society consensus on the classification of endometriosis. Hum. Reprod. 32, 315–324 (2017).
Article PubMed Google Scholar
Zondervan, K. T. et al. Endometriosis. Nat. Rev. Dis. Prim. 4, 9 (2018).
Article PubMed Google Scholar
International working group of AAGL, ESGE, ESHRE and WES et al. An international terminology for endometriosis. J. Minim. Invasive Gynecol. 28, 1849–1859 (2021).
Canis, M. et al. Revised American Society for Reproductive Medicine classification of endometriosis. Fertil. Steril. 67, 817–821 (1997).
Article Google Scholar
Gruppo Italiano per lo Studio dell’Endometriosi. Relationship between stage, site and morphological characteristics of pelvic endometriosis and pain. Hum. Reprod. 16, 2668–2671 (2011).
Zondervan, K. T., Cardon, L. R. & Kennedy, S. H. The genetic basis of endometriosis. Curr. Opin. Obstet. Gynecol. 13, 309–314 (2001).
Article CAS PubMed Google Scholar
Mihalyi, A. et al. Role of immunologic and inflammatory factors in the development of endometriosis: indications for treatment strategies. Clin. Pract. 2, 623 (2005).
CAS Google Scholar
Gao, X. et al. Economic burden of endometriosis. Fertil. Steril. 86, 1561–1572 (2006).
Article PubMed Google Scholar
Kennedy, S. et al. ESHRE guideline for the diagnosis and treatment of endometriosis. Hum. Reprod. 20, 2698–2704 (2005).
Article PubMed Google Scholar
Chiaffarino, F. et al. Endometriosis and irritable bowel syndrome: a systematic review and meta-analysis. Arch. Gynecol. Obstet. 303, 17–25 (2021).
Article PubMed Google Scholar
Matsuzaki, S. et al. Relationship between delay of surgical diagnosis and severity of disease in patients with symptomatic deep infiltrating endometriosis. Fertil. Steril. 86, 1314–1316 (2006).
Article PubMed Google Scholar
Prast, J. et al. Costs of endometriosis in Austria: a survey of direct and indirect costs. Arch. Gynaecol. 288, 569–576 (2013).
Article Google Scholar
Wang, F. & Preininger, A. AI in health: state of the art, challenges, and future directions. Yearb. Med. Inform. 28, 016–026 (2019).
Article Google Scholar
Wang, R. et al. Artificial intelligence in reproductive medicine. Reproduction 158, R139–R154 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, M. & Decary, M. Artificial intelligence in healthcare: An essential guide for health leaders. Healthc. Manag. Forum 33, 10–18 (2020).
Article Google Scholar
Yoldemir, T. Artificial intelligence and women’s health. Climacteric 23, 1–2 (2020).
Article PubMed Google Scholar
Siristatidis, C. & Pouliakis, A. Artificial Intelligence in IVF: a need. Syst. Biol. Reprod. Med. 57, 179–185 (2011).
Article PubMed Google Scholar
Lutomski, J. E., Meaney, S., Greene, R. A., Ryan, A. C. & Devane, D. Expert systems for fetal assessment in labour. Cochrane Database Syst. Rev. 4 https://doi.org/10.1002/14651858.CD010708 (2015).
Elgendi, M., Allaire, C., Williams, C., Bedaiwy, M. A. & Yong, P. J. Machine learning revealed new correlates of chronic pelvic pain in women. Front. Digit. Health 2, 600604 (2020).
Wang, L. et al. Identification biomarkers of eutopic endometrium in endometriosis using artificial neural networks and protein fingerprinting. Fertil. Steril. 93, 2460–2462 (2010).
Article CAS PubMed Google Scholar
Nnoaham, K. E., Hummelshoj, L., Kennedy, S. H., Jenkinson, C. & Zondervan, K. T. Developing symptom-based predictive models of endometriosis as a clinical screening tool: Results from a multicenter study. Fertil. Steril. 98, 692–701 (2012).
Article PubMed PubMed Central Google Scholar
Bouaziz, J. et al. How artificial intelligence can improve our understanding of the genes associated with endometriosis: natural language processing of the pubmed database. BioMed Res. Int. https://doi.org/10.1155/2018/6217812 (2018).
Lee, J. H., Kwon, S. Y., Chang, J. & Yuk, J. S. Machine learning approach to find the relation between endometriosis, benign breast disease, cystitis and non-toxic goiter. Sci. Rep. 9, 1–7 (2019).
CAS Google Scholar
Matta, K. et al. Associations between persistent organic pollutants and endometriosis: A multipollutant assessment using machine learning algorithms. Environ. Pollut. 260, 114066 (2020).
Article CAS PubMed Google Scholar
Knific, T. et al. Multiplex analysis of 40 cytokines do not allow separation between endometriosis patients and controls. Sci. Rep. 9, 1–12 (2019).
Article CAS Google Scholar
Cosar, E. et al. Serum microRNAs as diagnostic markers of endometriosis: a comprehensive array-based analysis. Fertil. Steril. 106, 402–409 (2016).
Article CAS PubMed Google Scholar
Domínguez, F. et al. Lipidomic profiling of endometrial fluid in women with ovarian endometriosis. Biol. Reprod. 96, 772–779 (2017).
Article PubMed Google Scholar
Vodolazkaia, A. et al. Evaluation of a panel of 28 biomarkers for the non-invasive diagnosis of endometriosis. Hum. Reprod. 27, 2698–2711 (2012).
Article CAS PubMed Google Scholar
Bendifallah, S. et al. MicroRNome analysis generates a blood-based signature for endometriosis. Sci. Rep. 12, 4051 (2022).
Article CAS PubMed PubMed Central Google Scholar
Braga, D. P. A. F. et al. Metabolomic profile as a noninvasive adjunct tool for the diagnosis of grades III and IV endometriosis-related infertility. Mol. Reprod. Dev. 86, 1044–1052 (2019).
Article CAS PubMed Google Scholar
Parlatan, U. et al. Raman spectroscopy as a non-invasive diagnostic technique for endometriosis. Sci. Rep. 9, 1–7. https://doi.org/10.1038/s41598-019-56308-y (2019).
Ghazi, N. et al. 1H NMR-based metabolomics approaches as non-invasive tools for diagnosis of endometriosis. Int J. Reprod. BioMed. 14, 1–8 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, L., Liu, H. Y., Shi, H. H., Lang, J. H. & Sun, W. Urine peptide patterns for non-invasive diagnosis of endometriosis: a preliminary prospective study. Eur. J. Obstet. Gynecol. Reprod. Biol. 177, 23–28 (2014).
Article CAS PubMed Google Scholar
Fassbender, A. et al. Proteomics analysis of plasma for early diagnosis of endometriosis. Obstet. Gynecol. 119, 276–285 (2012).
Article CAS PubMed Google Scholar
Fassbender, A. et al. Combined mRNA microarray and proteomic analysis of eutopic endometrium of women with and without endometriosis. Hum. Reprod. 27, 2020–2029 (2012).
Article CAS PubMed Google Scholar
Dutta, M. et al. A metabonomics approach as a means for identification of potential biomarkers for early diagnosis of endometriosis. Mol. Biosyst. 8, 3281–3287 (2012).
Article CAS PubMed Google Scholar
Wölfler, M. M. et al. Mass spectrometry and serum pattern profiling for analyzing the individual risk for endometriosis: promising insights? Fertil. Steril. 91, 2331–2337 (2009).
Article PubMed CAS Google Scholar
Wang, L., Zheng, W., Mu, L. & Zhang, S. Z. Identifying biomarkers of endometriosis using serum protein fingerprinting and artificial neural networks. Int. J. Gynecol. Obstet. 101, 253–258 (2008).
Article CAS Google Scholar
Vesale, E. et al. Predictive approach in managing voiding dysfunction after surgery for deep endometriosis: a personalized nomogram. Int. Urogynecol. J. 32, 1205–1212 (2021).
Article PubMed Google Scholar
Benoit, L. et al. Predicting the likelihood of a live birth for women with endometriosis-related infertility. Eur. J. Obstet. Gynecol. Reprod. Biol. 242, 56–62 (2019).
Article CAS PubMed Google Scholar
Lafay Pillet, M. C. et al. A clinical score can predict associated deep infiltrating endometriosis before surgery for an endometrioma. Hum. Reprod. 29, 1666–1676 (2014).
Article CAS PubMed Google Scholar
Ballester, M. et al. Nomogram to predict pregnancy rate after ICSI-IVF cycle in patients with endometriosis. Hum. Reprod. 27, 451–456 (2012).
Article CAS PubMed Google Scholar
Chapron, C. et al. Presurgical diagnosis of posterior deep infiltrating endometriosis based on a standardized questionnaire. Hum. Reprod. 20, 507–513 (2005).
Article CAS PubMed Google Scholar
Bendifallah, S. et al. Machine learning algorithms as new screening approach for patients with endometriosis. Sci. Rep. 12, 639 (2022).
Article CAS PubMed PubMed Central Google Scholar
Wang, Y. F. et al. Mining medical data: A case study of endometriosis. J. Med. Syst. 37, 9899 (2013).
Article PubMed Google Scholar
Li, B., Wang, S., Duan, H., Wang, Y. & Guo, Z. Discovery of gene module acting on ubiquitin-mediated proteolysis pathway by co-expression network analysis for endometriosis. Reprod. BioMed. Online 42, 429–441 (2021).
Article CAS PubMed Google Scholar
Akter, S. et al. GenomeForest: an ensemble machine learning classifier for endometriosis. AMIA Summits Transl. Sci. Proc. 33–42. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7233069/ (2020).
Perrotta, A. R. et al. The vaginal microbiome as a tool to predict rASRM stage of disease in endometriosis: a pilot study. Reprod. Sci. 27, 1064–1073 (2020).
Article CAS PubMed Google Scholar
Akter, S. et al. Machine learning classifiers for endometriosis using transcriptomics and methylomics data. Front. Genet. 10, 1–17. https://doi.org/10.3389/fgene.2019.00766 (2019).
Tamaresis, J. S. et al. Molecular classification of endometriosis and disease stage using high-dimensional genomic data. Endocrinology 155, 4986–4999 (2014).
Article PubMed PubMed Central CAS Google Scholar
Chattot, C. et al. ENDORECT: a preoperative score to accurately predict rectosigmoid involvement in patients with endometriosis. Hum. Reprod. Open 2, https://doi.org/10.1093/hropen/hoz007 (2019).
Guo, Z., Feng, P., Chen, X., Tang, R. & Yu, Q. Developing preoperative nomograms to predict any-stage and stage III-IV endometriosis in infertile women. Front. Med. 7, 695 (2020).
Article Google Scholar
Guerriero, S. et al. Artificial intelligence (AI) in the detection of rectosigmoid deep endometriosis. Eur. J. Obstet. Gynecol. Reprod. Biol. 261, 29–33 (2021).
Article PubMed Google Scholar
Reid, S., Lu, C. & Condous, G. Can we improve the prediction of pouch of Douglas obliteration in women with suspected endometriosis using ultrasound-based models? A multicenter prospective observational study. Acta Obstet. Gynecol. Scand. 94, 1297–1306 (2015).
Article PubMed Google Scholar
Maicas, G. et al. Deep learning to diagnose pouch of Douglas obliteration with ultrasound sliding sign. Reprod. Fertil. 2, 236–243 (2021).
Article PubMed PubMed Central Google Scholar
Stegmann, B. J. et al. A logistic model for the prediction of endometriosis. Fertil. Steril. 91, 51–55 (2009).
Article PubMed Google Scholar
Beam, A. L. & Kohane, I. S. Translating artificial intelligence into clinical care. JAMA 316, 2368–2369 (2016).
Article PubMed Google Scholar
Peters, M. D. et al. Guidance for conducting systematic scoping reviews. JBI Evid. Implant. 13, 141–146 (2015).
Google Scholar
Tricco, A. C. et al. PRISMA Extension for Scoping Reviews (PRISMA-ScR): checklist and explanation. Ann. Intern. Med. 169, 467–473 (2018).
Article PubMed Google Scholar
Arksey, H. & O’malley, L. Scoping studies: towards a methodological framework. Int. J. Soc. Res. Methodol. 8, 19–32 (2005).
Article Google Scholar
Moher, D. et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst. Rev. 4, 1–9 (2015).
Article PubMed PubMed Central Google Scholar
Bird, K. et al. Assessment of hypertension using clinical electrocardiogram features: a first-ever review. Front. Med. 7, 583331 (2020).
Article Google Scholar

Download references

Author information

These authors contributed equally: Brintha Sivajohan, Mohamed Elgendi.

Authors and Affiliations

Schulich School of Medicine & Dentistry, Western University, London, ON, Canada
Brintha Sivajohan
Department of Obstetrics and Gynecology, Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
Brintha Sivajohan, Mohamed Elgendi, Catherine Allaire, Paul Yong & Mohamed A. Bedaiwy
Biomedical and Mobile Health Technology Laboratory, Department of Health Sciences and Technology, ETH Zurich, Zurich, Switzerland
Mohamed Elgendi & Carlo Menon
British Columbia Women’s Hospital, Vancouver, BC, Canada
Catherine Allaire, Paul Yong & Mohamed A. Bedaiwy

Authors

Brintha Sivajohan
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Elgendi
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Menon
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Allaire
View author publications
You can also search for this author in PubMed Google Scholar
Paul Yong
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed A. Bedaiwy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.S., M.E., C.M., C.A., P.Y., and M.B. conceived the study. B.S. and M.E. developed the search strategy and made the inclusion decisions and the quality assessment. C.M., C.A., P.Y., and M.B. provided methodological and clinical expertise. B.S. and M.E. wrote the draft of the paper. B.S. and M.E. created all figures. All authors approved final paper.

Corresponding author

Correspondence to Mohamed A. Bedaiwy.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sivajohan, B., Elgendi, M., Menon, C. et al. Clinical use of artificial intelligence in endometriosis: a scoping review. npj Digit. Med. 5, 109 (2022). https://doi.org/10.1038/s41746-022-00638-1

Download citation

Received: 21 December 2021
Accepted: 24 June 2022
Published: 04 August 2022
DOI: https://doi.org/10.1038/s41746-022-00638-1

This article is cited by

Self-report symptom-based endometriosis prediction using machine learning
- Anat Goldstein
- Shani Cohen
Scientific Reports (2023)
Identification of potential diagnostic biomarkers and therapeutic targets for endometriosis based on bioinformatics and machine learning analysis
- Maryam Hosseini
- Behnaz Hammami
- Mohammad Kazemi
Journal of Assisted Reproduction and Genetics (2023)

Subjects

Abstract

Similar content being viewed by others

Machine learning algorithms as new screening approach for patients with endometriosis

Artificial intelligence in ovarian cancer histopathology: a systematic review

Predicting non-muscle invasive bladder cancer outcomes using artificial intelligence: a systematic review using APPRAISE-AI

Introduction

Results

Study selection

Study characteristics

Diagnostic or predictive models for endometriosis using biomarkers

Diagnostic or predictive models for endometriosis using protein spectra

Diagnostic or predictive models for endometriosis using clinical variables and symptoms

Diagnostic or predictive models for endometriosis using genetic variables

Diagnostic or predictive models for endometriosis using mixed variables

Diagnostic or predictive models for endometriosis using imaging

Discussion

Methods

Study guidelines

Search strategy and study eligibility

Inclusion and exclusion criteria

Study selection and data extraction

Pooled evaluation metric

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Self-report symptom-based endometriosis prediction using machine learning

Identification of potential diagnostic biomarkers and therapeutic targets for endometriosis based on bioinformatics and machine learning analysis

Search

Quick links