Learning endometriosis phenotypes from patient-generated data

Urteaga, Iñigo; McKillop, Mollie; Elhadad, Noémie

doi:10.1038/s41746-020-0292-9

Download PDF

Article
Open access
Published: 24 June 2020

Learning endometriosis phenotypes from patient-generated data

npj Digital Medicine volume 3, Article number: 88 (2020) Cite this article

8811 Accesses
19 Citations
21 Altmetric
Metrics details

Subjects

Abstract

Endometriosis is a systemic and chronic condition in women of childbearing age, yet a highly enigmatic disease with unresolved questions: there are no known biomarkers, nor established clinical stages. We here investigate the use of patient-generated health data and data-driven phenotyping to characterize endometriosis patient subtypes, based on their reported signs and symptoms. We aim at unsupervised learning of endometriosis phenotypes using self-tracking data from personal smartphones. We leverage data from an observational research study of over 4000 women with endometriosis that track their condition over more than 2 years. We extend a classical mixed-membership model to accommodate the idiosyncrasies of the data at hand, i.e., the multimodality and uncertainty of the self-tracked variables. The proposed method, by jointly modeling a wide range of observations (i.e., participant symptoms, quality of life, treatments), identifies clinically relevant endometriosis subtypes. Experiments show that our method is robust to different hyperparameter choices and the biases of self-tracking data (e.g., the wide variations in tracking frequency among participants). With this work, we show the promise of unsupervised learning of endometriosis subtypes from self-tracked data, as learned phenotypes align well with what is already known about the disease, but also suggest new clinically actionable findings. More generally, we argue that a continued research effort on unsupervised phenotyping methods with patient-generated health data via new mobile and digital technologies will have significant impact on the study of enigmatic diseases in particular, and health in general.

The Project Baseline Health Study: a step towards a broader mission to map human health

Article Open access 05 June 2020

Smartphone accelerometer data as a proxy for clinical data in modeling of bipolar disorder symptom trajectory

Article Open access 14 December 2022

Detecting the impact of subject characteristics on machine learning-based diagnostic applications

Article Open access 11 October 2019

Introduction

Endometriosis is a chronic and systemic disease in women of reproductive age with no known cure^1,2,3. Although complex multi-factorial causes (i.e., biological and environmental factors) are likely to be of relevance, the etiology of the disease is still unknown. Disease pathology is traditionally described by tissue similar to the endometrium—the lining of the uterus—growing outside the uterine cavity, which may form lesions in pelvic, gastrointestinal, and other areas. The disease is currently diagnosed by direct visualization of such lesions through laparoscopic surgery.

Endometriosis is prevalent in women, with estimates of affecting 10% of those in reproductive age, and has high morbidity and impact on quality of life^4,5. Nevertheless, it is a highly enigmatic condition, with heterogeneous symptoms documented by patients: stereotypical evidence like pain and infertility are known, but a wide range of other symptoms with systemic effects are reported as well⁶. However, these variety of symptoms have not been well characterized yet for all endometriosis patients, with unclear associations between some symptoms and the disease: it is still uncertain why some treatments are effective for some patients, and not for others. Besides, there are no known biomarkers of the disease for non-invasive diagnosis or for monitoring its progression, and it currently takes an average of 8 years for patients to receive a diagnosis. Although several stages of the disease have been proposed, they do not explain the diversity of symptoms experienced by patients, they do not correlate with their severity⁷, nor have unequivocal connection with disease progression⁸.

Due to its poor clinical characterization, identifying signatures across individuals that correspond to phenotypes of endometriosis would allow for better treatment, as well as to generate new hypotheses about potential causes and means of diagnosis⁹. An accurate characterization of endometriosis through disease subtypes is critical for earlier diagnosis, as well as for targeted treatment and management strategies of the disease. Traditional clinical phenotyping approaches based on available electronic health record data are limited, mostly due to the lack of sufficient evidence of the symptomatic manifestations of the disease. Furthermore, there is no existing grouping for characterization of the disease in the context of non-clinical, but easily observable, variables: e.g., signs and symptoms, such as pelvic pain, mood variations, or period characteristics, experienced by patients.

Recently, wearable sensors^10,11 and smartphones^12,13 have been proposed as a powerful way to connect medical researchers to patients, and vice versa. With these mobile technologies, patients can provide longitudinal, real-world evidence of their experience of a particular disease. Recent software platforms like ResearchKit¹⁴ and ResearchStack¹⁵ facilitate the use of mobile technology to recruit and consent patients into studies¹⁶. The first wave of app-based studies have shown that patients can provide valuable information, with the appropriate recruitment and retention strategies¹⁷, to advance our understanding of disorders over time, generating new insights about diseases^18,19 and overall health^20,21.

This work contributes to the emerging area of research on digital phenotyping from patient-generated health data, specifically from data collected through smartphone applications^13,22. Digital phenotyping aims at the automatic characterization of a patient’s phenotype using electronic data. In conjunction with the advance of data science and machine learning techniques, along with the pervasive use of smartphones, other personal digital devices and wearables, it holds considerable potential for analyzing patient-generated data^20,23 for medical research purposes^{12,13,16,24,25,26,27}.

In this work, we explore the use of unsupervised data-driven methods to identify subtypes of endometriosis, where patients are grouped together based on their signs and symptoms, quality of life, and treatments. We use self-tracking data obtained through an smartphone app specifically designed to characterize endometriosis at scale. We extend a mixed-membership model—which partitions collections of data into mixtures of a shared set of latent groups—to accommodate the idiosyncrasies of the data at hand: i.e., the multimodality and uncertainty of the tracked variables. We probabilistically model a wide range of observations (i.e., participant symptoms, quality of life, treatments) to obtain interpretable descriptions of endometriosis phenotypes.

We validate our approach both intrinsically and extrinsically via (1) the evaluation of its ability to model unseen data, (2) the interpretability of the identified subtypes by endometriosis experts, (3) the matching of unsupervised phenotype assignments against clinical experts grouping, and (4) the association between subtypes and responses to clinically validated standard surveys for endometriosis.

Our experiments show that (i) our approach identifies phenotypes that are robust to biases of self-tracked data (e.g., wide variations in tracking frequency amongst participants), as well as to hyperparameter choices for the model; and (ii) jointly modeling a wide range of observations self-tracked by participants (symptoms, quality of life, treatments) yields clinically meaningful disease subtypes, both validating what is already known about endometriosis and suggesting new hypothesis about the condition as well. Overall, we show the promise of unsupervised learning of endometriosis phenotypes from self-tracked participant data collected via digital mobile platforms.

Results

Patient-generated data

We collected two types of patient-generated data for this study. Once participants consented, they were asked to self-track their symptoms in the Phendo research app, as well as to fill out an electronic version of the WERF survey, a validated clinical survey by and for the endometriosis research community. The unsupervised phenotype learning task relied only on the self-tracking data from Phendo, while the WERF survey data was used to assess the quality of the learned phenotypes.

Patient-generated data—Phendo self-tracking data

Phendo is a Columbia University IRB-approved smartphone app for women to self-track endometriosis (Fig. 1), available for both iOS²⁸ and Android²⁹ based phones. The app was specifically designed to capture the patient experience of the disease, as well as to engage participants in self-tracking the condition over time^30,31. App users were recruited through patient advocacy groups, and active recruitment efforts were sustained throughout the study period, leveraging a wide range of strategies including social media (Twitter, Facebook, Instagram, and Medium), emails, radio, news articles, celebrity endorsement through social media posts, blog posts, and scientific articles.

**Fig. 1: Example screenshots of Phendo, the endometriosis research app.**

Once enrolled in the Phendo study, users can self-track a variety of variables of their interest at the frequency with which they experience them. Some—pain for example—moment-by-moment (i.e., when and as many times as participants experience it), while others—like “How was your day?”—are tracked daily. The app is purposely designed with these flexible options to collect data as close in time as to when the relevant events occur.

The moment-level tracking comprises reports about pain across specific body locations and severity levels, gastrointestinal and genitourinary issues relevant for endometriosis—with their associated severity levels—other signs and symptoms commonly reported by participants (e.g., “blurry vision”, “hot flashes”, “fatigue”) and their severity, participants’ bleeding patterns, and customized medication and hormonal intake reports. Users can track a functional assessment of their day (from “Great” to “Unbearable”), which daily living activities were hard for them to do, menstruation patterns, sexual activity and potential dyspareunia, as well as other personalized answers for hormonal treatments, diet and exercise items they want to keep track of.

We selected a cohort of Phendo participants who had self-reported diagnosis of endometriosis, and had at least one self-tracked entry in one of the available questions between December 2016 (launch of the app) and end of December 2018, resulting in 4368 participants—mostly white and non-hispanic, with a mean age of 29 (see Table 1 for the cohort characteristics).

Table 1 Phendo cohort (N = 4368) demographics.

Full size table

In this study, we focused on the following subset of questions related to: (1) pain location with 39 potential answers, (2) pain description with 15 potential answers, (3) pain severity with 3 potential answers, (4) gastrointestinal and genitourinary (GI/GU) symptoms with 14 potential answers, (5) their severity with 3 potential answers, (6) other symptoms with 21 potential answers, (7) their severity with 3 potential answers, (8) period flow with 3 potential answers, (9) bleeding patterns with 3 potential answers, (10) sexual activity with 6 potential answers, (11) difficult daily living activities with 23 potential answers, (12) medications including hormonal treatments with 64 potential answers, and (13) quality of life with 5 potential answers. The details for the potential answers per-question are provided in the Supplementary Results.

Since the Phendo data (with 776,855 observations in total for the cohort) are self-tracked at the participants’ discretion, they are heterogeneous both in their frequency and their amounts collected per participant. The aggregated statistics over all the observations per tracked variable are described in Table 2.

Table 2 Summary statistics per-tracked question.

Full size table

Patient-generated data—WERF survey data

The WERF EPHect survey is a standardized questionnaire designed by the endometriosis research community³², and it represents the gold-standard for clinical characterization of endometriosis. The survey was optional for our study participants, and it was provided as part of the profile tab in the Phendo app. We selected a subset of questions related to menstrual and endometriosis history, family history of endometriosis, family history of chronic pelvic pain, and surgical history (Table 3), as well as diagnosed comorbidities, general health and activities of daily living (Table 4) for our analysis. Of the 4368 participants who contributed self-tracking data, 533 participants completed the WERF survey.

Table 3 WERF survey statistics for participants’ medical history (N = 533).

Full size table

Table 4 WERF survey statistics for participants’ comorbidities (N = 533).

Full size table

Unsupervised phenotype modeling

The proposed unsupervised mixed-membership method—fully described in the Methods section—models per-participant and per-question observations with a latent joint mixture of distributions, and outputs both groupings of responses that describe endometriosis phenotypes, as well as probabilistic assignments of each participant to the learned subtypes.

We evaluated the accuracy of the proposed model in describing unseen data (see results in Table 5), and observed a significant improvement of our method when compared to a vanilla mixed-membership baseline model—where responses to all questions are modeled together as in the topic model in³³. We note the robustness of the learning process—there are no significant differences—with respect to specific choices of the hyperparameters of the model.

Table 5 10-fold cross-validated test data log-likelihood of the proposed method Vs vanilla LDA.

Full size table

The enigmatic nature of endometriosis and its poor clinical characterization makes indispensable the interpretability of the phenotyping model. The probabilistic posteriors learned by our model are highly interpretable and discriminative: the per-question posteriors describe how likely are participants within a phenotype to track specific responses. Due to the flexibility of our model in accommodating per-question modalities, the method is capable of capturing signal within each of the self-tracked variables separately, resulting in a better discrimination between endometriosis phenotypes. As such, our model selection is primarily guided by interpretability criteria.

In general, sparsity—using few per-question answers to describe each phenotype—helps experts understand the model outputs (i.e., the learned per-phenotype and per-question posterior distributions) better, as fewer answers become significant in discriminating among phenotypes. The selected model learned four phenotypes (as it captured distinguishing features, while models with more subtypes did not provide new discriminating insights) with sparse parameters (α = β = 0.001) that allowed endometriosis experts to easily interpret the provided outputs.

Unsupervised phenotype modeling—Learned endometriosis phenotypes

We present a summary of the outputs of the learned model for the whole study cohort in Figs. 2 and 3. The first illustrates the per-question posterior distribution for each phenotype, where for visual clarity, only the top 10 (most likely) vocabulary items of the posterior are displayed (the full vocabulary per-question posteriors are provided in the Supplementary Results). The second is an answer-cloud summary visualization of each phenotype (the per-question and per-phenotype answer-clouds are provided in the Supplementary Results). These figures reflect not only which responses are more commonly reported per phenotype (i.e., how likely is a participant within each subtype to track any of the per-question symptoms), but also how they correlate with each other in the Phendo cohort.

**Fig. 2: Visualization of learned posteriors for endometriosis phenotypes.**

**Fig. 3: Answer-cloud visualization of learned endometriosis phenotypes.**

We report the following two main findings from the learned endometriosis phenotypes. First, each of the four phenotypes is uniquely characterized by distinct signs and symptoms, behaviors, and treatment strategies. Second, the learned phenotypes characterize endometriosis according to its severity—consistently across all signs and symptoms (pain, GI/GU, other symptoms)—and the burden on participants’ daily lives, hinting at the systemic aspect of the disease.

Phenotype A, specifically, describes a particularly severe endometriosis subtype. Furthermore, while the learned phenotypes reflect the state-of-knowledge about endometriosis, they highlight new insights and correlations across signs, symptoms, and treatments. We provide a detailed description of each phenotype per question, i.e., the posteriors in Fig. 2.

Across all learned endometriosis subtypes, chronic pain-related symptoms are common. However, there is a significant difference for phenotype A, as it is the only phenotype with significant posterior mass for “severe pain” (see Fig. 2c). The severity of other reported symptoms, such as gastrointestinal, genitourinary, and other symptoms, is also highest for phenotype A (Fig. 2i, 2e illustrate this, respectively).

For all participants in the cohort, the most salient pain locations tracked are pelvic, lower back, ovary and uterus—see overall answer-clouds in Fig. 3 and per-question visualizations in the Supplementary Results. A wider and more specific range of pain locations are likely to be reported by participants in phenotype A: there is significant evidence of deep vagina, vagina entrance and inner thigh pain, as well as cervix, rectum and intestine pain. On the contrary, phenotypes B and C are associated with pelvis, uterus or vagina pain primarily, while phenotype D has a less prominent, but broader association with pain locations. The tracked pain is commonly described as aching or cramping across all phenotypes, while phenotype A has higher likelihood of deep pain reports, and is uniquely likely to report burning, throbbing and nauseating pain.

Phenotypes learned by the model capture common endometriosis GI/GU symptoms of bloated abdomen (i.e., “endo belly”), as well as reports of constipation, diarrhea, and nausea. Phenotype A is more likely to report both nausea and irritable-bowel-like symptoms—congruent with the high prevalence of such syndromes in the disease—as well as to do so with higher severity. Phenotype A shows urinary-related symptoms as well.

Tracking of other symptoms of endometriosis (collected via the question “What else are you experiencing?” in Phendo) demonstrates the overall chronic nature of the disease. Fatigue, headache, mental fogginess, and dizziness are tracked across all learned phenotypes. Phenotype A uniquely experiences more systemic symptoms, like hot flashes, sweaty, and numbness; while phenotypes C and D are characterized by some symptoms of the upper abdomen, like chest pressure. Both phenotype A and D are likely to track noise- and touch-sensitivity, as well as sinus congestion.

In Fig. 2f, 2g, we observe that phenotypes B, C and D are likely to track light menstrual flow (with some evidence for medium flow as well), with spotting bleeding outside the period reported more significantly in phenotypes B and D. Phenotype A shows evidence of very irregular menstruation, and is the only subtype with heavy flow reports. Subtype A has higher likelihood of menorrhagia and clots, which appear less likely in phenotypes B and D.

Across all learned phenotypes, we observe a wide range of issues with daily activities, such as walking, standing, getting out of bed, using the toilet, sitting down, getting dressed, socializing, and working. Notice how salient these difficulties are for phenotype A, with basic functionalities like walking, standing or getting out of bed being commonly reported.

In general, phenotype A experiences low quality of life with high probability. Specifically, subtype A is uniquely associated with “bad” days—see high posterior mass in Fig. 2j—while the rest of the phenotypes are likely to track on the other side of the spectrum: i.e., “manageable” and “good” days. This effect is also evident with regards to sex, as phenotype A is the only subtype where sex is explicitly avoided, or reported to be painful (see Fig. 2l).

Finally, we observe that medications and hormones are highly discriminative of how different patients experience endometriosis. From the learned phenotypic posteriors (see Fig. 2m), we conclude that phenotype A is uniquely associated with the use of narcotics and neuropathic pain medications, phenotype B with hormonal treatments, phenotype C with no medical treatments, and phenotype D with a wider variety of treatments (hormonal, narcotic and antidepressants).

Unsupervised phenotype modeling—learned participant phenotypic assignments

Fig. 4 provides the probabilistic assignment of participants to the learned phenotypes. While the model provides for each participant membership probabilities across all phenotypes, we see that most participants are clearly assigned (with probability above 0.9) to a single phenotype.

**Fig. 4: Posterior assignment probability of each participant across the phenotypes learned by the model.**

One possible question when learning unsupervised clustering of participants is whether the self-tracking patterns of the participants is responsible for their underlying phenotype assignments or, rather, whether their assignments are uncovering actual endometriosis characteristics. In our data, we note that the average number of days tracked in all learned phenotypes are similar (34, 48, 41, and 27 on average), although participants associated with phenotype A tracked slightly more observations (on average, 116, 80, 80, and 66, respectively).

In contrast, the phenotypic assignments of participants do not correlate with the number of days or the observations participants tracked, nor their ratio (see Fig. 5). The learned phenotypes do not capture spurious self-tracking patterns related to engagement with the app, but rather represent participants based on their answers to endometriosis relevant Phendo questions.

**Fig. 5: Learned phenotype assignments are not correlated with the number of days, number of observations tracked, nor the ratio of observations per day tracked by participants.**

Endometriosis phenotype evaluation

On top of the checks presented in the previous section related to the coherent representation of the learned phenotypes, as well as to a meaningful clustering of different types of endometriosis patients, we further assess the quality of the learned phenotypes in two ways: how they correlate with expert endometriosis groupings, and how they associate with responses to the WERF survey.

Phenotype evaluation—agreement between expert clustering and phenotyping

The responses collected by the Phendo app of randomly selected 40 participants were reviewed by two endometriosis experts, who were asked to group them based on their clinical understanding of patient signs and symptoms (see guidelines description in the Methods section). In general, experts tended to categorize participants based on the symptomatic intensity (mild Vs severe) and the clinical management of the disease (no medical involvement Vs clinically managed).

The assignments by the experts and the model are compared, via confusion matrices (provided in Tables 6 and 7). High cluster purity values were attained for both the severe phenotype A (0.9 and 0.8) and the mildest phenotype B (0.775 and 0.7)—see Tables 8 and 9—indicating a clear agreement between our model and the experts on which participants were assigned to the two ends of the endometriosis spectrum (the inter-expert purity is 0.85 and 0.75 for the severe and mild cases, respectively).

Table 6 Phenotype confusion matrix for Expert 1.

Full size table

Table 7 Phenotype confusion matrix for Expert 2.

Full size table

Table 8 Confusion matrices for severe cases.

Full size table

Table 9 Confusion matrices for mild cases.

Full size table

The cluster purity for the full phenotypic assignments learned by the model is lower (0.6 and 0.55), reflecting the hard time experts had splitting some participants into 2 subtypes within the moderate group. We noticed that, for some of the participants for which the experts had assignment uncertainty, there were few self-tracked variables (both in quantity and in clinical relevance). Besides, after revealing the model assignments to the experts, they noticed how the model was distinguishing between moderate phenotypes based on certain variables that were non-critical in state-of-the-art recommendations, such as treatment choices, menstruation flow and sex-reports, which they had not previously considered.

Phenotype evaluation—associations between learned phenotypes and survey answers

To further validate the insights from the proposed unsupervised model, we study the statistical association between the learned phenotypes and the participant responses to the WERF survey. In general, the severity and quality of life indicators of endometriosis (as specified by WERF standards) align well with how our model discriminates patients. Specifically, the most significant associations occur for daily living limitations, the surgical burden associated with the disease, and their overall health.

Quality of life is considerably impacted for participants assigned to phenotype A: they are significantly more likely to rate their overall health as poor in their WERF-EPHect responses, with those in phenotypes B and C being associated with good or excellent self-evaluations. More precisely, those in phenotype A are distinctively associated with responses acknowledging limitations on activities like bending, kneeling, stooping, lifting or carrying groceries, bathing, dressing, walking or climbing stairs. They are also associated with limitations for running, lifting heavy objects or participating in other strenuous sports. Participants assigned to both phenotypes A and D have reported significant pelvic pain preventing them from going to work or school, as well as from carrying out other daily activities.

The severity of endometriosis for participants in phenotype A is evident when looking at the surgical burden as well: they are more likely to have undergo abdominal surgeries (e.g., gallbladder surgery), and are associated with more surgical procedures for endometriosis (average of 2.32 for phenotype A, versus 1.62, 1.51, 1.46, respectively for other phenotypes), as well as laparoscopies (1.76 versus 1.40, 1.40, and 1.26 respectively). It is interesting to observe that phenotype A and D are both associated with evidence of fibromyalgia and sigmoidoscopy or colonoscopy procedures. Hormone-induced menstruation is uniquely associated with phenotype B, while participants assigned to phenotype C are the only ones associated with regular periods.

We found that participants assigned to phenotype A are most likely to have pelvic inflammatory diseases, with some evidence of high blood pressure associated with phenotypes A and C. Migraine is associated with phenotype A, while chronic fatigue syndrome and anxiety disorders requiring medication or therapy were associated with both participants in phenotypes A and D. In general, even if several comorbidities such as PCOS or interstitial cystitis are high in the overall cohort (see Table 4), no significant association was found with any particular learned subtype.

We conclude by noting that we find a weak association between participants assigned to phenotype A and higher body mass index (BMI), while no significant correlations are found between phenotypes and age, race, time to diagnoses, or reports of diagnosis of endometriosis within the family.

Discussion

Our joint modeling of multiple self-tracked variables through mixed-membership models show that we can produce robust, clinically meaningful groupings of self-tracked signs and symptoms collected via patient-centered mobile and digital platforms.

We find that the proposed unsupervised method learns robust phenotypes, with respect to specific choices of the hyperparameters of the model and the randomness associated with inference. We observe that the log-likelihood of the selected model is stable for different realizations of the inference algorithm, as well as to different train/test splits. Overall, the learned phenotypes show the same discriminative features, and the set of significant associations between the participant phenotypic assignments and the WERF questionnaire responses are consistent across realizations.

Even if the available data is heterogeneous, both in type and quantity across participants, the proposed method is robust to the inherent uncertainties of self-tracked data, and does not pick up spurious signals—the learned phenotypes do not correlate with the number of observations or days tracked, nor other variables like age or race of participants.

The proposed model characterizes the burden of endometriosis across all the learned phenotypes. The learned (unsupervised) subtypes, along with participant phenotypic assignments, align well with previous clinical knowledge about endometriosis, but also suggest novel findings. Our approach reflects direct patient experiences with endometriosis, and provide potentially novel insights about the disease.

The reports from the WERF survey confirm that patients with endometriosis have a higher number of known comorbidities than the general US population (see Table 4). These include autoimmune, endocrine-based, and mental health disorders, such as irritable-bowel syndrome³⁴, Hashimoto’s disease³⁵, fibromyalgia³⁶, anxiety disorders³⁷, asthma³⁸, chronic fatigue syndrome³⁹, depression⁴⁰, migraine⁴¹, and PCOS⁴².

The clusters of symptoms learned for the different phenotypes confirm, as well, the chronic nature of endometriosis: fatigue, headaches, mental fogginess, gastrointestinal problems, and pain reports are common across all phenotypes. These symptoms (specially fatigue and mental fogginess or dizziness) are similar to those experienced in other complex chronic conditions, and are characteristic of low grade inflammation⁴³.

The observed commonality of pelvic and lower back pain symptoms across phenotypes is expected for endometriosis patients⁴⁴, as well as having gastrointestinal symptoms related to irritable-bowel syndrome^45,46. Our analysis shows spotting and bleeding outside of the period to be characteristic of all participants in our cohort, which matches findings connecting premenstrual spotting with histologically confirmed endometriosis⁴⁷.

The phenotypes learned by the proposed model separate participants’ experiences according to their severity, consistently across all signs and symptoms (pain, GI/GU, other symptoms). Specifically, Phenotype A describes a particularly severe endometriosis subtype.

First, we observe (both in the learned posteriors and in the computed associations) that patients assigned to subtype A track symptoms related to several comorbidities already reported in the literature. Diagnosis of endometriosis has been linked to anxiety, depression, and other mood disorders^48,49, migraines⁵⁰, high blood pressure⁵¹, PCOS⁵², and chronic fatigue syndrome^6,53,54. The significant associations found for phenotype A reflected a higher surgical burden, and a lack of adequate treatment of the disease. This finding is consistent with the existing literature studying endometriosis diagnosis^55,56,57.

The severe genitourinary symptoms characteristic of phenotype A (e.g., painful urination or dysuria) have been previously reported in the literature^58,59,60, but their association with the collection of other symptoms tracked within this phenotype is novel. Associations with the WERF survey were consistent with current knowledge regarding menstruation, but also demonstrated novel patterns of the disease. Specifically, menstrual irregularity has been shown to be associated with endometriosis before, but not with a specific subgroup of participants^61,62. Phenotype A shows a higher likelihood of disordered periods (with heavier flows and menorrhagia). Besides, participants assigned to this subtype have tracked menstrual bleeding, and are associated with irregular periods in their WERF survey responses as well—only participants assigned to phenotype C were associated with regular periods. Even if menorrhagia is a common endometriosis symptom⁶³, it has not been previously associated with a particular subgroup of endometriosis patients. Furthermore, hormone-induced menstruation is uniquely associated with phenotype B, which aligns well with the presence of hormonal treatments found in the medication posterior of Fig. 2m.

Painful sex is a widely known symptom for endometriosis^64,65,66. We here find dyspareunia to distinctively correlate with phenotype A. This finding is consistent with the highly systemic nature of the disease, the impact of gastrointestinal and genitourinary symptoms, and pain locations—intestines, cervix pain, vagina entrance pain—specifically highlighted by the posteriors learned for phenotype A. The literature has previously documented sexual problems and active avoidance of sexual activity by women with endometriosis^67,68. However, we here find a novel association between dyspareunia and a specific subtype of the disease.

The learned phenotypes provide evidence of the different treatment alternatives for the disease, each endometriosis subtype being characterized by distinct medication intakes. A first line of treatment for endometriosis symptoms is often a combination of progestin and/or hormonal medications⁶⁹, which interestingly are highly associated with learned phenotype B, while phenotype C is not correlated with any particular medication. On the contrary, phenotype A is characterized by a heavy use of narcotics, and a more likely use of antidepressants and neuropathic pain medications (with some evidence of this also appearing in phenotype D). This finding reflects the psychological and physiological impact of the disease, as neuropathic pain often develops when there is damage to the somatosensory nervous system: evidence suggests that women with endometriosis, and in particular those with pain in the upper anterior-lateral part of the thigh (which is uniquely represented in pain locations for phenotype A), tend to experience neuropathic pain⁷⁰.

The impact of the disease on the quality of life aligns with the severity of symptoms across the learned phenotypes. Problems with day-to-day functioning of endometriosis patients have been previously documented⁷¹, and the associated loss of productivity and reduced quality of life is well known in the literature. However, evaluating the differences among patient subgroups is yet unexplored^71,72,73. Here, we find that “Bad days” and “Poor health” reports—in the Phendo app, as well as in the WERF survey—are uniquely associated with phenotype A, while participants in other phenotypes don’t report such negative experiences. The impact of the disease on quality of life and daily activities is supported by both the learned phenotype posteriors and the responses to the WERF survey. There is a clear and significant association between problems with daily living activities and participants assigned to phenotype A.

The exact etiology of endometriosis remains unclear⁷⁴. Among studies that examined heritability of the disease, there seems to be both maternal and paternal genes involved in the development of endometriosis, but the majority appear paternally inherited⁷⁵. In our study, 38% of participants reported a diagnosis of endometriosis within the family, but no significant etiology association was found at the phenotype level. Underweight BMI has traditionally been thought of as a risk factor for endometriosis, but recent research suggests that among woman who are obese, the disease is more severe⁷⁶. Our analysis points to a weak association between BMI and a more severe experience of the disease.

Finally, we also found some reports of tinnitus—ringing in ears—and itchiness (mostly for phenotype D), which have not been documented as important symptoms for endometriosis in the literature. Participants associated with this phenotype may be impacted by changes in hormone levels, which at least for menopausal women, have been associated with tinnitus⁷⁷ and itchiness⁷⁸.

As a first step towards investigating phenotyping of endometriosis based on self-tracked data, this study has ignored the temporal aspect of the condition, and have instead aggregated all tracked observations for each participant. We acknowledge that the heterogeneity in tracking might vary within a given participant’s timeline as well. Even if it is plausible that there is signal across learned phenotypes and disease progression, there is a lack of medical evidence as to whether endometriosis phenotypes indicate progression of disease. Specifically, there is little evidence that superficial endometriosis progresses to deep endometriosis⁸. Furthermore, our analysis shows no correlation between the discovered phenotypes and age or time to diagnosis.

Future work should consider modeling the temporality of the signs and symptoms of endometriosis, particularly since it is estrogen dependent and linked to the menstrual cycle. We acknowledge that how robust the learned phenotypes are when compared to other advanced computational phenotyping techniques, such as⁷⁹, is an open research question. We also note that our association analysis may be limited, both in terms of the type of questions available in the WERF survey, and the number of participants for whom we were able to collect responses.

Nevertheless, we argue that the analysis in this study already sheds novel insights into the understanding of endometriosis subtypes, and demonstrates the value of patient-generated data and unsupervised learning methods in medical research. This paper contributes to research in digital phenotyping from self-tracking data, and highlights how patient-powered mobile and digital technologies can be leveraged, in combination with unsupervised machine learning techniques, to study diseases and health outcomes.

In the case of endometriosis, a particularly enigmatic condition with a dire need for phenotyping, our method identified four subtypes of patients, grouped by severity of their condition and other factors of interest. Moreover, clinically meaningful novel associations beyond what is currently known about the disease were identified.

Methods

Unsupervised phenotyping model

We aim at understanding how self-tracked data from smartphones—a set of heterogeneous signs and symptoms from an enigmatic disease—can be grouped into different phenotypic experiences. Self-tracking data raises several considerations—it is irregularly sampled, noisy and contains several different data types—that we need to account for.

The process of extracting clinically relevant characteristics from a collection of data is generally defined as computational phenotyping. One family of phenotyping approaches are the generalized low-rank models (GLRMs), where the clinical data is put into a matrix form A, and a low-rank decomposition into factors X and Y is searched for⁸⁰. The factor X represents each observation in A in terms of low-rank features Y, which encodes a low-rank feature representation of the original data. This factorization is found via an optimization procedure that consists of a loss function and corresponding regularizing terms. Particular choices of loss and regularization functions result in many well-known models. For instance, a mean squared-loss and no regularization is mathematically equivalent to principal components analysis (PCA). After finding a good low-rank representation, clustering techniques (such as K-means) are applied in their latent feature representation to derive cluster centroids (i.e., phenotypes are vectors in the embedded space). We provide a description of GLRM baselines and their performance in the Supplementary Methods, which did not discover clinically meaningful endometriosis phenotypes.

The goal of these GLRMs⁸⁰ and other methods, such as non-negative tensor factorization⁸¹, is to autonomously identify clusters, usually in the learned latent space. Even if progress has been made on learning sparse and diverse phenotypes⁷⁹, interpretation of the learned clusters to clinicians is challenging. In general, a cluster centroid vector in latent space lacks clinical meaning, while the explanation of the centroid in the original space demands a complicated understanding and explanation of a high-dimensional vector of clinical features. Besides, when using non-linear embedding functions, the mapping from latent to original features becomes even more convoluted.

In this work, we leverage an unsupervised probabilistic method to account for the lack of gold-standard labels (i.e., supervised methods are not applicable), and the heterogeneity of the symptomatic experience (i.e., we aim at a probabilistic assignment of shared signs and symptoms across patients). We propose an extended mixed-membership model^82,83, which is a Bayesian generative model that can accommodate the inherent heterogeneity and uncertainty of the data, to capture the latent structure of collections of groups of self-tracked signs and symptoms.

Topic models⁸⁴ are one of the primary examples of mixed-membership models, where one infers the latent topics of a corpora of documents. Intuitively, if a document is about a particular topic, one would expect specific words to appear in the document more or less frequently. However, a document typically covers multiple topics in different proportions. Topic models capture this intuition mathematically, based on the statistics of the observed words in each document, and outputs what the topics might be, as well as the document’s proportion of topics³³.

Here, we cast the set of self-tracked responses per participant as “documents”, all generated from the “corpus” of endometriosis patients. As such, each set of tracked observations is modeled as a mixture model, where the mixture components (i.e., the phenotypes) are shared across the population, but the mixture proportions vary per participant.

The available self-tracked data however is not a standard document, but a collection of responses to different questions—for the unsupervised learning of phenotypes, we only use the self-tracked data, not the WERF EPHect questionnaire data, which is left-out for evaluation purposes. The Phendo app already provides a fixed set of possible responses to most of the questions, and medications and hormones were mapped to their corresponding medication classes of a fixed size (see per-question vocabularies in the Supplementary Results).

As a competitive baseline for the task at hand, we consider the mixed-membership model known as Latent Dirichlet Allocation³³. For this approach, the collection of responses to different questions q = {1, ⋯, Q} are concatenated. The input to this baseline is a high-dimensional (V₁ + V₂ + ⋯ + V_Q) multinomial vector per participant, where V_q is the vocabulary size of each question q, which the method uses to learn “topics” (i.e., phenotypes) and the per-participant assignments to each phenotype.

We here extend as in ref. ⁸³ the mixed-membership model to accommodate for multi-modal data, where each modality is an specific question q = {1, ⋯, Q} with its vocabulary size V_q. The proposed mixed-membership model infers phenotypes based on the co-occurrence of observations across the set of per-question responses and participants. The probabilistic graphical model and full details of the relevant statistical functions are provided in the Supplementary Methods and ref. ⁸³. The proposed unsupervised method outputs groupings of per-question responses to self-tracked variables that describe endometriosis phenotypes. The learned probabilistic posteriors per-question (see Fig. 2) describe how likely are certain terms to be tracked for each phenotypic profile.

In order to determine the hyperparameters for the task at hand, we perform held-out data log-likelihood comparisons (10-fold cross-validation), where the data are split with a 80/20 train/test ratio, the hyperparameters are varied within K ∈ {2, 3, 4, 5}, α ∈ {0.1, 0.01, 0.001}, and β ∈ {0.1, 0.01, 0.001}. Since computing the log-likelihood of mixed-membership models for unseen data is nontrivial—see discussion in ref. ⁸⁵—we extend the “left-to-right” method proposed in ref. ⁸⁵ to our per-question mixed-membership model.

Phenotype visualization

To allow for easy and visually appealing clinical evaluation, we provide posterior heatmaps, and a visual summary of each phenotype’s most prominent responses via answer-clouds (see Figs. 2 and 3, respectively). The former allows for a clear identification of the most salient responses, as they show the most discriminative vocabulary items per-question. Answer-clouds (also known as tag-clouds or word-clouds) are a novelty visual representation of text data. Shown answers are single vocabulary items per-question in the Phendo app (full list of answers are provided in the first section of the Supplementary Results), where the color indicates the question type, and the font size reflects the importance of each item in the learned phenotype. This format is commonly used for quickly presenting the most prominent terms to determine its relative prominence in the data. Due to the different vocabulary sizes for each considered Phendo question, comparing posteriors with different support is challenging. In this work, the answer-clouds are plotted by conditioning on the vocabulary items that cover 80% of the posterior mass per-question. As such, the relative size of visualized responses match the proportions of the conditional probability ratios. This allows for a more clear identification of the most salient responses per-question, even with different sized vocabularies per-question.

Agreement between expert clustering and unsupervised phenotyping

We randomly selected 40 participants from the cohort, who had at least 30 days of activity with more than 100 tracked observations, for the experts to review. We selected 8 participants per phenotype that had high posterior probability (above 95% percent) of being assigned to a unique phenotype, and 8 additional participants for which the model output was uncertain (where at least 80% of the probability of phenotype assignment was shared by more than one subtype). The participant responses collected by the Phendo app were reviewed by two endometriosis experts, who were asked to group them based on their clinical understanding of patient signs and symptoms. The guidelines for the experts to review were written separately from the execution of the proposed unsupervised modeling algorithm. Specifically, endometriosis experts where instructed to categorize participants into groups according to their clinical understanding of patient signs and symptoms, i.e., following their endometriosis knowledge and expertise. As a secondary task, they were asked to provide an explanation of how they used the available data (i.e., the self-tracked responses to the Phendo questions, which are different from state-of-the-art clinical data) to group the participants, and how such data supported their understanding of the disease. The assignments by the experts and the model are compared via confusion matrices.

Associations

We compute statistical associations between phenotypes learned by the model and responses to the questions from the WERF EPHect questionnaire³². After learning the model, participants were assigned to phenotypes based on the maximum per-phenotype posterior probability, and associations computed between responses to the WERF responses of participants within each subtype. For categorical questions, the chi-square test of independence of variables in the contingency table per phenotype was computed⁸⁶. For questions with continuous outcomes, the Kruskal—Wallis H-test for independent samples per phenotype was computed⁸⁷. This is a non-parametric version of ANOVA that works on 2 or more independent samples, which may have different sizes, and tests the null hypothesis that the population median of all of the groups are equal. We report correlations at a significance level of 0.05.

Ethics

Data collection and the analysis presented in this work were carried out under Research Protocol #AAAQ9812 approved by Columbia University IRB. We obtained signed informed consent from all participants in the study.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

Please contact the authors to obtain access to a de-identified version of the data that supports the findings of this study through a data-use agreement.

Code availability

Our code has been developed using open source tools in Python with common statistical libraries (e.g., NumPy, SciPy and Pandas). The code required for processing the data and producing the presented results is available in the public GitHub repository https://github.com/iurteaga/phendo.

References

Carmina, E. & Lobo, R. A. Polycystic ovary syndrome (PCOS): arguably the most common endocrinopathy is associated with significant morbidity in women. J. Clin. Endocrinol. Metab. 84, 1897–1899 (1999).
Article CAS PubMed Google Scholar
Giudice, L. C. Endometriosis. N. Engl. J. Med. 362, 2389–2398 (2010).
Article CAS PubMed PubMed Central Google Scholar
Barbosa, C. P., Souza, A. B. D., Bianco, B. & Christofolini, D. The effect of hormones on endometriosis development. Minerva Ginecologica: A J. Obstet. Gynecol. 63, 375–386 (2011).
Google Scholar
Simoens, S. et al. The burden of endometriosis: costs and quality of life of women with endometriosis and treated in referral centres. Hum. Reprod. 27, 1292–1299 (2012).
Article PubMed Google Scholar
Wheeler, J. Epidemiology of endometriosis-associated infertility. J. Reprod. Med. 34, 41–46 (1989).
CAS PubMed Google Scholar
Kvaskoff, M. et al. Endometriosis: a high-risk population for major chronic diseases? Hum. Reprod. Update 21, 500–516 (2015).
Article PubMed PubMed Central Google Scholar
Vercellini, P. et al. Association between endometriosis stage, lesion type, patient characteristics and severity of pelvic pain symptoms: a multivariate analysis of over 1000 patients. Hum. Reprod. 22, 266–271 (2007).
Article CAS PubMed Google Scholar
Brosens, I. & Brosens, J. Redefining endometriosis: Is deep endometriosis a progressive disease? Hum. Reprod. 15, 1–3 (2000).
Article CAS PubMed Google Scholar
Johnson, N. P. et al. World Endometriosis Society consensus on the classification of endometriosis. Hum. Reprod. 32, 315–324 (2017).
Article PubMed Google Scholar
Hua, A. et al. Accelerometer-based predictive models of fall risk in older women: a pilot study. npj Digital Med. 1, 25 (2018).
Gresham, G. et al. Wearable activity monitors to assess performance status and predict clinical outcomes in advanced cancer patients. npj Digital Med. 1, 27 (2018).
Egger, H. L. et al. Automatic emotion and attention analysis of young children at home: a ResearchKit autism feasibility study. npj Digital Med. 1, 20 (2018).
Torous, J. et al. Characterizing the clinical relevance of digital phenotyping data quality with applications to a cohort with schizophrenia. npj Digital Med. 1, 15 (2018).
ResearchKit: open source framework to create medical research apps. http://researchkit.org/ (2020).
ResearchStack: An SDK for building research study apps on Android. http://researchstack.org/ (2020).
Byambasuren, O., Sanders, S., Beller, E. & Glasziou, P. Prescribable mHealth apps identified from an overview of systematic reviews. npj Digital Med. 1, 12 (2018).
Pratap, A. et al. Indicators of retention in remote digital health studies: a cross-study evaluation of 100,000 participants. npj Digital Med. 3, 21 (2020).
Bot, B. M. et al. The mPower study, Parkinson disease mobile data collected using ResearchKit. Sci. Data 3, 160011 (2016).
Article CAS PubMed PubMed Central Google Scholar
Chan, Y.-F. Y. et al. The Asthma Mobile Health Study, a large-scale clinical observational study using ResearchKit. Nat. Biotechnol. 35, 354 (2017).
Article CAS PubMed PubMed Central Google Scholar
Althoff, T. Population-scale pervasive health. IEEE Pervasive Comput. 16, 75–79 (2017).
Article PubMed PubMed Central Google Scholar
Li, K. et al. Characterizing physiological and symptomatic variation in menstrual cycles using self-tracked mobile health data. npj Digital Med. In press (2020).
Zhan, A. et al. Using smartphones and machine learning to quantify parkinson disease severity: the mobile parkinson disease score. JAMA Neurol. 75, 876–880 (2018).
Article PubMed PubMed Central Google Scholar
Althoff, T. et al. Large-scale physical activity data reveal worldwide activity inequality. Nature 547 (2017).
Webster, D. E. et al. The Mole Mapper Study, mobile phone skin imaging and melanoma risk data collected using ResearchKit. Scientific Data 4, 170005 (2018).
Dagum, P. Digital biomarkers of cognitive function. npj Digital Med. 1, 10 (2018).
Smets, E. et al. Large-scale wearable data reveal digital phenotypes for daily-life stress detection. npj Digital Med. 1, 67 (2018).
Ata, R. et al. Clinical validation of smartphone-based activity tracking in peripheral artery disease patients. npj Digital Med. 1, 66 (2018).
Elhadad, N. Phendo app available at Apple’s App store. https://itunes.apple.com/us/app/phendo/id1145512423 (2020).
Elhadad, N. Phendo app available at Google Play. https://play.google.com/store/apps/details?id=com.appliedinformaticsinc.phendo (2020).
McKillop, M., Voigt, N., Schnall, R. & Elhadad, N. Exploring self-tracking as a participatory research activity among women with endometriosis. J. Participatory Med. 8, e17 (2016).
McKillop, M., Mamykina, L. & Elhadad, N. Designing in the dark: eliciting self-tracking dimensions for understanding enigmatic disease. In Proc. 2018 CHI Conference on Human Factors in Computing Systems 565 https://doi.org/10.1145/3173574.3174139 (ACM, 2018).
Vitonis, A. F. et al. World Endometriosis Research Foundation Endometriosis Phenome and Biobanking Harmonization Project: II. Clinical and covariate phenotype data collection in endometriosis research. Fertil. Steril. 102, 1223–1232 (2014).
Article PubMed PubMed Central Google Scholar
Blei, D. M., Ng, A. Y. & Jordan, M. I. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003).
Google Scholar
Canavan, C., West, J. & Card, T. The epidemiology of irritable bowel syndrome. Clin. Epidemiol. 6, 71 (2014).
PubMed PubMed Central Google Scholar
Staii, A., Mirocha, S., Todorova-Koteva, K., Glinberg, S. & Jaume, J. C. Hashimoto thyroiditis is more frequent than expected when diagnosed by cytology which uncovers a pre-clinical state. Thyroid Res. 3, 11 (2010).
Article PubMed PubMed Central Google Scholar
Jones, G. T. et al. The prevalence of fibromyalgia in the general population: a comparison of the American College of Rheumatology 1990, 2010, and modified 2010 classification criteria. Arthritis Rheumatol. 67, 568–575 (2015).
Article PubMed Google Scholar
Remes, O., Brayne, C., Linde, R. V. D. & Lafortune, L. A systematic review of reviews on the prevalence of anxiety disorders in adult populations. Brain Behav. 6, e00497 (2016).
Article PubMed PubMed Central Google Scholar
Halldin, C. N., Doney, B. C. & Hnizdo, E. Changes in prevalence of chronic obstructive pulmonary disease and asthma in the US population and associated risk factors. Chronic Respiratory Dis. 12, 47–60 (2015).
Article Google Scholar
Johnston, S., Brenu, E. W., Staines, D. & Marshall-Gradisnik, S. The prevalence of chronic fatigue syndrome/myalgic encephalomyelitis: a meta-analysis. Clin. Epidemiol. 5, 105 (2013).
Article PubMed PubMed Central Google Scholar
Centers for Disease Control and Prevention and others. Depression in the US household population, 2009–2012 (National Center for Health Statistics, Division of Health Interview Statistics, 2014).
Victor, T., Hu, X., Campbell, J., Buse, D. & Lipton, R. Migraine prevalence by age and sex in the United States: a life-span study. Cephalalgia 30, 1065–1072 (2010).
Article CAS PubMed Google Scholar
Jalilian, A. et al. Prevalence of polycystic ovary syndrome and its associated complications in Iranian women: A meta-analysis. Iran. J. Reprod. Med. 13, 591 (2015).
PubMed PubMed Central Google Scholar
Holgate, S. T., Komaroff, A. L., Mangan, D. & Wessely, S. Chronic fatigue syndrome: understanding a complex illness. Nat. Rev. Neurosci. 12, 539 (2011).
Article CAS PubMed Google Scholar
Chiantera, V., Abesadze, E. & Mechsner, S. How to understand the complexity of endometriosis-related pain. J. Endometr. Pelvic Pain. Disord. 9, 30–38 (2017).
Article Google Scholar
Ek, M. et al. Gastrointestinal symptoms among endometriosis patients: a case-cohort study. BMC Women’s Health 15, 59 (2015).
Article PubMed PubMed Central CAS Google Scholar
Luscombe, G. M., Markham, R., Judio, M., Grigoriu, A. & Fraser, I. S. Abdominal bloating: an under-recognized endometriosis symptom. J. Obstet. Gynaecol. Can. 31, 1159–1171 (2009).
Article PubMed Google Scholar
Heitmann, R. J., Langan, K. L., Huang, R. R., Chow, G. E. & Burney, R. O. Premenstrual spotting of ≥2 days is strongly associated with histologically confirmed endometriosis in women with infertility. Am. J. Obstet. Gynecol. 211, 358–e1 (2014).
Article PubMed Google Scholar
Pope, C. J., Sharma, V., Sharma, S. & Mazmanian, D. A systematic review of the association between psychiatric disturbances and endometriosis. J. Obstet. Gynaecol. Can. 37, 1006–1015 (2015).
Article PubMed Google Scholar
Laganà, A. S. et al. Anxiety and depression in patients with endometriosis: impact and management challenges. Int. J. Women’s Health 9, 323 (2017).
Article Google Scholar
Yang, M.-H. et al. Women with endometriosis are more likely to suffer from migraines: a population-based study. PLoS ONE 7, e33941 (2012).
Article CAS PubMed PubMed Central Google Scholar
Mu, F. et al. Association between endometriosis and hypercholesterolemia or hypertensionnovelty and significance. Hypertension 70, 59–65 (2017).
Article CAS PubMed Google Scholar
Holoch, K. J. et al. Coexistence of polycystic ovary syndrome and endometriosis in women with infertility. J. Endometr. Pelvic Pain. Disord. 6, 79–83 (2014).
Google Scholar
Sinaii, N., Cleary, S. D., Ballweg, M., Nieman, L. K. & Stratton, P. High rates of autoimmune and endocrine disorders, fibromyalgia, chronic fatigue syndrome and atopic diseases among women with endometriosis: a survey analysis. Hum. Reprod. 17, 2715–2724 (2002).
Article CAS PubMed Google Scholar
Ramin-Wright, A. et al. Fatigue–a symptom in endometriosis. Hum. Reprod. 33, 1459–1465 (2018).
Hadfield, R., Mardon, H., Barlow, D. & Kennedy, S. Delay in the diagnosis of endometriosis: a survey of women from the USA and the UK. Hum. Reprod. 11, 878–880 (1996).
Article CAS PubMed Google Scholar
Arruda, M., Petta, C., Abrao, M. & Benetti-Pinto, C. Time elapsed from onset of symptoms to diagnosis of endometriosis in a cohort study of Brazilian women. Hum. Reprod. 18, 756–759 (2003).
Article CAS PubMed Google Scholar
Greene, R., Stratton, P., Cleary, S. D., Ballweg, M. L. & Sinaii, N. Diagnostic experience among 4,334 women reporting surgically diagnosed endometriosis. Fertil. Steril. 91, 32–39 (2009).
Article PubMed Google Scholar
Villa, G. et al. Relationship between site and size of bladder endometriotic nodules and severity of dysuria. J. Minim. invasive Gynecol. 14, 628–632 (2007).
Article PubMed Google Scholar
Denny, E. & Mann, M. C. H. A clinical overview of endometriosis: a misunderstood disease. Br. J. Nurs. 16, 1112–1116 (2007).
Article PubMed Google Scholar
Kolodziej, A., Krajewski, W., Dolowy, L. & Hirnle, L. Urinary tract endometriosis. Urol. J. 12, 2213–2217 (2015).
PubMed Google Scholar
Signorello, L. B., Harlow, B. L., Cramer, D. W., Spiegelman, D. & Hill, J. A. Epidemiologic determinants of endometriosis: a hospital-based case-control study. Ann. Epidemiol. 7, 267–274 (1997).
Article CAS PubMed Google Scholar
Wei, M., Cheng, Y., Bu, H., Zhao, Y. & Zhao, W. Length of menstrual cycle and risk of endometriosis: a meta-analysis of 11 case–control studies. Medicine 95, e2922 (2016).
Darrow, S. L. et al. Menstrual cycle characteristics and the risk of endometriosis. Epidemiology 4, 135–142 (1993).
Ferrero, S. et al. Quality of sex life in women with endometriosis and deep dyspareunia. Fertil. Steril. 83, 573–579 (2005).
Article PubMed Google Scholar
Hummelshoj, L., Graaff, A. D., Dunselman, G. & Vercellini, P. Let’s talk about sex and endometriosis. J. Fam. Plann Reprod. Health Care 40, 8–10 (2014).
Article PubMed Google Scholar
Shabanov, S. et al. When sex hurts the couple: the case of endometriosis. Rev. Med. Suisse 13, 612–616 (2017).
PubMed Google Scholar
Denny, E. & Mann, C. H. Endometriosis-associated dyspareunia: the impact on women’s lives. BMJ Sex. Reprod. Health 33, 189–193 (2007).
Google Scholar
Vercellini, P. et al. Surgical versus medical treatment for endometriosis-associated severe deep dyspareunia: I. Effect on pain during intercourse and patient satisfaction. Hum. Reprod. 27, 3450–3459 (2012).
Article CAS PubMed Google Scholar
Schrager, S., Falleroni, J. & Edgoose, J. Evaluation and treatment of endometriosis. Am. Fam. Physician 87, 107–113 (2013).
PubMed Google Scholar
Pacchiarotti, A. et al. Pain in the upper anterior-lateral part of the thigh in women affected by endometriosis: study of sensitive neuropathy. Fertil. Steril. 100, 122–126 (2013).
Article PubMed Google Scholar
Jia, S.-Z., hua Leng, J., Shi, J.-H., Sun, P.-R. & Lang, J.-H. Health-related quality of life in women with endometriosis: a systematic review. J. Ovarian Res. 5, 29 (2012).
Article PubMed PubMed Central Google Scholar
Culley, L. et al. The social and psychological impact of endometriosis on women’s lives: a critical narrative review. Hum. Reprod. Update 19, 625–639 (2013).
Article PubMed Google Scholar
Giuliani, M. et al. Quality of life and sexual satisfaction in women suffering from endometriosis: An Italian preliminary study. Sexologies 25, e12–e19 (2016).
Article Google Scholar
Asghari, S., Valizadeh, A., Aghebati-Maleki, L., Nouri, M. & Yousefi, M. Endometriosis: Perspective, lights, and shadows of etiology. Biomed. Pharmacother. 106, 163–174 (2018).
Article PubMed Google Scholar
Baranov, V. S., Ivaschenko, T. E., Liehr, T. & Yarmolinskaya, M. I. Systems genetics view of endometriosis: a common complex disorder. Eur. J. Obstet. Gynecol. Reprod. Biol. 185, 59–65 (2015).
Article PubMed Google Scholar
Holdsworth-Carson, S. J. et al. The association of body mass index with endometriosis and disease severity in women with pain. J. Endometriosis Pelvic Pain Disorders https://doi.org/10.1177/2284026518773939 (2018).
Lee, S.-S., do Han, K. & Joo, Y.-H. Association of perceived tinnitus with duration of hormone replacement therapy in Korean postmenopausal women: a cross-sectional study. BMJ Open 7, e013736 (2017).
Article PubMed PubMed Central Google Scholar
Hall, G. & Phillips, T. J. Estrogen and skin: the effects of estrogen, menopause, and hormone replacement therapy on the skin. J. Am. Acad. Dermatol. 53, 555–568 (2005).
Article PubMed Google Scholar
Henderson, J. et al. Granite: diversified, sparse tensor factorization for electronic health record-based phenotyping. In 2017 IEEE International Conference on Healthcare Informatics (ICHI), 214–223 https://ieeexplore.ieee.org/document/8031150 (IEEE, 2017).
Schuler, A. et al. Discovering patient phenotypes using generalized low rank models. In Biocomputing 2016: Proceedings of the Pacific Symposium, 144–155 https://pubmed.ncbi.nlm.nih.gov/26776181/ (World Scientific, 2016).
Ho, J. C., Ghosh, J. & Sun, J. Marble: high-throughput phenotyping from electronic health records via sparse nonnegative tensor factorization. In Proc. 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 14, 115–124 https://doi.org/10.1145/2623330.2623658 (Association for Computing Machinery, New York, NY, USA, 2014).
Pivovarov, R. et al. Learning probabilistic phenotypes from heterogeneous EHR data. J. Biomed. Inform. 58, 156–165 (2015).
Article PubMed PubMed Central Google Scholar
Urteaga, I., McKillop, M., Lipsky-Gorman, S. & Elhadad, N. Phenotyping endometriosis through mixed membership models of self-tracking data. Preprint at https://arxiv.org/abs/1811.03431 (2018).
Blei, D. M. Probabilistic topic models. Commun. ACM 55, 77–84 (2012).
Article Google Scholar
Wallach, H. M., Murray, I., Salakhutdinov, R. & Mimno, D. Evaluation methods for topic models. In Proc. 26th Annual International Conference on Machine Learning, ICML ’09, 1105–1112 (ACM, New York, NY, USA, 2009). https://doi.org/10.1145/1553374.1553515.
Cressie, N. & Read, T. R. C. Multinomial goodness-of-fit tests. J. R. Statistical Soc. Ser. B (Methodol.) 46, 440–464 (1984).
Google Scholar
Kruskal, W. H. & Wallis, W. A. Use of ranks in one-criterion variance analysis. J. Am. Stat. Assoc. 47, 583–621 (1952).
Article Google Scholar

Download references

Acknowledgements

The authors are deeply grateful to all study participants whose de-identified data have been used for this study, and acknowledge the many endometriosis advocacy groups that helped in recruiting them. This work was supported by the Endometriosis Foundation of America, the National Science Foundation award SCH #1344668, National Library of Medicine award R01 LM013043, and the National Library of Medicine award T15 LM007079. We also thank Dr. Shadi Safar Gholi and Dr. Arnold Advincula for their time and expertise, and Sharon Lipsky Gorman for data preparation.

Author information

Authors and Affiliations

Department of Applied Physics and Applied Mathematics, Columbia University, New York, NY, 10027, USA
Iñigo Urteaga
Data Science Institute, Columbia University, New York, NY, 10027, USA
Iñigo Urteaga & Noémie Elhadad
Department of Biomedical Informatics, Columbia University, New York, NY, 10032, USA
Mollie McKillop & Noémie Elhadad

Authors

Iñigo Urteaga
View author publications
You can also search for this author in PubMed Google Scholar
Mollie McKillop
View author publications
You can also search for this author in PubMed Google Scholar
Noémie Elhadad
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.U., M.M., and N.E. conceived the proposed research and designed the experiments. I.U. processed the dataset, conducted the experiments, and wrote the first draft of the manuscript. M.M. and N.E. contributed to the writing of the manuscript. All authors read, reviewed, and approved the manuscript.

Corresponding author

Correspondence to Noémie Elhadad.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Urteaga, I., McKillop, M. & Elhadad, N. Learning endometriosis phenotypes from patient-generated data. npj Digit. Med. 3, 88 (2020). https://doi.org/10.1038/s41746-020-0292-9

Download citation

Received: 26 November 2019
Accepted: 26 May 2020
Published: 24 June 2020
DOI: https://doi.org/10.1038/s41746-020-0292-9

This article is cited by

Self-report symptom-based endometriosis prediction using machine learning
- Anat Goldstein
- Shani Cohen
Scientific Reports (2023)
AI in health and medicine
- Pranav Rajpurkar
- Emma Chen
- Eric J. Topol
Nature Medicine (2022)
Machine learning algorithms as new screening approach for patients with endometriosis
- Sofiane Bendifallah
- Anne Puchar
- Emile Daraï
Scientific Reports (2022)

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Patient-generated data

Patient-generated data—Phendo self-tracking data

Patient-generated data—WERF survey data

Unsupervised phenotype modeling

Unsupervised phenotype modeling—Learned endometriosis phenotypes

Unsupervised phenotype modeling—learned participant phenotypic assignments

Endometriosis phenotype evaluation

Phenotype evaluation—agreement between expert clustering and phenotyping

Phenotype evaluation—associations between learned phenotypes and survey answers

Discussion

Methods

Unsupervised phenotyping model

Phenotype visualization

Agreement between expert clustering and unsupervised phenotyping

Associations

Ethics

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links