Abstract
Recently, many phenotyping algorithms for high-throughput cohort identification have been developed. Prospective genome cohort studies are critical resources for precision medicine, but there are many hurdles in the precise cohort identification. Consequently, it is important to develop phenotyping algorithms for cohort data collection. Hypertensive disorders of pregnancy (HDP) is a leading cause of maternal morbidity and mortality. In this study, we developed, applied, and validated rule-based phenotyping algorithms of HDP. Two phenotyping algorithms, algorithms 1 and 2, were developed according to American and Japanese guidelines, and applied into 22,452 pregnant women in the Birth and Three-Generation Cohort Study of the Tohoku Medical Megabank project. To precise cohort identification, we analyzed both structured data (e.g., laboratory and physiological tests) and unstructured clinical notes. The identified subtypes of HDP were validated against reference standards. Algorithms 1 and 2 identified 7.93% and 8.08% of the subjects as having HDP, respectively, along with their HDP subtypes. Our algorithms were high performing with high positive predictive values (0.96 and 0.90 for algorithms 1 and 2, respectively). Overcoming the hurdle of precise cohort identification from large-scale cohort data collection, we achieved both developed and implemented phenotyping algorithms, and precisely identified HDP patients and their subtypes from large-scale cohort data collection.
Similar content being viewed by others
Introduction
In recent years, many phenotyping algorithms have been developed for scalable and high-throughput cohort identification from electronic health records (EHRs)1,2,3,4,5 for genome- and phenome-wide association studies6,7 and further biomedical analyses8. In past studies, many phenotyping algorithms have been developed using EHRs9,10,11,12,13. Phenotyping is also expected to stratify patients into subgroups that reflect the molecular and clinical diversity of diseases14, which is expected to have substantial potential to contribute to risk estimation15, diagnostic support and therapeutic development16 and the determination of the best available care for each patient based on their stratified subgroups.
Prospective genome cohort studies are known as one of the most important resources for developing precision medicine17,18. In these cohort studies, a wide range of data, including lifestyle, genomic, molecular, medical record and phenotypic data19,20,21, are collected and analyzed to elucidate clinical problems. The Birth and Three-Generation Cohort Study (BirThree Cohort Study)19 of the Tohoku Medical Megabank (TMM) project is a large-scale prospective genome cohort study that recruited more than 70,000 subjects, including more than 20,000 pregnant women and their family members in 48 hospitals in Miyagi Prefecture from regional populations.
Hypertensive disorders of pregnancy (HDP) is one of the leading cause of maternal morbidity and mortality, affecting approximately 2–8%22 of all pregnancies, and is a clinical syndrome that mainly includes three subgroups: gestational hypertension (GH), preeclampsia (PE), and superimposed preeclampsia (SPE)23. It is known that there are major differences between the subgroups in the likelihood to develop severe complications and in pathogenesis24,25,26,27,28,29. However, accurately diagnosing these subgroups is quite difficult in clinical practice, and the same treatment has been applied across the several subgroups. In addition to the subgrouping, further classification into early onset (EO) and late onset (LO) based on the timing of onset has gained attention because of the different clinical forms with different pathophysiological features30,31. Because of these major differences in the subgroups of HDP, the precise classification of patient diseases into subgroups by phenotyping algorithms is necessary for genomic, epidemiologic and other biomedical analyses.
In hospital-based clinical research, clinical research coordinators (CRC) are able to identify patients precisely by creating a case report form (CRF) using clinical data, including clinical notes. However, there are severe limitations in the phenotyping of patients in large-scale cohort studies that lack of complete unstructured clinical notes and have limited accessible EHR data for partial subjects due to the lack of linkages in cohort data collection because the subjects of the cohort study are not patients of a hospital but are part of the regional population. Recently, the UK Biobank32 and All of Us33, which are large-scale prospective genome cohort studies, collected structured data on EHRs, such as diagnoses, laboratory results, medications, and billing codes, using record linkages; however, collection of clinical notes were still not achieved. Because of this limitation, it is not realistic to perform phenotyping of hundreds of thousands of subjects in a large-scale cohort using the conventional approach in clinical research. Consequently, it is necessary to develop new high-throughput and objective phenotyping algorithms that exploit clinical notes.
In this study, we developed two rule-based HDP phenotyping algorithms and programmatically implemented them that follow American and Japanese clinical guidelines because there are no internationally standardized guidelines for HDP, but domestic guidelines have been developed34,35,36,37. The developed algorithms were applied to phenotype approximately 23,000 pregnant women in the BirThree cohort. We also compared the phenotyped subgroups with a clinician chart review to evaluate the consistency of the clinical concepts between the developed algorithms and the clinical guidelines.
Methods
Development of hypertensive disorders of pregnancy (HDP) phenotyping algorithm 1
We developed HDP phenotyping algorithm 1 according to the American College of Obstetricians and Gynecologists (ACOG) guidelines23. Algorithm 1 detects HDP patients based on hypertensive disease history, blood pressure (BP), proteinuria (PU), timing of onset and PE-related clinical conditions (see “Details of the identification rules for the HDP subgroups in algorithm 1” section in the Supplemental Methods).
PU in early pregnancy is caused by several possibilities both related and unrelated to HDP, such as some degree of renal insufficiency and chronic kidney disease (CKD). To avoid overestimating the HDP group, we implemented rules to exclude subjects with PU in early pregnancy from the HDP group.
Development of hypertensive disorders of pregnancy (HDP) phenotyping algorithm 2
We developed HDP phenotyping algorithm 2 according to the guidelines of the Japan Society of Obstetrics and Gynecology (JSOG) (see “Details of the identification rules for the HDP subgroups in algorithm 2” section in the Supplemental Methods). In algorithm 2, hypertensive subjects without PU were assigned to SPE or PE when the following conditions were met: maternal organ dysfunction (see Supplementary Table 1) or light-for-date. (see “Selection of subjects with light-for-date” section in the Supplemental Methods). The patients having maternal organ dysfunction were determined by analyzing clinical notes (see “Selection of subjects with maternal organ dysfunction” section in the Supplemental Methods). More details about the process used to reach the target conditions are provided in the “Details of the process used to analyze unstructured clinical notes” section in the Supplemental Methods.
Implementation of the developed phenotyping algorithms
The developed phenotyping algorithms according to American and Japanese clinical guidelines were implemented by Python 3.8.10 and Perl 5.16.3 (https://github.com/ogishimalab/HDP_phenotyping_algorithms). The implementations take no arguments and take both data for phenotyping and the lists of subjects who have specific phenotype about exclusion and inclusion criteria of HDP and their subtypes. The lists were required to prepare before phenotyping by analysis of clinical notes performed by such as pattern matching based on rules (see “Details of the processes used to analyze unstructured clinical notes” and “Selection of subjects with light-for-date” in Supplemental Methods).
Data sources used to apply the developed algorithms
We selected research subjects from approximately 23,000 pregnant women who were included in the BirThree Cohort study, which is a population-based prospective cohort study. We excluded subjects who did not have medical records at delivery, who did not have BP and PU data, who did not have a live birth and who withdrew from the study. Using these criteria, we included 22,452 research subjects in the dataset. The baseline characteristics of all pregnant women in the BirThree Cohort study are provided in a previous report38.
All data in our dataset were transcribed from the clinical records of usual care by trained genome medical research coordinators (GMRCs) from the interview conducted at the first visit, prenatal checkups and records at delivery (left side of Fig. 1). The average number of visits for prenatal checkups and measurements of BP and PU for pregnant women are shown in Supplementary Table 2. All items used in our phenotyping algorithms are listed in Supplementary Table 3.
Comparison of phenotyped subgroups and diagnoses
We conducted a clinician chart review to compare the subgroups phenotyped by the algorithms with the diagnoses. The comparison was performed to evaluate the consistency of clinical concepts between the developed algorithms and the clinical guidelines. A clinician chart review was performed based on the American guidelines23 for 252 subjects who met the following criteria: (1) subjects recruited at Tohoku University Hospital between September 2015 and November 2016 and (2) subjects who were able to access their medical records. The clinician chart review was conducted by two obstetricians, a mid-career and a senior obstetrician. If the subgroups assigned by the two doctors differed, the subgroups were determined by discussion. The chart review was performed blindly and annotated in a binary manner for each subgroup of HDP, CH and normotensive. The baseline characteristics of the 252 subjects are shown in Supplementary Table 4.
We calculated positive predictive values (PPVs), negative predictive values (NPVs), accuracies, sensitivities and specificities to compare our algorithms against the diagnoses. To calculate sensitivity and specificity, we identified true positive (TP), true negative (TN), false positive (FP), false negative (FN) by comparison of identified subgroups between phenotyping algorithms and diagnosis of 252 subjects. Based on the four indices, we identified sensitivity and specificity as TP/(TP + FN) and TN/(TN + FP). We identified accuracy using scikit-learn in python3. The diagnosed subgroups of the 252 subjects are listed in Supplementary Table 5. In this study, there were differences in data coverage between the cohort data for phenotyping and the diagnosis dataset that only diagnosis dataset included both intrapartum and postpartum data and ultrasound measurements, in addition to cohort data (right side of Fig. 1). The all items used for the diagnoses are listed in Supplementary Table 6.
Ethics declarations
This study was approved by the ethical committee of Tohoku Medical Megabank Organization, Tohoku University. We confirm that all methods took place in accordance with relevant guidelines and regulations, under informed consent from all participants.
Results
The developed HDP phenotyping algorithms
The rules of the developed HDP phenotyping algorithms 1 and 2 are shown in Fig. 2a and b, respectively. The detailed results of the identification of subjects with a hypertensive disease history and maternal organ dysfunction by analyzing unstructured clinical notes are described in the “Details of the identified subjects with a hypertensive disease history and maternal organ dysfunction” section in the Supplemental Results. The detailed numbers of HDP subgroups phenotyped by the developed phenotyping algorithms are described in “The number of phenotyped subgroups of HDP” section.
The number of phenotyped subgroups of HDP
Of the 22,452 research subjects, 1781 (7.93%) were phenotyped as having HDP by phenotyping algorithm 1. Among the subgroups of HDP, 890 subjects (3.96%) had GH, 304 (1.35%) had SPE, and 587 (2.61%) had PE. According to phenotyping with algorithm 2, 1813 (8.08%) subjects had HDP. Among the subgroups of HDP, 825 subjects (3.67%) had GH, 336 (1.50%) had SPE, and 652 (2.90%) had PE (Table 1). Compared with algorithm 1, fewer subjects had GH or CH, while more had PE or SPE according to algorithm 2 because the algorithm 2 classifies hypertensive subjects without PU into PE or SPE based on the criteria such as subjects having epigastralgia, HELLP syndrome, eclampsia and FGR, in addition to the criteria of algorithm 1.
Among the subgroups phenotyped by algorithm 1 and 2, 97 (0.44%) subjects were classified into different subgroups of HDP (Supplementary Table 7).
The causes of the differences in the phenotyped HDP subgroups between algorithm 1 and 2 are shown in Supplementary Table 8. The sum of the subjects was not equal to 97 because one person may have had multiple symptoms and differences in the timing of the symptoms.
Comparison of the subgroups identified by HDP phenotyping algorithms with diagnoses
Of the 252 subjects included in the diagnosis dataset, 36 (14.2%) were diagnosed as HDP. The subgroups of the 252 subjects are shown in Supplementary Table 5. The performance of phenotyping algorithm 1 and 2 compared to the diagnoses is shown in Tables 2 and 3, respectively. Comparisons for GH EO were not applicable because none of the 252 subjects was diagnosed with GH EO.
For phenotyping of HDP overall, the PPVs, NPVs, accuracies, and specificities of both algorithms 1 and 2 were high (0.96, 0.96, 0.96, and 1.00 for the PPVs, NPVs, accuracy and specificity of algorithm 1 and 0.90, 0.96, 0.95, and 0.99 for the PPV, NPV, accuracy and specificity of algorithm 2, respectively), and the sensitivities were modest (0.72 and 0.75 for algorithm 1 and 2, respectively). Of the 252 subjects in the diagnosis datasets, 15 and 16 subjects were classified into different subgroups by algorithm 1 and algorithm 2, respectively. Of the 15 subjects with different subgroups by algorithm 1, 10 of the differences were caused by missing intrapartum and postpartum data, 1 was caused by a missing creatinine level, 1 was caused by transcription error and 3 were caused by headache that was not accepted as an unexplained new-onset headache that was unresponsive in the chart review (see Supplementary Table 9).
Of the 16 subjects who showed differences by phenotyping algorithm 2, three were new because the subgroups phenotyped by algorithm 1 matched the diagnosed subgroups (see Supplementary Table 10).
Discussion
In this study, we developed two HDP phenotyping algorithms, algorithm 1 and 2. Algorithm 1 identifies HDP patients based on the hypertensive disease history, BP, PU, timing of onset and PE-related clinical conditions. In addition to the factors included in algorithm 1, algorithm 2 conducts phenotyping by considering maternal organ dysfunction and light-for-date, which originate from placental dysfunction in early pregnancy39. In these phenotyping results, 97 (0.44%) subjects were phenotyped into different subgroups by the two algorithms, and they actually had clinical conditions related to maternal organ dysfunction, such as epigastralgia and light-for-date (Supplementary Table 8). The choice of which algorithm to use depends on whether the phenotyping will be performed with the clinical concepts of the American or Japanese clinical guidelines used in algorithm 1 and 2, respectively. The Japanese clinical guidelines of HDP have been repeatedly updated in the past35,36,40,41 and will be continually updated according to the progress of research to understand the pathogenesis of HDP. Therefore, algorithm 2 will be updated continuously according to updates to the Japanese guidelines.
In this study, the number of identified subjects with a hypertensive disease history (115; 0.51%) and underlying hepatic or renal disorders (323; 1.44%) by analysis of unstructured clinical notes was slightly smaller than that in previous reports42,43. This result suggests the possibility of the underestimation of subjects with these conditions because of differences between the diagnoses and the self-reported interviews, which were the sources of the transcribed data regarding disease history.
Of the 22,452 subjects in the BirThree cohort study, the proportions of the phenotyped HDP patients by algorithms 1 and 2 were almost identical to the previously reported prevalence rate of HDP38,44,45.
To evaluate the performances of our developed phenotyping algorithms, we performed chart review by two obstetricians for 252 among 22,452 research subjects (see “Comparison of phenotyped subgroups and diagnoses” section). Among 252 subjects, 203 (80.56%) were identified as normotensive, 13 (5.16%) were as CH, and 36 (14.29%) were as HDP patients. For HDP subgroups, we identified 9 subjects (3.57%) as GH, 15 (5.95%) as PE, and 12 (4.76%) as SPE (Supplementary Table 5).
The prevalence of HDP in the diagnosis dataset was 14.29%, which was higher than that in the full cohort (Supplementary Table 5). The baseline characteristics of the diagnosis dataset were higher risk of HDP in terms such as BMI and age than the full cohort (Supplementary Table 4). These differences may have occurred because the subjects in the diagnosis dataset were recruited at Tohoku University Hospital, which is a leading hospital that actively accepts high-risk cases, including patients with HDP.
In the evaluation of the performances of our developed algorithms, our phenotyping algorithms showed high PPVs, NPVs, accuracies and specificities and modest sensitivities, despite the limitation of data availability of intrapartum and postpartum data (Table 2, 3). In the results comparison between algorithm 1 and diagnosis, 15 of the 252 subjects in the diagnosis dataset were phenotyped into different subgroups of HDP by algorithm 1 (Supplementary Table 9). All of these 15 differences will be resolved by linking EHRs and cohort data19 because all differences were caused by lacking intrapartum and postpartum data and detailed description about headaches in cohort data. In the comparison for algorithm 2, 16 of 252 subjects were phenotyped into different subgroups of HDP (Supplementary Table 10). In particular, three of the sixteen differences are caused by fit to criteria of algorithm 2 added to algorithm 1, and these are not included in the diagnostic criteria. These results show that the major cause of miss-phenotyping was not inadequacy of phenotyping algorithms but differences of data sources between phenotyping and diagnosis. Based on these results, we concluded that there was consistency in clinical concepts between phenotyping algorithms and clinical guidelines. Furthermore, these results showed that our phenotyping algorithms are ready to be applied in HDP phenotyping tasks.
We used the transcribed medical records of the BirThree Cohort study as data sources for the phenotyping algorithms. The advantage of using the BirThree Cohort study data is the high representativeness of over 70% of the pregnant women in the regional population during the study period. This study is premised on the representativeness of the data sources because we evaluated the performances of our algorithms by comparing the number of subjects having HDP between the study population and the prevalence of HDP in previous reports, and this was large enough to compare the proportion of phenotyped subgroups of HDP with the prevalence rates.
Conclusions
We developed two phenotyping algorithms to identify subjects with HDP and their subtypes from large-scale cohort data collection to perform further biomedical analyses. Our algorithms were developed according to American and Japanese guidelines that differ in the criteria for HDP and its subtypes. Compared to the diagnosis, our phenotyping algorithms showed high PPVs (0.96 and 0.90 for algorithm 1 and 2, respectively), NPVs (0.96 for algorithms 1 and 2), accuracies (0.96 and 0.95 for algorithm 1 and 2, respectively) and specificities (1.00 and 0.99 for algorithm 1 and 2, respectively), and modest sensitivities (0.72 and 0.75 for algorithm 1 and 2, respectively), despite the limitation of data availability for intrapartum and postpartum data in the cohort collection. The application of our phenotyping algorithms to large-scale cohort data will substantially accelerate HDP research by facilitating the high-throughput and precise identification of patients with HDP.
Data availability
The data underlying this article were provided by the Tohoku Medical Megabank project with permission. Data will be shared on request to the Tohoku Medical Megabank project with permission.
References
Wilke, R. A. et al. Use of an electronic medical record for the identification of research subjects with diabetes mellitus. Clin. Med. Res. 5, 1–7. https://doi.org/10.3121/cmr.2007.726 (2007).
McCarty, C. A. et al. The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med. Genomics 4, 13. https://doi.org/10.1186/1755-8794-4-13 (2011).
Hripcsak, G. & Albers, D. J. Next-generation phenotyping of electronic health records. J. Am. Med. Inform. Assoc. 20, 117–121. https://doi.org/10.1136/amiajnl-2012-001145 (2013).
Gottesman, O. et al. The electronic medical records and genomics (eMERGE) network: Past, present, and future. Genet. Med. 15, 761–771. https://doi.org/10.1038/gim.2013.72 (2013).
Esteban, S. et al. Development and validation of various phenotyping algorithms for Diabetes Mellitus using data from electronic health records. Comput. Methods Programs Biomed. 152, 53–70 (2017).
Ritchie, M. D. et al. Genome- and phenome-wide analyses of cardiac conduction identifies markers of arrhythmia risk. Circulation 127, 1377–1385. https://doi.org/10.1161/circulationaha.112.000604 (2013).
Shameer, K. et al. A genome- and phenome-wide association study to identify genetic variants influencing platelet count and volume and their pleiotropic effects. Hum. Genet. 133, 95–109. https://doi.org/10.1007/s00439-013-1355-7 (2014).
Li, L. et al. Identification of type 2 diabetes subgroups through topological analysis of patient similarity. Sci. Transl. Med. 7, 314. https://doi.org/10.1126/scitranslmed.aaa9364 (2015).
Shivade, C. et al. A review of approaches to identifying patient phenotype cohorts using electronic health records. J. Am. Med. Inform. Assoc. 21, 221–230. https://doi.org/10.1136/amiajnl-2013-001935 (2014).
Wei, W. Q. et al. Impact of data fragmentation across healthcare centers on the accuracy of a high-throughput clinical phenotyping algorithm for specifying subjects with type 2 diabetes mellitus. J. Am. Med. Inform. Assoc. 19, 219–224. https://doi.org/10.1136/amiajnl-2011-000597 (2012).
Kagawa, R. et al. Development of type 2 diabetes mellitus phenotyping framework using expert knowledge and machine learning approach. J. Diabetes Sci. Technol. 11, 791–799. https://doi.org/10.1177/1932296816681584 (2017).
Teixeira, P. L. et al. Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals. J. Am. Med. Inform. Assoc. 24, 162–171. https://doi.org/10.1093/jamia/ocw071 (2017).
Connolly, J. CHOP. ADHD phenotype algorithm. PheKB. https://phekb.org/phenotype/179, 2013 (accessed 28 July 2021).
Robinson, P. N. Deep phenotyping for precision medicine. Hum. Mutat. 33, 777–780. https://doi.org/10.1002/humu.22080 (2012).
Deo, R. C. Machine learning in medicine. Circulation 132, 1920–1930. https://doi.org/10.1161/circulationaha.115.001593 (2015).
Wishart, D. S. Emerging applications of metabolomics in drug discovery and precision medicine. Nat. Rev. Drug Discov. 15, 473–484. https://doi.org/10.1038/nrd.2016.32 (2016).
Collins, F. S. & Varmus, H. A new initiative on precision medicine. N. Engl. J. Med. 372, 793–795. https://doi.org/10.1056/NEJMp1500523 (2015).
Ashley, E. A. The precision medicine initiative: A new national effort. JAMA 313, 2119–2120. https://doi.org/10.1001/jama.2015.3595 (2015).
S. Kuriyama, N. Yaegashi, F. Nagami, T. Arai, Y. Kawaguchi, N. Osumi, M. Sakaida, Y. Suzuki, K. Nakayama, H. Hashizume, G. Tamiya, H. Kawame, K. Suzuki, A. Hozawa, N. Nakaya, M. Kikuya, H. Metoki, I. Tsuji, N. Fuse, H. Kiyomoto, J. Sugawara, A. Tsuboi, S. Egawa, K. Ito, K. Chida, T. Ishii, H. Tomita, Y. Taki, N. Minegishi, N. Ishii, J. Yasuda, K. Igarashi, R. Shimizu, M. Nagasaki, S. Koshiba, K. Kinoshita, S. Ogishima, T. Takai-Igarashi, T. Tominaga, O. Tanabe, N. Ohuchi, T. Shimosegawa, S. Kure, H. Tanaka, S. Ito, J. Hitomi, K. Tanno, M. Nakamura, K. Ogasawara, S. Kobayashi, K. Sakata, M. Satoh, A. Shimizu, M. Sasaki, R. Endo, K. Sobue, The Tohoku Medical Megabank Project Study Group, M. Yamamoto. The Tohoku medical megabank project: design and mission. J. Epidemiol. 26, 493–511. https://doi.org/10.2188/jea.JE20150268 (2016).
Pembrey, M. The avon longitudinal study of parents and children (ALSPAC): A resource for genetic epidemiology. Eur. J. Endocrinol. 151, U125–U129. https://doi.org/10.1530/eje.0.151u125 (2004).
Fraser, A. et al. Cohort profile: the avon longitudinal study of parents and children: ALSPAC mothers cohort. Int. J. Epidemiol. 42, 97–110. https://doi.org/10.1093/ije/dys066 (2013).
World Health Organization. The World Health Report 2005: Make Every Mother and Child Count (World Health Organization, Geneva, Switzerland, 2005).
American College of Obstetricians Gynecologists. Hypertension in pregnancy. Report of the American college of obstetricians and gynecologists’ task force on hypertension in pregnancy. Obstet. Gynecol. 122, 1122–1131. https://doi.org/10.1097/01.aog.0000437382.03963.88 (2013).
Sibai, B., Dekker, G. & Kupferminc, M. Pre-eclampsia. Lancet 365, 785–799. https://doi.org/10.1016/s0140-6736(05)17987-2 (2005).
Uzan, J., Carbonnel, M., Piconne, O., Asmar, R. & Ayoubi, J. M. Pre-eclampsia: Pathophysiology, diagnosis, and management. Vasc. Health Risk Manag. 7, 467–474. https://doi.org/10.2147/vhrm.s20181 (2011).
Al-Jameil, N., Aziz Khan, F., Fareed Khan, M. & Tabassum, H. A brief overview of preeclampsia. J. Clin. Med. Res. 6, 1–7. https://doi.org/10.4021/jocmr1682w (2014).
Chaiworapongsa, T., Chaemsaithong, P., Yeo, L. & Romero, R. Pre-eclampsia part 1: Current understanding of its pathophysiology. Nat. Rev. Nephrol. 10, 466–480. https://doi.org/10.1038/nrneph.2014.102 (2014).
Vanek, M., Sheiner, E., Levy, A. & Mazor, M. Chronic hypertension and the risk for adverse pregnancy outcome after superimposed pre-eclampsia. Int. J. Gynaecol. Obstet. 86, 7–11. https://doi.org/10.1016/j.ijgo.2004.03.006 (2004).
Magee, L. A., Pels, A., Helewa, M., Rey, E. & von Dadelszen, P. Diagnosis, evaluation, and management of the hypertensive disorders of pregnancy. Pregnancy Hypertens. 4, 105–145. https://doi.org/10.1016/j.preghy.2014.01.003 (2014).
Madazli, R. et al. Comparison of clinical and perinatal outcomes in early- and late-onset preeclampsia. Arch. Gynecol. Obstet. 290, 53–57. https://doi.org/10.1007/s00404-014-3176-x (2014).
Steegers, E. A., von Dadelszen, P., Duvekot, J. J. & Pijnenborg, R. Pre-eclampsia. Lancet 376, 631–644. https://doi.org/10.1016/s0140-6736(10)60279-6 (2010).
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209. https://doi.org/10.1038/s41586-018-0579-z (2018).
The All of Us Research Program Investigators. The “all of us” research program. N. Engl. J. Med. 381, 668–676. https://doi.org/10.1056/NEJMsr1809937 (2019).
National high blood pressure education program working group report on high blood pressure in pregnancy, Am. J. Obstet. Gynecol. 163: 1691–1712. https://doi.org/10.1016/0002-9378(90)90653-o (1990).
Brown, M. A., Lindheimer, M. D., de Swiet, M., Van Assche, A. & Moutquin, J. M. The classification and diagnosis of the hypertensive disorders of pregnancy: Statement from the international society for the study of hypertension in pregnancy (ISSHP), Hypertens. Pregnancy 20, IX–XIV. https://doi.org/10.1081/prg-100104165 (2001).
Watanabe, K., Naruse, K., Tanaka, K., Metoki, H. & Suzuki, Y. Outline of definition and classification of “pregnancy induced hypertension (PIH)”. Hypertens. Res. Pregnancy 1, 3–4. https://doi.org/10.14390/jsshp.1.3 (2013).
Watanabe, K. et al. New definition and classification of “hypertensive disorders of pregnancy (HDP)". Hypertens. Res. Pregnancy 5, 39–40. https://doi.org/10.14390/jsshp.HRP2017-018 (2017).
Sugawara, J. et al. Maternal baseline characteristics and perinatal outcomes: the Tohoku Medical Megabank project birth and three-generation cohort study. J. Epidemiol. https://doi.org/10.2188/jea.JE20200338 (2020).
Gyselaers, W. Preeclampsia is a syndrome with a cascade of pathophysiologic events. J. Clin. Med. 9, 2245. https://doi.org/10.3390/jcm9072245 (2020).
Japan Society for the Study of Toxemia of Pregnancy, Historical Perspective of Study of Pregnancy-Induced Hypertension in Japan, Medical View Co., Ltd., Tokyo, (2005).
Japan Society for the Study of Hypertension in Pregnancy, Guideline 2009 for Care and Treatment of Hypertension in Pregnancy (PIH), Medical View Co., Ltd., Tokyo, (2009).
Bateman, B. T. et al. Prevalence, trends, and outcomes of chronic hypertension: A nationwide sample of delivery admissions. Am. J. Obstet. Gynecol. 206(134), e1-8. https://doi.org/10.1016/j.ajog.2011.10.878 (2012).
Munkhaugen, J. et al. Kidney function and future risk for adverse pregnancy outcomes: A population-based study from HUNT II, Norway. Nephrol. Dial. Transplant. 24, 3744–3750. https://doi.org/10.1093/ndt/gfp320 (2009).
Villar, J. et al. Preeclampsia, gestational hypertension and intrauterine growth restriction, related or independent conditions?. Am. J. Obstet. Gynecol. 194, 921–931. https://doi.org/10.1016/j.ajog.2005.10.813 (2006).
Seely, E. W. & Ecker, J. Clinical practice. Chronic hypertension in pregnancy. N. Engl. J. Med. 365, 439–446. https://doi.org/10.1056/NEJMcp0804872 (2011).
Acknowledgements
The authors would like to thank all the participants and staff of the TMM Organization, Tohoku University (a full list of members is available at https://www.megabank.tohoku.ac.jp/english/a230901/) and the Iwate TMM Organization. This work was supported by the Ministry of Education, Culture, Sports, Science and Technology (MEXT) KAKENHI (Grant Number 16K20899), the Japan Agency for Medical Research and Development (AMED) (Grant Numbers JP21tm0124005 and JP21tm0424601).
Author information
Authors and Affiliations
Contributions
S.M.: participated in the methodology, formal Analysis and writing original draft. M.W. : participated in the methodology, investigation and data curation. S.N. : participated in the methodology and review & editing. M.I. and T.O : participated in the investigation and data curation. G.T., H.T. and N.Y. : participated in the methodology, investigation and supervision. S.K. : participated in the investigation and supervision. M.Y. : participated in the supervision and writing—review & editing. J.S. : participated in the methodology, investigation, data curation and supervision. S.O. : participated in the project administration and writing—review & editing. All authors have seen and approved the final version.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mizuno, S., Wagata, M., Nagaie, S. et al. Development of phenotyping algorithms for hypertensive disorders of pregnancy (HDP) and their application in more than 22,000 pregnant women. Sci Rep 14, 6292 (2024). https://doi.org/10.1038/s41598-024-55914-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-55914-9
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.