Introduction

Preterm birth, as defined by the World Health Organization (WHO), is birth before 37 completed weeks of gestation and poses a significant global health challenge1. A substantial proportion, approximately 65–70% of preterm births, occur spontaneously2, and its associated deaths are more common in low and middle-income countries (LMICs) 3,4. An estimated 15 million infants are born prematurely each year, constituting a considerable public health concern5.

In 2014, 14.8 million (10.6%) live births were preterm, with 80% of cases concentrated in Asia and Sub-Saharan Africa. The rate of preterm birth varies between countries, from 8.7% in Europe to 13.4% in North Africa. India, China, Nigeria, Bangladesh, and Indonesia account for 44.6% (6.6 million) of preterm births worldwide6. within Ethiopia alone, 320,000 neonates are born preterm annually, reflecting a prevalence rate of approximately 10.5% 7,8.

The burden of preterm birth is a serious public health problem that contributes to significant neonatal morbidity and mortality9. Preterm birth is associated with short and long-term morbidities for survivors, incurring high costs for the healthcare system and psychological and financial consequences on the family10,11. Preterm birth is one of the most common obstetric complications12. In 2016, prematurity was the leading cause of neonatal death during the first weeks of life for children under five years13. Approximately 35% of neonatal deaths14 and 18% of all deaths of under-five children4 were attributable to preterm birth. In Ethiopia, preterm birth complications account for 10% of all deaths of children under five years7. Maternal age less than 20 years15, short stature (≤ 155 cm)16, smoking17, anemia, malaria infection18, intimate partner violence19, multiple pregnancies, pre-existing chronic conditions20, rural residence, short birth interval, history of abortion21, history of preterm birth8 and household food insecurity22 are the potential risk factors of preterm birth. Stress is also a widespread psychological health problem among pregnant women, and it contributes to preterm birth23,24.

Despite preterm birth being a global public health priority, success in reducing adverse outcomes through evidence-based policies during antenatal care has been limited9,20. Early identification and quantification of individual women at risk of preterm birth could help to improve the quality of care during pregnancy, ensuring that all women have a positive pregnancy experience and outcomes5. Predicting preterm birth would allow earlier intervention to reduce infant morbidity and mortality, benefiting families, society, and healthcare.

The potential preterm birth risk factors have been identified, but the predictive value of their combination remains unclear in rural settings. Prediction models are vital for healthcare providers to estimate probabilities of preterm birth and allow for timely intervention to reduce adverse outcomes25. Ultrasound examinations26,27 and biomarker tests28,29 have been utilized to predict preterm birth, but these methods are less practical in resource-limited settings due to a lack of specialized medical equipment and trained health care providers30,31. Furthermore, existing studies on the prediction of preterm birth often exhibit limitations, such as lack of inclusivity of important prognostic factors32,33,34, focused on hospitalized women 35,36,37,38 or exclusively on multiple pregnancies39,40 and reliance on high-level health care settings33,34. However, none of these studies were accurate enough to be applied in daily clinical practice in the primary care setting due to disparities in the potential predictors, domains, and levels of healthcare settings. We, therefore, set out to develop a simple prediction tool to identify pregnant women at higher risk of preterm birth in early pregnancy in resource-limited primary care settings.

The current study aimed to develop and validate a simplified risk prediction model for preterm birth and evaluate the added value of maternal stress in predicting preterm birth using the routine characteristics of pregnant women in the rural area of Ethiopia.

Methods

Study setting and design

We used data from the Butajira Nutrition, Mental Health, and Pregnancy (BUNMAP) project among pregnant women and their newborns in Southern Ethiopia. The BUNMAP project was established in 2016 under the Butajira Health and Demographic Surveillance Site (BHDSS), consisting of nine rural and one urban dweller association. The BHDSS is one of the oldest Demographic and Health Survey sites in the Southern Ethiopia region41.

The BUNMAP project was an open, prospective cohort of pregnant women and their offspring (up to 59 months) collected between 2017 and 2019. We developed a prediction model for preterm birth using the baseline characteristics of pregnant women and birth history. The theoretical design was the incidence of preterm birth as a function of multiple predictors of ambulatory pregnant women. Health extension workers in the BHDSS identified pregnant women aged 15 to 49 through house-to-house surveillance.

Study population

At baseline, all pregnant women aged 15–49 years with gestational age between 8 and 24 completed weeks, living in the Butajira HDSS, and planning to deliver in the study area were included in the study. Pregnant women who were severely malnourished or had a mid-upper arm circumference (MUAC) of less than 17.5 cm and severely anemic women with hemoglobin (Hgb) levels less than seven g/dL were excluded from the cohort. Our study included pregnant women enrolled in the cohort who met the inclusion criteria and had a well-determined gestational age at birth (Fig. 1).

Figure 1
figure 1

Flow chart of the study design for preterm births in the Butajira Nutrition, Mental Health, and Pregnancy cohort.

Predictors and outcome assessment

After enrolment, all women were requested to travel to the nearest health facility for a comprehensive baseline assessment.

An experienced sonographer used transabdominal, portable diagnostic imaging, and full-color flow mapping ultrasound to confirm gestational age at baseline. The primary outcome, preterm birth, was defined as birth before 37 completed weeks of gestation. The study questionnaire was adopted and developed from the validated tools and Ethiopian demography and health survey to collect data on the following predictors: Intimate partner violence (IPV), maternal stress, maternal age, marital status, educational status, pregnancy type, substance use, Mid-upper arm circumference (MUAC), Comorbidity, history of previous and current Antenatal care (ANC) visits, history of abortion, history of preterm birth, gravidity, contraceptive use, deworming, and comorbidity. Then, potential predictors were categorized by considering the clinical relevance thresholds for adverse birth outcomes.

Mid-upper arm circumference was measured using a standard MUAC tape, and less than 23 cm was considered maternal malnutrition42.

Intimate partner violence was assessed using the Hurt, Insult, Threaten, and Scream (HITS) screening tool. This four-item questionnaire asks respondents how frequently their partner physically hurt, insulted, threatened harm, and screamed at them during pregnancy using a five-point Likert scale ranging from "never" (coded as 1) to "frequently" (coded as 5). The total score ranges from 4 to a maximum of 20. Positive or exposure to IPV was considered if the HITS score was more significant than ten43.

Maternal stress was assessed based on the ten-item classic perceived stress scale (PSS) assessment instrument for one month before data collection day using a five-point Likert scale ranging from "never" (coded as 0) to "frequently" (coded as 4). The individual score on the PSS can range from 0 to a maximum of 40. Exposure to low, moderate, or high maternal stress was considered if the PSS score was 0–13, 14–26, or 27–40, respectively44.

Maternal anemia was assessed by measuring Hgb concentration in red blood cells by taking a finger-prick blood sample using a Hemo-Cue (Hb-201) instrument. Pregnant women with Hgb concentration < 11 g/dl were considered anemic45.

Maternal comorbidity was considered when one or more of the following medical conditions exist: cardiac disease, diabetes, thyroid disease, chronic hypertension, HIV infection, malaria, typhoid, or renal disease in the baseline assessment46.

Substance use was examined in pregnant women who consumed local alcohol or beer or chewed khat at least once a week during pregnancy.

Statistical analysis

The data were collected using the Open Data Kit (ODK) platforms and were exported to the R statistical programming software version 4.2.047. The baseline characteristics of the women were summarised in a table with frequencies and proportions. The distribution was assessed using histograms for maternal age, gestational age, and MUAC at baseline. The median and interquartile range (IQR) for the pregnant women’s age, gestational age, and MUAC were presented. The mean and standard deviation (SD) were also used to present the baseline Hgb level of women. We performed Little’s missing completely at random (MCAR) test and checked the pattern of missing values. The p-value (< 0.001) indicated that the missing was not MCAR. However, the test result is insufficient to indicate whether the missing is not missing at random (NMAR) or missing at random (MAR)48. We then performed multiple imputations by chained equation using the “mice” package with ten imputations and 20 iterations49. Sensitivity analyses were performed to determine whether the MAR assumption was valid. The MAR assumption was valid, and the complete case and imputed data analysis results were comparable (Supplementary Table S1).

Model development

Individual predictors that significantly contribute to the risk of preterm birth were examined using univariable logistic regression analysis. The Likelihood Ratio Test (LRT) was used to determine the p-value for each model. Variables with a p-value for the LRT less than 0.25 were considered eligible to be included in the multivariable logistic regression analysis. The backward stepwise elimination technique with a p-value ≤ 0.15 for the LRT was fitted to build a final multivariable logistic regression model. The predictive accuracy of the final model was checked using discrimination (AUC) and calibration (calibration graphs and Hosmer–Lemeshow goodness of fit test) parameters. The Hosmer–Lemeshow goodness of fit test with a p-value greater than 0.05 indicates good calibration, which means that the probability of preterm birth estimated by the model is similar to the observed probability. An AUC of 0.5 indicates no discrimination ability, while an AUC of 1 indicates perfect discrimination.

Moreover, the added value of maternal stress in predicting preterm birth was assessed using logistic regression analysis. The model’s performance, including maternal stress, was evaluated using discrimination and calibration. The AUC of this model was compared with the reduced model to evaluate the improvement in prediction performance.

Internal validation

The model was internally validated using a bootstrap technique50 to estimate the degree of over-optimism of the final model when applied to a similar population. Internal validation was performed on the regression coefficient with a 95% confidence interval (CI) and the AUC of the model using 2,000 random bootstrap samples. The AUC difference between the bootstrap and the original full sample measured the optimism of the predictive model.

Simplified risk score development

Based on the final model's regression coefficients hierarchy, a simplified risk score was computed to provide an easily applicable prediction model. Each regression coefficient of the predictors in the final model was divided by the smallest regression coefficient, and the result was rounded to the nearest integers. The risk score performance was assessed and compared with the original regression coefficient model using the AUC. The simplified risk score is also arbitrarily classified based on the size of each interval and its potential public health relevance.

The TRIPOD (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis) checklist51 was used to guide the development and validation of the prediction model and reporting (Supplementary Table S2).

Ethics declarations

The study has been reviewed, and ethical clearance was obtained from the Institutional Review Boards of Addis Ababa University, College of Health Sciences (code: 099/17/SPH). Written informed consent and parental assent were obtained from study participants. This manuscript was approved by the Faculty of Medicine and Health Sciences, University of Antwerp, Belgium, and all methods were carried out in accordance with relevant guidelines and regulations.

Result

Characteristics of pregnant women

Of the 618 pregnant women, 137 were excluded from the analysis due to absence or inappropriate determination of gestational age at birth. At enrolment, the median age was 26 with an IQR of 6 years, and 38 (8.1%) were younger than 20 years (Table 1). All women (100%) were married, and 205 (42.7%) did not receive formal education.

Table 1 Baseline sociodemographic, obstetric, and clinical characteristics of study participants (n = 481).

Prediction model

Preterm birth occurred in 94 (19.5%) of the 481 women who gave birth in the BHDSS. The multivariable logistic regression analysis included variables with a p-value of less than 0.25 for the LRT in the univariable logistic regression analysis, such as MUAC, gravidity, history of abortion, anemia, comorbidity, IPV, and history of ANC for the current pregnancy. The final prediction model combined seven predictors, including MUAC, gravidity, history of abortion, comorbidity, IPV, anemia, and history of ANC for the current pregnancy (Table 2).

Table 2 Association between predictors and preterm birth in South Ethiopia (n = 481).

Model performance and validation

The AUC of the final model was 0.687 (95%CI: 0.620, 0.753). The calibration test of the model had a p-value of 0.7134, indicating good agreement between the predicted and observed probability of preterm birth (Fig. 2).

Figure 2
figure 2

Model performance of the prediction model for preterm birth. (A) The area under the receiver operating characteristics curve to evaluate the discrimination. The AUC suggests that the model has a 68.7% (95%CI:62.0%,75.3%) chance to correctly distinguish a high risk for preterm birth from normal pregnancy based on the characteristics of pregnant women in resource limited setting. (B) Calibration plot to evaluate the calibration of the prediction model. The visual calibration plot between the observed and prediction risk in different percentiles of the predicted values. The p-value of the calibration plot is 0.713.

The bootstrapping technique showed that we expect low optimism when applied to newly pregnant women in a similar population. The adjusted AUC of the model was 0.689 (95% CI: 0.622, 0.755) with an AUC overoptimism coefficient of 0.002 (p-value = 0.796) (Fig. 3).

Figure 3
figure 3

Area under the receiver operating characteristic curve of an internal validation of the prediction model.

Clinical importance of maternal stress

In the univariable and multivariable analysis, the maternal stress had a p-value for the LRT of 0.031 and 0.008, respectively. The calibration plot indicates that the model, including maternal stress, had good calibration (p-value = 0.825) (Fig. 4). The AUC increased from 0.687 (95%CI: 0.620, 0.753) to 0.693 (95% CI: 0.620–0.766) with a p-value of 0.784. Therefore, the addition of maternal stress to the prediction model has not made a significant difference in the discrimination performance of the model.

Figure 4
figure 4

Model performance after adding maternal stress in the prediction model. (A) Area under the receiver operating characteristic curve of the final model with and without maternal stress. (B) Calibration plot to evaluate the calibration of the addition of maternal stress to the final model. The added value maternal stress was assessed through a calibration plot, revealing a good calibration (a p-value of 0.825).

Simplified risk score per individual

A MUAC less than 23 cm had the smallest regression coefficient and was weighted as 1. The total risk score ranged from 0 to 13. The simplified risk score had an AUC of 0.678 (95% CI: 0.612–0.743) and a p-value of 0.08 compared to the original regression coefficient model. Total risk score formula = (1*MUAC < 23 cm + 1*primigravida + 2*being anemic + 2*positive IPV + 3*had a history of abortion + 2* had no ANC history for the current pregnancy + 2*presence of comorbidity) (Table 3).

Table 3 The simplified risk score and rounded weight to predict preterm birth.

Table 4 indicates that the risk of preterm birth increased as the simplified risk score increased. The proportion of preterm births was 244 (13.9%) with a simplified risk score of ≤ 1 and 18 (50%) with a simplified risk score of ≥ 5.

Table 4 Risk of preterm birth per individual risk score to predict high risk for preterm birth.

Table 5 presents results from the likelihood ratio of each risk score interval. The likelihood ratio increased as the simplified risk score for the prediction model increased. The simplified risk score between 5 and 13 increased the probability of having a preterm birth in pregnant women by 3.8 times compared to those who did not have a preterm birth.

Table 5 Interval likelihood ratio of risk score intervals for the prediction model.

Based on Youden’s index52, the optimal cut-off point was a probability of ≥ 0.221 with a sensitivity of 41% (95% CI: 30, 53), a specificity of 82% (95%CI: 77, 86), and a positive predictive value (PPV) of 36% (95%CI: 26, 48), and a negative predictive value (NPV) of 84% (95%CI: 80, 88). The positive likelihood ratio (LR+) was 2.24 (1.56, 3.21), and the negative likelihood ratio (LR−) was 0.72 (95% CI: 0.60, 0.88). Similarly, the optimal cut-off for the simplified risk score was ≥ 2 using Youden's index. One hundred twenty-seven (34.2%) pregnant women were classified as high risk for preterm birth, while 244 (65.8%) were classified as low risk, with a sensitivity of 55% and specificity of 71% (Table 6).

Table 6 Performance of simplified risk sore at different cut-off points to predict preterm birth.

Discussion

In the present study, one-fifth of the babies were born prematurely. We developed a prediction model for preterm birth based on the baseline characteristics of pregnant women in rural settings. The model combined MUAC, gravidity, comorbidity, ANC follow-up, history of abortion, IPV, and anemia to predict preterm birth. We developed a simplified risk score to improve the practical applicability of the prediction model in clinical practice in primary healthcare settings without requiring advanced diagnostic tests and imaging.

Previous studies developed a model for predicting preterm birth during pregnancy using ultrasound examination26,27 and biomarker tests28,29. However, these procedures are limited by their expense and complexity, requiring trained healthcare professionals and specialized equipment. In LMICs, such as Ethiopia, most primary care settings lack access to these advanced laboratory and imaging procedures. Consequently, estimating the risk of preterm birth in resource-limited settings using these methods is challenging. Hence, we developed a model using easily obtained and routinely collected maternal characteristics applicable in primary care settings in Ethiopia.

Prediction models were developed for women with signs of preterm birth in tertiary care centers35,36,38. The potential predictors of preterm birth in hospitalized women with preterm birth symptoms may differ from those in non-hospitalized women. The healthcare system in tertiary healthcare is different, and the diagnostic procedures are more advanced and expensive. However, the current study estimated the risk of preterm birth using available characteristics of women in the primary care setting.

Prediction models for preterm birth were initially developed in comprehensive specialized hospitals in northern Ethiopia, utilizing a retrospective study design33,34. These studies, however, overlooked fundamental predictors recommended by WHO, such as IPV, Substance use, MUAC, history of contraceptive use, deworming, and maternal stress. The exclusive use of a specialized hospital may have limited the generalizability of findings, potentially reflecting specific patient characteristics or a more selective group seeking specialized care. In contrast, our prediction model was deployed in southern Ethiopia using a prospective study design considering the basic WHO recommendations for potential preterm birth predictors. This was crucial in minimizing recall bias and strengthening model accuracy. Additionally, our research was conducted in a rural community setting, where diagnostic methods and healthcare professionals are more limited compared to specialized hospitals. Consequently, the previously developed model may not be applicable for predicting preterm birth in primary care settings.

Schaaf et al.53 predicted the risk of preterm birth with poor calibration and an AUC of 0.63 using a combined 13 potential predictors of preterm birth, including fetal sex and vaginal bleeding before 20 weeks of gestation. However, confirmation of fetal sex requires ultrasound examination, and vaginal bleeding before 20 weeks of pregnancy necessitates advanced diagnostic procedures. Therefore, the study is less applicable in clinical settings with scarce resources. Huang et al.32 also developed a prediction model for preterm birth by combining stress and metabolic predictors. Maternal stress biomarkers (cortisol) and metabolites were measured in the serum samples to predict preterm birth. The prediction model yielded an optimum AUC value of 0.895. The current study assessed maternal stress using a perceived stress scale. The addition of maternal stress in the prediction model increases the AUC from 0.687 to 0.693, but the difference was insignificant compared to the original model.

Schaaf et al.53 calculated the predictive model at two arbitrary cut-off points of the predictive probability. The incidence of preterm birth has occurred in 3.8% of pregnancies. At a predictive probability of 0.1, the sensitivity was 4.2%, and the specificity was 99.3%. At this cut-off point, the PPV was 19.4%, and the NPV was 96.3%. While in our study, we developed different cut-off points using the Youden index. The optimal predicted probability cut-off point was 0.221, with a sensitivity of 41%, a specificity of 82%, a PPV of 36%, and an NPV of 84%. The LR + value was 2.24, and the LR− value was 0.72. The simplified risk score is highly applicable and easier to use in daily clinical practice.

Our study suggests that a prediction model using the characteristics of pregnant women might be useful in identifying pregnant women at high risk of preterm birth in resource-limited settings. Women categorized as high-risk could be referred for further assessment and therapeutic intervention to prevent preterm birth. A simplified risk score would help produce relevant information for the community, policymakers, and clinical interventions to reduce neonatal morbidity and mortality rates.

The current study included highly applicable and routinely collected characteristics of pregnant women in LMICs. This can be used in daily clinical practice in primary care settings to identify high-risk pregnant women. We further used an internal validation based on a bootstrapping technique to provide unbiased estimates of the high risk for preterm birth. We developed different optimal cut-off points based on the coefficient and risk score, which helps to choose various cut-off points depending on the program goal and the availability of resources.

Our study is subject to limitations. The missingness data, mainly the history of abortion, preterm birth, and ANC for the previous pregnancy, had high missing values. However, we tried to minimize the risk of bias by using multiple imputations to develop the estimated regression coefficient of the prediction model.

When pregnant women experience risk or complication during their pregnancy, healthcare providers may intervene to induce preterm birth, affecting the prediction model and implying a computing risk. Future researchers should consider such issues using advanced analysis techniques. Potential predictors of preterm birth and complications might be different in the first and second trimesters. Developing a prediction model in each trimester thus improves the model's practical applicability and predictive capacity. Future research should consider conducting an extensive and multicentre study to improve the prediction model's generalization and external validation.

Conclusions

In this study, we developed a prediction model and a score for preterm birth risk stratification in rural Ethiopia. The model can identify pregnant women at high risk of preterm birth. Prediction of high-risk women based on individual characteristics could help to strengthen the clinical decision-making of the primary health care providers to reduce obstetric complications. It would also help produce relevant information for the community, policymakers, and clinical interventions on preventing and treating preterm birth and reducing neonatal morbidity and mortality rates.