Introduction

Globally, geographic and socioeconomic differences in childhood mortality are a major public health concern, especially in low- and middle-income countries (LMICs)1,2,3. Geographic inequalities in mortality in LMICs are usually substantial and partly explain socioeconomic differences in childhood mortality4. According to the World Health Organization (WHO), children in developing countries are 10 times more likely to die before their fifth birthday compared with children in developed countries2,5. In sub-Saharan Africa regions, 1 in 8 children dies before age five, which is nearly 20 times the average of 1 in 167 for developed regions5,6.

There has been a substantial decline in childhood mortality on the global stage7. The high rate of childhood mortality in Nigeria (132 deaths per 1000 births) is still one of the main public health concerns8. In Nigeria, every minute one child under-5 years dies i.e. 1440 deaths daily9. Given that the childhood mortality rate is a major indicator of the overall health of a population and level of economic development of any country10,11, Nigeria requires a minimum target of 44% reduction in under-five mortality rates (U5MR) every 5 years to reach the target of < 25 U5MR of the Sustainable Development Goal (SDG) by the year 203012. This involves a concerted and sustainable effort in removing various barriers to healthcare access and utilization13. However, the public health concern is the lack of attention to the large and growing inequities in the provision of and access to services within low-middle-income countries14.

The issue of child mortality does not draw media attention because it is occurring amongst the poorest, most vulnerable, and voiceless communities12. Differential distribution of child mortality also exists across different geographical regions in LMICs. In Nigeria, for example, U5MR varies across the six geopolitical zones (North Central, North East, North West, South East, South South, and South West), ranging from 89 deaths per 1000 births in the South West to 222 deaths per 1000 births in the North East6. The country loses far too many under-five children, making it one of the largest contributors to global under-five mortality rates15.

To date, some cross-sectional studies have analysed geographic and socioeconomic determinants of U5MR in Nigeria6,10,16,17. However, these studies did not take into consideration the time-to-event censured data analysis. To address this gap in the literature, this study uses the Kaplan–Meier survival estimates and the Cox proportional-hazards regression model instead of the traditional regression model to assess socioeconomic and geographical differences in U5MR. The Kaplan–Meier and the Cox proportional-hazards regression models are robust and preferred over the logistic regression, which ignores survival time and censoring information18,19,20,21. Since the major causes of child mortality are largely avoidable and/or preventable22, identifying the geographic and socioeconomic factors influencing U5MR in Nigeria helps policy and decision-makers to understand differences in U5MR between the most and least deprived social and geographic groups who may consequently benefit from focused interventions to prevent child mortality.

Methods

Study setting

The study setting is Nigeria, with an estimated population of around 200 million people and by implication the most populous sub-Saharan African country9. Administratively, the country is divided into six geopolitical zones, comprising 36 states and the Federal Capital Territory, Abuja. The country is culturally, ethnically, and geographically heterogeneous, with more than 250 identifiable ethnic groups16. Of the ethnic groups, the largest and politically dominant ethnic groups are the Hausa (North), the Igbo (South East), and the Yoruba (South West)23. Cultural values and practices unique to ethnic groups influence child health outcomes. For example, Yoruba and Igbo girls tend to marry in the third decade of life, while early marriage, before age 16 years, is common among the Hausa/Fulani/Kanuri ethnic groups23. The proportion of educated people is high among Igbo, Yoruba, and minority ethnic groups compared to the less-educated Hausa/Fulani/Kanuri tribes16. While more than 70% of the northern geopolitical zones live below the poverty line, this figure is less than 35% in the southern geopolitical zones24. Due to the cultural differences among Nigerians, there are significant ethnic variations in health care utilization in the country25. Nevertheless, many primary health facilities have fallen into disrepair and most public services are not trusted due to poor service delivery9.

Study design

This is a retrospective cross-sectional quantitative study using the latest 2018 Nigeria Demographic and Health Survey (2018 NDHS) conducted from August 14, 2018 to December 29, 20188.

Data source

The NDHS is a nationally representative health survey conducted every 5 years using a multistage sampling procedure, standardized tools and well-trained interviewers8. It is the world's largest survey, eliciting information on infant and child mortality rates and demographic factors generated from birth histories obtained from the mothers interviewed2. The DHS also contains information on socioeconomic and geographic characteristics including household ownership of assets, maternal education, and rural/urban residence4,26. As measuring inequalities in childhood mortality requires information on age at death of under-five, and socioeconomic status, the 2018 NDHS Birth Recode data file was used. It contained 127,545 sample sizes with a response rate of 99%8. Analysis in this study was restricted to children aged 0–59 months (n = 33,741).

Variables

The dependent variable is descriptive binary (1 if a live-born child dying before its fifth birthday, 0 otherwise) plus time in months until the event (death) occurs. In line with previous studies19,21,25,27,28, the independent variables comprise child gender, mother’s age (at first birth), household wealth index, maternal and paternal education (as an indicator for socioeconomic status), ethnic origin, religion, place of residence, and geopolitical zone. Table 1 presents the description of variables used in the analysis.

Table 1 Description of variables used in the study.

Statistical analysis

We used a survival analysis method to examine survival time-to-event data. Of the three survival analysis techniques (parametric modelling, semiparametric modelling and nonparametric analysis), we adopted the later (nonparametric) for analysing censored data21 and for it capacity of letting the dataset speak for itself29. We used the two most common nonparametric methods in survival analysis viz., Kaplan–Meier survival estimates and Log-rank test statistic to assess the survival function and pattern of under-five mortality20,30,31,32.

The Kaplan–Meier survival estimates were used to present graphically a survival curve that plots survival probability against time20. The conditional probability of a child’s survival increases as he/she progresses in age33. Kaplan–Meier provides a useful summary of the data that can be used to estimate measures such as median survival time34.

Survival data are modelled in terms of two related functions:-, the survivor function and the hazard function34. Assume \(T\) to be a random variable representing the survival time of subjects in the population, and \(t\) be the realization of \(T\). The cumulative distribution function of \(T\) is expressed as:

$$F\left(t\right)=P\left(T<t\right).$$
(1)

where \(t\) denotes the actual survival time of a child, \(T\) indicates a random variable associated with the survival time, and \(F(t)\) is the probability density function for the survival time.

The distribution function of \(T\) and survival function \(S\left(t\right)\) show the proportion of children that survive longer than \(t\) from the first day of birth and is expressed as:

$$F\left(t\right)=P\left(T>t\right)= \int\limits_{0}^{t}f\left(u\right)du$$
(2)
$$S\left(t\right)=P\left(T \ge t)=1-F(t\right)$$
(3)

The hazard function, \(h\left(t\right)\), represents the probability that an individual dies at a time, conditional on having survived to that time. That is, the function represents the instantaneous death rate for an individual surviving to time \(t\):

$$h\left(t\right)=\underset{\Delta \to 0}{\mathrm{lim}}\frac{P(t<T\le t+\Delta t\backslash T>t}{\Delta t}=\frac{f(t)}{S(t)}.$$
(4)

where \(h\left(t\right)\) is the hazard function, \(T\) is the survival time,\(S\left(t\right)\) is the survival function, and \(\Delta \) is the instantaneous change19,21.

The Cox proportional hazards regression model, the most widely used model in the analysis of survival data, was used to assess the influence of various covariates in the survival times of individuals through the hazard function19. It provides a hazard ratio to compare survival times of two or more population groups. The observation is right-censored, that is the survival status of the individual might not be known at the time of the survey19,25. In the model, the exponentiated linear regression portion of the model explains the effects of explanatory variables on hazard ratio19.

The Cox hazard is modelled as follows:

$$h\left(t\right)={h}_{0 }\left(t\right)exp.\left({b}_{1}{X}_{1}+{b}_{2}{X}_{2}+{b}_{3}{X}_{3}+\cdots {b}_{k}{X}_{k}\right),$$
(5)

where \({X}_{1}\) to \({X}_{k}\) are \(k\) explanatory variables and \({h}_{0}(t)\) is the baseline hazard at time \(t\), representing the hazard for a person with the value 0 for all the explanatory variables. By dividing both sides of Eq. (5) by \({h}_{0}(t)\) and taking logarithms, we obtain:

$$ln\frac{h(t)}{{h}_{0}(t)}={b}_{1}{X}_{1}+{b}_{2}{X}_{2}+{b}_{3}{X}_{3}+\cdots +{b}_{k}{X}_{k},$$
(6)

where \(\frac{h(t)}{{h}_{0}(t)}\) is the hazard ratio. The coefficients \({b}_{1}\;\text{to}\;{b}_{k}\) are estimated by the Cox regression25.

Three models were fitted into the Cox proportional hazards model for analysis to investigate the influence of the predictor variables on the under-five mortality. Model 1 contained demographic factors as the only predictor variables while model 2 added socioeconomic variables. Model 3 included environmental and geopolitical variables along with the previous variables in models 1 and 2 for analysis21.

In the Cox proportional hazards model, the outcome is described in terms of the hazard ratio. The Cox regression model gives the hazard function as a product of a baseline hazard involving \(t\) and an exponential expression involving \(X{^{\prime}}s\) without \(t\). The exponential part of the equation ensures that the fitted model will always give estimated hazards that are non-negative35.

The hazard ratio represents the instantaneous risk over the study time. The measures of association were expressed as hazard ratios (HR) with 95% confidence intervals (CI)20. A hazard ratio of 1 means lack of association, a hazard ratio > 1 suggests an increased risk and a hazard ratio < 1 suggests a smaller risk. In general, the survivor function focuses on not having an event, while the hazard function focuses on the event occurring34. All analyses were weighted to ensure representativeness of the survey sample. A p-value less than 0.05 is considered statistically significant. We conducted all the analysis using Stata version 13.1. More so, all methods were performed in accordance with the relevant guidelines and regulations.

Ethical approval and consent to participate

The Demographic and Health Surveys (DHS) Program has granted approval to use the Nigeria DHS dataset for this study. The DHS adhered to informed consent and we observed anonymity and confidentiality under the data terms of use.

Results

Table 2 presents the descriptive statistics of variables used in the study and distribution of under-five (0–59 months) deaths across the demographic, geographic and socioeconomic characteristics. The results show that the proportion of under-five mortality (U5M) is about 10%, and higher among the male (51.0%) than the female (49.0%) gender. Although the proportion of age of child (in months) is highest (82.2%) among the childhood (i.e. 12–59 months) group, the proportion from total U5M is the highest (39.5%) among the neonatal (< 1 month) age bracket.

Table 2 Descriptive statistics of variables used in the study and distribution of under-five (0–59 months) deaths across the demographic, geographic and socioeconomic characteristics.

Further, the result indicates that the proportion of U5M is prevalent among children born to mothers’ age ≤ 19 at first birth (58.0%), especially among the Hausa/Fulani/Kanuri ethnic origin (48.9%) and the Muslim groups (64.1%). Moreover, the socioeconomic variables show that the U5M was higher among children born to mother (46.5%) and father (37.5%)] with no formal education compared to the children born to mother or father with primary, secondary, or tertiary education. In the same vein, of the household wealth index quintiles, 65.4% of the under-five death occurred among the three lowest wealth quintiles: poorest (22.2%), poorer (22.6%) and middle (20.6%).

About 61.4% of the U5M occurred in rural areas, while the remaining 38.6% occurred in urban areas in Nigeria. Of the six geopolitical zones, 68.3% of U5M occur in the three northern geopolitical zones: North West (36.5%), North East (18.3%), and North Central (13.5%)] compared to their southern counterparts (31.7%). The majority of U5M occurs among those residing in rural areas (73.5%).

Figure 1 presents proportion of under-five death by geopolitical zones in a Nigeria map. Of the six geopolitical zones, North West (30.4%) followed by the North East (21.3%) and North Central (17.3%), had the highest under-five death compared to the South East (11.2%), South West (10.4%) and South South (9.4%).

Figure 1
figure 1

(Source: http://www.gadm.org (data); software: ArcMap version 10.8.1 (Esri), http://www.esri.com).

Proportion of under-five (U5) death by geopolitical zones of Nigeria

Figure 2 shows the Kaplan–Meier estimates of the survival graph for all the under-five children. The horizontal axis indicates the time to event in months, while the vertical axis shows the survival probability or the proportion of under-five children surviving. At time 0, the survival probability is 1.00 (i.e. 100% of the participants are alive). Thus, the result indicates most under-five death occurs at earlier months after birth.

Figure 2
figure 2

Plot of the overall estimate of the Kaplan–Meier survivor function of under-five mortality in Nigeria.

Figure 3 graphically presents Kaplan–Meier survival estimates of under-five mortality by household wealth index quintiles. The graph shows that the top wealth quintiles had higher under-five survival probability than the bottom quintiles. The survival probability is high for the richest but relatively low for the poorest. Even so, the survival probability of the poorer and the poorest were almost the same. The statistically significant value of the log-rank test for equality of survivor functions for household wealth index (χ2 = 217.89 P < 0.001) indicates differences in survival probability among different socioeconomic groups.

Figure 3
figure 3

Plot of the Kaplan–Meier survival estimates of under-five mortality by household wealth index quintiles in Nigeria, 2018.

Figure 4 plots the Kaplan–Meier survival estimates of under-five mortality by geopolitical zones. The graph indicates that the North West followed by the North East and the North Central have the most at risk of survival among the six geopolitical zones while South South, South East and South West have a lower risk of survival. The geographic zones log-rank test for equality of survivor functions is statistically significant (χ2 = 307.45, P < 0.001).

Figure 4
figure 4

Plot of the Kaplan–Meier survival estimates of under-five mortality by geopolitical zones in Nigeria, 2018.

Table 3 presents the results of the Cox proportional hazard regression analysis. It assesses the influence of different factors on the survival time of under-five children. The demographic variables included in Model 1 assessed the independent influence of demographic factors on the risk of under-five deaths. The result shows that U5M is 15.5% higher among male children (HR: 1.155; 95% CI 1.070–1.247, p < 0.001) than female children. Moreover, U5M is 25.4% higher in children born to mothers age ≤ 19 at first birth (HR: 1.254; 95% CI 1.148–1.369, p < 0.000) than children born to mothers age 20–39 at first birth. Further, the results show that mortality by ethnic origin is 47.8% (HR: 0.522; 95% CI 0.437–0.625, p < 0.001); 24.7% (HR: 0.753; 95% CI 0.625–0.907, p < 0.003) and 25.9% (HR: 0.741; 95% CI 0.661–0.830, p < 0.000) lower for children whose mothers are from the Yoruba, Igbo and other ethnic origin respectively, compared to the children whose mothers are of Hausa/Fulani/Kanuri origin. More so the U5M is 26.2% higher among children born to Muslim mothers (HR: 1.26.2; 95% CI 1.110–1.434, p < 0.001) than children born to Christian mothers.

Table 3 The Cox's proportional hazard ratios (HR) with 95% confidence interval (CI) for factors affecting under-five mortality in Nigeria 2018.

Model 2 examines the influence of demographic and socioeconomic variables on the risk of under-five deaths. Results indicate that U5M is 15.2% higher for male children (HR: 1.152; 95% CI 1.064–1.247, p < 0.001) than females. In addition, U5M is 15.7% higher in children born to mothers aged ≤ 19 at first birth (HR: 1.157; 95% CI 1.052–1.271, p < 0.003). Children of Yoruba (HR: 0.667; 95% CI 0.548–0.812, p < 0.000) and other ethnic origins (HR: 0.797; 95% CI 0.708–0.896, p < 0.001) had 33.3% and 20.3%, respectively lower U5M compared with the Hausa/Fulani/Kanuri ethnic origin. About 22.0% U5M occurs among children of Muslim mothers than children of Christian mothers. These findings show a significant association between U5M and paternal education. Results show that children born to fathers with no formal education (HR: 1.328; 95% CI 1.107–1.592, p < 0.002), primary education (HR: 1.252; 95% CI 1.035–1.515, p < 0.021) and secondary education (HR: 1.180; 95% CI 1.000–1.392, p < 0.050) had 32.8%, 25.2% and 18.0%, respectively, have a higher risk of U5M compared to children born to fathers with tertiary education. Of the household wealth index quintile, the richest (HR: 0.639; 95% CI 0.522–0.783, p < 0.001) and richer (HR: 0.804; 95% CI 0.689–0.937, p < 0.005) groups had 36.1% and 19.6%, respectively, lower risk of U5M compared with the poorest quintile group.

Finally, the results of Model 3 for demographic factors were consistent with Models 1 and 2 results. In the same vein, results indicate that children born to fathers with no formal education (HR: 1.360; 95% CI 1.133–1.631, p < 0.001), primary education (HR: 1.279; 95% CI 1.056–1.550, p < 0.012) and secondary education (HR: 1.204; 95% CI 1.020–1.421, p < 0.028) had 36.0%, 27.9% and 20.4%, respectively, higher risk of U5M compared to children born to fathers with tertiary education. Moreover, results suggest that the richest (HR: 0.703; 95% CI 0.567–0.872, p < 0.001) and richer (HR: 0.865; 95% CI 0.735–1.018, p < 0.082) wealth quintile groups had 29.7% and 13.5%, respectively, lower risk of U5M compared with the poorest quintile. Of all the geopolitical zones, children born to mothers living in the North West (HR 1.634; 95% CI 1.238–2.156, p < 0.001) had 63.4% higher risk of U5M than South West zone.

Discussion

Evidently, under-five mortality rate (U5MR) is the highest in sub-Saharan African countries and Nigeria in particular. Notwithstanding that, these deaths are preventable in part by addressing the associated demographic, geographic, and socioeconomic factors18. This study investigates the geographic and socioeconomic survival differences of under-five in Nigeria.

Findings from the Kaplan–Meier survival estimates show the most U5Ms occur within 12 months after birth with the poorest most at risk of U5M while the richest are the least affected across household wealth index quintiles. The findings are in tandem with the UNICEF report that under-five deaths are increasingly concentrated in the neonatal period5,36. Besides, our finding is in line with the assertion by Lartey and colleagues that the probability of a child’s survival increases as the child progresses in age and that the survival probability is lower for children from the poorest families but higher for the children from the richest families33. Therefore, U5M reduction interventions may target children under 12 months of birth given their fragile immune systems. This is to protect them from the perennial environmental threats to child health such as malaria (from mosquito bites), lack of clean water, and poor sanitation2,17.

Further, the finding shows that of the six geopolitical zones in Nigeria, the northern zones especially North West, North East and North Central are at the most risk of U5M. Earlier studies6,23,37 corroborate that the risk of under-five deaths is higher in the North West and North East regions owing to higher proportions of home delivery and complications during childbirth, younger age at birth of first child, and poor utilization of modern health facilities compared to the southern region. The finding establishes that geopolitical setting strongly influence the health and survival chances of children1, thus, implying that U5M risks could be contained depending on the geopolitical environment children in which find themselves16.

As a corollary, our study shows that children born to mothers living in rural areas experience higher U5M compared to their urban counterparts. Often children born to poor mothers in rural areas are delivered at home38. This is not surprising as modern health care is not easily available in rural areas as in the urban area, hence, urban areas are reported to have lower U5Ms than rural areas6,18,27.

The Cox proportional hazard regression models show that paternal education is negatively related to U5M: increased paternal education leads to a reduction in U5M and vice versa. This is in tandem with the assertion that parental education increases a child’s survival probability30. However, contrary to expectations the maternal education was not statistically significant although mothers’ education has a relatively higher impact on child mortality than fathers’ education and many other socioeconomic factors27,33,38,39. This could be due to the finding that mothers in northern Nigeria have a higher proportion of no education or primary education37 due to early marriage (before age 16 years)23 and combined with the fact that culturally, husbands are the overall decision-makers and breadwinner, especially in the regions40.

Moreover, the Cox regression model of the household wealth index shows that the rich had lower risk of U5M compared to the poor as shown in Cox models 2 and 3. A study in India affirms that the risk of child mortality is the highest among the poor41. It presupposes that a targeted intervention to the poor is necessary to close the gap.

This study also shows that children from Hausa/Fulani/Kanuri ethnic origin (northern region) experience more under-five mortality compared to the southern region. Earlier studies in Nigeria, indicates that a child born in the North West has a 2.5 times higher probability of dying before age five than one born in the South East42. This could be due to the preponderance of early marriage commonly practiced in northern Nigeria. Thus, given that education is a fundamental factor to consider in terms of child survival irrespective of region, formal education sensitization particularly in northern Nigeria would help alleviate childhood mortality in the country6.

The results consistently indicate that male children have higher mortality compared with female children. This is supported by literature that males have a higher mortality in infancy in both Nigeria and globally43. It brings to fore the findings of Wegbom and colleagues that U5M in Nigeria is diversely affected by health-related factors and non-health sector factors such as demographic, economic, environmental, social, and security21. Therefore, it requires a health equity-in-all policies approach to tackle under-five mortality44.

The strength of this work is the use of the Kaplan–Meier survival estimates and Cox proportional hazard model for analysing time-to-event data and censored observations. Nevertheless, this study is subject to some limitations. Firstly, although DHS is a renowned reliable data source on child mortality2,36, we acknowledge the inherent data collection challenges that could manifest through misreporting of age of child or age at death in months. Secondly, our analyses focused on U5M over the last 5 years, and the household wealth index constructed for the survey year was used as one of the proxies for socioeconomic status. Since, changes in household wealth often occur in the long-run, the current measures of wealth index can be a valid proxy for past values45. Notwithstanding this, the current measure of both dependent and independent variables would have been ideal for the analysis. Lastly, as our analyses were based on retrospective cross-sectional data, temporality could not be established between explanatory variables and geographic or socioeconomic inequality in U5M; thus, impeding causal inference.

Conclusion

Achieving a reduction in U5M is a public health concern that requires a multi-dimensional approach. There is a need to tailor U5M reduction interventions to the critical survival time of 12 months after birth. A target intervention in geopolitical zones especially the North West, North East and North Central will be of utmost importance to increase access to needed health care services. In addition, increased formal education particularly in northern Nigeria is vital for U5M reduction in the country, given that education is a fundamental factor to consider in terms of child survival irrespective of region.