Introduction

Optimal breastfeeding has been linked to improved cognitive development and higher intelligence test scores in several studies. For example, in a systematic review and meta-analysis of studies mainly from high-income countries that controlled for home stimulation and measures of socioeconomic status, breastfed children scored 3.4 points higher on intelligence tests than non-breastfed children1. Not only has breastfeeding been linked to improved cognitive functions, but there is growing evidence of a positive association between breastfeeding and educational outcomes2,3,4,5. In high-income countries, the evidence supporting these positive associations has been relatively consistent over the last few decades. Heikkila et al.2, for example, found higher educational achievement among five-year-olds in the United Kingdom who were exclusively breastfed for a longer duration. In Australia, ten-year-olds who were predominantly breastfed for 6 months had higher academic scores than those who were breastfed for less than 6 months4. Among adolescents in the United States, being breastfed was associated with a higher grade point average compared to those not breastfed5.

While the exact mechanism is unclear, it has been proposed that certain components of human breastmilk, including human milk oligosaccharides, arachidonic acid, and docosahexaenoic acid, modulate the neurodevelopmental processes of the newborn brain in the first few months after birth, a time of rapid growth and maturation of the brain6,7,8,9. From this perspective, if breastfeeding truly has a biological (direct nutritional) effect on intelligence and educational outcomes, we should observe it across all populations and settings, not just in high-income countries. For instance, breastfed children have a reduced risk of childhood gastrointestinal and respiratory tract infections across populations regardless of country income level10.

Nevertheless, in low and middle-income countries (LMICs), the evidence on the effects of breastfeeding on intelligence and educational outcomes is unclear and mostly inconsistent with those from high-income countries11,12. In Brazil, for example, Victora et al.13, found that children breastfed for 12 months or more had 0.91 more years of education than those breastfed for less than 1 month. Among children aged 6–18 in India, breastfeeding for more than 12 months was associated with 0.12–0.19 more years of education than breastfeeding for less than 12 months14. However, no association was observed between breastfeeding and educational attainment when Horta et al.15, studied birth cohorts in the Philippines, Guatemala, India, and Brazil. Similarly, a recent study in Turkey found no evidence of an association between breastfeeding and academic performance16.

Some have questioned the consistent positive effects of breastfeeding on educational outcomes in studies conducted in high-income countries because of the conflicting results from low-income settings. It has been suggested that the positive effects observed in high-income countries may be attributable to residual confounding resulting from insufficient statistical control for socioeconomic status and not a direct effect of breastmilk12,13,17,18. This hypothesis is supported by the fact that in high-income settings, women with higher socioeconomic status breastfeed longer than those with lower socioeconomic status, and maternal education strongly correlates with offspring cognitive abilities and educational outcomes13,19,20,21.

Studies from sub-Saharan Africa are scarce in this decades-old debate. In a recent systematic review, we found that only two studies have investigated the effects of breastfeeding on educational attainment in sub-Saharan Africa12. Both studies were conducted in South Africa. One found weak evidence for an association between exclusive breastfeeding and grade repetition22, while the other study found no evidence of an association between breastfeeding and school attainment or completion15. However, generalising the South African findings to other African countries would be naive, given the varying associations reported in earlier studies from different settings. There are also important differences in childhood adversities (such as HIV exposure) and non-socioeconomic (sociocultural, political, and educational system-related) factors that affect educational attainment across sub-Saharan African countries23. In addition, previous studies, including those carried out in high-income settings, examined educational attainment at one point in time. It is possible that these studies might have missed a child's educational attainment at other time points because academic growth is not always linear. Using longitudinal birth cohort data from the Karonga Health and Demographic Surveillance System (HDSS) in Malawi, we examined primary school progression over eight years and assessed the effects of exclusive breastfeeding duration on educational attainment and age-for-grade attainment trajectories from age six when the children started primary school through to age 13.

Methods

Participants and data

We used prospective, longitudinal data from the Karonga HDSS in northern Malawi. The HDSS is situated in the southern part of the Karonga District and includes 42,000 individuals living in 7000 households24,25. Most of the population lives in rural areas, and a majority of them engage in subsistence farming, fishing, and trading. The HDSS undertakes continuous demographic surveillance in the area to collect socioeconomic, demographic, and household-level data and information on vital events and migration of individuals of all ages26. It started with a baseline census between 2002 and 2004. Births to women in the baseline survey were reported, and field staff enrolled the children as part of the demographic surveillance follow-up visits27. Data were collected on maternal and child characteristics, including child anthropometry, birth characteristics, breastfeeding and other feeding practices, and household socioeconomic status when the children were 0–3 months old. The birth cohort was revisited at age 12 months, and the feeding data and other infant and family characteristics were updated. Annual surveys and re-census were conducted in the HDSS to periodically update existing data and collect new information, including children's schooling, anthropometry, and vital events. For the present analysis, we restricted the study population to birth cohort members between the ages of six and 13 (primary school age in Malawi) with at least one data point on current primary school grade. In Malawi, school enrolment at public primary schools is free, and children do not have to pass an entrance examination to enrol in school24.

Study measures

Outcome variable

The main outcome of this study was age-for-grade, defined as the age at which a child would be expected to enter a given grade if they had begun primary school on time and advanced without repeating or skipping a grade28. In this cohort, 96.3% (n = 953) of the children started primary school at age six or earlier, 3.4% (n = 34) started at age 7, and 0.3% (n = 3) started at age 8, so age-for-grade was primarily a measure of progress. Information on the current grade of each child in the cohort was collected at multiple time points starting in 2007, when they were about four years old and updated annually until 2015, when they were about thirteen years old. Public primary schools in Malawi consist of eight grades, and children are expected to enter grade 1 at age six and continue for eight years until they complete grade eight at age thirteen. For example, children are expected to enter grades 5, 6, and 7 at ages 10, 11, and 12, respectively. We calculated age-for-grade based on whether a child was at, over, or under the official expected age for entry into a grade for all the time points for which data were available for a given child. A child was considered under-age for grade if they were one or more years younger than the official age for the grade, on-time if they were of the official age or one year older than the official age for the grade, and over-age if they were two or more years older than the official age for the grade24,29. Over-age for grade is a sign that a child is struggling academically, leading them to repeat grades. Six children did not have data on age-for-grade and were excluded from further analysis (Fig. 1). Three of the six children with no current grade record were never enrolled, and three left school (one each at the ages of 5, 6, and 11), and their records were not taken.

Figure 1
figure 1

Flow of study subjects (The cohort was defined as in Sunny et al.27 which gives more detail on the exclusions).

Exposure variable

Exclusive breastfeeding in the first 6 months after birth is the main exposure. Mothers reported information on breastfeeding and other infant feeding practices at baseline when the children were 0–3 months old and at follow-up when the average age of children was 12 months (SD = 0.03 months). Mothers were asked if the child was breastfeeding and how old the child was when other kinds of milk, formula, water, solid foods, and other foods were introduced. We defined exclusive breastfeeding as feeding a child with only breastmilk in the first 6 months after birth. The duration of exclusive breastfeeding was categorised into 0–2 months, 3–5 months, and 6 months. We excluded from the analysis the few children (n = 32, 3.1%) who were exclusively breastfed for longer than the recommended 6 months.

Potential confounders and moderators

Potential confounding variables were identified based on prior research5,12,30 and included child sex, household wealth at birth, age of mother at birth, birth order of child, maternal HIV status, maternal education, paternal education, maternal occupation, paternal occupation, and distance to a tarmac road. The estimation of household wealth at birth was based on a principal component analysis of household items, dwelling characteristics, ownership of consumer durables, and utility access, as described in an earlier publication27. In addition, we decided a priori to assess whether the association between breastfeeding and educational attainment varied by sex and age at school assessment (6–9 and 10–13 years).

Data analysis

We estimated the number of children exclusively breastfed for 0–2 months, 3–5 months, and 6 months to determine the prevalence of exclusive breastfeeding. The characteristics of the children were summarised and stratified by the exclusive breastfeeding groups. Age-for-grade was recoded as a binary response variable with 1 = over-age for grade and 0 = underage/on-time for grade. Three sets of analyses were performed to determine the association between exclusive breastfeeding and age-for-grade.

First, binary logistic regression was used to model the association between exclusive breastfeeding and being over-age for grade using one observation per child and restricting the analysis to schooling assessments at the point closest to age 11.5 years (mid-point between 10 and 13), as most children had information on current grade at these ages. However, this approach does not make use of the longitudinal nature of the data and repeated schooling assessment.

In the second approach, we used generalised estimation equations (GEE)31 with an exchangeable correlation structure to estimate the effect of exclusive breastfeeding duration on age-for-grade measured at multiple time points between ages 6 and 13, taking into account the lack of independence between repeated school grade measurements for each child. In addition, the GEE approach can handle unbalanced data (uneven number of assessments per individual) and generate unbiased estimates even if the within-cluster correlation structure is misspecified32,33. We fitted separate GEE models for children aged 6–9 and 10–13 and for girls and boys because we hypothesised that the effect of exclusive breastfeeding on age-for-grade is likely to differ by age and sex5,12.

However, since the GEE approach estimates the average population effects of exposure on the outcome, this can mask important differences between hidden subgroups of the study population since academic growth can be nonlinear. For example, some children may perform poorly in early grades but catch up in the later grades, while others who did well in the early grades may do poorly in later grades. When such heterogeneity is ignored, results may be biased34.

Hence, the third approach used Latent Class Growth Analysis (LCGA)35 to identify unobserved homogenous subgroups (latent classes) within the study population that shared common trajectories of grade progression from age 6–13 using the “traj” macro in Stata36. LCGA is a person-centred technique that uses repeated measures of a single outcome variable across age or time to delineate a latent class model in which the classes represent different trajectories for the outcome variable37,38. The underlying assumption of LCGA is that individuals designated to the same latent class follow the same pattern of change over time39,40.

Unconditional models with one through to five latent classes were tested through an iterative process using a combination of linear, quadratic, and cubic polynomial functions (trajectory shape) in a logit model for the longitudinal binary outcome, age-for-grade. Models with more than five classes produced anomalous growth curves, and the model fit statistics started to deteriorate and, in most cases, failed to converge. Age at school assessment was the indicator of time. We included only children with at least three school assessments in the LCGA. The maximum probability assignment rule41 was used to assign participants to the latent class they had the highest posterior membership probability.

To determine the optimal number of latent classes, we compared the Bayesian information criterion [BIC] and Akaike Information Criterion [AIC]) values of the fitted latent class models. Lower BIC and AIC values indicate a balance between good model fit and parsimony40,42,43,44,45. However, because they are highly sensitive to sample size, researchers are advised to use BIC and AIC values in conjunction with other measures of model adequacy42,43,44. Thus, in addition to the information criteria, we utilised a number of model adequacy diagnostics, including the average posterior probability (AvePP) of class assignment, entropy (a measure of the accuracy of class classification), class size, and the practical interpretability (meaningfulness) of the latent class trajectories. Values of entropy near 1 (minimum 0.5), AvePP ≥ 0.70, and at least 5% of participants in each class have been proposed as indicators of high assignment accuracy though there are no conventional cut-offs40,42,43,44,45. We plotted individual-level grade progressions in each class of the latent class models to examine the meaningfulness and plausibility of the trajectories and how well they captured individual-level age-for-grade patterns. Because there are no definitive decision criteria, it is suggested that the selection of the optimal model balances between meaningful trajectories and model adequacy35,40,42,43. We used the approach described to model separate latent class trajectories for girls and boys because of differences in their academic progression46.

The participants probabilities of belonging to each trajectory class were used as a categorical variable in a multivariable multinomial logistic regression model to determine the association between exclusive breastfeeding duration and the latent age-for-grade trajectories. We used the trajectory with better grade progression as the reference group. In all the analyses, we adjusted for the baseline confounders outlined in the directed acyclic graph in Fig. 2, with these covariates included as categorical variables using the groupings presented in Table 1. All analyses were performed using Stata version 17.

Figure 2
figure 2

Directed acyclic graph showing the relationship between the exposure, outcome, and confounding variables.

Table 1 Distribution of exclusive breastfeeding duration according to sociodemographic characteristics of the study population.

Missing data

The amount of data missing at baseline was low, ranging from 0.3% (maternal education and maternal occupation) to 5.4% (paternal occupation) (see online Supplementary Table 1). Missing data mechanisms of baseline confounding variables were investigated and assumed to be missing at random. We performed multiple imputations by chained equations (MICE) in Stata to minimise bias and loss of precision and power from discarding observations with missing values47. Thirty imputed datasets were generated under fifteen iterations for each imputation model. All variables in the substantive analyses were included in the imputation models to predict the distribution of the missing values. However, only six of the ten baseline confounding variables with missing data were imputed (maternal education, maternal occupation, paternal education, paternal occupation, household wealth, and birth order). We did not impute missing values in the outcome variable. The substantive analyses described above were conducted on the imputed dataset, and the estimates combined and standard errors estimated using Rubin's rules48. For the multinomial logistic regression analysis, separate MICE models were fitted for the entire sample as well as for boys and girls.

In sensitivity analysis, we repeated the binary logistic regression, multinomial logistic regression, and GEE analyses using only individuals with complete data to assess the reliability of our substantive analysis that used imputed data.

Ethical consideration

The National Health Sciences Research Committee in Malawi (protocol number #1072) and the Research Ethics Committee of the London School of Hygiene and Tropical Medicine (protocol number #6303) granted ethical approval for the primary study. Permission and informed consent were sought from traditional leaders and village headmen, and verbal informed consent for participation was obtained from each household head and participating household members (parents provided informed consent for their children's participation) after the purpose of the surveillance were explained. Those who refused to give consent were not interviewed25,27. Ethics approval for the present analysis was granted by the research ethics committee of the London School of Hygiene and Tropical Medicine (Ethics Ref: 26468). In addition, we confirm that the analysis was performed in accordance with relevant guidelines/regulations, and in accordance with the Declaration of Helsinki.

Results

Characteristics of the study sample

There were 1761 births to women in the HDSS between 2002 and 2004, and 1044 were successfully followed up for at least 4 years (Fig. 1). Loss to follow-up was largely due to migration, deaths, and missing early data. 1021 children had information about breastfeeding practices and were prospectively followed for six to thirteen years. Descriptive statistics in Table 1 show that slightly more than half (51.9%) of the children were male, and half (50.2%) lived within 1 km of a tarmac road. There were 227 (23.1%) first-order births, and 58.8% of the mothers were between the ages of 20 and 30 when the children in the cohort were born. While 40.7% of the children’s fathers had at least some secondary education, only 20.2% of their mothers had the same level of education. Both parents were largely subsistence farmers. The majority (87.5%) of the mothers were HIV-negative, and only 2.7% were known to be HIV-positive. Almost all the children were breastfed (99.9%).

Duration of exclusive breastfeeding

Most (45.8%) children were exclusively breastfed for 3–5 months (Table 1). Only 35.9% were exclusively breastfed for the recommended 6 months. There was no sex difference in the prevalence of exclusive breastfeeding and no significant association between maternal education and duration of exclusive breastfeeding. However, the prevalence of exclusive breastfeeding for 6 months was higher with increasing household wealth and paternal education. Mothers who did not work exclusively breastfed for a longer duration than those who were subsistence farmers. Similarly, children whose fathers were unemployed were more likely to be exclusively breastfed for 6 months than those whose fathers were subsistence farmers. The order of births was associated with exclusive breastfeeding, with a greater percentage of children of higher birth order more likely to be exclusively breastfed for 6 months than those of lower birth order.

Age-for-grade

Of the eight possible time points at which age-for-grade was assessed, 1.9% had all eight recorded, 24.1% had 7, 43.3% had 6, and 13.8% had 5, and the remainder had between one and four school assessments. At age six, data on age-for-grade was collected for 895 children. The number of children increased to 951 at age seven but subsequently decreased to 902 at age nine, 821 at age 11, 485 at age 12, and 63 at age 13 (Fig. 3A). The earliest children began to fall behind in school was age eight, and the proportion of children under-age or on time for grades steadily decreased. By age 13, no child was underage-for-grade, and 41.9% of the children were 1 year older than their grade level. A similar upward trend was observed until age twelve for children who were 2–3 years over-age for their grades. Figure 3B shows that more boys than girls were assessed at each age of schooling. However, girls were more likely than boys to be underage and on-time for a grade.

Figure 3
figure 3

Distribution of age-for-grade by (A) age and (B) age and sex [F = female M = male]. Note: underage = one or more years younger than the official grade age; on time = at the official grade age; +1yr overage = one year older than the official grade age; +2yrs overage = two years older than the official grade age; and 3+yrs overage = three or more years older than the official grade age. Numbers in parentheses represent the number (n) of children in each group.

Association between exclusive breastfeeding duration and age-for-grade at age 11.5

Using binary logistic regression, we found no evidence of an association between exclusive breastfeeding duration and age-for-grade between ages 10–13 in the overall sample and sex-stratified analyses (Table 2). Results based on complete case analysis excluding participants with missing data were consistent with this finding (Supplementary Table 5).

Table 2 Binary logistic regression analysis of the association between exclusive breastfeeding duration and age-for-grade attainment at age 11.5 (mid-point of age 10–13) in Malawi (n = 887).

Association between exclusive breastfeeding duration and repeated measures of age-for-grade attainment from age 6 through to 13

In the generalised estimations equation analysis (Table 3), adjusted results provided some evidence that children exclusively breastfed for 6 months were less likely to be over-age for grade than those exclusively breastfed for less than 3 months (aOR = 0.82; 95%CI = 0.64–1.06). When we stratified the analysis by age, results from the adjusted analysis showed evidence of a dose–response effect of exclusive breastfeeding duration on age-for-grade between ages 6 and 9. Children exclusively breastfed for 6 months had 36% lower odds of being over-age for grades between ages six and nine than those exclusively breastfed for less than 3 months (aOR = 0.64; 95%CI = 0.43–0.94). There was no evidence of an association between exclusive breastfeeding duration and age-for-grade between ages 10 and 13. In the sex-stratified analysis, the adjusted results showed no evidence of an association between exclusive breastfeeding and age-for-grade. Complete case analysis excluding participants with missing data produced consistent results (Supplementary Table 6).

Table 3 Generalised estimating equations analysis of the effects of exclusive breastfeeding duration on age-for-grade attainment among children aged 6–13 in Malawi (N = 982).

Latent class growth trajectories and patterns of grade progression

The few children with less than three school grade records were excluded from LCGA (n = 63). Following the evaluation of the goodness of fit statistics and model adequacy diagnostics for the fitted latent classes (Supplementary Table 2), the three-class and four-class models were considered suitable candidates to appropriately capture the grade progression of the study sample. These models both had more than 20% of participants in each class and entropy values within the acceptable range (three-class model = 0.82; four-class model = 0.64). The four-class model had the lowest BIC (− 1544.69) and AIC (− 1515.54) values, whereas the classes in the three-class model had the highest AvePP of assignment values and odds of correct classification. However, the trajectories of the four-class model best represented individual-level grade progression, demonstrated acceptable class separation, and were practically meaningful (Supplementary Fig. 1). Based on these assessments, the four-class model was selected as the optimal model.

Each trajectory was named based on the shape of the curve and the age at which the children in that trajectory started to fall behind in school (Fig. 4). Trajectory 1 represented children over-age for grade in the early grades of primary school, starting from around age 8 (falling behind from early grades). Children in trajectory 2 were over-age for grades in middle primary school when they were about 9–10 years (falling behind from middle grades). Those in trajectory 3 became over-age for grades at the terminal stages of primary school when they were about 11–12 (falling behind from terminal grades). Trajectory 4 represents children on time for all the grades from age six to thirteen (consistently on time for grades).

Figure 4
figure 4

Age-for-grade attainment trajectories based on latent class growth modelling of schooling data from age 6 to 13 (Early grades = grades 1–3, middle grades = grades 4–6 and terminal grades = grades 7–8).

We evaluated the sex-stratified latent class models using the approach described (Supplementary Tables 3, 4, Supplementary Figs. 2 and 3) and found that a four-class model was optimal for both girls and boys (Fig. 5A and B).

Figure 5
figure 5

Age-for-grade attainment trajectories of (A) Girls and (B) Boys based on latent class growth modelling of schooling data from age 6 to 13 (Early grades = grades 1–3, middle grades = grades 4–6 and terminal grades = grades 7–8).

Table 4 shows the distribution of the overall age-for-grade trajectories across the participants’ sociodemographic characteristics. Overall, 24.1% of the children progressed without falling behind; 26.9% started falling behind in the terminal grades, and 28.7% and 20.3% started falling behind in the middle and early grades, respectively. A greater percentage of the children who fell behind in early grades were boys. The percentage of children consistently on time for grades increased with increasing paternal education and household wealth. Being consistently on time for grade was more common among later-order births and children living within 1 km of a tarmac road.

Table 4 Distribution of the overall age-for-grade trajectories of children with three or more repeated school assessments according to sociodemographic characteristics of the study sample (n = 952).

Association between exclusive breastfeeding duration and age-for-grade trajectories identified through latent class growth modelling

Compared to being consistently on time for grades, children exclusively breastfed for 6 months were less likely to fall behind in early grades than those exclusively breastfed for less than 3 months in the unadjusted (OR = 0.48; 95%CI = 0.30–0.77) and adjusted (aOR = 0.66; 95%CI = 0.41–1.08) analyses (Table 5). There was no association between exclusive breastfeeding and later age-for-grade trajectories. In the sex-stratified analysis, there was no association between exclusive breastfeeding and age-for-grade trajectories in both the girls and boys subgroups. However, the pattern of the odds ratios was more apparent and consistent for boys than it was for girls. Complete case analysis excluding participants with missing data produced similar results (Supplementary Table 7).

Table 5 Multinomial logistic regression analysis of the association between exclusive breastfeeding duration and age-for-grade trajectories among children aged six to thirteen in Malawi.

Discussion

We used prospective, longitudinal data to examine whether exclusive breastfeeding duration in the first 6 months after birth is associated with educational attainment and age-for-grade trajectories between ages 6 and 13. We saw some evidence of a reduced likelihood of over-age for grade in children exclusively breastfed for 6 months compared to those exclusively breastfed for less than 3 months. Latent class growth analysis indicated that the effect was on those falling behind in the early grades of primary school rather than those who fell behind later. While the width of the confidence intervals and modest effects should be interpreted with caution, they are similar to the association found in the saving brains cohort in KwaZulu-Natal, South Africa, where researchers investigated the effects of exclusive breastfeeding on grade repetition in children aged 7–11 years22. Additionally, in a prespecified age subgroup analysis, we found evidence of a dose–response relationship between exclusive breastfeeding and educational attainment. Children exclusively breastfed for a longer duration in infancy were less likely to be over-age for grade at 6–9 years.

Our finding on the effects of exclusive breastfeeding on age-for-grade trajectories is novel, as no previous study has investigated breastfeeding’s effect on longitudinal trajectories of grade progression among homogenous subgroups. The majority of studies, including those that assessed educational attainment at multiple time points, used methods that assumed study participants progressed at the same level and estimated the average effect of breastfeeding on attainment, ignoring the nuances in progression in the population. The use of various analytical methods adds to the robustness of our findings. Unlike previous studies, our study population was almost entirely breastfed, and maternal education was not associated with the duration of exclusive breastfeeding. Even though adjustment for confounders, including socioeconomic status, generally attenuated the association between exclusive breastfeeding and age-for-grade in some subgroups, our ORs were consistent with reduced odds of falling behind in early school grades with longer duration of exclusive breastfeeding.

Our results are consistent with several earlier studies in varying settings2,4,13,14,49. Richards et al.50, for example, found a dose–response relationship between breastfeeding and higher educational attainment in the 1946 British birth cohort. In Brazil, longer breastfeeding duration was linked to more years of schooling completed at age 18 than breastfeeding for less than 1 month in a population with no socioeconomic patterning of breastfeeding49. Using data on sibling pairs from the National Longitudinal Study of Adolescent Health in the United States, Rees and Sabia found that breastfeeding was associated with a higher grade point average and a higher likelihood of graduating from high school and enrolling in college51.

The mechanisms underlying the improved educational attainment of optimally breastfed children have yet to be confirmed by research. However, one of the primary ways that breastfeeding may improve academic performance is through its role in brain development. Breastmilk contains a variety of nutrients and bioactive compounds necessary for brain development and cognitive function52,53,54,55. Breastmilk, for example, contains a high concentration of long-chain polyunsaturated fatty acids (LCPUFAs), which are required for brain and nervous system development52,53. Docosahexaenoic acid (DHA), one of the LCPUFAs in breastmilk, plays a crucial role in developing neural connections essential for learning and cognition52,53,56. Indeed, brain images of children and adolescents revealed that breastfed children have greater cortical thickness and significant increases in white and grey matter volume in many brain regions compared to non-breastfed children57,58,59,60,61. In addition to its direct nutritional benefits, breastfeeding promotes mother–child attachment and bonding62, which have been linked to enhanced cognitive and socioemotional development63, both of which are essential for academic success. However, studies have found positive effects of breastfeeding on cognitive development even after controlling for home stimulation3,21.

Our findings confirmed heterogeneity in grade progression across primary school. To promote good primary school progression, educational interventions should consider the trajectories identified in this study, noting when children begin to fall behind. Interventions should prioritise children whose parents are illiterate, children from low-income families, and children whose parents are subsistence farmers, as these children are more vulnerable to falling behind in the early and middle grades. Future research should investigate the mechanisms that underpin the association between breastfeeding and educational outcomes, as well as sex differences and possible mediators. More studies on the subject are warranted in sub-Saharan Africa to contribute to the growing body of literature and broaden understanding of the educational benefits of breastfeeding in the region.

One notable limitation of our study is that only a small number of children were assessed at ages 12 and 13. These small numbers could have reduced the statistical power to detect a difference in older children at the terminal grades. Our analysis did not account for birthweight and gestational age. However, this concern is minimal given that we controlled for known key confounders like parental education, family income, and parental occupation. Since data on exclusive breastfeeding duration were based on maternal recall, we cannot completely rule out the possibility of overreporting due to recall and social desirability bias. However, given that the data were collected in the first 3 months after birth and updated only a few months after the recommended duration for exclusive breastfeeding, any such effects should be minimal. Indeed, our estimated prevalence of exclusive breastfeeding at 6 months is comparable to the 34% reported in the 2015 Malawi Demographic and Health Survey64 and the 40% reported by Chipojola et al.65. Furthermore, it has been shown that, within the first three years after birth, maternal recall is a valid and reliable method for estimating infant breastfeeding66.

Conclusions

Our findings suggest that a longer duration of exclusive breastfeeding during infancy may promote better grade progression in primary school and decrease the likelihood of being over-age for grade. This finding supports the World Health Organization’s recommendation on exclusive breastfeeding by adding to the growing body of evidence that the benefits of exclusive breastfeeding for 6 months extend beyond the known improvements in infant health and survival. It also suggests that the beneficial effects on educational attainment reported in high-income settings are not entirely due to socioeconomic confounding. Policies encouraging exclusive breastfeeding as the primary feeding method for the first 6 months of life may help improve children's and adolescents’ educational attainment. We encourage more research into the educational benefits of breastfeeding and its possible mechanisms in sub-Saharan Africa to inform infant feeding recommendations, as the literature on the subject from the region is still sparse.