Predicting low cognitive ability at age 5 years using perinatal data and machine learning

Bowe, Andrea K.; Lightbody, Gordon; O’Boyle, Daragh S.; Staines, Anthony; Murray, Deirdre M.

doi:10.1038/s41390-023-02914-6

Download PDF

Population Study Article
Open access
Published: 04 January 2024

Predicting low cognitive ability at age 5 years using perinatal data and machine learning

Andrea K. Bowe ORCID: orcid.org/0000-0002-1874-234X¹,
Gordon Lightbody^1,2,
Daragh S. O’Boyle¹,
Anthony Staines³ &
…
Deirdre M. Murray^1,4

Pediatric Research (2024)Cite this article

442 Accesses
Metrics details

Abstract

Background

There are no early, accurate, scalable methods for identifying infants at high risk of poor cognitive outcomes in childhood. We aim to develop an explainable predictive model, using machine learning and population-based cohort data, for this purpose.

Methods

Data were from 8858 participants in the Growing Up in Ireland cohort, a nationally representative study of infants and their primary caregivers (PCGs). Maternal, infant, and socioeconomic characteristics were collected at 9-months and cognitive ability measured at age 5 years. Data preprocessing, synthetic minority oversampling, and feature selection were performed prior to training a variety of machine learning models using ten-fold cross validated grid search to tune hyperparameters. Final models were tested on an unseen test set.

Results

A random forest (RF) model containing 15 participant-reported features in the first year of infant life, achieved an area under the receiver operating characteristic curve (AUROC) of 0.77 for predicting low cognitive ability at age 5. This model could detect 72% of infants with low cognitive ability, with a specificity of 66%.

Conclusions

Model performance would need to be improved before consideration as a population-level screening tool. However, this is a first step towards early, individual, risk stratification to allow targeted childhood screening.

Impact

This study is among the first to investigate whether machine learning methods can be used at a population-level to predict which infants are at high risk of low cognitive ability in childhood.
A random forest model using 15 features which could be easily collected in the perinatal period achieved an AUROC of 0.77 for predicting low cognitive ability.
Improved predictive performance would be required to implement this model at a population level but this may be a first step towards early, individual, risk stratification.

Big data, machine learning, and population health: predicting cognitive outcomes in childhood

Article Open access 09 June 2022

Predicting mortality risk for preterm infants using random forest

Article Open access 31 March 2021

Machine learning methods to predict attrition in a population-based cohort of very preterm infants

Article Open access 22 June 2022

Introduction

Early life is a unique period where the developing brain has great plasticity and huge potential for adaptability.¹ There is consensus agreement that individual interventions to improve cognitive development should be initiated early.^2,3 A failure to achieve early foundational cognitive skills may result in a permanent loss of opportunity to reach full academic potential.⁴ This, in turn, may adversely affect outcomes throughout the life course including educational attainment,⁵ mental health,⁶ social mobility,⁷ financial well-being,⁸ and physical health.⁹

Internationally, many countries rely on universal screening programmes to identify children who may benefit from early intervention. The majority of developmental screening assessments are based on the presence of a delay in developmental milestones.¹⁰ A limitation of this approach is that opportunities for intervention in the period of optimal neuroplasticity are lost, and intervention begins when an infant is already substantially behind their typically developing peers. There is increasing evidence that early pre-emptive interventions, initiated prior to overt signs of delay or difficulty, can alter neurodevelopmental outcomes.^11,12,13 The challenge we face is predicting at an individual level, using accurate and scalable methods, who the highest risk infants are.

In the United States, the population-based early intervention programmes Head Start and Early Head Start, base eligibility primarily on a family income at or below the poverty level.¹⁴ Adverse socioeconomic conditions are among the strongest predictors of poor cognitive outcomes in childhood, but there are other important psychosocial, biological, genetic, and environmental influences, which often have complex and interactive relationships both with each other and with cognitive outcomes.^15,16 There is now increasing potential to statistically model complex interactive relationships using machine learning techniques.^17,18 This has not been sufficiently explored for prediction of poor cognitive outcomes in childhood at a population level.¹⁹

In this study we aim to develop an explainable predictive algorithm, using population-based cohort data, for identifying infants at risk of low cognitive ability (LCA) at school-age. The objectives of the study are to train a variety of machine learning models to predict LCA at age 5; to test these models on an independent unseen test-set; to compare model performance using a range of measures; and to identify the most important features for prediction.

Methods

Data

Data are from the Growing Up in Ireland (GUI) Infant Cohort, a nationally representative survey of infants and their primary caregivers (PCG). Wave 1 of data collection commenced in 2008 at infant age 9-months, Wave 2 occurred at age 3 years, and Wave 3 at 5 years. The sample was drawn from the National Child Benefits Register, a universal welfare entitlement in the Republic of Ireland. It was selected on a systematic basis with a random start and constant sampling fraction, and was pre-stratified by marital status, county of residence, nationality, and number of children in the family. There were 11,134 families who participated at Wave 1, of whom 9001 completed Wave 3, representing 80.8% of the original sample. Full details of the sample design, response, and survey instruments are available.²⁰ Eligible for inclusion in this study were the 8858 infants who completed cognitive assessments at age 5 and their PCGs, who in 99.7% of cases were the infant’s mother. A flow chart of the study population is contained in Supplementary Material Fig. S1.

Outcome

Cognitive ability at age 5 years was directly assessed using two core subtests of the British Ability Scales (BAS) Early Years Battery Second Edition, administered in the child’s home by a trained interviewer. The BAS consists of a battery of individually administered subtests (detailed in Supplementary Material Table S1) and has demonstrated construct validity as a measure of cognitive ability and high test-retest reliability.²¹ To minimise participant burden in the GUI study, there was an upper limit of 90 min contact time in the home. Therefore, it was not feasible to administer the full battery of tests, and two core subtests which most closely align with measures of crystallised and fluid cognitive ability were chosen.²²

The Naming Vocabulary test measures verbal ability in the English language and consists of the child naming everyday items displayed from a picture book. The Picture Similarities test measures non-verbal ability and consists of the child being shown four pictures and requested to match a fifth picture, based on a shared characteristic or construct. A standardised score for each scale is provided in the dataset and is adjusted for both item difficulty and age (within a 3 month age band).

Multiple BAS subtest scores can be combined with summation to produce a General Conceptual Ability Score. To produce composite scores using fewer subtests principal components analysis (PCA) was used, as in previous research.²³ PCA of the two BAS subtests confirmed the presence of a general underlying cognitive ability factor. Principal component 1 (PC1) accounted for 64% of the total variance among the subtests. The Pearson correlation between this factor and the observed variable was 0.80 for Picture Similarities and 0.80 for Naming Vocabulary subtests. PC1 was then standardised to produce a general cognitive ability (GCA) score with a mean of 100 and a standard deviation (sd) of 15.

There is no consensus agreement on a cut-off that defines LCA in childhood. The International Classification of Disease 11th Revision (ICD-11) use a standardised test score that is ≥2 standard deviations (SD) below the mean to define a disorder of intellectual development, while other research in this area has used cut-offs of ≥1 or 1.5 SDs below the mean.^16,24,25 As this study was exploratory in nature, we examined both a 1 and 1.5 SD cut-off. For clarity of presentation the 1.5 SD cut off is used in the main study and children scoring below this cut-off are referred to as having low cognitive ability (LCA). The results using a 1 SD cut off are included in online-only material and this is referred to as below average cognitive ability (BACA).

Data preparation

No feature had more than 12% missing values. Missing values were imputed using the ‘missForest’ package, a random forest imputation method.²⁶ The post-imputation dataset was stratified by the outcome and then randomly split into a training set, containing 70% (n = 6202) of participants, and a testing set, containing 30% (n = 2656).

Feature selection

The GUI dataset contains more than 600 variables measured at wave 1 (9-months) across domains of health, education, cognitive development, social class, and neighbourhood characteristics. The framework used in the selection of relevant features is shown in Fig. S2. These features capture pregnancy and birth, maternal and infant characteristics, the socioeconomic circumstances of family, and the infant’s early environment. Only features based on information that could be easily obtained at a population level without the need for invasive testing were eligible for inclusion. Features which could be collected in the perinatal period were preferable to enable prediction soon after birth. It is well established that the early learning environment is very important in cognitive development, and a set of features intended to measure this were included.²⁷ Previous published work by our group has focussed on feature selection to predict low IQ in a separate cohort and this was also considered.²⁸ All features considered are described in Supplementary Material Table S2.

To remove redundant features, Pearson correlation coefficients were calculated and plotted. Features with a correlation greater than 0.6 were examined for redundancy. In choosing which features to remove, the potential timing of collection, objectivity, and clinical opinion were considered. Three feature sets were then created (Table S3). Set 1 contained the features identified as most important in our previous work.²⁸ Set 2 contained only features that could potentially be collected in the perinatal period. Set 3 included those representing the child’s early environment that would require later measurement. Recursive feature elimination (RFE), with five-fold cross validation repeated five times, was performed for each feature set to determine the optimal combination for the final models. The features included in the final models are indicated by an asterisk in Table S3.

Modelling

The dataset was imbalanced with regard to the outcome of interest. Training a model on class imbalanced data risks producing a model bias in favour of the majority class, with poor predictive performance for the minority class.²⁹ To address this Synthetic Minority Oversampling Technique (SMOTE) was applied to the training set only.³⁰ Unlike other oversampling techniques which simply duplicate minority class cases, SMOTE utilises an over-sampling approach where the minority class is over-sampled by creating synthetic examples.^29,30 Random forest (RF), logistic regression (LR), and support vector machine (SVM) algorithms were trained and optimal hyperparameters selected using the rebalanced training dataset and ten-fold cross validated grid search.

Evaluation

To select the most appropriate machine learning algorithm, accuracy across ten-fold cross-validation was compared. After selection of the algorithm, the final models were tested on an independent unseen test set. Area under the receiver operating curve (AUROC) was used to evaluate overall model performance. Explanation of performance metrics is provided in online-only material. A summary of the modelling process is contained in Fig. 1.

**Fig. 1: Overview of modelling process.**

Explainability

Feature importance plots were created using the permutation method in the ‘vip’ package.³¹ First, baseline model performance is measured using a measure set by the user (in this study—AUROC). The feature of interest is then randomly shuffled and model performance is measured again. The difference between the two measures is used as a measure of feature importance. For each feature shuffling was simulated ten times and importance was averaged across the simulations. It would be expected that randomly shuffling the values of an important feature would degrade model performance.³¹

The relationships between the most important features and the outcome were examined using partial dependence plots (PDPs), which provide a visualisation of the relationship between a feature and the response while accounting for the average effect of the other features in the model.³² A lower value on the y-axis of the PDP suggests that the positive class (low cognitive ability) is less likely at that value of the feature on x-axis according to the model.

Results

Characteristics of study population

There were 8858 infants included, of whom n = 573 (6.5%) had LCA. A summary of characteristics are described in Table 1, with a complete description provided in Table S4. The LCA group, which was comprised of 60.6% boys, had a lower mean maternal age (30.6 vs. 32.1 years, p < 0.001), a higher proportion of mothers who smoked (30.4% vs. 22.4%, p < 0.001), and a lower proportion of mothers reporting English as their native language (53.9% vs. 86.9%, p < 0.001). In the LCA group, 43.8% of mothers reported their highest education level as being secondary or primary only, compared with 27.5% of those without LCA. There were significant differences between the groups with regard to all socioeconomic characteristics. The LCA group had a lower median family income (€31,200 vs. €48,000, p < 0.001), a lower proportion of families living in owner occupied accommodation (38.9% vs. 73.5%, p < 0.001), and a higher proportion in the lower social classes.

Table 1 Pregnancy, maternal, infant, socioeconomic and early environmental characteristics of study population.

Full size table

Rebalanced training dataset

In the study dataset, n = 573/8858 (6.5%) children had LCA. As detailed in Fig. 1, in the training dataset, which contained 70% of participants, n = 402/6202 had LCA. After application of SMOTE, the rebalanced training dataset consisted of n = 1206 (42.9%) children with LCA and n = 1608 (57.1%) without. This rebalanced dataset was used to train the models. The testing dataset was not rebalanced and contained the original 30% of participants, of whom n = 171/2656 (6.5%) had LCA. This was the dataset used to evaluate the models.

Feature and algorithm selection

Following RFE 8 features were retained for Model 1, 15 for Model 2 and 23 features for Model 3. As shown in Table S5, the random forest algorithm was the best performing algorithm for all three models, in repeated ten-fold cross validation, and was selected for the final models.

Final model evaluation

The independent test set contained n = 2656 participants, of whom n = 171 had LCA. Models 2 and 3 achieved the highest AUROC of 0.77 and 0.78 respectively (Fig. S3). At a decision threshold of 0.5 the models correctly predicted the cognitive outcome of 87% of participants at age 5 (Table 2). The odds of having low cognitive ability were 5.9 times higher in the group with a ‘low’ prediction. Model 2 was deemed to be the best performing model overall, achieving similar performance to Model 3 but using only 15 features, all of which had potential to be collected in the perinatal period. To further explore its performance, the decision threshold was altered in increments of 0.5 and the corresponding sensitivities, specificities, positive and negative predictive values were examined using the independent test set. The optimal threshold of 0.28, using Youden’s index, yielded a sensitivity of 72% with a specificity of 66% (Table S6).

Table 2 Performance metrics of final models tested on independent test set.

Full size table

A worked example of how the model, at a decision threshold of 0.5, would function in the real world is provided in Table S7. In 2021 there were almost 60,000 births in Ireland. Assuming a prevalence of 6.5%, 3900 infants will have LCA at age 5, of whom 1560 would be detected in infancy (sensitivity 0.40). Of the 56,100 infants with average or above cognitive ability at age 5, 50,490 would be correctly identified as not at risk (specificity 0.90). There would be 5610 false positives and 2340 false negatives.

The modelling and results for predicting LCA using an alternative cut off of a GCA score more than 1 SD below the mean are shown in Supplementary Material Fig. S4 and Tables S8 and S9.

Explainability

The five most important features in Model 2 were PCG alcohol intake (measured when infant was 9-months old), household social class, PCG highest education, number of bedrooms in the home, and household equivalised income (Fig. 2). PDPs are shown in Fig. 3, alongside a histogram showing the distribution of each feature in the dataset.

**Fig. 2: Ten most important features in random forest model 2.**

**Fig. 3: Partial dependence plots and histograms examining the relationship between the six most important features in Random Forest Model 2 and low cognitive ability.**

The five most important features were all markers of socioeconomic status (SES). To determine whether any of these markers of SES could achieve similar predictive performance alone, five separate random forest models were trained and tested using each feature alone as a predictor. As shown in Supplementary Table S10, none of the features alone achieved as high an AUROC as the final model. Family income achieved an AUROC of 0.64 (95% CI 0.59–0.68). Each feature alone could achieve a high accuracy, but this was driven by high specificity with relatively poor sensitivity for detecting those with low cognitive ability.

Discussion

We have shown by evaluating a variety of machine learning algorithms in a large population-based cohort that a RF model based on 15 features, all of which have potential to be collected in the perinatal period, achieved an AUROC of 0.77 for predicting low cognitive ability at age 5. At a decision threshold of 0.5, the model could correctly predict the cognitive outcome of 87% of infants, however this was largely driven by a high specificity and its ability to detect the majority of children with normal cognitive ability. When the alternative cut off of 1 SD was used, an AUROC of 0.70 was achieved using 31 features in a RF model. Model performance was similar to that reported by Camargo et al., who developed a LR model with an AUROC of 0.75, to predict low IQ (defined by a z-score greater than 1 standard deviation below the mean) at age 6. Their model, however, included predictors that could not be measured until the infant was 12-months old.¹⁶ No other similar predictive models were identified in the literature for comparison, and to our knowledge, this is the first study to examine of potential of machine learning for the prediction of low cognitive ability in childhood.

It is difficult to make direct comparisons of model performance with other screening tools due to differences in cohort characteristics, cognitive tests, and cut-off scores used to determine LCA. The available literature examining the performance of the Ages and Stages Questionnaire (ASQ) suggests very similar performance. The ASQ is one of the most widely used parent-reported screening tools and is currently recommended for early screening of developmental delay at 2 years of age in many countries, including by the American Academy of Pediatrics. The ASQ at 24 and 36 months have reported AUROCs of 0.64 and 0.78 respectively, for predicting low IQ at age 5, defined as an IQ < 1 SD below the cohort mean and an IQ < 85 in the respective studies.^33,34 A study examining the performance of the ASQ performed between 8–30 months for predicting low IQ in the early years of schooling reported similarly low sensitivities ranging from 28–50%, with specificities ranging from 79–96% across a range of cut-offs.³⁵ Our data suggest that similar prediction can be made at birth allowing early intervention in the most high risk cases.

The statistical approach used in this study was designed to optimise prediction, not investigate causal relationships. However, it is notable that among the ten most important predictors, six (alcohol intake, social class, education, income, bedrooms in the home, and maternal age) are inherently associated with socioeconomic status.^36,37 This is in keeping with a wealth of literature, both interventional and observational, which has consistently shown that the socioeconomic environment an infant is born into is one of the strongest predictors of cognitive outcomes in childhood.^{38,39,40,41,42} The findings of this study would suggest that these features, while all representative of socioeconomic status, may contribute to risk in different ways as their cumulative effects better predicted the outcome than the effect of any single feature.

Previous research has shown that children exposed to multiple early risk factors represent a more vulnerable subgroup for adverse childhood outcomes. The more risk factors a child is exposed to the worse these outcomes tend to be.⁴³ The most commonly used statistical approach to combining risk factors for the prediction of cognitive outcomes has been to combine them in an additive fashion, with the main effect of each factor accounted for.^15,16 However, we know that risk factor effects are not necessarily the same for all children and effect modification and interaction does exist.⁴⁴ The effect of one risk factor may be accentuated or diminished by another exposure. For example, the co-occurrence of brain injury due to preterm birth, a biological risk factor, and socioeconomic disadvantage, a social risk factor, have synergistic adverse effects on neurodevelopmental outcomes.⁴⁵ An advantage of the ML approach used in this study is that the algorithms can combine features in interactive relationships without the need for prior specification.

Our study has many limitations to consider. There is much debate about the merits and limitations of standardised cognitive assessments. Both the ICD-11 and the Diagnostic and Statistical Manual of Mental Disorders 5th Edition (DSM-V) now include the impact on adaptive functioning as a diagnostic criteria for IDDs, reflecting a move away from using standardised testing alone.^24,46 The purpose of this study was not to develop a diagnostic model, but a prognostic model that could identify high-risk children whose cognitive outcome could adversely impact other aspects of life, such as educational attainment and mental health.^47,48 In this regard, the use of a binary cut-off is justified, although there is no consensus on the most appropriate one. In this study cut offs of 1 and 1.5 SD were examined.

Cognitive ability was directly measured using standardised assessments, however children taking the naming vocabulary test, for whom English is not their native language, may be disadvantaged, with apparent poor performance. A more appropriate measure of cognitive ability in children from multilingual backgrounds may include total vocabulary across their languages, or novel language-independent cognitive assessments.^49,50 Children in GUI did not complete all BAS subtests and PCA was therefore used to calculate a GCA score. This may not be acceptable for a formal diagnosis of an IDD, however, a cited advantage of the BAS is that not all tests in the battery are required to assess performance and tests are individually interpretable.²¹ While there is overlap in the mental processes used in different subtests, the two subtests chosen in the GUI study largely measure verbal and non-verbal ability, and are not primarily intended to measure numerical ability, spatial ability, perceptual speed, or memory. Performing the full battery of BAS subtests would provide a more robust assessment of cognitive ability, but for large population-based cohort studies may not be feasible.^21,51

There is now a substantial body of evidence to suggest that early interventions are most effective when started in the first year of life.^11,12,13 For this reason, our study focussed on features which could be collected in the perinatal period at a population level. However, this approach may come at the cost of reduced predictive performance as detailed information on the child’s early home and school environment were not included in the model. Information on the number of books in the home and the time spent on learning activities at age 3 were included in feature set C but predictive performance was not significantly better than with perinatal features alone. This likely reflects the fact that the early learning environment is intimately intertwined with the socioeconomic background of the family.

While all 15 features in the final model could be collected in the perinatal period, in the GUI study they were collected at infant age 9-months. Validation of the model in a cohort with data collection in the perinatal period is required, but this is challenging. There are more than 110 birth cohorts in Europe, however, there are no consensus guidelines on measurements, scales, data sources, or timing.⁵² This leads to significant challenges with merging and harmonising datasets which could provide large pools of data for validation. Work in this area is ongoing.⁵³

This study included a large sample size, and a careful sampling strategy was employed in recruitment of the cohort.²⁰ Attrition in the GUI study was relatively low and the sample included in this study who completed cognitive assessments at age 5 represented 79.6% of the original cohort recruited at 9-months. However, attrition was higher among those from disadvantaged backgrounds, who are also more likely to have poor cognitive outcomes.⁵⁴ Ideally, the model should be validated using population-based registry data which is not currently collected in Ireland.

Extensive validation is required to ensure wider generalisability of predictive models. If the relationship between predictor and outcome changes, model performance will be affected. For example, in Ireland only 35% of babies receive any breastmilk at 3 months, and this is significantly associated with socioeconomic status (SES).⁵⁵ In our model breastfeeding may be a surrogate marker of SES, a relationship which may not exist in other countries where there are very high rates of breastfeeding.

An important challenge with ML models is explainability, the concept that the prediction a model makes can be explained in an acceptable way on a human level. In this study PDP plots were used to help understand how the model was making predictions. However, these must be interpreted with caution. For example, the PDP plot examining the relationship between PCG alcohol intake and risk of low cognitive ability in the child would suggest that those with moderate alcohol consumption have the lowest risk, while those with no alcohol or very high alcohol consumption have the highest risk. This U-shaped relationship curve between alcohol consumption and health outcomes has been described previously in the literature.⁵⁶ In our study, among those who reported English was not their native language 41.4% reported no alcohol intake, compared to only 12% of those for whom English was the native language. Therefore, it is plausible that the increased risk seen for those who reported no alcohol consumption is due to confounding, and the relationship is actually being driven by immigration, cultural, religious, language, or socioeconomic factors.

Finally the individual, health-system, and resource implications of a risk-based approach must be considered. In its current iteration this model may not be suitable for use at a population level. The sensitivity is too low and the resource implications of the high false positive rate, when the decision threshold is lowered, is too great. However, it is a first step towards enabling an early personalised, prediction, which could assist decisions on further intervention. This is not without consequence. Labelling a child so early in life could have detrimental effects on both child and family. In addition to adverse impacts on the child’s self-concept, it can perpetuate a self-fulfilling prophecy due to effects on parent and teacher expectations.^57,58 Using a label based largely on social factors, often rooted in inequities generated by social and political policy, may shift the blame of societal failures onto the individual child and family. Any risk stratification must come from a position of support, environmental enrichment, and education.

In conclusion, the current practice of waiting for overt signs of developmental delay before intervention goes against a large body of developmental literature. In this study, it was possible to develop a model, using 15 simple features available at birth and readily incorporated into an electronic health record, that correctly predicted the cognitive ability of 87% of children at age 5. Whilst further improvements in predictive performance would be required to use this as a population level screening tool, it provides a strong basis for further research. New methods of direct assessment of early cognitive function are lacking. Neurophysiological measures such as electroencephalography and eye-tracking show some promise.^42,59 Combined with targeted direct assessment, this model could, in the future, form the foundation for an early targeted screening protocol similar to that now being widely implemented for the early diagnosis of cerebral palsy.^60,61 Improved capacity to collect large volumes of rich data and to apply novel statistical methods, particularly suited to prediction, provides an opportunity for researchers and clinicians to investigate alternative approaches.

Data availability

The anonymised Growing Up in Ireland data from the Infant Cohort is available for request for bona fide research purposes through the Irish Social Science Data Archive https://www.ucd.ie/issda/data/growingupinirelandgui/.

References

Cioni, G., Inguaggiato, E. & Sgandurra, G. Early intervention in neurodevelopmental disorders: underlying neural mechanisms. Dev. Med. Child Neurol. 58, 61–66 (2016).
Article PubMed Google Scholar
Ramey, C. T. & Ramey, S. L. Prevention of intellectual disabilities: early interventions to improve cognitive development. Prev. Med. 27, 224–232 (1998).
Article CAS PubMed Google Scholar
Pungello, E. P. et al. Early educational intervention, early cumulative risk, and the early home environment as predictors of young adult outcomes within a high-risk sample. Child Dev. 81, 410–426 (2010).
Article PubMed PubMed Central Google Scholar
Spencer, N., Raman, S., O’Hare, B. & Tamburlini, G. Addressing inequities in child health and development: towards social justice. BMJ Paediatr. Open 3, e000503 (2019).
Article PubMed PubMed Central Google Scholar
Lager, A., Bremberg, S. & Vågerö, D. The association of early IQ and education with mortality: 65 year longitudinal study in Malmö, Sweden. BMJ 339, b5282 (2009).
Article CAS PubMed PubMed Central Google Scholar
Alesi, M., Rappo, G. & Pepi, A. Emotional profile and intellectual functioning: a comparison among children with borderline intellectual functioning, average intellectual functioning, and gifted intellectual functioning. SAGE Open. 5, 21582440155 (2015)
Forrest, L. F., Hodgson, S., Parker, L. & Pearce, M. S. The influence of childhood IQ and education on social mobility in the Newcastle Thousand Families birth cohort. BMC Public Health 11, 895 (2011).
Article PubMed PubMed Central Google Scholar
Furnham, A. & Cheng, H. Childhood cognitive ability predicts adult financial well-being. J. Intell. 5, 3 (2016).
Article PubMed PubMed Central Google Scholar
Whalley, L. J. & Deary, I. J. Longitudinal cohort study of childhood IQ and survival up to age 76. BMJ 322, 819 (2001).
Article CAS PubMed PubMed Central Google Scholar
Hirai, A. H. et al. Prevalence and variation of developmental screening and surveillance in early childhood. JAMA Pediatr. 172, 857–866 (2018).
Article PubMed PubMed Central Google Scholar
Campbell, F. A. et al. Early childhood education: young adult outcomes from the abecedarian project. Appl. Dev. Sci. 6, 42–57 (2002).
Article Google Scholar
Spittle, A. J., Orton, J., Doyle, L. W. & Boyd, R. Early developmental intervention programs post hospital discharge to prevent motor and cognitive impairments in preterm infants. Cochrane Database Syst. Rev. 18, CD005495 (2007).
Whitehouse, A. J. O. et al. Effect of preemptive intervention on developmental outcomes among infants showing early signs of autism: a randomized clinical trial of outcomes to diagnosis. JAMA Pediatr. 175, e213298 (2021).
Article PubMed PubMed Central Google Scholar
Office of the Assistant Secretary for Planning and Evaluation. Poverty Guidelines. https://aspe.hhs.gov/topics/poverty-economic-mobility/poverty-guidelines (2023).
Eriksen, H. L. et al. Predictors of intelligence at the age of 5: family, pregnancy and birth characteristics, postnatal influences, and postnatal growth. PLoS ONE 8, e79200 (2013).
Article PubMed PubMed Central Google Scholar
Camargo-Figuera, F. A., Barros, A. J., Santos, I. S., Matijasevich, A. & Barros, F. C. Early life determinants of low IQ at age 6 in children from the 2004 Pelotas Birth Cohort: a predictive approach. BMC Pediatr. 14, 308 (2014).
Article PubMed PubMed Central Google Scholar
Azzolina, D. et al. Machine learning in clinical and epidemiological research: Isn’t it time for biostatisticians to work on it? Epidemiol. Biostat. Public Health 16 https://doi.org/10.2427/13245 (2019).
Obermeyer, Z. & Emanuel, E. J. Predicting the future—big data, machine learning, and clinical medicine. N. Engl. J. Med. 375, 1216–1219 (2016).
Article PubMed PubMed Central Google Scholar
Bowe, A. K. et al. Big data, machine learning, and population health: predicting cognitive outcomes in childhood. Pediatr. Res. 93, 300–307 (2023).
Article PubMed Google Scholar
Quail, A., Williams, J., McCrory, C., Murray, A. & Thornton, M. A Summary Guide to Wave 1 of the Infant Cohort of Growing Up in Ireland. https://www.growingup.gov.ie/pubs/Summary-Guide_Infant-Cohort_Wave-1.pdf (2011).
Elliott, C. D., Smith, P. & McCulloch, K. British Ability Scales Second Edition (BAS II). Administration and Scoring Manual (Nelson, 1996).
Williams, J., Thornton, M., Murray, A. & Quail, A. Growing Up in Ireland Design, Instrumentation and Procedures for Cohort ’08 at Wave 3 (5 Years). https://www.growingup.gov.ie/pubs/20190404-Cohort-08-at-5years-design-instrumentation-and-procedures.pdf (2019).
Jones, E. M. & Schoon, I. Child Cognition and Behaviour. Millennium Cohort Study Third Survey: A User’s Guide to Initial Findings. https://core.ac.uk/download/pdf/111052317.pdf#page=126 (2008).
World Health Organisation. ICD-11 for Mortality and Morbidity Statistics: 6A00 Disorders of Intellectual Development. https://icd.who.int/browse11/l-m/en#/http%253A%252F%252Fid.who.int%252Ficd%252Fentity%252F605267007 (2023).
Karrasch, M. et al. Cognitive outcome in childhood-onset epilepsy: a five-decade prospective cohort study. J. Int. Neuropsychol. Soc. 23, 332–340 (2017).
Article PubMed PubMed Central Google Scholar
Stekhoven, D. J. & Bühlmann, P. MissForest–non-parametric missing value imputation for mixed-type data. Bioinformatics 28, 112–118 (2012).
Article CAS PubMed Google Scholar
McCormick, B. J. J. et al. Early life experiences and trajectories of cognitive development. Pediatrics 146, e20193660 (2020).
Article PubMed Google Scholar
Bowe, A. K. et al. Predicting low cognitive ability at age 5-feature selection using machine learning methods and birth cohort data. Int. J. Public Health 67, 1605047 (2022).
Article PubMed PubMed Central Google Scholar
Blagus, R. & Lusa, L. SMOTE for high-dimensional class-imbalanced data. BMC Bioinforma. 14, 106 (2013).
Article Google Scholar
Chawla, N. et al. SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 341–348 (2002).
Article Google Scholar
Greenwell, B. & Boehmke, B. Variable importance plots—an introduction to the vip package. R. J. 12, 343–366 (2020).
Article Google Scholar
Greenwell, B. pdp: an R package for constructing partial dependence plots. R. J. 9, 421–436 (2017).
Article Google Scholar
Charkaluk, M. L. et al. Ages and stages questionnaire at 3 years for predicting IQ at 5-6 years. Pediatrics 139, e20162798 (2017).
Article PubMed Google Scholar
Bowe, A. K. et al. The predictive value of the ages and stages questionnaire in late infancy for low average cognitive ability at age 5. Acta Paediatr. 111, 1194–1200 (2022).
Article PubMed PubMed Central Google Scholar
Schonhaut, B. L. et al. Validez del Ages & Stages questionnaires para predecir el desempeño cognitivo en los primeros años de educación escolar [Predictive value of Ages & Stages Questionnaires for cognitive performance at early years of schooling]. Rev. Chil. Pediatr. 88, 28–34 (2017).
Google Scholar
Daly, M. C., Duncan, G. J., McDonough, P. & Williams, D. R. Optimal indicators of socioeconomic status for health research. Am. J. Public Health 92, 1151–1157 (2002).
Article PubMed PubMed Central Google Scholar
van Roode, T., Sharples, K., Dickson, N. & Paul, C. Life-course relationship between socioeconomic circumstances and timing of first birth in a birth cohort. PLoS ONE 12, e0170170 (2017).
Article PubMed PubMed Central Google Scholar
Bradley, R. H. & Corwyn, R. F. Socioeconomic status and child development. Annu. Rev. Psychol. 53, 371–399 (2002).
Article PubMed Google Scholar
Larson, K., Russ, S. A., Nelson, B. B., Olson, L. M. & Halfon, N. Cognitive ability at kindergarten entry and socioeconomic status. Pediatrics 135, e440–8 (2015).
Article PubMed Google Scholar
Tong, S., Baghurst, P., Vimpani, G. & McMichael, A. Socioeconomic position, maternal IQ, home environment, and cognitive development. J. Pediatr. 151, 284–288 (2007).
Article PubMed Google Scholar
von Stumm, S. & Plomin, R. Socioeconomic status and the growth of intelligence from infancy through adolescence. Intelligence 48, 30–36 (2015).
Article Google Scholar
Troller-Renfree, S. V. et al. The impact of a poverty reduction intervention on infant brain activity. Proc. Natl. Acad. Sci. USA 119, e2115649119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Evans, G. W. A multimethodological analysis of cumulative risk and allostatic load among rural children. Dev. Psychol. 39, 924–933 (2003).
Article PubMed Google Scholar
Evans, G. W., Li, D. & Whipple, S. S. Cumulative risk and child development. Psychol. Bull. 139, 1342–1396 (2013).
Article PubMed Google Scholar
Benavente-Fernández, I. et al. Association of socioeconomic status and brain injury with neurodevelopmental outcomes of very preterm children. JAMA Netw. Open 2, e192914 (2019).
Article PubMed PubMed Central Google Scholar
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders 5th edn (American Psychiatric Association, 2013).
Deary, I. J., Strand, S., Smith, P. & Fernandes, C. Intelligence and educational achievement. Intelligence 35, 13–21 (2007).
Article Google Scholar
Bowe, A. K., Staines, A. & Murray, D. M. Below average cognitive ability—an under researched risk factor for emotional-behavioural difficulties in childhood. Int. J. Environ. Res. Public Health 18, 12923 (2021).
Article PubMed PubMed Central Google Scholar
Hoff, E. & Core, C. What clinicians need to know about bilingual development. Semin. Speech Lang. 36, 89–99 (2015).
Article PubMed PubMed Central Google Scholar
Twomey, D. M. et al. Feasibility of using touch screen technology for early cognitive assessment in children. Arch. Dis. Child 103, 853–858 (2018).
Article PubMed Google Scholar
The British Psychological Society. Test Review British Ability Scales Second Edition (BAS II) Early Years. https://explore.bps.org.uk/binary/bpsworks/8c6cc30167a91a68/49da8e25db69400d9493abefd966956872cb302b488b2e54a8b2acdf2c769cb6/1854335138.pdf (2014).
Pansieri, C. et al. An inventory of European birth cohorts. Int. J. Environ. Res. Public Health 17, 3071 (2020).
Article PubMed PubMed Central Google Scholar
Jaddoe, V. W. V. et al. The LifeCycle Project-EU Child Cohort Network: a federated analysis infrastructure and harmonized data of more than 250,000 children and parents. Eur. J. Epidemiol. 35, 709–724 (2020).
Article PubMed PubMed Central Google Scholar
Murray, A., Williams, J., Quail, A., Neary, M. & Thornton, M. Growing Up in Ireland A Summary Guide to Wave 3 of the Infant Cohort (at 5 Years). https://www.growingup.gov.ie/pubs/Summary-Guide-_Infant-Cohort_Wave-3.pdf (2019).
Gallegos, D. et al. Understanding breastfeeding behaviours: a cross-sectional analysis of associated factors in Ireland, the United Kingdom and Australia. Int. Breastfeed. J. 15, 103 (2020).
Article PubMed PubMed Central Google Scholar
San José, B., van de Mheen, H., van Oers, J. A., Mackenbach, J. P. & Garretsen, H. F. The U-shaped curve: various health measures and alcohol drinking patterns. J. Stud. Alcohol 60, 725–731 (1999).
Article PubMed Google Scholar
Jussim, L. & Harber, K. D. Teacher expectations and self-fulfilling prophecies: knowns and unknowns, resolved and unresolved controversies. Pers. Soc. Psychol. Rev. 9, 131–155 (2005).
Article PubMed Google Scholar
Shifrer, D. Stigma of a label: educational expectations for high school students labeled with learning disabilities. J. Health Soc. Behav. 54, 462–480 (2013).
Article PubMed Google Scholar
Dean, B. et al. Eye-tracking for longitudinal assessment of social cognition in children born preterm. J. Child Psychol. Psychiatry 62, 470–480 (2021).
Article PubMed Google Scholar
Novak, I. et al. Early, accurate diagnosis and early intervention in cerebral palsy: advances in diagnosis and treatment. JAMA Pediatr. 171, 897–907 (2017).
Article PubMed PubMed Central Google Scholar
Maccow, G. Overview of the Differential Ability Scales 2nd edn (Pearson Clinical, 2023).

Download references

Funding

A.K.B. is funded by the Irish Clinical Academic Training (ICAT) Programme. The ICAT programme is supported by the Wellcome Trust and the Health Research Board (Grant Number 203930/B/16/Z), the Health Service Executive, National Doctors Training and Planning and the Health and Social Care, Research and Development Division, Northern Ireland. The funding body had no role in the study design, data collection and analysis, or manuscript preparation. Open Access funding provided by the IReL Consortium.

Author information

Authors and Affiliations

INFANT Research Centre, University College Cork, Cork, Ireland
Andrea K. Bowe, Gordon Lightbody, Daragh S. O’Boyle & Deirdre M. Murray
Department of Electrical and Electronic Engineering, University College Cork, Cork, Ireland
Gordon Lightbody
School of Nursing, Psychotherapy, and Community Health, Dublin City University, Dublin, Ireland
Anthony Staines
Department of Paediatrics, Cork University Hospital, Cork, Ireland
Deirdre M. Murray

Authors

Andrea K. Bowe
View author publications
You can also search for this author in PubMed Google Scholar
Gordon Lightbody
View author publications
You can also search for this author in PubMed Google Scholar
Daragh S. O’Boyle
View author publications
You can also search for this author in PubMed Google Scholar
Anthony Staines
View author publications
You can also search for this author in PubMed Google Scholar
Deirdre M. Murray
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K.B.: conceptualisation, data analysis and interpretation, writing (original draft), guarantor of study. G.L.: conceptualisation, supervision, writing (review and editing). D.S.O.: data analysis and interpretation, writing (review and editing). A.S.: conceptualisation, supervision, writing (review and editing). D.M.M.: conceptualisation, supervision, writing (review and editing).

Corresponding author

Correspondence to Andrea K. Bowe.

Ethics declarations

Competing interests

The authors declare no competing interests.

Ethical approval

Ethical approval for the Growing Up in Ireland Infant Cohort was granted by a dedicated Research Ethics Committee set up by the Department of Health and Children in Ireland. Secondary analysis of the anonymised dataset does not require further ethical approval.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bowe, A.K., Lightbody, G., O’Boyle, D.S. et al. Predicting low cognitive ability at age 5 years using perinatal data and machine learning. Pediatr Res (2024). https://doi.org/10.1038/s41390-023-02914-6

Download citation

Received: 29 May 2023
Revised: 06 October 2023
Accepted: 03 November 2023
Published: 04 January 2024
DOI: https://doi.org/10.1038/s41390-023-02914-6

Abstract

Background

Methods

Results

Conclusions

Impact

Similar content being viewed by others

Big data, machine learning, and population health: predicting cognitive outcomes in childhood

Predicting mortality risk for preterm infants using random forest

Machine learning methods to predict attrition in a population-based cohort of very preterm infants

Introduction

Methods

Data

Outcome

Data preparation

Feature selection

Modelling

Evaluation

Explainability

Results

Characteristics of study population

Rebalanced training dataset

Feature and algorithm selection

Final model evaluation

Explainability

Discussion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Ethical approval

Additional information

Supplementary information

Supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links