Main

Early identification of infants at high risk of CP is essential in clinical practice to select an appropriate follow-up, and in research to develop and evaluate early interventions. Despite well-known risk factors for CP like preterm birth and perinatal asphyxia, a majority of those infants will have normal motor development (1). Less than 15% of infants born before 28 weeks of gestation develop CP (2), and, among extremely preterm infants with major brain injury identified on cerebral ultrasound, only a third develop CP (3). Lack of accurate and early markers for CP hampers the organization of resource-effective follow-up programs and research on early, high-intensity training programs.

Prognostic models based on gestational age, neonatal morbidities, and therapeutic interventions like the use of steroids in the neonatal period (4), as well as neurological assessment tools (5, 6), magnetic resonance imaging (7), or a combination of these (8, 9), may be used to predict outcomes in high-risk infants. A common finding is that accuracy of neurological assessment increases with age on assessment, delaying interventions beyond the time when brain plasticity is assumed to be at its highest (10, 11). The General Movement Assessment (GMA) has been found to have the best accuracy and predictive power for CP in young infants (12). The absence of so-called fidgety movements (FMs) at 3 months post term age has been shown to identify infants with CP with sensitivities between 90 and 98% and specificities between 90 and 94% (refs 12, 13, 14). Recently, GMA was recommended as a screening tool for CP in high-risk infants in Australia (13).

The GMA requires ample training, frequent recalibration of analytical skills, and time for video observation and analysis. The GMA is based on Gestalt perception, and similar to other observational pattern-recognition methods for analysis of complex phenomena, it is in its nature subjective. Although inter-rater reliability appears high for experienced observers (15, 16), results are inconsistent (17). There will be a high demand for skilled observers if GMA is to be used in large-scale screening. Because of these limitations, several automated, computer-based methods for pattern recognition of infants’ spontaneous movements are under development (18). We have previously published results from a feasibility study of a computer-based video analysis software for quantitative analysis of infants’ general movements (GMs) (19, 20). In a convenience sample of high-risk infants, the software identified infants with FMs with a sensitivity and specificity of 81.5% and 85%, respectively. On the basis of these findings, we wanted to validate the method in a prospectively collected sample of high-risk infants who were routinely enrolled in a high-risk follow-up program at discharge from the neonatal intensive care unit.

The aim of the present study was to explore whether selected computer-based movement variables were associated with the temporal organization of FMs in infants at risk of perinatal brain injury. Furthermore, we wanted to explore a risk stratification model based on thresholding of computer-based movement variables to differentiate between infants with normal vs. absence of or sporadic FMs.

Methods

Participants

A total of 165 infants born between 1 October 2008 and 31 December 2012 and designated to one of the four participating Norwegian hospitals’ follow-up clinics at discharge from the neonatal intensive care unit were included. All infants were born and had their initial treatment in one of the three tertiary centers enrolling patients in the study. Infants who did not have their follow-up in a participating center, or infants with peripheral neuromotor diseases, e.g., brachial plexus injury, or iatrogenic restricted movements (e.g., cast for congenital talipes equinovarus), were not included in the study. Twelve infants were excluded because of errors in the standardized camera setup (n=10) or crying during recording (n=2). Three infants (all with GA<28 weeks) were classified with exaggerated FMs. They were excluded from further analysis because their FMs were not classified according to their temporal organization. Table 1 shows the reason for referral to follow-up among the remaining 150 infants. Although one infant could have several risk factors for adverse development, all infants were classified into one risk group according to their main reason for referral. Extremely preterm infants (GA<28 weeks and/or BW≤1,000 g) were classified as such irrespective of other risk factors. Among the eight infants exposed to perinatal asphyxia who were not treated with therapeutic hypothermia (TH), three had a GA below 36 weeks and were not considered eligible for TH. The remaining five infants were term infants, of whom one had an assumed prenatal insult, three had hypoxic–ischemic encephalopathy (HIE) stages I–II, and were not considered to fulfill the neurological criteria for TH, and one infant had HIE stage II, which was not recognized until after 6 h of life.

Table 1 Demographic variables and risk groups

Video Recording

Parents were invited to bring their infant for video recording once or twice (mainly depending on travel distance) between 10 and 15 weeks post term age. All recordings were performed during active wakefulness using a standardized setup with a commercially available digital video camera (Sanyo VPC-HD2000, SANYO Electric Co., Ltd., Osaka, Japan) on a tripod with a standardized distance and angle to the mattress where the infant was placed (18, 20). All infants were dressed in a white “onesie”. Gestalt assessment of GMs and the computer-based analysis were based on video recordings performed according to the GMA methodology of Prechtl and co-workers (21). Thirty-eight (13.6%) of 279 recordings, all from infants with two recordings, were excluded because of technical problems (n=21), crying (n=9), displacement of the infant on the mattress (n=6), or unknown factors (n=2). Ninety-one (61%) of the included infants had two recordings, and fifty-nine (39%) infants had one recording available for analysis. A total of 241 recordings from 150 infants were analyzed with both GMA and computer-based video analysis. The median post-menstrual age at recording was 52.0 (range 49–57) weeks. The median length of recording was 300 s (range 50–343).

Observation of GMs

Two certified observers, who had successfully completed the GMs Trust training courses, classified the GMs. Both observers were blinded to the medical history of the infants and observed the videos independently. In case of disagreement, the observers re-assessed the video together and reached consensus. FMs are small movements of moderate speed and variable acceleration, of neck, trunk, and limbs, in all directions (21), and their temporal organization was classified according to how frequently they occurred and the duration of interspersed pauses. FMs were defined as normal if present intermittently or continually. Continual FMs were interspersed with only short pauses, whereas intermittent FMs were interspersed with longer pauses, giving the impression that FMs were present for only half the recording time. Sporadic FMs were interspersed with even longer pauses. Exaggerated FMs were defined as presence of FMs, but excessive in amplitude and speed (21).

Computer-Based Video Analysis of GMs

The video analysis software has been described in detail previously (19, 20). The video contains 25 frames per second with a resolution of 1,280 × 720 pixels. By subtracting subsequent frames in the video stream (frame differencing), the pixels changing between frames are identified and used to create a “motion image”. A motion image thus represents the motion happening between two consecutive frames in the video stream (22), and can further be used to extract quantitative data features. A motion image built from a matrix containing only zeros indicates that no movement occurs between the frames, whereas a motion image from a matrix with positive values represents movement. All videos in the present study were cropped to remove any non-infant movement (e.g., parents interfering with the infant), and so that only a window containing the mattress with the infant was available for analysis.

The quantitative variables were based on variables that have been shown previously to differ between infants with presence and absence of FMs (20). Quantity of motion (Q) is calculated for each frame as the sum of all pixel values (active pixels indicating movement) in the motion image divided by the total number of pixels, to create a resolution-independent feature. The centroid of motion (C) is the spatial center of the pixels with positive values in the motion image. This variable reflects the center point of the overall movement of the infant, and it will continuously change position during a video sequence. The SD of the centroid of motion (CSD) was calculated as the vector magnitude of the x (horizontal) and y (vertical) directions of the spatial center of motion. Evenly distributed movements in all body parts in all directions will be the source of low variability of the CSD, regardless of large or small movement amplitudes. Infant movements characterized by unsteady limb movements (i.e., uneven upper and lower limb activity with respect to laterality) will typically have higher CSD values.

Statistical Analysis

Data were analyzed using SPSS Statistics version 21.0 (IBM SPSS Statistics, Chicago, IL). To correct for variable body size between infants, all computer-based movement variables were normalized for the trunk area, which was calculated by multiplying the trunk length with the trunk width measured from the video image (cm2).

The computer-based movement variables CSD and Q were compared between the groups of FMs’ temporal organization using a linear mixed model. A subject-specific, random intercept was included to account for a possible within-subject correlation for the individuals with two repeated measurements. Temporal organization of FM was included as a categorical covariate, and gestational age as a continuous covariate.

To study the predictive abilities of the CSD for the binary outcome of normal vs. absence of or sporadic FMs, a generalized linear mixed model was applied using CSD as a covariate. The area under the curve was used to assess the discrimination between normal and absence of or sporadic FMs based on the generalized linear mixed model. A triage model comprising the three groups “refer to early intervention”, “refer to GMA”, and “reassure” was selected based on combinations of sensitivities and specificities.

Results were considered statistically significant for P values less than 0.05.

Ethical Approval

The study was approved by the regional committee for medical and health research ethics (REC Central-Committee 4.2007.2327). Written parental consent was obtained before inclusion.

Results

Table 2 shows the temporal organization of FMs according to clinical risk group for all recordings. The highest proportion of absence of and sporadic FMs was found in recordings from infants diagnosed with neonatal arterial ischemic stroke and those with perinatal asphyxia not treated with TH.

Table 2 Temporal organization of FMs according to risk group for all recordings

The CSD was significantly lower in infants with normal (0.320; 95% confidence interval (CI) 0.309, 0.330) vs. those with absence of or sporadic (0.380; 95% CI 0.361, 0.398) FMs (P<0.001). There was a significant increase in CSD from continual to intermittent to absence of or sporadic FMs (Figure 1a). No significant overall difference in Q was found between different categories of temporal organization of FMs (Figure 1b). Adjusting for gestational age did not affect any of the results.

Figure 1
figure 1

Computer-based movement variables and temporal organization of fidgety movements. (a) Centroid of motion (CSD) according to temporal organization of fidgety movements (FMs) in all recordings. Statistically significant difference between absence of and sporadic FMs vs. intermittent and continual FMs with P<0.001. (b) Quantity of motion (Q) according to temporal organization of FMs in all recordings. No significant differences between groups. CI, confidence interval.

The area under curve for the CSD was 0.73 (95% CI 0.658, 0.800) for normal (intermittent or continual) vs. absence of FMs or sporadic FMs using CSD as the only covariate (generalized linear mixed model). Different triage models were explored. Choosing a model with sensitivity and specificity of 80% resulted in a 27% referral rate to GMA. With this model, there was a false-negative rate for the computer-based video analysis (a CSD value below the lower threshold despite absence of or sporadic FMs assessed with GMA) of 20%. Increasing the sensitivity to 90% resulted in a referral rate to GMA of 40% and a false-negative rate of 10% (Table 3). With both models, 50% of recordings above the upper threshold (refer to early intervention) were assessed with absence of or sporadic FMs using GMA.

Table 3 Triage threshold analysis of variability of CSD

Discussion

This study confirms that computer-based video analysis applied on standardized recordings of GMs in young infants can provide objective information about the presence or absence of normal FMs. A larger variability of the spatial center of movements throughout a video was strongly associated with the absence of normal FMs, and thereby is a high risk for later CP. The lowest variation of the spatial center of movement was seen in infants with continual presence of FMs. This automated movement analysis can easily be applied for screening of high-risk infants to select those who need referral to an observational GMA.

Eligible infants in the present study were those who are routinely enrolled in follow-up clinics because of increased risk of adverse neurodevelopment. Criteria for such follow-up will differ between hospitals, but commonly comprise extremely preterm infants as well as more mature infants with perinatal morbidities known to increase the risk of perinatal brain injury. Associations between Gestalt perception and computer-based analysis were consistent across the different risk groups and were not affected by gestational age. Consequently, the included sample mirrors the diversity seen in the group of infants who are at risk of perinatal brain injury, and the results are not restricted to infants with specific underlying brain pathology.

Increased frequency of FMs in a video recording, from absence or sporadic to intermittent to continual, was strongly associated with a decrease in the variability of the spatial center of motion. This association between the temporal organization of FMs and the CSD has also been found in healthy, term-born infants (23). An explanation may be that FMs, which involve the whole body in an evenly distributed way, lead to a more stable center of motion. During periods with no FMs in a video, the variability of the center of motion will be the result of more unevenly distributed limb and body movements, such as isolated limb movements, asymmetric posture, and potentially stiff, jerky, or monotonous movements. The quantity of motion (Q) did not increase significantly with increasing presence of FMs in the video. This variable most likely measures a combination of FMs and concurrent movements, and will not differentiate between healthy and abnormal concurrent movements.

Several methods for automated movement analysis have shown discriminative power with regard to later CP or other developmental delays in small groups of high-risk infants (24, 25, 26). Using magnetic sensors, Philippi et al. (26) demonstrated that stereotypic arm movements predicted CP with high accuracy in a group of 49 infants. Kanemaru et al. (25), using reflective tape as limb sensors, found that children born preterm who exhibited developmental delay at 3 years of age had less active spontaneous movements at term-equivalent age. The video analysis in the present study is based on a sensor-less system to increase clinical feasibility. The movement variables extracted from the videos are not limited to one or more limbs, but reflect a more holistic measure of movement patterns. We suggest that this is more in accordance with the Gestalt perception used in GMA (27). However, adding features like frequency of limb movements (separately or in combination) may add to the accuracy of our method and needs to be further explored using more advanced computer vision methods (28). It also remains to be systematically tested as to how changes in camera distance or angle, or different infant clothing, influence the robustness of the computer-based method. This is particularly important if a future goal is to facilitate the implementation of computer-based methods in smart phone applications developed for remote GMA.

The purpose of early identification of CP risk is to refer infants to high-intensity intervention, highlighting the importance of high sensitivity of the test. However, avoidance of unnecessary follow-up and reassurance to parents of a healthy motor development will also have high priority for those working with children at risk. In the present study, a sensitivity of 80% had a corresponding specificity of 53%, meaning that a rather large proportion of infants with normal FMs would be regarded as potentially abnormal based on the automated video analysis. A triage model, where those between upper and lower threshold are referred for an observational GMA, is suggested to provide more certainty to parents and reduce unnecessary worries. The accuracy of early identification may also be improved by combining computer-based analysis with other assessments like neurological examination or magnetic resonance imaging, depending on the clinical setting and availability of resources.

Both observers who classified GMA in the present study have completed two advanced GMA courses each and have more than 15 years’ experience using GMA in clinical practice and research. Predictive accuracy of GMA during the FMs’ period for later CP has been reported to be very high (13, 27), and inter-rater reliability appears to be high (15, 16). A recent report from Australia showed very high sensitivity and specificity for abnormal development at 1 year when GMA was assessed by several observers in a structured GMA network (13). That study, however, used only three categories of FMs, i.e., absent, present, or abnormal (exaggerated). To our knowledge, inter-rater reliability of temporal organization of FMs using all available categories has not been published. The definition of sporadic FMs as lasting 1–3 s with long interspersed pauses (up to 1 min) came recently (29), and the significance of sporadic FMs for later CP development is not entirely clear. Quantitative movement analysis of infants during the FMs’ period comprising all categories of temporal organization may contribute to increased knowledge and understanding of FMs as a phenomenon.

Interpretation

This study confirms that automated movement analysis captures movement features relevant for the classification of FMs in high-risk infants. A low variability of the spatial center of motion throughout a video was strongly associated with normal FMs. A triage model based on automated movement analysis reduced the need for observational GMA, and half of the infants with absence of or sporadic FMs could be referred to early intervention based on automated movement analysis alone. Further studies including long-term outcomes are needed to explore the use of automated movement analysis as an adjunct to observational GMA for prediction of CP in infants at risk of perinatal brain injury.