Effects of aging on emotion recognition from dynamic multimodal expressions and vocalizations

Cortes, Diana S.; Tornberg, Christina; Bänziger, Tanja; Elfenbein, Hillary Anger; Fischer, Håkan; Laukka, Petri

doi:10.1038/s41598-021-82135-1

Download PDF

Article
Open access
Published: 29 January 2021

Effects of aging on emotion recognition from dynamic multimodal expressions and vocalizations

Diana S. Cortes¹,
Christina Tornberg¹,
Tanja Bänziger²^na1,
Hillary Anger Elfenbein³,
Håkan Fischer¹ &
…
Petri Laukka¹

Scientific Reports volume 11, Article number: 2647 (2021) Cite this article

5321 Accesses
21 Citations
Metrics details

Subjects

Abstract

Age-related differences in emotion recognition have predominantly been investigated using static pictures of facial expressions, and positive emotions beyond happiness have rarely been included. The current study instead used dynamic facial and vocal stimuli, and included a wider than usual range of positive emotions. In Task 1, younger and older adults were tested for their abilities to recognize 12 emotions from brief video recordings presented in visual, auditory, and multimodal blocks. Task 2 assessed recognition of 18 emotions conveyed by non-linguistic vocalizations (e.g., laughter, sobs, and sighs). Results from both tasks showed that younger adults had significantly higher overall recognition rates than older adults. In Task 1, significant group differences (younger > older) were only observed for the auditory block (across all emotions), and for expressions of anger, irritation, and relief (across all presentation blocks). In Task 2, significant group differences were observed for 6 out of 9 positive, and 8 out of 9 negative emotions. Overall, results indicate that recognition of both positive and negative emotions show age-related differences. This suggests that the age-related positivity effect in emotion recognition may become less evident when dynamic emotional stimuli are used and happiness is not the only positive emotion under study.

The role of facial movements in emotion recognition

Article 27 March 2023

Older adults have difficulty decoding emotions from the eyes, whereas easterners have difficulty decoding emotion from the mouth

Article Open access 06 May 2022

EmBody/EmFace as a new open tool to assess emotion recognition from body and face expressions

Article Open access 19 August 2022

Introduction

Accurately interpreting facial and vocal expressions is a key ability for effectively navigating the social world. Adult aging is often associated with difficulties in emotion recognition^1,2, which may have a negative impact on social functioning, health, and psychological well-being³. Most aging studies have investigated emotion recognition by presenting facial expressions (often static pictures) or vocal expressions in isolation², whereas in real social situations emotions are expressed by a combination of dynamic facial, vocal and bodily expressions⁴. In the current study, we therefore used dynamic facial and vocal stimuli to investigate effects of adult aging on recognition of a wide range of both positive and negative emotions.

Aging and emotion recognition

Facial expressions

The majority of previous aging studies have focused on recognition of facial expressions of anger, sadness, fear, disgust, and happiness, and to a lesser extent, surprise. For example, studies report that older adults are less accurate in recognizing anger and sadness from faces^5,6, and to some extent, fear^7,8,9. In contrast, older adults sometimes perform similarly to younger adults in facial disgust recognition¹⁰ and sometimes even outperform their younger counterparts⁵. Some studies also report age-related stability in recognition of happiness⁹, although findings for happiness may be difficult to interpret because of a ceiling effect caused by the often very high recognition rates for happiness^2,7. Meta-analyses of this literature suggest that older adults show the largest emotion recognition difficulties for anger, fear, and sadness, less difficulty for happiness, and no age-related differences for disgust^2,11.

A recent meta-analysis¹ further showed that task characteristics can have a large impact on aging effects for individual emotions. This was the case for disgust recognition where the facial dataset may be a potential moderator. Specifically, older adults showed robust difficulties for recognizing disgust for most of the datasets included in the meta-analysis, with Pictures of Facial Affect (POFA¹²) dataset being the only exception. The authors therefore concluded that the direction of age effects for recognition of disgust was systematically related to the stimuli used, and that studies using the POFA dataset may have largely contributed to the overall null-effect observed for disgust¹. Another important task characteristic was whether studies used static pictures of facial expressions or dynamic stimuli¹. In contrast to static pictures, age differences were more uniform across emotions for videos, which may suggest that the relative sparing of recognition of happiness relative to anger, fear, and sadness may in part be a function of task design. Dynamic facial stimuli provide more contextual cues and are richer in information than static images^13,14, and this may contribute to improved emotion recognition¹³. Indeed, studies have reported fewer or no age-related differences in emotion recognition when dynamic stimuli are used^15,16.

Vocal expressions

Compared to facial expressions, there are fewer studies on age effects in vocal expression recognition^2,17. Similar to facial expressions, vocal expression studies report less accurate recognition of anger and sadness for older compared to younger adults^5,18,19,20. Some studies have also reported age-related difficulties for recognition of fear^18,20,21, disgust¹⁸, and sometimes also happiness^5,21.

Aging vocal expression studies have mainly investigated emotionally inflected speech, but non-linguistic vocalizations may be a more effective mean of vocal expression than emotional speech²². Non-linguistic vocalizations consist of various human sounds such as laughs, cries, screams, sighs, groans, and gasps, and such sounds can convey a particularly wide variety of emotions^23,24,25. Few aging studies have investigated the recognition of emotions from vocalizations and results have been mixed. For example, Lima et al.²⁵ investigated anger, amusement, disgust, fear, pleasure, relief, sadness, and triumph vocalizations. In contrast to some previous studies that have reported age impairments for recognition of negative vocalizations and anger in particular^21,26, Lima et al.²⁵ instead reported age-related differences for all emotions regardless of valence. These results suggest that age impairments can also emerge for positive emotions (i.e., amusement, pleasure, relief, triumph) when more than one positive emotion is included²⁵.

Multimodal expressions

Studies of age-related differences in emotion recognition usually present stimuli unimodally (i.e., only faces or only voices), which reduces ecological validity and may not reflect how people perceive emotions in daily life¹³. Studies using multimodal stimulus presentation (i.e., combination of facial and vocal expressions) have reported that older adults achieve lower accuracy for recognition of negative emotions compared to younger adults^17,18. However, studies have also reported that the magnitude of age-related differences decreases considerably when multimodal information is available^16,26. When comparing the three modalities (visual, auditory, and multimodal), both younger and older adults may benefit from the multimodal condition and perform worse in the unimodal auditory condition¹⁸.

Mechanisms underlying age-related differences in emotion recognition

Potential mechanisms explaining age-related declines include the perceptual, cognitive, biological, and social levels. One influential motivational theory regarding aging is the socioemotional selectivity theory (SST). According to SST, constraints in time horizon (i.e., mortality) may become more prominent with advancing age and lead to changes in motivation, influencing goal selection and goal pursuit^27,28. For example, older adults may shift their focus to prioritize close interpersonal relationships and to foster emotional well-being and emotion regulation, and may thereby reduce exposure to negative affect²⁸. The term positivity effect refers to a relative preference in older adults towards positive over negative information (for reviews see^29,30). An alternative explanation of the positivity effect is given by the dynamic integration theory which states that processing negative information imposes greater cognitive demands and this leads older adults to automatically process positive information instead³¹. Thus, by an early avoidance of negative information, older adults are able to preserve their cognitive processing and still gain affect optimization. Instead, in SST, the attentional shift is considered to be more conscious and voluntary, involving top-down processes³².

Another possible explanation concerns the neural basis of emotion processing in younger and older adults. Brain regions such as the frontal and temporal lobes are associated with visual and auditory emotional processing and undergo substantial age-related declines^2,33. For example, some studies have suggested reduced amygdala activation in older compared to younger adults when viewing negative but not positive emotional pictures^34,35. Age-related changes in specific brain regions may therefore explain why some emotions are more affected by aging than others^2,33.

Losses in cognitive and sensory functions have also been discussed as possible explanations for age-related differences in emotion recognition. Cognitive functions (e.g., processing speed and working memory) as well as visual and auditory perception are involved in emotion recognition, and a decline in these abilities may be observed with increasing age³⁶. However, studies suggest that normal age-related hearing or vision loss does not account for age-related differences in the recognition of facial and vocal expressions^17,21,25,37.

Reports of older adults’ lower performance in recognition of negative emotions (mainly anger and sadness), but not positive emotions, are often interpreted as support for the positivity effect³⁰. However, because most previous research has included only one positive emotion (happiness; but see²⁵) the question of whether age-related differences are specific to happiness, or if they can be extended to other positive emotions, still remains open. In addition, when only one positive emotion is included in an emotion recognition task, correct recognition of that emotion can be achieved based on valence-specific information only, without the need to use emotion-specific information. A better understanding of the pattern of age-related differences across different emotions could provide additional clues about the mechanisms behind the effects of aging on emotion recognition.

The present study

Prior aging studies have generally assessed recognition of very few emotions (i.e., anger, disgust, fear, happiness, sadness, surprise; see¹), which contrasts with recent studies (on young populations) which suggest that it is possible to communicate a wide range of positive as well as negative emotions^14,23. The current study expands upon previous research by examining how aging affects recognition of a larger than usual number of emotions, including several positive emotions other than happiness. We also used dynamic facial and vocal stimuli—which better reflects how emotions are perceived in daily life¹³. We used two tasks which were designed to provide a reasonably full picture of the participants’ emotion recognition abilities. In Task 1, emotions were expressed through facial and bodily expressions and emotionally inflected speech, and stimuli contained both unimodal (video only, audio only) and multimodal (audio-visual) items. In Task 2, emotions were instead expressed through non-linguistic vocalizations, which are considered especially suited for expression of positive emotions²⁴ and have rarely been included in aging research. Task 1 contained 12 emotions, and Task 2 contained 18 emotions.

Based on the literature reviewed above, we expected overall age-related decreases in emotion recognition² for both tasks. For Task 1, we expected age effects to be greatest for the auditory-only condition, followed by the visual-only condition, and with the smallest group differences in the multimodal condition^18,26. Furthermore, for both tasks, we expected older adults to show a positivity effect with relatively better recognition of positive compared to negative emotions^28,31.

Results

Unimodal and multimodal emotion recognition (Task 1)

Confusion patterns

In order to gain a complete picture of the participants’ emotion judgments, a confusion matrix is shown in Table 1. It plots the intended emotions in the columns and percentage of judgments in the rows, separately for each emotion (across the visual, auditory, and multimodal presentation blocks) and age group. The main diagonal denotes correct percentage recognition, and all other cells represent the percentage of incorrect answers by emotion category. Both age groups recognized the intended emotions with accuracy above chance level (the chance level in a 12-alternative forced-choice task is 1 out of 12 = 8.33%), as indicated by 95% confidence intervals. Confusions were most common between conceptually similar emotions and were in general similar for both age-groups. The most common confusions for both young and older adults included that despair was mistaken for sadness, anxiety mistaken for fear, fear for despair, sadness for fear, and pride for happiness. There were also some notable differences between young and older adults with regard to confusions. Older, but not young, adults tended to mistake irritation for interest, sadness for relief, and interest for relief.

Table 1 Confusion matrix showing the proportion of judgments in Task 1 (the ERAM test) for each intended emotion and for both age groups.

Full size table

Emotion recognition accuracy

We used unbiased hit rate (Hu)³⁸ as the dependent variable in our analyses of age-related differences in emotion recognition. This measure accounts for the possibility that systematic biases in participants’ responses could artificially inflate their response accuracy. Hu gives a measure of perceptual sensitivity and is defined as the “joint probability both that a stimulus is correctly identified (given that it is presented) and that a response is correctly used (given that it is used)” (³⁸, p. 16). Hu is calculated as the hit rate multiplied by one minus the rate of false alarms, and Hu values range from 0 to 1. A score of 1 would indicate that all stimuli of an emotion were correctly classified and that the respective emotion was never misclassified as another emotion. It is not possible to calculate Hu values for a single participant’s judgment of a single stimulus. We therefore calculated Hu values for each participant and (a) each presentation modality (visual, auditory, and multimodal) across all emotions, (b) each emotion (across all presentation modalities), and (c) both positive and negative valence (across all presentation modalities). We did not calculate Hu values for individual emotions separately for each presentation modality due to the small number of items per emotion and modality.

Three separate mixed analyses of variance (ANOVAs) were conducted. The first ANOVA examined effects of age (young, old) and presentation modality (visual, auditory, multimodal) on emotion recognition (Hu) across all emotions in a 2 × 3 design. The second ANOVA instead examined effects of age (young, old) and emotion (anger, anxiety, despair, disgust, fear, happiness, interest, irritation, pleasure, pride, relief, sadness) on emotion recognition (Hu) across all presentation modalities in a 2 × 12 design. The third ANOVA examined effects of age (young, old) and valence (positive emotions, negative emotions) on emotion recognition (Hu) in a 2 × 2 design. Age was always entered as a between-subjects factor, while presentation modality, emotion and valence were entered as within-subject factors. Post hoc multiple comparisons (t tests) and effect sizes (Hedges’s g) are presented for all pairwise comparisons between young and old adults in Supplementary Table S1.

The first ANOVA yielded a significant main effect of age F(1, 129) = 15.07, p < 0.001, η_p² = 0.11, indicating that younger participants (M = 0.40) overall performed more accurately than older participants (0.33). (Note that identical main effects of age also emerged in the analyses of emotion and valence below, but to avoid redundancy these are not reported). The main effect of presentation modality was also significant, F(2, 258) = 156.23, p < 0.001, η_p² = 0.55 (Huynh–Feldt corrected). Multiple comparisons (dependent samples t tests) showed that multimodal expressions (0.49) had significantly higher recognition rates compared to visual expressions (0.35), which in turn were better recognized than auditory expressions (0.28, ps < 0.001). These main effects were further qualified by a significant interaction between age and modality, F(2, 258) = 3.96, p = 0.020, η_p² = 0.03 (Huynh–Feldt corrected). Multiple comparisons using independent t tests (with Bonferroni adjusted alpha levels of 0.017) revealed that older adults (0.22) had significantly lower accuracy than younger adults (0.32) in the auditory condition (p < 0.001) (Fig. 1a). However, group differences were not significant for the multimodal (old = 0.46; young = 0.51; p = 0.05) or visual condition (old = 0.32; young = 0.37; p = 0.03).

The second ANOVA yielded a significant main effect of emotion, F(10.46, 1349.02) = 67.55, p < 0.001, η_p² = 0.34 (Huynh–Feldt corrected). For example, anger and pleasure were better recognized, and anxiety and sadness were worse recognized than the other emotions (see Supplementary Table S2). The age by emotion interaction was also significant F(10.46, 1349.02) = 7.43, p < 0.001, η_p² = 0.05 (Huynh–Feldt corrected), see Fig. 1b. Multiple comparisons (independent samples t tests, Bonferroni adjusted alpha level = 0.004) revealed that older participants had more difficulties than younger participants to correctly recognize expressions of anger, irritation, and relief (ps < 0.001).

The third ANOVA yielded a significant main effect of valence F(1, 129) = 60.44, p < 0.001, η_p² = 0.32, indicating that positive expressions (M = 0.42) were recognized more accurately than negative expressions (0.33) across age groups. The age by valence interaction was not significant, F(1, 129) = 0.01, p = 0.94.

Recognition of emotions from non-linguistic vocalizations (Task 2)

Confusion patterns

Tables 2 and 3 show confusion matrices separately for positive and negative emotions. Both age groups recognized all positive and negative emotions with better-than-chance accuracy (the chance level in a 9-alternative forced-choice task is 1 out of 9 = 11.11%), as indicated by 95% confidence intervals displayed for the percentage accuracy values in the diagonals of Tables 2 and 3. Among positive emotions, common confusions for both age groups included affection mistaken for interest and serenity, amusement mistaken for happiness, happiness for amusement, interest for positive surprise, lust for serenity, pride for interest, and serenity for lust and relief (see Table 2). Older, but not younger, adults also tended to mistake lust for affection, and also mistook pride for positive surprise more frequently than young adults. In fact, for older adults, accuracy for pride was lower than the most frequent misclassifications.

Table 2 Confusion matrix showing the proportion of judgments in Task 2 (positive nonlinguistic vocalizations) for each intended emotion and for both age groups.

Full size table

Table 3 Confusion matrix showing the proportion of judgments in Task 2 (negative nonlinguistic vocalizations) for each intended emotion and for both age groups.

Full size table

For negative emotions (see Table 3), common confusions for young and older adults included contempt mistaken for negative surprise, distress mistaken for fear, fear for distress, guilt for negative surprise and shame, and shame for distress and guilt. Older adults showed more frequent misclassifications of sadness as fear compared to young adults.

Emotion recognition accuracy

We performed two separate mixed ANOVAs, using Hu scores for positive and negative vocalizations, respectively, to investigate effects of age on emotion recognition from vocalizations. Age was entered as between-subjects factor, and emotions (positive: affection, amusement, happiness, interest, sexual lust, pride, positive surprise, relief, and serenity; negative: anger, contempt, disgust, distress, fear, guilt, negative surprise, sadness, and shame) were entered as within-subject factors. Note that two older participants did not complete this task and the analyses were conducted including 58 older participants.

For positive vocalizations we observed a significant main effect of age, F(1, 127) = 106.34, p < 0.001, η_p² = 0.46, indicating an advantage for younger (M = 0.32) compared to older (0.19) participants. There was also a main effect of emotion F(7.18, 911.98) = 90.66, p < 0.001, η_p² = 0.42 (Huynh–Feldt corrected). For example, relief and lust were better recognized than all other positive emotions. At the bottom end of recognition, affection, amusement, and pride had lower Hu-scores than all other emotions (see Supplementary Table S3 for more details). These main effects were further qualified by a significant interaction between age and emotion, F(7.18, 911.98) = 16.80, p < 0.001, η_p² = 0.12 (Huynh–Feldt corrected). Post hoc multiple comparisons (independent t tests, Bonferroni adjusted alpha = 0.0055) indicated worse recognition of interest, lust, pride, positive surprise, relief, and serenity vocalizations with advancing age (ps < 0.001) (Fig. 2a).

For negative vocalizations, there was a main effect of age, F(1, 127) = 84.10, p < 0.001, η_p² = 0.40 showing that older adults (M = 0.28) had significantly lower overall recognition rates than younger adults (0.42). The main effect of emotion was also significant F(6.26, 795.24) = 241.51, p < 0.001, η_p² = 0.66 (Huynh–Feldt corrected). For example, anger, sadness, and disgust had higher, whereas guilt and shame had lower Hu scores compared to the rest of the emotions (see Supplementary Table S4 for more details). The age by emotion interaction, F(8, 1016) = 9.81 p < 0.001, η_p² = 0.07 (Huynh–Feldt corrected) showed that older compared to younger participants had lower recognition rates for all negative vocalizations (ps ≤ 0.002), except for shame (p = 0.04) (independent samples t tests, Bonferroni adjusted alpha = 0.0055) (Fig. 2b).

A separate ANOVA investigated effects of valence (positive, negative). A significant main effect of valence showed that negative vocalizations (M = 0.35) were recognized better than positive vocalizations (0.26) across age groups, F(1, 127) = 227.54, p < 0.001, η_p² = 0.64. The age by valence interaction was not significant, F(1, 127) = 1.26, p = 0.26.

Discussion

The current study evaluated the effects of adult aging on the recognition of a variety of positive and negative emotions using dynamic facial and vocal stimuli. Across two emotion recognition tasks, we found that overall recognition rates were better for younger compared to older adults. Group differences were observed for positive and negative emotions in both tasks, and occurred mainly for vocally expressed emotions. In Task 1, we observed the largest effects for auditory stimuli, and smaller differences for visual and multimodal stimuli, which is in line with previous research^18,26. Across all emotion categories, age-related differences were only significant in the auditory condition. When looking at individual emotions (across presentation modalities), significant age effects were observed for anger, irritation, and relief expressions, although group differences were not significant for most of the emotions (i.e., anxiety, despair, disgust, fear, happiness, interest, pleasure, pride, and sadness). In Task 2, younger adults instead performed better than older adults for 6 out of 9 positive emotions (interest, sexual lust, pride, positive surprise, relief, and serenity), and for 8 out of 9 negative emotions (anger, contempt, disgust, distress, fear, guilt, negative surprise, and sadness) expressed through non-linguistic vocalizations. No significant differences were observed for the positive emotions affection, amusement, and happiness, or for the negative emotion shame. Misclassifications occurred most frequently between conceptually similar emotions and confusion patterns were similar for younger and older adults in both tasks. For example, in Task 1 common misclassifications included mistaking despair for sadness, and fear for despair; and in Task 2 amusement and happiness were frequently confused with each other, and guilt was mistaken for shame.

The above pattern of results suggests that difficulties in recognition of both positive and negative emotions were observed to a similar degree with advancing age. In addition, the interaction between age and valence (positive, negative) did not have a significant effect on recognition rates for either emotion recognition task. Consequently, our results do not provide support for a positivity effect whereby older adults would have relatively better recognition of positive compared to negative emotions²⁸. We suggest that ceiling effects in recognition of happiness may have contributed to the positivity effect in previous studies⁷ and that this effect fades when several positive expressions and response options are included. When happiness is the only positive expression, it is possible to infer the correct alternative based on valence-specific rather than emotion-specific information. In addition, our emotion recognition tasks were designed to avoid ceiling effects due to (too) high recognition rates.

Our findings also extend previous research by showing that recognition of both basic (e.g.,³⁹) and non-basic, more complex emotions show age-related differences. For example, group differences were observed for the basic emotions anger (in both Task 1 and 2) and contempt, disgust, fear, happiness, and sadness (in Task 2), as well as for several emotions and affective states that are not usually considered “basic” (e.g., relief in both Task 1 and 2; and distress, guilt, interest, lust, pride, relief, and serenity in Task 2). Although recognition of basic emotions does not seem to be preferentially spared in adult aging, it could be suggested that basic emotions attract prioritized processing resources due to their survival value and may therefore be recognized faster and more automatically than more complex emotions. Our study was not designed to investigate the speed and automaticity of emotion recognition, but future aging studies could conduct such comparisons between basic and more complex emotions (see⁴⁰), and also between different presentation modalities.

If general cognitive decline would be the main driver of emotion recognition deficits, we would expect age-related differences to be more or less similar for all emotions. This could be the case for Task 2, where older adults showed lower recognition rates for most of the included emotion categories. We note that no age differences emerged for affection, amusement, happiness, and shame in Task 2, but as these emotions were hard to recognize for both age groups, it is possible that the lack of group differences may have been due to a floor effect. However, the results from Task 1 instead showed group differences only for a few emotions (anger, irritation, and relief). The ERAM test employed in Task 1 is, however, not ideal for investigating differences between specific emotions because it has a small number of items per emotion, which makes it difficult to generalize patterns (other than the finding that both positive and negative emotions showed differences). Future studies could develop tasks that allow for the investigation of how aging may affect recognition of specific emotions in different ways depending on whether the emotion is conveyed through dynamic visual, auditory or multimodal expressions.

It is also difficult to interpret our findings in relation to the suggestion that age-related changes in specific brain regions lead to impaired recognition of specific emotions². Apart from the fact that recognition rates for anger and relief were lower for older vs. younger participants in both Task 1 and Task 2, age-related differences occurred for different emotions in the two tasks. It could be speculated that age-related changes in, for example, the orbitofrontal cortex might be involved in group differences in the processing of visual and auditory expressions of anger^2,5. However, investigating explanations at the neural level would require different methodology than what was employed in the current study.

A recent meta-analysis¹ reported an overall age effect of Hedges’s g = 0.40 for recognition of facial expressions. The corresponding effect size (from the visual-only condition in Task 1) in our study was quite similar in magnitude at g = 0.38 (see Supplementary Table S1). It should be noted that the estimate in this meta-analysis¹ was based mainly on studies of static pictures and few emotions, whereas our study contained dynamic expressions of relatively many emotions and also contained some bodily and gestural cues to emotion. Nevertheless, our results are in line with the previous literature and suggest a moderate effect of age on facial expression recognition.

Because our study included more emotions than most previous studies, the tasks also contained more response options than most previous studies. It has been proposed that tasks involving larger numbers of response options place larger demands on working memory, possibly leading to larger age-related differences in recognition accuracy³⁶. We speculate that the similarity in effect sizes (despite the larger number of response options) could be explained by increased motivation among the elderly due to dynamic stimuli being perceived as less artificial and closer to real-life interactions than static stimuli. If so, this increased motivation could potentially have compensated for lower working memory performance in older adults^41,42, but this hypothesis needs to be tested in future studies.

We observed large effects of adult aging on auditory emotion recognition in both Task 1 and 2. The auditory modality may involve more complex processes and require rapid attentional and speed processing, which places larger demands on perceptual and cognitive skills, resulting in disadvantage for older adults¹. Alternatively, the results may suggest that age declines in emotion recognition are domain specific⁴³, happening first in the auditory modality and affecting other modalities to a lesser extent. Although previous studies suggest that age-related differences in vocal emotion recognition are not primarily caused by hearing loss^17,37, we cannot rule out the possibility that individual differences in hearing abilities may have moderated the processing of vocal expressions in the current study. Speaking against this explanation is the fact that aging effects varied across emotions although the sound level was similar (normalized) for all stimuli. Nevertheless, audiometric test procedures could be included in future studies. Future studies could also include more detailed assessment of cognitive functioning as it may have an influence on emotion recognition accuracy¹¹.

Another possible limitation is related to the fact that the order of the emotion recognition tasks was not counterbalanced. Because recognition of non-linguistic vocalizations in Task 2 was always assessed after the first task was already completed, we cannot rule out that fatigue could have contributed to the larger age-related differences in Task 2 vs. Task 1. However, the fact that aging effects were larger for auditory stimuli also in Task 1, and that Task 1 was brief and only took 15–20 min to complete, speak against fatigue playing a major role.

The stimuli used in both emotion recognition tasks were language free and thus suitable for use in many cultural settings. However, while all participants were living in Sweden, the stimuli were originally produced by actors from other countries. Although the high recognition rates attest that emotions were well recognized, the stimuli could nevertheless contain some culture-specific cues to emotion⁴⁴. The stimuli in both tasks were further produced using a natural acting procedure that was intended to produce expressions that are as authentic as possible^4,45. However, it remains a possibility that acted emotion expressions may differ in some important aspects from spontaneous expressions⁴⁶. We would thus welcome future aging studies that investigate cross-cultural aspects and possible effects of perceived authenticity of stimuli.

Furthermore, we suggest that future studies could investigate individual facial cues to emotion (e.g., in terms of facial action unit activations) together with vocal cues (e.g., pitch, loudness, speech rate) to get a more complete picture of why older adults show emotion recognition difficulties. Finally, to understand fully when difficulties in emotion recognition appear, cross-sectional studies across the life-span and longitudinal studies need to be considered. Brain imaging studies that include a wide range of emotions and multimodal stimuli could further contribute to this aim.

Methods

Participants

Seventy-one younger (M_age = 23.42 years, SD = 2.83; range = 18–30 years; 36 female) and 60 older (M_age = 69.22 years, SD = 3.16; range = 58–75 years; 40 female) adults participated in this study. Exclusion criteria included: poor Swedish, presence of a psychiatric, neurological or neurodegenerative disorder, use of psychiatric medication, substance abuse, poor vision or hearing. Younger individuals were recruited from university bulletin boards or designated websites, and older individuals were recruited from a research database and from the community. The number of participants was determined so that we would achieve power of 0.80 with alpha of 0.05 to detect a medium size (Cohen’s d = 0.50) difference between age-groups using independent t tests.

Both young and old participants completed the Mini-Mental State Examination (MMSE⁴⁷) before taking part in the emotion recognition tasks. A cutoff score of 26 (adapted for Swedish populations⁴⁸) suggests potential cognitive impairment. All participants scored higher than this cutoff and no participants were excluded based on this criterion. MMSE was not correlated to any of the main dependent variables for older participants (see Supplementary Table S5), and there were no significant differences in MMSE scores between younger (M = 28.99, SD = 1.37) and older participants (M = 29.23, SD = 0.95), t₍₁₂₈₎ = 1.19, p = 0.24. However, there was a significant difference in years of education between older (M = 15.88, SD = 4.15) and younger participants (M = 11.11, SD = 5.67), t₍₁₂₇₎ = − 5.34, p < 0.001. Correlations between years of education and the dependent variables were generally low and are shown in Supplementary Table S6. Written informed consent was obtained prior to participation. The study was approved by the Stockholm area Regional Ethical Board and adhered to the Declaration of Helsinki. Each participant received cinema tickets as compensation for their participation.

Procedure

We used two emotion recognition tasks (described below) that contained dynamic multimodal emotion expressions (Task 1) and non-linguistic vocalizations (Task 2). Instructions were first given verbally for each task and the participants also read the same instructions on the computer screen prior to beginning the tasks. There was no time constraint for any of the tasks. For both tasks, participants made their choices by clicking the respective response alternative on the computer screen using a mouse. Task 1 was completed first, followed by Task 2. Participants also took part in a separate social cognitive assessment (reported elsewhere⁴⁹) after they had completed the two emotion recognition tasks. An overview of both emotion recognition tasks is provided in Supplementary Figure 1.

Emotion recognition tests

Task 1: Multimodal emotion recognition test (ERAM)

Individual ability to perceive emotions was assessed with the Emotion Recognition Assessment in multiple Modalities (ERAM) test⁵⁰. The ERAM test includes stimuli from the Geneva Multimodal Emotion Portrayal corpus⁴, a database of 10 actors (5 men, 5 women) depicting emotions while speaking a sentence in a pseudo-language (e.g., “nekal ibam soud molen!”). The pseudo-sentences were selected to avoid possible confounding effects of language. The ERAM test consists of dynamic video clips that provide facial, postural, gestural, and vocal cues, and contains 72 unique items which convey 12 emotions (anger, anxiety, despair, disgust, fear, happiness, interest, irritation, pleasure, pride, relief, and sadness). Items were presented in 3 blocks in a fixed order: first 24 video only items, followed by 24 audio only items, and 24 multimodal (both audio and video) items. Items were presented in a pseudo-random fixed order within each block. Each emotion appeared twice in each block, and there were 3 male and 3 female items for each emotion.

After the presentation of each item, participants were asked to choose which of the response alternatives best described the emotion that the actor had portrayed in a 12-alternative forced choice paradigm. The response alternatives were the same as the 12 intended emotions and appeared on the screen after each item was presented. Each item was only presented once and could not be replayed. The items were no longer visible (or audible) when the response alternatives were shown on the screen. Participants first completed a brief training session containing 3 items that were not used in the test set in order to familiarize themselves with the procedure. Testing was conducted individually using Authorware software (Adobe Systems Inc., San Jose, CA) to present stimuli and collect responses. Video stimuli were presented on 22″ LED computer screens and auditory stimuli were presented through headphones (AKG K619, AKG Acoustics GmbH, Vienna, Austria). The sound level of the items was normalized separately for each actor⁴ and the headphone volume setting was kept the same for all participants. The duration of the items ranged between 1 to 5 seconds and the total administration time was 15–20 minutes.

Correlations between the recognition rates (Hu) for each emotion category are shown in Supplementary Table S7. Recognition rates for most emotions were positively correlated regardless of their valence, with happiness being the exception. Accuracy for happiness expressions was only significantly correlated with other positive expressions (i.e., interest pleasure, and relief).

Task 2: Non-linguistic vocalization test (VENEC)

Participants’ ability to perceive emotional vocalizations was assessed using vocalizations from the Vocal Expressions of Nineteen Emotions across Cultures (VENEC) corpus, which is a cross-cultural database of vocal emotional expressions portrayed by 100 actors⁴⁵. In the present study, we used non-linguistic vocalizations of 9 positive (affection, amusement, happiness, interest, sexual lust, pride, positive surprise, relief, and serenity) and 9 negative (anger, contempt, disgust, distress, fear, guilt, negative surprise, sadness, and shame) emotions⁵¹. Some examples of vocalizations include sighs, breathing sounds, crying, hums, grunts, laughter, and shrieks. Positive and negative emotions were judged in separate experiments containing 108 vocalizations each (12 vocal stimuli per emotion). The order of experiments and the order of items within each experiment was randomized for each participant.

After presentation of an item, participants were requested to select the response alternative that they thought best captured the emotion conveyed by the vocalization in a 9-alternative forced-choice task. The response alternatives were the same as the intended 9 positive or 9 negative emotions. Testing was conducted individually using MediaLab software⁵² to present the stimuli and record responses. Sound levels were normalized in order to soften the contrast between stimuli which would otherwise have been disturbingly loud (e.g., screams) or inaudibly quiet (e.g., whispers)⁵¹. Participants were allowed to replay the vocalizations as many times as needed in order to make a judgment. Completing both vocalization experiments took about 30 min.

Correlations between the recognition rates (Hu) for all emotions are shown in Supplemental Table S8. Recognition rates for most emotions were positively correlated with each other. Exceptions included recognition rates for affection, amusement, happiness, and shame which were not highly correlated with recognition rates for the other emotions.

Data availability

Data is available on the Open Science Framework (https://osf.io/fsd4j/?view_only=9623145e4919497da1d7529d14547b74).

References

Hayes, G. S. et al. Task characteristics influence facial emotion recognition age-effects: A meta-analytic review. Psychol. Aging 35, 295–315 (2020).
Article PubMed Google Scholar
Ruffman, T., Henry, J. D., Livingstone, V. & Phillips, L. H. A meta-analytic review of emotion recognition and aging: Implications for neuropsychological models of aging. Neurosci. Biobehav. Rev. 32, 863–881 (2008).
Article PubMed Google Scholar
Hall, J. A., Andrzejewski, S. A. & Yopchick, J. E. Psychosocial correlates of interpersonal sensitivity: A meta-analysis. J. Nonverbal Behav. 33, 149–180 (2009).
Article Google Scholar
Bänziger, T., Mortillaro, M. & Scherer, K. R. Introducing the Geneva Multimodal expression corpus for experimental research on emotion perception. Emotion 12, 1161–1179 (2012).
Article PubMed Google Scholar
Ruffman, T., Halberstadt, J. & Murray, J. Recognition of facial, auditory, and bodily emotions in older adults. J. Gerontol. Ser. B Psychol. Sci. Soc. Sci. 64, 696–703 (2009).
Google Scholar
Sullivan, S., Campbell, A., Hutton, S. B. & Ruffman, T. What’s good for the goose is not good for the gander: Age and gender differences in scanning emotion faces. J. Gerontol. Ser. B Psychol. Sci. Soc. Sci. 72, 441–447 (2017).
Google Scholar
Isaacowitz, D. M. et al. Age differences in recognition of emotion in lexical stimuli and facial expressions. Psychol. Aging 22, 147–159 (2007).
Article PubMed Google Scholar
Murphy, N. A. & Isaacowitz, D. M. Age effects and gaze patterns in recognising emotional expressions: An in-depth look at gaze measures and covariates. Cogn. Emot. 24, 436–452 (2010).
Article Google Scholar
Svärd, J., Wiens, S. & Fischer, H. Superior recognition performance for happy masked and unmasked faces in both younger and older adults. Front. Psychol. 3, 520 (2012).
Article PubMed PubMed Central Google Scholar
Orgeta, V. & Phillips, L. H. Effects of age and emotional intensity on the recognition of facial emotion. Exp. Aging Res. 34, 63–79 (2008).
Article PubMed Google Scholar
Gonçalves, A. R. et al. Effects of age on the identification of emotions in facial expressions: A meta-analysis. PeerJ 2018, e5278 (2018).
Article Google Scholar
Ekman, P. & Friesen, W. V. Pictures of Facial Affect (Consulting Psychologists Press, Palo Alto, 1976).
Google Scholar
Isaacowitz, D. M. & Stanley, J. T. Bringing an ecological perspective to the study of aging and recognition of emotional facial expressions: Past, current, and future methods. J. Nonverbal Behav. 35, 261–278 (2011).
Article PubMed PubMed Central Google Scholar
Phillips, L. H. & Slessor, G. Moving beyond basic emotions in aging research. J. Nonverbal Behav. 35, 279–286 (2011).
Article Google Scholar
Krendl, A. C. & Ambady, N. Older adults’ decoding of emotions: Role of dynamic versus static cues and age-related cognitive decline. Psychol. Aging 25, 788–793 (2010).
Article PubMed Google Scholar
Sze, J. A., Goodkind, M. S., Gyurak, A. & Levenson, R. W. Aging and emotion recognition: Not just a losing matter. Psychol. Aging 27, 940–950 (2012).
Article PubMed PubMed Central Google Scholar
Lambrecht, L., Kreifelts, B. & Wildgruber, D. Age-related decrease in recognition of emotional facial and prosodic expressions. Emotion 12, 529–539 (2012).
Article PubMed Google Scholar
Hunter, E. M., Phillips, L. H. & MacPherson, S. E. Effects of age on cross-modal emotion perception. Psychol. Aging 25, 779–787 (2010).
Article PubMed Google Scholar
Mill, A., Allik, J., Realo, A. & Valk, R. Age-related differences in emotion recognition ability: A cross-sectional study. Emotion 9, 619–630 (2009).
Article PubMed Google Scholar
Laukka, P. & Juslin, P. N. Similar patterns of age-related differences in emotion recognition from speech and music. Motiv. Emot. 31, 182–191 (2007).
Article Google Scholar
Ryan, M., Murray, J. & Ruffman, T. Aging and the perception of emotion: Processing vocal expressions alone and with faces. Exp. Aging Res. 36, 1–22 (2010).
Article PubMed Google Scholar
Hawk, S. T., van Kleef, G. A., Fischer, A. H. & van der Schalk, J. ‘Worth a thousand words’: Absolute and relative decoding of nonlinguistic affect vocalizations. Emotion 9, 293–305 (2009).
Article PubMed Google Scholar
Cowen, A. S., Elfenbein, H. A., Laukka, P. & Keltner, D. Mapping 24 emotions conveyed by brief human vocalization. Am. Psychol. 74, 698–712 (2019).
Article PubMed Google Scholar
Kamiloğlu, R. G., Fischer, A. H. & Sauter, D. A. Good vibrations: A review of vocal expressions of positive emotions. Psychon. Bull. Rev. 27, 237–265 (2020).
Article PubMed PubMed Central Google Scholar
Lima, C. F., Alves, T., Scott, S. K. & Castro, S. L. In the ear of the beholder: How age shapes emotion processing in nonverbal vocalizations. Emotion 14, 145–160 (2013).
Article PubMed Google Scholar
Chaby, L., Luherne-du Boullay, V., Chetouani, M. & Plaza, M. Compensating for age limits through emotional crossmodal integration. Front. Psychol. 6, 691 (2015).
Article PubMed PubMed Central Google Scholar
Carstensen, L. L., Isaacowitz, D. M. & Charles, S. T. Taking time seriously: A theory of socioemotional selectivity. Am. Psychol. 54, 165–181 (1999).
Article CAS PubMed Google Scholar
Carstensen, L. L. The influence of a sense of time on human development. Science 312, 1913–1916 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Carstensen, L. L. & DeLiema, M. The positivity effect: A negativity bias in youth fades with age. Curr. Opin. Behav. Sci. 19, 7–12 (2018).
Article PubMed Google Scholar
Reed, A. E., Chan, L. & Mikels, J. A. Meta-analysis of the age-related positivity effect: Age differences in preferences for positive over negative information. Psychol. Aging 29, 1–15 (2014).
Article PubMed Google Scholar
Labouvie-Vief, G. Dynamic integration: Affect, cognition and the self in adulthood. Curr. Dir. Psychol. Sci. 12, 201–206 (2003).
Article Google Scholar
Isaacowitz, D. M., Allard, E. S., Murphy, N. A. & Schlangel, M. The time course of age-related preferences toward positive and negative stimuli. J. Gerontol. Ser. B Psychol. Sci. Soc. Sci. 64, 188–192 (2009).
Google Scholar
Raz, N. et al. Regional brain changes in aging healthy adults: General trends, individual differences and modifiers. Cereb. Cortex 15, 1676–1689 (2005).
Article PubMed Google Scholar
Mather, M. et al. Amygdala responses to emotionally valenced stimuli in older and younger adults. Psychol. Sci. 15, 259–263 (2004).
Article PubMed Google Scholar
Keightley, M. L., Chiew, K. S., Winocur, G. & Grady, C. L. Age-related differences in brain activity underlying identification of emotional expressions in faces. Soc. Cogn. Affect. Neurosci. 2, 292–302 (2007).
Article PubMed PubMed Central Google Scholar
Hedden, T. & Gabrieli, J. D. E. Insights into the ageing mind: A view from cognitive neuroscience. Nat. Rev. Neurosci. 5, 87–96 (2004).
Article CAS PubMed Google Scholar
Christensen, J. A., Sis, J., Kulkarni, A. M. & Chatterjee, M. Effects of age and hearing loss on the recognition of emotions in speech. Ear Hear. 40, 1069–1083 (2019).
Article PubMed PubMed Central Google Scholar
Wagner, H. L. On measuring performance in category judgment studies of nonverbal behavior. J. Nonverbal Behav. 17, 3–28 (1993).
Article Google Scholar
Ekman, P. & Cordaro, D. What is meant by calling emotions basic. Emot. Rev. 3, 364–370 (2011).
Article Google Scholar
Lima, C. F., Anikin, A., Monteiro, A. C., Scott, S. K. & Castro, S. L. Automaticity in the recognition of nonverbal emotional vocalizations. Emotion 19, 219–233 (2019).
Article PubMed Google Scholar
Gerhardsson, A. et al. Positivity effect and working memory performance remains intact in older adults after sleep deprivation. Front. Psychol. 10, 605 (2019).
Article PubMed PubMed Central Google Scholar
Salthouse, T. A. & Babcock, R. L. Decomposing adult age differences in working memory. Dev. Psychol. 27, 763–776 (1991).
Article Google Scholar
Paulmann, S., Pell, M. D. & Kotz, S. A. How aging affects the recognition of emotional speech. Brain Lang. 104, 262–269 (2008).
Article PubMed Google Scholar
Scherer, K. R., Clark-Polner, E. & Mortillaro, M. In the eye of the beholder? Universality and cultural specificity in the expression and perception of emotion. Int. J. Psychol. 46, 401–435 (2011).
Article PubMed Google Scholar
Laukka, P. et al. The expression and recognition of emotions in the voice across five nations: A lens model analysis based on acoustic features. J. Pers. Soc. Psychol. 111, 686–705 (2016).
Article PubMed Google Scholar
Juslin, P. N., Laukka, P. & Bänziger, T. The mirror to our soul? Comparisons of spontaneous and posed vocal expression of emotion. J. Nonverbal Behav. 42, 1–40 (2018).
Article PubMed Google Scholar
Folstein, M., Folstein, S. & McHugh, P. Mini-mental state: A practical method for grading the cognitive state of patients for the clinician. J. Geriatr. Psychiatry 12, 189–198 (1975).
CAS Google Scholar
Kvitting, A. S., Fällman, K., Wressle, E. & Marcusson, J. Age-normative MMSE data for older persons aged 85 to 93 in a longitudinal Swedish cohort. J. Am. Geriatr. Soc. 67, 534–538 (2019).
Article PubMed Google Scholar
Cortes, D. S., Laukka, P., Ebner, N. C. & Fischer, H. Age-related differences in evaluation of social attributes from computer-generated faces of varying intensity. Psychol. Aging 34, 686–697 (2019).
Article PubMed Google Scholar
Holding, B. C. et al. Multimodal emotion recognition is resilient to insufficient sleep: Results from cross-sectional and experimental studies. Sleep 40, zsx145 (2017).
Article Google Scholar
Laukka, P. et al. Cross-cultural decoding of positive and negative non-linguistic emotion vocalizations cross-cultural decoding of positive and negative non-linguistic emotion vocalizations. Front. Psychol. 4, 353 (2013).
Article PubMed PubMed Central Google Scholar
Jarvis, B. G. Medialab (Version 2010. 2.19) computer software (Empirisoft Corporation, New York, 2010).
Google Scholar

Download references

Acknowledgements

The present study was not pre-registered. We acknowledge support from the Swedish Research Council Grant no. 2012-801 awarded to P.L.

Funding

Open Access funding provided by Stockholm University.

Author information

Tanja Bänziger is deceased.

Authors and Affiliations

Department of Psychology, Stockholm University, Stockholm, Sweden
Diana S. Cortes, Christina Tornberg, Håkan Fischer & Petri Laukka
Department of Psychology, Mid Sweden University, Östersund, Sweden
Tanja Bänziger
Olin Business School, Washington University in St. Louis, St. Louis, MO, USA
Hillary Anger Elfenbein

Authors

Diana S. Cortes
View author publications
You can also search for this author in PubMed Google Scholar
Christina Tornberg
View author publications
You can also search for this author in PubMed Google Scholar
Tanja Bänziger
View author publications
You can also search for this author in PubMed Google Scholar
Hillary Anger Elfenbein
View author publications
You can also search for this author in PubMed Google Scholar
Håkan Fischer
View author publications
You can also search for this author in PubMed Google Scholar
Petri Laukka
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.L., T.B., and H.F. designed the experiment, P.L., T.B. and H.A.E. contributed materials. D.S.C. and C.T. contributed to data collection, D.S.C., C.T., and P.L. analysed the data. D.S.C. and P.L. wrote the manuscript with input from C.T., H.A.E., and H.F. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Diana S. Cortes or Petri Laukka.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cortes, D.S., Tornberg, C., Bänziger, T. et al. Effects of aging on emotion recognition from dynamic multimodal expressions and vocalizations. Sci Rep 11, 2647 (2021). https://doi.org/10.1038/s41598-021-82135-1

Download citation

Received: 27 August 2020
Accepted: 15 January 2021
Published: 29 January 2021
DOI: https://doi.org/10.1038/s41598-021-82135-1

This article is cited by

Characteristics of healthy Japanese young adults with respect to recognition of facial expressions: a preliminary study
- Tomoko Hama
- Michihiko Koeda
BMC Psychology (2023)
Music emotion recognition based on a modified brain emotional learning model
- Maryam Jandaghian
- Saeed Setayeshi
- Arash Sharifi
Multimedia Tools and Applications (2023)
Blended Emotions can be Accurately Recognized from Dynamic Facial and Vocal Expressions
- Alexandra Israelsson
- Anja Seiger
- Petri Laukka
Journal of Nonverbal Behavior (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

The role of facial movements in emotion recognition

Older adults have difficulty decoding emotions from the eyes, whereas easterners have difficulty decoding emotion from the mouth

EmBody/EmFace as a new open tool to assess emotion recognition from body and face expressions

Introduction

Aging and emotion recognition

Facial expressions

Vocal expressions

Multimodal expressions

Mechanisms underlying age-related differences in emotion recognition

The present study

Results

Unimodal and multimodal emotion recognition (Task 1)

Confusion patterns

Emotion recognition accuracy

Recognition of emotions from non-linguistic vocalizations (Task 2)

Confusion patterns

Emotion recognition accuracy

Discussion

Methods

Participants

Procedure

Emotion recognition tests

Task 1: Multimodal emotion recognition test (ERAM)

Task 2: Non-linguistic vocalization test (VENEC)

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Characteristics of healthy Japanese young adults with respect to recognition of facial expressions: a preliminary study

Music emotion recognition based on a modified brain emotional learning model

Blended Emotions can be Accurately Recognized from Dynamic Facial and Vocal Expressions

Comments

Search

Quick links