A Chinese Face Dataset with Dynamic Expressions and Diverse Ages Synthesized by Deep Learning

Han, Shangfeng; Guo, Yanliang; Zhou, Xinyi; Huang, Junlong; Shen, Linlin; Luo, Yuejia

doi:10.1038/s41597-023-02701-2

Download PDF

Data Descriptor
Open access
Published: 07 December 2023

A Chinese Face Dataset with Dynamic Expressions and Diverse Ages Synthesized by Deep Learning

Shangfeng Han ORCID: orcid.org/0000-0002-2243-8906^1,2^na1,
Yanliang Guo¹^na1,
Xinyi Zhou³^na1,
Junlong Huang⁴,
Linlin Shen ORCID: orcid.org/0000-0003-1420-0815¹ &
…
Yuejia Luo^3,4,5

Scientific Data volume 10, Article number: 878 (2023) Cite this article

2066 Accesses
Metrics details

Subjects

Abstract

Facial stimuli have gained increasing popularity in research. However, the existing Chinese facial datasets primarily consist of static facial expressions and lack variations in terms of facial aging. Additionally, these datasets are limited to stimuli from a small number of individuals, in that it is difficult and time-consuming to recruit a diverse range of volunteers across different age groups to capture their facial expressions. In this paper, a deep-learning based face editing approach, StyleGAN, is used to synthesize a Chinese face dataset, namely SZU-EmoDage, where faces with different expressions and ages are synthesized. Leverage on the interpolations of latent vectors, continuously dynamic expressions with different intensities, are also available. Participants assessed emotional categories and dimensions (valence, arousal and dominance) of the synthesized faces. The results show that the face database has good reliability and validity, and can be used in relevant psychological experiments. The availability of SZU-EmoDage opens up avenues for further research in psychology and related fields, allowing for a deeper understanding of facial perception.

A guide to artificial intelligence for cancer researchers

Article 16 May 2024

Mapping model units to visual neurons reveals population code for social behaviour

Article Open access 22 May 2024

What colour are your eyes? Teaching the genetics of eye colour & colour vision. Edridge Green Lecture RCOphth Annual Congress Glasgow May 2019

Article Open access 23 August 2021

Background & Summary

Faces contain rich information useful for social interaction¹. Researchers have widely used facial stimuli to explore cognitive and emotional processing in both healthy individuals and those with disorders^2,3,4. Emotional faces are very common stimuli in emotional studies, and standardized emotional face datasets, such as the Radboud Faces Database⁵, FACES⁶, and the American Multiracial Faces Database⁷, have been created to provide research materials with relatively uniform facial features and good image quality. However, researchers found significant cross-cultural differences in emotion recognition among different races⁸.

To overcome the influence of culture on facial expression recognition, researchers in China have established a localized emotion face database based on the basic emotion model (including happiness, anger, fear, sadness, disgust, and surprise). The most widely used database is the Chinese Affective Face Picture System⁹, which was collected from recruited actors, when they imitate different emotions. However, some of the expressions may appear exaggerated or artificial to observers. Furthermore, as some of the volunteers may not present all six emotions well, some expressions are missing for many subjects in the datasets. Therefore, a more natural and standardized Chinese emotional face dataset is needed.

Our faces undergo changes as we grow older. However, age information is rarely considered in face-related studies. Initial research has shown that individuals’ social judgements of young faces primarily consisted of two dimensions: trustworthiness and dominance^10,11. Nevertheless, with the inclusion of age as a factor and diverse age faces for observers to evaluate, attractiveness emerged as the third dimension in facial social judgments¹². Moreover, classical face models propose that age, along with emotion and sex, plays a critical role in facial preception¹³. These findings underscore the importance of studying facial age as a critical factor in social judgments. While facial age information is frequently unavailable in current facial datasets, impeding advancements in age-related research on faces.

Additionally, dynamic facial expressions are often seen in social interactions. Dynamic facial expressions evoke stronger emotional responses compared to static ones and are easier to recognize with higher accuracy^14,15. The dynamic face dataset contributes to explain how people recognize the dynamic properties of faces. While researchers have created a dynamic face dataset based on Caucasian women and men¹⁴, the Chinese version is missing. It is necessary to establish a Chinese dynamic facial expression dataset, which can effectively capture the characteristics of dynamic facial expression changes in Chinese individuals and provide valuable support for cross-cultural comparisons.

Though several Chinese facial datasets are available^9,16,17, the authenticity of facial expressions exhibited by volunteers, as well as the diversity among ages of faces, is limited. The credibility and validity of research findings based on such datasets are thus compromised. Furthermore, collecting a large number of volunteers across diverse ages is challenging and requires a substantial investment of time and resources to train these volunteers to exhibit the required emotions on their faces. The adoption of recent AI (Artificial Intelligence) technologies can help overcome this bottleneck in data collection¹⁸.

Compared to collecting real faces, using AI-generated faces offers advantages in terms of increased experimental control, standardization, and the ease of obtaining novel stimuli⁷. We propose a method that introduces the facial action units into pre-trained StyleGAN to achieve high-quality expression editing. The approach produces naturally synthesized expressions without artifacts. Furthermore, we trained our model using Chinese faces with well-controlled identities, resulting in the generation of consistent basic emotions for each individual. Additionally, our method also includes the functions of age progression and dynamic attribute editing. This proposed method can serve as an extension of the currently available facial datasets, enhancing their quality, authenticity, and diversity.

In this study, the Generative Adversarial Networks (GAN) technique, namely the StyleGAN model, was employed to generate facial images. Our contribution includes the creation of a comprehensive face dataset called SZU-EmoDage, which comprises facial images of 120 individuals (equally divided between men and women) with six basic emotions, various ages, and dynamic emotions. Specifically, the StyleGAN model enables the manipulation of facial expressions and age, to produce all six distinct basic facial emotions for each individual. To meet the growing interest in understanding facial age perception, facial images representing ages ranging from 10 to 70 in 10-year increments were also generated. Notably, the SZU-EmoDage dataset incorporates dynamic and continuous changes in facial expressions, providing a valuable resource for further research in the field.

In summary, we present SZU-EmoDage, the first facial dataset synthesized using AI technologies, for face perception study. Notably, the authenticity of expressions and the diversity of faces across different age groups surpass that of existing face datasets. This dataset makes a valuable contribution to the field of facial perception, particularly in areas such as cross-cultural analysis, dynamic facial perception, and facial age perception. Additionally, the extensive variation in face material can serve as an effective tool for detecting mental disorders. The dataset generated in this study represents a significant expansion of currently available facial materials and is very likely to have a profound impact on related research, owing to its improved quantity and diversity.

Methods

Participants

We recruited 120 participants (including 60 men and 60 women, aged from 18 to 28, M ± SD: 20.47 ± 1.83) to finish the study. All the participants reported no history of mental illness, and had normal or corrected-to-normal vision. All the participants have signed the informed consent form; and we followed the principle of voluntary withdrawal and no harm. After participants finished the experiment, they were paid 100 RMB. The study was performed in agreement with the Declaration of Helsinki and approved by the local ethics committee of Shenzhen University.

Procedure

The procedure can be summarized into three parts: (1) To ensure that the generated faces align with Chinese facial features, we used the open face datasets^5,9,17,19,20 to train a StyleGAN-based editing model and applied the model to transform a neutral face to six different expressions. All the data we used for the research was obtained with informed consent from the participants. (2) In the process of transforming a neutral face into different expressions, interpolations of latent vectors were employed. This technique enabled the generation of dynamic expressions with varying intensities. (3) Finally, to generate neutral faces of different ages, the open-source SAM (Style-based Age Transform) model²¹ was used. By starting with the neutral face of a subject, this model was able to generate faces ranging from 10 years old to 70 years old.

Specifically, we used StyleGAN²² based AU (Action Unit) editing to change the expression of facial images. AU is the contraction or relaxation of one or more muscles of the face. As facial expressions can be decomposed into a combination of multiple AU²³, the change of a group of AUs can lead to the synthesis of desired expressions on a facial image.

Our model comprises three main modules: the StyleGAN encoder, the AU fusion module, and the StyleGAN generator. The StyleGAN encoder utilizes the encoder architecture and pretrained model from Pixel2Style2Pixel²⁴, and remains unchanged throughout training. Its primary function is to extract image features and encode them into the latent space of StyleGAN, to obtain the corresponding latent vector for the image. The AU fusion module consists of the AU encoder, Style extractor, and Style fusioner. The AU encoder maps the input target AU intensity vector to the space of the latent vector, capturing specific attributes of AUs and target expression information. In this mapping process, a 5-layer multi-layer perceptron (MLP) is employed as the AU encoder. Both the Style extractor and Style fusioner also use a 5-layer MLP. The Style extractor extracts features such as identity and background from the latent vector, which are then concatenated with the target AU latent vector obtained from the AU encoder. The resulting concatenated vector is then input to the Style fusioner, which combines style attribute features with expression features, and generates a new latent vector with the desired AU. Through the AU fusion module, manipulation of AU and expression in the latent space can be achieved. The StyleGAN generator utilizes the state-of-the-art StyleGAN ffhq pretrained model²⁵, and remains unchanged throughout training, which output the face with desire expression, given the latent vector with the target AU.

In the training process, we paired different expression images of the same person to obtain the original expression image I₁ and the target expression image I_2. Then we obtained the latent vector w₁ corresponding to image I₁ by StyleGAN encoder²⁴, and an AU vector au₂ representing the contraction or relaxation of 17 AUs of face image I₂ using AU extractor²⁶. The latent vector w₁ is input into the Style extractor to extract style features, which are then concatenated with the result obtained from the target expression AU vector au₂ fed into the AU encoder, and then fed into the Style fusioner to obtain a new latent vector w₂’ for the target expression. Finally, w₂’ was fed into the StyleGAN generator²² to generate the synthesized face image I₂’ with the target expression. To generate different expressions of a face image I_s, a set of AU vectors AU_t = (au_t1, …, au_t7) of 7 target expressions (including neutral) were extracted from the reference images with seven expression labels. The latent vector w_s of I_s was then input together with the target AU vector au_ti (i ∈ [1,7]) into the trained model to obtain the latent vector w_t, which was then used by StyleGAN generator to synthesize a face image I_t with the target expression (Fig. 1).

All images were mapped by StyleGAN into a smooth latent space, W. Two latent vectors with close distances in the latent space will generate similar images. As a result, interpolation in the latent space W can be used to generate intermeddle expressions between the face with original expression I_s and the target expression I_t. Specifically, we performed linear interpolation between the original expression latent vector w_s and the target expression latent vector w_t to generate multiple intermediate latent vectors. If the intermediate latent vector is closer to w_s, the expression image generated by StyleGAN generator is more similar to I_s, and vice versa. In this way, we obtained many faces with intermediate expressions interpolated between two expression images, which were connected together to form a dynamic group.

For age synthesis, SAM²¹ was used to obtain images with desired age I_age, which can be mapped into a latent vector w_age in the latent space of StyleGAN. GAN prior embedded network (GPEN)²⁷ was further used to increase the resolution of facial images. Similar to the interpolation of expressions, the dynamic change of age can be realized through interpolations of latent vectors between faces of different ages. Finally, we generated faces of seven basis, aging faces and emotional dynamic faces of 180 individuals (half men and women) in total.

To validate the efficacy of the proposed data generation method, our method was compared in Fig. 2 to several state-of-the-art expression editing methods including HiSD²⁸, GANimation²⁹, Expression-manipulator (ExprMAN)³⁰, and InterfaceGAN³¹. Each of these methods was utilized to generate neutral and the six basic expressions for the same individual.

After using StyleGAN to generate various facial images, we recruited participants to rate the representation of the morphed faces by using the 9-point scale⁹. The development process of this study refers to related facial dateset⁹. Eight participants were firstly invited to evaluate the emotional representation of these pictures and performed a preliminary screening. Finally, the faces of 60 men and 60 women were selected as formal experimental materials. The 120 individuals have 840 emotional faces in total, which is ready to be evaluated for the emotional category and emotional dimension (including valence, arousal and dominance). To prevent fatigue from judging numerous faces, we divided the assessment into 3 parts and recruited 40 college students (20 men and 20 women) in each part. The first group of participants were aged from 18 to 23, (M ± SD: 19.88 ± 1.65), who were asked to evaluate the emotional category of presented faces. The second group of participants were aged from 18 to 25 (M ± SD: 20.50 ± 1.88), who were asked to evaluate valence (positive, natural and negative), arousal (from 1 = “very not excited” to 9 = “very excited”), dominance (from 1 = “A weak sense of dominance” to 9 = “A strong sense of dominance”) and the authenticity (from 1 = “not authentic at all” to 9 = “very authentic”) of the faces. The third group of participants were aged from 19 to 28 (M ± SD: 21.18 ± 1.77), who were asked to evaluate the ages of faces with neutral expression (Fig. 3).

Data Records

The face dataset is free and available at https://osf.io/7a5fs/ under a CC license³². The face images and videos of different emotions, ages and dynamic expressions are stored in three separate compressed folders. Within each folder, different face images or videos generated from the same individualare organized into a subfolder named as “<gender> <id>”, where “gender” and “id” refer to gender and id of the individual. Face images are named according to the corresponding expressions or ages while videos are named according to the corresponding expressions and duration of videos.

Technical Validation

We conducted a comparative analysis between our method and several state-of-the-art expression editing methods, including HiSD, GANimation, ExprMAN, and InterfaceGAN. Notably, both HiSD and GANimation exhibit limitations in accurately editing the expressions, leading to the generation of low-quality images with noticeable artifacts. Conversely, while InterfaceGAN generates fewer artifacts, it produces expressions that appear unnatural. In comparison, our method excels by producing high-quality images with minimal artifacts and capturing natural expressions, thereby outperforming other methods.

We compared the expression categories of the 840 faces in our dataset with the categories labeled by volunteers recruited for the study, and the matching proportions are listed in Table 1. On average, the percentages of matching are higher than 70%. Happiness has the highest matching rate (100%), followed by neutral (98%), surprise (83%), sadness (82%), disgust (71%), anger (57%) and fear (51%). Furthermore, a confusion matrix was computed to illustrate the matching rate of each type of facial expressions (Fig. 4).

Table 1 The percentage of different matching rates of seven emotions (%).

Full size table

We compared the accuracy of basic emotion recognition in SZU-EmoDage to other Chinese-expression databases, including facial-expression database of Chinese (FEDC)-Han²⁰, FEDC-Hui²⁰, FEDC-Tibetan²⁰, Tsinghua facial-expression database¹⁷, the first version of CAFPS (CAFPS1)¹⁶ and the update version of CAFPS (CAFPS2)³³. The results showed that the accuracy of basic emotion recognition in SZU-EmoDage was similar to that in other databases for neutral, happy, surprised, disgusted, and sad expressions. The accuracy of disgusted and fearful expressions in the two versions of Chinese Facial Affective Picture System was below 30%, while in SZU-EmoDage, it was above 51% (see Table 2 and Fig. 5). The results of this research paper demonstrate the potential of deep learning in emotion recognition and its ability to generate reliable and accurate facial expressions.

Table 2 The accuracy rate of basic emotion recognition in different databases (%).

Full size table

Table 3 shows the percentage of emotional valence rating for each emotion. The results indicated that the majority of negative emotions, including anger, disgust, and sadness, were rated as having a negative emotional valence, with percentages ranging from 65.35% to 68.67%. Fear was also rated as having a negative emotional valence, but with a lower percentage of 37.96%. In contrast, happiness expressions were rated as having a positive emotional valence, with a percentage of 98.08%. Neutral and surprise were rated as having a neutral emotional valence, with percentages of 94.33% and 67.31%, respectively.

Table 3 The percentage of the emotional valence rating (%).

Full size table

We compared the arousal and dominance among different emotions. The results showed that happiness was rated as the most arousing emotion, while neutral and disgust were rated as the least arousing. Anger was rated as the most dominant emotion, while a neutral face was rated as the least dominant. To assess the extent to which emotions are expressed naturally, participants were also asked to rate the authenticity of the facial expression. The average authenticity rating for all emotions was above five, indicating that participants perceived the facial expressions as at least somewhat genuine. Pictures of happy expressions were rated as the most authentic (Table 4).

Table 4 The degree of arousal, dominance and authenticity of seven emotions.

Full size table

To assess the stability and reliability of facial expressions, we analyzed the internal consistency coefficient of each emotion category in terms of arousal, dominance, and authenticity. The results indicate that all seven emotional categories demonstrated high reliability, suggesting that the evaluation process of selected faces in the database was highly stable and reliable. Cronbach alpha values are all larger than 0.9 (see Table 5).

Table 5 The Cronbach alpha internal consistency reliability coefficient of each facial expression in the dimension of arousal, dominance and authenticity.

Full size table

The current dataset also includes faces aged from 10 to 70, with a 10-year interval. The rating results indicate that the proportion of faces in the age ranges of 10–20, 30–50, and 60–70 years old were 25.2%, 34.1%, and 40.7%, respectively.

Usage Notes

The SZU-EmoDage dataset and the proposed method contribute significantly for face perception related studies. Deep-learning models serve as powerful tools to achieve a trade-off between experimental control and ecological validity¹⁸, ultimately helps generate naturalistic and standardized datasets. Researchers can leverage our AU-integrated StyleGAN model to generate a large number of faces as required. However, the usage of the method requires some basic technical knowledge, including deep learning fundamentals and proficiency in Python programming, as well as access to computational resources such as GPUs with high memory capacity, to accelerate the image generation process. Additionally, the StyleGAN can be further developed to model new Chinese facial datasets related to social attributes, including facial attractiveness, trustworthiness, and dominance^10,11,12. This would allow for the investigation of more scientific questions related to social cognition and the development of new face models for improving facial-perception technology. The generated datasets can also serve as stimuli to detect individual differences in facial expression recognition, particularly those related to emotional disorders, and investigate cross-cultural disparities in facial perception.

Code availability

The code we used to generate the morphed faces is publicly available, including Action Unit(AU) extractor (https://github.com/TadasBaltrusaitis/OpenFace), StyleGAN2 encoder (https://github.com/omertov/encoder4editing), StyleGAN2 generator (https://github.com/rosinality/stylegan2-pytorch), SAM (https://github.com/yuval-alaluf/SAM).

References

Todorov, A., Olivola, C. Y., Dotsch, R. & Mende-Siedlecki, P. Social attributions from faces: Determinants, consequences, accuracy, and functional significance. Annu. Rev. Psychol. 66(1), 519–545 (2015).
Article PubMed Google Scholar
Gur, R. E., Moore, T. M., Calkins, M. E., Ruparel, K. & Gur, R. C. Face processing measures of social cognition: a dimensional approach to developmental psychopathology. Biol. Psychiatry Cogn. Neurosci. Neuroimaging 2(6), 502–509 (2017).
PubMed Google Scholar
Schindler, S. & Bublatzky, F. Attention and emotion: An integrative review of emotional face processing as a function of attention. Cortex 130, 362–386 (2020).
Article PubMed Google Scholar
Schwartz, L. & Yovel, G. Independent contribution of perceptual experience and social cognition to face recognition. Cognition 183, 131–138 (2019).
Article PubMed Google Scholar
Langner, O. et al. Presentation and validation of the Radboud Faces Database. Cogn. Emot. 24(8), 1377–1388 (2010).
Article Google Scholar
Ebner, N. C., Riediger, M. & Lindenberger, U. FACES—A database of facial expressions in young, middle-aged, and older women and men: Development and validation. Behav. Res. Methods 42(1), 351–362 (2010).
Article PubMed Google Scholar
Chen, J. M., Norman, J. B. & Nam, Y. Broadening the stimulus set: introducing the American multiracial faces database. Behav. Res. Methods 53(1), 371–389 (2021).
Article PubMed Google Scholar
Mishra, M. V., Ray, S. B. & Srinivasan, N. Cross-cultural emotion recognition and evaluation of Radboud faces database with an Indian sample. PLoS One 13(10), e0203959 (2018).
Article PubMed PubMed Central Google Scholar
Gong, X., Huang, Y. X., Wang, Y. & Luo, Y. J. Standardization and Assessment of College Students’ Facial Expression of Emotion. Chin. Ment. Health J. 13(4), 396–398 (2011).
Google Scholar
Oosterhof, N. N. & Todorov, A. The functional basis of face evaluation. Proc. Natl. Acad. Sci. USA 105(32), 11087–11092 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Todorov, A., Said, C. P., Engell, A. D. & Oosterhof, N. N. Understanding evaluation of faces on social dimensions. Trends Cogn. Sci. 12(12), 455–460 (2008).
Article PubMed Google Scholar
Sutherland, C. A. et al. Social inferences from faces: Ambient images generate a three-dimensional model. Cognition 127(1), 105–118 (2013).
Article PubMed Google Scholar
Young, A. W. & Bruce, V. Understanding person perception. Br. J. Psychol. 102(4), 959–974 (2011).
Article PubMed Google Scholar
Holland, C. A., Ebner, N. C., Lin, T. & Samanez-Larkin, G. R. Emotion identification across adulthood using the Dynamic FACES database of emotional expressions in younger, middle aged, and older adults. Cogn. Emot. 33(2), 245–257 (2019).
Article PubMed Google Scholar
Kamachi, M. et al. Dynamic properties influence the perception of facial expressions. Perception 42(11), 1266–1278 (2013).
Article PubMed Google Scholar
Wang, Y. & Luo, Y. J. Standardization and Assessment of College Students’ Facial Expression of Emotion. Chin. J. Clin. Psychol. 13(4), 396–398 (2005).
CAS Google Scholar
Yang, T. et al. Tsinghua facial expression database–A database of facial expressions in Chinese young and older women and men: Development and validation. PloS one 15(4), e0231304 (2020).
Article CAS PubMed PubMed Central Google Scholar
Goetschalckx, L., Andonian, A. & Wagemans, J. Generative adversarial networks unlock new methods for cognitive science. Trends Cogn. Sci. 25(9), 788–801 (2021).
Article PubMed Google Scholar
Du, S., Tao, Y. & Martinez, A. M. Compound facial expressions of emotion. Proc. Natl. Acad. Sci. USA 111(15), e1454–e1462 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, J., Yang, B., Luo, R. & Ding, X. Development of a facial‐expression database of Chinese Han, Hui and Tibetan people. Int. J. Psychol. 55(3), 456–464 (2020).
Article PubMed Google Scholar
Alaluf, Y., Patashnik, O. & Cohen-Or, D. Only a matter of style: Age transformation using a style-based regression model. ACM. Trans. Graph. 40(4), 1–12 (2021).
Article Google Scholar
Karras, T., Laine, S. & Aila, T. A Style-Based Generator Architecture for Generative Adversarial Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence 43, 4217–4228 (2021).
Article PubMed Google Scholar
Ekman, P. & Friesen, W. V. The Facial Action Coding System: A Technique for The Measurement of Facial Movement (Consulting Psychologists Press, San Francisco, 1978)
Richardson, E. et al. Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation. in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2287–2296 (2021).
Karras, T. et al. Analyzing and Improving the Image Quality of StyleGAN. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020).
Baltrusaitis, T., Zadeh, A., Lim, Y. C. & Morency, L.-P. OpenFace 2.0: Facial Behavior Analysis Toolkit. in 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018) 59–66 (2018).
Yang, T., Ren, P., Xie, X. & Zhang, L. GAN Prior Embedded Network for Blind Face Restoration in the Wild. in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 672–681 (2021).
Li, X. et al. Image-to-image Translation via Hierarchical Style Disentanglement. in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 8635–8644 (2021).
Pumarola, A. et al. GANimation: Anatomically-Aware Facial Animation from a Single Image. in Computer Vision – ECCV 2018 (eds. Ferrari, V., Hebert, M., Sminchisescu, C. & Weiss, Y.), 835–851 (Springer International Publishing, 2018).
Ling, J. et al. Toward Fine-Grained Facial Expression Manipulation. in Computer Vision – ECCV 2020 (eds. Vedaldi, A., Bischof, H., Brox, T. & Frahm, J.-M.), 37–53 (Springer International Publishing, 2020).
Shen, Y. et al. Interpreting the Latent Space of GANs for Semantic Face Editing. in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 9240–9249 (2020).
Han, S. F. et al. SZU-EmoDage: A Chinese Face Dataset with Dynamic Expressions and Diverse Ages Synthesized by Deep Learning, OSF, https://doi.org/10.17605/OSF.IO/7A5FS (2022).
Gong, X., Huang, Y. X., Wang, Y. & Luo, Y. J. Revision of the Chinese Facial Affective Picture System. Chin. J. Clin. Psychol. 25(1), 40–60 (2011).
Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (31920103009 and 82261138629), the Major Project of National Social Science Foundation (20&ZD153), Guangdong Basic and Applied Basic Research Foundation (2023A1515010688), Shenzhen-Hong Kong Institute of Brain Science-Shenzhen Fundamental Research Institutions (2023SHIBS0003), Shenzhen Municipal Science and Technology Innovation Council (JCYJ20220531101412030) and Project of Guangzhou University (RC2023063).

Author information

These authors contributed equally: Shangfeng Han, Yanliang Guo, Xinyi Zhou.

Authors and Affiliations

School of psychology, Magnetic Resonance Imaging Center, China-UK Visual Information Processing Laboratory, Institute of Computer Vision, College of Computer Science and Software Engineering, Shenzhen University, Shenzhen, China
Shangfeng Han, Yanliang Guo & Linlin Shen
Department of Psychology and Center for Brain and Cognitive Sciences, School of Education, Guangzhou University, Guangzhou, China
Shangfeng Han
State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China
Xinyi Zhou & Yuejia Luo
School of Psychology, Sichuan Center of Applied Psychology, Chengdu Medical College, Chengdu, China
Junlong Huang & Yuejia Luo
Institute for Neuropsychological Rehabilitation, University of Health and Rehabilitation Sciences, Qingdao, China
Yuejia Luo

Authors

Shangfeng Han
View author publications
You can also search for this author in PubMed Google Scholar
Yanliang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Xinyi Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Junlong Huang
View author publications
You can also search for this author in PubMed Google Scholar
Linlin Shen
View author publications
You can also search for this author in PubMed Google Scholar
Yuejia Luo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Han S.F., Guo Y.L. conceived and implemented the study and wrote the manuscript. Zhou X.Y. acquired and analyzed the data. Huang J.L. edited the format of the manuscript. Shen L.L. and Luo Y.J. commented on the manuscript. All authors critically reviewed and approved the final version of the manuscript.

Corresponding authors

Correspondence to Linlin Shen or Yuejia Luo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Han, S., Guo, Y., Zhou, X. et al. A Chinese Face Dataset with Dynamic Expressions and Diverse Ages Synthesized by Deep Learning. Sci Data 10, 878 (2023). https://doi.org/10.1038/s41597-023-02701-2

Download citation

Received: 27 June 2023
Accepted: 31 October 2023
Published: 07 December 2023
DOI: https://doi.org/10.1038/s41597-023-02701-2