Assessing the Big Five personality traits using real-life static facial images

Kachur, Alexander; Osin, Evgeny; Davydov, Denis; Shutilov, Konstantin; Novokshonov, Alexey

doi:10.1038/s41598-020-65358-6

Download PDF

Article
Open access
Published: 22 May 2020

Assessing the Big Five personality traits using real-life static facial images

Scientific Reports volume 10, Article number: 8487 (2020) Cite this article

133k Accesses
51 Citations
349 Altmetric
Metrics details

Subjects

Abstract

There is ample evidence that morphological and social cues in a human face provide signals of human personality and behaviour. Previous studies have discovered associations between the features of artificial composite facial images and attributions of personality traits by human experts. We present new findings demonstrating the statistically significant prediction of a wider set of personality features (all the Big Five personality traits) for both men and women using real-life static facial images. Volunteer participants (N = 12,447) provided their face photographs (31,367 images) and completed a self-report measure of the Big Five traits. We trained a cascade of artificial neural networks (ANNs) on a large labelled dataset to predict self-reported Big Five scores. The highest correlations between observed and predicted personality scores were found for conscientiousness (0.360 for men and 0.335 for women) and the mean effect size was 0.243, exceeding the results obtained in prior studies using ‘selfies’. The findings strongly support the possibility of predicting multidimensional personality profiles from static facial images using ANNs trained on large labelled datasets. Future research could investigate the relative contribution of morphological features of the face and other characteristics of facial images to predicting personality.

Facial recognition technology can expose political orientation from naturalistic facial images

Article Open access 11 January 2021

Individual differences and the multidimensional nature of face perception

Article 28 March 2022

Professional actors demonstrate variability, not stereotypical expressions, when portraying emotional states in photographs

Article Open access 19 August 2021

Introduction

A growing number of studies have linked facial images to personality. It has been established that humans are able to perceive certain personality traits from each other’s faces with some degree of accuracy^1,2,3,4. In addition to emotional expressions and other nonverbal behaviours conveying information about one’s psychological processes through the face, research has found that valid inferences about personality characteristics can even be made based on static images of the face with a neutral expression^5,6,7. These findings suggest that people may use signals from each other’s faces to adjust the ways they communicate, depending on the emotional reactions and perceived personality of the interlocutor. Such signals must be fairly informative and sufficiently repetitive for recipients to take advantage of the information being conveyed⁸.

Studies focusing on the objective characteristics of human faces have found some associations between facial morphology and personality features. For instance, facial symmetry predicts extraversion⁹. Another widely studied indicator is the facial width to height ratio (fWHR), which has been linked to various traits, such as achievement striving¹⁰, deception¹¹, dominance¹², aggressiveness^13,14,15,16, and risk-taking¹⁷. The fWHR can be detected with high reliability irrespective of facial hair. The accuracy of fWHR-based judgements suggests that the human perceptual system may have evolved to be sensitive to static facial features, such as the relative face width¹⁸.

There are several theoretical reasons to expect associations between facial images and personality. First, genetic background contributes to both face and personality. Genetic correlates of craniofacial characteristics have been discovered both in clinical contexts^19,20 and in non-clinical populations²¹. In addition to shaping the face, genes also play a role in the development of various personality traits, such as risky behaviour^22,23,24, and the contribution of genes to some traits exceeds the contribution of environmental factors²⁵. For the Big Five traits, heritability coefficients reflecting the proportion of variance that can be attributed to genetic factors typically lie in the 0.30–0.60 range^26,27. From an evolutionary perspective, these associations can be expected to have emerged by means of sexual selection. Recent studies have argued that some static facial features, such as the supraorbital region, may have evolved as a means of social communication²⁸ and that facial attractiveness signalling valuable personality characteristics is associated with mating success²⁹.

Second, there is some evidence showing that pre- and postnatal hormones affect both facial shape and personality. For instance, the face is a visible indicator of the levels of sex hormones, such as testosterone and oestrogen, which affect the formation of skull bones and the fWHR^30,31,32. Given that prenatal and postnatal sex hormone levels do influence behaviour, facial features may correlate with hormonally driven personality characteristics, such as aggressiveness³³, competitiveness, and dominance, at least for men^34,35. Thus, in addition to genes, the associations of facial features with behavioural tendencies may also be explained by androgens and potentially other hormones affecting both face and behaviour.

Third, the perception of one’s facial features by oneself and by others influences one’s subsequent behaviour and personality³⁶. Just as the perceived ‘cleverness’ of an individual may lead to higher educational attainment³⁷, prejudice associated with the shape of one’s face may lead to the development of maladaptive personality characteristics (i.e., the ‘Quasimodo complex’³⁸). The associations between appearance and personality over the lifespan have been explored in longitudinal observational studies, providing evidence of ‘self-fulfilling prophecy’-type and ‘self-defeating prophecy’-type effects³⁹.

Fourth and finally, some personality traits are associated with habitual patterns of emotionally expressive behaviour. Habitual emotional expressions may shape the static features of the face, leading to the formation of wrinkles and/or the development of facial muscles.

Existing studies have revealed the links between objective facial picture cues and general personality traits based on the Five-Factor Model or the Big Five (BF) model of personality⁴⁰. However, a quick glance at the sizes of the effects found in these studies (summarized in Table 1) reveals much controversy. The results appear to be inconsistent across studies and hardly replicable⁴¹. These inconsistencies may result from the use of small samples of stimulus faces, as well as from the vast differences in methodologies. Stronger effect sizes are typically found in studies using composite facial images derived from groups of individuals with high and low scores on each of the Big Five dimensions^6,7,8. Naturally, the task of identifying traits using artificial images comprised of contrasting pairs with all other individual features eliminated or held constant appears to be relatively easy. This is in contrast to realistic situations, where faces of individuals reflect a full range of continuous personality characteristics embedded in a variety of individual facial features.

Table 1 Effect sizes found in studies that used integral facial images to predict personality traits.

Full size table

Studies relying on photographic images of individual faces, either artificially manipulated^2,42 or realistic, tend to yield more modest effects. It appears that studies using realistic photographs made in controlled conditions (neutral expression, looking straight at the camera, consistent posture, lighting, and distance to the camera, no glasses, no jewellery, no make-up, etc.) produce stronger effects than studies using ‘selfies’²⁵. Unfortunately, differences in the methodologies make it hard to hypothesize whether the diversity of these findings is explained by variance in image quality, image background, or the prediction models used.

Research into the links between facial picture cues and personality traits faces several challenges. First, the number of specific facial features is very large, and some of them are hard to quantify. Second, the effects of isolated facial features are generally weak and only become statistically noticeable in large samples. Third, the associations between objective facial features and personality traits might be interactive and nonlinear. Finally, studies using real-life photographs confront an additional challenge in that the very characteristics of the images (e.g., the angle of the head, facial expression, makeup, hairstyle, facial hair style, etc.) are based on the subjects’ choices, which are potentially influenced by personality; after all, one of the principal reasons why people make and share their photographs is to signal to others what kind of person they are. The task of isolating the contribution of each variable out of the multitude of these individual variables appears to be hardly feasible. Instead, recent studies in the field have tended to rely on a holistic approach, investigating the subjective perception of personality based on integral facial images.

The holistic approach aims to mimic the mechanisms of human perception of the face and the ways in which people make judgements about each other’s personality. This approach is supported by studies of human face perception, showing that faces are perceived and encoded in a holistic manner by the human brain^43,44,45,46. Put differently, when people identify others, they consider individual facial features (such as a person’s eyes, nose, and mouth) in concert as a single entity rather than as independent pieces of information^47,48,49,50. Similar to facial identification, personality judgements involve the extraction of invariant facial markers associated with relatively stable characteristics of an individual’s behaviour. Existing evidence suggests that various social judgements might be based on a common visual representational system involving the holistic processing of visual information^51,52. Thus, even though the associations between isolated facial features and personality characteristics sought by ancient physiognomists have emerged to be weak, contradictory or even non-existent, the holistic approach to understanding the face-personality links appears to be more promising.

An additional challenge faced by studies seeking to reveal the face-personality links is constituted by the inconsistency of the evaluations of personality traits by human raters. As a result, a fairly large number of human raters is required to obtain reliable estimates of personality traits for each photograph. In contrast, recent attempts at using machine learning algorithms have suggested that artificial intelligence may outperform individual human raters. For instance, S. Hu and colleagues⁴⁰ used the composite partial least squares component approach to analyse dense 3D facial images obtained in controlled conditions and found significant associations with personality traits (stronger for men than for women).

A similar approach can be implemented using advanced machine learning algorithms, such as artificial neural networks (ANNs), which can extract and process significant features in a holistic manner. The recent applications of ANNs to the analysis of human faces, body postures, and behaviours with the purpose of inferring apparent personality traits^53,54 indicate that this approach leads to a higher accuracy of prediction compared to individual human raters. The main difficulty of the ANN approach is the need for large labelled training datasets that are difficult to obtain in laboratory settings. However, ANNs do not require high-quality photographs taken in controlled conditions and can potentially be trained using real-life photographs provided that the dataset is large enough. The interpretation of findings in such studies needs to acknowledge that a real-life photograph, especially one chosen by a study participant, can be viewed as a holistic behavioural act, which may potentially contain other cues to the subjects’ personality in addition to static facial features (e.g., lighting, hairstyle, head angle, picture quality, etc.).

The purpose of the current study was to investigate the associations of facial picture cues with self-reported Big Five personality traits by training a cascade of ANNs to predict personality traits from static facial images. The general hypothesis is that a real-life photograph contains cues about personality that can be extracted using machine learning. Due to the vast diversity of findings concerning the prediction accuracy of different traits across previous studies, we did not set a priori hypotheses about differences in prediction accuracy across traits.

Results

Prediction accuracy

We used data from the test dataset containing predicted scores for 3,137 images associated with 1,245 individuals. To determine whether the variance in the predicted scores was associated with differences across images or across individuals, we calculated the intraclass correlation coefficients (ICCs) presented in Table 2. The between-individual proportion of variance in the predicted scores ranged from 79 to 88% for different traits, indicating a general consistency of predicted scores for different photographs of the same individual. We derived the individual scores used in all subsequent analyses as the simple averages of the predicted scores for all images provided by each participant.

Table 2 BF personality trait prediction accuracy for ANN models.

Full size table

The correlation coefficients between the self-report test scores and the scores predicted by the ANN ranged from 0.14 to 0.36. The associations were strongest for conscientiousness and weakest for openness. Extraversion and neuroticism were significantly better predicted for women than for men (based on the z test). We also compared the prediction accuracy within each gender using Steiger’s test for dependent sample correlation coefficients. For men, conscientiousness was predicted more accurately than the other four traits (the differences among the latter were not statistically significant). For women, conscientiousness was predicted more accurately, and openness was predicted less accurately compared to the three other traits.

The mean absolute error (MAE) of prediction ranged between 0.89 and 1.04 standard deviations. We did not find any associations between the number of photographs and prediction error.

Trait intercorrelations

The structure of the correlations between the scales was generally similar for the observed test scores and the predicted values, but some coefficients differed significantly (based on the z test) (see Table 3). Most notably, predicted openness was more strongly associated with conscientiousness (negatively) and extraversion (positively), whereas its association with agreeableness was negative rather than positive. The associations of predicted agreeableness with conscientiousness and neuroticism were stronger than those between the respective observed scores. In women, predicted neuroticism demonstrated a stronger inverse association with conscientiousness and a stronger positive association with openness. In men, predicted neuroticism was less strongly associated with extraversion than its observed counterpart.

Table 3 Trait intercorrelations for observed (first line) and predicted (in parentheses) Big Five scores.

Full size table

To illustrate the findings, we created composite images using Abrosoft FantaMorph 5 by averaging the uploaded images across contrast groups of 100 individuals with the highest and the lowest test scores on each trait. The resulting morphed images in which individual features are eliminated are presented in Fig. 1.

Discussion

This study presents new evidence confirming that human personality is related to individual facial appearance. We expected that machine learning (in our case, artificial neural networks) could reveal multidimensional personality profiles based on static morphological facial features. We circumvented the reliability limitations of human raters by developing a neural network and training it on a large dataset labelled with self-reported Big Five traits.

We expected that personality traits would be reflected in the whole facial image rather than in its isolated features. Based on this expectation, we developed a novel two-tier machine learning algorithm to encode the invariant facial features as a vector in a 128-dimensional space that was used to predict the BF traits by means of a multilayer perceptron. Although studies using real-life photographs do not require strict experimental conditions, we had to undertake a series of additional organizational and technological steps to ensure consistent facial image characteristics and quality.

Our results demonstrate that real-life photographs taken in uncontrolled conditions can be used to predict personality traits using complex computer vision algorithms. This finding is in contrast to previous studies that mostly relied on high-quality facial images taken in controlled settings. The accuracy of prediction that we obtained exceeds that in the findings of prior studies that used realistic individual photographs taken in uncontrolled conditions (e.g., selfies⁵⁵). The advantage of our methodology is that it is relatively simple (e.g., it does not rely on 3D scanners or 3D facial landmark maps) and can be easily implemented using a desktop computer with a stock graphics accelerator.

In the present study, conscientiousness emerged to be more easily recognizable than the other four traits, which is consistent with some of the existing findings^7,40. The weaker effects for extraversion and neuroticism found in our sample may be because these traits are associated with positive and negative emotional experiences, whereas we only aimed to use images with neutral or close to neutral emotional expressions. Finally, this appears to be the first study to achieve a significant prediction of openness to experience. Predictions of personality based on female faces appeared to be more reliable than those for male faces in our sample, in contrast to some previous studies⁴⁰.

The BF factors are known to be non-orthogonal, and we paid attention to their intercorrelations in our study^56,57. Various models have attempted to explain the BF using higher-order dimensions, such as stability and plasticity⁵⁸ or a single general factor of personality (GFP)⁵⁹. We discovered that the intercorrelations of predicted factors tend to be stronger than the intercorrelations of self-report questionnaire scales used to train the model. This finding suggests a potential biological basis of GFP. However, the stronger intercorrelations of the predicted scores can be explained by consistent differences in picture quality (just as the correlations between the self-report scales can be explained by social desirability effects and other varieties of response bias⁶⁰). Clearly, additional research is needed to understand the context of this finding.

We believe that the present study, which did not involve any subjective human raters, constitutes solid evidence that all the Big Five traits are associated with facial cues that can be extracted using machine learning algorithms. However, despite having taken reasonable organizational and technical steps to exclude the potential confounds and focus on static facial features, we are still unable to claim that morphological features of the face explain all the personality-related image variance captured by the ANNs. Rather, we propose to see facial photographs taken by subjects themselves as complex behavioural acts that can be evaluated holistically and that may contain various other subtle personality cues in addition to static facial features.

The correlations reported above with a mean r = 0.243 can be viewed as modest; indeed, facial image-based personality assessment can hardly replace traditional personality measures. However, this effect size indicates that an ANN can make a correct guess about the relative standing of two randomly chosen individuals on a personality dimension in 58% of cases (as opposed to the 50% expected by chance)⁶¹. The effect sizes we observed are comparable with the meta-analytic estimates of correlations between self-reported and observer ratings of personality traits: the associations range from 0.30 to 0.49 when one’s personality is rated by close relatives or colleagues, but only from −0.01 to 0.29 when rated by strangers⁶². Thus, an artificial neural network relying on static facial images outperforms an average human rater who meets the target in person without any prior acquaintance. Given that partner personality and match between two personalities predict friendship formation⁶³, long-term relationship satisfaction⁶⁴, and the outcomes of dyadic interaction in unstructured settings⁶⁵, the aid of artificial intelligence in making partner choices could help individuals to achieve more satisfying interaction outcomes.

There are a vast number of potential applications to be explored. The recognition of personality from real-life photos can be applied in a wide range of scenarios, complementing the traditional approaches to personality assessment in settings where speed is more important than accuracy. Applications may include suggesting best-fitting products or services to customers, proposing to individuals a best match in dyadic interaction settings (such as business negotiations, online teaching, etc.) or personalizing the human-computer interaction. Given that the practical value of any selection method is proportional to the number of decisions made and the size and variability of the pool of potential choices⁶⁶, we believe that the applied potential of this technology can be easily revealed at a large scale, given its speed and low cost. Because the reliability and validity of self-report personality measures is not perfect, prediction could be further improved by supplementing these measures with peer ratings and objective behavioural indicators of personality traits.

The fact that conscientiousness was predicted better than the other traits for both men and women emerges as an interesting finding. From an evolutionary perspective, one would expect the traits most relevant for cooperation (conscientiousness and agreeableness) and social interaction (certain facets of extraversion and neuroticism, such as sociability, dominance, or hostility) to be reflected more readily in the human face. The results are generally in line with this idea, but they need to be replicated and extended by incorporating trait facets in future studies to provide support for this hypothesis.

Finally, although we tried to control the potential sources of confounds and errors by instructing the participants and by screening the photographs (based on angles, facial expressions, makeup, etc.), the present study is not without limitations. First, the real-life photographs we used could still carry a variety of subtle cues, such as makeup, angle, light facial expressions, and information related to all the other choices people make when they take and share their own photographs. These additional cues could say something about their personality, and the effects of all these variables are inseparable from those of static facial features, making it hard to draw any fundamental conclusions from the findings. However, studies using real-life photographs may have higher ecological validity compared to laboratory studies; our results are more likely to generalize to real-life situations where users of various services are asked to share self-pictures of their choice.

Another limitation pertains to a geographically bounded sample of individuals; our participants were mostly Caucasian and represented one cultural and age group (Russian-speaking adults). Future studies could replicate the effects using populations representing a more diverse variety of ethnic, cultural, and age groups. Studies relying on other sources of personality data (e.g., peer ratings or expert ratings), as well as wider sets of personality traits, could complement and extend the present findings.

Methods

Sample and procedure

The study was carried out in the Russian language. The participants were anonymous volunteers recruited through social network advertisements. They did not receive any financial remuneration but were provided with a free report on their Big Five personality traits. The data were collected online using a dedicated research website and a mobile application. The participants provided their informed consent, completed the questionnaires, reported their age and gender and were asked to upload their photographs. They were instructed to take or upload several photographs of their face looking directly at the camera with enough lighting, a neutral facial expression and no other people in the picture and without makeup.

Our goal was to obtain an out-of-sample validation dataset of 616 respondents of each gender to achieve 80% power for a minimum effect we considered to be of practical significance (r = 0.10 at p < 0.05), requiring a total of 6,160 participants of each gender in the combined dataset comprising the training and validation datasets. However, we aimed to gather more data because we expected that some online respondents might provide low-quality or non-genuine photographs and/or invalid questionnaire responses.

The initial sample included 25,202 participants who completed the questionnaire and uploaded a total of 77,346 photographs. The final combined dataset comprised 12,447 valid questionnaires and 31,367 associated photographs after the data screening procedures (below). The participants ranged in age from 18 to 60 (59.4% women, M = 27.61, SD = 12.73, and 40.6% men, M = 32.60, SD = 11.85). The dataset was split randomly into a training dataset (90%) and a test dataset (10%) used to validate the prediction model. The validation dataset included the responses of 505 men who provided 1224 facial images and 740 women who provided 1913 images. Due to the sexually dimorphic nature of facial features and certain personality traits (particularly extraversion^1,67,68), all the predictive models were trained and validated separately for male and female faces.

Ethical approval

The research was carried out in accordance with the Declaration of Helsinki. The study protocol was approved by the Research Ethics Committee of the Open University for the Humanities and Economics. We obtained the participants’ informed consent to use their data and photographs for research purposes and to publish generalized findings. The morphed group average images presented in the paper do not allow the identification of individuals. No information or images that could lead to the identification of study participants have been published.

Data screening

We excluded incomplete questionnaires (N = 3,035) and used indices of response consistency to screen out random responders⁶⁹. To detect systematic careless responses, we used the modal response category count, maximum longstring (maximum number of identical responses given in sequence by participant), and inter-item standard deviation for each questionnaire. At this stage, we screened out the answers of individuals with zero standard deviations (N = 329) and a maximum longstring above 10 (N = 1,416). To detect random responses, we calculated the following person-fit indices: the person-total response profile correlation, the consistency of response profiles for the first and the second half of the questionnaire, the consistency of response profiles obtained based on equivalent groups of items, the number of polytomous Guttman errors, and the intraclass correlation of item responses within facets.

Next, we conducted a simulation by generating random sets of integers in the 1–5 range based on a normal distribution (µ = 3, σ = 1) and on the uniform distribution and calculating the same person-fit indices. For each distribution, we generated a training dataset and a test dataset, each comprised of 1,000 simulated responses and 1,000 real responses drawn randomly from the sample. Next, we ran a logistic regression model using simulated vs real responses as the outcome variable and chose an optimal cutoff point to minimize the misclassification error (using the R package optcutoff). The sensitivity value was 0.991 for the uniform distribution and 0.960 for the normal distribution, and the specificity values were 0.923 and 0.980, respectively. Finally, we applied the trained model to the full dataset and identified observations predicted as likely to be simulated based on either distribution (N = 1,618). The remaining sample of responses (N = 18,804) was used in the subsequent analyses.

Big Five measure

We used a modified Russian version of the 5PFQ questionnaire⁷⁰, which is a 75-item measure of the Big Five model, with 15 items per trait grouped into five three-item facets. To confirm the structural validity of the questionnaire, we tested an exploratory structural equation (ESEM) model with target rotation in Mplus 8.2. The items were treated as ordered categorical variables using the WLSMV estimator, and facet variance was modelled by introducing correlated uniqueness values for the items comprising each facet.

The theoretical model showed a good fit to the data (χ² = 147854.68, df = 2335, p < 0.001; CFI = 0.931; RMSEA = 0.040 [90% CI: 0.040, 0.041]; SRMR = 0.024). All the items showed statistically significant loadings on their theoretically expected scales (λ ranged from 0.14 to 0.87, M = 0.51, SD = 0.17), and the absolute cross-loadings were reasonably low (M = 0.11, SD = 0.11). The distributions of the resulting scales were approximately normal (with skewness and kurtosis values within the [−1; 1] range). To assess the reliability of the scales, we calculated two internal consistency indices, namely, robust omega (using the R package coefficientalpha) and algebraic greatest lower bound (GLB) reliability (using the R package psych)⁷¹ (see Table 4).

Table 4 Reliability indices per scale.

Full size table

Image screening and pre-processing

The images (photographs and video frames) were subjected to a three-step screening procedure aimed at removing fake and low-quality images. First, images with no human faces or with more than one human face were detected by our computer vision (CV) algorithms and automatically removed. Second, celebrity images were identified and removed by means of a dedicated neural network trained on a celebrity photo dataset (CelebFaces Attributes Dataset (CelebA), N > 200,000)⁷² that was additionally enriched with pictures of Russian celebrities. The model showed a 98.4% detection accuracy. Third, we performed a manual moderation of the remaining images to remove images with partially covered faces, those that were evidently photoshopped or any other fake images not detected by CV.

The images retained for subsequent processing were converted to single-channel 8-bit greyscale format using the OpenCV framework (opencv.org). Head position (pitch, yaw, roll) was measured using our own dedicated neural network (multilayer perceptron) trained on a sample of 8 000 images labelled by our team. The mean absolute error achieved on the test sample of 800 images was 2.78° for roll, 1.67° for pitch, and 2.34° for yaw. We used the head position data to retain the images with yaw and roll within the −30° to 30° range and pitch within the −15° to 15° range.

Next, we assessed emotional neutrality using the Microsoft Cognitive Services API on the Azure platform (score range: 0 to 1) and used 0.50 as a threshold criterion to remove emotionally expressive images. Finally, we applied the face and eye detection, alignment, resize, and crop functions available within the Dlib (dlib.net) open-source toolkit to arrive at a set of standardized 224 × 224 pixel images with eye pupils aligned to a standard position with an accuracy of 1 px. Images with low resolution that contained less than 60 pixels between the eyes, were excluded in the process.

The final photoset comprised 41,835 images. After the screened questionnaire responses and images were joined, we obtained a set of 12,447 valid Big Five questionnaires associated with 31,367 validated images (an average of 2.59 images per person for women and 2.42 for men).

Neural network architecture

First, we developed a computer vision neural network (NNCV) aiming to determine the invariant features of static facial images that distinguish one face from another but remain constant across different images of the same person. We aimed to choose a neural network architecture with a good feature space and resource-efficient learning, considering the limited hardware available to our research team. We chose a residual network architecture based on ResNet⁷³ (see Fig. 2).

This type of neural network was originally developed for image classification. We dropped the final layer from the original architecture and obtained a NNCV that takes a static monochrome image (224 × 224 pixels in size) and generates a vector of 128 32-bit dimensions describing unique facial features in the source image. As a measure of success, we calculated the Euclidean distance between the vectors generated from different images.

Using Internet search engines, we collected a training dataset of approximately 2 million openly available unlabelled real-life photos taken in uncontrolled conditions stratified by race, age and gender (using search engine queries such as ‘face photo’, ‘face pictures’, etc.). The training was conducted on a server equipped with four NVidia Titan accelerators. The trained neural network was validated on a dataset of 40,000 images belonging to 800 people, which was an out-of-sample part of the original dataset. The Euclidean distance threshold for the vectors belonging to the same person was 0.40 after the training was complete.

Finally, we trained a personality diagnostics neural network (NNPD), which was implemented as a multilayer perceptron (see Fig. 2). For that purpose, we used a training dataset (90% of the final sample) containing the questionnaire scores of 11,202 respondents and a total of 28,230 associated photographs. The NNPD takes the vector of the invariants obtained from NNCV as an input and predicts the Big Five personality traits as the output. The network was trained using the same hardware, and the training process took 9 days. The whole process was performed for male and female faces separately.

Data availability

The set of photographs is not made available because we did not solicit the consent of the study participants to publish the individual photographs. The test dataset with the observed and predicted Big Five scores is available from the openICPSR repository: https://doi.org/10.3886/E109082V1.

References

Kramer, R. S. S., King, J. E. & Ward, R. Identifying personality from the static, nonexpressive face in humans and chimpanzees: Evidence of a shared system for signaling personality. Evol. Hum. Behav. https://doi.org/10.1016/j.evolhumbehav.2010.10.005 (2011).
Walker, M. & Vetter, T. Changing the personality of a face: Perceived big two and big five personality factors modeled in real photographs. J. Pers. Soc. Psychol. 110, 609–624 (2016).
Article Google Scholar
Naumann, L. P., Vazire, S., Rentfrow, P. J. & Gosling, S. D. Personality Judgments Based on Physical Appearance. Personal. Soc. Psychol. Bull. 35, 1661–1671 (2009).
Article Google Scholar
Borkenau, P., Brecke, S., Möttig, C. & Paelecke, M. Extraversion is accurately perceived after a 50-ms exposure to a face. J. Res. Pers. 43, 703–706 (2009).
Article Google Scholar
Shevlin, M., Walker, S., Davies, M. N. O., Banyard, P. & Lewis, C. A. Can you judge a book by its cover? Evidence of self-stranger agreement on personality at zero acquaintance. Pers. Individ. Dif. https://doi.org/10.1016/S0191-8869(02)00356-2 (2003).
Penton-Voak, I. S., Pound, N., Little, A. C. & Perrett, D. I. Personality Judgments from Natural and Composite Facial Images: More Evidence For A “Kernel Of Truth” In Social Perception. Soc. Cogn. 24, 607–640 (2006).
Article Google Scholar
Little, A. C. & Perrett, D. I. Using composite images to assess accuracy in personality attribution to faces. Br. J. Psychol. 98, 111–126 (2007).
Article Google Scholar
Kramer, R. S. S. & Ward, R. Internal Facial Features are Signals of Personality and Health. Q. J. Exp. Psychol. 63, 2273–2287 (2010).
Article Google Scholar
Pound, N., Penton-Voak, I. S. & Brown, W. M. Facial symmetry is positively associated with self-reported extraversion. Pers. Individ. Dif. 43, 1572–1582 (2007).
Article Google Scholar
Lewis, G. J., Lefevre, C. E. & Bates, T. Facial width-to-height ratio predicts achievement drive in US presidents. Pers. Individ. Dif. 52, 855–857 (2012).
Article Google Scholar
Haselhuhn, M. P. & Wong, E. M. Bad to the bone: facial structure predicts unethical behaviour. Proc. R. Soc. B Biol. Sci. 279, 571 LP–576 (2012).
Article Google Scholar
Valentine, K. A., Li, N. P., Penke, L. & Perrett, D. I. Judging a Man by the Width of His Face: The Role of Facial Ratios and Dominance in Mate Choice at Speed-Dating Events. Psychol. Sci. 25, (2014).
Carre, J. M. & McCormick, C. M. In your face: facial metrics predict aggressive behaviour in the laboratory and in varsity and professional hockey players. Proc. R. Soc. B Biol. Sci. 275, 2651–2656 (2008).
Article Google Scholar
Carré, J. M., McCormick, C. M. & Mondloch, C. J. Facial structure is a reliable cue of aggressive behavior: Research report. Psychol. Sci. https://doi.org/10.1111/j.1467-9280.2009.02423.x (2009).
Haselhuhn, M. P., Ormiston, M. E. & Wong, E. M. Men’s Facial Width-to-Height Ratio Predicts Aggression: A Meta-Analysis. PLoS One 10, e0122637 (2015).
Article Google Scholar
Lefevre, C. E., Etchells, P. J., Howell, E. C., Clark, A. P. & Penton-Voak, I. S. Facial width-to-height ratio predicts self-reported dominance and aggression in males and females, but a measure of masculinity does not. Biol. Lett. 10, (2014).
Welker, K. M., Goetz, S. M. M. & Carré, J. M. Perceived and experimentally manipulated status moderates the relationship between facial structure and risk-taking. Evol. Hum. Behav. https://doi.org/10.1016/j.evolhumbehav.2015.03.006 (2015).
Geniole, S. N. & McCormick, C. M. Facing our ancestors: judgements of aggression are consistent and related to the facial width-to-height ratio in men irrespective of beards. Evol. Hum. Behav. 36, 279–285 (2015).
Article Google Scholar
Valentine, M. et al. Computer-Aided Recognition of Facial Attributes for Fetal Alcohol Spectrum Disorders. Pediatrics 140, (2017).
Ferry, Q. et al. Diagnostically relevant facial gestalt information from ordinary photos. Elife 1–22 https://doi.org/10.7554/eLife.02020.001 (2014).
Claes, P. et al. Modeling 3D Facial Shape from DNA. PLoS Genet. 10, e1004224 (2014).
Article Google Scholar
Carpenter, J. P., Garcia, J. R. & Lum, J. K. Dopamine receptor genes predict risk preferences, time preferences, and related economic choices. J. Risk Uncertain. 42, 233–261 (2011).
Article Google Scholar
Dreber, A. et al. The 7R polymorphism in the dopamine receptor D4 gene (<em>DRD4</em>) is associated with financial risk taking in men. Evol. Hum. Behav. 30, 85–92 (2009).
Article Google Scholar
Bouchard, T. J. et al. Sources of human psychological differences: the Minnesota Study of Twins Reared Apart. Science (80-.). 250, 223 LP–228 (1990).
Article ADS Google Scholar
Livesley, W. J., Jang, K. L. & Vernon, P. A. Phenotypic and genetic structure of traits delineating personality disorder. Arch. Gen. Psychiatry https://doi.org/10.1001/archpsyc.55.10.941 (1998).
Bouchard, T. J. & Loehlin, J. C. Genes, evolution, and personality. Behavior Genetics https://doi.org/10.1023/A:1012294324713 (2001).
Vukasović, T. & Bratko, D. Heritability of personality: A meta-analysis of behavior genetic studies. Psychol. Bull. 141, 769–785 (2015).
Article Google Scholar
Godinho, R. M., Spikins, P. & O’Higgins, P. Supraorbital morphology and social dynamics in human evolution. Nat. Ecol. Evol. https://doi.org/10.1038/s41559-018-0528-0 (2018).
Rhodes, G., Simmons, L. W. & Peters, M. Attractiveness and sexual behavior: Does attractiveness enhance mating success? Evol. Hum. Behav. https://doi.org/10.1016/j.evolhumbehav.2004.08.014 (2005).
Lefevre, C. E., Lewis, G. J., Perrett, D. I. & Penke, L. Telling facial metrics: Facial width is associated with testosterone levels in men. Evol. Hum. Behav. 34, 273–279 (2013).
Article Google Scholar
Whitehouse, A. J. O. et al. Prenatal testosterone exposure is related to sexually dimorphic facial morphology in adulthood. Proceedings. Biol. Sci. 282, 20151351 (2015).
Article Google Scholar
Penton-Voak, I. S. & Chen, J. Y. High salivary testosterone is linked to masculine male facial appearance in humans. Evol. Hum. Behav. https://doi.org/10.1016/j.evolhumbehav.2004.04.003 (2004).
Carré, J. M. & Archer, J. Testosterone and human behavior: the role of individual and contextual variables. Curr. Opin. Psychol. 19, 149–153 (2018).
Article Google Scholar
Swaddle, J. P. & Reierson, G. W. Testosterone increases perceived dominance but not attractiveness in human males. Proc. R. Soc. B Biol. Sci. https://doi.org/10.1098/rspb.2002.2165 (2002).
Eisenegger, C., Kumsta, R., Naef, M., Gromoll, J. & Heinrichs, M. Testosterone and androgen receptor gene polymorphism are associated with confidence and competitiveness in men. Horm. Behav. 92, 93–102 (2017).
Article CAS Google Scholar
Kaplan, H. B. Social Psychology of Self-Referent Behavior. https://doi.org/10.1007/978-1-4899-2233-5. (Springer US, 1986).
Rosenthal, R. & Jacobson, L. Pygmalion in the classroom. Urban Rev. https://doi.org/10.1007/BF02322211 (1968).
Masters, F. W. & Greaves, D. C. The Quasimodo complex. Br. J. Plast. Surg. 204–210 (1967).
Zebrowitz, L. A., Collins, M. A. & Dutta, R. The Relationship between Appearance and Personality Across the Life Span. Personal. Soc. Psychol. Bull. 24, 736–749 (1998).
Article Google Scholar
Hu, S. et al. Signatures of personality on dense 3D facial images. Sci. Rep. 7, 73 (2017).
Article ADS Google Scholar
Kosinski, M. Facial Width-to-Height Ratio Does Not Predict Self-Reported Behavioral Tendencies. Psychol. Sci. 28, 1675–1682 (2017).
Article Google Scholar
Walker, M., Schönborn, S., Greifeneder, R. & Vetter, T. The basel face database: A validated set of photographs reflecting systematic differences in big two and big five personality dimensions. PLoS One 13, (2018).
Goffaux, V. & Rossion, B. Faces are ‘spatial’ - Holistic face perception is supported by low spatial frequencies. J. Exp. Psychol. Hum. Percept. Perform. https://doi.org/10.1037/0096-1523.32.4.1023 (2006).
Schiltz, C. & Rossion, B. Faces are represented holistically in the human occipito-temporal cortex. Neuroimage https://doi.org/10.1016/j.neuroimage.2006.05.037 (2006).
Van Belle, G., De Graef, P., Verfaillie, K., Busigny, T. & Rossion, B. Whole not hole: Expert face recognition requires holistic perception. Neuropsychologia https://doi.org/10.1016/j.neuropsychologia.2010.04.034 (2010).
Quadflieg, S., Todorov, A., Laguesse, R. & Rossion, B. Normal face-based judgements of social characteristics despite severely impaired holistic face processing. Vis. cogn. 20, 865–882 (2012).
Article Google Scholar
McKone, E. Isolating the Special Component of Face Recognition: Peripheral Identification and a Mooney Face. J. Exp. Psychol. Learn. Mem. Cogn. https://doi.org/10.1037/0278-7393.30.1.181 (2004).
Sergent, J. An investigation into component and configural processes underlying face perception. Br. J. Psychol. https://doi.org/10.1111/j.2044-8295.1984.tb01895.x (1984).
Tanaka, J. W. & Farah, M. J. Parts and Wholes in Face Recognition. Q. J. Exp. Psychol. Sect. A https://doi.org/10.1080/14640749308401045 (1993).
Young, A. W., Hellawell, D. & Hay, D. C. Configurational information in face perception. Perception https://doi.org/10.1068/p160747n (2013).
Calder, A. J. & Young, A. W. Understanding the recognition of facial identity and facial expression. Nature Reviews Neuroscience https://doi.org/10.1038/nrn1724 (2005).
Todorov, A., Loehr, V. & Oosterhof, N. N. The obligatory nature of holistic processing of faces in social judgments. Perception https://doi.org/10.1068/p6501 (2010).
Junior, J. C. S. J. et al. First Impressions: A Survey on Computer Vision-Based Apparent Personality Trait Analysis. (2018).
Wang, Y. & Kosinski, M. Deep neural networks are more accurate than humans at detecting sexual orientation from facial images. J. Pers. Soc. Psychol. 114, 246–257 (2018).
Article Google Scholar
Qiu, L., Lu, J., Yang, S., Qu, W. & Zhu, T. What does your selfie say about you? Comput. Human Behav. 52, 443–449 (2015).
Article Google Scholar
Digman, J. M. Higher order factors of the Big Five. J.Pers.Soc.Psychol. https://doi.org/10.1037/0022-3514.73.6.1246 (1997).
Musek, J. A general factor of personality: Evidence for the Big One in the five-factor model. J. Res. Pers. https://doi.org/10.1016/j.jrp.2007.02.003 (2007).
DeYoung, C. G. Higher-order factors of the Big Five in a multi-informant sample. J. Pers. Soc. Psychol. 91, 1138–1151 (2006).
Article Google Scholar
Rushton, J. P. & Irwing, P. A General Factor of Personality (GFP) from two meta-analyses of the Big Five: Digman (1997) and Mount, Barrick, Scullen, and Rounds (2005). Pers. Individ. Dif. 45, 679–683 (2008).
Article Google Scholar
Wood, D., Gardner, M. H. & Harms, P. D. How functionalist and process approaches to behavior can explain trait covariation. Psychol. Rev. 122, 84–111 (2015).
Article Google Scholar
Dunlap, W. P. Generalizing the Common Language Effect Size indicator to bivariate normal correlations. Psych. Bull. 116, 509–511 (1994).
Article Google Scholar
Connolly, J. J., Kavanagh, E. J. & Viswesvaran, C. The convergent validity between self and observer ratings of personality: A meta-analytic review. Int. J. of Selection and Assessment. 15, 110–117 (2007).
Article Google Scholar
Harris, K. & Vazire, S. On friendship development and the Big Five personality traits. Soc. and Pers. Psychol. Compass. 10, 647–667 (2016).
Article Google Scholar
Weidmann, R., Schönbrodt, F. D., Ledermann, T. & Grob, A. Concurrent and longitudinal dyadic polynomial regression analyses of Big Five traits and relationship satisfaction: Does similarity matter? J. Res. in Personality. 70, 6–15 (2017).
Article Google Scholar
Cuperman, R. & Ickes, W. Big Five predictors of behavior and perceptions in initial dyadic interactions: Personality similarity helps extraverts and introverts, but hurts “disagreeables”. J. of Pers. and Soc. Psychol. 97, 667–684 (2009).
Article Google Scholar
Schmidt, F. L. & Hunter, J. E. The validity and utility of selection methods in personnel psychology: Practical and theoretical implications of 85 years of research findings. Psychol. Bull. 124, 262–274 (1998).
Article Google Scholar
Brown, M. & Sacco, D. F. Unrestricted sociosexuality predicts preferences for extraverted male faces. Pers. Individ. Dif. 108, 123–127 (2017).
Article Google Scholar
Lukaszewski, A. W. & Roney, J. R. The origins of extraversion: joint effects of facultative calibration and genetic polymorphism. Pers. Soc. Psychol. Bull. 37, 409–21 (2011).
Article Google Scholar
Curran, P. G. Methods for the detection of carelessly invalid responses in survey data. J. Exp. Soc. Psychol. 66, 4–19 (2016).
Article Google Scholar
Khromov, A. B. The five-factor questionnaire of personality [Pjatifaktornyj oprosnik lichnosti]. In Rus. (Kurgan State University, 2000).
Trizano-Hermosilla, I. & Alvarado, J. M. Best alternatives to Cronbach’s alpha reliability in realistic conditions: Congeneric and asymmetrical measurements. Front. Psychol. https://doi.org/10.3389/fpsyg.2016.00769 (2016).
Liu, Z., Luo, P., Wang, X. & Tang, X. Deep Learning Face Attributes in the Wild. in 2015 IEEE International Conference on Computer Vision (ICCV) 3730–3738 https://doi.org/10.1109/ICCV.2015.425 (IEEE, 2015).
He, K., Zhang, X., Ren, S. & Sun, J. Deep Residual Learning for Image Recognition. in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 770–778 https://doi.org/10.1109/CVPR.2016.90 (IEEE, 2016).

Download references

Acknowledgements

We appreciate the assistance of Oleg Poznyakov, who organized the data collection, and we are grateful to the anonymous peer reviewers for their detailed and insightful feedback.

Author information

Authors and Affiliations

Artificial Intelligence LLC (AIPictor), BP Mirland, 2-ya Khutorskaya ul. 38Ас15, Moscow, 127287, Russia
Alexander Kachur
National Research University Higher School of Economics, Department of Psychology, International Laboratory of Positive Psychology of Personality and Motivation, Myasnitskaya ul. 20, Moscow, 101000, Russia
Evgeny Osin
Open University for the Humanities and Economics, Department of Psychology and Pedagogy, Nizhegorodskaya ul. 32s4, Moscow, 109029, Russia
Denis Davydov
BestFitMe Ltd, 67 Grosvenor St, Mayfair, London, W1K 3JN, United Kingdom
Konstantin Shutilov & Alexey Novokshonov

Authors

Alexander Kachur
View author publications
You can also search for this author in PubMed Google Scholar
Evgeny Osin
View author publications
You can also search for this author in PubMed Google Scholar
Denis Davydov
View author publications
You can also search for this author in PubMed Google Scholar
Konstantin Shutilov
View author publications
You can also search for this author in PubMed Google Scholar
Alexey Novokshonov
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K., E.O., D.D. and A.N. designed the study. K.S. and A.K. designed the ML algorithms and trained the ANN. A.N. contributed to the data collection. A.K., K.S. and D.D. contributed to data pre-processing. E.O., D.D. and A.K. analysed the data, contributed to the main body of the manuscript, and revised the text. A.K. prepared Figs. 1 and 2. All the authors contributed to the final version of the manuscript.

Corresponding authors

Correspondence to Alexander Kachur or Evgeny Osin.

Ethics declarations

Competing interests

A.K., K.S. and A.N. were employed by the company that provided the datasets for the research. E.O. and D.D. declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kachur, A., Osin, E., Davydov, D. et al. Assessing the Big Five personality traits using real-life static facial images. Sci Rep 10, 8487 (2020). https://doi.org/10.1038/s41598-020-65358-6

Download citation

Received: 12 April 2019
Accepted: 28 April 2020
Published: 22 May 2020
DOI: https://doi.org/10.1038/s41598-020-65358-6

This article is cited by

Personality of organizational social media accounts and its relationship with characteristics of their photos: analyses of startups’ Instagram photos
- Yunhwan Kim
BMC Psychology (2024)
Seeing the darkness: identifying the Dark Triad from emotionally neutral faces
- Danielle Haroun
- Yaarit Amram
- Joseph Glicksohn
Current Psychology (2024)
A Richer Vocabulary of Chinese Personality Traits: Leveraging Word Embedding Technology for Mining Personality Descriptors
- Yigang Ding
- Feijun Zheng
- Yiyun Jia
Journal of Psycholinguistic Research (2024)
Unravelling the many facets of human cooperation in an experimental study
- Victoria V. Rostovtseva
- Mikael Puurtinen
- Franz J. Weissing
Scientific Reports (2023)
Using deep learning to predict ideology from facial photographs: expressions, beauty, and extra-facial information
- Stig Hebbelstrup Rye Rasmussen
- Steven G. Ludeke
- Robert Klemmensen
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Prediction accuracy

Trait intercorrelations

Discussion

Methods

Sample and procedure

Ethical approval

Data screening

Big Five measure

Image screening and pre-processing

Neural network architecture

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links