A multimodal psychological, physiological and behavioural dataset for human emotions in driving tasks

Li, Wenbo; Tan, Ruichen; Xing, Yang; Li, Guofa; Li, Shen; Zeng, Guanzhong; Wang, Peizhi; Zhang, Bingbing; Su, Xinyu; Pi, Dawei; Guo, Gang; Cao, Dongpu

doi:10.1038/s41597-022-01557-2

Download PDF

Data Descriptor
Open access
Published: 06 August 2022

A multimodal psychological, physiological and behavioural dataset for human emotions in driving tasks

Wenbo Li ORCID: orcid.org/0000-0003-0407-3517^1,2,
Ruichen Tan ORCID: orcid.org/0000-0001-9585-5674²,
Yang Xing ORCID: orcid.org/0000-0002-3786-2865³,
Guofa Li¹,
Shen Li⁴,
Guanzhong Zeng¹,
Peizhi Wang¹,
Bingbing Zhang¹,
Xinyu Su¹,
Dawei Pi⁵,
Gang Guo¹ &
…
Dongpu Cao²

Scientific Data volume 9, Article number: 481 (2022) Cite this article

8123 Accesses
21 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Human emotions are integral to daily tasks, and driving is now a typical daily task. Creating a multi-modal human emotion dataset in driving tasks is an essential step in human emotion studies. we conducted three experiments to collect multimodal psychological, physiological and behavioural dataset for human emotions (PPB-Emo). In Experiment I, 27 participants were recruited, the in-depth interview method was employed to explore the driver’s viewpoints on driving scenarios that induce different emotions. For Experiment II, 409 participants were recruited, a questionnaire survey was conducted to obtain driving scenarios information that induces human drivers to produce specific emotions, and the results were used as the basis for selecting video-audio stimulus materials. In Experiment III, 40 participants were recruited, and the psychological data and physiological data, as well as their behavioural data were collected of all participants in 280 times driving tasks. The PPB-Emo dataset will largely support the analysis of human emotion in driving tasks. Moreover, The PPB-Emo dataset will also benefit human emotion research in other daily tasks.

Measurement(s)	electroencephalogram measurement • driving behaviour measurement • face expression, body gesture and road scenario measurement • emotion and persionality
Technology Type(s)	electroencephalography (EEG) • driving simulator • visual observation method • self-reported scale

K-EmoCon, a multimodal sensor dataset for continuous emotion recognition in naturalistic conversations

Article Open access 08 September 2020

Cheul Young Park, Narae Cha, … Uichin Lee

A multimodal physiological dataset for driving behaviour analysis

Article Open access 12 April 2024

Xiaoming Tao, Dingcheng Gao, … Yanjun Qin

A dataset of continuous affect annotations and physiological signals for emotion analysis

Article Open access 09 October 2019

Karan Sharma, Claudio Castellini, … Friedhelm Schwenker

Background & Summary

Human emotions are integral to daily life, and impacts a variety of cognitive abilities^1,2. Specifically,they can direct attention to key features of the environment, optimize sensory intake, tune decision making, ready behavioural responses, promote social interaction, and enhance episodic memory^3,4. Consequently, human emotions affect different tasks that humans experience every day, such as, learning, sleeping, driving, entertainment. Driving is now a typical daily task^5,6. According to the statistical report, the average driving time per person in the world exceeds 1 hour per day^7,8, which also is accompanied by serious injuries, fatalities and related costs caused by negative human emotions in driving tasks^9,10. For a long time, how to reduce accidents risk caused by emotions through studying human emotions in driving tasks has been an important research topic in many fields such as psychology, physiology, engineering, and ergonomics, and has been extensively studied^{11,12,13,14,15}.

With the advancement of sensing, machine learning, and computing systems, extensively emerging intelligent vehicles have been developed to connect with vehicles, pedestrians, infrastructures, and clouds in the transportation network^16,17. Thus, intelligent vehicles have become an intelligent mobile terminal that carries rich functions and services^18,19, which not only expand and deepen the scope of human-machine interaction but also provide an emerging and challenging research field to further study human emotions in driving tasks, namely, emotion-aware human-machine interactions^20,21. Specifically, emotion-aware human-machine interactions consists of multi-modal emotion detection and regulation, which will enhance the safety of humans in driving tasks, and will also improve their comfort and driving experience^22,23. To study the emotion-aware human-machine interactions in driving tasks, cross-disciplinary knowledge is required, including cognitive psychology, brain science, human factors, automotive engineering, and affective computing^19,24. The latest technological innovations like wearable devices have boosted the study of emotions, leading to a growing number of studies investigating the positive or negative impact of certain emotions during driving (e.g., anger²⁵, sadness²⁶). In order to measure emotions of human beings, psychological studies have revealed the multi-modal expression of emotion²⁷. Further, affective computing studies have focused on the detection of human emotion states based on different emotional expressions^6,28. Moreover, the human-machine-interface like visual and auditory interface could affect the human’s emotions in driving tasks^15,29.

To study human emotion in driving tasks, researchers need rich and repeatable datasets³⁰. Over the past decade, researchers have shared multiple human emotion datasets in driving. Table 1 summarizes the datasets that capture information of drivers in the surveyed papers. Although these datasets have contributed to successful support in human emotion studies, there is still a lack of multi-modal datasets (including psychological, physiological and behavioural data) that are dedicated to human emotions research in driving. Thus, creating a multi-modal human emotion dataset in driving is an essential step in emotion-aware human-machine interactions studies. However, as far as we know, there is no publicly available multimodal dataset of human emotions in driving tasks.

Table 1 The summary of reviewed publicly available datasets for human emotion research in driving.

Full size table

Here, we present the multimodal dataset of psychological, behavioural and physiological data for human emotion (PPB-Emo) in driving tasks. As shown in Fig. 1, we conducted three experiments to collect PPB-Emo dataset. In Experiment I, 27 participants were recruited, the in-depth interview method was employed to explore the driver’s viewpoints on driving scenarios that induce different emotions, and the results were used to develop a questionnaire. For Experiment II, 409 participants were recruited, a questionnaire survey was conducted to obtain driving scenarios information that induces human drivers to produce specific emotions, and the results were used as the basis for selecting video-audio stimulus materials. Experiment III used the video-audio clips selected in Experiment I and Experiment II as the stimulus materials for human driver’s emotion induction. In Experiment III, 40 participants were recruited, and the psychological data (self-reported dimensional emotions and discrete emotions, personality traits) and physiological data (EEG), as well as their behavioural data (driving behaviour, facial expressions, body posture, road scenario) were collected of all participants in 280 times driving tasks. The PPB-Emo dataset will largely support the analysis of human emotion-cognition-behaviour-personality in driving tasks, as well as the study in emotion detection algorithms and adaptive emotion regulation strategies. To the best of our knowledge, The PPB-Emo dataset is currently the only publicly available multimodal dataset of human emotions in driving tasks, and the PPB-Emo dataset will also benefit human emotion research in other daily tasks.

Methods

Ethics statement

This study was carried out under the requirements of the Declaration of Helsinki and the later amendments of it. The content and procedures of this study were noticed and approved by the Ethics Committee of Chongqing University Cancer Hospital(Approval number: 2019223).

The written informed consent were given by all participants before they joined in this study. A statement was informed to the participants that results of this study might be published in academic journals or books. During the experiments, participants were told about the rights they would have in experiments. They were allowed to withdraw at any time during the experiments.

The permissions to make the processed data records known to public were gained from all the participants at the end of the study. Since PPB-Emo is to be open to public access, separate consent was obtained for the disclosure of the data that contains personally identifiable information, which is the facial expression of participants during driving tasks. Additional permission was used to inform them about the data types that would be shared in public and the potential risks of re-identification that might be caused by sharing the date and time of the processed data records. The sharing permissions were given by all participants in this study.

Experiment I: in-depth interview to collect drivers’ viewpoints

Experiment I focused on the investigation of drivers’ viewpoints on driving scenarios that induce different emotions in humans.

Participants

In-depth interviews with 27 participants were conducted. The 27 participants included 6 females (22.22%) and 21 males (77.78%). The age range of the participants was 19–55 years old, with an average age of 36.81 years old (standard deviation (SD) = 9.27). Participants’ driving experience ranged from 1 to 25 years, with an average driving experience of 8.93 years(SD = 6.49). The occupations of the participants include workers, teachers, students, farmers, staffs, civil servants, drivers, etc.

Procedure

The aim of in-depth interviews was to obtain real-life scenario information that induces different emotions of human drivers and use the results to develop questionnaires. The scenario information collection procedure includes semi-structured interviews with human drivers. The interviews were based on the interview guide method³¹. All participants first signed the demographic questionnaire, and collected personal and demographic information including age, gender, driving experience, and occupation. Then, through interviews with the participants, the participants answered a set of open-ended questions (e.g., question “Could you share an experience that you felt scared while driving or even when you recalled it?”). During the answering process, the interviewer guided the participants to use their own words to recall and describe driving scenarios that trigger different emotions, including roads, weather and lighting conditions; other road users’ behaviours; events; and other contributing factors (e.g., answer “One time when I was driving on a mountain road at night, there was no one on the road. I felt very sleepy. My eyes closed a little uncontrollably. When I opened my eyes, I found that I was in a sharp bend. I stepped on the brakes. It made me feel scared.”). Each participant answered seven driving scenarios questions corresponding to different emotions. The interview time for each participant was about 30 minutes and the process was recorded.

Results of collected drivers’ viewpoints

All audio recording and on-site notes of the in-depth interviews were transcribed verbatim and analyzed using Excel files. First, the original transcripts of the 27 interviewees were broken into complete sentences. Next, the two researchers (1 male and 1 female) with expert knowledge and rich experience in drivers’ emotions analysis evaluated sorted the sentences separately and the main scenario information corresponding to the seven emotions in the statement were determined under the consensus of them. After summarizing, there are eleven kinds of scenarios that induce anger in human drivers; sixteen kinds of scenarios that induce happiness in human drivers; ten kinds of scenarios that make human drivers fear; eleven kinds of scenarios that trigger human drivers to feel disgusted; There are ten kinds of scenarios that cause human drivers to feel surprised; Relatively, few scenarios that trigger sadness and neutral of human drivers are five and six respectively. Table 2 summarizes the top five driving scenarios that induce each emotion according to the number of participants.

Table 2 Description of the top five driving scenarios that induce each emotion according to the number of participants.

Full size table

Experiment II: online questionnaire for stimulus selection

Experiment II focuses on obtaining seven driving scenarios that most effectively induce the corresponding emotions of human drivers through questionnaire surveys, as the basis for the selection of video-audio stimulus materials.

Participants

409 Chinese participants were recruited from four countries, including China, the United States, Canada, and Singapore. They were asked to complete an online questionnaire, including 146 women (35.61%) and 263 men (64.39%). The age range of the participants is 18–71 years old, and the average age is 31.34 years old (SD = 10.64). Participants’ driving experience ranges from 1–41 years, with an average driving experience of 5.87 years (SD = 6.69).

Procedure

Because online surveys can avoid geographical restrictions on data collection, and previous studies have also verified the effectiveness of online tools in assessing driving behaviour^32,33. Therefore, an online survey was conducted to collect the data in Experiment II. Based on the outcomes of Experiment I, the online questionnaire consists of two parts and a total of ten questions. The first part is the demographic background. There are three questions, including gender, age, and driving experience. The second part is based on the results of Experiment I and developed seven questions for driving scenarios that induce different emotions in human drivers. These questions correspond to seven emotions that need to be investigated. Each question describes five different driving scenarios. These scenario descriptions are derived from the top five more frequently mentioned scenarios in Experiment I. Participants were asked to select the scenarios most likely to induce corresponding emotions from the five scenarios and they can select more than one scenario (up to five) if they want. It takes about 10 minutes to complete the questionnaire.

The professional online survey platform Sojump (www.sojump.com) was used to design and post the questionnaire. Participants’ answers, region, and answer time were automatically recorded. The survey was distributed in the chat groups of social software (WeChat and QQ). To increase the involvement in the survey, participants will receive a reward of five RMB after completing the survey.

Results of stimulus selection

Participants reported the corresponding scenarios that easily induce seven kinds of emotion states (anger, fear, disgust, sadness, surprise, happiness and neutral) during driving. Table 3 presents the frequency and percentage of scenarios that easily induce 7 kinds of emotions among the 409 participants. Among them, a total of 344 participants (84.11%) thought that the scenario of “Others keep the high beam on while meeting the car, which affects the vision.” was most likely to induce their anger. 310 participants (75.79%) mentioned “Driving on a mountain road with high cliff beside.” that would make them feel fear. 351 participants (85.82%) felt disgusted when they saw the scenario “The driver in front keeps throwing garbage, water bottles, and spitting out.” A total of 271 participants (66.26%) thought that the scenario of “Witnessing an accident while driving.” was the easiest to make them sad. 307 participants (75.06%) reported that “Seeing some pedestrians walking on the highway.” would make them surprise. Regarding the happiness, 299 participants (73.11%) reported that “Noticing interesting things happened on the road and the scenery outside is very beautiful.” is the easiest to make them happy. The corresponding frequencies of scenarios are shown in Fig. 2. In addition, 273 participants (66.75%) felt neutral when driving while listening to soft music.

Table 3 Results of the online questionnaire survey for 409 participants.

Full size table

The emotions of human drivers need to be induced by appropriate stimuli to collect emotion data. Video-audio clips have been proven to reliably trigger the emotions of human driver^6,34,35. Based on the results of the questionnaire survey, we manually selected the seven most effective (the highest percentage of participants were selected in each emotional scenario) video-audio clips on the Bilibili website (https://www.bilibili.com/) to induce the corresponding emotions of the human driver in Experiment III. Bilibili is a Chinese video-sharing site where users can upload videos of their lives, and video viewers can tag or add comments to videos through a scrolling commenting system nicknamed “bullet-screen comments”, which will help us evaluate video viewers’ emotional feelings induced by the video-audio clips.

To select the most effective video-audio clips based on the results of the online survey, two research experts (1 male and 1 female) with rich experience in drivers’ emotions analysis evaluated more than 100 video-audio clips. The consensus of the two experts determined the choice of video-audio clips, and finally, 7 videos were selected for Experiment III. Notebaly, in order to make the driver feel more immersive and induce the correct emotion in Experiment III, all the selected video-audio clips in Experiment II were first-perspective of the human driver. Table 4 describes the contents of these seven clips.

Table 4 Contents description of the selected seven video-audio stimulus for human driver emotion induction.

Full size table

Experiment III: multi-modal human emotion data collection in driving tasks

The aim of Experiment III is to collect the multimodal psychological, physiological and behavioural dataset for human emotions in driving tasks.

Participants

A total of 41 drivers from Chongqing were recruited for this data collection experiment. Among these participants, the data of participant 1 was found incomplete and invalid after the collection process. The reason might due to the unexpected technical problems. Therefore, the data of 40 participants (age range = 19–58 years old, average age = 28.10 years old, SD = 9.47)) were valid in this experiment, including 31 males and 9 females. All participants had a valid driver’s license and had at least one year of driving experience (driving experience range = 1–32 years, average driving experience = 5.58 years, SD = 6.02). All participants had normal/corrected vision and hearing. Their health statuses were reported before the start of the experiment. Participants were suggested to have a regular 24-hour schedule and took no stimulating drugs or alcohol before the experiment. Each participants received a reward of 200 RMB after the experiment.

Experiment setup

The multi-modal data collection system used in this experiment mainly includes the psychological data collection module, physiological data collection module, behavioural data collection module, driver emotion induction module, driving scenarios, and data synchronization. Figure 3 shows the setup of the overall multi-modal data collection experiment. The contents of the specific modules are as follows:

Psychological data collection module

In this experiment, three self-reported scales were used to collect psychological data, including self-assessment manikin (SAM), differential emotion scale (DES), Eysenck personality questionnaire (EPQ). SAM³⁶ was used for participants to subjectively annotate their dimensional emotions. Representations of non-verbal graphical were used in SAM to evaluate the level of three dimensions (arousal, valence, and dominance). The 9-point scale (1 = “not at all”, 9 = “extremely”) SAM was used for assessment in the experiment procedure. DES³⁷ was used for participants to subjectively annotate their discrete emotions. DES is a multidimensional self-report scale for human’s emotions assessment, including ten fundamental emotions: sadness, anger, contempt, fear, shame, interest, joy, surprise, disgust, and guilt. In the experiment, the 9-point scale DES (1 = “not at all”, 9 = “extremely”) was chosen as the method to evaluate the intensity of self-reported emotions in each dimension. EPQ³⁸ with a total of 88 questions was used to assess the personality traits of participants in the experiment. EPQ is a multi-dimensional psychological measurement³⁸, which can measure the personality traits of humans, including P-Psychoticism/Socialisation, E-Extraversion/Introversion, N-Neuroticism/Stability, L-Lie/Social Desirability. The experiment used iPad (Apple, Cupertino, USA) for participants’ self-reported emotions during driving.

Physiological data collection module

An EnobioNE (Neuroelectrics, Barcelona, Spain) was used in the experiment to collect participant’s EEG physiological data. EnobioNE is a 32-channel wireless EEG device that uses a neoprene cap to fix the channel at the desired brain location. The electrical activity of the brain was recorded using the EnobioNE-32 system. Dry copper electrodes (coated with a silver layer) fixed on the cap was used to guarantee the good contact with the participant’s scalp. The amplitude resolution of EnobioNE we used was 24 bi (0.05 μV), the sampling rate was 500 Hz, and the band-pass filter was between 2 and 40 Hz. The signal was directly captured by the NIC2 software, and The software contained programs for acquiring and processing signals. During the experiment, the software filtered out electrooculogram (EOG), electromyography (EMG) and electrocardiographic (ECG) signals simultaneously. In addition, the NIC2 software associated the channels with the variable position in the international 10–10 positioning system dynamically. The alpha wave, beta wave, gamma wave, delta wave and theta wave at these positions were directly output to the computer through the NIC2 software.

Before the experiment, the researcher suggested that the participants should wash their hair in advance to avoid the poor contact of the EEG cap electrodes. After the participants put on the device correctly, the contact status of all electrodes in the EnobioNE system was checked and adjusted till a good fit was reached. In addition, a common-mode sensing electrode clamped on the right earlobe was used as a ground reference.

Behavioural data collection module

Behavioural data collection module consists of driving behaviour data collection and video data collection. Driver behaviour data was obtained using a fixed-based driving simulator (Realtime Technologies, Ann Arbor, USA). The simulator consists of a half-cab platform and an automatic transmission, providing a 270° field of view. The simulator is equipped with a rear-view mirror with a simulated projection, allowing the driver to monitor the traffic behind. Furthermore, the sound of the engine and ambient is emitted through two speakers. The woofer in the simulator simulates the vibration of the vehicle under the driver’s seat. In addition, the simulator dashboard was an LCD (resolution 1920×720, 60 Hz) screen, which was used to display the speedometer, tachometer and gear position. The data of driver behaviour, road information and vehicle posture generated by operating the driving simulator during the driver’s driving process were synchronized and recorded in real-time in the background of the main control computer.

The video data collection composed of six high-definition cameras. Five RGB cameras and one infrared camera were used in this experiment to collect the driver’s face expression, body gesture and road scenario data. The RGB camera we used was Pro Webcam C920 (Logitech, Newark, USA) with a resolution of 1920×1080 pixels, which collected data at a frame rate of 30 fps. The infrared camera we used was an industrial-grade camera with a resolution of 1080×720 pixels, a lens focal length of 2.9 mm and a shooting angle of 90 degrees without distortion. Data collection was at a frame rate of 30 fps. Six cameras were arranged in the cockpit of the driving simulator, of which three RGB cameras were located in front of the participant’s face at 40° on the left and right sides. These cameras were used to collect facial expression data, and one RGB camera was arranged in the front pillar of the driving simulator to collect the driving posture data of participants, and one RGB camera was arranged at the position of the rear-view mirror of the driving simulator to collect road scenario information during driving. Infrared cameras were placed directly in front of the participants’ faces and were also used to collect facial expression data. In addition, the camera was also used to collect the voice information of the participants during emotional driving. The LiveView software (EVtech, Changsha, China) was used to record video information simultaneously from the six high-definition cameras.

Driver’s emotion induction module

A 20-inch simulator central display (resolution 1280×1024, 60 Hz) was used in the experiment to display video-audio stimulus materials. Stereo Bluetooth speakers (Xiaomi, Shenzhen, China) were used to play audio, and the audio was set to a relatively large volume. Meanwhile, each participant was asked if the volume was comfortable for them to ensure clear hearing volume was adjusted before the experiment. Video-audio stimulus materials selected in Experiment II was used in Experiment III. To ensure that there was no human intervention in the emotion induction of participants during the experiment, the emotion induction system in this experiment was mainly composed of a master computer, a remote display and a remote Bluetooth audio playback device.

Driving scenarios

In this experiment, two simulated driving scenarios were designed: a formal experimental scenario and a simulated driving practice scenario. The practice scenario setting aims to improve the participants’ control and familiarity with the driving simulator through the practice before the formal experiment. The scenario for practice driving was an 8 km straight section of highway with bidirectional four traffic lanes. The formal experiment scenario is a two-way two-lane straight-line section with a total length of 3 km. The reason for setting these two scenarios is to minimize the requirements of complex driving conditions on the driver’s performance, to show the real multimodal responses elicited by driver emotion to the greatest extent³⁹. Participants were asked to drive in the right lane throughout the experiment, keeping the speed at about 80 km/h. The specific configuration parameters of the two experimental scenarios are shown in Table 5. The entire driving scenario uses SimVista and SimCreator software to build the driving scenarios.

Table 5 Driving scenarios details of Experiment III.

Full size table

Data synchronization

To collect and store all data synchronously, this experiment used the D-lab data collection synchronization platform (Ergoneers, Gewerbering, Germany) to collect data in multiple channels, including EEG, driving behaviour data, video data are recorded synchronously on a common time axis to achieve subsequent synchronous analysis. In addition, D-Lab was also used to manage and control the experiment.

Experiment procedure

The whole experiment process is divided into three parts: preparation, emotional driving experiments and post-experiment interviews. The overall process is shown in Fig. 4.

Experiment preparation

1.
Experiment introduction: After the participants arrive in the waiting room, the participants will be explained the purpose, the duration, and the research significance of this experiment. At the same time, the participants will be informed that the data collection apparatus of this experiment is non-invasive and radiation-free, and apparatus will not have any impact or harm on the participants’ health, and the participate voluntarily of participants will be ensured.
2.
Sign the participant inform consent form: instruct participants to read the “Participant Inform Consent Form”, the researchers will number the participants and register the basic information.
3.
Complete the health form for experiment participants: to check the health of the participants in their daily lives, and whether they have taken psychotropic drugs, cold allergy drugs or alcohol in the past 12 hours. The researchers will evaluate the situation of the participants and see if it is suitable for them to participate in the experiment.
4.
Wear testing apparatus: the researchers help the participants to wear the EEG. After wearing the EEG cap, the researchers will adjust the comfort level to observe whether the electrodes are fit and whether the signal collection is normal.
5.
Simulator practice driving: the researchers will lead the participants to sit in the cockpit and adjust the positions of the seat to a suitable position. Then, the researchers will help the participants to adapt to the speed control of the driving simulator, and remind participants to drive according to the speed signs. In the process of practice driving, the researchers will explain the formal experiment process, steps and attention to the participants in the co-pilot position.
6.
Fill in the driving simulator sickness questionnaire: check whether the participant has any physical discomfort during the driving simulator experiment.

Multimodal human emotion data collection

In the formal experiment, participants were asked to complete the driving tasks in seven emotional states (anger, sadness, fear, disgust, surprise, happiness, and neutral), in which the order of emotional induction was randomly selected. After each experiment, a 3-minute emotional cooling period was set up to allow participants to calm down from the previous period of emotions.

1.
Emotion induction: The researcher loads the preset driving scenarios program to the driving simulator, and at the same time randomly plays the video-audio clips to the participants for emotion induction. The participants watch the video-audio stimulus material and try to maintain their emotions while driving.
2.
Emotional driving: After the participant finished watching the emotion induction material, the participant starts emotional driving in D (Driving) gear, and the experimental platform starts recording data simultaneously. Participants were told to keep the speed at around 80 km/h during the emotional driving phase.
3.
Self-reported emotion: After completing a time of emotional driving, participants were required to recall the state of their emotions during their driving scenarios by completing self-assessment of the SAM scale and DES scale questionnaires.
4.
Repeat the above two steps until the participant completes seven emotional driving. After the participant completes the corresponding SAM scale and DES scale, the researcher will record the experiment process.

Post-experiment interview

After completing all the emotional driving experiments, the researcher will help the participants to remove the experimental apparatus from their bodies, and then guided the participants to complete the EPQ questionnaire.

Data Records

Dataset summary

This section discusses the organisation of PPB-Emo dataset in the Figshare⁴⁰. Table 6 summarizes the data collection of PPB-Emo dataset. After the above data collection, each participant completed 7 simulator driving and data recording. Therefore, for 40 participants, a total of 280 times driving were completed, and the length of each driving was about 135 s. To verify whether the participant experienced the target emotion in certain driving scenario, we carried out the target emotion induction success check.

Table 6 PPB-Emo dataset collection summary.

Full size table

The DES of each participant was used as the ground truth to verify whether the target emotion was generated by the participant during the emotional driving. The self-reported emotion would be selected as the ground truth when it was not consistent with the target emotion.The outcomes showed that for each emotional driving, namely anger, sadness, fear, disgust, surprise, happiness and neutral driving, 34, 38, 36, 25, 34, 36 and 37 participants were successfully induced into target emotion, respectively. At the same time, we deleted each set of data that was not successfully induced.

The resulting PPB-Emo dataset contains 240 sets of valid multimodal data from 40 participants, totaling 540 minutes of raw data. It includes psychological data, physiological data and behavioural data of 40 participants during driving tasks. Table 7 summarizes the details of the PPB-Emo dataset dataset.

Table 7 PPB-Emo dataset content.

Full size table

Dataset content

The information in participant-level was pre-processed to accomplish de-identification in accordance with the General Data Protection Regulation (GDPR)⁴¹. For time synchronization across data, we convert all timestamps from UTC + 8 to UTC + 0 and clipped the raw data. Previous studies have shown that the physiological expression of human emotions can last at least 30 s^15,42. Therefore, based on the gear change information of the driving behaviour data, we regard the multi-modal data 30 s after the participants start driving as the most effective data in data processing.

For the EEG data and driving behaviour data of each participant, we first exported the raw data from D-Lab, the data format is.txt. Then we converted the data format to.csv, and clipped the first 30 s of driving behaviour and EEG data as the most effective data, renamed and stored them respectively. For the video data of each participant, we first exported the original video data from Liveview, the data format is.mp4. Subsequently, we clipped the first 30 s of the video data as the most effective data, and divided the original shot into 6 images, respectively stored and renamed, including road scenarios video, infrared middle facial video, driving body gesture video, RGB left facial video, RGB middle face video, RGB right face video. Please note that any unedited video and raw log-level data recordings will not be provided. Meanwhile, the code for pre-processing of these data will not be in public either, because the privacy-sensitive information contained exceed the boundaries of the information we are allowed to share. More details can be found in the section of code availability.

In the root path of the dataset, it was organized into the following seven main directories: psychological data, physiological data, driving behavioural data, facial expression data, body gesture data, road scenario data, and scripts. The README.TXT file in these directories will give a detailed explanation. For each participant, a unique two-digit participant ID is assigned.

Psychological_data

This directory contains the data of participant biographic, self-reported emotion labels and personality traits as three.xlsx files.

BIO.XLSX. Each row contains the biographical data of a participant, organized by participant ID, gender (1 = male, 2 = female), age and driving age.

Emotion label.XLSX. Each row contains the self-reported data of a participant’s experienced emotions, organized according to participant ID, valence, arousal, dominance, category, and intensity. The SAM scale was used to measure valence, arousal, and dominance, the DES scale was used to measure category and intensity. The organization of content in each row is shown in the Table 10.

EPQ.XLSX. Each row contains the Eysenck personality questionnaire data of a participant, organized according to participant ID, P-score, E-score, N-score, L-score, Where P reï¬‚ects psychoticism/socialisation, E is extraversion/introversion, N means neuroticism/stability, and L is lie/social desirability.

Physiological_data

40 sub-folders are further divided in this directory, each sub-folder contains the data of all the EEG signal per participant. These sub-folders were named after the participant ID and include multiple CSV files. Each CSV file corresponds to valid emotional driving. Each row contains the EEG data at an instantaneous measurement and is organized according to rec-time, UTC, 32 channels EEG data, α, β, γ, δ, θ frequency bands data for each channel. The organization of content in each row is shown in the Table 8. Besides, the EEG montage description file is contained in the directory. The TXT file described the channels’ information created to display activity over the entire head and to provide lateralizing and localizing information, which will help the understanding and analysis of EEG data.

Table 8 Organization of the content in EEG.CSV.

Full size table

Driving_behavioural_data

40 sub-folders are further divided in this directory, each sub-folder contains all the driving behavioural data (DBD) of one participant. These sub-folders were named after the participant ID and include multiple CSV files. Each CSV file corresponds to valid emotional driving. Each row contains the driving behavioural data at an instantaneous measurement and is organized according to rec-time, UTC, acceleration, lateral-acceleration, gas-pedal-position, brake-pedal-force, gear, steering-wheel-position, velocity, lateral-velocity, x-position, y-position, z-position. The organization of content in each row is shown in the Table 9.

Table 9 Organization of the content in driving behavioural data.CSV.

Full size table

Table 10 Organization of the content in emotion labels.CSV, Note: AD = angry driving, SAD = sad driving, FD = fear driving, DD = disgust driving, SD = surprise driving, HD = happy driving, ND = neutral driving.

Full size table

Facial_expression_data

40 sub-folders are further divided in this directory, each sub-folder contains the data of all the facial expression per participant. The sub-folders are named after the participant ID and include 4 sub-sub folders which are central RGB (CRGB) facial expression data, left RGB (LRGB) facial expression data, right RGB (RRGB) facial expression data, and central infrared (CIR) facial expression data. Each folder contains multiple MP4 files, and each MP4 file corresponds to valid emotional driving. The facial expression data of a participant’s emotional driving record is shown in the Fig. 5.

Body_gesture_data

40 sub-folders are further divided in this directory, each sub-folder contains the data of body gesture per participant. These sub-folders were named after the participant ID and include multiple MP4 files. Each MP4 files file corresponds to valid emotional driving. The body gesture data of a participant’s emotional driving record is shown in the Fig. 5.

Road_scenario_data

This directory is divided into 40 sub-folders, each of which contains the data of driving road scenario per participant. These sub-folders were named after the participant ID and include multiple MP4 files. Each MP4 files file corresponds to valid emotional driving. The driving road scenario data of a participant’s emotional driving record is shown in the Fig. 5.

Selected_stimulus_clips

This directory contains one sub-folder of selected video-audio clips and one clip raw links.xlsx file.

Selected video-audio clips. This sub-folder contains seven selected video-audio clips for driver emotion induction. The target emotions corresponding to the seven clips are anger, sadness, disgust, fear, happiness, neutral and surprise.

Selected video-audio clips raw links.XLSX. Each row contains the selected video-audio clips raw link of a target emotion, organized by target emotion and raw link.

Scripts

The preprocessing and main analysis codes (Python scripts) are summarized in this directory. All results in the technical verification section can be copied using these scripts. For more details, please read the instructions in the README.md file.

Technical Validation

Our validation consists of reliability validation of emotion labels, quality validation of physiological and behavioural data, as well as correlation analysis of physiology, behaviour and emotion labels.

Reliability validation of emotion labels

In this section, a K-Means cluster algorithm⁴³ was performed to provide a intuitive visualization analysis for the distribution of 40 participants’ subjective rating scores. Then the distribution of each emotion labels was summarized using 3-dimension histograms.

Clustering and visualization analysis of emotion labels

The SAM we used evaluated emotions in three dimensions: valence, arousal and dominance. To validate the reliability, we performed data clustering and visualization analysis towards this three dimensions. To guarantee that each feature is equally treated, we non-dimensionalized the data by projecting all of the subjective scores to a range of 0 to 1 using the max-min normalization method. Then, the values of valence, arousal, and dominance were used as the coordinate values of the scatter diagram. Figure 6(A) shows the distribution of 40 participants’ rating score after normalization.

The rating scores were clustered using a K-Means algorithm, and the center of each cluster is shown in the Fig. 6(A). Seven clusters represent seven discrete emotion including, anger driving, fear driving, disgust driving, sad driving, surprise driving, happy driving and neutral driving. The center points of each cluster have no overlap and the classification of clusters is relatively obvious, especially the Happy Driving, Angry Driving and Fear Driving scenario. Other clustered points are partially overlapped due to the complexity of emotions and the fact that the participants’ different understanding of certain emotions while they did the scoring. To provide a more comprehensive display, we projected the 3-D scatter diagram into 2-D diagrams. Figure 6(B–D) shows the results after projection, and the center points of each cluster still have no overlap and the classification of clusters is relatively obvious.

Distribution analysis of emotion labels

The participants’ ratings scores of valence, arousal and dominance, emotion category and intensity were summarized in Fig. 7. The x-coordinate represents seven emotion category, the y-coordinate represents the values of the subjective scores, and the z-coordinate represents the total number of each item.

Figure 7(A) shows the distribution of valence. Valence means the positivity or negativity of an emotion according to the definition. Positive emotions have higher valence scores while negative emotions have lower valence scores. The data distribution shown in Fig. 7(A) conforms to this pattern. The valence scores of negative emotions (such as sad, angry, disgust and fear) are mainly located between 1 and 4 while the valence scores of positive or neutral emotions (such as neutral, happy and surprised) are mainly located between 5 and 9. By conforming to the regular distribution pattern, the valence score is verified.

Figure 7(B) shows the distribution of arousal. According to the definition, arousal ranges from excitement to relaxation. The data distribution shown in Fig. 7(B) consist with this pattern. As Fig. 7(B) shows, the arousal scores of neutral only located at the 1, which means the participants feel relaxed and have no positive or negative emotions at this moment. This feature meet our objective experience, which validate the reliability of arousal data.

Figure 7(C) shows the distribution of dominance. According to the valence-arousal-dominance model, dominance ranges from submissive to dominant. Fear is low-dominance and anger is high-dominance. The distribution of fear driving shown in Fig. 7(C) concentrated mainly between 1 and 3, which conforms to this pattern. The distributions of other emotions mainly located between 4 and 6, which shows that participants had a mid-level control towards these emotions. By conforming to the regular distribution pattern, the reliability of dominance score is verified.

Figure 7(D) shows the distribution of emotion intensity. Based on the definition of DES, the score of intensity and the strength of different emotions are positively related. The distribution verifies that participants’ emotions have been successfully stimulated since all of the data are distributed between 5 and 9 and most of them located between 6 and 9.

Quality validation of physiological and behavioural data

For each variable involved in the research, we performed a visual display and a quality control, and the quality of data measurements has been thoroughly tested. The relevant signals were extracted as time function. The overall results per drive after quality control is shown in Figs. 8 and 9.

Physiological data

The device we used to obtain EEG signals is a head-mounted device. The data was obtained through the electrodes. Therefore, the quantity and length of participants’ hair will affect the contact of the electrodes as well as the results. Thus, good contact is necessary to get effective data. The valid data was extracted and shown in Fig. 8, which contained the trend curves of 32 channels of EEG signals under different emotional conditions. As we can see from Fig. 8, the signals collected by most channels have obvious similarities and trends except channel 20. The signals from channel 20 are invalid due to poor contact during the experiment.

Driving behavioural data

These variables include 11 dynamics parameters that represent driving behaviour, including accelerations, degree/force of gas/brake-pedal, velocities and positions, which provide a comprehensive description of driving behaviour. Figure 9 shows the time function of different driving behaviours under seven different emotions. Each sub-figure in Fig. 9 contains 40 curves and each curve represents the time functions of a participant’s driving signals. All of the driving behaviour data were acquired through a driving simulator mentioned above. As shown in Fig. 9, the tendency of driving behaviour data is consistent in the same emotion.

Correlation analysis of physiology, behaviour and emotion labels

Figure 10 shows the Spearman correlations analysis⁴⁴ among 11 driving behaviours, 32-channel EEG signals and 3 dimensions of emotions. The correlations were shown in the form of heatmap. In Fig. 10, the data was first non-dimensionalized using the max-min normalization methods. Then, the mean and variance values of the data were calculated separately for each participant and the processed data was combined together with 3 dimensions of emotions scores. The Spearman’s correlations analysis was then used to obtain the correlation coefficients. The correlation coefficients are in a range of −1 to 1.

Physiological data and emotion labels

Fig. 10(A) shows the correlation heatmap between mean values of EEG signals and 3 dimensions scores of emotions. Correlations with high significance were noticed for the groups of valence-channel(CH)5/CH11/CH14/CH27, arousal-CH12/CH27/CH31 and dominance-CH3/CH26. Figure 10(B) shows the correlation heatmap between variance values of EEG signals and 3 dimensions scores of emotions. Correlations with high significance were noticed for the groups of valence-CH9/CH12/CH13/CH17, arousal-CH4/CH7/CH9/CH13/CH16/CH17/CH28/CH30/CH31 and dominance-CH5/CH6/CH8/CH20 /CH28/CH29.

Behavioural data and emotion labels

Fig. 10(C) shows the correlation heatmap between mean values of Driving behavioural data and 3 dimensions scores of emotions. The correlation coefficients were normalized before the process. Correlations with high significance were noticed for the group of arousal-brake pedal force. Figure 10(D) shows the correlation heatmap between variance values of Driving behavioural data and 3 dimensions scores of emotions. Correlations with high significance were noticed for the groups of valence-gas pedal degree, arousal-brake pedal force, arousal-vertical velocity/velocity and dominance-velocity.

Usage Notes

The user can use any video playback tool (e.g., QuickTime Player) to open the.MP4 file. The user can use any spreadsheet or workbook software to open the.CSV file. The data can be directly imported into Python, Matlab and other statistical or programming tools for analysis. We recommend that users check the sample report in the database for further clarification.

Potential applications

With the help of various data mining techniques, the dataset can be used for the analysis of the relationship between the emotion-physiology-behaviour-personality trait of human drivers^45,46. It can be used to analyze the driving risks caused by the emotions of human drivers^13,47. It can also be used to analyze the difference between human emotion expression in driving scenes and life scenes^6,48. The dataset can also be used to analyze the cognitive and behaviour changes of human drivers in different emotions in the driving environment, and then conduct research on human drivers’ emotion regulation strategies^46,49.

Moreover, by applying various machine learning techniques, based on the collected driving behaviour, EEG, facial expressions, driving posture, and road scene information, the dataset can be used to develop single-modal/multi-modal driver emotion monitoring algorithms^35,50. Accurate and efficient emotion monitoring algorithms will help the emotion-aware interaction between human drivers and intelligent vehicles to improve driving safety and comfort, and increase human trust in machines^46,51. Besides, The PPB-Emo dataset will also benefit human emotion research in other daily tasks.

Limitations and future works

Driver emotion induction. This study used video-audio clips to induce driver emotions. Although all these video-audio clips have been validated to be effective in eliciting expected human driver emotions, this study cannot completely rule out the possibility that there may be difference between video-audio clips induced emotion and real on-road driving scenarios induced emotion. In the future, we will further conduct on-road driving experiments to study the emotions of human drivers in real on-road driving scenarios.

Participants.There was a gender imbalance in all three experiments in this study, with a 3:1 male to female ratio. Although this is basically the same as the male-to-female ratio of Chinese drivers⁵², this may affect the use of the dataset. Future research should maintain a balanced gender ratio as much as possible. In addition, in Experiment II, 409 Chinese participants were invited to answer the online questionnaires. Since the scenarios to elicit the same emotions might vary in different cultures⁵³, the induction effect of the driving scenarios provided in Experiment II on different culture groups needs further research.

Code availability

The data pre-processing methods and procedures of validation mentioned in the technical validation section were carried out in Jupyter Notebook. Python version 3.5.8 was used throughout. The correlation analysis and distribution display are conducted using seaborn, sciki-learn⁵⁴ and pandas packages. The codes and a brief description(readme.md) have been uploaded.

References

Williams, S. E., Lenze, E. J. & Waring, J. D. Positive information facilitates response inhibition in older adults only when emotion is task-relevant. Cognition and Emotion 34, 1632–1645, https://doi.org/10.1080/02699931.2020.1793303 (2020).
Article PubMed Google Scholar
Pessoa, L. How do emotion and motivation direct executive control? Trends in cognitive sciences 13, 160–166, https://doi.org/10.1016/j.tics.2009.01.006 (2009).
Article PubMed PubMed Central Google Scholar
Gross, J. J. Handbook of emotion regulation (Guilford publications, New York City, 2013).
Hoemann, K. & Feldman Barrett, L. Concepts dissolve artificial boundaries in the study of emotion and cognition, uniting body, brain, and mind. Cognition and Emotion 33, 67–76, https://doi.org/10.1080/02699931.2018.1535428 (2019).
Article PubMed Google Scholar
Koppel, S. et al. A comparison of older drivers’ driving patterns during a naturalistic on-road driving task with patterns from their preceding four-months of real-world driving. Safety science 125, 104652, https://doi.org/10.1016/j.ssci.2020.104652 (2020).
Article Google Scholar
Li, W. et al. A spontaneous driver emotion facial expression (defe) dataset for intelligent vehicles: Emotions triggered by video-audio clips in driving scenarios. IEEE Transactions on Affective Computing 0, 1–1, https://doi.org/10.1109/TAFFC.2021.3063387 (2021).
Article Google Scholar
AAA. New american driving survey: Updated methodology and results from july 2019 to june 2020. Print at https://newsroom.aaa.com/wp-content/uploads/2021/07/New-American-Driving-Survey-Fact-Sheet-FINAL.pdf (2020).
MTPRC. The number of vehicles in china reached 380 million. Print at http://www.caam.org.cn/chn/11/cate_76/con_5233729.html (2021).
Dingus, T. A. et al. Driver crash risk factors and prevalence evaluation using naturalistic driving data. Proceedings of the National Academy of Sciences 113, 2636–2641, https://doi.org/10.1073/pnas.1513271113 (2016).
Article ADS CAS Google Scholar
Wickens, C. M., Mann, R. E., Ialomiteanu, A. R. & Stoduto, G. Do driver anger and aggression contribute to the odds of a crash? a population-level analysis. Transportation research part F: traffic psychology and behaviour 42, 389–399, https://doi.org/10.1016/j.trf.2016.03.003 (2016).
Article Google Scholar
Ellison-Potter, P., Bell, P. & Deffenbacher, J. The effects of trait driving anger, anonymity, and aggressive stimuli on aggressive driving behavior. Journal of Applied Social Psychology 31, 431–443, https://doi.org/10.1111/j.1559-1816.2001.tb00204.x (2001).
Article Google Scholar
Mesken, J., Hagenzieker, M. P., Rothengatter, T. & De Waard, D. Frequency, determinants, and consequences of different drivers’ emotions: An on-the-road study using self-reports,(observed) behaviour, and physiology. Transportation research part F: traffic psychology and behaviour 10, 458–475, https://doi.org/10.1016/j.trf.2007.05.001 (2007).
Article Google Scholar
Stephens, A. N. & Groeger, J. A. Anger-congruent behaviour transfers across driving situations. Cognition & emotion 25, 1423–1438, https://doi.org/10.1080/02699931.2010.551184 (2011).
Article Google Scholar
Deng, Z. et al. A probabilistic model for driving-style-recognition-enabled driver steering behaviors. IEEE Transactions on Systems, Man, and Cybernetics: Systems 52, 1838–1851, https://doi.org/10.1109/TSMC.2020.3037229 (2020).
Article Google Scholar
Li, W. et al. Visual-attribute-based emotion regulation of angry driving behaviours. IEEE Intelligent Transportation Systems Magazine 14, 10–28, https://doi.org/10.1109/MITS.2021.3050890 (2021).
Article CAS Google Scholar
Bigman, Y. E. & Gray, K. Life and death decisions of autonomous vehicles. Nature 579, E1–E2, https://doi.org/10.1038/s41586-020-1987-4 (2020).
Article CAS PubMed Google Scholar
Amolika, S., Sai, C., Vincent, V., Huang, C. & Vinayak, D. Crash and disengagement data of autonomous vehicles on public roads in california. Scientific Data 8, 0–0, https://doi.org/10.1038/s41597-021-01083-7 (2021).
Article Google Scholar
Zepf, S., Hernandez, J., Schmitt, A., Minker, W. & Picard, R. W. Driver emotion recognition for intelligent vehicles: a survey. ACM Computing Surveys 53, 1–30, https://doi.org/10.1145/3388790 (2020).
Article Google Scholar
Olaverri-Monreal, C. Promoting trust in self-driving vehicles. Nature Electronics 3, 292–294, https://doi.org/10.1038/s41928-020-0434-8 (2020).
Article Google Scholar
Nunes, A., Reimer, B. & Coughlin, J. F. People must retain control of autonomous vehicles. Print at https://www.nature.com/articles/d41586-018-04158-5 (2018).
McDuff, D. & Czerwinski, M. Designing emotionally sentient agents. Communications of the ACM 61, 74–83, https://doi.org/10.1145/3186591 (2018).
Article Google Scholar
ERTRAC. Connected automated driving roadmap. Print at https://www.ertrac.org/uploads/documentsearch/id57/ERTRAC-CAD-Roadmap-2019.pdf (2019).
ISO/TR-21959-1. Road vehicles - human performance and state in the context of automated driving - part 1: Common underlying concepts. Print at https://www.iso.org/standard/78088.html (2020).
Li, S. et al. Drivers’ attitudes, preference, and acceptance of in-vehicle anger intervention systems and their relationships to demographic and personality characteristics. International Journal of Industrial Ergonomics 75, 102899, https://doi.org/10.1016/j.ergon.2019.102899 (2020).
Article Google Scholar
Berdoulat, E., Vavassori, D. & Sastre, M. T. M. Driving anger, emotional and instrumental aggressiveness, and impulsiveness in the prediction of aggressive and transgressive driving. Accident Analysis & Prevention 50, 758–767, https://doi.org/10.1016/j.aap.2012.06.029 (2013).
Article Google Scholar
Jeon, M. Don’t cry while you’re driving: Sad driving is as bad as angry driving. International Journal of Human–Computer Interaction 32, 777–790, https://doi.org/10.1080/10447318.2016.1198524 (2016).
Article Google Scholar
Keltner, D., Sauter, D., Tracy, J. & Cowen, A. Emotional expression: Advances in basic emotion theory. Journal of nonverbal behavior 1–28, https://doi.org/10.1007/s10919-019-00293-3 (2019).
Lee, B. G. et al. Wearable mobile-based emotional response-monitoring system for drivers. IEEE Transactions on Human-Machine Systems 47, 636–649, https://doi.org/10.1109/THMS.2017.2658442 (2017).
Article Google Scholar
Braun, M., Weber, F. & Alt, F. Affective automotive user interfaces–reviewing the state of driver affect research and emotion regulation in the car. ACM Computing Surveys 54, 1–26, https://doi.org/10.1145/3460938 (2021).
Article Google Scholar
Zeng, Z., Pantic, M., Roisman, G. I. & Huang, T. S. A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE transactions on pattern analysis and machine intelligence 31, 39–58, https://doi.org/10.1109/TPAMI.2008.52 (2009).
Article PubMed Google Scholar
Patton, M. Q. Qualitative research & evaluation methods: Integrating theory and practice (Sage publications, Thousand Oaks, 2014).
Zhang, T. et al. The roles of initial trust and perceived risk in public’s acceptance of automated vehicles. Transportation research part C: emerging technologies 98, 207–220, https://doi.org/10.1016/j.trc.2018.11.018 (2019).
Article Google Scholar
Zhang, T., Chan, A. H., Li, S., Zhang, W. & Qu, X. Driving anger and its relationship with aggressive driving among chinese drivers. Transportation research part F: traffic psychology and behaviour 56, 496–507, https://doi.org/10.1016/j.trf.2018.05.011 (2018).
Article Google Scholar
Roidl, E., Siebert, F. W., Oehl, M. & Höger, R. Introducing a multivariate model for predicting driving performance: The role of driving anger and personal characteristics. Journal of safety research 47, 47–56, https://doi.org/10.1016/j.jsr.2013.08.002 (2013).
Article PubMed Google Scholar
Li, W. et al. Cogemonet: A cognitive-feature-augmented driver emotion recognition model for smart cockpit. IEEE Transactions on Computational Social Systems 9, 667–678, https://doi.org/10.1109/TCSS.2021.3127935 (2021).
Article Google Scholar
Bradley, M. M. & Lang, P. J. Measuring emotion: the self-assessment manikin and the semantic differential. Journal of behavior therapy and experimental psychiatry 25, 49–59, https://doi.org/10.1016/0005-7916(94)90063-9 (1994).
Article CAS PubMed Google Scholar
Gross, J. J. & Levenson, R. W. Emotion elicitation using films. Cognition & emotion 9, 87–108, https://doi.org/10.1080/02699939508408966 (1995).
Article Google Scholar
Eysenck, H. J. & Eysenck, S. B. G. Manual of the Eysenck Personality Questionnaire (junior & adult) (Hodder and Stoughton Educational, London, 1975).
Taamneh, S. et al. A multimodal dataset for various forms of distracted driving. Scientific data 4, 1–21 (2017).
Article Google Scholar
Li, W. PPB-Emo: A multimodal psychological, physiological and behavioural dataset for human emotions in driving tasks, figshare, https://doi.org/10.6084/m9.figshare.c.5744171.v1 (2022).
EU. General data protection regulation (gdpr). Print at https://gdpr.eu/tag/gdpr/ (2019).
Levenson, R. W., Carstensen, L. L., Friesen, W. V. & Ekman, P. Emotion, physiology, and expression in old age. Psychology and aging 6, 28, https://doi.org/10.1037/0882-7974.6.1.28 (1991).
Article CAS PubMed Google Scholar
Likas, A., Vlassis, N. & Verbeek, J. J. The global k-means clustering algorithm. Pattern recognition 36, 451–461, https://doi.org/10.1016/S0031-3203(02)00060-2 (2003).
Article ADS Google Scholar
Zar, J. H. Significance testing of the spearman rank correlation coefficient. Journal of the American Statistical Association 67, 578–580, https://doi.org/10.1080/01621459.1972.10481251 (1972).
Article MATH Google Scholar
Fairclough, S. H. & Dobbins, C. Personal informatics and negative emotions during commuter driving: effects of data visualization on cardiovascular reactivity & mood. International Journal of Human-Computer Studies 144, 102499, https://doi.org/10.1016/j.ijhcs.2020.102499 (2020).
Article Google Scholar
Schlegel, K. et al. A meta-analysis of the relationship between emotion recognition ability and intelligence. Cognition and emotion 34, 329–351, https://doi.org/10.1080/02699931.2019.1632801 (2020).
Article PubMed Google Scholar
Šeibokaitė, L., Endriulaitienė, A., Sullman, M. J., Markšaitytė, R. & Žardeckaitė-Matulaitienė, K. Difficulties in emotion regulation and risky driving among lithuanian drivers. Traffic injury prevention 18, 688–693, https://doi.org/10.1080/15389588.2017.1315109 (2017).
Article PubMed Google Scholar
Matsumoto, D. & Ekman, P. American-japanese cultural differences in intensity ratings of facial expressions of emotion. Motivation and emotion 13, 143–157, https://doi.org/10.1007/BF00992959 (1989).
Article Google Scholar
Sani, S. R. H., Tabibi, Z., Fadardi, J. S. & Stavrinos, D. Aggression, emotional self-regulation, attentional bias, and cognitive inhibition predict risky driving behavior. Accident Analysis & Prevention 109, 78–88, https://doi.org/10.1016/j.aap.2017.10.006 (2017).
Article Google Scholar
Lan, Z., Sourina, O., Wang, L., Scherer, R. & Müller-Putz, G. R. Domain adaptation techniques for eeg-based emotion recognition: a comparative study on two public datasets. IEEE Transactions on Cognitive and Developmental Systems 11, 85–94, https://doi.org/10.1109/TCDS.2018.2826840 (2018).
Article Google Scholar
Yokoi, R. & Nakayachi, K. Trust in autonomous cars: exploring the role of shared moral values, reasoning, and emotion in safety-critical decisions. Human factors 63, 1465–1484, https://doi.org/10.1177/0018720820933041 (2020).
Article PubMed Google Scholar
CAAM. the national motor vehicle ownership exceeded 395 million. Print at http://www.caam.org.cn/chn/11/cate_120/con_5235344.html (2022).
Matsumoto, D. & Hwang, H. S. Culture and emotion: The integration of biological and cultural contributions. Journal of Cross-Cultural Psychology 43, 91–118 (2012).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in python. the Journal of machine Learning research 12, 2825–2830, https://doi.org/10.5555/1953048.2078195 (2011).
Article MathSciNet MATH Google Scholar
Ma, Z., Mahmoud, M., Robinson, P., Dias, E. & Skrypchuk, L. Automatic detection of a driver’s complex mental states. In International Conference on Computational Science and Its Applications, 678–691, https://doi.org/10.1007/978-3-319-62398-6_48 (Springer, Cham, 2017).
Angkititrakul, P., Petracca, M., Sathyanarayana, A. & Hansen, J. H. Utdrive: Driver behavior and speech interactive systems for in-vehicle environments. In Intelligent Vehicles Symposium, 566–569, https://doi.org/10.1109/IVS.2007.4290175 (IEEE, New York, 2007).
Healey, J. A. & Picard, R. W. Detecting stress during real-world driving tasks using physiological sensors. IEEE Transactions on intelligent transportation systems 6, 156–166, https://doi.org/10.1109/TITS.2005.848368 (2005).
Article Google Scholar

Download references

Acknowledgements

This study was supported by the National Key Research and Development Program of China (Grant Number: 2019YFB1706102) and National Natural Science Foundation of China, the Major Project (Grant Number: 52131201). The authors would like to thank Huafei Xiao, Yujing Liu, Jiyong Xue, Yinghao Zhong, Ying Lin, Chengmou Li, Juncheng Zhang, Hao Chen for data collection.

Author information

Authors and Affiliations

College of Mechanical and Vehicle Engineering, Chongqing University, Chongqing, 400044, China
Wenbo Li, Guofa Li, Guanzhong Zeng, Peizhi Wang, Bingbing Zhang, Xinyu Su & Gang Guo
School of Vehicle and Mobility, Tsinghua University, Beijing, 100084, China
Wenbo Li, Ruichen Tan & Dongpu Cao
Department of Aerospace, Transport, and Manufacturing, Cranfield University, Cranfield, MK430AL, UK
Yang Xing
Department of Civil Engineering, Tsinghua University, Beijing, 100084, China
Shen Li
School of Mechanical Engineering, Nanjing University of Science and Technology, Nanjing, 210094, China
Dawei Pi

Authors

Wenbo Li
View author publications
You can also search for this author in PubMed Google Scholar
Ruichen Tan
View author publications
You can also search for this author in PubMed Google Scholar
Yang Xing
View author publications
You can also search for this author in PubMed Google Scholar
Guofa Li
View author publications
You can also search for this author in PubMed Google Scholar
Shen Li
View author publications
You can also search for this author in PubMed Google Scholar
Guanzhong Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Peizhi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bingbing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xinyu Su
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Pi
View author publications
You can also search for this author in PubMed Google Scholar
Gang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Dongpu Cao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

W.L. designed, prepared, conducted the data collection of experiment I/II/III, performed data curation analysis, wrote and edited the manuscript. R.T. performed the technical validation, wrote and edited the manuscript. Y.X. contributed to the data analysis methodology and edited the manuscript. G.L. co-designed the experiment III and edited the manuscript. S.L. edited and revised the manuscript. G.Z. performed data curation and processing. P.W. prepared, conducted the data collection of experiment III. B.Z. prepared, conducted the data collection of experiment III. X.S. conducted the data collection of experiment I/II. D.P. advised, revised, and reviewed the manuscript. G.G. supervised the experiment, provided the resources, and reviewed the manuscript. D.C. supervised the experiment, advised, revised, and reviewed the manuscript. All authors have read and agreed to this version of the manuscript.

Corresponding authors

Correspondence to Gang Guo or Dongpu Cao.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, W., Tan, R., Xing, Y. et al. A multimodal psychological, physiological and behavioural dataset for human emotions in driving tasks. Sci Data 9, 481 (2022). https://doi.org/10.1038/s41597-022-01557-2

Download citation

Received: 14 January 2022
Accepted: 30 June 2022
Published: 06 August 2022
DOI: https://doi.org/10.1038/s41597-022-01557-2

This article is cited by

A multimodal physiological dataset for driving behaviour analysis
- Xiaoming Tao
- Dingcheng Gao
- Yanjun Qin
Scientific Data (2024)
Review and Perspectives on Human Emotion for Connected Automated Vehicles
- Wenbo Li
- Guofa Li
- Keqiang Li
Automotive Innovation (2024)
Driver Steering Behaviour Modelling Based on Neuromuscular Dynamics and Multi-Task Time-Series Transformer
- Yang Xing
- Zhongxu Hu
- Chen Lv
Automotive Innovation (2024)

Subjects

Abstract

Similar content being viewed by others

K-EmoCon, a multimodal sensor dataset for continuous emotion recognition in naturalistic conversations

A multimodal physiological dataset for driving behaviour analysis

A dataset of continuous affect annotations and physiological signals for emotion analysis

Background & Summary

Methods

Ethics statement

Experiment I: in-depth interview to collect drivers’ viewpoints

Participants

Procedure

Results of collected drivers’ viewpoints

Experiment II: online questionnaire for stimulus selection

Participants

Procedure

Results of stimulus selection

Experiment III: multi-modal human emotion data collection in driving tasks

Participants

Experiment setup

Psychological data collection module

Physiological data collection module

Behavioural data collection module

Driver’s emotion induction module

Driving scenarios

Data synchronization

Experiment procedure

Experiment preparation

Multimodal human emotion data collection

Post-experiment interview

Data Records

Dataset summary

Dataset content

Psychological_data

Physiological_data

Driving_behavioural_data

Facial_expression_data

Body_gesture_data

Road_scenario_data

Selected_stimulus_clips

Scripts

Technical Validation

Reliability validation of emotion labels

Clustering and visualization analysis of emotion labels

Distribution analysis of emotion labels

Quality validation of physiological and behavioural data

Physiological data

Driving behavioural data

Correlation analysis of physiology, behaviour and emotion labels

Physiological data and emotion labels

Behavioural data and emotion labels

Usage Notes

Potential applications

Limitations and future works

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

A multimodal physiological dataset for driving behaviour analysis

Review and Perspectives on Human Emotion for Connected Automated Vehicles

Driver Steering Behaviour Modelling Based on Neuromuscular Dynamics and Multi-Task Time-Series Transformer

Search

Quick links