A mind-brain-body dataset of MRI, EEG, cognition, emotion, and peripheral physiology in young and old adults

We present a publicly available dataset of 227 healthy participants comprising a young (N=153, 25.1±3.1 years, range 20–35 years, 45 female) and an elderly group (N=74, 67.6±4.7 years, range 59–77 years, 37 female) acquired cross-sectionally in Leipzig, Germany, between 2013 and 2015 to study mind-body-emotion interactions. During a two-day assessment, participants completed MRI at 3 Tesla (resting-state fMRI, quantitative T1 (MP2RAGE), T2-weighted, FLAIR, SWI/QSM, DWI) and a 62-channel EEG experiment at rest. During task-free resting-state fMRI, cardiovascular measures (blood pressure, heart rate, pulse, respiration) were continuously acquired. Anthropometrics, blood samples, and urine drug tests were obtained. Psychiatric symptoms were identified with Standardized Clinical Interview for DSM IV (SCID-I), Hamilton Depression Scale, and Borderline Symptoms List. Psychological assessment comprised 6 cognitive tests as well as 21 questionnaires related to emotional behavior, personality traits and tendencies, eating behavior, and addictive behavior. We provide information on study design, methods, and details of the data. This dataset is part of the larger MPI Leipzig Mind-Brain-Body database.

We present a publicly available dataset of 227 healthy participants comprising a young (N = 153, 25.1 ± 3.1 years, range 20-35 years, 45 female) and an elderly group (N = 74, 67.6 ± 4.7 years, range 59-77 years, 37 female) acquired cross-sectionally in Leipzig, Germany, between 2013 and 2015 to study mind-bodyemotion interactions. During a two-day assessment, participants completed MRI at 3 Tesla (resting-state fMRI, quantitative T1 (MP2RAGE), T2-weighted, FLAIR, SWI/QSM, DWI) and a 62-channel EEG experiment at rest. During task-free resting-state fMRI, cardiovascular measures (blood pressure, heart rate, pulse, respiration) were continuously acquired. Anthropometrics, blood samples, and urine drug tests were obtained. Psychiatric symptoms were identified with Standardized Clinical Interview for DSM IV (SCID-I), Hamilton Depression Scale, and Borderline Symptoms List. Psychological assessment comprised 6 cognitive tests as well as 21 questionnaires related to emotional behavior, personality traits and tendencies, eating behavior, and addictive behavior. We provide information on study design, methods, and details of the data. This dataset is part of the larger MPI Leipzig Mind-Brain-Body database.
Design Type(s) parallel group design • data collection and processing objective

Background & Summary
Emotions -intrinsically related to the body -have a huge influence on our behavior 1 . The connection between emotions and the body has been acknowledged by "folk psychology" 2 , language metaphors (e.g., "heart-breaking"), and scientists: in classic theories, emotions arise from our perception of bodily changes, which is understood as more 3 or less strongly influenced 4 by cognitive-evaluative processes.
Hence, emotionslike other mental processesdepend on interactions between the brain and the rest of the body. While in psychology, a lot of research measured the physiological effects of psychological manipulations, the inverse (body-mind) direction has been less frequently studied 5 . In clinical research, the opposite is true: While psychological changes after physical or somatic illness, such as depressive symptoms after stroke 6 or a cancer diagnosis [7][8][9] have been investigated, mental factors contributing to bodily diseases have received comparatively less scientific recognition. For example, psychological stress has negative influences on somatic and mental health 10 and emotional episodes like depressive symptoms have been discussed as a risk factor for stroke 11 , coronary heart disease 12 or diabetes 13 . Informed by these recent studies, we investigate psychological factors that play a key role in the pathogenesis, development, and treatment of somatic diseases in a multi-modular approach. This "mindbody-emotion" approach emphasizes the bi-directionality of brain-body interactions as they underlie mental phenomena and the importance of psychological factors for somatic health and disease. In the "Leipzig Study for Mind-Body-Emotion Interactions" (LEMON), we acquired a large dataset of physiological, psychological, and neuroimaging measures in younger and older healthy adults.
The LEMON dataset provides the following advantages: artificially separated, which hinders their comprehensive investigation. This holds true for broader fields like cognition and emotion 16 but also for more specific psychological processes like emotion regulation and value-based decision-making 17 . Particularly from a clinical viewpoint, a more integrative approach is beneficial, as risk factors for disease usually do not occur in isolationand for example mental distress, hypertension, and obesity often co-occur [18][19][20] .
In summary, the LEMON dataset is particularly suited to comprehensively relate cognitive and emotional traits or states to physiological characteristics of brain and body. While focusing on fundamental mind-body-emotion interactions in healthy younger and older adults, our data and results may inform clinical research. Here, we present the study's objectives, design, and methods together with available data types, their quality and quantities.
The dataset presented here was acquired as one of two complementary data acquisition protocols on a partially overlapping cohort of participants which constitute the MPI-Leipzig Mind-Brain-Body database. All MRI data of both projects were acquired on the same scanner. Taken in conjunction with the data acquired in the complementary project by Mendes et al. 21 , the MPI-Leipzig Mind-Brain-Body (MPILMBB) database aims to enable exploration of individual variance across a wide range of cognitive, emotional, physiological phenotypes in relation to the brain.

Participants
The total sample included 227 participants in two age groups. The young age group was between 20-35 years old (N = 153, 45 females, median age = 24 years, mean age = 25.1 years, standard deviation (SD) = 3.1) and the older age group was between 59-77 years old (N = 74, 37 females, median age = 67 years, mean age = 67.6 years, SD = 4.7). All participants were tested at the Day Clinic for Cognitive Neurology of the University Clinic Leipzig and the Max Planck Institute for Human and Cognitive and Brain Sciences (MPI CBS) in Leipzig, Germany. The study was carried out in accordance with the Declaration of Helsinki and the study protocol was approved by the ethics committee at the medical faculty of the University of Leipzig (reference number 154/13-ff).

Recruitment and Exclusion Criteria
Participants were recruited via public advertisements, leaflets, online advertisements, and information events at the University of Leipzig. Eligibility for the study was determined in two steps that are referred to as Day 0 in Fig. 1. First, we prescreened prospective participants via telephone with a semi-structured interview for study eligibility (N = 695). Individuals that did not meet any exclusion criteria in the prescreening (Table 1) were invited to MPI CBS to receive detailed information about the study in a www.nature.com/sdata/ SCIENTIFIC DATA | 6:180308 | https://doi.org/10.1038/sdata.2018.308 group briefing. There, they were informed about the study procedure and its measures with a special focus on MRI acquisition and safety. Following the group briefing, the study physician performed a second, individual screening of every participant to ensure that none of the exclusion criteria were fulfilled. Participants who were included in the study provided written informed consent prior to any data acquisition for the study (including agreement to their data being shared anonymously). Participants received monetary compensation for volunteering in the study after the completion of all assessment days. A participant was excluded because of brain pathology after completion of study, thus the total number of included participants became 227.
In round 3, 58 (23 females, young N = 29, 14 females, mean age = 25, SD = 3.7, older N = 29, 9 females, mean age = 66.6, SD = 4.6, 25.6% of total sample) participants were tested between 10/2014 and 03/2015. Round 4 consisted of 57 young males only (mean age = 25.6, SD = 2.6, 25.1% of total sample) and was acquired from 03/2015-09/2015. In round 4 we limited the sample to only male participants, since these participants were included in a follow-up stress experiment (not described here) which only included males (this was due to the attempt to replicate a previous study performed in male soldiers).
The following general study procedure was established. During the process of the study some measures were adapted and are thus not available for the total sample. Table 2 and Table 3 give a detailed overview of all measures and their availability for assessment day 1 and day 2 and Table 4 gives detailed overview of all measures and their availability for follow-up assessment days.
Participants completed two assessment days of approximately 4 hours duration each (Fig. 1). The first assessment day (day 1) included a cognitive test battery, MRI scanning, blood pressure and anthropometric measurements as well as acquisition of a blood sample. On the second assessment day (day 2), we acquired resting-state electroencephalogram (EEG) data and participants completed a psychological assessment including an emotion and personality test battery as well as a psychiatric interview. Participants were also invited for follow-up experiments, of which some measures are included here (3rd occasion of blood pressure, Future Time Perspective questionnaire, Multidimensional Mood State Questionnaire).
A complementary project by Mendes et al. 21 included 194 participants of which 109 participants completed both protocols which enables repeated-measures (e.g., test-retest) analyses. Some data from Mendes et al. will be released as part of the study described here (e.g. continuous peripheral physiological recordings during resting-state (rs) fMRI).

Psychological Assessment
Cognitive Test Battery. Cognitive tests were administered by undergraduate psychology students specifically trained in neuropsychological assessment following a standardized protocol. On day 1, participants underwent cognitive testing session of six cognitive tests in a fixed order (cf. Table 5). The subtests ("Alertness", "Incompatibility", and "Working Memory") of the "Test of Attentional Exclusion criteria via telephone screening: Self-reported ► Diagnosis of hypertension without intake of antihypertensive medication ► Any other cardiovascular disease (current and/or previous heart attack or congenital heart defect) ► History of psychiatric diseases that required inpatient treatment for longer than 2 weeks, within the last 10 years (psychosis, attempted suicide, post-traumatic stress disorder) ► History of neurological disorders (multiple sclerosis, stroke, epilepsy, brain tumor, meningoencephalitis, severe concussion) ► History of malignant diseases ► Intake of one of the following medications (centrally active medication, beta-and alpha-blocker, cortisol, any chemotherapeutic or psychopharmacological medication) ► Positive drug anamnesis (extensive alcohol, MDMA, amphetamines, cocaine, opiates, benzodiazepine, cannabis) ► MRI exclusion criteria (metallic implants, braces, non-removable piercings, tattoos, pregnancy, claustrophobia, tinnitus, surgical operation in the last 3 months) ► Previous participation in any scientific study within the last 10 years ► Previous or current enrollment in undergraduate, graduate or postgraduate psychology studies Performance" (TAP) were administered electronically via computer. An overview of this cognitive testing session is shown in Table 5, and detailed information on all measures is provided in the subsequent section.
California Verbal Learning Task (CVLT): The California Verbal Learning Task (CVLT) 22 assesses verbal learning and memory capacity. Participants are acoustically presented with 16 words, which have to be memorized and recalled or recognized several times. By quantifying, the CVLT provides how much information has been acquired over the rounds andby generating a variety of measuresit can provide information about different learning strategies. The task has two main parts (CVLT-part 1 and CVLT-part 2) and for the second part another free recall takes place after 20 min. During the interval between recall 1 and 2, typically a different non-verbal task is administered, for instance other cognitive tasks. In the present study the TAP-Test (see below) was administered, because the items are not supposed to interfere with verbal learning.
Test of Attentional Performance (TAP): Test of Attentional Performance (TAP) 23 measures different aspects of attentional processing. Here, the TAP version 2.3.1 was used. Three subtests assessed a participant's capacity of sustained attention ("TAP Alertness", and "TAP Incompatibility"-i.e. Simon effect) and working memory ("TAP Working Memory"-i.e. 2-back task) 23 . Mistakes, omissions, and reaction times in these subtests were recorded as measures of performance.
Trail Making Test (TMT): The Trail Making Test (TMT) measures cognitive flexibility, and it consists of subtest A (TMT-A) and subtest B (TMT-B) 24 . Participants are asked to quickly and correctly connect circles which are randomly distributed on a piece of paper. In TMT-A, these circles contain numbers from 1 to 25. In TMT-B, numbers and letters have to be connected in alternating and increasing order. The reaction time quantifies visual attention and executive functioning.
Wortschatztest (WST): The Vocabulary Test (Wortschatztest, WST) 25 indicates the measurement of verbal intelligence level and the assessment of language comprehension. By determining the vocabulary of a person, the WST allows estimation of his/her crystallized intelligence. It consists of 43 rows with 6 words each. In each row, participants have to identify the one word that actually exists in German.
Subtest 3 of the "Leistungsprüfsystem 2" (LPS-2): Subtest 3 of the Performance Testing System (Leistungsprüfsystem 2, LPS-2) 26 measures logical or inferential thinking and quantifies fluid intelligence. In subtest 3, participants are asked to identify the one item in a series of symbols that does not follow the logical rule of that series. The goal is to find as many items as possible within three minutes.
Regensburger Wortflüssigkeitstest (RWT): The Regensburger Word Fluency Test (Regensburger Wortflüssigkeitstest, RWT) 27 quantifies the verbal fluency of a person. In the section of "S-Words", participants have two minutes to name as many valid German words as possible that start with the letter "S". In the "Animals" section, as many animals as possible should be named within two minutes. The correct number of words quantifies formal lexical ("S-Words") or categorical-semantic fluency ("Animals").
Emotion and Personality Test Battery. On day 2, participants were asked to answer electronic version of 18 emotion-related questionnaires (cf. first18 sections below) in a randomized sequence on a computer (LimeSurvey version 2.0) 28 . The whole questionnaire completion took on average 1.5 hours to 2.5 hours with a short break after 45 min.
Besides those electronic testing sesion of 18 questionnaires, which were answered on a computer via LimeSurvey, there were pen-and-paper version of three other emotion-related questionnaires (cf. last 3 sections below) that were filled out at different time points. The Multidimensional Mood State Questionnaire (German MDBF) was answered on each of the two assessment days. After the MRI scanning session, participants filled out the New York Cognition Questionnaire (NYC-Q). The questionnaire of Future Time Perspective (FTP) was assessed during LEMON Rounds 1-3 only at the  beginning of a follow-up experiment. An overview of the individual questionnaires can be found in Table 6 (available online only) and a more detailed description is given in the section below.
Big-Five of Personality (NEO-FFI): We used the German adaptation of NEO-Five-Factor Inventory 29 to assess Costa and McCrae's Big-Five of Personality Inventory (NEO-FFI) 30 . The 60 items can be divided into the five factors of "Neuroticism", "Extraversion", "Openness to experience", "Agreeableness", and "Conscientiousness". Answers are given on a 5-point Likert scale ranging from 0 (strong denial) to 4 (strong approval).

Behavioral Inhibition and Approach System (BIS/BAS):
The German version 33 of the Behavioral Inhibition and Approach System (BIS/BAS) 34 was applied to measure reactivity of the aversive "Behavioral Inhibition" and the appetitive "Behavioral Approach" motivational systems in response to punishment or reward. This measure consists of a BIS subscale and three BAS subscales "Drive", "Reward Responsiveness", and "Fun Seeking", each consisting of 7 items. A total of 24 items are rated on a 4-point Likert-type response format ranging from 1 (does not apply to me at all) to 4 (fully applies to me).   Emotion Regulation Questionnaire (ERQ): To measure inter-individual differences in habitual emotion regulation, participants completed the German version 35 of the emotion regulation questionnaire (ERQ) 36 , which has 10 items that are answered on a 7-point Likert-type scale ranging from 1 (strongly disagree) to 7 (strongly agree). Six of the 10 items measure the tendency to use reappraisal for emotion regulation, and the other 4 items assess habitual expressive suppression.
Cognitive Emotion Regulation Questionnaire (CERQ): The Cognitive Emotion Regulation Questionnaire (CERQ) evaluates the cognitive aspects of emotion regulation 37,38 . It contains nine scales that measure five adaptive (acceptance, positive refocusing, refocusing on planning, positive reappraisal, putting into perspective) and four maladaptive emotion regulation strategies (self blame, rumination, catastrophising, blaming others) on a 5-point Likert scale from 0 (almost never) to 4 (almost always).

Affect Regulation Style (MARS):
We used the German version (external official translation, not validated yet) of the Measure of Affect Regulation Style (MARS) 39 to evaluate cognitive and behavioral aspects of emotion regulation. The scale consists of six subscales of "Behavioral Distraction", "Cognitive Distraction", "Situation-focused Strategies", "Affect-focused Strategies", "Disengagement", and "Avoidance". Ratings are given on a 7-point Likert scale ranging from 0 (not at all) to 6 (almost always). Since this data refers to the first version of a German validationa process which is still on-goingthe data should be used with caution.
Social Support Questionnaire (F-SozU K-22): Perceived social support was assessed using the German Fragebogen zur Sozialen Unterstützung 40 , the 22-item short version Social Support Questionnaire (F-SozU K-22) 41 . The scale comprises subscales of "Emotional Support", "Practical support", "Social Integration", "Availability of Trusted Person", and "Satisfaction with Social Support". The 22 items are answered on a 5point Likert scale ranging from 1 (does not apply at all) to 5 (strongly applies).

Multidimensional Scale of Perceived Social Support (MSPSS):
The German version of the Multidimensional Scale of Perceived Social Support (MSPSS) 42 was used to evaluate the perceived availability of social resources in the area of friends, family and significant others. In addition to the three subscales of the sources of social support, a sum score can be computed. Ratings can be provided on a 7point Likert scale ranging from 1 (not true at all) to 7 (very true).
Coping Orientations to Problems Experienced (Brief COPE): We used the German adaptation of the 28-item version of Brief COPE Inventory 43 to assess participants' Coping Orientations to Problems Experienced (Brief COPE) 44 . The measure consists of four subscales of "Positive Coping", "Active Coping", "Support Coping", and "Evasive Coping". The answers are rated on a 4-point Likert scale ranging from 1 (not at all) to 4 (very much).
Optimism Pessimism Questionnaire-Revised (LOT-R): The German version 45 of Life Orientation Test-Revised (LOT-R) was used to assess individual differences in generalized optimism versus pessimism 46 . The 10 items are added to an overall optimism score ranging from 0-24, with higher scores representing greater positive expectation. Answers are rated on a 5-point Likert scale ranging from 0 (does not apply at all) to 4 (strongly applies).
Perceived Stress Questionnaire (PSQ): We used the German version 47 of 20-item short version of the Perceived Stress Questionnaire (PSQ) 48 in order to assess the perception, appraisal, and processing of stressors during the last two years. The scale contains four subscales of "Worries", "Tension", "Joy", and "Demands". Answers are rated on a 4-point Likert scale from 1 (almost never) to 4 (usually).  Trier Inventory of Chronic Stress (TICS): To assess aspects of chronic stress we applied the German version 49 of the Trier Inventory of Chronic Stress (TICS) 50 . The 57-item scale comprises nine factors of chronic stress: "Work Overload", "Social Overload", "Pressure to Perform", "Work Discontent", "Excessive Demands at Work", "Lack of Social Recognition", "Social Tension", "Social Isolation", and "Chronic Worrying". Answers are rated on a 5-point Likert scale ranging from 0 (never) to 4 (very often).
Eating Behavior (FEV): The three-factor eating questionnaire (TFEQ) 51 , German version Fragebogen zum Essverhalten (FEV) 52 , was used to assess three domains of eating behavior. 'Cognitive Restraint of Eating' measures whether eating behavior is under cognitive, rather than physiological control, 'Disinhibition of Eating' measures the lack of control over eating, especially in the presence of tempting external cues or situations, and 'Susceptibility to Hunger' measures the experience of prominent and disturbing subjective hunger feelings. The 60 items are answered in different response formats ranging from dichotomous scales (applies, does not apply) to 4-point Likert scales from 1 (always) to 4 (never) or 1 (very much) to 4 (not). Item 58-60 are rated by selecting from a list of behavior descriptions.
Addicted Eating Behavior (YFAS): We applied the German version 53 of Yale Food Addiction Scale (YFAS) 54 in order to classify food-dependent eating behavior. Twenty of the total 27 items measure the seven DSM-IV-TR criteria of dependence 55 , two items measure if the eating behavior causes a clinically significant impairment, three items ask for particular foods related to the problematic eating behavior, and three items act as a primer for the other questions. The items are rated either on a 5-point Likert scale from 0 (never) to 4 (four times a week to daily) or dichotomous 0 (never) or 1 (yes).
Emotional Intelligence Questionnaire (TEIQue-SF): The 30-item short version of the Trait Emotional Intelligence Questionnaire (TEIQue-SF) 56 of German adaptation 57 was used to measure emotion-related dispositions and self-perception abilities. The scale contains the four subscales of "Well-being", "Selfcontrol", "Emotionality", and "Sociability", which can be averaged to one "Global Trait Emotional Intelligence" score. Answers are rated in a 7-point Likert format, ranging from 1 (do not agree at all) to 7 (agree completely).

State-Trait Anxiety Inventory (STAI-G-X2):
We applied the German version 58 of the Trait Scale of the State-Trait Anxiety Inventory (STAI-G-X2) short version 59 for the assessment of a situationindependent general condition of anxiety. This subscale consists of 20 items rated on a 4-point Likert scale ranging from 1 (almost never) to 4 (nearly always).

State-Trait Anger Expression Inventory (STAXI):
We used the 44-item German version 60 of the State-Trait Anger Expression Inventory (STAXI) 61 to measure the habitual experience, expression, and control of anger. We applied the four trait scales "Trait-anger", the individual anger-disposition, "Angerin", the tendency to suppress and non-verbalization of angry feelings, "Anger-out", the verbal or physical expression of anger towards others or self, and "Anger-control", which measures the attempt to control anger-expressions. All ratings were ranked rated on a 4-point Likert scale either from 1 (not at all or hardly ever) to 4 (very much or nearly always).

Toronto-Alexithymia Scale (TAS):
The German version 62 of the 26-item Toronto-Alexithymia Scale (TAS) 63 was used to measure alexithymia, difficulty experiencing, and expressing emotional states. We applied all three subscales: "difficulty with identifying feelings", "difficulty with expressing and describing feelings", and "externally-oriented thinking". Answers are rated on 5-point Likert scale from 1 (does not apply at all) to 5 (applies completely).

Multidimensional Mood State Questionnaire (MDBF):
The 24-item German version of the Multidimensional Mood State Questionnaire (German MDBF) 64 was completed by the participants on each assessment day. Mood ratings ("happy", "nervous", etc.) are ranked on a 5-point Likert scale from 1 (not at all) to 5 (very much). Three subscales can be computed along the dimensions of "good-bad", "awake-tired", "calm-nervous". New York Cognition Questionnaire (NYC-Q): After completion of the scanning session, participants filled out the New York Cognition Questionnaire (NYC-Q) 66 , which measures content and form of selfgenerated thoughts with 31 statements. The first part "Content of thoughts" is ranked on a Likert scale from 1 (did not describe my thoughts at all) to 9 (completely described my thoughts), while the second part "form of self-generated thoughts" is rated on a scale ranging from 1 (does not characterize my experience at all) to 9 (completely characterizes my experience). of an Axis 1 psychiatric disorder according to DSM IV 68 . The SCID I is a semi-structured interview that covers the major DSM-IV Axis I diagnoses. Interviews were either led by a trained psychologist or by a psychology student who had been trained to use the SCID I and supervised by a licensed psychiatrist. Documentation includes full current or history of Axis I diagnosis as well as a column with notes on noteworthy current or past subclinical symptoms beyond full fulfillment of diagnostic criteria (e.g., occasional use of an illegal drug, subclinical symptoms).

Assessment of Past and Present
Screening of Depressive Symptoms or Borderline Symptomatology (HAM-D and BSL-23): Any reported depressive symptoms were additionally assessed by a trained psychologist or trained research assistant using the Hamilton Depression Scale (HAM-D) 69 . Documentation includes the Hamilton sum score. Note that our psychiatric assessment focused primarily on present or past Axis I disorders. In addition, the Borderline Symptoms List (short version BSL-23) 70 was applied in 170 participants. This questionnaire is a self-rating instrument for borderline-typical symptomatology. Documentation includes sum scores of the BSL-23 and an additional sum score regarding borderline-typical behaviors. Additionally, participants were asked about their relationship status ("yes"/"no").
Screening for Alcohol Abuse: We also assessed alcohol consumption during the last 28 days using the Time Line Follow Back Questionnaire 71 . Using a calendar, participants self-report retrospectively the number of alcohol units consumed on each day in this period. Documentation includes the number of alcohol units consumed. The Alcohol Use Disorder Identification Test (AUDIT) 72 questionnaire was administered to screen for any indication of alcohol abuse. We additionally asked for family history of addiction in participants' 1st to 3rd degree relatives. Documentation includes presence or absence of family history of addiction.
Screening for Substance Abuse: In addition to the semi-structured interview, an in-vitro urine drug screening was performed using the "Multi 8/2 Drogen-Tauchtest" (Diagnostik Nord, Schwerin, Germany) to assess present substance use. The test detects the following substances simultaneously and up to two weeks after their administration: buprenorphine (cut-off 10 ng/mL), amphetamine (cut-off 1000 ng/mL), benzodiazepine (cut-off 300 ng/mL), cocaine (cut-off 300 ng/mL), methamphetamine (cutoff 1000 ng/mL), morphine/heroine (cut-off 300 ng/mL), methadone (cut-off 300 ng/mL), and THC (Marihuana, cut-off 50 ng/mL). Cut-off values of the tests were chosen according to recommendations of the American National Institute on Drug Abuse (NIDA) 73 . Documentation includes name of the substance detected (if any). The drug screening was performed on the second day of assessments, which was randomly assigned in order for participants not to know the date of the urine drug screening ahead of time. Moreover, it covered more than 1 week presence of any substance in urine, thus covering also assessment day 1.

Physiological data
MRI. Magnetic resonance imaging (MRI) was performed on a 3 Tesla scanner (MAGNETOM Verio, Siemens Healthcare GmbH, Erlangen, Germany) equipped with a 32-channel head coil. Over the course of MRI data acquisition, the scanner remained stable and did not undergo any major maintenance or updates which would systematically affect the quality of data provided here. This is also true in relation to the complementary protocol by Mendes et al., ensuring comparability between the studies.
The data were acquired with a very large coverage using simultaneous multi-slice acquisition to include the brain and the cerebellum. Diffusion data were acquired parallel to the AC-PC line and the volume (149.6 mm height) covered the entire brain including the cerebellum in all participants. The fMRI data were angulated by -15°(backwards) with respect to the AC-PC line. The slice block (147 mm) also covered the entire brain including the full cerebellum. The figures in Supplementary Figure S1 show cross-subject coverage of the fMRI (left) and DWI (right) data normalized to the MNI brain.
During rs-fMRI, electrocardiography (ECG), pulse, beat-to-beat blood pressure and respiration were recorded simultaneously (see section Continuous peripheral physiological recordings during rs-fMRI). Before imaging started, participants filled out the first MDBF questionnaire. Once imaging was completed, participants were asked to fill out the New York Cognition Questionnaire (NYC-Q, for details on the questionnaires see section Emotion and Personality Test Battery).
Resting-state fMRI (rs-fMRI): A T2 * -weighted gradient echo echo planar imaging (EPI) multiband BOLD rs-fMRI scan [79][80][81] was acquired to enable functional connectivity analyses. Participants were instructed to remain awake and lie still with their eyes open while looking at a low-contrast fixation cross. Data regarding sleep/wake for the rs-fMRI as such does not exist, but it is assumed that the participants were awake throughout the duration of the scan because they were requested to do so. The sequence parameters were specified as follows: TR = 1400 ms and the total number of volumes = 657 (for more www.nature.com/sdata/ SCIENTIFIC DATA | 6:180308 | https://doi.org/10.1038/sdata.2018.308 details see Table 7). The total acquisition time for rs-fMRI was 15 min 30 s. To enable correction for geometric distortions in EPI images from rs-fMRI, a gradient echo fieldmap scan and two pairs of spin echo EPI images with reversed phase encoding direction were acquired.
Resting-state fMRI Data Preprocessing: The preprocessing of the rs-fMRI data was implemented in Nipype and the details of it can be found in the complementary project by Mendes et al. 21  Axial acquisition orientation, phase encoding = A ≫ P, voxel size = 2.    87 .
Structural MRI T1 and T2: The MP2RAGE 78 sequence was acquired for assessment of brain structure with a voxel resolution of 1 mm (isotropic). Resulting T1-weighted images and quantitative T1 maps can be used for analyses of gray and white matter (e.g., cortical thickness, voxel-based morphometry), as well as for the assessment of myelin content 88,89 . Importantly, these T1-weighted images differ from MPRAGE T1-weighted images as they are uniform and free of other imaging properties (i.e. proton density, T2 * ) which can affect morphometric measurements 90 . The total acquisition time for MP2RAGE was 8 min 22 s. In addition, a standard T2-weighted volume with 1 mm isotropic resolution was acquired within 4 min 43 s (for details see Table 7).

T1 Data Preprocessing:
The preprocessing of the T1 MP2RAGE data was implemented in Nipype and the details of it can be found in the complementary project by Mendes et al. 21 . The pipeline is available at https://github.com/NeuroanatomyAndConnectivity/pipelines/tree/master/src/lsd_lemon and comprised the following steps: The background of the uniform T1-weighted image was removed using CBS Tools 91 , and the masked image was used for cortical surface reconstruction using FreeSurfer's full version of recon-all 92,93 . A brain mask was created based on the FreeSurfer segmentation results. Diffeomorphic nonlinear registration as implemented in ANTs SyN algorithm 87 was used to compute a spatial transformation between the individual's T1-weighted image and the MNI152 1mm standard space. To remove identifying information from the structural MRI scans, a mask for defacing was created from the MP2RAGE images using CBS Tools 91 . This mask was subsequently applied to all anatomical scans.
Fluid-Attenuated Inversion Recovery (FLAIR): T2-weighted FLAIR images were used for clinical screening of incidental findings. The scan was changed from a low-resolution 2D FLAIR to a 3D SPACE sequence with fluid-attenuated inversion-recovery preparation after the first 112 participants. Acquisition time for the 2D image was 4 min 42 s and 7 min 2 s for the 3D volume (for details see Table 7).
Diffusion-Weighted Imaging (DWI): We acquired axial whole brain high angular resolution diffusion-weighted images to analyze structural connectivity. The images were acquired with 1.7mm isotropic resolution using a multi-band accelerated sequence 79,81,94 and an in-plane GRAPPA 95 (60 diffusion directions, b-value = 1000 s/mm 2 , 7 b0 images, for details see Table 7). The total DWI scanning time was 9 min 27 s. To correct for geometric distortions, two volumes with reversed phase encoding (AP and PA) were acquired after the DWI sequence, lasting 1 min 59 s each. A new version of the DWI sequence (CMRR) with a faster calibration procedure was introduced after the first 112 participants which reduced the scanning time to 8 min 38 s and the time for the two scans with reversed phase encoding to 1 min 10 s each.
Susceptibility-weighted data acquisition: The visualization of magnetic susceptibility tissue differences is most commonly achieved via gradient echo data acquired using a single-or multi-echo spoiled-gradientrecalled-echo (GRE) sequence 96 . The Susceptibility-Weighted Imaging (SWI) technique capitalizes on the contrast inherent in the magnitude and phase images to improve susceptibility contrast by combining both images to enhance contrast between grey-/white-matter and water/fat, in addition to enhancing the contrast of paramagnetic elements exhibiting high densities in the brain (e.g. iron). SWI has a number of applications in the clinical setting including the diagnosis of cerebral vascular pathology and the detection of abnormal accumulation of mineral deposition. On the other hand, Quantitative Susceptibility Mapping (QSM) is a recently established technique that allows the determination of the intrinsic magnetic susceptibility properties of tissues based on signal from the phase image 97,98 . Susceptibility-weighted data were acquired using a three-dimensional (3D) flow-compensated fast low-angle shot (FLASH) sequence (for parameter details see Table 7) in a sub-sample which was introduced only after 112 participants. Highquality phase maps (i.e. excluding coil-combination pole artifacts) were reconstructed from multi-channel complex signals using an automated, data-driven coil combination method (SVD-ESPIRiT) 99,100 . Both magnitude and phase images are provided for SWI and QSM calculation which could be achieved using varied techniques 96,101 . The total time of acquisition was 7 min 50 s.
Continuous Peripheral Physiological Recordings During rs-fMRI. During the 15 min 30 s acquisition of resting-state fMRI, continuous beat-to-beat blood pressure (NIBP), electrocardiography (ECG), pulse, and respiration were recorded non-invasively with MR-compatible devices. Blood pressure and pulse via photoplethysmography were recorded with a BIOPAC MP150 acquisition system (BIOPAC Systems Inc., Goleta, CA, USA) and the acquisition software AcqKnowledge (Version 4.0, BIOPAC Systems Inc., Goleta, CA, USA). In addition to the MP150 main hardware unit, blood www.nature.com/sdata/ SCIENTIFIC DATA | 6:180308 | https://doi.org/10.1038/sdata.2018.308 EEG. Resting-state EEG (rs-EEG) was obtained in 216 participants who completed the second MDBF just before the EEG recording and underwent the Multi 8/2 drug strip test. The whole experiment session took approximately 1.5 hours (including the 16-minute EEG recording). The raw rs-EEG data with preprocessed rs-EEG and localizer files are openly available.
Resting-state EEG: A 16-min rs-EEG was recorded with a BrainAmp MR plus amplifier in an electrically shielded and sound-attenuated EEG booth using 62-channel (61 scalp electrodes plus 1 electrode recording the VEOG below the right eye) active ActiCAP electrodes (both Brain Products GmbH, Gilching, Germany) attached according to the international standard 10-20 extended localization system, also known as 10-10 system, 103 and referenced to FCz. The ground was located at the sternum and skin electrode impedance was kept below 5 KΩ. The amplitude resolution was set to 0.1 μV. EEG was recorded with a bandpass filter between 0.015 Hz and 1 kHz and digitized with a sampling rate of 2500 Hz. The EEG session comprised a total of 16 blocks, each 60 s long, 8 with eyes-closed (EC) and 8 with eyes-open (EO) (EO and EC segments being interleaved), where the recording started with eyes-closed condition. The blocks were introduced using Presentation software (version 16.5, Neurobehavioral Systems Inc., Berkeley, CA, USA). Participants were seated in front of a computer screen and asked to stay awake while fixating eyes on a black cross presented on a white background (during the eyes-open sessions).
Digitized EEG channel locations: Starting from the second round (54th participant), a Polhemus PATRIOT Motion Tracking System (Polhemus, Colchester, VT, USA) localizer was used together with the Brainstorm toolbox 104 to digitize the exact location of each 62 electrode on a participant's head relative to three fiducial points (plus 1 electrode referenced to FCz).
Resting-State EEG Data Preprocessing: Data from 13 participants were excluded due to missing event information, different sampling rate, mismatching header files or insufficient data quality. The raw EEG data from 203 participants used for preprocessing was downsampled from 2500 Hz to 250 Hz, bandpass filtered within 1-45 Hz (8th order, Butterworth filter) and split into EO and EC conditions for the subsequent analyses. Outlier channels were rejected after visual inspection for frequent jumps/shifts in voltage and poor signal quality. Data intervals containing extreme peak-to-peak deflections or large bursts of high frequency activity were identified by visual inspection and removed. Intervals containing traces from eye blinks or eye movements were not removed at this stage. Further data preprocessing was done in EEGLAB 105 (version 14.1.1b) for MATLAB (Delorme and Makeig, 2004). The dimensionality of the data was reduced using principal component analysis (PCA), by keeping PCs (NZ30) that explain 95% of the total data variance. Next, independent component analysis 106 (ICA) was performed using the Infomax (runica) algorithm. Components reflecting eye movement, eye blink or heartbeat related artifacts were removed. Retained independent components for EO (mean: 19.7, range: 9-30) and EC (mean: 21.4, range: 14-28) conditions were back-projected to sensor space for further analysis.
Additional Measures. Seated Resting Blood Pressure: Blood pressure (BP) was measured using an automatic oscillometric blood pressure monitor (OMRON M500, OMR HEM-7213-D) and a 22-42 cm arm cuff (OMRON HEM-RML30, both OMRON Medizintechnik Handelsgesellschaft mbH, Mannheim, Germany) after a seated resting period of 5 min. The BP measurements took place on three occasions throughout the course of the study: 1) before the MRI session (BP1), 2) after the MRI session (BP2), 3) at the beginning of follow-up experiments (BP3). All BP measurements were recorded at the left arm. Before the MRI session, an additional measurement at the right arm was recorded to rule out pathologic differences between right and left arm measurements. Accompanying pulse measurements at the arm (Pulse1, Pulse2) were saved during BP measurements 1 and 2. As part of the complementary project by Mendes et al. 21 , one blood pressure measurement at the left arm was taken from 91 additional participants before a rs-fMRI session that also included continuous peripheral physiological recordings (see section Continuous Peripheral Physiological Recordings During rs-fMRI).
Peripheral Blood Sample Collection and Analysis: A blood sample of approximately 70 ml in total was collected on the first assessment day after acquisition of MRI data. If the blood drawing was not possible on this date, it was acquired on the following assessment days and documented as such. The new date is mentioned in the data files. The blood was collected with four different types of sampling tubes: Serum, EDTA, Citrate and RNA. A portion of the sample was directly sent to the Institute for Laboratory Medicine, Clinical Chemistry and Molecular Diagnostics (ILM) of the Medical Faculty at the Leipzig University; the remaining samples were stored for later use. One serum tube (S-Monovette® 7.5 ml, Sarstedt), one EDTA tube (S-Monovette® 2.7 ml K3E, Sarstedt), and one citrate tube (S-Monovette® 3.0 ml 9NC, Sarstedt) were sent for direct analysis to the ILM. The remaining blood samples were divided into 10 microtainers of 2.0 ml size. Together with three EDTA tubes (S-Monovette® 2.7 ml K3E, Sarstedt) and 2 RNA tubes (Tempus TM , Applied Biosystems)-containing stabilization solution-the remaining aliquots were stored at −80°C for later use.
Anthropometry: Classical anthropometric measurements were taken according to standardized procedures by trained medical persons. Body weight was measured using an electronic scale (SECA 813, Seca Gmbh & Co KG) with a precision of 0.01 kg. The participants were barefoot, dressed with empty pockets and without outer garments. Body height of barefoot participants was measured using a stadiometer (SECA 216) to the nearest 0.1 cm. During measurement, the body of the participants were erect and centered placing feet together, the heels and the occiput touching the wall. The waist was measured 1 cm above the belly button, and the hip was measured around the widest part of the hip, with all outer garments removed. The waist and the hip were assessed by using an ergonometric circumference measuring tape (SECA 201) to the nearest 0.1 cm.
Hair Sample: To obtain the required amount of hair for the sample the hair had to be a minimum 4 cm long. Participants with colored/dyed hair were also included (as suggested by the analyzing lab), dreadlocked hair was an exclusion criterion. The hair sample was taken from the back ofand as close as possible tothe scalp (posterior vertex position). The strands were carefully placed in aluminum foil and the proximal end was marked. The sample was weighed before being sent to the laboratory. Hair sampling followed the procedure described here: http://poolux.psychopool.tu-dresden.de/dat/videos/ hmd1.mpg. Hair samples were delivered to Technische Universität Dresden (TU Dresden) laboratory for analysis (lab of Prof. Dr. C. Kirschbaum). However, the results for cortisol and other hormonal measurements contained an unusual high percentage of 0-values (cortisol 13%, progesterone 63%, corticosterone 73%). Therefore hair-derived corticosteroid measurements were deemed unreliable and will not be released.

Code Availability
All code that was implemented for MRI data acquisition and processing pipelines is available online: (https://github.com/NeuroanatomyAndConnectivity/pipelines/tree/v2.0/src/lsd_lemon/). Data handling and computation of summary measures were implemented in Python.

Data Records Data Security and Data Anonymization Procedures
To protect health information prevent direct identification, the participant were given special LEMON IDs. All the data, whether pen-and-paper, computer administered, as well as LimeSurvey, were saved only with these LEMON IDs. For public data sharing we anonymized them once more these IDs into BIDS 6digit format (010000). Thus our participants are given now IDs such as sub-010000. For internal use, the data was first saved on a MPI-CBS in-house local secured network. Later, the data for all participants was stored on our instance of the eXtensible Neuroimaging Archive Toolkit (XNAT 35) v.1.6.5. at the MPI-CBS. Access to the initial project was restricted (via XNAT's private project mode) to members of the Leipzig Study for Mind-Body-Emotion Interactions and Neuroanatomy & Connectivity Group at MPI-CBS for initial curation and quality assessment of data. All data comprised in the MPI-Leipzig Mind-Brain-Body database were derived from MPI-CBS so data import into XNAT was done from our local secured network. A specially customized XNAT uploader was used to upload all participants' data to XNAT.
The native DICOM format was used for MRI data, whilst a standard ASCII ( * .csv, * .txt) format was employed to upload all other experimental data such as surveys, test batteries, and demographical data in XNAT also in local secured network. The anonymization measures applied to the MRI data consisted of removal of DICOM header tags containing information which could lead to the identification of test participants as well as the defacing of all structural (NIFTI) scans.
This applied mainly for internal use. For releasing the data publicly, the MRI data in NIFTI files in JavaScript Object Notation (*.json) with (*.tsv) format is stored. More details regarding publicly released data can be found below in Usage Notes section.

MRI Data
All MRI datasets are made available in NIFTI format, and all anatomical scans have been defaced. For more details see Mendes et al. The dataset is organized in concordance with the Brain Imaging Data Structure (BIDS) format. This facilitates data analysis, for example with BIDS-Apps 107 (http://bids-apps. neuroimaging.io). BIDS-Apps encapsulate standard MRI analysis tools within an application that understands the BIDS format and allows automatic access to relevant data and metadata. The MRI raw and preprocessed data can be found in GWDG (https://ftp.gwdg.de/pub/misc/MPI-Leipzig_Mind-Brain-Body-LEMON/) as well as in Functional Connectomes Project International Neuroimaging Data-Sharing Initiative/Child Mind Institute (Data Citation 1) and OpenNeuro repository (Data Citation 2).

EEG Data
The raw rs-EEG data folder contains raw resting state EEG data files (Brain Vision files). The marker codings are S200 for eyes open at rest and S210 for eyes closed at rest.
The preprocessed resting state EEG data folder contains preprocessed EEG (see method section for details) saved in the standard EEGLAB 100 file format (.set and .fdt files). For each participant (N = 203) eyes-closed (EC) and eyes-open (EO) conditions are stored separately thus each having 4 files (2 for EC condition and 2 for EO), with the following naming structure: sub-BIDS condition.fdt (.set) and conditions: eyes closed (EC) or eyes open (EO). This preprocessed data has already been used in another EEG study about non-sinusoidal nature of neuronal oscillations 108 .
The digitized EEG channel locations (62) with Polhemus PATRIOT Motion Tracking System are stored in separate folder as MATLAB (.mat) files. The EEG raw and preprocessed data can be found at GWDG (https://ftp.gwdg.de/pub/misc/MPI-Leipzig_Mind-Brain-Body-LEMON/) or Functional Connectomes Project International Neuroimaging Data-Sharing Initiative/Child Mind Institute (Data Citation 1).

Emotion and Cognition Test Batteries, Assessments, and Other Protocols
The data from most questionnaires are reported as summary scores. Whenever summary scores do not provide an adequate measure, we report raw item scores, for instance the New York cognition (NYC-Q).
Questionnaires that do not come with summary scores are released as raw item scores, namely: New York Cognition Questionnaire (NYC-Q).
For each participant, the average age across the course of the study was reported which was the same in both supplementary studies of MPI Leipzig Mind-Brain-Body database. For the purpose of anonymity, the mean age was then binned into five year width (5-year bins). Cutoff values for binning were 20.0, 25.0, 30.0 and so forth. A meta file with demographic summary in * .csv format includes: gender, age (5-year bins), handedness, formal education, drug test results on day 2, smoking status, SKID, HAM-D, BLS-23, AUDIT, and relationship status. Separate subject-specific (.csv) files with information given in Text (.txt) file include the results of blood sample, blood pressure (for 3 occasions), and anthropometry.
In addition to this meta file, we include a data availability (.csv) file which includes all the LEMON data available for each specific data acquisition section with subscales (1 = available, 0 = not available).

Technical Validation
Before inclusion in the database, we manually double-checked all datasets for missing or corrupt data. Further quality control of the data was applied to the MRI and behavioral measures, as described below.

MRI Data Quality Assessment
As described in Mendes et al. 21 , we assessed the quality of preprocessed resting-state fMRI images using the mriqc package 109 , implemented in Python. Mriqc creates a report for each individual scan based on different parameters like motion, coregistration, and temporal signal-to-noise (tSNR). The details of tSNR and fieldmap correction can be found in the previous article of Mendes et al 21 . Resting-state fMRI data from 8 participants were excluded from preprocessing due to errors during data acquisition (ghosting artifact n = 2, incomplete scan n = 1), anatomical preprocessing (n = 4) or functional preprocessing (n = 1).
We visually inspected the quality assessment reports for each participant. Furthermore, frames with high motion were marked according to the framewise displacement, which was calculated as the sum of the absolute values of the six realignment parameters 110 . For comparison, all individual-level scores are displayed with respect to the group-level distribution (N = 219). In Fig. 2, the mean and maximum framewise displacement for all participants (N = 219), as well as separately for young (N = 152) and old (N = 67) participants, are given. Overall, the summary of the motion parameters of our MRI data shows that 91.78% of runs have less than one voxel (2.3 mm) maximum framewise displacement, and mean framewise displacement of 0.202 mm (SD = 0.101 mm), demonstrating sufficient quality. Mean framewise displacement was 0.165 mm (SD = 0.046 mm) in young participants and a slightly higher in the elderly group (M = 0.289, SD = 0.134).

Behavioral Measures Quality Assessment
We calculated descriptive statistics and reliability estimates of each subscale of the emotion and personality battery to ensure their general usability (see Table 6 (available online only)). Since we used mainly German questionnaires with validated factor structures (besides NYC-Q and MARS, which were translated by a professional translator), we report both the Cronbach's alpha coefficients from the original validation studies of the respective questionnaires as well as the ones calculated from our data in Table 6 (available online only). We did not compute the Cronbach's alpha coefficient for the NYC-Q as the heterogeneity of items within these questionnaires neither describe a unitary phenomenon nor are they designed to be internal consistent 111 . We recommend a factor analytic approach 66 to derive behavioral scores from this questionnaire. Moreover, for YFAS we calculated the internal consistency based on Kuder-Richardson's alpha coefficient 112 .
To further facilitate the evaluation of emotion and personality data, we plotted densities of all subscale scores for younger and older participants (see Supplementary Figure S2), which suggest sensible distributions: For example, normal distributions were observed for most personality traits (NEO-FFI), whereas skewed distributions were observed for social support variables (FSozU, MSPSS). Age-group differences observed after conservative Bonferroni correction for multiple comparisons (α = 0.0005) are in general alignment with previous reports of changes in emotional processing during aging 113,114 . Significant differences between younger and older adults emerged on 20 of 98 subscales (see Supplementary Figure S2). These results further underline the value of the LEMON dataset to examine associations of emotions and brain-body functions in healthy aging.

Usage Notes
The public dataset, protocols, and software used in the acquisition and processing of the data are documented, curated, and available for research purposes. The datasets are provided with three different tiers of access. Users are kindly asked to first agree to the terms of data usage, especially for access to behavioral data, which prohibits identifying individuals on these phenotyping data.
The complete LEMON data can be retrieved from the first location (GWDG) under point 1 (a) below. Moreover, with the complementary project by Mendes et al. the raw MRI data are currently available from the OpenNeuro and INDI mentioned under point 1 (b) below. All MRI datasets are shared in NIFTI files, and all anatomical scans have been defaced. A standard ASCII ( * .csv, * .txt) format was employed to upload all other experimental data such as surveys, test batteries, and demographical data in GWDG, and in NITRC.
b. Additionally, the complete LEMON Data is accessible via Functional Connectomes Project International Neuroimaging Data-Sharing Initiative (INDI) at Child Mind Institute (Data Citation 1).

Only MPILMBB LEMON MRI Raw Data
The OpenNeuro.org platform also hosts the raw data (Data Citation 2). The OpenNeuro repository provides API access available via https://openneuro.org/dataset/api/. In addition, similar to all other datasets in OpenNeuro, our dataset is available via Amazon Web Services S3 object data store (Data Citation 2).