Little evidence for sex or ovarian hormone influences on affective variability

Weigard, Alexander; Loviska, Amy M.; Beltz, Adriene M.

doi:10.1038/s41598-021-00143-7

Download PDF

Article
Open access
Published: 22 October 2021

Little evidence for sex or ovarian hormone influences on affective variability

Alexander Weigard¹,
Amy M. Loviska¹ &
Adriene M. Beltz¹

Scientific Reports volume 11, Article number: 20925 (2021) Cite this article

44k Accesses
14 Citations
742 Altmetric
Metrics details

Subjects

Abstract

Women were historically excluded from research participation partly due to the assumption that ovarian hormone fluctuations lead to variation, especially in emotion, that could not be experimentally controlled. Although challenged in principle and practice, relevant empirical data are limited by single measurement occasions. The current paper fills this knowledge gap using data from a 75-day intensive longitudinal study. Three indices of daily affective variability—volatility, emotional inertia, and cyclicity—were evaluated using Bayesian inferential methods in 142 men, naturally cycling women, and women using three different oral contraceptive formulations (that “stabilize” hormone fluctuations). Results provided more evidence for similarities between men and women—and between naturally cycling women and oral contraceptive users—than for differences. Even if differences exist, effects are likely small. Thus, there is little indication that ovarian hormones influence affective variability in women to a greater extent than the biopsychosocial factors that influence daily emotion in men.

A systematic review and multivariate meta-analysis of the physical and mental health benefits of touch interventions

Article Open access 08 April 2024

Microdosing with psilocybin mushrooms: a double-blind placebo-controlled study

Article Open access 02 August 2022

Sleep quality, duration, and consistency are associated with better academic performance in college students

Article Open access 01 October 2019

Introduction

Female animals and humans were excluded from biomedical, neurological, and social research for decades because cyclic fluctuations in ovarian hormones were assumed to induce variability that would undermine statistical inferences or experimental manipulations^1,2. For decades, this assumption lacked empirical support. With funding and policy mandates, however, recent years have seen increased inclusion of females in scientific research^3,4,5, and relevant data are beginning to emerge.

To-date, these data generally concern biological variability in rodents. Syntheses in several large-scale reviews and meta-analyses show that female mice are not more variable (and may even be less variable) than are males^6,7,8. Of particular relevance is a meta-analysis on sex differences in physiological and trait variability that included over 300 studies and 6,000 data points in rats⁹. No sex differences in the variability of behavior, electrophysiology, histology, or neurochemistry measures were found, nor was variability impacted by the estrous cycle.

A recent study in mice¹⁰ went beyond these investigations of inter-individual variability (how females vary from each other) to focus on intra-individual variability (how an individual varies across assessments). This is an important distinction because estrous cycle fluctuations are known to affect intra-individual variability even if they are of little consequence for inter-individual variability. Continuous measurement of locomotion and body temperature across 12 days revealed that males were actually more variable across a day, but that females had a four-day cyclic pattern that coincided with their estrous cycle. Thus, there were sex differences in intra-individual variability, but the direction depended upon timescale (i.e., 24 versus 96 h).

Although studies about intra-individual variability in rodents are not directly applicable to human biology (let alone behavior), they are nonetheless important because there is scant evidence concerning sex differences in intra-individual variability of psychological traits in humans—even traits that fluctuate with time and are associated with ovarian hormones, such as emotion. Emotion varies meaningfully across moments, days, and the lifespan^11,12. When assessed within and between days, intra-individual emotional variability can reflect individual differences and can even be considered a valid and reliable trait in its own right^11,13. It is also related to personality, especially to neuroticism^11,14,15, and to internalizing psychopathology in comprehensive meta-analyses^16,17,18.

Unfortunately, there have been few investigations of sex differences in intra-individual affective variability, despite an extensive literature on inter-individual sex differences in emotion¹⁹. As noted by others¹⁴, most studies of intra-individual emotion variability simply fail to consider gender or attempt to statistically “control” for it^{20,21,22,23,24,25,26,27,28}. This illustrates an interesting shift in the study of sex differences. Women were historically excluded due to concerns that variability would confound results, and now, women are included, but potential effects of sex are often deliberately ignored, posing problems for interpretation and effective prevention and intervention^3,4,5.

Findings from the few studies that do consider affective variability are inconsistent. For instance, there is some indication that women have greater self-reported variability, especially for negative emotions, than do men^29,30, but in other research, men and women do not differ in self reports of day-to-day distress³¹. These disparate findings are challenging to interpret because gender was not a focal variable in these aging samples (with declining ovarian function). Thus, as suggested by the unclear role of gender in a largescale meta-analysis on emotion variability, “more targeted research is needed”¹⁷ [p. 924].

Insight about sex differences in intra-individual variability in emotion, however, can be gleaned from studies of sex hormones. These studies show that sex hormone levels influence the brain and behavior, implying that emotion changes in concert with hormonal changes. Menstrual cycle studies (of natural rises and falls in endogenous ovarian hormones) and oral contraceptive (OC) studies (of self-administered exogenous ovarian hormones) are particularly insightful³². Menstrual cycle studies typically consider phases, including menstruation, follicular, and luteal. Although samples are often small and there are challenges in defining cycle phases³³, findings generally show increased negative affect preceding and during menstruation when estrogen and progesterone are low^34,35,36. This may be an indirect effect of physical premenstrual symptoms and menstruation discomfort and not a direct effect of ovarian hormones on emotion³⁷.

OC studies typically include women following regimens of active hormone and placebo pills. Active pills vary in formulation and dose, with monophasic combined OCs containing a constant dose of estradiol and a progestin, and triphasic combined OCs containing variable doses of estradiol and/or a progestin. There is no clear evidence of exogenous hormone influences on emotion³⁸. For instance, randomized trials suggest that OCs only impact emotion (both positively and negatively) for some women³⁹, as a partial function of age, length of OC use, or pre-existing conditions. Within OC users and broadly consistent with findings from menstrual cycle studies, though, there is indication of worsening mood late in the active pill phase, extending into the placebo phase for some sub-samples of women^40,41,42. Most studies also fail to consider effects of different pill formulations³²—a critical limitation because work in other domains shows that especially the androgenicity of progestins matters for outcomes⁴³.

Unique insight into ovarian hormone influences on affective variability comes from comparisons of naturally cycling (NC) women and OC users. A review of early studies is consistent with the intuition that OC users, especially users of monophasic pills, show less affective variability than NC women⁴⁴, and recent research is generally consistent^35,42,45,46. However, most studies concern inter-individual variability (despite the often rich repeated emotion assessments across phases), so it remains unclear whether OC users have reduced intra-individual variability in emotion.

Current study

Despite emerging work on the importance of affective variability as a construct, questions remain about whether this construct displays systematic sex differences that may be associated with ovarian hormonal milieus. This study aimed to fill this knowledge gap by assessing evidence for sex-linked inter-individual differences in intra-individual variation in emotion. This was accomplished by using 75-day intensive longitudinal data from men, NC women, and women using three different types of OCs (two monophasic differing in progestin androgenicity and one triphasic). Intra-individual variability was operationalized by three indices of different emotion timescales—volatility (intra-individual standard deviation), emotional inertia (first order autoregression), and cyclic patterns (maximum significant autoregression that can reflect phases)—and compared across groups. Bayesian inferential methods were used for group comparisons, as the research question concerned quantification of the degree of evidence for group differences and the effect sizes of these differences.

Methods

Data came from a parent study on sex hormones and behavior. Subsets concerning neuroticism and physical health⁴⁷ as well as gender self-concept⁴⁸ have been previously reported.

Participants

Participants were 142 men (n = 30), NC women (n = 28), and OC users (n = 84), between 18 and 38 years old (M = 21.59; SD = 3.26) recruited from a university community and small U.S. city. Most were White/Caucasian (70%) and non-Hispanic (96%), with 23% Asian, 6% Black/African American, and < 1% multi-racial. Participants were not taking psychotropic or neuroendocrine medications, and they did not report physical health or reproductive conditions impacting hormone function (e.g., polycystic ovary syndrome). Women were never pregnant, and NC women reported regular menstrual cycles⁴³. OC users were taking one of three pill formulations for at least three months⁴³: monophasic containing ethinyl estradiol and the anti-androgenic progestin drospirenone (OCd; n = 22; e.g., Yaz, Nikki), monophasic containing ethinyl estradiol and the moderately androgenic progestin norethindrone acetate (OCna; n = 30; e.g., Microgestin, Loestrin), and triphasic containing ethinyl estradiol and three doses of the mildly androgenic progestin norgestimate (OCng; n = 32; e.g., Ortho-tri-cyclen, Trinessa).

Participants completed at least 80% of the 75 daily assessments; an additional 93 participants began the intensive longitudinal study, but 60 dropped out or were removed because they had response rates under 50%, and 31 completed with response rates between 51 and 79%; 2 participants were excluded for using an alternative OC formulation. The 80% cut-off was informed by research indicating that 20% missing time series data does not significantly impact inferences⁴⁹, and empirical measure validation in this sample. Thus, 235 participants began the intensive longitudinal study, but 142 completed it and were included here.

Participant groups did not display statistically significant differences in age, F(2, 140) = 0.62, p = 0.542, or ethnicity, χ²(2) = 1.89, p = 0.389, but they did differ in race, χ²(6) = 12.97, p = 0.044, with more men endorsing an Asian identity (40%) than NC women (25%) or OC users (17%). One participant did not provide their age, four did not report ethnicity, and one did not endorse any racial identity. NC women and OC users also did not display a statistically significant difference in self-reported weight (M = 138.25, SD = 23.18), t(110) = 0.29, p = 0.772.

Procedures

In an hour-long laboratory-based session, participants provided informed consent and completed questionnaires, including reports of their medication use and reproductive health history; OC users also presented their pill packet (or a picture thereof).

The following day, participants began the 75-day intensive longitudinal study. Every night, they completed an approximately 20-min online survey that assessed daily affect on any Internet-capable device using Qualtrics. They received a unique link to each day’s survey at 5:00PM through the Qualtrics system. Participants were asked to complete the survey after 8:00PM or before going to bed; the links expired the following day at noon. All research procedures were approved by the University of Michigan Institutional Review Board for Health Sciences and Behavioral Sciences (IRB-HSBS) and were conducted in accordance with the ethical standards outlined by the Declaration of Helsinki. All participants provided written informed consent.

Participants were compensated with either course credit or $15 for the laboratory session and up to $200 for the daily assessments. They received $1 for every assessment they completed. Compensation increased to $2 if they completed at least 80%, and they received a $50 bonus if they completed at least 90%. If their completion rate fell below 50% after 30 days, then they were withdrawn from the study. The average overall study completion rate (N = 235; including all who completed 100 days) was 71%, and the average completion rate for the sample reported here was 94%.

Measures

In each daily assessment, participants completed the reliable and externally valid Positive and Negative Affect Schedule PANAS⁵⁰, which has been widely used in investigations of inter-individual variability and employed in recent investigations of intra-individual variability¹⁷. Participants rated the extent to which they experienced 10 positive affect items (e.g., Happy, Proud) and 10 negative affect items (e.g., Irritable, Afraid) in the past 24 h on a 5-point scale (from 1 = “very slightly/not at all” to 5 = “extremely”). Daily composites were created by averaging across positive and negative affect items, respectively.

Three different indices of affective variability were generated for both positive and negative affect using R⁵¹; they each summarize variability across 75 days for each participant, but consider different timescales. First, volatility was operationalized by the intra-individual standard deviation (iSD). It is the extent to which each individual’s affect varied from their own mean. It is not locked to a timescale, and so is not affected by missing data and reflects a composite of emotional peaks and valleys. It is also the most common measure of affective intra-individual variability¹⁷. Second, emotional inertia was operationalized by autocorrelation, (i.e., autoregressive coefficients for a lag of t–1); missing values were imputed using time series-based linear interpolation. This index reflects the extent to which today’s affect predicts tomorrow’s affect, or the daily carryover or persistence of emotion, and has been informative in recent investigations of affective variability¹⁷. Third, cyclicity was operationalized by the longest significant lag from t–1 through t–7 (i.e., the extent to which today’s affect predicts affect 7 days from now) again using linear interpolation to account for missing data. This novel index reflects the length of cyclic temporal patterns in daily emotion. For instance, a significant lag 5 could indicate a weekday/weekend cycle, which has been detected in previous work⁵².

Statistical analyses

Group comparisons were conducted for each index of positive and negative affective variability. Sex differences in affective variability were examined by comparing men (reference group) and NC women (focusing on endogenous ovarian hormones), and by comparing men to all women (both NC and OC, reflecting the general population, which is a mix of women who do, and do not, use hormonal contraceptives). Although the former comparison provides a controlled test of the hypothesis for naturally-occurring sex differences in affect variability, the latter has practical implications for samples recruited from the general population. Next, differences between NC women (reference group) and OC users (considering OCd, OCna, and OCng users separately) were examined to assess effects of exogenous ovarian hormones on women’s affective variability.

For all group comparisons, Bayesian and frequentist t-tests were conducted using JASP Version 0.8.6^53,54. Bayesian t-tests involve the comparison of prior and posterior probability distributions for a standard effect size parameter, δ. The prior distribution represents beliefs about the value of δ before seeing the data. As there is little information that can be used to form strong priors for δ here, the JASP default prior was used: a zero-centered Cauchy distribution with a scale of 0.707. This prior represents relatively weak beliefs about possible values of δ, holding that the effect is most likely to be small, while not ruling out the possibility of large effects (e.g., δ > 1) in either direction. The posterior distribution represents updated beliefs about the value of δ after “seeing” the data. The focus here is on the median of the posterior distribution, which indicates the most likely δ value, and on the 95% posterior credible interval (CI), which indicates the range in which δ has a 95% chance of falling.

Posteriors are only interpretable if the effect in question exists (i.e., δ ≠ 0). To quantify evidence that the effect exists, the difference between the density of the prior and posterior distributions at 0 is used to compute Bayes factors, which represent how beliefs about the null hypothesis (δ = 0) and alternative hypothesis (δ ≠ 0) change given the data. Bayes factors for the null (BF₀₁) and alternative (BF₁₀) hypotheses are interpreted as odds ratios; for example, a BF₀₁ of 3 (identical to a BF₁₀ = 0.33) indicates the data are three times more likely under the null than under the alternative hypothesis. Although Bayes factors are interpreted on a continuous scale, conventions for categorizing degrees of evidence have also been proposed: Values between 1 and 3 provide “anecdotal” evidence for a hypothesis, values between 3 and 10 provide “moderate” evidence, and values > 10 provide “strong” evidence⁵³.

Results

Results are presented in two parts. First, interrelations among the affective variability measures are reported and illustrated via plots of daily data from example participants. Second, group comparison results from Bayesian t-tests are presented. Bayesian analyses were of primary interest, but results from frequentist t-tests, which generally aligned with Bayesian results, are reported in the Supplemental Materials for completeness (Supplemental Table 1).

Descriptive statistics

Correlations among affective variability indices are shown in Table 1; descriptive statistics are reported in Supplemental Table 2. Correlations had an average magnitude of 0.22, reflecting limited overlap among emotional volatility, inertia, and cyclicity, although correlations did range from negligible to large. Individual scores on each index are shown in scatterplots in Supplemental Fig. 1 by group. To the naked eye, the means and ranges appear similar across groups, especially for volatility and inertia.

Table 1 Bayesian estimates of correlations among measures of daily positive and negative affective variability, including volatility (intra-individual standard deviation), inertia (strength of first order autoregressions), and cyclicity (longest order from 0 through 7 with a significant autoregression).

Full size table

To illustrate individual differences in indices of affective variability, Fig. 1 depicts the positive affect time series from example participants who had relatively low (< 25th percentile) and relatively high (> 75th percentile) values on each index. The participant with high volatility displays positive affect scores with repeated high amplitudes that drop off compared to the participant with low volatility, whose scores seem to hover around an imaginary mean line. Individual differences in emotional inertia are comparatively subtle. Although the participant with high inertia has scores that vary over a small range, the primary distinction is that there are only small jumps from day-to-day. Interestingly, those jumps lead to progressive shifts in positive affect across days (e.g., decline through day 15 followed by increase until days 50–60). There is little predictability, however, in positive affect scores for the participant with low emotional inertia. The participant with high cyclicity displays a clear pattern of increases followed by decreases within most 7-day periods (denoted by gray lines). The participant with a low, 0-day cycle displays no such patterns.

Group differences

Comparisons between men and NC women provided the primary test of sex differences in affective variability, and detailed Bayesian t-test results for these comparisons are shown in Fig. 2. The density plots illustrate the weak prior probability distribution for effect size δ (dashed line), the posterior probability distribution (solid line), and the difference in the density of these distributions at δ = 0 (gray dots), which is used to estimate the Bayes factor. For comparisons of all affective variability indices, the point null hypothesis (δ = 0, or no sex difference) was more likely after consideration of the data. This is reflected in the BF₀₁ values, displayed for men versus NC comparisons in Fig. 2 and for all comparisons in Fig. 3, which indicate that the null is between 2.1 and 3.7 times more likely than the alternative hypothesis (δ ≠ 0, or a sex difference). Furthermore, the posterior density plots and CIs, displayed for men versus NCs in Fig. 2 and for all comparisons in Fig. 4 (with reference lines indicating effect sizes typically considered “small”, δ =|0.20|, “medium”, δ =|0.50|, and “large” δ =|0.80|), show that, in the event that the effect size is not exactly 0, the most likely effect sizes are very small. Hence, these group differences are unlikely to be of major consequence, even if they do exist.

Comparisons between men and all women revealed moderate evidence for the null in all cases except for positive volatility, in which evidence was only anecdotal (BF₀₁ = 2.11). The range of effect sizes for positive volatility, however, was modest and mostly negative (Fig. 4), indicating that if a difference exists, then men may actually be slightly more variable than women. Posterior medians for the remaining comparisons between men and all women were close to 0 and CIs rarely exceeded small effects.

Similar patterns held for comparisons between NC women and the OCd and OCna users, visualized in Figs. 3 and 4: There was more evidence for the null hypothesis than for group differences, and the most likely effect sizes ranged from small to near-0 with CIs showing only slight overlap with medium effect sizes. Evidence for some comparisons between NC women and the OCng group, however, was mixed, as there was anecdotal evidence for differences in positive inertia (BF₁₀ = 2.12) and positive cyclicity (BF₁₀ = 1.35). The posterior medians and CIs for these comparisons indicated that, if such differences exist, women using triphasic pills with norgestimate have greater inertia and cyclicity, suggesting reduced variability.

Discussion

Implicit assumptions about how fluctuations in ovarian hormones relate to variability in emotion have contributed to the exclusion of women from research, and thus, have stymied understanding of female behavior. Although recent animal research provides little support for sex differences in trait variability, conclusions concerning human emotion—despite increasing evidence of affective variability as a marker of individual differences—are unclear. The goal of the present study was to directly address this knowledge gap in a sample of men and women with varying ovarian hormonal milieus using three indices of affective variability calculated from 75 daily assessments per person: (1) volatility; (2) emotional inertia; and (3) cyclicity.

Results provide little evidence for sex differences, and thus, empirically undermine the notion that excluding females from research studies improves inferences about affect or emotion. Specifically, evidence from Bayes factors for the lack of sex differences was an average of 3.3 times greater than evidence for the existence of sex differences. Moreover, even if sex differences exist, their most likely effect sizes (δ posterior medians) were very small and suggested that, if anything, men had more variability in positive affect than women.

These findings converge with early work on the lack of sex differences in rodent trait variability, suggesting that ovarian hormones either do not matter for affective variability, or that “male[s] feature their own sex-specific variability”⁸ [p. 3]. Thus, men and women may have similar levels of affective variability, but the mechanisms or biopsychosocial processes that underlie this variability may systematically differ between the sexes. One contributing mechanism could be ovarian hormones, which influence emotion in some women^38,42, and contributing mechanisms for some men could be testosterone or personality traits linked to dominance or aggression⁵⁵. As effects are unlikely to be homogeneous even within the sexes, future work should broadly consider sources of individual differences in affective variability.

Results also provide little evidence that affective variability is altered by systematic shifts in ovarian hormones. In comparisons between NC women and women using OCs with different formulations (i.e., varying in progestin androgenicity) and doses (i.e., monophasic versus triphasic administrations), evidence for the lack of group differences was an average of 2.7 times greater than evidence for differences. Most effect size estimates for differences were also very small to small, consistent with conclusions from early work and—once again—suggesting that ovarian hormones have only a small influence on emotion that is often superseded by other biopsychosocial influences⁵⁶. The comparison between NC women and women using triphasic OCs with norgestimate (OCng), however, ran counter to this trend. Specifically, there was modest evidence that OC users may have greater positive emotional inertia and cyclicity. This finding must be replicated, especially because past work provides little interpretative context, as it largely focused on inter-individual variability and negative affect, and did not consider OC formulations and doses^38,44.

Together, results provide little evidence for ovarian hormone-related inter-individual differences in intra-individual variation in emotion across 75 days. Past research has documented many influences on both sex hormones, ranging from biological cycles to daily experiences, including diet, sleep, social interactions, and physical exertion³³, as well as direct links between ovarian hormone levels and emotions^34,35,36. This study does not speak to those direct associations. Instead, it speaks to overall variation in emotion—whatever its origins—and whether it systematically differs in men and women or across women with varying hormonal milieus. Indeed, if ovarian hormone influences on emotion variation were so great in the everyday lives of women—great enough to exclude women from scientific research participation, for instance—then women would be expected to show clear differences in overall emotion variation from men or from each other according to their hormone profiles (e.g., having a natural menstrual cycle versus using OCs with varying pharmacokinetic properties). This study, however, does not support such conclusions.

Study considerations

There are five key features of the study that warrant additional comment. The first is linked to sample characteristics and concerns generalizing study findings. Participants were from a Western, industrialized nation, and the majority were young, White, non-Hispanic, and affiliated with a university. There were also disproportionately more Asian men than Asian NC women or OC users in the sample. Given evidence for varying sex hormone processes and the ways in which they relate to lifestyle factors across development, ethnoracial identities, and cultures^57,58, it is imperative that future work recruit individuals of diverse ages and identities, and from international, non-industrialized communities.

The second concerns emotion assessment. Although widely-used indicators of positive and negative affect were employed^50,59, corroborating measures of emotion were not available from the parent study, and findings may differ for clinical indicators of mood, particularly depression, which shows a sex difference and link to hormonal contraceptive use^60,61. Similarly, variability in daily emotion was operationalized via three indicators, but measures from different timescales (e.g., hourly) or different operationalizations (e.g., from spectral or network analyses) could provide additional insight^18,62,63. Finally, it is unclear whether time of day was related to emotion reports, especially because participants could complete daily surveys up until noon the next day, but there is no reason to think that men and women responded at systematically different times. The daily indices used here are likely reasonable: They align with a trait-like conceptualization of affective variability¹¹, have relatively low inter-relations¹⁴, and yet, produced consistent evidence.

The third concerns ovarian hormones. It is important to emphasize that findings do not have implications for the understanding of direct links between ovarian hormones and emotion (e.g., whether high levels of progesterone during the luteal phase of the menstrual cycle predict negative affect), although an exciting opportunity for future research concerns delineating these and other lifestyle factors that influence emotion variability in men and women. The study focused on group differences in trait-like affective variability (using repeated within-person assessments across two cycles)—not on comparing cycle or pill phases. Phase determination without daily hormone assessments, which are unfortunately not available in this sample, and without granular knowledge of OC use (e.g., number and timing of placebo pill days) is a process riddled with error³³.

The fourth concerns the relatively small group sizes. In frequentist hypothesis testing, they were only large enough to reliably detect moderate-to-large effects at conventional thresholds, which is why p-values were provided only for reference in Supplemental Materials. However, Bayesian hypothesis testing produced continuous estimates of evidence for and against effects in these small samples by directly quantifying uncertainty about their possible presence and size. Hence, Bayesian methods leveraged the information provided by the small groups, which are unprecedented in terms of within-person measurement for women with varying ovarian hormone profiles, in order to draw converging conclusions: Sex and ovarian hormone-related differences in affective variability are unlikely to be large enough to be practically meaningful.

The fifth concerns the use of Bayesian inferential methods. The focus was on evaluating evidence for each effect compared to a point null hypothesis using Bayes factors, and then investigating the range of possible effect sizes with posterior medians and CIs. Although the latter is only applicable if the null hypothesis is false⁵³, it is nonetheless valuable to consider CIs because evidence in favor of the null was not overwhelming (i.e., BF₀₁ was generally between 2 and 4), and because the point null hypothesis of δ = 0 is often implausible^64,65.

Conclusions

The goal of this study was to fill the significant knowledge gap concerning sex differences in affective variability and its ovarian hormone links. Using 75 daily assessments of emotion (indexed by three different timescales) in men, NC women, and women using three different types of OCs (that are thought to diminish ovarian hormone fluctuations), Bayesian inferential methods generally found evidence for group similarities (i.e., the null hypothesis) to be roughly three times greater than evidence for differences (i.e., the alternative hypothesis), and indicated that effect sizes, even if differences do exist, are likely to be very small. Thus, daily emotion fluctuates to similar extents in men and women: There may be sex differences in the factors that influence emotion, such as ovarian hormones, but those factors do not ultimately produce different outcomes with respect to affective variability. Biomedical, neurological, and social scientists are encouraged to adjust their conceptual and statistical priors accordingly.

Data availability

The data generated and analyzed during the current study are available from the corresponding author upon reasonable request.

References

Beery, A. K. & Zucker, I. Sex bias in neuroscience and biomedical research. Neurosci. Biobehav. Rev. 35(3), 565–572 (2011).
Article PubMed Google Scholar
Kim, A. M., Tingen, C. M. & Woodruff, T. K. Sex bias in trials and treatment must end. Nature 465(7299), 688–689 (2010).
Article ADS CAS PubMed Google Scholar
Beltz, A. M., Beery, A. K. & Becker, J. B. Analysis of sex differences in pre-clinical and clinical data sets. Neuropsychopharmacology 44(13), 2155–2158 (2019).
Article PubMed PubMed Central Google Scholar
Mazure, C. M. & Jones, D. P. Twenty years and still counting: Including women as participants and studying sex and gender in biomedical research. BMC Womens Health. 15, 94 (2015).
Article PubMed PubMed Central Google Scholar
McCarthy, M. M., Woolley, C. S. & Arnold, A. P. Incorporating sex as a biological variable in neuroscience: What do we gain?. Nat. Rev. Neurosci. 18, 1–2 (2017).
Article Google Scholar
Itoh, Y. & Arnold, A. P. Are females more variable than males in gene expression? Meta-analysis of microarray datasets. Biol. Sex Differ. 6, 18 (2015).
Article PubMed PubMed Central CAS Google Scholar
Prendergast, B. J., Onishi, K. G. & Zucker, I. Female mice liberated for inclusion in neuroscience and biomedical research. Neurosci. Biobehav. Rev. 40, 1–5 (2014).
Article PubMed Google Scholar
Mogil, J. S. & Chanda, M. L. The case for the inclusion of female subjects in basic science studies of pain. Pain 117(1–2), 1–5 (2005).
Article PubMed Google Scholar
Becker, J. B., Prendergast, B. J. & Liang, J. W. Female rats are not more variable than male rats: A meta-analysis of neuroscience studies. Biol. Sex Differ. 7, 34 (2016).
Article PubMed PubMed Central Google Scholar
Smarr, B. L., Grant, A. D., Zucker, I., Prendergast, B. J. & Kriegsfeld, L. J. Sex differences in variability across timescales in BALB/c mice. Biol. Sex Differ. 8, 7 (2017).
Article PubMed PubMed Central CAS Google Scholar
Eid, M. & Diener, E. Intraindividual variability in affect: Reliability, validity, and personality correlates. J. Pers. Soc. Psychol. 76(4), 662–676 (1999).
Article Google Scholar
Charles, S. T., Reynolds, C. A. & Gatz, M. Age-related differences and change in positive and negative affect over 23 years. J. Pers. Soc. Psychol. 80(1), 136–151 (2001).
Article CAS PubMed Google Scholar
Penner, L. A., Shiffman, S., Paty, J. A. & Fritzsche, B. A. Individual differences in intraperson variability in mood. J. Pers. Soc. Psychol. 66(4), 712–721 (1994).
Article CAS PubMed Google Scholar
Eaton, L. G. & Funder, D. C. Emotional experience in daily life: Valence, variability, and rate of change. Emotion 1(4), 413–421 (2001).
Article CAS PubMed Google Scholar
McConville, C. & Cooper, C. Personality correlates of variable moods. Pers. Individ. Differ. 26(1), 65–78 (1999).
Article Google Scholar
Gruber, J., Kogan, A., Quoidbach, J. & Mauss, I. B. Happiness is best kept stable: Positive emotion variability is associated with poorer psychological health. Emotion 13(1), 1–6 (2013).
Article PubMed Google Scholar
Houben, M., Van den Noortgate, W. & Kuppens, P. The relation between short-term emotion dynamics and psychological well-being: A meta-analysis. Psychol Bull. 141(4), 901–930 (2015).
Article PubMed Google Scholar
Kuppens, P., Van Mechelen, I., Nezlek, J. B., Dossche, D. & Timmermans, T. Individual differences in core affect variability and their relationship to personality and psychological adjustment. Emotion 7(2), 262–274 (2007).
Article PubMed Google Scholar
Kring, A. M. & Gordon, A. H. Sex differences in emotion: Expression, experience, and physiology. J. Pers. Soc. Psychol. 74(3), 686–703 (1998).
Article CAS PubMed Google Scholar
Brose, A., Scheibe, S. & Schmiedek, F. Life contexts make a difference: Emotional stability in younger and older adults. Psychol. Aging. 28(1), 148–159 (2013).
Article PubMed Google Scholar
Brose, A., Schmiedek, F., Lovden, M. & Lindenberger, U. Normal aging dampens the link between intrusive thoughts and negative affect in reaction to daily stressors. Psychol. Aging. 26(2), 488–502 (2011).
Article PubMed Google Scholar
Kardum, I. Affect intensity and frequency: Their relation to mean level and variability of positive and negative affect and Eysenck’s personality traits. Pers. Individ. Differ. 26(1), 33 (1999).
Article Google Scholar
Timmermans, T., Van Mechelen, I. & Kuppens, P. The relationship between individual differences in intraindividual variability in core affect and interpersonal behaviour. Eur. J. Personal. 24(8), 623–638 (2010).
Article Google Scholar
Mill, A., Realo, A. & Allik, J. Emotional variability predicts tiredness in daily life an experience sampling study. J. Individ. Differ. 37(3), 181–193 (2016).
Article Google Scholar
Sliwinski, M. J., Almeida, D. M., Smyth, J. & Stawski, R. S. Intraindividual change and variability in daily stress processes: Findings from two measurement-burst diary studies. Psychol. Aging. 24(4), 828–840 (2009).
Article PubMed PubMed Central Google Scholar
Stawski, R. S., Almeida, D. M., Sliwinski, M. J. & Smyth, J. M. Reported exposure and emotional reactivity to daily stressors: The roles of adult age and global perceived stress. Psychol. Aging. 23(1), 52–61 (2008).
Article PubMed PubMed Central Google Scholar
Ram, N., Gerstorf, D., Lindenberger, U. & Smith, J. Developmental change and intraindividual variability: Relating cognitive aging to cognitive plasticity, cardiovascular lability, and emotional diversity. Psychol. Aging. 26(2), 363–371 (2011).
Article PubMed PubMed Central Google Scholar
Röcke, C., Li, S. & Smith, J. Intraindividual variability in positive and negative affect over 45 days: Do older adults fluctuate less than young adults?. Psychol. Aging. 24(4), 863–878 (2009).
Article PubMed Google Scholar
Neiss, M. & Almeida, D. M. Age differences in the heritability of mean and intraindividual variation of psychological distress. Gerontology 50(1), 22–27 (2004).
Article PubMed Google Scholar
Wang, L. J., Hamaker, E. & Bergeman, C. S. Investigating inter-individual differences in short-term intra-individual variability. Psychol. Methods. 17(4), 567–581 (2012).
Article PubMed Google Scholar
Almeida, D. M. & Kessler, R. C. Everyday stressors and gender differences in daily distress. J. Pers. Soc. Psychol. 75(3), 670–680 (1998).
Article CAS PubMed Google Scholar
Beltz, A. M. & Moser, J. S. Ovarian hormones: A long overlooked but critical contributor to cognitive brain structures and function. Ann. N.Y. Acad. Sci. 1464(1), 156–180 (2020).
Article ADS PubMed Google Scholar
Hampson, E. A brief guide to the menstrual cycle and oral contraceptive use for researchers in behavioral endocrinology. Horm. Behav. 119, 104655 (2020).
Article CAS PubMed Google Scholar
Marriott, A. & Faragher, E. B. An assessment of psychological state associated with the menstrual cycle in users of oral contraception. J. Psychosomat. Res. 30(1), 41–47 (1986).
Article CAS Google Scholar
Hamstra, D. A., de Kloet, E. R., de Rover, M. & van der Does, W. Oral contraceptives positively affect mood in healthy PMS-free women: A longitudinal study. J. Psychosomat. Res. 103, 119–126 (2017).
Article Google Scholar
Harvey, A. T., Hitchcock, C. L. & Prior, J. C. Ovulation disturbances and mood across the menstrual cycles of healthy women. J. Psychosomat. Obstet. Gynecol. 30(4), 207–214 (2009).
Article Google Scholar
Hengartner, M. P. et al. Negative affect is unrelated to fluctuations in hormone levels across the menstrual cycle: Evidence from a multisite observational study across two successive cycles. J. Psychosomat. Res. 99, 21–27 (2017).
Article Google Scholar
Robakis, T., Williams, K. E., Nutkiewicz, L. & Rasgon, N. L. Hormonal contraceptives and mood: Review of the literature and implications for future research. Curr. Psychiatry Rep. 21, 57 (2019).
Article PubMed Google Scholar
Lundin, C. et al. Combined oral contraceptive use is associated with both improvement and worsening of mood in the different phases of the treatment cycle: A double-blind, placebo-controlled randomized trial. Psychoneuroendocrinology 76, 135–143 (2017).
Article CAS PubMed Google Scholar
Coffee, A. L., Kuehl, T. J., Willis, S. & Sulak, P. J. Oral contraceptives and premenstrual symptoms: Comparison of a 21/7 and extended regimen. Am. J. Obstet. Gynecol. 195(5), 1311–1319 (2006).
Article CAS PubMed Google Scholar
Gingnell, M. et al. Oral contraceptive use changes brain activity and mood in women with previous negative affect on the pill: A double-blinded, placebo-controlled randomized trial of a levonorgestrel-containing combined oral contraceptive. Psychoneuroendocrinology 38(7), 1133–1144 (2013).
Article CAS PubMed Google Scholar
Oinonen, K. A. & Mazmanian, D. Effects of oral contraceptives on daily self-ratings of positive and negative affect. J. Psychosomat. Res. 51(5), 647–658 (2001).
Article CAS Google Scholar
Beltz, A. M., Hampson, E. & Berenbaum, S. A. Oral contraceptives and cognition: A role for ethinyl estradiol. Horm. Behav. 74, 209–217 (2015).
Article CAS PubMed Google Scholar
Oinonen, K. A. & Mazmanian, D. To what extent do oral contraceptives influence mood and affect?. J. Affect. Disord. 70(3), 229–240 (2002).
Article CAS PubMed Google Scholar
Jarva, J. A. & Oinonen, K. A. Do oral contraceptives act as mood stabilizers? Evidence of positive affect stabilization. Arch. Womens Ment. Health. 10, 225–234 (2007).
Article CAS PubMed Google Scholar
Ott, M. A., Shew, M. L., Ofner, S., Tu, W. & Fortenberry, J. D. The influence of hormonal contraception on mood and sexual interest among adolescents. Arch. Sex Behav. 37(4), 605 (2008).
Article PubMed PubMed Central Google Scholar
Kelly, D. P., Weigard, A. & Beltz, A. M. How are you doing? The person-specificity of daily links between neuroticism and physical health. J Psychosomat Res. 137, 110194 (2020).
Article Google Scholar
Beltz, A. M., Loviska, A. M. & Weigard, A. Daily gender expression is associated with psychological adjustment for some people, but mainly men. Sci. Rep. 11, 9114 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Rankin, E. D. & Marsh, J. C. Effects of missing data on the statistical analysis of clinical time series. Soc. Work Res. Abstr. 21(2), 13–16 (1985).
Article Google Scholar
Watson, D., Clark, L. A. & Tellegen, A. Development and validation of brief measures of positive and negative affect: The PANAS scales. J. Pers. Soc. Psychol. 54(6), 1063–1070 (1988).
Article CAS PubMed Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing. (R Foundation for Statistical Computing, 2018).
Stone, A. A., Hedges, S. M., Neale, J. M. & Satin, M. S. Prospective and cross-sectional mood reports offer no evidence of a “blue Monday” phenomenon. J. Pers. Soc. Psychol. 49(1), 129 (1985).
Article Google Scholar
Wagenmakers, E. J. et al. Bayesian inference for psychology. Part II: Example applications with JASP. Psychon. Bull. Rev. 25(1), 58–76 (2018).
Article PubMed Google Scholar
JASP Team. JASP (Version 0.8.6). (2018).
Carre, J. M. & Archer, J. Testosterone and human behavior: The role of individual and contextual variables. Curr. Opin. Psychol. 19, 149–153 (2018).
Article PubMed Google Scholar
McFarlane, J., Martin, C. L. & Williams, T. M. Mood fluctuations: Women versus men and menstrual versus other cycles. Psychol. Women Q. 12, 201–223 (1988).
Article Google Scholar
Eckert-Lind, C. et al. Worldwide secular trends in age at pubertal onset assessed by breast development among girls: A systematic review and meta-analysis. JAMA Pediatr. 174(4), e195881 (2020).
Article PubMed PubMed Central Google Scholar
Jasienska, G. & Jasienski, M. Interpopulation, interindividual, intercycle, and intracycle natural variation in progesterone levels: A quantitative assessment and implications for population studies. Am. J. Hum. Biol. 20(1), 35–42 (2008).
Article PubMed Google Scholar
Houben, M. et al. The emotion regulation function of nonsuicidal self-injury: A momentary assessment study in inpatients with boderline personality disorder features. J. Abnorm. Psychol. 126(1), 89–95 (2017).
Article PubMed Google Scholar
Kessler, R. C. Epidemiology of women and depression. J. Affect. Disord. 74(1), 5–13 (2003).
Article PubMed Google Scholar
Skovlund, C., Morch, L. S., Kessing, L. V. & Lidegaard, O. Association of hormonal contraception with depression. JAMA Psychiat. 73(11), 1154–1162 (2016).
Article Google Scholar
Larsen, R. J. The stability of mood variability: A spectral analytic approach to daily mood assessments. J. Pers. Soc. Psychol. 52(6), 1195–1204 (1987).
Article Google Scholar
Bringmann, L. F. et al. Assessing temporal emotion dynamics using networks. Assessment 23(4), 425–435 (2016).
Article PubMed Google Scholar
McShane, B. B., Gal, D., Gelman, A., Robert, C. & Tackett, J. L. Abandon statistical significance. Am. Stat. 73, 235–245 (2019).
Article MathSciNet Google Scholar
Meehl, P. E. Theoretical risks and tabular asterisks: Sir Karl, Sir Ronald, and the slow progress of soft psychology. J. Consult. Clin. Psychol. 46, 806–834 (1978).
Article Google Scholar

Download references

Acknowledgements

We thank the research assistants in the Methods, Sex differences, and Development—M(SD)—Lab at the University of Michigan who worked tirelessly to collect, organize and curate the data used in this manuscript. We also thank the participants without whom this important work would not be possible.

Funding

Adriene Beltz was supported by the Jacobs Foundation. Alexander Weigard was supported by T32 AA007477 (to Frederic Blow).

Author information

Authors and Affiliations

Department of Psychology, The University of Michigan, 2227 East Hall 530 Church Street, Ann Arbor, MI, 48109, USA
Alexander Weigard, Amy M. Loviska & Adriene M. Beltz

Authors

Alexander Weigard
View author publications
You can also search for this author in PubMed Google Scholar
Amy M. Loviska
View author publications
You can also search for this author in PubMed Google Scholar
Adriene M. Beltz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.B. designed the original study and worked with A.L. to acquire the data set. All authors were involved in conceptualizing the research questions and analyses included in this manuscript. A.W. analyzed and interpreted the data with assistance and consultation from A.B. and A.L. A.W. and A.B. wrote the manuscript draft with critical input and assistance from A.L. All authors approved the final version of the manuscript for submission and agree to be accountable for all aspects of the work.

Corresponding author

Correspondence to Adriene M. Beltz.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Weigard, A., Loviska, A.M. & Beltz, A.M. Little evidence for sex or ovarian hormone influences on affective variability. Sci Rep 11, 20925 (2021). https://doi.org/10.1038/s41598-021-00143-7

Download citation

Received: 16 November 2020
Accepted: 28 September 2021
Published: 22 October 2021
DOI: https://doi.org/10.1038/s41598-021-00143-7

This article is cited by

Why the estrous cycle matters for neuroscience
- Devin Rocks
- Heining Cham
- Marija Kundakovic
Biology of Sex Differences (2022)
Depressive affect moderates the effects of biological sex on the recognition of facial emotion
- Elizabeth Hampson
- Jessica A. Chow
- Cathleen Fleury
Archives of Women's Mental Health (2022)
Tracking Women’s Reproductive Data in Sport: Practical Limitations, Perils and Pitfalls
- Kathleen V. Casto
Sports Medicine (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.