Cultural familiarity and musical expertise impact the pleasantness of consonance/dissonance but not its perceived tension

Lahdelma, Imre; Eerola, Tuomas

doi:10.1038/s41598-020-65615-8

Download PDF

Article
Open access
Published: 26 May 2020

Cultural familiarity and musical expertise impact the pleasantness of consonance/dissonance but not its perceived tension

Imre Lahdelma¹ &
Tuomas Eerola¹

Scientific Reports volume 10, Article number: 8693 (2020) Cite this article

12k Accesses
32 Citations
28 Altmetric
Metrics details

Subjects

Abstract

The contrast between consonance and dissonance is vital in making music emotionally meaningful. Consonance typically denotes perceived agreeableness and stability, while dissonance disagreeableness and a need of resolution. This study addresses the perception of consonance/dissonance in single intervals and chords with two empirical experiments conducted online. Experiment 1 explored the perception of a representative sample of intervals and chords to investigate the overlap between the seven most used concepts (Consonance, Smoothness, Purity, Harmoniousness, Tension, Pleasantness, Preference) denoting consonance/dissonance in all the available (60) empirical studies published since 1883. The results show that the concepts exhibit high correlations, albeit these are somewhat lower for non-musicians compared to musicians. In Experiment 2 the stimuli’s cultural familiarity was divided into three levels, and the correlations between the key concepts of Consonance, Tension, Harmoniousness, Pleasantness, and Preference were further examined. Cultural familiarity affected the correlations drastically across both musicians and non-musicians, but in different ways. Tension maintained relatively high correlations with Consonance across musical expertise and cultural familiarity levels, making it a useful concept for studies addressing both musicians and non-musicians. On the basis of the results a control for cultural familiarity and musical expertise is recommended for all studies investigating consonance/dissonance perception.

Microdosing with psilocybin mushrooms: a double-blind placebo-controlled study

Article Open access 02 August 2022

Determinants of behaviour and their efficacy as targets of behavioural change interventions

Article 03 May 2024

Contextual and combinatorial structure in sperm whale vocalisations

Article Open access 07 May 2024

Introduction

The origins of consonance and dissonance have been investigated since the days of Pythagoras in ancient Greece, and its elusive and mercurial nature baffles scholars to this day. The contrast between consonance and dissonance is a crucial feature of Western music, and it plays a vital role in making music emotionally meaningful by providing a sense of variety and motion^1,2,3. Typically, consonant denotes connotations like harmonious, agreeable, and stable, while dissonant, in turn, connotations like disagreeable, unpleasant, and in need of resolution⁴. Consonance/dissonance has both a vertical and a horizontal aspect: single isolated intervals (two concurrent pitches) and chords (three or more concurrent pitches) represent vertical consonance/dissonance, while the sequential relationships between these in melodies and chord progressions represent horizontal consonance/dissonance².

Aesthetic responses to consonance/dissonance (hereafter referred to as C/D and implying exclusively its vertical aspect) are surmised to have both biological and cultural roots, and the debate over which prevails represents a classical nature vs. nurture setting (e.g. ref. ³). In addition to disputes over its origins, also the very definition of C/D is notoriously problematic. As Tenney⁵ points out, “there is surely nothing in the language of discourse about music that is more burdened with purely semantic problems than are the terms consonance and dissonance” (p. 1). The concept itself is semantically loaded, and it has been volatile in a historical context as well: certain intervals (e.g., the major and minor thirds) became consonant only over time in the framework of Western music⁶. The inconsistencies arise not only from debates over which acoustic (e.g., roughness, harmonicity, fusion) and cultural phenomena (familiarity on both on a cultural and on an individual level, i.e., exposure) and their possible interactions might explain the underlying cause of C/D, but the term itself means different things to different scholars ranging from the most commonly associated definition pleasantness (e.g. ref. ⁷) to concepts like preference (e.g. ref. ⁸), smoothness (e.g. ref. ⁹), clearness¹⁰, purity (e.g. ref. ¹¹), tension (e.g. ref. ¹²), and harmoniousness (e.g. ref. ¹³). While there have been a couple of attempts to compare the overlap between some of these associated concepts^12,14, it is striking how most scholars do not problematise their definitions of C/D and take them at face value despite clear caveats in previous literature of automatically equating consonance with for example pleasantness or preference^15,16,17,18. Moreover, Ritossa and Rickard¹⁹ suggest that pleasantness and preference are not directly linked concepts in music perception, yet these have been used as synonyms in C/D research by e.g., Bones et al.⁷, Prete et al.²⁰, and McDermott et al.²¹.

The current study’s Experiment 1 aims to empirically explore the perception of the stimuli (single intervals and chords isolated from musical context) across those seven concepts (Consonance, Smoothness, Purity, Harmoniousness, Tension, Pleasantness, Preference) that have been most used to denote vertical C/D across all the available empirical studies reported since 1883 (in total 60). A related aim is to investigate the possible role of timbre in this by playing the stimuli with both the piano and the sine wave timbres as timbre can influence the perception of C/D¹⁴. Experiment 2 aims to further investigate the five key concepts (Consonance, Tension, Harmoniousness, Pleasantness, Preference) by addressing specific acoustic (roughness, harmonicity) and cultural (familiarity measured with the frequency of occurrence of the stimuli in actual music) contributors that might affect the perception of the stimuli across these. Moreover, both experiments aim to investigate the role of musical expertise in the perception of C/D as it has been suggested that the concepts of consonance and pleasantness correlate differently among musicians and non-musicians^14,22. Also, both experiments will address the influence of the total number of pitches present in the stimuli (referred to as numerosity) on the ratings of C/D and related concepts as numerosity can affect the perception of C/D^23,24.

Experiment 1

Methods

Experiment 1 is reported as one experiment but is actually a combination of seven separate sub-experiments. In each sub-experiment, participants rated through an online interface the stimuli on one of the seven concepts denoting C/D that have been most used in the previous empirical studies conducted since 1883. The review of past studies was carried out by the current authors and included exclusively those studies that used isolated, vertical pitch combinations (intervals and chords) as the experiment stimuli. The included studies were found with the aid of Web of Science, an online subscription-based scientific citation indexing service. The applied search terms were “consonance dissonance” (326 results), “consonance perception” (322 results), “interval consonance” (183 results), and “chord consonance” (125 results). As Web of Science keeps track of publications only from the year 1900 onwards, studies older than this were searched for manually.

The seven most common concepts to denote C/D are 1) Pleasantness (used in 31 studies), 2) Consonance (used in 15 studies), 3) Smoothness (used in 13 studies), 4) Purity (used in five studies), 5) Harmoniousness (used in four studies), 6) Preference (used in three studies), and 7) Tension (used in three studies). Those terms that evidently denote the same perceptual concept (e.g., antonyms like smoothness/roughness) were collapsed under one concept. The majority of the concepts have been used consistently during the 20th century and are in use to the present day. All of the concepts have been used to denote the perception of both intervals and chords; the concepts of fusion (used in eight studies), beauty (used in five studies), and euphony (used in five studies) were excluded as they have been used in studies involving exclusively intervals as the experiment stimuli. To minimise the effect of different interpretations of the concepts between participants, each one was explained on the basis of how the concepts are typically defined in previous research or in dictionary entries (see the Appendix). In the explanations, care was taken not to confound the pivotal concept of Consonance with the rest of the concepts.

Participants

As culture has been reported to affect the perception of C/D^21,25, only Western participants (self-identified native English speakers) were recruited to avoid a cultural confound. The rationale behind choosing both musicians and non-musicians as participants was data-driven, as including both of these groups is the most common procedure (used in 26 studies) in the previous C/D studies. The participants were recruited through Prolific Academic, an online crowdsourcing platform targeted especially for research purposes. Previous research suggests that Prolific Academic participants consistently complete questionnaires carefully and the platform has high reliability^26,27.

The participants’ musical expertise was measured with the six self-report rank items (Which title best describes you?) taken from the Ollen Musical Sophistication Index²⁸. The six items were (1) Non-musician, (2) Music-loving non-musician, (3) Amateur musician, (4) Serious amateur musician, (5) Semiprofessional musician, and (6) Professional musician. Participants identifying themselves as belonging to groups 1–2 were categorised as “non-musicians”, while those belonging to groups 3–6 as “musicians”. For the benefits of using this strategy to assess musical expertise, see Zhang and Schubert²⁹. In addition, participants’ age, gender, and music preference was assessed within the survey. The latter was divided into four meta-genres based on Rentfrow and Gosling³⁰ by providing example genres as proxies for the four dimensions (Reflective & Complex - Classical/Ethnic, Intense & Rebellious - Rock/Heavy, Upbeat & Conventional - Pop/Electro, and Energetic & Rhythmic - Other). The participants were asked to choose one of these four genres to indicate their music preference. Informed consent was obtained from all participants. The experiment was approved by the ethics committee of the Department of Music at Durham University and was conducted in accordance with its guidelines and regulations.

The total amount of participants after removing outliers (see Procedure) was 407. The mean age of the participants was 35.04 (SD = 12.55, 57.2% females). Participants were randomly allocated to each sub-experiment from the overall pool in order to have a balanced sample of both musicians and non-musicians. This pool size was estimated on the basis of a previous experiment by Bowling et al.³¹ where thirty participants (15 musicians and 15 non-musicians) gave consonance ratings for all 12 dyads, 66 trichords, and 220 tetrachords (played with the piano timbre) that can be formed using the intervals specified by the chromatic scale over one octave. Our aim was to have twice the number of musicians and non-musicians in each concept to be able to evaluate the consistencies within the concepts reliably (see Supporting Information Table 1).

Materials

For a representative continuum of C/D, the stimuli were chosen on the basis of the above-mentioned experiment conducted by Bowling et al.³¹ on the perception of C/D in intervals, trichords, and tetrachords. All intervals, trichords and tetrachords that were rank ordered according to perceived consonance by Bowling et al.³¹ were ordered into five quintiles of the mean consonance ratings. Out of these quintiles five intervals, 10 trichords, and 10 tetrachords were chosen in a randomised manner to represent a continuum of consonance, as two- (used in 42 studies), three- (used in 20 studies), and four-pitch (used 11 studies) combinations are the most used stimuli in the previous experiments conducted on C/D perception. Due to the smaller overall number of intervals than trichords and tetrachords, only one interval per quintile could be chosen to represent the respective consonance levels. With trichords and tetrachords there were always two chords representing a quintile of consonance. The total number of stimuli was thus 25 × 2 timbres = 50 (see Table 1). As per the procedure by Bowling et al.³¹, the fundamental frequencies (F₀s) of the pitches in each interval and chord were adjusted so that the mean F₀ of all pitches was C₄ (261.63 Hz). The timbres used were the piano and sine wave, these being the two most commonly utilised timbres (the piano used in 18 studies, the sine wave in 14 studies) in the previous experiments conducted on C/D perception (see Fig. 1 for examples of the stimuli). The stimuli were played exclusively in equal temperament: again, this is the most common procedure in the previous C/D studies (used in 40 studies).

Table 1 The stimuli.

Full size table

The piano stimuli were generated with Ableton Live 9 (a music sequencer software), using the Synthogy Ivory Grand Pianos II plug-in. The applied sound font was Steinway D Concert Grand. No reverb was used, and the intervals and chords had a fixed velocity (65) in order to have a neutral and even sound. The sine wave stimuli were generated with five partials with exponential decay in the successive amplitudes, \({a}_{n}={e}^{6-n}/{e}^{5}\). The temporal envelope of the sound was shaped with a half-Hanning window (duration of 2.0 seconds). All stimuli were normalised (to −3 db) with Adobe Audition CC 2019 (a digital audio workstation) to control for any amplitude differences due to pitch numerosity and timbre dissimilarities. The sound files were converted to stereo (same signal in both channels) as 44.1 kHz, 32 bits per sample waveform audio files. These files were rendered as constant bit rate 320 kbps high quality stereo mp3 files for compatibility with the survey design software used in the experiment (see Procedure). The length of each interval and chord was exactly 2.0 seconds. The stimuli can be found online at https://osf.io/tupzq/.

Procedure

The online experiment was conducted with the Qualtrics Survey Software, a web-based survey tool. First, the participants’ demographic background data was collected (musical expertise, music preference, gender, age). Before the evaluation of the stimuli, the participants received written instructions and were asked to rate each interval and chord on the presented concept (see the Appendix). Each concept was rated on a Likert scale ranging from 1 to 5, the concepts’ bipolar extremes taken from previous research literature. With Pleasantness, the bipolar extremes were 1 = Unpleasant and 5 = Pleasant (e.g. ref. ²²). With Consonance, the extremes were 1 = Dissonant and 5 = Consonant (e.g. ref. ³²). With Smoothness, the extremes were 1 = Rough and 5 = Smooth (e.g. ref. ¹²). With Purity, the extremes were 1 = Impure and 5 = Pure (e.g. ref. ³³). With Harmoniousness, the extremes were 1 = Inharmonious and 5 = Harmonious (e.g. ref. ¹³). With Preference, the extremes were 1 = I don’t like it and 5 = I like it (e.g. ref. ²⁰). With Tension, the extremes were 1 = Tense and 5 = Relaxed (e.g. ref. ¹⁴). Participants were randomly allocated to one of the seven concept sub-experiments, and the order of the stimuli presentation was also randomised. All of the 50 separate pitch combinations were repeated once, resulting in 100 stimuli altogether. As there was a clear link between fast overall survey completion time and random response patterns, those participants (n = 58) who completed the experiment faster than the minimal time estimated for reasonable assessment (< 400 s overall, i.e., < 4 s/trial) were removed.

To summarise the experiment design, there are seven between-subject sub-experiments (one for each concept), all having the same stimuli (n = 100) broken down into four stimulus factors (Consonance: 5 levels, Numerosity: 3 levels, Timbre: 2 levels, Repeat: 2 levels) and four participant factors (Musical Expertise: 2 levels, Music Preference: 4 levels, Gender: 2 levels, Age).

Results

The results will first focus on the concepts’ inter-rater reliability and their overall correlations and will then continue to the role of specific factors on the evaluations across the seven concepts. The internal consistencies of the concepts were measured with mean r correlation coefficients due to inflated values of the Cronbach alphas (αs > 0.93 for musicians, > 0.82 for non-musicians). Interestingly, by far the highest consistency among musicians was on the concept of Harmoniousness (0.52), followed by Pleasantness (0.45). For non-musicians the highest consistency was on the concept of Tension (0.30), followed by Harmoniousness (0.24). All in all, the consistencies were considerably higher for musicians than non-musicians (see Table 2).

Table 2 Correlations across the seven concepts for musicians and non-musicians (df = 98) and average correlations across the participants (reliability).

Full size table

As can be seen from the correlation table (Table 2) the coefficients between Consonance and the rest of the concepts were conspicuously high and consistent especially in the case of musicians (all correlations > 0.90). For non-musicians the correlations were somewhat lower, but also consistent (all correlations > 0.80, with the exception of Preference’s 0.78). The highest correlations with Consonance for musicians were on the concepts of Pleasantness (0.96), Harmoniousness (0.96), and Smoothness (0.96), while for non-musicians on the concepts of Purity (0.89), Harmoniousness (0.87), and Pleasantness (0.87). For both groups the lowest correlations with Consonance were on the concept of Preference (0.92 for musicians and 0.78 for non-musicians).

To explore the differences between the concepts and factors, first a repeated MANOVA was conducted across the seven concepts and the eight factors (Numerosity, Consonance Level, Repeat, Timbre, Expertise, Age, Gender, Music Preference) with the participants as random effects. Strong main effects for Concept (df = 403, t = 2.02, p ≤ 0.05), Numerosity (df = 40286, t = −9.62, p ≤ 0.001), Consonance Level (df = 40286, t = 40.82, p ≤ 0.001), Timbre (df = 40286, t = −2.09, p ≤ 0.05), and Expertise (df = 532.9, t = −3.171, p ≤ 0.01) were observed, but no significant main effects for Repeat (df = 401, t = −0.009, p = 0.993), Age (df = 401, t = 1.62, p = 0.106), Gender (df = 401, t = −1.90, p = 0.058), and Music Preference (df = 401, t = 0.30, p = 0.77).

A more detailed generalised linear mixed model (GLMM) analysis was carried out within each concept to better highlight the different ways the factors operated across the concepts. Table 3 shows the breakdown of the GLMM analyses across the seven concepts and four factors with the participants as random effects. To save space, only the estimates and the p values are shown for the main effects across the concepts. Supporting Information Tables 2–8 displays the full statistical table with interactions.

Table 3 GLMM estimates across the seven concepts and four factors.

Full size table

Figure 2 summarises the ratings for one concept (Consonance) and the three most important factors (Consonance Level, Numerosity, and Expertise) as an example. The complete breakdown across different factor combinations can be seen from Supporting Information Figs. 1, 2, and 3.

Numerosity

Numerosity affected all seven concepts. On the concept of Consonance the intervals were perceived as more consonant, except in the case of the most dissonant sonorities. This tendency was exactly the same for the concept of Purity. All in all, higher numerosity created more perceived dissonance, roughness, impurity, inharmoniousness, and tension especially on the middle level of C/D in the stimuli. Notably, this was not mirrored in perceived Pleasantness and Preference, where higher numerosity yielded slightly higher ratings across various levels of C/D (see Supporting Information Fig. 2).

Consonance

In all seven sub-experiments, the Consonance Level showed a significant effect across the five levels (see Table 3 for statistical significance, and also see Supporting Information Fig. 1 for the full pattern). Ratings typically increased from dissonant to consonant levels in a linear fashion (reverse for Tension).

Timbre

All of the concepts were affected by timbre statistically significantly, with the exception of Harmoniousness. The sine wave timbre was generally perceived as more dissonant, unpleasant, impure, and tense, and it was preferred less than the piano timbre. The difference between the two timbres was especially conspicuous on the concepts of Tension and Preference. However, on the concept of Smoothness this pattern was broken, where the most consonant intervals and tetrachords as well as the most dissonant trichords and tetrachords were perceived slightly smoother when played on the sine wave timbre (see Supporting Information Fig. 2).

Expertise

None of the concepts were affected by musical expertise statistically significantly with the exception of Purity, where non-musicians perceived the more dissonant stimuli as noticeably purer than musicians (see Supporting Information Fig. 3). This implies that for non-musicians, consonance and purity are not completely overlapping concepts when the stimuli are highly dissonant.

Discussion

It is striking how high and consistent the correlations between the seven concepts were especially for musically trained participants. For musically less-trained participants the correlations were somewhat lower, but also consistent. The only notable exception was the concept of Preference which had a somewhat lower correlation (0.78) in the case of non-musicians. The results imply that both groups – especially musicians – have virtually a blueprint of an acoustic concept (vertical consonance and dissonance) that they rate similarly across semantically quite distantly related concepts (e.g., purity vs. pleasantness). It is worth noting that the concept of Preference had the lowest correlation with Consonance across both musicians and non-musicians and showed by far the lowest internal consistency in the case of musicians; this raises concerns about its validity to reliably measure the perception of consonance.

With regard to different factors, higher numerosity typically resulted in higher perceived dissonance, roughness, impurity, inharmoniousness, and tension especially on the middle level of C/D in the stimuli. This is notably in line with the notion that the addition of pitches to a chord typically increases its roughness^23,24, an acoustic component seen as prevalent in dissonant, but not in consonant musical chords³⁴. However, the current results imply that higher pitch numerosity does not automatically result in a lack of preference and pleasantness despite a higher amount of perceived dissonance. On the contrary, it seems to increase ratings of pleasantness and preference in the case of consonant chords; this finding is line with previous research on the perception of isolated chords³⁵.

In terms of timbre, the sine wave sound was typically perceived as more dissonant, unpleasant, impure, and tense, and it was preferred less than the piano. A plausible explanation for this is that the sine wave sound is simply less familiar than the common piano sound. The difference according to timbre was especially prominent on the concepts of Tension and Preference where the piano timbre was perceived less tense and was also preferred more. This finding is line with previous research conducted with isolated chords where both perceived preference³⁶ and pleasantness¹⁴ were affected by timbre. Interestingly, in the current study timbre did not affect the concept of Harmoniousness. This implies that using this particular concept may have advantages in C/D research when multiple timbres are involved; it also exhibited good inter-rater reliability across both musicians and non-musicians.

Experiment 2

As Experiment 1 was concerned only with representing a seamless continuum of C/D without addressing specific acoustic or cultural contributors, the question of cultural familiarity was not yet investigated. There is a consensus that the overall perception of C/D in Western sonorities is presumably based on a combination of roughness, harmonicity, and familiarity (e.g. refs. ^2,37). Roughness denotes the sound quality that arises from the beating of frequency components (e.g. refs. ^10,34), and harmonicity indicates how closely a sonority’s spectrum corresponds to a harmonic series (e.g. ref. ³⁸). The order of importance between these two acoustic factors on the perception of C/D is debated³⁷. In addition to the acoustic phenomena of roughness and harmonicity, exposure (i.e., familiarity on both on a cultural and on an individual level) has been surmised to be an essential contributor to perceived C/D^17,39, and its important role has been empirically demonstrated both in the case of intervals⁴⁰ and chords^41,42. As cultural familiarity is evidently an important factor in C/D perception, the current experiment quantifies the stimuli’s cultural familiarity with the aid of a corpus-based familiarity model by Harrison and Pearce³⁷. As explained by Harrison and Pearce, their model is based on the hypothesis that listeners become familiar with vertical pitch combinations in proportion to their frequency of occurrence in the listener’s musical culture, and that this familiarity positively influences consonance through the mere exposure effect³⁷. Their model simulates a Western listener’s musical exposure by counting the frequencies of occurrence of different vertical pitch combinations in the Billboard Data Set⁴³, a large corpus of music sampled from the US charts published between 1958 and 1991.