A simple cognitive method to improve the prediction of matters of taste by exploiting the within-person wisdom-of-crowd effect

Fujisaki, Itsuki; Honda, Hidehito; Ueda, Kazuhiro

doi:10.1038/s41598-022-16584-7

Download PDF

Article
Open access
Published: 20 July 2022

A simple cognitive method to improve the prediction of matters of taste by exploiting the within-person wisdom-of-crowd effect

Itsuki Fujisaki¹,
Hidehito Honda² &
Kazuhiro Ueda¹

Scientific Reports volume 12, Article number: 12413 (2022) Cite this article

1165 Accesses
2 Citations
Metrics details

Subjects

Abstract

In our daily lives, we must often predict the level of others’ satisfaction with something they have not experienced thus far. How can such a prediction be accurate? Existing studies indicate that, by referring to the extent to which people themselves have enjoyed something, they are able to predict others’ future satisfaction, to some extent. In this study, we propose a method that can further improve such predictions. This method is expected to allow individuals to exploit the ‘wisdom of the crowd’ within a person, in terms of taste. Specifically, for a single target, participants in our study group produced two opinions from different perspectives: the degree to which they preferred something, and they estimated ‘public opinion’. Utilising two behavioural studies and computer simulations, we confirmed the effectiveness of our method; specifically, blending the two opinions could enhance an individual’s prediction ability. Subsequently, we mathematically analysed how effective our method is and identified several factors that influenced its efficiency. Our findings offer several contributions to ‘wisdom-of-crowd’ research.

Context-dependent choice and evaluation in real-world consumer behavior

Article Open access 22 October 2022

A. Ross Otto, Sean Devine, … Kenway Louie

On the round number bias and wisdom of crowds in different response formats for numerical estimation

Article Open access 17 May 2022

Hidehito Honda, Rina Kagawa & Masaru Shirasuna

Deviations of rational choice: an integrative explanation of the endowment and several context effects

Article Open access 01 October 2020

Joost Kruis, Gunter Maris, … Han L. J. van der Maas

Introduction

In daily life, we are often asked questions such as, ‘Do you feel I will like the restaurant you visited?’ or ‘Do you think I will be satisfied with the lecture you attended last year?’ by our friends or peers. How can we accurately predict others’ future satisfaction in such situations^1,2,3,4? An intuitive way is to speculate on how much others would enjoy their experiences. However, existing studies^5,6,7 noted that these speculations tend to be inaccurate, even when one has a long relationship with them. In contrast, some researchers^7,8,9 have revealed that our own impression can be a good predictor. Specifically, simply answering the extent to which we ourselves appreciated the experience (hereinafter referred to as ‘Own’ opinion) can predict others’ future satisfaction, to some extent.

This study proposed a method that can further improve these predictions. Our proposed method is as follows: for a question item (e.g. ‘Do you think I would like the restaurant?’), two opinions should be considered, from two different perspectives: in addition to the Own opinion, they must estimate the public’s views (hereinafter called the ‘Estimated’ opinion). Specifically, they would guess the extent to which average people prefer it. Subsequently, in our study, the two opinions are then averaged (hereinafter referred to as ‘Blended’ opinion). This study’s central hypothesis is that the Blended opinion is a better predictor than the Own opinion.

Its theoretical basis can be explained as follows. We developed a method based primarily on wisdom-of-crowd research^{10,11,12,13,14,15,16,17,18}. It has been established that the aggregate of multiple judgements could have greater accuracy than individual ones. Note, however, that judgement problems are generally matters of fact, in which there is an objective truth (e.g. ‘What percentage of the world’s airports are in the United States?’). Importantly, some previous studies^2,3,19 show that the wisdom-of-crowd effect can also emerge for matters of taste, in which there is no objective, universal truth. Put simply, these studies show that, as the number of people voicing their opinion increases, the accuracy of the prediction (i.e. the aggregation of their opinions) also increases. Furthermore, recent studies^{4,26,27,28,29,30,31,32,33,34,35} demonstrate that an individual can harness the within-person wisdom of the crowd (called ‘the wisdom of the inner crowd’). In these studies, an individual produced multiple responses for a problem (basically regarding the matters of fact, see ‘Discussion’), resulting in the aggregated answer being more accurate than a single one. Overall, we consider that an individual can improve their prediction if they can harness the within-person wisdom of the crowd for matters of taste.

As mentioned above, participants in our study were requested to produce two evaluations for the same item: the Own and the Estimated opinion. To obtain the Estimated opinion, participants had to examine the item from a different perspective than their own opinion; in other words, our method was expected to produce two quasi opinions from participants. This approach for obtaining ‘the wisdom of the inner crowd’ is in line with the methods of previous studies^{4,20,21,22,23,24,25,26,27,28,29}. To determine the details of the Estimated opinion—the simulated public opinion—we draw on findings from cognitive and social psychology. In these fields, many studies have examined how people can think differently from their own. In particular, it is well-known that, to consider others’ perspectives, an individual can make diverse recognitions in various aspects, called ‘perspective-taking’³⁰. For example, perspective-taking enables a person to decrease stereotypic biases³¹, engage in coordinated behaviours³², change preferential values³³, and reduce egocentric thinking³⁴. We therefore decided to primarily follow the perspective-taking paradigm. Subsequently, among ‘others’, we adopted the general crowd’s viewpoint. Many studies have reported that people believe that the general crowd differs from themselves in several ways (e.g. the degree of intelligence^35,36,37,38, risk attitude^39,40, and judgemental estimations²⁶). Accordingly, we hypothesised that, by considering the general crowd’s perspectives, participants could make evaluations that differ from their own opinions for the same target.

This is a broad background of our method. In the following section, we report on our comprehensive investigation on the effectiveness of the proposed method. We first conducted two experimental studies to collect the evaluation data (see details in ‘Methods’) that differed in terms of the stimuli category presented: paintings and music in Studies 1 and 2, respectively. Both studies evaluated stimuli successively using our method. Subsequently, we utilised the evaluation data and conducted a computer simulation to assess the effectiveness of our method.

Results

Analysis

In the analysis, we focused on the following situations: For a target, one person (‘Giver’) gave an opinion to another (‘Receiver’). Figure 1a illustrates the definition of ‘helpfulness’ of the Giver’s opinion. The left side represents the following situation: the Giver has already experienced a target; then, they give an opinion (for example, 70 in the figure) to a Receiver, who has not experienced it thus far. The right side shows the results of the opinion giving. Here, the Receiver has also encountered the target and has formed their own judgement (that is, the Receiver’s Own opinion). The upper row on the right side demonstrates the situations in which the Giver had a similar opinion (80) regarding the target. In this case, we assumed that the Giver’s opinion was ‘helpful’, as it accurately predicted the Receiver’s future satisfaction. Conversely, when the Receiver had an opinion different from the Giver (for example, 20 in the lower row), we supposed that the Giver’s opinion was relatively ‘unhelpful’.

Note that this assumption is similar to those adopted in previous studies^3,4,7,8,19. As mentioned in the Introduction section, we simulated the opinion giving on a computer using the evaluation data in Studies 1 and 2. The simulations eliminated the possibility that the Receiver’s opinion formation was influenced by receiving the Giver’s opinion.

For the detailed analysis, we mainly used the theoretical framework of existing wisdom-of-crowd literature on matters of taste (particularly, Müller-Trede et al.³), which enabled us to quantitatively investigate the efficacy of the proposed method.

Figure 1b illustrates the detailed analysis. A Giver and a Receiver were independently selected from the participants whose behavioural data were obtained, after which we examined the helpfulness of the Giver’s three opinions (Own, Estimated, and Blended opinions). As shown in Fig. 1b, we employed the mean squared error (MSE) as an index for the helpfulness of opinions. In particular, we computed the value of the squared difference between a Giver’s opinion and the Receiver’s Own opinion. A smaller MSE value indicated that the Giver’s opinion was more helpful. We conducted this analysis across all stimuli.

Using the simulation procedure, we examined all the Giver–Receiver combinations: (i) For a Giver, all participants except the Giver became their receivers, and we computed the MSE; (ii) The average MSE across all Receivers was allocated to the Giver’s value; (iii) We computed the MSE for all participants including the Giver.

Main results

Figure 2 shows the results of the analysis. Notably, we obtained a lower MSE for the Blended opinion than the Own opinion across the two studies (Study 1: Wilcoxon signed-rank test, p < 0.001, Cliff’s delta = 0.37; Study 2: paired t-test, p < 0.001, Cohen’s d = 1.36). That is, using our method, a Giver could improve the accuracy of their opinions. Thus, our main hypothesis was supported.

As mentioned in the Introduction section, the two studies differed in terms of stimulus categories. The results indicated that they also differed in terms of the data structure, as shown in Table 1. In Study 1, the average rating values were less than half (i.e. 50) across all opinions. However, in Study 2, these values hovered around half. Additionally, as shown in Fig. 3, none of the opinions followed a normal distribution in Study 1, while in Study 2, they did (Kolmogorov–Smirnov test; Study 1: ps < 0.001; Study 2: ps > 0.1). Taken together, our method was effective across different categories and data structures.

Table 1 Data structure across the two studies. This table shows a 95% confidence interval (CI) regarding the rating values. We computed the 95% CI by bootstrapping. In Study 1, the average rating values for all types of opinions were below half, while in Study 2, they were around half.

Full size table

Next, we discuss how effective our method was across different data structures. Figure 4 represents typical examples of the results. In Study 1 (Fig. 4a), the rating values of Own opinion focused on 0. However, there were a few large rating values (for example, 100). In these cases, the MSE of Own became quite large. Conversely, fewer Blended ratings were 0. Specifically, the rating value of Blended opinion tended to remain distributed between 0 and 75. This resulted in Blended recording a lower MSE than Own. Subsequently, in Study 2 (Fig. 4b), the mean rating value of Own was similar to that of Blended (especially around 50). However, the distribution of the rating values of Own was relatively larger than that of Blended. That is, there were certain cases where the rating values between a Giver and a Receiver were largely different (for example, the Giver’s rating value was 0 and the Receiver’s rating value 100, and vice versa). In this respect, there were relatively fewer cases in the Blended than in the Own opinion, as the Blended distribution was small. Thus, our method was effective across different data structures.

Notably, the results also indicated that Blended had a significantly lower MSE than Estimated in Study 1 (p < 0.001; Cliff’s delta = 0.52). Although we did not find such an effect in Study 2 (p = 0.55, Cohen’s d = 0.18), the findings were also in favour of Blended; we calculated the number of participants who had a lower MSE in Estimated (Blended), compared to Own opinion. We found that in Blended, more participants had a lower MSE than in Estimated (55 for Blended and 44 for Estimated, out of 56 participants; Fisher’s exact test: p < 0.005). When we consider these results along with those of Study 1, it can be stated that Blended opinion improves the Giver’s opinion more effectively than Estimated opinion.

How our method performed: analysis by decomposing MSE

How could our method reduce the error in Own opinion? It is well-known^3,41,42 that the MSE was theoretically decomposed into different sources of error. By performing this decomposition, we were able to comprehensively observe how effective our method was. Several decompositions of the MSE have been suggested in the literature^3,41,42, among which we adopted the one proposed by Müller‐Trede et al.³; we chose this method because the decomposition consisted of psychologically meaningful factors. The MSE decomposition of Müller‐Trede et al.³ is represented as follows:

$$MSE \, = \, Bias \, + \, Variability \, bias \, + \, Linear \, correspondence$$

(1)

First, bias represents the degree of difference in the rating value of a Giver, compared to that of a Receiver. Simply put, it was larger when the mean rating value of a Giver differed more from that of a Receiver. Second, variability bias indicates the degree of difference in the variability of the Giver’s opinions from the optimal degree of regression to the mean. Specifically, variability refers to the standard deviation of the Giver’s rating values across stimuli. Concerning the regression to the mean, we multiplied the variability of the Receiver’s opinions using a Giver–Receiver’s correlation coefficient across the stimuli (see mathematical description in ‘Methods’). Third, linear correspondence denotes the extent to which a Giver–Receiver correlation deviates from a linear relation.

Table 2 shows the results of the MSE decomposition. Remarkably, Blended opinion recorded lower values in variability bias than Own opinion, across two studies (95% CI). We did not find such results concerning bias and linear correspondence (see also Supplementary Fig. S1). The results indicated that our method was effective because of the improvement of the regression to the mean. Notably, the results were in line with the findings on the wisdom of the crowd for matters of taste (Müller-Trede et al.³). Considering this perspective, we can regard our method as exploiting the wisdom-of-crowd effect for matters of taste on a within-person level. In the Discussion section, we address this study’s contribution to wisdom-of-crowd literature.

Table 2 Results of the MSE decomposition (95% CI; in Study 1, all types of opinions did not follow a normal distribution, Kolmogorov–Smirnov test: ps < 0.001). It can be observed that the Blended opinion had a smaller variability bias than the Own opinion across the two studies. In addition, in Study 1, the Blended opinion had a smaller bias and variability bias than the Estimated opinion.

Full size table

In Study 1, Blended opinion had both lower bias and variability bias, compared with Estimated opinion. The results indicated that the rating values of Estimated opinion were consistently farther from those of Own opinion, compared with Blended opinion (higher, most often; see also Table 1). This result was not found in Study 2.

Additional analysis: when is our method more (or less) effective

As an additional analysis, we investigated the conditions under which our method performed better (or worse). We focused particularly on two factors: individual differences and taste discrimination.

Individual differences

There are diverse types of tastes among people^3,19,43,44: Some have considerably different tastes from the general public, while others have ordinary ones. Here, we examined how individual differences in terms of taste typicality influenced the efficacy of our method.

Figure 5 illustrates our analysis. (1) For the typicality of the tastes of a Giver, we calculated the absolute distance between the Giver’s and all participants’ Own opinions (this means the averages of all participants, called ‘distance from average’); that is, a small distance from average value represents high typicality of the Giver’s taste. (2) We then analysed the reduction of the MSE. This was calculated by subtracting the value of when the Giver’s opinion was Blended from when it was Own; the larger the reduction of MSE, the better the performance of our method. (3) We conducted this analysis across all stimuli and examined the relationship between the distance from the average and the reduction of MSE. Specifically, we calculated the correlation coefficient between them.

The following procedure was the same as in the final paragraph of the ‘Analysis’ section: for the Giver, we performed this analysis across the Receivers (all participants except the Giver). The averages of the reduction in the MSE were then assigned to the Giver’s value. Finally, we conducted this procedure for all the participants.

Figure 6 shows the results of the analysis. Each plot indicates the Giver’s value. The x-axis represents the distance from average, while the y-axis represents the reduction of the MSE. Across the two studies, we found a significant positive relationship between them (Study 1: rho = 0.42, p < 0.001; Study 2: r = 0.67, p < 0.001). That is, for a Giver with atypical taste, our method was more effective (for further analysis, see Section S2 of the Supplementary Information).

Taste discrimination

When assessing items in our daily lives, we have different feelings concerning our evaluations. For some items, we are able to make distinctive judgements about whether we like them or not (for example, pop and metal music), while for other items, we can only make vague judgements (for example, ambient and experimental music). Previous studies^1,3 indicate that the distinctiveness of judgements (called ‘taste discrimination’) plays a critical role in opinion giving. Notably, Müller-Trede et al.³ show that taste discrimination affects the helpfulness of opinions in our context. They primarily provide a theoretical model and point out the effects of taste discrimination; briefly, they define taste discrimination based on signal-to-noise ratios on judgements. Subsequently, they performed behavioural research and empirically confirmed its influences. Specifically, familiarity with the stimulus to which participants responded was used as an index of taste discrimination.

We therefore investigated the effect of taste discrimination on the effectiveness of our method. In this section, we only utilised the behavioural data from Study 2, in which the participants indicated Own opinion and Familiarity, identical to the study of Müller-Trede et al.³. We also provided ‘Difficulty’ as a new index for taste discrimination: Participants directly answered how challenging they found it to answer the Own opinion (see more details in ‘Methods’).

We conducted mixed-effects analyses that included the ‘Reduction of MSE’ as a dependent variable and Familiarity, Difficulty, and the interaction term as independent variables (Table 3). The impact of Difficulty was significant (F(1, 1123.7) = 48.32, p < 0.001). Further, we found no effect of Familiarity or interaction (Familiarity: F(1, 427.1) = 0.13, p = 0.72; interaction term: F(1, 1311.3) = 0.90, p = 0.34). This shows that, concerning our experimental settings, only Difficulty influenced the effectiveness of our method. Subsequently, findings showed that the lower the Difficulty, the higher the efficacy of our method: reduction of the MSE = − 3.23 × Difficulty + intercept (= 288.21).

Table 3 Results of the GLMM (a Generalized Linear Mixed Model). * indicates the interaction term.

Full size table

Next, we determine how lower difficulty enhanced our method. To examine this issue, we performed separate additional mixed-effects analyses for Own and Blended opinions (Tables 4 and 5, respectively). These analyses used the same independent variables as the previous analysis: Familiarity, Difficulty, and interaction term. However, it included a different dependent variable: the MSE (not the reduction of the MSE).

Table 4 Results of the additional GLMM for Own opinion.

Full size table

Table 5 Results of the additional GLMM for the Blended opinion.

Full size table

The results showed that the effects of Difficulty were significant (ps < 0.001), and neither Familiarity nor interactions were significant (all ps > 0.1), for both Own and Blended opinions. Concerning Difficulty, the MSEs in both Own and Blended opinions had inverse relationships: as Difficulty increased, the reduction in MSE decreased. Importantly, Own opinion had larger slopes than Blended: Own = − 6.48 × Difficulty + intercept (= 1034.49), and Blended = − 3.03 × Difficulty + intercept (= 739.58). That is, as Difficulty increased, Own opinion rapidly became an accurate prediction. Consequently, the merits of using our method were relatively low when people found it challenging to answer their own opinions.

Discussion

This study proposed a method for improving an individual’s prediction of others’ future satisfaction. This method requires the individual to evaluate an item twice, from different viewpoints; one would state their own preferences, while the other would estimate public opinion^{26,35,36,37,38,39,40}. Using two behavioural studies and computer simulations, we comprehensively examined our proposed method. We first confirmed its effectiveness; by averaging the two opinions, an individual could improve their predictions (concerning optimal weightings on the opinions, see Section S4 of the Supplementary Information). Subsequently, we mathematically analysed³ our method to determine how it performed effectively. Moreover, we identified multiple factors that influenced the efficiency of our method.

As mentioned in the Introduction, previous studies^2,3,19 demonstrate that the wisdom-of-crowd effect could emerge in terms of matters of taste within a group. In contrast, this is the first study that shows that the wisdom-of-crowd effect for matters of taste could emerge even within a person (a related study⁴ focuses on performance evaluation as a kind of wisdom of the crowd for matters of taste, and extends it to a within-person level). Specifically, the analysis on decomposing MSE indicated that the same mechanism worked for the wisdom-of-crowd effect for matters of taste – both within a group and within a person. It functions mainly by reducing variability bias (in other words, improving the regression to the mean).

The remaining question is how effective our method was. To answer this, we conducted an additional analysis that compared our method (i.e. one individual’s Blended opinion) with two individuals’ Own opinions (see Supplementary material, Section S5). As a result, our method recorded relatively high efficacy for the wisdom-of-inner-crowd method, especially in Study 2 (1.92 individuals; 1.36 individuals in Study 1). For matters of fact, most previous studies^20,23,27 record approximately 1.1–1.3 individuals. Therefore, this high efficacy may be a characteristic for matters of taste. However, it should be added that the results may be due to the settings of our method (i.e. estimating public opinion).

It must be noted that academically, we can associate our study with differential-information theories^45,46 and social sampling^36,37,47,48. These studies suggest that when people estimated public opinion, their opinions were based on themselves or on a similar social circle. Therefore, in our context, would people with atypical taste have a poorer sense of what the average is? The answer is yes; to address this question, we conducted an additional analysis. We examined the relationship between the distance from average and prediction accuracy for the average (i.e. how different is a Giver’s Estimated opinion from the average of all people’s Own opinions?). We found that people with more atypical tastes have a poorer sense of what the average is, across the two studies (Study 1: r = 0.47, p < 0.001; Study 2: r = 0.58, p < 0.001; see also Supplementary Fig. S5).

This study also fits in with the advice-taking paradigm⁴⁹. However, it should be noted that this study conducted advice-taking automatically. That is, for our method, we systematically averaged a giver’s Own and Estimated opinions, and directed it to a Receiver. In this respect, the existing findings⁵⁰ on advice-taking show that a Receiver usually does not average two opinions naturally (e.g. a Receiver adopts only one of the two opinions). It would therefore be a challenge to investigate whether the Receiver naturally averaged the Giver’s two opinions, as in our method.

In terms of practical contributions, this study may contribute to the online interface. It is well-known that items on online review sites often get very few reviews. For example, half of the items on Amazon.com only get one review⁵¹. In these circumstances, the helpfulness of the reviews could improve if the reviewer used our method. Reviewers would, of course, need incentives to use our method. However, this paper suggests how to gather more helpful opinions beyond people’s own opinions, at the least.

Our method could also contribute to research on the recommender system. Previous studies^19,52 on the recommender system show that an individual could leverage the experiences of similar others. Conversely, as the Estimated opinion differed from the Own opinion, an individual might learn from people with dissimilar tastes, when applying our method.

One limitation of this study is related to the efficacy of the method. As mentioned above, our method recorded relatively high efficacy for the wisdom-of-inner-crowd method. However, our method (i.e. Blended opinion) could not defeat the combined opinion of two individuals (i.e. two Own opinions) across two studies. Subsequently, more sophisticated methods should be investigated in future. One example is increasing the number of Estimated opinions (e.g. four times) and combining it with other wisdom-of-inner-crowd methods (e.g. introducing timespan²⁰).

In terms of other future studies, one promising approach is related to ‘Difficulty’. Based on an existing theoretical framework³, we asked participants to report difficulty level and found that the lower the difficulty was, the less effective our method was. How did these results emerge? One possible explanation is that when an individual felt it difficult to rate an item, the individual might find it highly difficult to project the average for other users. This means that the individual was off the mark in Estimated opinion (e.g. 100 in Study 1), resulting in the obtained results. In the future, we aim to test this explanation. For simplicity, we plan to ask participants about the difficulty of producing an Estimated opinion directly.

Another promising future research direction is determining the efficacy of our method; specifically, we aim to compare our method with two ‘own opinions’ or two ‘estimated opinions’. We plan to examine two ‘own opinions’ using open access data from a previous study⁵³ as the first step.

Overall, we consider that this study highlights the generality of the wisdom-of-crowd phenomenon. That is, we can exploit the wisdom-of-crowd effect without objective criteria and multiple people.

Methods

(1) Details of the experiment in Study 1.

We recruited 543 Japanese adults (273 females and 270 males, M_age = 45.23 years, SD_age = 11.01 years) to participate in the experiment through a web research company. All participants provided informed consent prior to study enrolment. The experimental protocol was approved by the University of Tokyo Research Ethics Committee and conducted in accordance with the latest version of the Declaration of Helsinki. Participants received cash-equivalent points that can be used for online shopping in Japan as an incentive.

We set five conditions for this research. In each condition, five different paintings were utilised as stimuli. Their contents varied among the five conditions. As a result, the stimulus consisted of 25 paintings in total (= 5 conditions × 5 paintings; Supplementary Table S3). Regarding the selection of the paintings, we followed a method used by a previous study⁵⁴ and included various paintings such as Gothic, Renaissance, Surrealism, and Modern art. In the experiments, the participants were randomly assigned to one of the five conditions and asked to evaluate the paintings. Specifically, they were asked the following two questions: ‘How much would you like to hang this picture on your wall?’⁵⁵ (Own opinion) and ‘How much would the average people like to hang this picture on their wall?’ (Estimated opinion). We randomised the order of the paintings’ presentation for each participant. Moreover, for convenience, we first analysed each condition, after which we combined the results across the conditions.

(2) Details of the experiment in Study 2.

We recruited 56 Japanese undergraduate and graduate students (22 females and 34 males, M_age = 19.61, SD_age = 1.40) for the second experiment. All participants provided informed consent prior to study enrolment. The experimental protocol was approved by the University of Tokyo Research Ethics Committee and conducted in accordance with the latest version of the Declaration of Helsinki. On its completion, they received a flat fee of 1,000 Japanese Yen (approximately US$ 9.17 at the currency rate at the time of the experiment).

The experimental settings were the same as those employed by Müller-Trede et al.³. In this study, we set only a single condition: All participants followed the same experimental procedure and evaluated identical stimuli. They were asked the following two questions: ‘How much do you like the musical piece?’ (Own opinion) and ‘How much would the average people like the musical piece?’ (Estimated opinion).

We selected songs from various music genres, such as classic, folk, hip-hop, and ethnic. Specifically, we selected two songs each from twelve musicians (for example, two songs by Oasis from the same album ‘What’s the Story Morning Glory?’; see Supplementary Table S4). Thus, the stimuli consisted of 24 songs. One minute of each song was presented to participants. We randomised the order of the musical pieces for each participant.

Furthermore, this research involved two additional questions: Difficulty in answering the Own opinion (‘How difficult was it for you to state your preference?’) and Familiarity with the song’s genre (‘How familiar are you with the genre of this musical piece?’). Participants responded to these questions on a scale ranging from 0–100.

(3) A formula of MSE decomposition.

As mentioned in the Results section, the MSE decomposition proposed by Müller‐Trede et al.³ is represented as Eq. (1).

Mathematically, in our context, the MSE decomposition was represented as follows:

$$MSE_{Giver, \, Receiver} = \, (M_{Giver} - M_{Receiver} )^{2} + \, (\sigma_{Giver} - \rho_{Giver, \, Receiver} \times \sigma_{Receiver} )^{2} + \, (1 - \rho_{Giver, \, Receiver}^{2} )\sigma_{Receiver}^{2}$$

(2)

M is the mean of the rating value, σ is its standard deviation, and ρ is the correlation between a Giver’s and a Receiver’s ratings. On the equation’s right-hand side, the first, second, and third terms correspond to bias, variability bias, and linear correspondence, respectively.

(4) Assumptions in our analysis.

The analysis supposed that we could measure the difference in the tastes between the Givers and Receivers on a discrete scale. Many studies^{3,4,8,56,57,58} that explored preference predictions have also made similar assumptions.

(5) Mixed-effects analysis.

We performed all mixed-effects analyses using the R packages lme4 and lmerTest⁵⁹. In particular, we selected the best model and computed all statistical values using the step() function for the full model, with random participants and stimulus intercepts.

Data availability

The R-code during the current study and the two datasets analysed during the current study (including data for creating the figures) are available in the Mendeley Data: https://doi.org/10.17632/tr952fsrpx.1.

References

Yaniv, I., Choshen-Hillel, S. & Milyavsky, M. Receiving advice on matters of taste: Similarity, majority influence, and taste discrimination. Organ. Behav. Hum. Decis. Process. 115, 111–120 (2011).
Article Google Scholar
Analytis, P. P., Barkoczi, D. & Herzog, S. M. You’re special, but it doesn’t matter if you’re a greenhorn: Social recommender strategies for mere mortals. Proc. 37th Annu. Conf. Cogn. Sci. Soc. 1799–1804 (2015).
Müller-Trede, J., Choshen-Hillel, S., Barneron, M. & Yaniv, I. The wisdom of crowds in matters of Taste. Manag. Sci. 64, 1779–1803 (2017).
Article Google Scholar
Barneron, M., Allalouf, A. & Yaniv, I. Rate it again: Using the wisdom of many to improve performance evaluations. J. Behav. Decis. Mak. 32, 485–492 (2019).
Article Google Scholar
Gershoff, A. D. & Johar, G. V. Do you know me? Consumer calibration of friends’ knowledge. J. Consum. Res. 32, 496–503 (2006).
Lerouge, D. & Warlop, L. Why it is so hard to predict our partner’s product preferences: The effect of target familiarity on prediction accuracy. J. Consum. Res. 33, 393–402 (2006).
Article Google Scholar
Eggleston, C. M., Wilson, T. D., Lee, M. & Gilbert, D. T. Predicting what we will like: Asking a stranger can be as good as asking a friend. Organ. Behav. Hum. Decis. Process. 128, 1–10 (2015).
Article Google Scholar
Gilbert, D. T., Killingsworth, M. A., Eyre, R. N. & Wilson, T. D. The surprising power of neighborly advice. Science 323, 1617–1619 (2009).
Article ADS CAS PubMed Google Scholar
Walsh, E. & Ayton, P. My imagination versus your feelings: Can personal affective forecasts be improved by knowing other peoples’ emotions?. J. Exp. Psychol. Appl. 15, 351–360 (2009).
Article PubMed Google Scholar
Surowiecki, J. The wisdom of crowds. (Anchor, 2004).
Hertwig, R. Tapping into the Wisdom of the Crowd–with Confidence. Science 336, 303–304 (2012).
Article ADS CAS PubMed Google Scholar
Jayles, B. et al. How social information can improve estimation accuracy in human groups. Proc. Natl. Acad. Sci. 114, 12620–12625 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Prelec, D., Seung, H. S. & McCoy, J. A solution to the single-question crowd wisdom problem. Nature 541, 532–535 (2017).
Article ADS CAS PubMed Google Scholar
Fujisaki, I., Honda, H. & Ueda, K. Diversity of inference strategies can enhance the ‘wisdom-of-crowds’ effect. Humanit. Soc. Sci. Commun. 4, 107 (2018).
Google Scholar
De Courson, B., Fitouchi, L., Bouchaud, J. P. & Benzaquen, M. Cultural diversity and wisdom of crowds are mutually beneficial and evolutionarily stable. Sci. Rep. 11, 16566 (2021).
Article ADS PubMed PubMed Central CAS Google Scholar
Almaatouq, A. et al. Adaptive social networks promote the wisdom of crowds. Proc. Natl. Acad. Sci. 117, 11379–11386 (2020).
Article CAS PubMed PubMed Central MATH Google Scholar
Kao, A. B., Couzin, I. D. & Kao, A. B. Modular structure within groups causes information loss but can improve decision accuracy. Proc. R. Soc. B. 374, 20180378 (2019).
Google Scholar
Lorenz, J., Rauhut, H., Schweitzer, F. & Helbing, D. How social influence can undermine the wisdom of crowd effect. Proc. Natl. Acad. Sci. 108, 9020–9025 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Analytis, P. P., Barkoczi, D. & Herzog, S. M. Social learning strategies for matters of taste. Nat. Hum. Behav. 2, 415–424 (2018).
Article PubMed Google Scholar
Vul, E. & Pashler, H. Measuring the crowd within: Probabilistic representation within individuals. Psychol. Sci. 19, 645–647 (2008).
Article PubMed Google Scholar
Fiechter, J. L. & Kornell, N. How the wisdom of crowds, and of the crowd within, are affected by expertise. Cogn. Res. Princ. Implic. 6, 5 (2021).
Article PubMed PubMed Central Google Scholar
Herzog, S. M. & Hertwig, R. The wisdom of many in one mind. Psychol. Sci. 20, 231–237 (2009).
Article PubMed Google Scholar
Rauhut, H. & Lorenz, J. The wisdom of crowds in one mind: How individuals can simulate the knowledge of diverse societies to reach better decisions. J. Math. Psychol. 55, 191–197 (2011).
Article MathSciNet MATH Google Scholar
Herzog, S. M. & Hertwig, R. Think twice and then: Combining or choosing in dialectical bootstrapping?. J. Exp. Psychol. Learn. Mem. Cogn. 40, 218–232 (2014).
Article PubMed Google Scholar
Herzog, S. M. & Hertwig, R. Harnessing the wisdom of the inner crowd. Trends Cogn. Sci. 18, 504–506 (2014).
Article PubMed Google Scholar
Fujisaki, I., Honda, H. & Ueda, K. On an effective and efficient method for exploiting ‘wisdom of crowds in one mind’. in Proc. 39th Annu. Conf. Cogn. Sci. Soc. 2043–2048 (2017).
Van Dolder, D. & Van Den Assem, M. J. The wisdom of the inner crowd in three large natural experiments. Nat. Hum. Behav. 2, 21–26 (2018).
Article PubMed Google Scholar
Litvinova, A., Herzog, S. M., Kall, A. A., Pleskac, T. J. & Hertwig, R. How the ‘wisdom of the inner crowd’ can boost accuracy of confidence judgments. Decision 7, 183–211 (2020).
Article Google Scholar
Gaertig, C. & Simmons, J. P. The psychology of second guesses: Implications for the wisdom of the inner crowd. Manag. Sci. 67, 5921–5942 (2021).
Article Google Scholar
Epley, N., Keysar, B., Van Boven, L. & Gilovich, T. Perspective taking as egocentric anchoring and adjustment. J. Pers. Soc. Psychol. 87, 327–339 (2004).
Article PubMed Google Scholar
Galinsky, A. D. & Moskowitz, G. B. Perspective-taking: Decreasing stereotype expression, stereotype accessibility, and in-group favoritism. J. Pers. Soc. Psychol. 78, 708–724 (2000).
Article CAS PubMed Google Scholar
Novembre, G., Mitsopoulos, Z. & Keller, P. E. Empathic perspective taking promotes interpersonal coordination through music. Sci. Rep. 9, 12255 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Simmons, J. P., LeBoeuf, R. A. & Nelson, L. D. The effect of accuracy motivation on anchoring and adjustment: Do people adjust from provided anchors?. J. Pers. Soc. Psychol. 99, 917–932 (2010).
Article PubMed Google Scholar
Yaniv, I. & Choshen-Hillel, S. When guessing what another person would say is better than giving your own opinion: Using perspective-taking to improve advice-taking. J. Exp. Soc. Psychol. 48, 1022–1028 (2012).
Article Google Scholar
Krueger, J. & Mueller, R. A. Unskilled, unaware, or both? The better-than-average heuristic and statistical regression predict errors in estimates of own performance. J. Pers. Soc. Psychol. 82, 180–188 (2002).
Article PubMed Google Scholar
Galesic, M., Olsson, H. & Rieskamp, J. Social sampling explains apparent biases in judgments of social environments. Psychol. Sci. 23, 1515–1523 (2012).
Article PubMed Google Scholar
Galesic, M., Olsson, H. & Rieskamp, J. A sampling model of social judgment. Psychol. Rev. 125, 363–390 (2018).
Article PubMed Google Scholar
Müller-Pinzler, L. et al. Negativity-bias in forming beliefs about own abilities. Sci. Rep. 9, 14416 (2019).
Article ADS PubMed PubMed Central CAS Google Scholar
Svenson, O. Are we all less risky and more skilful than our fellow drivers?. Acta. Psychol. 47, 143–148 (1981).
Article ADS Google Scholar
Hsee, C. K. & Weber, E. U. fundamental prediction error: Self-others discrepancies in risk preference. J. Exp. Psychol-Gen. 126, 45–53 (1997).
Article Google Scholar
Lee, J. W. & Yates, J. F. How quantity judgment changes as the number of cues increases: An analytical framework and review. Psychol. Bull. 112, 363–377 (1992).
Article Google Scholar
Gigone, D. & Hastie, R. Proper analysis of the accuracy of group judgments. Psychol. Bull. 121, 149–167 (1997).
Article Google Scholar
Analytis, P. P., Delfino, A., Kämmer, J., Moussaïd, M. & Joachims, T. Ranking with social cues: Integrating online review scores and popularity information. Preprint: https://arxiv.org/pdf/1704.01213 (2017).
Bello, P. & Garcia, D. Cultural Divergence in popular music: the increasing diversity of music consumption on Spotify across countries. Humanit. Soc. Sci. Commun. 8, 182 (2021).
Article Google Scholar
Moore, D. A. & Small, D. A. Error and bias in comparative judgment: On being both better and worse than we think we are. J. Pers. Soc. Psychol. 92, 972–989 (2007).
Article PubMed Google Scholar
Moore, D. A. & Healy, P. J. The trouble with overconfidence. Psychol. Rev. 115, 502–517 (2008).
Article PubMed Google Scholar
Schulze, C., Hertwig, R. & Pachur, T. Who you know is what you know: Modeling boundedly rational social sampling. J. Exp. Psychol-Gen. 150(2), 221 (2021).
Article Google Scholar
Parkinson, C., Kleinbaum, A. M. & Wheatley, T. Similar neural responses predict friendship. Nat. Commun. 9, 332 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
Bonaccio, S. & Dalal, R. S. Advice taking and decision-making: An integrative literature review, and implications for the organizational sciences. Organ. Behav. Hum. Decis. Process. 101, 127–151 (2006).
Article Google Scholar
Yaniv, I. & Kleinberger, E. Advice taking in decision making: egocentric discounting and reputation formation. Organ. Behav. Hum. Decis. Process. 83, 260–281 (2000).
Article CAS PubMed Google Scholar
https://minimaxir.com/2017/01/amazon-spark/
Analytis, P. P., Barkoczi, D., Lorenz-Spreen, P. & Herzog, S. The Structure of Social Influence in Recommender Networks. Proc. Web Conf. 2655–2661 (2020).
Vessel, E. A., Maurer, N., Denker, A. H. & Starr, G. G. Stronger shared taste for natural aesthetic domains than for artifacts of human culture. Cognition 179, 121–131 (2018).
Article PubMed Google Scholar
Chatterjee, A., Widick, P., Sternschein, R., Smith, W. B. & Bromberger, B. The Assessment of art attributes. Empir. Stud. Arts 28, 207–222 (2010).
Article Google Scholar
Analytis, P. P., Schnabel, T., Herzog, S., Barkoczi, D. & Joachims, T. A preference elicitation interface for collecting dense recommender datasets with rich user information. Preprint: https://arxiv.org/pdf/1706.08184 (2017).
Kahneman, D., Wakker, P. P. & Sarin, R. Back to Bentham? Explorations of experienced utility. Q. J. Econ. 112, 375–405 (1997).
Article Google Scholar
Loewenstein, G. & Schkade, D. Wouldn’t it be nice? Predicting future feelings. In Diener, E., Schwartz, N., & Kahneman, D. (Eds.), Well-being: The Foundations of Hedonic Psychology. (New York, 1999).
Gilbert, D. T. & Wilson, D. T. Prospection: Experiencing the future. Science 317, 1351–1354 (2007).
Article ADS CAS PubMed Google Scholar
Bates, D., Mächler, M., Bolker, B. M. & Walker, S. C. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Article CAS Google Scholar

Download references

Acknowledgments

This study was supported by JST Grant Number JPMJCR19A1.

Author information

Authors and Affiliations

Graduate School of Arts and Sciences, The University of Tokyo, Tokyo, Japan
Itsuki Fujisaki & Kazuhiro Ueda
Faculty of Psychology, Otemon Gakuin University, Osaka, Japan
Hidehito Honda

Authors

Itsuki Fujisaki
View author publications
You can also search for this author in PubMed Google Scholar
Hidehito Honda
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Ueda
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors developed the study concept and contributed to the study design. IF and HH performed the data collection. IF performed the testing, data analysis and computer simulations. IF wrote the manuscript, with feedback from HH and KU.

Corresponding authors

Correspondence to Itsuki Fujisaki or Kazuhiro Ueda.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fujisaki, I., Honda, H. & Ueda, K. A simple cognitive method to improve the prediction of matters of taste by exploiting the within-person wisdom-of-crowd effect. Sci Rep 12, 12413 (2022). https://doi.org/10.1038/s41598-022-16584-7

Download citation

Received: 28 March 2022
Accepted: 12 July 2022
Published: 20 July 2022
DOI: https://doi.org/10.1038/s41598-022-16584-7

This article is cited by

Fermian guesstimation can boost the wisdom-of-the-inner-crowd
- Tamara Gomilsek
- Ulrich Hoffrage
- Julian N. Marewski
Scientific Reports (2024)
On an effective and efficient method for exploiting the wisdom of the inner crowd
- Itsuki Fujisaki
- Kunhao Yang
- Kazuhiro Ueda
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.