Subjective experience of difficulty depends on multiple cues

Desender, Kobe; Van Opstal, Filip; Van den Bussche, Eva

doi:10.1038/srep44222

Download PDF

Article
Open access
Published: 13 March 2017

Subjective experience of difficulty depends on multiple cues

Kobe Desender¹,
Filip Van Opstal^2,3 &
Eva Van den Bussche¹

Scientific Reports volume 7, Article number: 44222 (2017) Cite this article

3516 Accesses
21 Citations
12 Altmetric
Metrics details

Subjects

Abstract

Human cognition is characterized by subjective experiences that go along with our actions, but the nature and stability of these experiences remain largely unclear. In the current report, the subjective experience of difficulty is studied and it is proposed that this experience is constructed by integrating information from multiple cues. Such an account can explain the tight relationship between primary task performance and subjective difficulty, while allowing for dissociations between both to occur. Confirming this hypothesis, response conflict, reaction time and response repetition were identified as variables that contribute to the experience of difficulty. Trials that were congruent, fast or required the same response as the previous trial were more frequently rated as easy than trials that were incongruent, slow or required a different response as the previous trial. Furthermore, in line with theoretical accounts that relate metacognition to learning, a three day training procedure showed that the influence of these variables on subjective difficulty judgments can be changed. Results of the current study are discussed in relation to work on meta-memory and to recent theoretical advancements in the understanding of subjective confidence.

Confidence guides priority between forthcoming tasks

Article Open access 15 September 2021

Confidence reflects a noisy decision reliability estimate

Article 07 November 2022

Discrete confidence levels revealed by sequential decisions

Article 21 September 2020

Introduction

A defining characteristic of human cognition is that our actions are accompanied by subjective experiences. For example, when pronouncing difficult words such as Worcestershire sauce, there is more to it than just the cognitive processes allowing its pronunciation. You will also have the experience that this particular word was difficult to pronounce and you will experience a sense of confidence in the correctness of your pronunciation. These subjective experiences are termed metacognitive since they are a reflection on other cognitive processes taking place.

Many different fields of research have adopted the term “metacognition” when studying how humans reflect on their behavior. For example, when deciding on the nature of a noisy perceptual input, we typically experience a sense of confidence in the accuracy of our decision^1,2, or we become aware of errors in the decision process³. Likewise, when learning novel information, we sense whether we have effectively learned the newly acquired information⁴, and whether we will know the answer during recall⁵. Recently, the subjective experience of response conflict has attracted considerable attention^6,7,8,9,10. Whenever deciding between two options, we have a subjective experience of difficulty associated with the resulting response. Some responses feel very easy to carry out, whereas others are experienced as more difficult. In experimental tasks, these experiences of difficulty are often studied by inducing conflict between potential actions. For example, in a priming task, participants might be asked to rapidly categorize a target arrow as pointing to the left or to the right. Shortly preceding this target, a prime arrow is flashed that either triggers the same response as the target (i.e., a congruent trial) or a different response, hence inducing conflict between the responses (i.e., an incongruent trial). Incongruent trials are more frequently rated as difficult compared to congruent trials. Thus, the conflict between potential responses is experienced as subjectively difficult⁷. This finding has been observed even with masked primes that were entirely invisible^8,9,11, indicating that these difficulty ratings reflect genuine metacognitive experiences, rather than visual awareness of the conflict between prime and target.

Although the influence of response conflict on subjective difficulty has been documented, the nature and stability of the experience of difficulty remain unclear. Intuitively, one might expect that response conflict results in increased reaction times (RTs) and error rates, and that the subjective experience of difficulty results from this performance decrement. However, there is some recent evidence speaking against this possibility. In a previous study, the influence of response conflict on the experience of difficulty was still present in a subset of data in which congruent and incongruent trials were matched in terms of RTs¹¹. This raises the intriguing possibility that a subjective experience of difficulty is directly based on the presence of response conflict. Based on repeated past experiences, participants might have learned that response conflict leads to actual errors in responding. Given that errors are experienced as aversive¹², after repeated pairing, the presence of response conflict in itself might be experienced as aversive¹³. Thus, over time, participants might learn that the mere presence of response conflict is a good indicator of performance. After having learned this relation, response conflict could then be used as a cue for the construction of the subjective experience of difficulty, independent of task performance.

Importantly, from the assumption that response conflict can be learned to be an indicator for difficulty, it follows that any variable that is a good indicator for primary task performance, could act as a cue for the construction of subjective difficulty. Apart from response conflict, another variable that is clearly indicative of performance is reaction time (RT). Participants might have learned that this is a good proxy for task difficulty¹⁴. Finally, a third factor that will be considered is the possibility that subjective difficulty depends on expectations. In behavioral tasks, humans are inherently biased to expect responses to repeat over consecutive trials¹⁵. In serial two-choice RT tasks, for example, it has been observed that responses for stimulus repetitions are faster than responses for stimulus alternations¹⁶. Given that this variable is indicative of performance, trials with response repetitions might be experienced as subjectively easier than trials with response alternations. In sum, if subjective difficulty is based on cues that are indicative of performance, it should be possible to provide empirical evidence that these three variables affect judgments of difficulty.

While the previous concerns the construction of subjective difficulty, its stability is equally unclear. If subjective difficulty results from sampling information from multiple cues, to what extent is the relative contribution of these cues fixed? Theoretical work in the broader field of metacognition has already stressed the role of learning in the construction of metacognition^17,18,19. For example, Pasquali and colleagues¹⁷ presented simulations of a neural network that learned to wager on its own responses. By learning which representations are indicative of good performance, the network was successful in evaluating its own accuracy. In this framework, metacognition arises because the brain continuously learns about its own activity²⁰. Converging empirical work has documented that subjective certainty in a decision can be altered by means of training^21,22,23. After training, participants are more confident in their decisions²³, and better in discriminating their own errors from correct responses²². This theoretical emphasis on learning is in line with the current hypothesis that the brain first has to learn which variables are good indicators of performance, before they become cues that are used to construct subjective difficulty. Therefore, a second prediction of the current study is that the construction of subjective difficulty can be influenced by training participants to rely more on certain cues at the expense of others. By training participants that a certain cue is a highly reliable indicator of performance, it should gain more weight in the construction of subjective difficulty.

In a first experiment we will test if subjective difficulty judgments are constructed by integrating information from multiple cues. This will be done by examining if the three variables described above, namely response conflict, RT and response repetition, affect subjective difficulty judgments. A second experiment will investigate if the relative contribution of cues to subjective difficulty can be changed by training. Participants were trained to rely more on response conflict (Experiment 2a) or on RTs (Experiment 2b) as the main cue informing their subjective difficulty judgments.

Experiment 1

Participants

Thirty-one participants, 14 men, participated for monetary compensation (£ 15). Mean age was 24.3 years (SD = 5.2, range 19–42). All participants were right-handed and reported normal or corrected-to-normal vision. All experimental protocols were approved by the local ethics committee of the Vrije Universiteit Brussel. All methods of all experiments were performed in accordance with the relevant guidelines and regulations. In accordance with the approved guidelines, written informed consent was obtained from each participant prior to the experimental session. Non-overlapping results from this dataset have been published elsewhere¹¹.

Stimuli and apparatus

All stimuli were presented in white on a black background on a 15 inch CRT monitor, synchronized with a vertical refresh rate of 60 Hz. Experimental stimuli were prime arrows (1.5° wide and 0.7° high) and target arrows (3.3° wide and 1.4° high), that could point to the left or right (see Fig. 1). Because the prime arrows fitted perfectly within the contours of the target arrow (i.e., metacontrast masking²⁴), primes were rendered invisible. Responses were collected using a standard QWERTY keyboard.

Experimental procedure

Participants completed a masked priming experiment in which they additionally reported their metacognitive experience associated with each response. Each experimental trial started with a fixation cross for 1000 ms that was followed by a prime arrow for 34 ms, a blank screen for 34 ms, a target arrow for 116 ms, and finally a blank screen again. Participants were asked to respond as fast and accurately as possible to the direction of the target, by pressing “d” in response to a left pointing target arrow and “k” in response to a right pointing target arrow. They responded with the middle finger of each hand. If a response to the target was registered within 3000 ms, a blank screen was presented for 516 ms, followed by a screen asking participants about their subjective experience of difficulty: “How much difficulty did you experience when responding to the arrow? ”. They could answer either by pressing the “o” key with the ring finger of their right hand (“Rather more difficulty”) or by pressing the “m” key with the index finger of their right hand (“Rather less difficulty”). There was no time limit to answer this question. The inter-trial interval was 800 ms.

Each participant started with 20 practice trials in which the metacognitive question was omitted. Subsequently, the experimenter explained that participants had to rate their experience associated with a trial after each response. The experimenter motivated participants to use all information available to them (e.g., difficulty, error-tendency, response fluency) to answer this question. Participants were informed that there would be an equal amount of “more difficult” and “less difficult” trials, and they were motivated to keep a balance between these responses. Participants then received 20 additional practice trials with the metacognitive question. After these two training phases, each participant performed eight blocks of 80 trials each. In each block, half of the trials were congruent (i.e., prime and target pointing in the same direction), and half were incongruent (i.e., prime and target pointing in opposite directions).

To ensure that primes were genuinely invisible, participants performed a detection task after the main experiment. In this task, they were instructed to categorize the direction of the prime arrows, instead of the target arrows. During the detection task, targets were neutral with heads pointing in both directions to prevent that participants would respond to the target. It has been shown that these neutral targets provide a more sensitive test of prime visibility, compared to targets that are congruent or incongruent with the primes²⁵. After the targets, a blank screen was presented for 516 ms, followed by a question about the prime direction. The detection task comprised 100 trials.

Data analysis

The main aim of Experiment 1 was to examine how the variables response conflict, RT and response repetition affect primary task performance and subjective difficulty. Because RTs cannot be used as predictor in an analysis where they already serve as the dependent variable, they were omitted in the analysis of task performance. Analyses were done by fitting mixed effect models to our data. The most important advantage of this method is that it allows analyzing the effect of RTs on a trial-level. In traditional repeated measures approaches, one is obliged to create averages based on an arbitrary partitioning of the data (e.g., quartiles or tertiles), thereby losing an enormous wealth of information. Furthermore, mixed models are generally more powerful than traditional approaches to data analysis, better in handling unbalanced data which is insurmountable given the subjective nature of our task, and in analyzing categorical outcome variables²⁶. Furthermore, error variance caused by between-subject differences can be accounted for by adding random slopes to the model. Although a full random effects structure has been suggested to be optimal²⁷, this often results in overparameterized models that fail to converge²⁸. Therefore, a model building strategy was used here. Random slopes were added for a variable only when this increased the model fit, as assessed by model comparison. To analyze RTs in responding to the target arrow, a linear mixed model approach was used, for which F statistics are reported and the degrees of freedom were estimated by Satterthwaite’s approximation, as implemented in the R library lmerTest²⁹. To analyze metacognitive responses, a logistic linear mixed model approach was used. Significance of each variable and the interactions between variables in explaining the metacognitive response was assessed by computing X² statistics.

Model fitting was done in R³⁰, using the lme4 package³¹. Interpretation of interaction effects was done by applying contrasts using the multcomp package³², or by computing fitted values using the effects package³³.

Results

Primary task performance

To analyze reaction times, a linear mixed effects model was fitted predicting RTs on correct trials (96.7%), using equation (1), where X represents the fixed effects structure that is defined in equation (2), and Z the random effects structure defined in equation (3), for which the b is calculated for each participant i. The variables congruency (congruent or incongruent) and response repetition (repetition or alternation) were both factors with two levels.

To deal with outliers, trials in which the RT deviated more than 3 SDs from their condition-specific mean were excluded from further analysis (i.e., 2.0%). Results showed that RTs were significantly faster on congruent trials (M = 457 ms) than on incongruent trials (M = 525 ms), F(1,30) = 123.70, p < 0.001. There was also an effect of response repetition, F(1,30) = 5.03, p = 0.032, reflecting faster RTs on response repetitions (M = 484 ms) compared to response alternations (M = 495 ms). There was no interaction between both, F < 1.

To analyze the error rates, a generalized mixed model as shown in equation (1) using a logistic link function was fitted to all data predicting accuracy (wrong or correct), using the same fixed and random effects structure as specified in equations (2) and (3), respectively. More errors were made on incongruent trials (M = 3.2%) than on congruent trials (M = 0.8%), χ2(1) = 35.37, p < 0.001. There was no main effect of response repetition, p = 0.45, nor an interaction between both, p = 0.58.

Subjective difficulty

To examine the influence of the three variables on subjective difficulty judgments, a generalized mixed model as shown in equation (1) using a logistic link function was fitted on correct trials predicting subjective difficulty judgments (easy or difficulty), using the fixed and random effects structures specified in equations (4) and (5), respectively. When the continuous predictor RT was entered as raw values, the fitted model was nearly unidentifiable, so to deal with this RTs were mean-centered and then scaled by the standard deviation, separately for each participant. To visually represent the findings, the proportion of ‘easy’ judgments is plotted as a function of the three independent variables in Fig. 2. To enhance visual interpretability, the data were divided in three bins, separately for each subject and each level of congruency.

**Figure 2: Results from Experiment 1.**

The analysis showed a main effect of congruency, χ2(1) = 19.33, p < 0.001, with a higher proportion of ‘easy’ judgments on congruent (M = 85.3%) compared to incongruent trials (M = 56.6%). In Fig. 2, this main effect can be derived from the fact that the black lines, reflecting congruent trials, lie above the grey lines, reflecting incongruent trials. There was also a main effect of RT, χ2(1) = 45.74, p < 0.001, with decreasing RTs leading to a decrease in the proportion of ‘easy’ judgments. This effect is reflected in the negative slopes in Fig. 2. Finally, there was a main effect of response repetition, χ2(1) = 16.61, p < 0.001, with a higher proportion of ‘easy’ judgments for response repetitions (M = 77.0%) compared to response alternations (M = 70.1%). Although not numerically as strong as the previous two effects, response repetitions have overall higher values on the y-axis (Fig. 2, left panel) compared to response alternations (Fig. 2, right panel). There was also a significant interaction between congruency and RT, χ2(1) = 6.39, p = 0.011, indicating that the effect of congruency on subjective difficulty is slightly larger when RTs are slower. No other effects were significant (all p’s > 0.13).

Because in Experiment 1, the instructions encouraged participants to use all information that was available to them, it is important to demonstrate that the current findings are not contingent on the exact instructions provided. Therefore, in the Supplementary Materials, we report the reanalysis of four additional studies that varied in small ways from Experiment 1, but that nevertheless produced highly similar results. In order to evaluate the effect of the instructions, the results of a replication study (reported first in the Supplementary Materials) are of most interest here. In that study participants only received the instructions that they had to decide on each trial whether their response felt rather easy or rather difficult, without any reference to which cue they had to use. Notwithstanding this difference in instructions and some other marginal differences, the results replicated those of Experiment 1: both congruency, reaction time and response repetitions reliably affected subjective difficulty judgments (see Supplementary Materials for full details).

Prime visibility

To ensure that the effect of congruency on subjective difficulty does not simply reflect the visual resemblance between prime and target, we used the data of the detection task to compute d’ as an index of prime visibility. Left-pointing primes were treated as signal, right-pointing primes as noise. A left response to a left-pointing prime was considered a hit; the same response to a right-pointing prime was considered a false alarm. Hit and false alarm proportions were computed by dividing the total number of hits and false alarm by the number of signals. Results showed that d’ did not differ from chance level performance (i.e., zero), d’ = 0.10, t(30) = 1.12, p = 0.24, suggesting that primes were truly invisible.

Experiments 2a and 2b

Experiment 1 showed that subjective difficulty judgments depend on multiple cues. First, trials were more frequently judged to be easy when they were congruent (compared to incongruent). Second, the proportion of ‘easy’ judgments linearly increased with decreasing RTs. Third, trials were more frequently judged to be easy when the response was a repetition of the previous trial (compared to an alternation). In Experiments 2a and 2b, it was tested whether the influence of these cues on the experience of difficulty can be changed by means of training. Participants were trained to rely more on response conflict (Experiment 2a) or RT (Experiment 2b) when providing their subjective difficulty judgment. Because response repetition is a very explicit cue once attention is directed towards it, it was not included in this metacognitive training. For this reason, and because the model did not converge when response repetition was entered as an additional factor into equation (8), this variable was excluded from further analysis.