Finding patterns in policy questions

Osman, Magda; Cosstick, Nick

doi:10.1038/s41598-022-21830-z

Download PDF

Article
Open access
Published: 22 November 2022

Finding patterns in policy questions

Magda Osman¹ &
Nick Cosstick¹

Scientific Reports volume 12, Article number: 20126 (2022) Cite this article

670 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

To help advance exchanges between science and policy, a useful first step is to examine the questions which policymakers pose to scientists. The style of a question indicates what the asker is motivated to know, and how they might use that knowledge. Therefore, the aggregate pattern of typical policy inquires can help scientists anticipate what types of information policy audiences desire. A dataset (n = 2972) of questions from policymakers collected over 10 years (2011–2021)—by the Centre for Science and Policy at the University of Cambridge—was classified into one of seven classes. In the main, the most popular questions posed by policymakers—within the public and private sectors—were those whose answers inform how to achieve specific outcomes—whether directly, or by providing a causal analysis which is instrumental to this process. Moreover, this seems to be a general aspect of policymakers’ inquiries, given that it is preserved regardless of the policy issue considered (e.g., Artificial Intelligence, Economy, or Health). Thus, maximizing the usefulness of the information that policymakers receive when engaging with scientists requires informing how to achieve specific outcomes—directly, or by providing a useful causal analysis.

Harnessing human and machine intelligence for planetary-level climate action

Article Open access 17 August 2023

Ramit Debnath, Felix Creutzig, … Emily Shuckburgh

Automatic question answering for multiple stakeholders, the epidemic question answering dataset

Article Open access 21 July 2022

Travis R. Goodwin, Dina Demner-Fushman, … Ian M. Soboroff

Towards understanding policy design through text-as-data approaches: The policy design annotations (POLIANNA) dataset

Article Open access 13 December 2023

Sebastian Sewerin, Lynn H. Kaack, … Fabian Hafner

Introduction

Knowledge exchange activities between science and policy are driven by a need to address practical issues¹. Several studies have highlighted barriers to effective translation from scientific evidence to policy^2,3. In particular, scientists and policymakers often have different motivations and goals, which limit their collaboration⁴. Whilst the prospects for bringing these two communities’ motivations and goals into complete alignment are poor, scientists might reasonably gain a greater understanding of the motivations and goals of policymakers—and how the evidence they generate feeds into, or helps to achieve, them. Examining the questions which policymakers pose to scientists is instrumental to achieving this greater understanding. The ‘style’ of a question—the structure of the information sought (see below)—provides valuable indicators of what the asker is motivated to know, and what they might use that knowledge for^5,6. Indeed, science-policy exchanges can often involve framing policy issues through a particular style of inquiry that articulates policymakers’ goals and the means of achieving them⁷. In light of this, the aim of this study is to examine the styles—and subject matters—of questions which policymakers pose to scientists, in order to expose any underlying patterns in these evidence requests. An understanding of these underlying patterns can potentially aid the integration of scientific evidence into policymaking through co-production⁸. Co-production requires a mutual understanding between scientists and policymakers⁹, which, in turn, requires clarity regarding any subject matter under discussion and the structure of the information sought. Furthermore, the integration of scientific evidence into policymaking is aided by tailoring evidence to the structure of the information sought by policymakers. To this end, the discussion will contain some advice for evidence tailoring.

Before considering the substantive lessons to be drawn from the literature on questions, a terminological confusion must be addressed. Within this domain, a universal agreement regarding the meanings of certain relevant terms has not yet been reached. This might cause confusion, as the same term can be deployed to refer to distinct features of questions and/or answers. For example, Pomerantz⁵ uses the term ‘content’ to refer solely to the subject matter that a question/answer concerns. By contrast—as noted by Pomerantz⁵—Graesser, McMahen, and Johnson¹⁰ use the term ‘content’ to refer both to a question’s subject matter and its style. When using antecedent studies to evidence the arguments in this paper, such terminological issues are disregarded in favour of the underlying point being made.

To begin, it is useful to consider two contrasting features of questions: subject matter and style. The subject matter of a sincere question indicates the information that the inquirer is interested in attaining, thereby indicating the kind of content which would be appropriate for a sincere answer⁵. For example, sincerely asking “what is net zero?” implies that one wants to know about the net-zero emissions goal. The style of a question—the structure of the information sought—indicates the understanding of the inquirer¹¹, and what kind of answer is expected^5,12,13. Asking a sincere question implies that the inquirer has enough of an understanding of the issue from which to build the question and interpret the answer, but does not know enough to make seeking the answer superfluous¹¹. For instance, sincerely asking “what is net-zero?” requires that the inquirer has at least heard of the term ‘net-zero’ but does not have a complete understanding of its referent. Furthermore, the style of this question indicates that sincere answers should be structured as definitions. By contrast, sincerely asking “what do we need to do to achieve net-zero?” requires that the inquirer has a basic understanding of what net-zero is but not a complete understanding of how to bring it about. Moreover, the style of this question indicates that sincere answers should outline the procedure(s) which will bring about net-zero.

Earlier work in psychology^10,14,15, linguistics¹⁴, and information science⁵ provided the foundation for the types of analyzes that have been used to develop taxonomies of questions. Importantly, question stems—such as “why…?”, “how…?”, “when…?”, and “what…?”—have not been the standard basis on which taxonomies are developed, because they are typically polysemous^12,17. The ambiguity of question stems makes their application highly context-specific, hence why question classification systems have generally focused on question styles¹².

The most practical approach to taxonomizing questions is to classify questions according to their style. Lehnert¹³ was the originator of this approach. Graesser, Person, and Huber¹² later generated a simpler taxonomy of questions by style (the ‘Taxonomy of Question Styles’)—these questions were later grouped by the length of the expected answer by Graesser, McMahen, and Johnson¹⁰ (see Table 1). For example, “what does X mean?” was given as part of the abstract characterization (“abstract specification”) of the ‘definition’ style of question. Definition questions invite answers which specify the details—usually as long descriptions—that characterize a phenomenon or event. In contrast, “what caused some event to occur?” was given as the abstract characterization of the ‘causal antecedent’ style of question. Causal antecedent questions invite answers which outline the factors that brought about an event. Taxonomies of questions can be used to investigate applied scientific problems. In order to improve outcomes in a variety of domains, such taxonomies are used to understand how agents approach the task of structuring a problem or dilemma, what types of solutions they are expecting, and how their inquiries could be improved⁶.

Table 1 Two taxonomies of question styles.

Full size table

The Taxonomy of Question Styles (see Table 1) has proved fairly popular. It has been successfully applied within the education sector^6,10,18,19. It has also been used as a foundation of, and supplement for, arguments made by other education researchers^20,21,22. Moreover, it has played an applied, foundational, and/or supplemental role in studies analyzing web search strategies^23,24,25, consumer health-related inquiries²⁶, interpersonal exchanges²⁷, and interview settings²⁸.

Through an analysis of the frequency of questions generated, this taxonomy has been used to determine what types of questions are most likely to appear in a particular domain. Such information feeds into proposals regarding what improvements are necessary to support an effective evidence exchange process. In the education domain—where the Taxonomy of Question Styles has been used most often—it has aided in identifying the types of inquires made by students, so that they can then be encouraged to formulate different styles of questions which enable a more substantive understanding of a topic⁶. For instance, often the efforts have been to shift students away from verification-style questions (“did X occur?”) to analytical questions—such as causal consequence-style questions (“why did X occur?”)—to develop deeper understanding.

The theoretical underpinning of work analyzing the quality of questions has largely been informed by the ‘Grasser-Person-Huber (GPH) Scheme’^6,12. It proposes that there are three dimensions on which a question should be assessed. Firstly, style (“content”): the structure of the information sought. Secondly, question-generation mechanism: the psychological processes—goals, plans, and knowledge—which bring about a question. The GPH Scheme lists four question-generation mechanisms: reducing, or correcting, a knowledge deficit; monitoring common ground; social coordination of action; and control of conversation and attention. The scheme holds that these categories are orthogonal to the style categories, since—in theory—a style of question might be motivated by any question-generation mechanism. For example, an inquirer might ask “what are the consequences of academic freedom?” to address a deficit in their knowledge. Alternatively, the same question might be asked to monitor the extent to which they share common ground with the responder. The GPH Scheme’s final dimension of assessment is ‘degree of specification’: the extent to which the information sought is made clear. A highly specific question is clear regarding what information is sought. Whereas an under-specified question requires the responder to make inferences about which details are relevant to the inquirer.

Within cognitive psychology, associations have been made between the effective generation of questions and problem-solving ability, as well as the learning of complex material^{6,29,30,31,32}. Within social psychology, improvement in ability regarding interpersonal exchanges has also been shown to be the result of asking good questions³³, that can increase one’s likability^34,35. Many of the efforts to improve cognitive functions (e.g. problem solving, critical thinking, memory, and text comprehension), by improving questioning, are based on two factors. Firstly, increasing the specificity of the question to ensure that the responder has the best chance of providing answers that are directly applicable. Secondly, encouraging ‘deep-reasoning questions’: those which direct the inquirer to ask questions that invite a causal analysis⁶. In essence, this involves considering the cause-effect relationships between variables to start examining the underlying structures that enable inferences to be made about what brings about observable outcomes^36,37.

To date, there has been no empirical work examining the styles of questions that policymakers pose to scientific experts—including the types of questions that are asked, and the frequency by which they are asked. Once this is understood, it can be used to improve science-policy exchanges. Improvements can be made to the articulation of policy questions so that the value of the answers provided is maximized. Furthermore, scientists might find it easier to adapt their communication, in order to focus on the evidence that policy audiences want from them. To address this deficit, the present study analyzed policy questions that have been compiled by the Centre for Science and Policy (CSaP), at the University of Cambridge. CSaP is a knowledge brokerage which creates opportunities for public policymakers and academics—primarily scientists—to learn from each other. This is achieved through CSaP’s Policy Fellowships, as well as workshops, seminars, conferences, and professional development activities.

Applicants to CSaP’s main ‘Policy Fellowship Programme’ initially submit 3–6 questions which indicate the main policy problems they will explore throughout their Fellowship—along with a justification of their influence on public policy and their aims and objectives concerning the Fellowship. A panel of academics and civil servants review the applications and assess the candidates regarding how influential their role is, their intellectual capacity (to get the most out of 25 one-hour meetings with academics throughout the Fellowship), the extent to which their questions will interest academics, and the relevance of the contributions that might stem from addressing their questions. During an initial meeting with successful applicants, Fellows are provided with any feedback from the judging panel, including any suggested changes to their proposed questions. The Fellows are then asked to submit their finalized questions—if different from those submitted in their applications. Fellows spend five days in Cambridge for one-to-one meetings with academics. In general, Fellows visit Cambridge twice for this purpose, submitting 4–5 questions per trip (though sometimes the same questions are submitted for both trips). As a result of this process, CSaP has accumulated a database of policy questions submitted by over 400 Policy Fellows over 10 years.

The database was used to examine two properties of policy questions: (1) what frequent styles of questions are posed to expert scientists? (2) Is there a relationship between the subject matter and style of questions posed to expert scientists? By answering these questions, it is possible to build up a profile of what evidence policymakers invite scientific experts to provide, as well as what that evidence is applied to.

Methods

At the time the analysis of the questions was conducted there were a total 4319 questions for the period of 05–09–2011 to 23–09–2021. These were generated from a total of 443 different Policy Fellowships taken up at CSAP. The questions were then cleaned. In particular, this involved removing statements and duplicate questions (posed by the same Policy Fellow over consecutive trips to Cambridge). Once this filtering had been applied, there were a total of 2927 questions from 409 Policy Fellows that were submitted for analysis. Each question contained the following details: (1) a unique number to identify it (1–2927), (2) whether the Policy Fellow was from a public or private sector organization (public, private), (3) the year that the question was submitted, and (4) the word length of the question.

To ensure an appropriate classification system was applied to the questions, the questions were classified using an iterative approach³⁸. First, the Taxonomy of Question Styles was applied to all 2972 questions. This initial attempt to classify the policy questions served two purposes: to identify categories that are applicable to policy questions from the original taxonomy, and to identify new categories where needed. From this, a second taxonomy was developed, which included categories from the Taxonomy of Question Styles as well as some new ones.

The development of this revised taxonomy started from the principle that the question style categories deployed in a policy-specific taxonomy need to be useful to policymakers. This was inferred from each question style’s frequency in the coding of the questions. As shown in Table 2, those question styles with a low frequency were not carried over from the Taxonomy of Question Styles to the revised taxonomy (see Table 2). (Where possible, the revised taxonomy subsumed questions from these omitted categories into other style categories). In addition, the taxonomy included several other categories of question styles not present in the original taxonomy to reflect the kinds of questions that were frequently occurring—such as those that invited the inquirer to make forecasts.

Table 2 Frequency (%) of questions per style category (* indicates categories that were omitted in the development of the revised taxonomy).

Full size table

This ‘Revised Taxonomy of Question Styles’ (see Table 3) was then used to classify all 2927 questions, and two independent coders were used to validate the taxonomy. Each coded a subset of questions (n = 1224), the results of which are presented in Table 4. Applying a stringent process for agreement, with only exact matches recorded, both coders agreed on (n = 582) 47.55% of the questions. However, the coders identified the issue that the differences between some of the Revised Taxonomy of Question Styles’ categories are superficial. For example, instrumental/procedural and enablement have superficially different subjects and predicates, yet both drive at the same idea: they seek to identify things which can be used to achieve some goal. This violates the principle of ‘qualitative parsimony’³⁹: categories should not be inflated beyond necessity. When taking this into account, along with the related point that some of the categories were broad enough to significantly span the specification of others, the next step was to determine how many of the questions coded revealed matches based on feasible overlaps. This identified an addition 331 matches between coders, which increased the total level of agreement to 74.59%. Categories with superficial differences were merged to achieve mutual exclusivity. The resulting ‘Taxonomy of Policy Questions’ (see Table 1) was then used to analyze the 2927 questions that are reported in the results section.

Table 3 The Revised Taxonomy of Question Styles.

Full size table

Table 4 Frequency (%) of questions when classified according to the Revised Taxonomy of Question Styles.

Full size table

The aim of the content analysis was to examine whether certain subjects lent themselves to particular question styles. This involved looking at the domains of the organizations to which the Policy Fellows belonged, to narrow down the subjects that informed the content analysis. There were seven common types of policy subject: Artificial Intelligence (AI), Economics and Finance, Education, Environment, Defense and Security, Health, and Technology/Manufacturing. From this, several associated topics were identified³⁸. Each question was coded as “1” if a key subject—or associated terms for that subject—appeared at least once in the question. For some questions, multiple associated terms were found. To avoid skewing the data in such cases, the question was still coded as “1” to reflect that it was associated with a key subject (regardless of how many other associated terms were present in that question). Trends in subjects over time were not analyzed because the policy interests/positions of the Fellows are not controlled for. Consequently, some years the data is skewed towards different subjects by virtue of the interests/positions of the Fellows at the time.

Results

A general point concerning the statistical analysis of the dataset was that—due to the nature of the dataset—the analyzes were ran to gather a general impression of the pattern of findings rather than to develop firm conclusions. Inferential statistics were used with caution, given that in many cases the data violated basic assumptions for running the test (e.g. independence).

To begin, while there is an uneven distribution of policymakers by whether they belonged to private (20%) or public sector (80%) organizations, a simple analysis indicated that there is no significant difference between the two groups by frequency of class of questions, χ² (6, N = 2927) = 10.16, p = 0.12, Cramer’s V = 0.06. Given this, for the remainder of the analyzes performed, we collapsed across the two groups of policymakers. Generally, there were more questions that invited unbounded answers (76%) than bounded (24%), χ² (1, N = 2927) = 890.96, p < 0.001. Running a further analysis indicated there were significant differences in the distribution of questions by the seven sub-ordinate categories, χ² (6, N = 2927) = 110.54, p < 0.001, where the most frequently generated type of question was instrumental/procedural (see Table 5).

Table 5 Summary: frequencies (%) and mean word length (SD) of questions according to superordinate and subordinate categories of the Taxonomy of Policy Questions.

Full size table

Bounded (M = 22.53, SD = 15.66) questions are longer than unbounded questions (M = 20.42, SD = 11.49), t (2925) = 3.68, p < 0.001, d = 0.15, BF = 0.05, but the effect sizes and Bayes factor indicate that this is a weak difference. Looking at the average word counts for each of the seven classes of questions, example/explaining class of questions seems to be the outlier (M = 14.93, SD = 9.70). Applying the Bonferroni correction, when compared against the other unbounded answer types, the phrasing of example/explaining questions were significantly shorted than causal analysis questions, t (975) = 8.27, p < 0.005, d = 0.53, BF₁₃ = 1.11, instrumental/procedural, t (1371) = 11.27, p < 0.005, d = 0.65, BF₂₅ = 1.23, and explaining/asserting value judgments, t (753) = 9.08, p < 0.005, d = 0.67, BF₁₆ = 2.41, verification/qualification, t (877) = 8.67, p < 0.005, d = 0.58, BF₁₅ = 5.16, forecasting, t (595) = 6.15, p < 0.005, d = 0.57, BF_z = 1.94, comparison, t (535) = 6.04, p < 0.005, d = 0.67, BF7 = 3.45. Overall, the word length of the question doesn’t appear to be indicative of the types of answers it invites.

There were two main ways in which the content of the questions was examined. The first was the frequency with which the seven key subjects appeared in the questions. The content analysis applied to identify the presence of any of the seven key subjects meant that approximately two thirds of the questions were coded by the seven main subjects (n = 1786/2927, 61%). Given that a question could contain multiple subjects, when taking this into account (n = 951/2972, 32.59%), the most commonly occurring subjects—occurring once in each question—were AI (n = 222/951, 23.34%) and Technology/Manufacturing (n = 205/951, 21.56%). A total of 595 questions had a combination of two subjects present, with the most common pairing being Environment and Economy/Finance (n = 180/595, 30.25%). A total of 193 questions had three subjects in combination, with the most common triple being AI, Environment, and Economy/Finance (n = 32/193, 16.58%). A total of 42 questions had four subjects in combination, with the most common quadruplet being AI, Technology/Manufacturing, Environment, and Economy/Finance (n = 7/42, 16.67%). A total of four questions contained five subjects and one question contained six subjects. None contained all seven subjects.

The next way in which the content of the questions was examined was how often the various question styles appeared within the set of questions for each subject. Since multiple subjects sometimes appeared within the same question, independence was violated, which prevented any categorical inferential analyzes. Nonetheless, it was possible to determine an overall impression of the most common class of questions which different subjects appeared in. All questions were classified (n = 1786) that were coded by subject into the seven different classes of questions, and this was then repeated for questions where the subject appeared only once in each question (n = 951)³⁸. Classifying the questions by subject and by style on these two sets provided a basis for determining consistency in any patterns detected. Looking across both classification methods, the most common question-style for all seven main subjects, was instrumental/procedural questions (average 34%). This may not be a surprise given the base rate of this class of question. Where subjects differed was the second most common question class that they appeared in. When classifying all questions coded by subject, the second most common question class for six of the subjects was causal analysis (average 20%), with the exception of AI which was verification/qualification (16.67%). When classifying questions coded by subject appearing only once in a question, then the second most common class was causal analysis (average 21%) for AI, Environment, and Defence/Security. For Economics/Finance, Education, and Technology/Manufacturing the second most common class of question was example/explanation (average 19%), and for Health the second most common class of question was verification/qualification (23.14%). Overall, the indication from the examination of content by question class is that all seven subjects most commonly appeared in instrumental/procedural questions, thereafter the subjects appeared commonly in other unbounded questions-styles (e.g. causal analysis, example/explanation) with few appearing commonly in bounded question-styles (i.e. verification/qualification).

Discussion

From a database of 2927 policy questions, that were classified according to a taxonomy that has its roots in research on the psychology of questions, we find that: (1) the two most frequent question-styles invite answers that address causal-analytic and instrumental/procedural matters; (2) regardless of the policy subject, the most common answer that policymakers invited informed instrumental/procedural questions. This indicates that the common questions that policymakers present in exchanges with scientists are deep-reasoning based questions, that aren’t just for reducing or correcting knowledge deficits, but for presenting knowledge for specific purposes—such as informing what policy interventions could be taken.

This has clear prescriptive implications for scientists who wish to participate in the co-production of policy—and specifically the integration of scientific evidence into policymaking. By tailoring their evidence to these most common policy question styles, scientists might reasonably hope to maximize their chances of success. The abstract specifications of these styles can be used for this purpose. For example, scientists might ask themselves: is there an obvious policy goal that this research might help to achieve? However, it might be necessary to update evidence tailoring to meet the specific interests of any policymakers they engage with.

The fact that the two most frequently generated classes of questions were causal-analytic (e.g. understanding mechanisms) and instrumental/procedural (e.g. interventions) reveals important information regarding the main motivations and interests of policymakers. Inviting answers that expose cause-effect relationships between variables is also key to examining the underlying structures that enable inferences to be made about what brings about observable outcomes^36,37. This aligns closely with work in cognitive psychology that examines causal reasoning, which has shown consistent improvement in the way a causal-analytic representation can impact decision-making^37,40,41, problem solving^42,43, moral reasoning^44,45, perception^46,47, interpretation of statistical information^48,49, and evidential reasoning^50,51. Recently, the application of causal-analytic approaches has been extended to policymaking^{52,53,54,55,56}. This work suggests that, in order to interpret the effects of a policy intervention, what is first needed is to decompose the context in which the intervention is introduced into its causal factors (i.e. the variables that will support as well as inhibit the efficacy of the intervention). Achieving this requires formulating questions that concern the mechanisms which can bring about change in a desired direction and what outcomes need to be measured to determine the causal link between the intervention and the outcome. Thus, there is a clear relationship between causal analysis and instrumental/procedural/enablement reasoning: the former is a means to the latter. Scientists wishing to engage with policymakers might keep this framework for interpreting the effects of a policy intervention in mind.

As the results show, policymakers do invite answers that are causal-analytic in nature, but they are half as popular as questions which invite answers that inform how to achieve specific outcomes (i.e. instrumental/procedural). This finding can be contextualized in light of the relationship between causal analysis and procedural reasoning: the most popular questions posed by policymakers—within the public and private sectors—were those whose answers inform how to achieve specific outcomes—whether directly, or by providing a causal analysis which is instrumental to this process. Given the importance of causal analysis in determining the potential success of policy interventions, scientists might consider framing their answers via causal-analytic terms. Furthermore, depending on the reception to their instrumental/procedural questions, policymakers may consider increasing the number of causal analysis-style questions they ask.

Policymakers’ preference for asking instrumental/procedural questions is also relevant to several of the academic literatures on policymaking. It is consistent with several characterizations of ‘evidence-based policy’. This concept has been characterized in a strictly means-end way and, more broadly, as the complex interaction of evidence and individual, professional, and political goals^57,58. Both characterizations allow some role for policymakers’ preference for instrumental/procedural questions. The result also provides some backing for the claim that the scientific and policy communities are divided. In its weaker form, this simply amounts to the claim that scientists and policymakers have different goals and motivations⁴. In its stronger form, it amounts to the claim that scientists and policymakers constitute two distinct communities that are poorly connected, motivated by different incentives, operate under different rules, and suffer from communication problems⁵⁹. In either form, this claim might explain the poor fit between researchers’ assembling and packaging of information and policymakers’ practical needs⁶⁰. However, the stronger claim (the ‘two communities’ theory) has faced robust criticism. In particular, it is hard to provide a specific characterization of the theory’s titular communities which is consistent with the data^61,62. Thus, perhaps this result might be more fruitfully associated with the weaker claim, as part of a nuanced account of science-policy interaction. Finally, as the previous paragraph indicates, this result could be of use in the policy studies project of developing ‘bridging instruments’: tools which help to bring academics and policymakers closer together⁶¹.

Other insights from the analysis of the questions suggest that more unbounded (long, open ended) questions were generated than bounded (short, closed) questions. Given that there were fewer classes of bounded question-styles to unbounded question-styles, one might think it inevitable that fewer bounded questions would be identified. However, previous work provides evidence which suggests that, independent of the number of categories of questions corresponding to short vs. long answer types—depending on the domain—more short answer types are generated than long answer types. Domains in which this seems to hold include: education⁶, interviewing²⁸, and health inquiries²⁶. Thus, the fact that more unbounded than bounded question-styles were generated may reflect the properties of the domain in which the questions were asked—the policy domain—rather than the taxonomic structure used to classify the questions. In the main, policymakers tended to ask questions directed towards detailed answers. Often this meant steering away from constraining answers to provide an estimate about a future outcome (forecasting), verifying a particular understanding of an issue (verification/qualification), or outlining the strengths/weaknesses costs/benefits of a particular issue (comparison). The lesson for scientists wishing to participate in the co-production of policy is that less constrained answers—regarding forecasting, verification/qualification, and/or comparisons—might be required to address policymakers’ needs.

A final point to consider concerns important barriers that limit potential science-policy exchanges, thereby retarding the co-production of policy. Since co-production requires a mutual understanding between scientists and policymakers⁹, understanding the needs and goals of one’s audience is an important barrier which must be surmounted. Splitting this task into understanding the subject matter under discussion and the structure of the information sought might aid its completion. Moreover, treating the process as iterative—whereby the answer sought from the question posed is the first step in a dialogue which establishes mutual grounds on which to then revisit the question and how it can be addressed—is also important. Another potential barrier is that scientists have concerns about the possible blurring of lines between acting in the capacity of providing expertise versus advocating^4,63,64. The findings from this study suggest that this may be warranted, given that the most common type of answer which policymakers invited from scientists was one that involved suggestions for interventions (e.g. methods of measurement, plans of action, types of instruments) that serve particular goals. While addressing questions of this type may lead to more impact for the scientific knowledge provided, it may potentially draw scientists into making recommendations, rather than presenting policymakers with factors to consider.

Data availability

The datasets generated and/or analyzed in the study are available in the Open Science Framework repository (https://osf.io/c978s/files/).

References

Hoppe, R. Rules-of-thumb for problem-structuring policy design. Policy Design Pract. 1(1), 12–29 (2018).
Article Google Scholar
Singh, G. G. et al. A more social science: Barriers and incentives for scientists engaging in policy. Front. Ecol. Environ. 12, 161–166 (2014).
Article Google Scholar
Weichselgartner, J. & Kasperson, R. Barriers in the science-policy-practice interface: Toward a knowledge-action-system in global environmental change research. Glob. Environ. Change 20, 266–277 (2010).
Article Google Scholar
Hetherington, E. D. & Phillips, A. A. A scientist’s guide for engaging in policy in the United States. Front. Marine Sci. 7, (2020).
Pomerantz, J. A linguistic analysis of question taxonomies. J. Am. Soc. Inform. Sci. Technol. 56, 715–728 (2005).
Article Google Scholar
Graesser, A. C. & Person, N. K. Question asking during tutoring. Am. Educ. Res. J. 31, 104–137 (1994).
Article Google Scholar
Jenkins, W. I. Policy Analysis: A Political and Organisational Perspective. (M. Robertson, 1978).
Wyborn, C. et al. Co-producing sustainability: Reordering the governance of science, policy, and practice. Annu. Rev. Environ. Resour. 44, 319–346 (2019).
Article Google Scholar
Nikulina, V., Lindal, J. L., Baumann, H., Simon, D. & Ny, H. Lost in translation: A framework for analysing complexity of co-production settings in relation to epistemic communities, linguistic diversities and culture. Futures 113, 102442 (2019).
Article Google Scholar
Graesser, A. C., McMahen, C. L. & Johnson, B. K. Question asking and answering. In Gernsbacher, M. A. Handbook of Psycholinguistics 517–538 (Academic Press, 1994).
Miyake, N. & Norman, D. A. To ask a question, one must know enough to know what is not known. J. Verb. Learn. Verb. Behav. 18, 357–364 (1979).
Article Google Scholar
Graesser, A. C., Person, N. & Huber, J. Mechanisms that generate questions. Quest. Inform. Syst. 2, 167–187 (1992).
Google Scholar
Lehnert, W. G. The Process of Question Answering: A Computer Simulation of Cognition. (Laurence Erlbaum Associates, 1978).
Graesser, A. C., Lang, K. & Horgan, D. A taxonomy for question generation. Quest. Exchange 2, 3–15 (1988).
Google Scholar
Graesser, A.C., & Black, J.B. The Psychology of Questions. (Erlbaum, 1985).
Liddy, E. D. Enhanced text retrieval using natural language processing. Bull. Am. Soc. Inf. Sci. Technol. 24, 14–16 (1998).
Article Google Scholar
de Villiers, J., & Roeper, T. Handbook of Generative Approaches to Language Acquisition. vol. 41 (Springer, 2011).
Graesser, A.C., Ozuru, Y., & Sullins, J. What is a good question? In McKeown, M.G. & Kucan, L. Bringing Reading Research to Life. (Guilford Press, 2010).
Veerman, A., Andriessen, J. & Kanselaar, G. Collaborative argumentation in academic education. Instr. Sci. 30, 155–186 (2002).
Article Google Scholar
Silva, M. & Cain, K. The use of questions to scaffold narrative coherence and cohesion. J. Res. Read. 42, 1–17 (2019).
Article Google Scholar
Mason, J., Chen, W. & Hoel, T. Questions as data: Illuminating the potential of learning analytics through questioning an emergent field. Res. Pract. Technol. Enhanc. Learn. 11, 12 (2016).
Article PubMed PubMed Central Google Scholar
Taboada, A., Tonks, S. M., Wigfield, A. & Guthrie, J. T. Effects of motivational and cognitive variables on reading comprehension. Read. Writ. 22, 85–106 (2009).
Article Google Scholar
Ulyshen, T. Z., Koehler, M. J. & Gao, F. Understanding the connection between epistemic beliefs and internet searching. J. Educ. Comput. Res. 53, 345–383 (2015).
Article Google Scholar
Lavender, K., Nicholson, S. & Pomerantz, J. Building bridges for collaborative digital reference between libraries and museums through an examination of reference in special collections. J. Acad. Librariansh. 31, 106–118 (2005).
Article Google Scholar
White, M. D. & Iivonen, M. Questions as a factor in web search strategy. Inf. Process. Manag. 37, 721–740 (2001).
Article MATH Google Scholar
White, M. D. Questioning behavior on a consumer health electronic list. Library Q.: Inform. Commu. Policy 70, 302–334 (2000).
Article Google Scholar
Huang, K., Yeomans, M., Brooks, A. W., Minson, J. & Gino, F. It doesn’t hurt to ask: question-asking increases liking. J. Pers. Soc. Psychol. 113, 430–452 (2017).
Article PubMed Google Scholar
White, M. D. Questions in reference interviews. J. Document. 54, 443–465 (1998).
Article Google Scholar
Pedrosa-de-Jesus, H., Moreira, A., Lopes, B. & Watts, M. So much more than just a list: Exploring the nature of critical questioning in undergraduate sciences. Res. Sci. Technol. Educ. 32, 115–134 (2014).
Article Google Scholar
Portnoy, L. B. & Rabinowitz, M. What’s in a domain: Understanding how students approach questioning in history and science. Educ. Res. Eval. 20, 122–145 (2014).
Article Google Scholar
Almeida, P. A. Can I ask a question? The importance of classroom questioning. Procedia. Soc. Behav. Sci. 31, 634–638 (2012).
Article Google Scholar
Glaubman, R., Glaubman, H. & Ofir, L. Effects of self-directed learning, story comprehension, and self-questioning in kindergarten. J. Educ. Res. 90, 361–374 (1997).
Article Google Scholar
Hilton, D. J. Conversational processes and causal explanation. Psychol. Bull. 107, 65–81 (1990).
Article Google Scholar
Yeomans, M., Brooks, A. W., Huang, K., Minson, J. & Gino, F. It helps to ask: The cumulative benefits of asking follow-up questions. J. Pers. Soc. Psychol. 117, 1139–1144 (2019).
Article PubMed Google Scholar
Kluger, A. N. & Malloy, T. E. Question asking as a dyadic behavior. J. Pers. Soc. Psychol. 117, 1127–1138 (2019).
Article PubMed PubMed Central Google Scholar
Lagnado, D. A. Explaining the Evidence: How the Mind Investigates the World. (Cambridge University Press, 2021).
Osman, M. Future-Minded: The Psychology of Agency and Control. (Macmillan International Higher Education, 2014).
Osman, M. Basic data set, Open Science Framework, (2022), https://osf.io/c978s/files/.
Lewis, D. Counterfactuals. (Blackwell, 1973).
Hagmayer, Y., Meder, B., Osman, M., Mangold, S. & Lagnado, D. Spontaneous causal learning while controlling a dynamic system. Open Psychol. J. 3, 145 (2010).
Google Scholar
Osman, M. Positive transfer and negative transfer/antilearning of problem-solving skills. J. Exp. Psychol. Gen. 137, 97–115 (2008).
Article PubMed Google Scholar
Hester, K. S. et al. Causal analysis to enhance creative problem-solving: performance and effects on mental models. Creat. Res. J. 24, 115–133 (2012).
Article Google Scholar
Marcy, R. T. & Mumford, M. D. Social innovation: Enhancing creative performance through causal analysis. Creat. Res. J. 19, 123–140 (2007).
Article Google Scholar
Lagnado, D. A. & Gerstenberg, T. Causation in legal and moral reasoning. In Waldmann, M.R. The Oxford Handbook of Causal Reasoning 565–602 (Oxford University Press, 2017).
Osman, M. & Wiegmann, A. Explaining moral behavior. Exp. Psychol. 64, 68–81 (2017).
Article PubMed Google Scholar
Bechlivanidis, C. & Lagnado, D. A. Time reordered: Causal perception guides the interpretation of temporal order. Cognition 146, 58–66 (2016).
Article PubMed Google Scholar
Bechlivanidis, C. & Lagnado, D. A. Does the ‘why’ tell us the ‘when’?. Psychol. Sci. 24, 1563–1572 (2013).
Article PubMed Google Scholar
Tešić, M., Liefgreen, A. & Lagnado, D. The propensity interpretation of probability and diagnostic split in explaining away. Cogn. Psychol. 121, 101293 (2020).
Article PubMed Google Scholar
Tubau, E. Enhancing probabilistic reasoning: The role of causal graphs, statistical format and numerical skills. Learn. Individ. Differ. 18, 187–196 (2008).
Article Google Scholar
Lagnado, D. A., Fenton, N. & Neil, M. Legal idioms: A framework for evidential reasoning. Argument Comput. 4, 46–63 (2013).
Article Google Scholar
Liefgreen, A., Pilditch, T. & Lagnado, D. Strategies for selecting and evaluating information. Cogn. Psychol. 123, 101332 (2020).
Article PubMed Google Scholar
Osman, M. et al. Learning from behavioural changes that fail. Trends Cogn. Sci. 24, 969–980 (2020).
Article PubMed Google Scholar
Joyce, K. E. & Cartwright, N. Bridging the gap between research and practice: Predicting what will work locally. Am. Educ. Res. J. 57, 1045–1082 (2020).
Article Google Scholar
Game, E. T. et al. Cross-discipline evidence principles for sustainability policy. Nat. Sustain. 1, 452–454 (2018).
Article PubMed PubMed Central Google Scholar
Cartwright, N. & Hardie, J. Predicting what will happen when you intervene. Clin. Soc. Work J. 45, 270–279 (2017).
Article PubMed PubMed Central Google Scholar
Cartwright, N. & Hardie, J. Evidence-Based Policy: A Practical Guide to Doing It Better. (Oxford University Press, 2012).
Schwandt, T. A. The centrality of practice to evaluation. Am. J. Eval. 26, 95–105 (2005).
Article Google Scholar
Sanderson, I. Making sense of ‘what works’: Evidence based policy making as instrumental rationality?. Public Policy Adm. 17, 61–75 (2002).
Google Scholar
Caplan, N. The two-communities theory and knowledge utilization. Am. Behav. Sci. 22, 459–470 (1979).
Article Google Scholar
Head, B. W. Reconsidering evidence-based policy: Key issues and challenges. Policy Soc. 29, 77–94 (2010).
Article Google Scholar
Newman, J., Cherney, A. & Head, B. W. Do policy makers use academic research? Reexamining the “two communities” theory of research utilization. Public Adm. Rev. 76, 24–32 (2016).
Article Google Scholar
Kalmuss, D. Scholars in the courtroom: Two models of applied social science. Am. Sociol. 16, 212–223 (1981).
Google Scholar
Gluckman, P. Policy: The art of science advice to government. Nature 507, 163–165 (2014).
Article PubMed Google Scholar
Weingart, P. Scientific expertise and political accountability: paradoxes of science in politics. Sci. Public Policy 26, 151–161 (1999).
Article Google Scholar

Download references

Funding

The funding was provided by Capabilities in Academic Policy Engagement (CAPE) through Research England.

Author information

Authors and Affiliations

The Centre for Science and Policy, Judge Business School, University of Cambridge, Cambridge, UK
Magda Osman & Nick Cosstick

Authors

Magda Osman
View author publications
You can also search for this author in PubMed Google Scholar
Nick Cosstick
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.O. is responsible for the conception of this project and design of this study. Regarding the analysis, she did half of the coding (of policy questions). Furthermore, she is responsible for all of the data analysis which was done on the coded dataset. She also led on the interpretation of the results, and produced the first draft of this manuscript. N.C. did half of the coding (of policy questions). He also made substantial revisions to the initial draft of this paper, focusing on the introduction and the discussion. Both authors reviewed the manuscript.

Corresponding authors

Correspondence to Magda Osman or Nick Cosstick.

Ethics declarations

Competing interests

Both authors declare that they are employed by the Centre for Science and Policy (CSaP), at the University of Cambridge. CSaP may stand to gain kudos from the publication of this manuscript.CSaP’s policy fellows team collected the raw data used in this study. Furthermore, CSaP’s director, Rob Doubleday, read a draft of this manuscript, but did not edit it in any way. This exhausts CSaP’s involvement––as an institution––in the generation of this work. This research was funded by a grant from the Capabilities in Academic Policy Engagement (CAPE) Collaborations Fund, to the project Tools to Structure Policy Makers' Questions Seeking Academic Expertise––a collaboration between the University of Cambridge and the Government Office for Science. CAPE may stand to gain kudos from the publication of this manuscript.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Osman, M., Cosstick, N. Finding patterns in policy questions. Sci Rep 12, 20126 (2022). https://doi.org/10.1038/s41598-022-21830-z

Download citation

Received: 25 May 2022
Accepted: 04 October 2022
Published: 22 November 2022
DOI: https://doi.org/10.1038/s41598-022-21830-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.