Abstract
How does scientific knowledge grow? This question has occupied a central place in the philosophy of science, stimulating heated debates but yielding no clear consensus. Many explanations can be understood in terms of whether and how they view the expansion of knowledge as proceeding through the accretion of scientific concepts into larger conceptual structures. Here we examine these views empirically by analysing 2,605,224 papers spanning five decades from both the social sciences (Web of Science) and the physical sciences (American Physical Society). Using natural language processing techniques, we create semantic networks of concepts, wherein noun phrases become linked when used in the same paper abstract. We then detect the core/periphery structures of these networks, wherein core concepts are densely connected sets of highly central nodes and periphery concepts are sparsely connected nodes that are highly connected to the core. For both the social and physical sciences, we observe increasingly rigid conceptual cores accompanied by the proliferation of periphery concepts. Subsequently, we examine the relationship between conceptual structure and the growth of scientific knowledge, finding that scientific works are more innovative in fields with cores that have higher conceptual churn and with larger cores. Furthermore, scientific consensus is associated with reduced conceptual churn and fewer conceptual cores. Overall, our findings suggest that while the organization of scientific concepts is important for the growth of knowledge, the mechanisms vary across time.
This is a preview of subscription content, access via your institution
Access options
Access Nature and 54 other Nature Portfolio journals
Get Nature+, our best-value online-access subscription
$29.99 / 30 days
cancel any time
Subscribe to this journal
Receive 12 digital issues and online access to articles
$119.00 per year
only $9.92 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
Data availability
The WoS data and the APS data are available from the Web of Science and the American Physical Society, respectively, but restrictions apply to the availability of these data, which were used under licence for the current study and so are not publicly available. If you are interested in accessing the WoS data, you can request access to the API through Clarivate, which requires an additional subscription or permission (https://clarivate.com/products/scientific-and-academic-research/research-discovery-and-workflow-solutions/webofscience-platform/web-of-science-core-collection/). For access to the APS data, you can request permission directly from their website (https://journals.aps.org/datasets/).
Code availability
The Python v.3 and Stata v.18 code we used to analyse and visualize the data for the current study are publicly available via Zenodo at https://doi.org/10.5281/zenodo.11533199 (ref. 49).
References
Price, D. J. d. S. Science since Babylon (Yale Univ. Press, 1961).
Price, D. J. d. S. Little Science, Big Science (Columbia Univ. Press, 1963).
Bornmann, L., Devarakonda, S., Tekles, A. & Chacko, G. Are disruption index indicators convergently valid? The comparison of several indicator variants with assessments by peers. Quant. Sci. Stud. 1, 1242–1259 (2020).
Milojević, S. Quantifying the cognitive extent of science. J. Informetr. 9, 962–973 (2015).
Tabah, A. N. Literature dynamics: studies on growth, diffusion, and epidemics. Annu. Rev. Inf. Sci. Technol. 34, 249–286 (1999).
Kuhn, T. S. The Structure of Scientific Revolutions (Univ. Chicago Press, 1962).
Lakatos, I. & Musgrave, A. Criticism and the Growth of Knowledge: Proceedings of the International Colloquium in the Philosophy of Science, London, 1965 Vol. 4 (Cambridge Univ. Press, 1970).
Laudan, L. Progress and Its Problems: Toward a Theory of Scientific Growth (Univ. California Press, 1978).
Popper, K. R. Conjectures and Refutations: The Growth of Scientific Knowledge (Routledge & Kegan Paul, 2002).
Cole, S. Why sociology doesn’t make progress like the natural sciences. Sociol. Forum 9, 133–154 (1994).
Cole, S. Disciplinary knowledge revisited: the social construction of sociology. Am. Sociol. 37, 41–56 (2006).
Gonzalez, W. J. Prediction and Novel Facts in the Methodology of Scientific Research Programs 103–124 (Springer International, 2015).
Chu, J. S. G. & Evans, J. A. Slowed canonical progress in large fields of science. Proc. Natl Acad. Sci. USA 118, e2021636118 (2021).
Newman, M. E. J. Scientific collaboration networks. ii. Shortest paths, weighted networks, and centrality. Phys. Rev. E 64, 016132 (2001).
Latour, B. Science in Action: How to Follow Scientists and Engineers through Society (Harvard Univ. Press, 1987).
Lakatos, I., Worrall, J., Currie, G. & Currie, P. The Methodology of Scientific Research Programmes: Philosophical Papers Vol. 1 (Cambridge Univ. Press, 1978).
Kojaku, S. & Masuda, N. Finding multiple core–periphery pairs in networks. Phys. Rev. E 96, 052313 (2017).
Borgatti, S. P. & Everett, M. G. Models of core/periphery structures. Soc. Netw. 21, 375–395 (2000).
Funk, R. J. & Owen-Smith, J. A dynamic network measure of technological change. Manage. Sci. 63, 791–817 (2017).
Mulkay, M. J., Gilbert, G. N. & Woolgar, S. Problem areas and research networks in science. Sociology 9, 187–203 (1975).
Wimsatt, W. C. Reductionism and its heuristics: making methodological reductionism honest. Synthese 151, 445–475 (2006).
Wu, L., Wang, D. & Evans, J. A. Large teams develop and small teams disrupt science and technology. Nature 566, 378–382 (2019).
Shwed, U. & Bearman, P. S. The temporal structure of scientific consensus formation. Am. Sociol. Rev. 75, 817–840 (2010).
Mayo, L. C., McCue, S. W. & Moroney, T. J. Gravity-driven fingering simulations for a thin liquid film flowing down the outside of a vertical cylinder. Phys. Rev. E 87, 053018 (2013).
Jones, B. F. The burden of knowledge and the ‘death of the Renaissance Man’: is innovation getting harder? Rev. Econ. Stud. 76, 283–317 (2009).
Gordon, R. J. The Rise and Fall of American Growth: The U.S. Standard of Living since the Civil War revised edn (Princeton Univ. Press, 2016).
Bhattacharya, J. & Packalen, M. Stagnation and Scientific Incentives Working Paper No. 26752 (National Bureau of Economic Research, 2020).
Fink, T., Reeves, M., Palma, R. & Farr, R. S. Serendipity and strategy in rapid innovation. Nat. Commun. 8, 2002 (2017).
Tria, F., Loreto, V., Servedio, V. & Strogatz, S. The dynamics of correlated novelties. Sci. Rep. 4, 5890 (2014).
Bloom, N., Jones, C. I., Van Reenen, J. & Webb, M. Are ideas getting harder to find? Am. Econ. Rev. 110, 1104–1144 (2020).
Horgan, J. The End of Science: Facing the Limits of Knowledge in the Twilight of the Scientific Age (Basic Books, 2015).
Jones, B. F. & Weinberg, B. A. Age dynamics in scientific creativity. Proc. Natl Acad. Sci. USA 108, 18910–18914 (2011).
Duncker, K. On problem solving. Psychol. Monogr. 58, i–113 (1945).
Jansson, D. G. & Smith, S. M. Design fixation. Des. Stud. 12, 3–11 (1991).
Maier, N. R. F. Reasoning in humans: II. The solution of a problem and its appearance in consciousness. J. Compar. Psychol. 12, 181–194 (1931).
Smith, S. M., Ward, T. B. & Schumacher, J. S. Constraining effects of examples in a creative generation task. Mem. Cogn. 21, 837–845 (1993).
Cole, S. Making Science: Between Nature and Society (Harvard Univ. Press, 1995).
Van Rossum, G. & Drake, F. L. Python 3 Reference Manual. (CreateSpace, 2009).
MariaDB Foundation. MariaDB. https://mariadb.com/ (2023).
Mongeon, P. & Paul-Hus, A. The journal coverage of Web of Science and Scopus: a comparative analysis. Scientometrics 106, 213–228 (2016).
Tennant, J. P. Web of Science and Scopus are not global databases of knowledge. Eur. Sci. Ed. 46, e51987 (2020).
Christianson, N. H., Sizemore Blevins, A. & Bassett, D. S. Architecture and evolution of semantic networks in mathematics texts. Proc. R. Soc. A 476, 20190741 (2020).
Dworkin, J. D., Shinohara, R. T. & Bassett, D. S. The emergent integrated network structure of scientific research. PLoS ONE 14, e0216146 (2019).
Rule, A., Cointet, J.-P. & Bearman, P. S. Lexical shifts, substantive changes, and continuity in State of the Union discourse, 1790–2014. Proc. Natl Acad. Sci. USA 112, 10837–10844 (2015).
Honnibal, M., Montani, I., Van Landeghem, S. & Boyd, A. spaCy: industrial-strength natural language processing in Python. Zenodo https://zenodo.org/records/10009823 (2020).
DeWilde, B. textacy documentation (Chartbeat, Inc., 2021).
Hofstra, B. et al. The diversity–innovation paradox in science. Proc. Natl Acad. Sci. USA 117, 9284–9291 (2020).
Kojaku, S. & Masuda, N. Core–periphery structure requires something else in the network. New J. Phys. 20, 043012 (2018).
Kedrick, K., Levitskaya, E. & Funk, R. J. Conceptual structure and the growth of scientific knowledge. Zenodo https://doi.org/10.5281/zenodo.11533199 (2024).
Davis, R. L. Quantum turbulence. Phys. Rev. Lett. 64, 2519–2522 (1990).
Acknowledgements
We thank the National Science Foundation for financial support of work related to this project (grants no. 1829168 to R.J.F and no. 1932596 to R.J.F). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. We also thank D. Hirschman, M. Park and Y. J. Kim for feedback on an earlier version of this work, and T. Gebhart for many helpful conversations and assistance with data and computation. Our work was presented as a poster at the 2nd Annual International Conference on the Science of Science and Innovation, as a poster at the 43rd Annual Meeting of the Cognitive Science Society, as a lightning talk at Networks 2021: A Joint Sunbelt and NetSci Conference, and as a poster at the 3rd North American Social Networks Conference.
Author information
Authors and Affiliations
Contributions
The study was conceptualized and designed by K.K., E.L. and R.J.F. The data analysis was conducted by K.K. and R.J.F. The manuscript was initially drafted by K.K., E.L. and R.J.F., with subsequent revisions made by K.K. and R.J.F.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Human Behaviour thanks Sadamori Kojaku, Marc Santolini and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Concepts extracted from the text of an abstract.
This figure shows an example abstract from the APS data; the highlighted text indicates single-word and multi-word noun phrases identified as concepts using our extraction algorithm. Reproduced with permission from ref. 50, American Physical Society.
Supplementary information
Supplementary Information
Supplementary Figs. 1–9 and Tables 1–3.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Kedrick, K., Levitskaya, E. & Funk, R.J. Conceptual structure and the growth of scientific knowledge. Nat Hum Behav (2024). https://doi.org/10.1038/s41562-024-01957-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41562-024-01957-x