Abstract
Background
Pediatric research is a diverse field that is constantly growing. Current machine learning advancements have prompted a technique termed text-mining. In text-mining, information is extracted from texts using algorithms. This technique can be applied to analyze trends and to investigate the dynamics in a research field. We aimed to use text-mining to provide a high-level analysis of pediatric literature over the past two decades.
Methods
We retrieved all available MEDLINE/PubMed annual data sets until December 31, 2018. Included studies were categorized into topics using text-mining.
Results
Two hundred and twenty-five journals were categorized as Pediatrics, Perinatology, and Child Health based on Scimago ranking for medicine journals. We included 201,141 pediatric papers published between 1999 and 2018. The most frequently cited publications were clinical guidelines and meta-analyses. We found that there is a shift in the trend of topics. Epidemiological studies are gaining more publications while other topics are relatively decreasing.
Conclusions
The topics in pediatric literature have shifted in the past two decades, reflecting changing trends in the field. Text-mining enables analysis of trends in publications and can serve as a high-level academic tool.
Impact
-
Text-mining enables analysis of trends in publications and can serve as a high-level academic tool.
-
This is the first study using text-mining techniques to analyze pediatric publications.
-
Our findings indicate that text-mining techniques enable better understanding of trends in publications and should be implemented when analyzing research.
This is a preview of subscription content
Access options
Subscribe to Journal
Get full journal access for 1 year
137,91 €
only 10,61 € per issue
Tax calculation will be finalised during checkout.
Buy article
Get time limited or full article access on ReadCube.
$32.00
All prices are NET prices.





Change history
21 July 2021
A Correction to this paper has been published: https://doi.org/10.1038/s41390-021-01644-x
References
Bergman, A. B. 50 years of pediatrics: 1948-1998. the journal in 1947 and 1997: a dramatic change. Pediatrics 102, 186–190 (1998).
Ozuah, P. O. Residency research requirement as a predictor of future publication productivity. J. Pediatr. 155, 1–2 (2009).
Alvira, C. M. et al. Enhancing the development and retention of physician-scientists in academic pediatrics: strategies for success. J. Pediatr. 200, 277–284 (2018).
Singh S. P., Swagata, K., Sudhir, S. M. & Singh V. P. The application of text mining algorithms in summarizing trends in anti-epileptic drug research. Int. J. Stat. Probability https://doi.org/10.5539/ijsp.v7n4p11 (2018).
Thuraisingham, B. M. Data Mining: Technologies, Techniques, Tools, and Trends (CRC Press, 1999).
Alfalqi, K. & Alghamdi, R. A survey of topic modeling in text mining. Int. J. Adv. Comput. Sci. Appl. https://doi.org/10.14569/IJACSA.2015.060121 (2015).
Hao, T. A bibliometric analysis of text mining in medical research. Soft Comput. 22, 7875–7892 (2018).
Song, M. Detecting the knowledge structure of bioinformatics by mining full-text collections. Scientometrics 96, 183–201 (2013).
Zhang, Y. et al. Trends in diatom research since 1991 based on topic modeling. Microorganisms https://doi.org/10.3390/microorganisms7080213 (2019).
Wang, S. H. et al. Text mining for identifying topics in the literatures about adolescent substance use and depression. BMC Public Health 16, 279–016 (2016).
N.I.H. of U.S. National Library of Medicine, download MEDLINE/PubMed data. www.nlm.nih.gov/databases/download/pubmed_medline.html (2020).
SCImago, (n.d.). SJR — SCImago journal & country rank. http://www.scimagojr.com (2020).
Blei, D. M. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003).
Keating, E. M. et al. Global disparities between pediatric publications and disease burden from 2006 to 2015. Glob. Pediatr. Health https://doi.org/10.1177/2333794X19831298 (2019).
Acknowledgements
We would like to thank MS. Amy Zhong at Icahn School of Medicine at Mount Sinai, New York for her expert assistance with the design of the tables and figure. No external funding was used for this research.
Author information
Affiliations
Contributions
S.L.-M., E.K., and Y.B. contributed to conception and design and acquisition of data. Y.B., I.B., S.S., S.F., R.S., and D.L.-E. contributed to interpretation of data. S.L.-M., E.K., and D.L.-E. drafted the article, revised it critically for important intellectual content, and approved the version to be published. I.B., S.F., and R.S. revised the article critically for important intellectual content.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Consent
No patient consent was needed in this study.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: The captions for Figures 1, 3, and 5 have been corrected.
Supplementary information
Rights and permissions
About this article
Cite this article
Levy-Mendelovich, S., Barbash, Y., Budnik, I. et al. Pediatric literature trends: high-level analysis using text-mining. Pediatr Res 90, 212–215 (2021). https://doi.org/10.1038/s41390-021-01415-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41390-021-01415-8