Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Pediatric literature trends: high-level analysis using text-mining

A Correction to this article was published on 21 July 2021

This article has been updated

Abstract

Background

Pediatric research is a diverse field that is constantly growing. Current machine learning advancements have prompted a technique termed text-mining. In text-mining, information is extracted from texts using algorithms. This technique can be applied to analyze trends and to investigate the dynamics in a research field. We aimed to use text-mining to provide a high-level analysis of pediatric literature over the past two decades.

Methods

We retrieved all available MEDLINE/PubMed annual data sets until December 31, 2018. Included studies were categorized into topics using text-mining.

Results

Two hundred and twenty-five journals were categorized as Pediatrics, Perinatology, and Child Health based on Scimago ranking for medicine journals. We included 201,141 pediatric papers published between 1999 and 2018. The most frequently cited publications were clinical guidelines and meta-analyses. We found that there is a shift in the trend of topics. Epidemiological studies are gaining more publications while other topics are relatively decreasing.

Conclusions

The topics in pediatric literature have shifted in the past two decades, reflecting changing trends in the field. Text-mining enables analysis of trends in publications and can serve as a high-level academic tool.

Impact

  • Text-mining enables analysis of trends in publications and can serve as a high-level academic tool.

  • This is the first study using text-mining techniques to analyze pediatric publications.

  • Our findings indicate that text-mining techniques enable better understanding of trends in publications and should be implemented when analyzing research.

This is a preview of subscription content

Access options

Buy article

Get time limited or full article access on ReadCube.

$32.00

All prices are NET prices.

Fig. 1: Citations/publications ratio per country.
Fig. 2: Number of articles per type of article.
Fig. 3: Citations/publications ratio per article type.
Fig. 4: Number of publications per topic over the 2 decades.
Fig. 5: Citations/publications ratio per topic.

Change history

References

  1. Bergman, A. B. 50 years of pediatrics: 1948-1998. the journal in 1947 and 1997: a dramatic change. Pediatrics 102, 186–190 (1998).

    CAS  PubMed  Google Scholar 

  2. Ozuah, P. O. Residency research requirement as a predictor of future publication productivity. J. Pediatr. 155, 1–2 (2009).

    Article  Google Scholar 

  3. Alvira, C. M. et al. Enhancing the development and retention of physician-scientists in academic pediatrics: strategies for success. J. Pediatr. 200, 277–284 (2018).

    Article  Google Scholar 

  4. Singh S. P., Swagata, K., Sudhir, S. M. & Singh V. P. The application of text mining algorithms in summarizing trends in anti-epileptic drug research. Int. J. Stat. Probability https://doi.org/10.5539/ijsp.v7n4p11 (2018).

  5. Thuraisingham, B. M. Data Mining: Technologies, Techniques, Tools, and Trends (CRC Press, 1999).

    Google Scholar 

  6. Alfalqi, K. & Alghamdi, R. A survey of topic modeling in text mining. Int. J. Adv. Comput. Sci. Appl. https://doi.org/10.14569/IJACSA.2015.060121 (2015).

  7. Hao, T. A bibliometric analysis of text mining in medical research. Soft Comput. 22, 7875–7892 (2018).

    Article  Google Scholar 

  8. Song, M. Detecting the knowledge structure of bioinformatics by mining full-text collections. Scientometrics 96, 183–201 (2013).

    Article  Google Scholar 

  9. Zhang, Y. et al. Trends in diatom research since 1991 based on topic modeling. Microorganisms https://doi.org/10.3390/microorganisms7080213 (2019).

  10. Wang, S. H. et al. Text mining for identifying topics in the literatures about adolescent substance use and depression. BMC Public Health 16, 279–016 (2016).

    Article  Google Scholar 

  11. N.I.H. of U.S. National Library of Medicine, download MEDLINE/PubMed data. www.nlm.nih.gov/databases/download/pubmed_medline.html (2020).

  12. SCImago, (n.d.). SJR — SCImago journal & country rank. http://www.scimagojr.com (2020).

  13. Blei, D. M. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003).

    Google Scholar 

  14. Keating, E. M. et al. Global disparities between pediatric publications and disease burden from 2006 to 2015. Glob. Pediatr. Health https://doi.org/10.1177/2333794X19831298 (2019).

Download references

Acknowledgements

We would like to thank MS. Amy Zhong at Icahn School of Medicine at Mount Sinai, New York for her expert assistance with the design of the tables and figure. No external funding was used for this research.

Author information

Affiliations

Authors

Contributions

S.L.-M., E.K., and Y.B. contributed to conception and design and acquisition of data. Y.B., I.B., S.S., S.F., R.S., and D.L.-E. contributed to interpretation of data. S.L.-M., E.K., and D.L.-E. drafted the article, revised it critically for important intellectual content, and approved the version to be published. I.B., S.F., and R.S. revised the article critically for important intellectual content.

Corresponding author

Correspondence to Sarina Levy-Mendelovich.

Ethics declarations

Competing interests

The authors declare no competing interests.

Consent

No patient consent was needed in this study.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The original online version of this article was revised: The captions for Figures 1, 3, and 5 have been corrected.

Supplementary information

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Levy-Mendelovich, S., Barbash, Y., Budnik, I. et al. Pediatric literature trends: high-level analysis using text-mining. Pediatr Res 90, 212–215 (2021). https://doi.org/10.1038/s41390-021-01415-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1038/s41390-021-01415-8

Search

Quick links