Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence

Liang, Huiying; Tsui, Brian Y.; Ni, Hao; Valentim, Carolina C. S.; Baxter, Sally L.; Liu, Guangjian; Cai, Wenjia; Kermany, Daniel S.; Sun, Xin; Chen, Jiancong; He, Liya; Zhu, Jie; Tian, Pin; Shao, Hua; Zheng, Lianghong; Hou, Rui; Hewett, Sierra; Li, Gen; Liang, Ping; Zang, Xuan; Zhang, Zhiqi; Pan, Liyan; Cai, Huimin; Ling, Rujuan; Li, Shuhua; Cui, Yongwang; Tang, Shusheng; Ye, Hong; Huang, Xiaoyan; He, Waner; Liang, Wenqing; Zhang, Qing; Jiang, Jianmin; Yu, Wei; Gao, Jianqun; Ou, Wanxing; Deng, Yingmin; Hou, Qiaozhen; Wang, Bei; Yao, Cuichan; Liang, Yan; Zhang, Shu; Duan, Yaou; Zhang, Runze; Gibson, Sarah; Zhang, Charlotte L.; Li, Oulan; Zhang, Edward D.; Karin, Gabriel; Nguyen, Nathan; Wu, Xiaokang; Wen, Cindy; Xu, Jie; Xu, Wenqin; Wang, Bochu; Wang, Winston; Li, Jing; Pizzato, Bianca; Bao, Caroline; Xiang, Daoman; He, Wanting; He, Suiqin; Zhou, Yugui; Haw, Weldon; Goldbaum, Michael; Tremoulet, Adriana; Hsu, Chun-Nan; Carter, Hannah; Zhu, Long; Zhang, Kang; Xia, Huimin

doi:10.1038/s41591-018-0335-9

Letter
Published: 11 February 2019

Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence

Huiying Liang¹^na1,
Brian Y. Tsui ORCID: orcid.org/0000-0001-8017-5895²^na1,
Hao Ni³^na1,
Carolina C. S. Valentim⁴^na1,
Sally L. Baxter ORCID: orcid.org/0000-0002-5271-7690²^na1,
Guangjian Liu¹^na1,
Wenjia Cai ORCID: orcid.org/0000-0003-2398-1449²,
Daniel S. Kermany^1,2,
Xin Sun¹,
Jiancong Chen²,
Liya He¹,
Jie Zhu¹,
Pin Tian²,
Hua Shao²,
Lianghong Zheng^5,6,
Rui Hou^5,6,
Sierra Hewett^1,2,
Gen Li^1,2,
Ping Liang³,
Xuan Zang³,
Zhiqi Zhang³,
Liyan Pan¹,
Huimin Cai^5,6,
Rujuan Ling¹,
Shuhua Li¹,
Yongwang Cui¹,
Shusheng Tang¹,
Hong Ye¹,
Xiaoyan Huang¹,
Waner He¹,
Wenqing Liang¹,
Qing Zhang¹,
Jianmin Jiang¹,
Wei Yu¹,
Jianqun Gao¹,
Wanxing Ou¹,
Yingmin Deng¹,
Qiaozhen Hou¹,
Bei Wang¹,
Cuichan Yao¹,
Yan Liang¹,
Shu Zhang¹,
Yaou Duan²,
Runze Zhang²,
Sarah Gibson²,
Charlotte L. Zhang²,
Oulan Li²,
Edward D. Zhang²,
Gabriel Karin²,
Nathan Nguyen²,
Xiaokang Wu^1,2,
Cindy Wen²,
Jie Xu²,
Wenqin Xu²,
Bochu Wang²,
Winston Wang²,
Jing Li^1,2,
Bianca Pizzato²,
Caroline Bao²,
Daoman Xiang¹,
Wanting He^1,2,
Suiqin He²,
Yugui Zhou^1,2,
Weldon Haw^2,7,
Michael Goldbaum²,
Adriana Tremoulet²,
Chun-Nan Hsu ORCID: orcid.org/0000-0002-5240-4707²,
Hannah Carter²,
Long Zhu³,
Kang Zhang ORCID: orcid.org/0000-0002-4549-1697^1,2,7 &
…
Huimin Xia ORCID: orcid.org/0000-0002-3714-3764¹

Nature Medicine volume 25, pages 433–438 (2019)Cite this article

37k Accesses
341 Citations
757 Altmetric
Metrics details

Subjects

Abstract

Artificial intelligence (AI)-based methods have emerged as powerful tools to transform medical care. Although machine learning classifiers (MLCs) have already demonstrated strong performance in image-based diagnoses, analysis of diverse and massive electronic health record (EHR) data remains challenging. Here, we show that MLCs can query EHRs in a manner similar to the hypothetico-deductive reasoning used by physicians and unearth associations that previous statistical methods have not found. Our model applies an automated natural language processing system using deep learning techniques to extract clinically relevant information from EHRs. In total, 101.6 million data points from 1,362,559 pediatric patient visits presenting to a major referral center were analyzed to train and validate the framework. Our model demonstrates high diagnostic accuracy across multiple organ systems and is comparable to experienced pediatricians in diagnosing common childhood diseases. Our study provides a proof of concept for implementing an AI-based system as a means to aid physicians in tackling large amounts of data, augmenting diagnostic evaluations, and to provide clinical decision support in cases of diagnostic uncertainty or complexity. Although this impact may be most evident in areas where healthcare providers are in relative shortage, the benefits of such an AI system are likely to be universal.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Workflow diagram of our AI pediatric diagnosis framework.**

**Fig. 2: Hierarchy of the diagnostic framework in a large pediatric cohort.**

Causal machine learning for predicting treatment outcomes

Article 19 April 2024

Generative models improve fairness of medical classifiers under distribution shifts

Article Open access 10 April 2024

Segment anything in medical images

Article Open access 22 January 2024

Data availability

We have made available the Jupyter notebook that we used in constructing and validating the hierarchical logistic regression models: https://s3.cn-north-1.amazonaws.com.cn/ped.emr/Data/hierachical_logistic_regression.ipynb. To protect patient confidentiality, we have deposited de-identified aggregated patient data in a secured and patient confidentiality compliant cloud in China in concordance with data security regulations. Data access can be requested by writing to the corresponding authors. All data access requests will be reviewed and (if successful) granted by the Data Access Committee.

References

Hu, J., Perer, A. & Wang, F. Data Driven Analytics for Personalized Healthcare. (Springer Internatonal Publishing, Switzerland, Healthcare Information Management Systems: Cases, Strategies, and Solutions, 2016).
Nezhad, M.Z., Zhu, D.X., Sadati, N., Yang, K. & Levy, P. SUBIC: A supervised bi-clustering approach for precision medicine. 2017 16th Ieee International Conference on Machine Learning and Applications (Icmla). Preprint at https://arxiv.org/pdf/1709.09929.pdf (2017).
Hornberger, J. Electronic health records: a guide for clinicians and administrators. JAMA 301, 110–110 (2009).
Article CAS Google Scholar
Esteva, A. et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 542, 115–118 (2017).
Article CAS Google Scholar
Kermany, D. S. et al. Identifying medical diagnoses and treatable diseases by image-based deep learning. Cell 172, 1122–1131 (2018).
Article CAS Google Scholar
Gulshan, V. et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA 316, 2402–2410 (2016).
Article Google Scholar
Erickson, B. J., Korfiatis, P., Akkus, Z. & Kline, T. L. Machine learning for medical imaging. Radiographics 37, 505–515 (2017).
Article Google Scholar
Wang, F., Zhang, P., Qian, B., Wang, X. & Davidson, I. Clinical risk prediction with multilinear sparse logistic regression. In Proc. 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.145–154 (2014).
Turchin, A. et al. Using regular expressions to abstract blood pressure and treatment intensification information from the text of physician notes. J. Am. Med. Inform. Assoc. 13, 691–695 (2006).
Article Google Scholar
Halevy, A., Norvig, P. & Pereira, F. The unreasonable effectiveness of data. IEEE Intelligent Systems 24, 8–12 (2009).
Article Google Scholar
Banko, M. & Brill, E. Scaling to very very large corpora for natural language disambiguation. In Proc. 39th Annual Meeting Association for Computational Linguistics. 26–33 (Association for Computational Linguistics, Stroudsburg, 2001).
Tsui, B. Y., et al. Creating a scalable deep learning based named entity recognition model for biomedical textual data by repurposing biosample free-text annotations. Preprint at https://www.biorxiv.org/content/biorxiv/early/2018/09/12/414136.full.pdf (2018).
Rajkomar, A. et al. Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine 1, 18 (2018).
Article Google Scholar
Wilkinson, M. D. et al. Comment: the fair guiding principles for scientific data management and stewardship. Sci. Data 3, 160018 (2016).
Article Google Scholar
Liang, Y., Chen, Z., Huang, X. & Zeng, L. Analysis of the disease spectrum of hospitalized children in guangdong province. Chin. Med. J. (Engl) 1, 414–418 (2013).
Google Scholar
WHO. International Statistical Classification of Diseases and Related Health Problems. (World Health Organization, 2004).
English–Chinese Medical Dictionary (英汉医学大词典) (Shanghai Scientific and Technical Publishers (上海科学技术出版社), 2015).
Lindberg, D. A. B., Humphreys, B. L. & Mccray, A. T. The unified medical language system. Methods Inf. Med. 32, 281–291 (1993).
Article CAS Google Scholar
Tweedie, F. J., Singh, S. & Holmes, D. I. Neural network applications in stylometry: the federalist papers. Computers and the Humanities 30, 1–10 (1996).
Article Google Scholar
Luong, M.-T., Pham, H. & Manning, C. D. Effective approaches to attention-based neural machine translation. Preprint at https://arxiv.org/abs/1508.04025 (2015).
Lipton, Z.C., Kale, D.C. & Wetzel, R.C. Phenotyping of clinical time series with LSTM recurrent neural networks. Preprint at https://arxiv.org/pdf/1510.07641.pdf (2015).
Peng, X.B., Andrychowicz, M., Zaremba, W. & Abbeel, P. Sim-to-real transfer of robotic control with dynamics randomization. IEEE International Conference on Robotics and Automation (ICRA) 3803–3810 (2018).
Graves, A. & Schmidhuber, J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks 18, 602–610 (2005).
Article Google Scholar
Yeung, K.Y. & Ruzzo, W.L. Details of the adjusted rand index and clustering algorithms supplement to the paper ‘an empirical study on Principal Component Analysis for clustering gene expression data. Available at http://faculty.washington.edu/kayee/pca/supp.pdf (2011).

Download references

Acknowledgements

This study was funded by the National Key Research and Development Program of China (2017YFC1104600 to H.L.), National Natural Science Foundation of China (81771629 to H.X. and 81700882 to J.X.), Guangzhou Women and Children’s Medical Center, Guangzhou Regenerative Medicine and Health Guangdong Laboratory (Innovation and Startup Talents Program 2018GZR031001 to L.Z. and R.H.).

Author information

These authors contributed equally: Huiying Liang, Brian Tsui, Hao Ni, Carolina C. S. Valentim, Sally L. Baxter, Guangjian Liu.

Authors and Affiliations

Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, China
Huiying Liang, Guangjian Liu, Daniel S. Kermany, Xin Sun, Liya He, Jie Zhu, Sierra Hewett, Gen Li, Liyan Pan, Rujuan Ling, Shuhua Li, Yongwang Cui, Shusheng Tang, Hong Ye, Xiaoyan Huang, Waner He, Wenqing Liang, Qing Zhang, Jianmin Jiang, Wei Yu, Jianqun Gao, Wanxing Ou, Yingmin Deng, Qiaozhen Hou, Bei Wang, Cuichan Yao, Yan Liang, Shu Zhang, Xiaokang Wu, Jing Li, Daoman Xiang, Wanting He, Yugui Zhou, Kang Zhang & Huimin Xia
Institute for Genomic Medicine, Institute of Engineering in Medicine, and Shiley Eye Institute, University of California, San Diego, La Jolla, CA, USA
Brian Y. Tsui, Sally L. Baxter, Wenjia Cai, Daniel S. Kermany, Jiancong Chen, Pin Tian, Hua Shao, Sierra Hewett, Gen Li, Yaou Duan, Runze Zhang, Sarah Gibson, Charlotte L. Zhang, Oulan Li, Edward D. Zhang, Gabriel Karin, Nathan Nguyen, Xiaokang Wu, Cindy Wen, Jie Xu, Wenqin Xu, Bochu Wang, Winston Wang, Jing Li, Bianca Pizzato, Caroline Bao, Wanting He, Suiqin He, Yugui Zhou, Weldon Haw, Michael Goldbaum, Adriana Tremoulet, Chun-Nan Hsu, Hannah Carter & Kang Zhang
Hangzhou YITU Healthcare Technology Co. Ltd, Hangzhou, China
Hao Ni, Ping Liang, Xuan Zang, Zhiqi Zhang & Long Zhu
Department of Thoracic Surgery/Oncology, First Affiliated Hospital of Guangzhou Medical University, China State Key Laboratory and National Clinical Research Center for Respiratory Disease, Guangzhou, China
Carolina C. S. Valentim
Guangzhou Kangrui Co. Ltd, Guangzhou, China
Lianghong Zheng, Rui Hou & Huimin Cai
Guangzhou Regenerative Medicine and Health Guangdong Laboratory, Guangzhou, China
Lianghong Zheng, Rui Hou & Huimin Cai
Veterans Administration Healthcare System, San Diego, CA, USA
Weldon Haw & Kang Zhang

Authors

Huiying Liang
View author publications
You can also search for this author in PubMed Google Scholar
Brian Y. Tsui
View author publications
You can also search for this author in PubMed Google Scholar
Hao Ni
View author publications
You can also search for this author in PubMed Google Scholar
Carolina C. S. Valentim
View author publications
You can also search for this author in PubMed Google Scholar
Sally L. Baxter
View author publications
You can also search for this author in PubMed Google Scholar
Guangjian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Wenjia Cai
View author publications
You can also search for this author in PubMed Google Scholar
Daniel S. Kermany
View author publications
You can also search for this author in PubMed Google Scholar
Xin Sun
View author publications
You can also search for this author in PubMed Google Scholar
Jiancong Chen
View author publications
You can also search for this author in PubMed Google Scholar
Liya He
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Pin Tian
View author publications
You can also search for this author in PubMed Google Scholar
Hua Shao
View author publications
You can also search for this author in PubMed Google Scholar
Lianghong Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Rui Hou
View author publications
You can also search for this author in PubMed Google Scholar
Sierra Hewett
View author publications
You can also search for this author in PubMed Google Scholar
Gen Li
View author publications
You can also search for this author in PubMed Google Scholar
Ping Liang
View author publications
You can also search for this author in PubMed Google Scholar
Xuan Zang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Liyan Pan
View author publications
You can also search for this author in PubMed Google Scholar
Huimin Cai
View author publications
You can also search for this author in PubMed Google Scholar
Rujuan Ling
View author publications
You can also search for this author in PubMed Google Scholar
Shuhua Li
View author publications
You can also search for this author in PubMed Google Scholar
Yongwang Cui
View author publications
You can also search for this author in PubMed Google Scholar
Shusheng Tang
View author publications
You can also search for this author in PubMed Google Scholar
Hong Ye
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyan Huang
View author publications
You can also search for this author in PubMed Google Scholar
Waner He
View author publications
You can also search for this author in PubMed Google Scholar
Wenqing Liang
View author publications
You can also search for this author in PubMed Google Scholar
Qing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jianmin Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jianqun Gao
View author publications
You can also search for this author in PubMed Google Scholar
Wanxing Ou
View author publications
You can also search for this author in PubMed Google Scholar
Yingmin Deng
View author publications
You can also search for this author in PubMed Google Scholar
Qiaozhen Hou
View author publications
You can also search for this author in PubMed Google Scholar
Bei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Cuichan Yao
View author publications
You can also search for this author in PubMed Google Scholar
Yan Liang
View author publications
You can also search for this author in PubMed Google Scholar
Shu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yaou Duan
View author publications
You can also search for this author in PubMed Google Scholar
Runze Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Gibson
View author publications
You can also search for this author in PubMed Google Scholar
Charlotte L. Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Oulan Li
View author publications
You can also search for this author in PubMed Google Scholar
Edward D. Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Gabriel Karin
View author publications
You can also search for this author in PubMed Google Scholar
Nathan Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaokang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Cindy Wen
View author publications
You can also search for this author in PubMed Google Scholar
Jie Xu
View author publications
You can also search for this author in PubMed Google Scholar
Wenqin Xu
View author publications
You can also search for this author in PubMed Google Scholar
Bochu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Winston Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Bianca Pizzato
View author publications
You can also search for this author in PubMed Google Scholar
Caroline Bao
View author publications
You can also search for this author in PubMed Google Scholar
Daoman Xiang
View author publications
You can also search for this author in PubMed Google Scholar
Wanting He
View author publications
You can also search for this author in PubMed Google Scholar
Suiqin He
View author publications
You can also search for this author in PubMed Google Scholar
Yugui Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Weldon Haw
View author publications
You can also search for this author in PubMed Google Scholar
Michael Goldbaum
View author publications
You can also search for this author in PubMed Google Scholar
Adriana Tremoulet
View author publications
You can also search for this author in PubMed Google Scholar
Chun-Nan Hsu
View author publications
You can also search for this author in PubMed Google Scholar
Hannah Carter
View author publications
You can also search for this author in PubMed Google Scholar
Long Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Kang Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Huimin Xia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.L., B.T., H.N., W.C., S.L.B., G. Liu, D.S.K., X. S., C.C.S.V., P.T., H.S., J.C., L. H., J.Z., L.Z., R.H., S.H., G. Li, P.L., X.Z., Z.Z., L.P., H.C., R.L., S.L., Y.C., S.T., H.Y., X.H., W. He, W.L., Q.Z., J.J., W.Y., J.G., W.O., Y. Deng, Q.H., B. Wang, C.Y., Y.L., S.Z., Y. Duan, R.Z., S.G., C.L.Z., O.L., E.D.Z., G.K., X.W., C.W., N.N., J.X., W.X., B. Wang, W.W., J.L., B.P., C.B., D.X., W. He, S.H., Y.Z., W. Haw, M.G., A.T., C.-N.H., H.C., L.Z., H.X. and K.Z. collected and analyzed the data. X.H. and K.Z. conceived the project. K.Z., S.L.B., B.T., H.L., and H.X. wrote the manuscript. All authors discussed the results and reviewed the manuscript.

Corresponding authors

Correspondence to Kang Zhang or Huimin Xia.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data 1 Unsupervised clustering of NLP extracted textual features from pediatric diseases.

The diagnostic system analyzed the EHRs in the absence of a defined classification system. This grouping structure reflects the detection of trends in clinical features without pre-defined labeling or human input. The clustered blocks are marked with the boxes with grey lines.

Extended Data 2 Design of the natural language processing information extraction model.

Segmented sentences from the raw text of the EHR were embedded using word2vec. The LSTM model then generated the structured records in a query–answer format. This schematic illustrates the process using the free-text ‘lesion in the upper left lobe of patient’s lung’ as an example.

Supplementary Information

Reporting Summary

Supplementary Tables

Supplementary Tables 1–9

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liang, H., Tsui, B.Y., Ni, H. et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med 25, 433–438 (2019). https://doi.org/10.1038/s41591-018-0335-9

Download citation

Received: 18 July 2018
Accepted: 07 December 2018
Published: 11 February 2019
Issue Date: March 2019
DOI: https://doi.org/10.1038/s41591-018-0335-9

This article is cited by

Hospitalization, case fatality, comorbidities, and isolated pathogens of adult inpatients with pneumonia from 2013 to 2022: a real-world study in Guangzhou, China
- Yun Li
- Zhufeng Wang
- Jinping Zheng
BMC Infectious Diseases (2024)
Performance and clinical utility of a new supervised machine-learning pipeline in detecting rare ciliopathy patients based on deep phenotyping from electronic health records and semantic similarity
- Carole Faviez
- Marc Vincent
- Anita Burgun
Orphanet Journal of Rare Diseases (2024)
Predicting which patients with cancer will see a psychiatrist or counsellor from their initial oncology consultation document using natural language processing
- John-Jose Nunez
- Bonnie Leung
- Alan T. Bates
Communications Medicine (2024)
Artificial intelligence in the diagnosis of dental diseases on panoramic radiographs: a preliminary study
- Junhua Zhu
- Zhi Chen
- Yuanna Zheng
BMC Oral Health (2023)
Machine learning-based models to predict one-year mortality among Chinese older patients with coronary artery disease combined with impaired glucose tolerance or diabetes mellitus
- Yan Li
- Lixun Guan
- Shihui Fu
Cardiovascular Diabetology (2023)