A comparison of performance between a deep learning model with residents for localization and classification of intracranial hemorrhage

Angkurawaranon, Salita; Sanorsieng, Nonn; Unsrisong, Kittisak; Inkeaw, Papangkorn; Sripan, Patumrat; Khumrin, Piyapong; Angkurawaranon, Chaisiri; Vaniyapong, Tanat; Chitapanarux, Imjai

doi:10.1038/s41598-023-37114-z

Download PDF

Article
Open access
Published: 20 June 2023

A comparison of performance between a deep learning model with residents for localization and classification of intracranial hemorrhage

Salita Angkurawaranon^1,2,
Nonn Sanorsieng¹,
Kittisak Unsrisong¹,
Papangkorn Inkeaw³,
Patumrat Sripan⁴,
Piyapong Khumrin⁵,
Chaisiri Angkurawaranon ORCID: orcid.org/0000-0003-4206-9164^2,5,
Tanat Vaniyapong⁶ &
…
Imjai Chitapanarux¹

Scientific Reports volume 13, Article number: 9975 (2023) Cite this article

1450 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Intracranial hemorrhage (ICH) from traumatic brain injury (TBI) requires prompt radiological investigation and recognition by physicians. Computed tomography (CT) scanning is the investigation of choice for TBI and has become increasingly utilized under the shortage of trained radiology personnel. It is anticipated that deep learning models will be a promising solution for the generation of timely and accurate radiology reports. Our study examines the diagnostic performance of a deep learning model and compares the performance of that with detection, localization and classification of traumatic ICHs involving radiology, emergency medicine, and neurosurgery residents. Our results demonstrate that the high level of accuracy achieved by the deep learning model, (0.89), outperforms the residents with regard to sensitivity (0.82) but still lacks behind in specificity (0.90). Overall, our study suggests that the deep learning model may serve as a potential screening tool aiding the interpretation of head CT scans among traumatic brain injury patients.

Segment anything in medical images

Article Open access 22 January 2024

Towards a general-purpose foundation model for computational pathology

Article 19 March 2024

AI in health and medicine

Article 20 January 2022

Introduction

Among common neurological problems, traumatic brain injury (TBI) is one of the most prevalent and poses one of the most important burdens on public health¹. A head computed tomography (CT) scan, an effective non-invasive modality, is almost always the first-line investigation of acute TBI, owing to the widespread availability for the procedure and also the short acquisition time. CT scans have the ability to detect intracranial hemorrhage (ICH), mass effect, and associated complications. As a result, patients requiring emergency neurosurgical intervention can be identified rapidly².

Due to the emergency nature of trauma, doctors need to obtain and interpret CT scans as quickly as possible. This is especially important in the case of head injuries, where timely treatment can avoid cognitive and physical disability. Emergency physicians and neurosurgeons must decide whether to plan operative or conservative treatment for the patient. The potential subtypes of ICH that may necessitate surgical intervention include intraparenchymal hemorrhage (IPH), subdural hemorrhage (SDH), and epidural hemorrhage (EDH)³. Rapid trauma response systems, including the availability of CT scans and adequate personnel, are required to prevent the possibly long-lasting effects of secondary brain injury and enhance patient outcomes⁴.

However, the number of trained radiologists or even radiology trainees available to interpret the CT scans has often been limited, resulting in significant delays in analyzing and reporting results^5,6. Thus, in resource-limited settings, treatment planning before the formal radiology report may result in misinterpretation and inappropriate clinical management.^7,8.

A promising solution to tackle this problem is the usage of Artificial Intelligence (AI). Many studies have used deep learning methods to assist in the diagnosis of diseased, oncologic, and traumatic patients^9,10,11,12. An automated ICH detection and classification tool may assist residents or clinicians when medical radiology experts are not immediately available^13,14. Deep learning models have also been implemented to detect ICH and even assess mass effects from ICH in both retrospective and prospective studies^15,16,17. The majority of the studies^{17,18,19,20,21} focused on the evaluation of diagnostic accuracy for the identification of ICH and classification into each ICH subtype using algorithms, but, to date, the accuracy of the detection of ICH into specific intracranial locations has not been well evaluated.

Recently, the authors of this study developed a deep learning model for segmenting SDH, EDH, and IPH²². The model proposed outperformed segmentation performance with a higher dice score when compared to reports in previously published literature²². In this study, we aim to compare the performance of the proposed deep learning segmentation model²² with that of radiology, emergency, and neurosurgery residents. We primarily focus on the detection of IPH, SDH, and EDH, as these subtypes are usually the ones being evaluated in identifying and selecting TBI patients for neurosurgical intervention³.

Material and methods

Development of the deep learning model

The deep learning model we investigated in this study was proposed in our prior published study²². The model is a variation of the DeepMedic²³ model that has the ability to segment SDH, EDH, and IPH on a CT scan. Its architecture consists of four parallel pathways that process the input at different resolutions and two fully connected layers. It obtains a 2-channel voxel extracted from the subdural and bone windows of a brain CT scan as the input. All voxels in a CT scan were processed by the model to generate segmentation results with the value of each pixel as the class label of the hemorrhage type where the pixel was located. After the segmentation results were produced by the deep learning model, the regions of SDH and EDH with their major axis of less than 5 mm were considered as noise and removed from the segmentation result. The final results were drawn on the input CT scan by assigning different colors to hemorrhage types. The model is shared via https://github.com/RadiologyCMU/Hemorrhage-DeepMedic. In this study, the deep learning model was performed without any fine-tuning. We also adopted the same data pre-processing process used in our previous work. In addition, a new dataset was used as the test dataset, which differed from the dataset used to train the model in²². The samples in the test dataset were randomly selected from patients who were not in the training dataset. Patients who received multiple scans, we have chosen only the first scan.

Study cohort

Non-contrast head CT scans of adult patients aged 15 years or older suspected of head injury/trauma as the initial presentation during emergency department (ED) visits at Maharaj Nakorn Chiang Mai Hospital from January 1, 2014, to December 31, 2014, were included. Exclusion criteria included: (1) Follow-up CT studies of patients with known recent TBI; (2) Studies of patients with recent neurosurgical intervention; (3) Severe artifacts degrading study quality such as motion artifacts and metallic artifacts.

Brain CT scans were acquired with CT equipment from either of two manufacturers (Toshiba Aquilion 16 or Siemens SOMATOM Definition). Each slide was stored as a 512 × 512 pixels DICOM image. The typical image resolution of x and y is 0.4473 mm per pixel.

The number of image slices per patient may vary between 80 and 115, depending on factors such as the size of the patient's head, with a fixed separation distance between slices of 1.5 mm. With these criteria, a total of 300 head CT studies from different subjects were finally included, 166 studies were categorized under the intracranial hemorrhage (ICH) group, while 134 studies were classified under the non-ICH group. After thorough review and annotation from consensus of two experienced neuroradiologists out of the 300 head CT studies, 171 were identified as lesions of IPH (27.01%), 356 lesions of SDH (56.24%), and 106 lesions of EDH (16.75%). Detailed data describing locations of lesions are shown in Table 1.

Table 1 Type and distribution of ICHs. Abbreviations: ICH, intracranial hemorrhage; IPH, intraparenchymal hemorrhage; EDH, epidural hemorrhage; SDH, subdural hemorrhage.

Full size table

Classification of ICH by deep learning model

The deep learning model was then used to identify negative and positive studies. In the positive studies, the subtypes of ICH of concern were classified and segmented. The deep learning model segmented the area of ICH with different color maps at specific locations indicating different subtypes and locations of ICH. Correct segmentation means coloring true ICH in the expected locations, as in Fig. 1. The true location of ICH but incorrect color regarding ICH subtypes were considered false interpretations.

Classification of ICH by residents

Four radiology residents (three junior radiology residents and one senior radiology resident) and four non-radiology residents (two senior emergency medicine residents and two senior neurosurgery residents) were recruited for the study. These three areas of specialist were chosen as the residents attached to these areas of expertise were most likely to be those making the initial CT interpretation at the emergency department. Blinded to the original CT results, all eight residents were independently required to interpret all head CT scans solely from the CT series consisting of a 1.5-mm thick slice in the axial plane. The residents could manually adjust the width and level of the window for each scan during interpretation.

A record form was created consisting of multiple-choice checkboxes regarding the location of each ICH subtype. The locations for IPH were: (1) Right cerebral hemisphere; (2) Left cerebral hemisphere; (3) Right cerebellar hemisphere; (4) Left cerebellar hemisphere; (5) Right deep nuclei; (6) Left deep nuclei; (7) Midbrain; (8) Pons. Deep nuclei encompassed either the caudate nucleus, lentiform nucleus, or thalamus. In the case of SDH, the locations could be: (1) Right cerebral convexity; (2) Left cerebral convexity; (3) Falx cerebri; (4) Tentorium cerebelli; (5) Right cerebellar convexity; (6) Left cerebellar convexity. For EDH, the locations included: (1) Right cerebral convexity; (2) Left cerebral convexity; (3) Right cerebellar convexity; (4) Left cerebellar convexity; (5) Vertex. If no ICH locations were identified in IPH, SDH, nor EDH these were classed as negative-ICH. A flowchart methodology of this study is available in the Fig. 2.

Statistical analysis

In most cases, there were more than one ICH subtype and/or multiple lesions of the same subtype. The algorithm or trainee residents would have to correctly identify all ICH subtypes and a correct location was considered as “detected”. If any ICH remained undetected or was mis-identified either by subtype or location, it was counted as “missed”. The imaging of ICH subtypes in certain locations is presented in Fig. 3. We evaluated the performance of the algorithm and trainees using statistical metrics, including accuracy, sensitivity, and specificity, using the Python package scikit-learn. A significant difference was considered when p < 0.05.

Informed consent and ethical approval

This study was approved by the Research Ethics Committee of the Faculty of Medicine, Chiang Mai University (No.423/2021). Informed consent was obtained from all participants. All methods were performed in accordance with the relevant guidelines and regulations.

Results

We evaluated the performance of the deep learning model and compared it to that of the eight training residents in the classification and localization of ICHs based on the individual locations occupied by specific types of ICH. The accuracy, sensitivity, and specificity of the algorithm and residents are displayed in Table 2. In terms of ICH detection and localization, the model achieved an accuracy of 0.89 with a sensitivity, and specificity of 0.82 and 0.90, respectively. Overall, four radiology residents achieved accuracy, sensitivity, and specificity of 0.96 ± 0.00, 0.74 ± 0.04, and 0.99 ± 0.01, whereas four non-radiology residents scored values of 0.94 ± 0.01, 0.61 ± 0.08, and 0.99 ± 0.00, respectively.

Table 2 Location-level performance of the algorithm, three junior radiology residents, a senior radiologist, two emergency residents, and two neurosurgery residents presented as accuracy, sensitivity, and specificity. Abbreviations: ICH, intracranial hemorrhage; IPH, intraparenchymal hemorrhage; EDH, epidural hemorrhage; SDH, subdural hemorrhage.

Full size table

Regarding the detection and localization of each ICH subtype, the deep learning model was the most sensitive in detecting SDH (sensitivity = 0.85). The sensitivities for detecting IPH and EDH were 0.83 and 0.72, respectively. The radiology residents performed similarly well to the model in the sensitivity of each ICH subtype. The sensitivity for IPH was 0.80, 0.71 for EDH, and 0.73 for SDH. The neurosurgery and emergency medicine residents had lower performance scores in detecting each ICH subtype, with a sensitivity of 0.71 for IPH detection, 0.63 for EDH, and 0.54 for SDH. The model had an overall higher sensitivity for ICH detection than the average performance of training residents across all ICH subtypes (p < 0.05).

In several cases, the deep learning model detected subtle SDH when this could not be detected by any of the residents. Most of these cases were studies containing thin SDHs either along the tentorium cerebelli or the cerebral convexities. Some of these cases had various hemorrhagic subtypes resulting in small hemorrhages being overlooked. Three subtle EDHs were missed by radiology residents, and five subtle EDHs were missed by non-radiology residents. However, all of these EDHs were picked up by the algorithm. These cases are illustrated in Figs. 4 and 5. Only two cases of small EDHs were missed by the deep learning but were able to be detected by all residents (Fig. 6).

The specificities for ICH detection and subtypes of ICH by the deep learning and residents were relatively high. The overall specificity for ICH detection by the algorithm was 0.90, while the specificities for ICH detection by radiology and non-radiology residents were both 0.99. Specificity values for each subtype of ICH varied between 0.82 and 0.90 in the case of the deep learning, while the specificity values varied between 0.97 and 0.99 for the residents for each ICH subtype (Table 2). There were many cases in which the deep learning model overdiagnosed, usually involving basal ganglia calcification, beam hardening artifacts, dense cortical veins, and dural venous sinuses being interpreted as hemorrhage. Some of these studies are presented in Fig. 7.

Discussion

Our results demonstrate non-inferior diagnostic accuracies in ICH detection of the deep learning model compared to residents (p > 0.05). In terms of sensitivity, the model yielded noticeably higher overall sensitivity values for ICH detection across nearly all subtypes compared to the residents (p < 0.05). Nonetheless, specificity of the model still falls behind that of the residents.

In our study, we determined the accuracy of the algorithm by evaluating the frequency of correct and incorrect ICH detection from each location. The deep learning model achieved high accuracy of overall ICH detection, and IPH, EDH, and SDH detection (0.89, 0.93, 0.91, and 0.82). The reported diagnostic performance of deep learning models in prior literature were variable, with accuracy values ranging from 0.70 to 0.94¹³. Similar studies based on convolutional neural networks yielded approximate accuracy values from 0.81 to 0.90^20,24. The variations in accuracy are likely due to the classification methods among different models and ways of measurement for ICH segmentation which might not allow direct comparison between studies.

Few studies have compared the diagnostic performance between algorithms and trainee residents. Ye et al. demonstrated superior performance of the deep learning neural network. They concluded that their algorithm was fast and accurate, indicating its potential role in assisting junior radiology residents in reducing misinterpretation of head CT scans¹⁵. However, they primarily focused on ICH detection and classification, while our study also stresses the importance of identifying both the subtype and correct location.

Although our deep learning model demonstrated high specificity, it still lags behind that of the residents. From a review of the lesion segmentation by the algorithm, many non-ICH findings had been misinterpreted as ICH. This was possibly because identification of ICH required the region be of higher attenuation or higher Hounsfield unit (HU) than surrounding normal brain parenchyma. Other findings with high HU, such as basal ganglia calcification, beam hardening artifacts, dense cortical veins, and dural venous sinuses, were misclassified as hemorrhage. This is concordant with prior results in that common AI overcalls are calcification and beam-hardening artifacts^25,26. While radiologists and clinicians gain experience in identifying actual ICHs, AI analysis software must also be trained to recognize these ICH mimickers.

In terms of sensitivity, the deep learning model yielded noticeably higher overall sensitivity values for ICH detection across nearly all subtypes compared to the residents (p < 0.05). Moreover, the deep learning model could detect subtle hemorrhages missed by residents, such as thin SDH and small EDH. Waite et al. suggest that in their study of interpretative errors in radiology perceptual errors account for 60%-80% of radiological errors²⁷. Since perception and detection are the initial phases in image interpretation, an error in this phase can abruptly terminate the diagnostic process and result in a mistaken (false-negative) diagnosis. Perceptual errors in radiology have been linked to a variety of causes, including fatigue of the interpreter and the increased pace of interpretations²⁸. The error rate could also change depending on the time of day the interpretation was given, and long and overnight shifts are associated with increased rates of inaccuracy, according to previous studies^29,30. More crucially, in approximately 1% of cases, the diagnostic error results in incorrect or inadequate patient management^31,32. This reinforces the vital role of a deep learning model as a potential screening tool in emergency cases requiring rapid ICH detection or as an assistant for trainee residents in generating emergency CT reports. Similar work has been done using an active automated tool to detect acute intracranial conditions, including hemorrhage, reprioritizing study, and notifying radiologists if ICH was identified as present. Thus, these resulted in significant reductions in diagnostic waiting time with high sensitivity with regard to detection^33,34.

Different sensitive and specificity tradeoff is likely to be different depending on their role in the care pathway. If primary role of the operator, whether it be the radiologist, emergency physician or AI, is to screen/inform or refer for appropriate interventions, due to the serious and urgent nature of TBI and importance of timely surgical intervention for some subtypes of TBI, it may be better to set a high sensitivity and allow more false positives (lower positive predictive value). As a reference, the American College of Surgeons Committee on Trauma has set the national benchmark for field triage at ≥ 95% for sensitivity and ≥ 50% specificity³⁵. However, if the primary role of the operator, whether it be the radiologist, neurosurgeon or AI, is to decide the definitive treatment on whether or not to perform neurosurgery, it would be better to set a high specificity.

Following this argument, outcomes of this study suggest that the deep learning model may be a useful screening tool for the detection and localization of ICH from CT scans in the cases of traumatic brain injury in the majority of emergency departments (ED), in particular where there may not be 24-hour coverage of trained radiologists. However, confirmation by trained personnel is required before definitive surgical treatment plans can be made. This is particularly the case in low- and middle-income settings when emergency physicians and neurosurgeons frequently evaluate emergency computed tomography (CT) scans without support from trained radiologists.

There are three main limitations to our study. First, the performance evaluation is done by scoring the detected ICH according to crude locations, which sometimes represent a wide and less specific area within the cranium. For example, hematoma in the cerebral hemisphere could be in either the frontal, parietal, temporal, or occipital lobes. Multiple separated hematomas may be present in different locations within the same hemisphere. If only one hematoma is segmented or identified in the setting of multiple discrete hematomas confined within the recording area (e.g., a hemisphere), this will be erroneously considered as all hematomas being detected when the remaining hematomas have been missed. However, unlike a previous study¹⁵, ours is one of the few that has attempted to match the type of ICH and their precise locations with performance comparisons to training residents rather than simply detecting ICH subtype alone. The second limitation is the sample size; with only 300 cases of head CT studies, the study may not have the same level of statistical power in defining the exact diagnostic performance of the deep learning model in comparison to some other studies in the literature^16,17,36. In addition, all the subjects were limited to 15 or more years of age. In future studies, it would be useful if the data set were expanded to include all age groups. Lastly, we did not validate the deep learning model with the external dataset (e.g. from another hospital or geographical area). Samples in the test dataset were collected from the same hospital and CT machines used for the training dataset. This limitation needs to be addressed in future validation.

Conclusion

In conclusion, our study is one of the first to validate the efficacy of the role of the deep learning model for ICH detection and localization by comparing the level of diagnostic accuracy with radiology, emergency medicine, and neurosurgery residents. Based on the results, our study highlighted the potential use of AI as a useful intracranial hemorrhage screening tool in traumatic brain injury patients. However, its slightly lower specificity and tendency to misinterpret some benign lesions with high attenuation into hemorrhage remain an issue to be addressed. Further model training with a larger data set and a larger sample size is expected to improve the overall capability of our deep learning model in a real clinical setting.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Maas, A. I. R. et al. Traumatic brain injury: Progress and challenges in prevention, clinical care, and research. Lancet Neurol. 21(11), 1004–1060 (2022).
Article PubMed Google Scholar
Fink, K. R. Imaging of head trauma. Semin. Roentgenol. 51(3), 143–151 (2016).
Article PubMed Google Scholar
Kvint, S., Gutierrez, A., Blue, R. & Petrov, D. Surgical management of trauma-related intracranial hemorrhage-a review. Curr. Neurol. Neurosci Rep. 20(12), 63 (2020).
Article PubMed Google Scholar
Naidoo, D. Traumatic brain injury: The South African landscape. South Afr. Med. Journal 103(9), 613–614 (2013).
Article Google Scholar
Boland, G.W., Guimaraes, A.S. & Mueller, P.R. The radiologist's conundrum: Benefits and costs of increasing CT capacity and utilization. European radiology 19(1):9–11 (2009); discussion 2.
Waganekar, A., Sadasivan, J., Prabhu, A. S. & Harichandrakumar, K. T. Computed tomography profile and its utilization in head injury patients in emergency department: A prospective observational study. J. Emerg. Trauma Shock 11(1), 25–30 (2018).
Article PubMed PubMed Central Google Scholar
Alfaro, D., Levitt, M. A., English, D. K., Williams, V. & Eisenberg, R. Accuracy of interpretation of cranial computed tomography scans in an emergency medicine residency program. Ann. Emerg. Med. 25(2), 169–174 (1995).
Article CAS PubMed Google Scholar
Al-Reesi, A., Stiell, I. G., Al-Zadjali, N. & Cwinn, A. A. Comparison of CT head interpretation between emergency physicians and neuroradiologists. Eur. J. Emerg. Med. 17(5), 280–282 (2010).
Article PubMed Google Scholar
Liu, X. et al. A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: A systematic review and meta-analysis. The Lancet Digit. Health 1(6), e271–e297 (2019).
Article PubMed Google Scholar
Nir, G. et al. Comparison of artificial intelligence techniques to evaluate performance of a classifier for automatic grading of prostate cancer from digitized histopathologic images. JAMA Netw. Open 2(3), e190442 (2019).
Article PubMed PubMed Central Google Scholar
Wong, R. K., Tandan, V., De Silva, S. & Figueredo, A. Pre-operative radiotherapy and curative surgery for the management of localized rectal carcinoma. Cochrane Database Syst. Rev. https://doi.org/10.1002/14651858.CD002102.pub2 (2007).
Article PubMed PubMed Central Google Scholar
Ting, D. S. W. et al. Development and validation of a deep learning system for diabetic retinopathy and related eye diseases using retinal images from multiethnic populations with diabetes. JAMA 318(22), 2211–2223 (2017).
Article PubMed PubMed Central Google Scholar
Hssayeni, M. D. et al. Intracranial hemorrhage segmentation using a deep convolutional model. Data 5(1), 14 (2020).
Article Google Scholar
Xiao, F., Liao, C. C., Huang, K. C., Chiang, I. J. & Wong, J. M. Automated assessment of midline shift in head injury patients. Clin. Neurol. Neurosurg. 112(9), 785–790 (2010).
Article PubMed Google Scholar
Ye, H. et al. Precise diagnosis of intracranial hemorrhage and subtypes using a three-dimensional joint convolutional and recurrent neural network. Eur. Radiol. 29(11), 6191–6201 (2019).
Article PubMed PubMed Central Google Scholar
Arbabshirani, M. R. et al. Advanced machine learning in action: Identification of intracranial hemorrhage on computed tomography scans of the head with clinical workflow integration. npj Digit. Med. 1(1), 9 (2018).
Article PubMed PubMed Central Google Scholar
Majumdar, A., Brattain, L., Telfer, B., Farris, C., & Scalera, J. (eds) Detecting intracranial hemorrhage with deep learning. In 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); 2018 18–21 July 2018.
Phaphuangwittayakul, A. et al. An optimal deep learning framework for multi-type hemorrhagic lesions detection and quantification in head CT images for traumatic brain injury. Appl. Intell. 52(7), 7320–7338 (2022).
Article Google Scholar
Lee, H. et al. An explainable deep-learning algorithm for the detection of acute intracranial haemorrhage from small datasets. Nat. Biomed. Eng. 3(3), 173–182 (2019).
Article PubMed Google Scholar
Cho, J. et al. Improving sensitivity on identification and delineation of intracranial hemorrhage lesion using cascaded deep learning models. J. Digit. Imaging 32(3), 450–461 (2019).
Article PubMed PubMed Central Google Scholar
Chang, P. D. et al. Hybrid 3D/2D convolutional neural network for hemorrhage evaluation on head CT. AJNR Am. J. Neuroradiol. 39(9), 1609–1616 (2018).
Article CAS PubMed PubMed Central Google Scholar
Inkeaw, P. et al. Automatic hemorrhage segmentation on head CT scan for traumatic brain injury using 3D deep learning model. Comput. Biol. Med. 146, 105530 (2022).
Article PubMed Google Scholar
Kamnitsas, K. et al. Efficient multi-scale 3D CNN with fully connected CRF for accurate brain lesion segmentation. Med. Image Anal. 36, 61–78 (2017).
Article PubMed Google Scholar
Lee, J. Y., Kim, J. S., Kim, T. Y. & Kim, Y. S. Detection and classification of intracranial haemorrhage on CT images using a novel deep-learning algorithm. Sci. Rep. 10(1), 20546 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kundisch, A. et al. Deep learning algorithm in detecting intracranial hemorrhages on emergency computed tomographies. PLoS ONE 16(11), e0260560 (2021).
Article CAS PubMed PubMed Central Google Scholar
Rao, B. et al. Utility of artificial intelligence tool as a prospective radiology peer reviewer—Detection of unreported intracranial hemorrhage. Acad. Radiol. 28(1), 85–93 (2021).
Article PubMed Google Scholar
Waite, S. et al. Interpretive error in radiology. Am. J. Roentgenol. 208(4), 739–749 (2017).
Article Google Scholar
Bruno, M. A., Walker, E. A. & Abujudeh, H. H. Understanding and confronting our mistakes: The epidemiology of error in radiology and strategies for error reduction. Radiographics 35(6), 1668–1676 (2015).
Article PubMed Google Scholar
Hanna, T. N., Lamoureux, C., Krupinski, E. A., Weber, S. & Johnson, J. O. Effect of shift, schedule, and volume on interpretive accuracy: A retrospective analysis of 2.9 million radiologic examinations. Radiology 287(1), 205–212 (2018).
Article PubMed Google Scholar
Ruutiainen, A. T., Durand, D. J., Scanlon, M. H. & Itri, J. N. Increased error rates in preliminary reports issued by radiology residents working more than 10 consecutive hours overnight. Acad. Radiol. 20(3), 305–311 (2013).
Article PubMed Google Scholar
Miyakoshi, A., Nguyen, Q. T., Cohen, W. A., Talner, L. B. & Anzai, Y. Accuracy of preliminary interpretation of neurologic CT examinations by on-call radiology residents and assessment of patient outcomes at a level I trauma center. J. Am. Coll. Radiol. 6(12), 864–870 (2009).
Article PubMed Google Scholar
Briggs, G. M., Flynn, P. A., Worthington, M., Rennie, I. & McKinstry, C. S. The role of specialist neuroradiology second opinion reporting: Is there added value?. Clin. Radiol. 63(7), 791–795 (2008).
Article CAS PubMed Google Scholar
O’Neill, T. J. et al. Active reprioritization of the reading worklist using artificial intelligence has a beneficial effect on the turnaround time for interpretation of head CT with intracranial hemorrhage. Radiol. Artif. Intell. 3(2), e200024 (2021).
Article PubMed Google Scholar
Prevedello, L. M. et al. Automated critical test findings identification and online notification system using artificial intelligence in imaging. Radiology 285(3), 923–931 (2017).
Article PubMed Google Scholar
Newgard, C. D. et al. The trade-offs in field trauma triage: A multiregion assessment of accuracy metrics and volume shifts associated with different triage strategies. J. Trauma Acute Care Surg. 74(5), 1298–1306 (2013).
PubMed PubMed Central Google Scholar
Jnawali, K., Arbabshirani, M.R., Rao, N. & Patel, A.A. (eds) Deep 3D Convolution Neural Network for CT Brain Hemorrhage Classification. Medical Imaging 2018: Computer-Aided Diagnosis; 2018 February 01, (2018).

Download references

Acknowledgements

This research was supported by the Program Management Unit for Human Resources and Institutional Development, Research, and Innovation, NXPO [grant number B04G640072]. This work was partially supported by Chiang Mai University [grant number 8/2565].

Author information

Authors and Affiliations

Department of Radiology, Maharaj Nakorn Chiang Mai Hospital, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
Salita Angkurawaranon, Nonn Sanorsieng, Kittisak Unsrisong & Imjai Chitapanarux
Global Health and Chronic Conditions Research Group, Chiang Mai, 50200, Thailand
Salita Angkurawaranon & Chaisiri Angkurawaranon
Department of Computer Science, Faculty of Science, Chiang Mai University, Chiang Mai, 50200, Thailand
Papangkorn Inkeaw
Research Institute for Health Sciences, Chiang Mai University, Chiang Mai, 50200, Thailand
Patumrat Sripan
Department of Family Medicine, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
Piyapong Khumrin & Chaisiri Angkurawaranon
Neurosurgery Division, Department of Surgery, Faculty of Medicine, Chiang Mai University, Chiang Mai, 50200, Thailand
Tanat Vaniyapong

Authors

Salita Angkurawaranon
View author publications
You can also search for this author in PubMed Google Scholar
Nonn Sanorsieng
View author publications
You can also search for this author in PubMed Google Scholar
Kittisak Unsrisong
View author publications
You can also search for this author in PubMed Google Scholar
Papangkorn Inkeaw
View author publications
You can also search for this author in PubMed Google Scholar
Patumrat Sripan
View author publications
You can also search for this author in PubMed Google Scholar
Piyapong Khumrin
View author publications
You can also search for this author in PubMed Google Scholar
Chaisiri Angkurawaranon
View author publications
You can also search for this author in PubMed Google Scholar
Tanat Vaniyapong
View author publications
You can also search for this author in PubMed Google Scholar
Imjai Chitapanarux
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.A. contributed to Conceptualization, Methodology, Investigation, Resources, Data Curation, Writing—Original Draft, Visualization. N.S. contributed to Resources, Writing—Review & Editing. K.U. contributed to Resources, Writing—Review & Editing. P.I. contributed to Conceptualization, Software, Formal Analysis, Writing—Original Draft, Visualization. P.K. contributed to Software, Resources, Writing—Review & Editing. P.S. contributed to formal analysis, Writing—Review & Editing. C.A. contributed to Methodology, Formal Analysis, Writing—Review & Editing. T.V. contributed to investigation, writing- review&editing. I.C. contributed to Conceptualization, Writing—Review & Editing, Project Administration, Funding Acquisition.

Corresponding author

Correspondence to Imjai Chitapanarux.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Angkurawaranon, S., Sanorsieng, N., Unsrisong, K. et al. A comparison of performance between a deep learning model with residents for localization and classification of intracranial hemorrhage. Sci Rep 13, 9975 (2023). https://doi.org/10.1038/s41598-023-37114-z

Download citation

Received: 07 February 2023
Accepted: 15 June 2023
Published: 20 June 2023
DOI: https://doi.org/10.1038/s41598-023-37114-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.