A fully-automated paper ECG digitisation algorithm using deep learning

Wu, Huiyi; Patel, Kiran Haresh Kumar; Li, Xinyang; Zhang, Bowen; Galazis, Christoforos; Bajaj, Nikesh; Sau, Arunashis; Shi, Xili; Sun, Lin; Tao, Yanda; Al-Qaysi, Harith; Tarusan, Lawrence; Yasmin, Najira; Grewal, Natasha; Kapoor, Gaurika; Waks, Jonathan W.; Kramer, Daniel B.; Peters, Nicholas S.; Ng, Fu Siong

doi:10.1038/s41598-022-25284-1

Download PDF

Article
Open access
Published: 05 December 2022

A fully-automated paper ECG digitisation algorithm using deep learning

Huiyi Wu¹^na1,
Kiran Haresh Kumar Patel¹^na1,
Xinyang Li¹,
Bowen Zhang²,
Christoforos Galazis¹,
Nikesh Bajaj¹,
Arunashis Sau^1,3,
Xili Shi¹,
Lin Sun¹,
Yanda Tao⁴,
Harith Al-Qaysi³,
Lawrence Tarusan³,
Najira Yasmin³,
Natasha Grewal³,
Gaurika Kapoor³,
Jonathan W. Waks⁵,
Daniel B. Kramer^1,5,
Nicholas S. Peters¹ &
…
Fu Siong Ng^1,3,6

Scientific Reports volume 12, Article number: 20963 (2022) Cite this article

7637 Accesses
7 Citations
20 Altmetric
Metrics details

Subjects

Abstract

There is increasing focus on applying deep learning methods to electrocardiograms (ECGs), with recent studies showing that neural networks (NNs) can predict future heart failure or atrial fibrillation from the ECG alone. However, large numbers of ECGs are needed to train NNs, and many ECGs are currently only in paper format, which are not suitable for NN training. We developed a fully-automated online ECG digitisation tool to convert scanned paper ECGs into digital signals. Using automated horizontal and vertical anchor point detection, the algorithm automatically segments the ECG image into separate images for the 12 leads and a dynamical morphological algorithm is then applied to extract the signal of interest. We then validated the performance of the algorithm on 515 digital ECGs, of which 45 were printed, scanned and redigitised. The automated digitisation tool achieved 99.0% correlation between the digitised signals and the ground truth ECG (n = 515 standard 3-by-4 ECGs) after excluding ECGs with overlap of lead signals. Without exclusion, the performance of average correlation was from 90 to 97% across the leads on all 3-by-4 ECGs. There was a 97% correlation for 12-by-1 and 3-by-1 ECG formats after excluding ECGs with overlap of lead signals. Without exclusion, the average correlation of some leads in 12-by-1 ECGs was 60–70% and the average correlation of 3-by-1 ECGs achieved 80–90%. ECGs that were printed, scanned, and redigitised, our tool achieved 96% correlation with the original signals. We have developed and validated a fully-automated, user-friendly, online ECG digitisation tool. Unlike other available tools, this does not require any manual segmentation of ECG signals. Our tool can facilitate the rapid and automated digitisation of large repositories of paper ECGs to allow them to be used for deep learning projects.

Delineation of the electrocardiogram with a mixed-quality-annotations dataset using convolutional neural networks

Article Open access 13 January 2021

Automated multilabel diagnosis on electrocardiographic images and signals

Article Open access 24 March 2022

Automatic classification of healthy and disease conditions from images or digital standard 12-lead electrocardiograms

Article Open access 01 October 2020

Introduction

There has been growing interest in applying machine learning to electrocardiograms (ECG). For example, variations of wavelet analysis and local binary patterns were used for extracting features from ECG, then support vector machine (SVM), k-nearest neighbour (kNN), and state-of-the-art deep neural networks were explored for arrhythmia diagnosis^1,2,3,4,5. Convolutional neural networks (CNN) have also been used to predict the likelihood of paroxysmal atrial fibrillation (AF) from sinus rhythm ECGs^6,7,8, screening left ventricular systolic dysfunction to identify incident heart failure^{9,10,11,12,13,14}, screening hypertrophic cardiomyopathy^15,16,17, and early diagnosis of valvular diseases such as aortic stenosis and mitral regurgitation^18,19,20. The application of machine learning requires large volumes of ECGs in an electronic format, although, in clinical practice, they are often printed on paper and are not available in a digitised format. The practicalities of accessing and utilising large volumes of paper ECGs that have not been saved electronically can be particularly challenging. Although data repositories containing ECG data are increasingly available, the accessibility to ECGs for machine learning applications would be greatly increased with an automated digitisation tool that can rapidly convert large volumes of historical paper-based ECGs into digital signals.

A number of attempts have been made to develop 12-lead ECG digitisation tools^21,22,23. For example, ECGscan²¹ was the first such application to be commercialised but requires significant user input to identify the regions of the ECG that require digitisation. Similarly, other digitisation tools^22,23,24 also require manual input to ensure that the ECG leads are correctly identified by the end-user. Others have developed ECG digitisation tools to work directly on segmented single-lead ECG image^25,26,27. There have been other efforts to develop automated digitisation tools that require no manual inputs, but again, those algorithms can only digitise ECGs with leads printed in a specific configuration²⁸. Another approach involved applying a pre-set binary mask to obtain the region of interest, though the generalisation is limited to a single and specific layout of ECG signals²⁹. In addition, ECG digitisation tools have been developed for diagnosis and monitoring cardiac disease³⁰. However, there is no single method that is applicable to all paper ECG configurations and that does not require manual intervention. Some existing methods are validated using ECG parameters such as PR, QRS, RR, QT intervals, or heart rate^25,26,27 rather than using a direct comparison with an original digitised version of the ECG. There is an unmet need for a user-friendly, accurate, generalisable, and fully-automated ECG digitisation tool that can be applied to paper ECGs with different configurations.

To address these limitations, we sought to develop an open-access fully-automated algorithm that can digitise 12-lead ECGs with signals printed in any standard configuration and requires no user input. We incorporate this functionality in a user-friendly interface, and we envisage that our tool will enable a large number of ECGs to be readily digitised, for machine learning purposes.

Methods

Figure 1 outlines our automated ECG digitisation algorithm. The pseudocode of the ECG digitisation is shown in Algorithms 1–7 in Supplementary information. The paper ECG image was first pre-processed to remove any redacted regions and grids lines, and then transformed into a binary image, which enabled the ECG baselines to be subsequently detected. Once the ECG baselines were detected, vertical anchor points were used to detect the upper and lower boundary of each ECG lead signal. This step also allows the algorithm to determine the layout of the ECG leads (i.e., number of rows) on the printed ECG. Next, using lead name detection, the horizontal anchor points of each lead, i.e. left-and-right-hand boundaries of the ECG signals to be digitised, that signified their start and end, respectively, were used to crop and extract the signals in each lead of the 12-lead ECG. Finally, signals in each of the leads were digitised individually. We have developed an open-access online tool to allow users to upload scanned ECGs to extract the digital signals (http://ecg-digitisation.hh.med.ic.ac.uk:8050/) (Running speed details of the website are shown in Supplementary information). Each of these steps are described in greater detail below.

Data source for development

Our online ECG digitisation tool was developed using 12-lead ECGs recorded in patients presenting to Imperial College London NHS Trust. These ECGs were originally printed on paper and were provided to the research team as anonymised scanned versions in Portable Document Format (PDF), and subsequently reformatted into 250 dpi Portable Network Graphics (PNG) files. These ECGs were typically in the conventional 3 \(\times\) 4 lead configuration with a lead II rhythm strip. This database contained only paper ECGs, without digital ECG ground truth data.

For validation, we used anonymised 12-lead ECGs from Beth Israel Deaconess Medical Centre (BIDMC), Boston MA, USA, as PNG files in 3 \(\times\) 4, 12 \(\times\) 1 and 3 \(\times\) 1 lead configurations to validate our digitisation tool. This second database contained both ECG images and digital ECG ground truth data. All ECGs used in the development and testing of our digitisation tool were calibrated to 1 mV = 10 mm and recorded at a paper speed of 25 mm/s.

Both the Imperial College and BIDMC provided ethics review for this project. All methods were carried in accordance with relevant guidelines and regulations. Ethical approval for collection of data used in this study was granted by Health Research Authority London Research Ethics Committee (Hampstead) (protocol number 20HH5967, REC reference 20/HRA/2467, sponsor Imperial College London). Informed consent was obtained from all subjects and/or their legal guardian(s). This study conforms to the Declaration of Helsinki.

Step I: Determining ECG baseline and lead configuration

Pre-processing

In the database for development, all ECGs contained a header made up of black pixels of redacted patient information, which may adversely influence digitisation of ECG traces. For this reason, before implementation of the digitisation process, the redacted area of each ECG was automatically removed. The redacted region was black resulting in the average pixel intensity of each row of the redacted region as zero, while the average pixel intensity became a positive scalar value in regions of interest to be digitised. This enabled the redacted region to be reliably identified and removed prior to the digitisation of the ECG signals.

ECGs are routinely printed on paper containing gridlines which were removed prior to the digitisation process. Given that ECG contained red pixels, the red channel of the image was set as 1, and the image transformed to grey-scale. A threshold of 0.94 was used to differentiate pixels that made up the ECG signal versus gridlines. Pixels \(>0.94\) were discarded and those with \(\le 0.94\) were taken as indicative of an ECG signal or lead name. In this way, the ECG traces and the lead name information were extracted in the binary image and the background and gridlines were eliminated. The processed binary image is shown graphically from Fig. 2A, B.

ECG baseline detection and ECG configuration determination

After pre-processing, the first step of the automated digitisation process required the algorithm to detect the signal baseline and determine the number of rows of ECG signals to determine the ECG configuration. We considered ECG baselines as the horizontal lines that have the highest intensities of ECG signals on the horizontal axis.

Hough transform³¹ is a coordinate transformation that converts images from Cartesian to polar axes, and has been used for computer vision feature extraction on digital images. Here, we applied Hough transform to identify the ECG baselines. In order to perform Hough transform and constrain the number of plausible solutions, two restraints were implemented to avoid inaccurate identification of the baseline. First, given that the ECG baseline is expected to be near horizontal, only lines between \(-\)2.5\(^{\circ }\) and +2.5\(^{\circ }\) around the x-axis were considered. Second, given that the baseline is expected to extend almost across the entire image, any lines less than 80% of the width of the printed ECG were discarded. In instances where there were spaces between ECG lead waveforms, the lines were merged if the inter-lead space was no greater than 15% of the total width of the image. This ensured that the ECG signals of adjacent leads remained independent and were not combined in the digitisation process. This method also helped to determine the number of baselines on the printed ECG, and in conjunction with the vertical anchor point detection below, provided information on the lead configuration.

Step II: Automated anchor point detection

Vertical anchor point detection

Just as baseline detection was used to determine vertical anchor points to identify ECG signals in space, vertical anchor points were used to determine the upper and lower boundaries of the signals in each ECG lead to identify the signals to be digitised. The vertical cropping length is presented in Fig. 2B. The upper and lower boundaries were defined as 0.7 times the distance between two neighbouring ECG signals (in the horizontal plane) above and below the ECG baseline, respectively.

Horizontal anchor point detection

Horizontal anchor points were used to determine the left- and right-hand boundaries of the ECG signals to be digitised, that signified their start and end, respectively. The lead name and the start of the subsequent ECG signal in the horizontal plane constituted the start and end of the ECG signal to be digitised. The maximum horizontal distance encompassing the ECG signal in other leads in the same ECG was used to define the right-hand boundary for leads on the far right of the image that had no right-hand boundary.

Our text recognition model was unable to detect lead names when these were in close proximity to the ECG baseline. In these instances, ECG baselines were removed to enable the digitisation tool to identify the lead names. Additionally, morphological dilation and erosion were applied to the image to enhance the distinguishability of the lead names to surrounding signals. Thus, it enabled the text recognition model to identify these cases more easily. Dilation is an iterative region-growing algorithm that thickens the lines and erosion is an iterative region-shrinking algorithm that thins the lines, thereby making any objects of interest more readily identifiable by automated processes. All objects of interest in the image were filtered in this way to exclude those with width-height-ratio \(>5\) and those with a width or height \(<5\) pixels or \(>500\) pixels.

Thereafter, a trained text character recognition deep learning model³² was used to specifically detect lead names amongst the other filtered objects. The input for the model comprised the 12-lead ECG binary image and 12 ground truth lead name text strings (‘I’, ‘II’, ‘III’, ‘avr’, ‘avl’, ‘avf’, ‘v1’, ‘v2’, ‘v3’, ‘v4’, ‘v5’, ‘v6’). The output constituted any texts detected by the model, the corresponding bounding box for the text, and the confidence score. Thresholds of confidence scores were set to detect lead names such that the identification of one of the text strings would result in a confidence score exceeding the threshold. In this way, lead name objects, the position, height and width information of the lead name objects were identified for their implementation as horizontal anchor points. The process of obtaining horizontal distance from lead name detection is presented in Fig. 2C. In instances when some lead name detection was unsuccessful, horizontal anchor points were determined based on the distance between other lead names that were successfully identified in the same ECG. ECG segments for each lead were cropped after successfully identifying the horizontal and vertical anchor points, and is shown in Fig. 2D.

Step III: Single lead ECG extraction

The extraction of the ECG signals from the cropped image required removal of “salt-and-pepper” noise, that comprises sparse white and black pixels, as well as any partial ECG signals from other leads. The latter is particularly true for large amplitude ECG traces that would encroach the cropped images of neighbouring leads as shown in Fig. 3. To do this, first we used image dilation to connect any discontinuities in the ECG signal of interest which also prevented any spurious connections with noise or neighbouring signals. Thereafter, we considered the largest detectable object in the image as the ECG signal of interest and all other objects as artefacts. This process is presented in Fig. 3 which demonstrates this method retains the signal of interest and removes other objects contained within the cropped image.

The next step involved converting the extracted ECG binary image into a one-dimension digital ECG signal. The ECG signal in the binary image comprises a set of pixels with x (time) and y (voltage) coordinates, calibrated at 25 mm/s and 10 mm/mv. For any given point in time (x-axis), several pixels may make up the corresponding amplitude. Given that the digital ECG signal can only have a single y-coordinate for each x-coordinate, we used the median amplitude pixel (y-axis) in the binary image to reconstruct the digital ECG signal. This generated a digital ECG signal with x and y coordinates in pixel units. In order to ascribe time and voltage values to the digital ECG signal, we determined time and voltage resolutions using the rhythm (or longest signal) strip in each ECG. Given that a standard 12-lead ECG duration is 10 s, the time resolution was calculated as the 10 s divided by the number of pixels in the x-axis. The voltage-time resolution ratio is standard at 0.1 mV/40 ms = 0.0025 mV/ms which enabled the voltage resolution of the signals to be determined by multiplying time resolution and voltage-time resolution ratio (0.0025 mV/ms). In this way, the time of the digital ECG signal was calculated as the number of pixels in the x-axis multiplied by time resolution, and the amplitude as the number of pixels on y-axis multiplied by voltage resolution.

Step IV: Dashboard online tool development

We developed the online tool with Python dash plotly. The following steps provides step-by-step instructions for the end-user to use the online tool. First, the users are required to scan and upload an ECG image. Users are reminded to fully redact and anonymise all confidential or patient-identifiable data. The image is read by the Python method “cv2.imread” and can support any image format that is supported by “cv2.imread”. After uploading the image, it is displayed with a fixed height 600 pixels (px). Next, a dropdown bar provides options to visualise each digitised ECG signal with the option of changing the resolution by magnifying or minimising the image. The digitised ECG can be downloaded into a spreadsheet containing 13 columns, with the first column providing data for the time axis and remaining 12 columns are ECG signal data in voltage.

Statistical analyses

We validated our tool using Pearson’s correlation and root mean squared error (RMSE) to determine the association between ground truth ECG signals and digitised ECG signals generated by our digitisation tool. The validation was conducted on the independent database obtained from BIDMC. Pearson’s correlation and Root Mean Squared Error (RMSE) were performed using Python (“scipy.stat.pearsonr” for Pearson’s correlation and “sklearn.metrics.mean_squared_error” for RMSE). \(P<0.001\) was considered significant.

Result

We validated our digitisation tool using three independent validation tests. The digitisation tool was developed using a database of paper ECGs. Consequently, parameters (QRS duration, PR, QT and RR intervals) from these ECGs were the only method for validating our tool. To obtain more accurate validation, we performed validation using an external ECG database from BIDMC containing digital ECGs.

Validation 1: 3 \(\times\) 4 ECGs

This validation was performed with acquired digital and printed ECGs (Fig. 4A). There are overall 930 standard 3 \(\times\) 4 ECG images that are validated. 7 3 \(\times\) 4 ECG images failed in lead name detection, which are shown in the Supplementary Fig. S1. The average correlation and RMSE performance of the remaining 923 3 \(\times\) 4 ECG images is shown in Table 1. The performance of average correlation is from 90 to 97% across the leads. 515 3 \(\times\) 4 ECG images without overlap of lead signals are selected for validation from the 923 3 \(\times\) 4 ECG images by a cardiologist to eliminate the effect of lead signal overlapping. The performance of correlation and RMSE between 515 digitised ECGs and ground truth ECG signals in a 3 \(\times\) 4 configuration are shown in Table 2. The average correlation value was consistently \(>99\%\) across all leads (\(p<0.001\)), and the average RMSE were consistently 0.04 mV (\(p<0.001\)). Examples of this validation is presented in Fig. 5, in which the red line represents the ground truth and the blue line the digitised result.

Validation 2: 12 \(\times\) 1 and 3 \(\times\) 1 ECGs

Next, we performed validation on 310 ECGs in 12 \(\times\) 1 and 91 ECGs in 3 \(\times\) 1 lead configurations (Fig. 4A). There were 2 12 \(\times\) 1 ECG images and 4 3 \(\times\) 1 ECG images that failed in lead name detection (Supplementary Figs. S2 and S3). The average correlation and RMSE performance of the remaining 308 12 \(\times\) 1 ECG images and 87 3 \(\times\) 1 ECG images are shown in Tables 3 and 4. Some leads’ average correlation performance drops between 60 and 70% due to severe overlapping of ECG signals in 12 by 1 ECG configurations, and the average correlation performance of 3 by 1 ECG signals achieved 80–90%. Similarly, to get rid of the overlapping images, 45 12 \(\times\) 1 ECG images were selected from 308 12 \(\times\) 1 ECG images, and 51 3 \(\times\) 1 ECG images were selected from 87 3 \(\times\) 1 ECG images by a cardiologist for validation. The correlations between digitised ECGs and the ground truth signals of 45 12 \(\times\) 1 and 51 3 \(\times\) 1 thresholded ECGs are shown in Tables 5 and 6 respectively, and consistently exceeded 97% in all leads (\(p<0.001\)), and the average RMSE were consistently 0.04 mV (\(p<0.001\)). Examples of digitised and ground truth ECG traces are shown in Fig. 5.

Validation 3: ECG images and prints

Finally, we validated our digitisation tool against 45 images of printed ECGs in a 3 \(\times\) 4 configuration. This validation process is shown in Fig. 4B. For this validation process, we printed each ECG image and re-scanned it to generate an ECG in PDF format. This was then transformed into PNG-image, to which our digitisation tool was applied. Digitisation was unsuccessful in one ECG in which lead name could not be detected, although digitisation was successful in its equivalent digital copy, suggesting that the resolution of the printed ECG was of poor quality. The correlation between the digitised and remaining 44 scanned ECGs are shown in Table 7. The average correlation value between digitised and validation ECGs was 96% across all leads (\(p<0.001\)), and the average RMSE were consistently 0.05 mV (\(p<0.001\)). These results demonstrate that our digitisation tool can be successfully generalized to both ECG images and ECG paper scans.

Table 1 Correlation and root mean squared error (RMSE) statistics of the digitised results from 923 standard 3 by 4 ECG images and the ground truth digital ECG before image thresholding (validation 1).

Full size table

Table 2 Correlation and root mean squared error (RMSE) statistics of the digitised results from 515 standard 3 by 4 ECG images and the ground truth digital ECGs (validation 1).

Full size table

Table 3 Correlation and root mean squared error (RMSE) statistics of the digitised results from selected 308 12 by 1 ECG images and the ground truth digital ECGs before image thresholding (validation 2).

Full size table

Table 4 Correlation and root mean squared error (RMSE) statistics of the digitised results from 87 3 by 1 ECG images and the ground truth digital ECGs before image thresholding (validation 2).

Full size table

Table 5 Correlation and root mean squared error (RMSE) statistics of the digitised results from selected 45 12 by 1 ECG images and the ground truth digital ECGs (validation 2).

Full size table

Table 6 Correlation and root mean squared error (RMSE) statistics of the digitised results from selected 51 3 by 1 ECG images and the ground truth digital ECGs (validation 2).

Full size table

Table 7 Correlation and root mean squared error (RMSE) statistics of the digitised results from 44 printed and scanned paper ECGs and the ground truth digital ECGs (validation 3).

Full size table

Discussion

We have developed a robust and user-friendly online ECG digitisation interface that lends itself to the digitisation of large numbers of paper ECG. Its main advantage is that it is fully-automated and can be readily applied to all printed ECGs irrespective of the lead configuration. Validation on an external database of digital ECGs showed 99.0% correlation and average 0.04 mV RMSE on 8 ECG leads in a 3 by 4 configuration after excluding the ECG images with lead signal overlap. Without this thresholding, it achieved 90–97% average correlation across the leads. In addition, we show that the software can digitise ECG signals from leads arranged in a number of configurations from printed and scanned ECGs. The average correlation of 12 by 1 ECG signals dropped to 60–70% in some leads due to the overlapping of the lead signals. However, it still achieved 97% average correlation in 12 by 1 and 3 by 1 ECG configurations after excluding the ECG images with overlapping signals.

The first step of the digitisation process required the algorithm to detect the lead configuration of the printed ECG using horizontal and vertical anchors to facilitate cropping of each lead in turn. Other digitisation tools²⁸ developed a similar interface using a line detection algorithm for horizontal and vertical anchor point detection that functions with ECGs printed in a 6 \(\times\) 2 configuration. Although our tool adopted a similar method for vertical anchor detection, we also applied a deep learning-based text recognition model for lead name detection for horizontal anchor point detection. This has the advantage of allowing the software to extract data from any configuration of ECG. Although horizontal anchor points may be identified by dividing the ECG image in half, this approach may not be accurate in ECG configurations where the leads are not equidistant and would only work for 6 \(\times\) 2 ECG configurations. Other digitisation tools also require manual labelling of anchor points^{21,22,23,24,29} and restricted in their application by ECG configurations. They are also user-dependent, requiring manual selection of each lead prior to the digitisation process. By contrast, our digitisation tool can be utilised on a ECGs of different configurations and requires no manual inputs prior to the digitisation process. We envisage that this will aid its application in clinical and non-clinical settings to enable larger volumes of printed ECGs to be digitised in a shorter timescale.

Following lead detection and cropping of individual leads, our digitisation tool provides an efficient method for ECG signal extraction. Similar to other digitisation interfaces²⁸, we apply connectivity algorithms to label and remove small objects. However, the other existing digitisation methods cannot remove all non-ECG artefacts or partial ECG signals from other leads, and this necessitates other processes, such as an iterative process to select pixels from left to right of the image. Although this methodology enables ECG extraction, it can be a complex and time-consuming process. By contrast, we utilised a dynamic morphological method to connect any discontinuities in the ECG signal prior to identifying the largest labelled object as the ECG signal of interest. This effectively eliminates noise without the need for further computational processing.

Traditionally, many existing ECG digitisation tools require manual segmentation, removal of gridlines, and processing to extract digital signals. Ravichandran et al.²² and Lobodzinski et al.³³ have applied optical character recognition to scan and reference printed text with a pre-defined character template database, or to store the demographic data. Apart from the traditional methods, others used end-to-end deep learning technique for ECG digitisation³⁴. However, their techniques are limited on the generalisability to different ECG image databases, especially with different configurations.

The motivation for developing our tool was to enable users to generate large volumes of digital ECGs from their paper, image, or scanned counterparts quickly and easily. We envisage that this will be particularly useful for individuals that wish to use ECGs in machine learning applications. Although this can be achieved without digitising ECGs, for example with paper ECGs or their images³⁰, any outputs from these processes is inherently determined by the quality of the input. By contrast, our tool digitises paper ECGs with different configurations and thereby generates standardised inputs for machine learning algorithms.

Overall, our digitisation tool has the following advantages:

1. It is fully-automated without the need for manual user input of single lead signal segmentation.

2. Text-recognition-based lead name detection makes our digitisation tool generalisable on different configurations of ECG images, or paper-based ECG scans.

3. An efficient ECG extraction algorithm enables swift digitisation at the point of need.

4. The Pearson’s correlation and RMSE of ground truth digital ECG and digitised ECG waveform is a robust way of validation for ECG digitisation tool.

Although our method accurately extracts ECG signals, there are conditions in which we expect that the tool may not perform as desired. The limitations are listed as below:

1. Our text recognition model was trained on generic images and therefore may not always recognize lead names on printed ECGs. For instance, the tool may not consistently distinguish between the leads I, II and III accurately particularly if these are obscured by large voltage ECG signals. Lead name detection may be inaccurate in ECGs that are pixelated and of low resolution (Supplementary Figs. S1–S3).

2. Similarly, signal extraction may not be accurate in instances where there are overlapping ECG traces, particularly as shown in Table 3. We intend to apply deep neural networks (DNN) to address the limitations that would obviate the need for manual annotation of leads and serve to improve the out-of-distribution detection.

Comparisons of our digitisation tool with other existing tools are summarised in Table 8. Our digitisation tool compares favourably with these, and notably can discriminate different lead configurations.

Table 8 Comparison of different ECG digitisation tools: Existing digitisation tools are specific ECG configurations or do not detect ECG anchor points by automated methods.

Full size table

Conclusion

We have developed a validated, fully-automated, user-friendly online 12-lead ECG digitisation tool that demonstrates a high degree of accuracy and reliability amongst external validation datasets. It consists of multiple logic-based modules and a sophisticated text character recognition deep learning model that enables its application to all common configurations of ECGs in different clinical settings. Furthermore, it can be utilised on printed and/or scanned ECGs and thereby enables large-scale digitisation of paper ECGs without any user-input.

Data availability

The data that support the findings of this study are available from BIDMC and Imperial College Healthcare NHS trust but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of BIDMC and Imperial College Healthcare NHS trust.

References

Tuncer, T., Dogan, S., Plawiak, P. & Subasi, A. A novel discrete wavelet-concatenated mesh tree and ternary chess pattern based ECG signal recognition method. Biomed. Signal Process. Control 72, 103331 (2022).
Article Google Scholar
Tuncer, T., Dogan, S., Pławiak, P. & Acharya, U. R. Automated arrhythmia detection using novel hexadecimal local pattern and multilevel wavelet transform with ECG signals. Knowl. Based Syst. 186, 104923 (2019).
Article Google Scholar
Subasi, A., Dogan, S. & Tuncer, T. A novel automated tower graph based ECG signal classification method with hexadecimal local adaptive binary pattern and deep learning. J. Ambient Intell. Human. Comput.https://doi.org/10.1007/s12652-021-03324-4 (2021).
Article Google Scholar
Baygin, M., Tuncer, T., Dogan, S., Tan, R.-S. & Acharya, U. R. Automated arrhythmia detection with homeomorphically irreducible tree technique using more than 10,000 individual subject ECG records. Inf. Sci. 575, 323–337 (2021).
Article Google Scholar
Kobat, M. A., Karaca, O., Barua, P. D. & Dogan, S. Prismatoidpatnet54: an accurate ECG signal classification model using prismatoid pattern-based learning architecture. Symmetry 13, 1914 (2021).
Article ADS Google Scholar
Attia, Z. I. et al. An artificial intelligence-enabled ECG algorithm for the identification of patients with atrial fibrillation during sinus rhythm: a retrospective analysis of outcome prediction. Lancet 394, 861–867 (2019).
Article PubMed Google Scholar
Raghunath, S. et al. Deep neural networks can predict new-onset atrial fibrillation from the 12-lead ECG and help identify those at risk of atrial fibrillation-related stroke. Circulation 143, 1287–1298 (2021).
Article PubMed PubMed Central CAS Google Scholar
Khurshid, S. et al. ECG-based deep learning and clinical risk factors to predict atrial fibrillation. Circulation 145, 122–133 (2022).
Article PubMed CAS Google Scholar
Attia, Z. I. et al. Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram. Nat. Med. 25, 70–74 (2019).
Article PubMed CAS Google Scholar
Adedinsewo, D. et al. Artificial intelligence-enabled ECG algorithm to identify patients with left ventricular systolic dysfunction presenting to the emergency department with dyspnea. Circ. Arrhythmia Electrophysiol. 13, e008437 (2020).
Article CAS Google Scholar
Akbilgic, O. et al. ECG-AI: electrocardiographic artificial intelligence model for prediction of heart failure. Eur. Heart J. Digit. Health 2, 626–634 (2021).
Article PubMed PubMed Central Google Scholar
Kwon, J.-M. et al. Artificial intelligence assessment for early detection of heart failure with preserved ejection fraction based on electrocardiographic features. Eur. Heart J. Digit. Health 2, 106–116 (2021).
Article Google Scholar
Grün, D. et al. Identifying heart failure in ECG data with artificial intelligence: a meta-analysis. Front. Digit. Health 2, 584555 (2021).
Article PubMed PubMed Central Google Scholar
Cho, J. et al. Artificial intelligence algorithm for screening heart failure with reduced ejection fraction using electrocardiography. ASAIO J. 67, 314–321 (2021).
Article PubMed Google Scholar
Ko, W.-Y. et al. Detection of hypertrophic cardiomyopathy using a convolutional neural network-enabled electrocardiogram. J. Am. Coll. Cardiol. 75, 722–733 (2020).
Article PubMed Google Scholar
Rahman, Q. A. et al. Utilizing ECG-based heartbeat classification for hypertrophic cardiomyopathy identification. IEEE Trans. Nanobiosci. 14, 505–512 (2015).
Article Google Scholar
Galloway, C. D. et al. Development and validation of a deep-learning model to screen for hyperkalemia from the electrocardiogram. JAMA Cardiol. 4, 428–436 (2019).
Article PubMed PubMed Central Google Scholar
Cohen-Shelly, M. et al. Electrocardiogram screening for aortic valve stenosis using artificial intelligence. Eur. Heart J. 42, 2885–2896 (2021).
Article PubMed Google Scholar
Kwon, J.-M. et al. Deep learning-based algorithm for detecting aortic stenosis using electrocardiography. J. Am. Heart Assoc. 9, e014717 (2020).
Article PubMed PubMed Central Google Scholar
Kwon, J.-M. et al. Artificial intelligence for detecting mitral regurgitation using electrocardiography. J. Electrocardiol. 59, 151–157 (2020).
Article PubMed Google Scholar
Badilini, F., Erdem, T., Zareba, W. & Moss, A. J. ECGSCAN: a method for conversion of paper electrocardiographic printouts to digital electrocardiographic files. J. Electrocardiol. 38, 310–318 (2005).
Article PubMed Google Scholar
Ravichandran, L. et al. Novel tool for complete digitization of paper electrocardiography data. IEEE J. Transl. Eng. Health Med. 1, 1800107–1800107 (2013).
Article PubMed PubMed Central Google Scholar
Fortune, J., Coppa, N., Haq, K. T., Patel, H. & Tereshchenko, L. G. Digitizing ECG image: new fully automated method and open-source software code. medRxiv (2021).
Mishra, S. et al. ECG paper record digitization and diagnosis using deep learning. J. Med. Biol. Eng. 41, 422–432 (2021).
Article PubMed PubMed Central Google Scholar
Mallawaarachchi, S., Perera, M. P. N. & Nanayakkara, N. D. Toolkit for extracting electrocardiogram signals from scanned trace reports. In IEEE Conference on Biomedical Engineering and Sciences (IECBES), 868–873 (IEEE, 2014).
Shi, G., Zheng, G. & Dai, M. ECG waveform data extraction from paper ECG recordings by k-means method. In Computing in Cardiology, 797–800 (IEEE, 2011).
Swamy, P., Jayaraman, S. & Chandra, M. G. An improved method for digital time series signal generation from scanned ECG records. In International Conference on Bioinformatics and Biomedical Technology, 400–403 (IEEE, 2010).
Baydoun, M. et al. High precision digitization of paper-based ECG records: a step toward machine learning. IEEE J. Transl. Eng. Health Med. 7, 1–8 (2019).
Article Google Scholar
Isabel, A., Jimenez-Perez, G., Camara, O. & Silva, E. Mobile app for the digitization and deep-learning-based classification of electrocardiogram printed records. In Computing in Cardiology (CinC), vol. 48, 1–4 (IEEE, 2021).
Sangha, V. et al. Automated multilabel diagnosis on electrocardiographic images and signals. Nat. Commun. 13, 1–12 (2022).
Article Google Scholar
Hough, P. V. Method and means for recognizing complex patterns (1962). US Patent 3,069,654.
Baek, J. et al. What is wrong with scene text recognition model comparisons? dataset and model analysis. In International Conference on Computer Vision (ICCV) (2019).
Lobodzinski, S. M., Teppner, U. & Laks, M. State of the art techniques for preservation and reuse of hard copy electrocardiograms. J. Electrocardiol. 36, 151–155 (2003).
Article PubMed Google Scholar
Li, Y. et al. Deep learning for digitizing highly noisy paper-based ECG records. Comput. Biol. Med. 127, 104077 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank the Imperial College IT department for helping with the website deployment. This work was supported by the British Heart Foundation (RG/16/3/32175 and RG/F/22/110078 for X.L., N.S.P. and F.S.N.) and the National Institute for Health Research Imperial Biomedical Research Centre.

Author information

These authors contributed equally: Huiyi Wu and Kiran Haresh Kumar Patel

Authors and Affiliations

Imperial College London, National Heart & Lung Institute, London, W12 0NN, UK
Huiyi Wu, Kiran Haresh Kumar Patel, Xinyang Li, Christoforos Galazis, Nikesh Bajaj, Arunashis Sau, Xili Shi, Lin Sun, Daniel B. Kramer, Nicholas S. Peters & Fu Siong Ng
National University of Singapore, Singapore, Singapore
Bowen Zhang
Department of Cardiology, Imperial College Healthcare NHS Trust, London, UK
Arunashis Sau, Harith Al-Qaysi, Lawrence Tarusan, Najira Yasmin, Natasha Grewal, Gaurika Kapoor & Fu Siong Ng
CentraleSupélec, Paris, France
Yanda Tao
Harvard-Thorndike Electrophysiology Institute, Beth Israel Deaconess Medical Centre, Harvard Medical School, Boston, MA, USA
Jonathan W. Waks & Daniel B. Kramer
Cardiac Electrophysiology, National Heart and Lung Institute, Imperial College London, 4th floor, Imperial Centre for Translational and Experimental Medicine, Hammersmith Campus, Du Cane Road, London, W12 0NN, UK
Fu Siong Ng

Authors

Huiyi Wu
View author publications
You can also search for this author in PubMed Google Scholar
Kiran Haresh Kumar Patel
View author publications
You can also search for this author in PubMed Google Scholar
Xinyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Bowen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Christoforos Galazis
View author publications
You can also search for this author in PubMed Google Scholar
Nikesh Bajaj
View author publications
You can also search for this author in PubMed Google Scholar
Arunashis Sau
View author publications
You can also search for this author in PubMed Google Scholar
Xili Shi
View author publications
You can also search for this author in PubMed Google Scholar
Lin Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yanda Tao
View author publications
You can also search for this author in PubMed Google Scholar
Harith Al-Qaysi
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence Tarusan
View author publications
You can also search for this author in PubMed Google Scholar
Najira Yasmin
View author publications
You can also search for this author in PubMed Google Scholar
Natasha Grewal
View author publications
You can also search for this author in PubMed Google Scholar
Gaurika Kapoor
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan W. Waks
View author publications
You can also search for this author in PubMed Google Scholar
Daniel B. Kramer
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas S. Peters
View author publications
You can also search for this author in PubMed Google Scholar
Fu Siong Ng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.W., K.H.K.P. wrote the manuscript. X.L., B.Z., K.H.K.P., H.W. developed the concept idea of ECG digitisation tool. B.Z., H.W., C.G. worked on the lead name detection for ECG cropping. X.L., H.W, C.G. worked on the extraction of single lead ECG. A.S., K.H.K.P., N.B., X.S., L.S. worked on the external dataset preparation to eliminate overlapping lead signals. H.W. developed the dash-plotly website tool. H.W., C.G., Y.T. deployed the website. H.W. analyzed and interpreted the results. K.H.K.P., H.A.Q., L.T., N.Y., N.G., G.K. prepared the internal ECG dataset for development and redacted the ECG dataset. J.W.W., D.B.K. prepared the external ECG dataset for validation of the tool. All authors above reviewed the manuscript. N.S.P., F.S.N. have made critical revisions of the manuscript and approved the latest version of the manuscript. Each author has also agreed to be personally accountable for the author’s own contributions and to ensure that questions related to the accuracy or integrity of any part of the work are appropriately investigated, resolved, and the resolution documented in the manuscript.

Corresponding author

Correspondence to Fu Siong Ng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, H., Patel, K.H.K., Li, X. et al. A fully-automated paper ECG digitisation algorithm using deep learning. Sci Rep 12, 20963 (2022). https://doi.org/10.1038/s41598-022-25284-1

Download citation

Received: 23 August 2022
Accepted: 28 November 2022
Published: 05 December 2022
DOI: https://doi.org/10.1038/s41598-022-25284-1

This article is cited by

Machine learning discriminates P2X7-mediated intracellular calcium sparks in human-induced pluripotent stem cell-derived neural stem cells
- Yuki Hanafusa
- Akira Shiraishi
- Fumiyuki Hattori
Scientific Reports (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Delineation of the electrocardiogram with a mixed-quality-annotations dataset using convolutional neural networks

Automated multilabel diagnosis on electrocardiographic images and signals

Automatic classification of healthy and disease conditions from images or digital standard 12-lead electrocardiograms

Introduction

Methods

Data source for development

Step I: Determining ECG baseline and lead configuration

Pre-processing

ECG baseline detection and ECG configuration determination

Step II: Automated anchor point detection

Vertical anchor point detection

Horizontal anchor point detection

Step III: Single lead ECG extraction

Step IV: Dashboard online tool development

Statistical analyses

Result

Validation 1: 3 \(\times\) 4 ECGs

Validation 2: 12 \(\times\) 1 and 3 \(\times\) 1 ECGs

Validation 3: ECG images and prints

Discussion

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Machine learning discriminates P2X7-mediated intracellular calcium sparks in human-induced pluripotent stem cell-derived neural stem cells

Comments

Search

Quick links