Combining machine learning and nanopore construction creates an artificial intelligence nanopore for coronavirus detection

Taniguchi, Masateru; Minami, Shohei; Ono, Chikako; Hamajima, Rina; Morimura, Ayumi; Hamaguchi, Shigeto; Akeda, Yukihiro; Kanai, Yuta; Kobayashi, Takeshi; Kamitani, Wataru; Terada, Yutaka; Suzuki, Koichiro; Hatori, Nobuaki; Yamagishi, Yoshiaki; Washizu, Nobuei; Takei, Hiroyasu; Sakamoto, Osamu; Naono, Norihiko; Tatematsu, Kenji; Washio, Takashi; Matsuura, Yoshiharu; Tomono, Kazunori

doi:10.1038/s41467-021-24001-2

Download PDF

Article
Open access
Published: 17 June 2021

Combining machine learning and nanopore construction creates an artificial intelligence nanopore for coronavirus detection

Masateru Taniguchi ORCID: orcid.org/0000-0002-0338-8755¹,
Shohei Minami²,
Chikako Ono^2,3,
Rina Hamajima²,
Ayumi Morimura⁴,
Shigeto Hamaguchi^4,5,
Yukihiro Akeda^2,4,5,
Yuta Kanai²,
Takeshi Kobayashi²,
Wataru Kamitani⁶,
Yutaka Terada⁷,
Koichiro Suzuki⁸,
Nobuaki Hatori⁸,
Yoshiaki Yamagishi ORCID: orcid.org/0000-0002-9163-787X^4,5,9,
Nobuei Washizu¹⁰,
Hiroyasu Takei¹¹,
Osamu Sakamoto¹¹,
Norihiko Naono¹¹,
Kenji Tatematsu¹,
Takashi Washio¹,
Yoshiharu Matsuura ORCID: orcid.org/0000-0001-9091-8285^2,3 &
…
Kazunori Tomono^4,5

Nature Communications volume 12, Article number: 3726 (2021) Cite this article

16k Accesses
78 Citations
325 Altmetric
Metrics details

Subjects

Abstract

High-throughput, high-accuracy detection of emerging viruses allows for the control of disease outbreaks. Currently, reverse transcription-polymerase chain reaction (RT-PCR) is currently the most-widely used technology to diagnose the presence of SARS-CoV-2. However, RT-PCR requires the extraction of viral RNA from clinical specimens to obtain high sensitivity. Here, we report a method for detecting novel coronaviruses with high sensitivity by using nanopores together with artificial intelligence, a relatively simple procedure that does not require RNA extraction. Our final platform, which we call the artificially intelligent nanopore, consists of machine learning software on a server, a portable high-speed and high-precision current measuring instrument, and scalable, cost-effective semiconducting nanopore modules. We show that artificially intelligent nanopores are successful in accurately identifying four types of coronaviruses similar in size, HCoV-229E, SARS-CoV, MERS-CoV, and SARS-CoV-2. Detection of SARS-CoV-2 in saliva specimen is achieved with a sensitivity of 90% and specificity of 96% with a 5-minute measurement.

Multiplexed detection of viral antigen and RNA using nanopore sensing and encoded molecular probes

Article Open access 14 November 2023

Ren Ren, Shenglin Cai, … Joshua B. Edel

Simultaneous identification of viruses and viral variants with programmable DNA nanobait

Article Open access 16 January 2023

Filip Bošković, Jinbo Zhu, … Ulrich F. Keyser

Massively scaled-up testing for SARS-CoV-2 RNA via next-generation sequencing of pooled and barcoded nasal and saliva samples

Article 01 July 2021

Joshua S. Bloom, Laila Sathe, … Valerie A. Arboleda

Introduction

Human coronavirus, HCoV-229E, is one of the first coronavirus strains reported to be associated with nasal colds¹. In the past 20 years, other species of coronaviruses, namely, severe acute respiratory syndrome (SARS) and Middle East respiratory syndrome (MERS), have caused a pandemic in the form of severe respiratory illness². Recently, SARS-CoV-2, the seventh species of coronavirus, has spread all over the world, causing the outbreak of an acute respiratory disease^{3,4,5,6,7,8,9,10}. As well as vaccines and effective treatments, testing, and quarantine are needed to control the transmission of the virus. Currently, RT-PCR is the gold standard for SARS-CoV-2 testing that is based on the principle that the single-stranded RNA is present in this virus and the primer forms a double helix. Prior to the genetic test, the process of extraction and purification of viral RNA is time-consuming. The exposure increases the risk of the inspector contracting the virus. Therefore, there is a need for an inspection method with higher throughput¹¹.

Nanopores have through holes with diameter ranging from several nanometers to several hundreds of nanometers on the substrate^{12,13,14,15,16,17,18}. Low-aspect solid-state nanopores with nanopore thickness/diameter <1 are used for the detection of DNA¹⁹, viruses^19,20,21,22, and bacteria²³. When the virus is transported from the cis side to the trans side by electrophoretic force, the ionic current decreases (Fig. 1a, b). The ionic current versus time waveform obtained from the nanopore has information on the volume, structure, and surface charge of the target being analyzed²³. At the laboratory level, it has been demonstrated that by classifying the waveform data using artificial intelligence, a single virus can be directly identified with high accuracy that does not require the extraction of the genome^{20,23,24,25,26,27,28}. However, in order to obtain sufficient amount of virus learning data in clinical specimens and achieve highly precise detection with increased reproducibility, the manufacturing of nanopore devices with improved accuracy and better yield is a critical constraint. On the other hand, the ionic current obtained from the nanopore is in the order of several tens of pA. The current characteristics obtained from the nanopore largely depend not only on the electrical characteristics of the nanopore, but also on the electrical characteristics of the measuring device and the fluidic device that transports the specimen to the nanopore¹⁷. Therefore, the development of a dedicated measuring device and flow channel suitable for the nanopores has also been a major limitation in realizing a highly accurate diagnostic system. In the current study, we have developed an artificial intelligence-assisted nanopore-based device to accurately detect the viruses.

Results

AI-Nanopore platform

We developed a nanopore module (25 mm × 25 mm × 5 mm) in which a nanopore chip and a plastic channel were fused (Fig. 1c). Both sides of the silicon chip were chemically bonded to the plastic channel. The hydrophilic channels on the front surface (blue) and the back surface (red) revealed a crossbar structure and passed through the nanopores. From the specimen inlet with a diameter of 1 mm, 15 μl of buffer solution and specimen were pipetted into the front and back channels, respectively. Ag/AgCl electrodes were fabricated on the polymer substrate in each channel for stable current measurement with high reproducibility for a longer duration. A silicon chip (5 mm × 5 mm × 0.5 mm) on which a 50-nm thick SiN was deposited had nanopores about 300 nm in diameter comparable with the diameters of coronaviruses of about 80–120 nm (Fig. 1d, e)^1,29. Silicon chips were manufactured in units of 12-inch wafers by using microfabrication technology, and were cut into chips by dicing. Through this mass production process, nanopores were produced with high accuracy (diameter error ±10 nm) and high yield (90%). The diameter of the nanopore was flexible and was modified according to the size of the virus to be measured.

The performance of the developed measurement system is evaluated using standard polystyrene nanoparticles nearly uniform shape and average diameters of 200 and 220 nm that are diluted with 1× phosphate buffered saline (PBS). A buffer solution is placed in the trans channel of the nanopore module, nanoparticles of 15 μl diluted with the buffer solution is placed in the cis channel, and the module is placed in a dedicated cartridge. The cartridge is placed in a measuring instrument and the current–time waveform is measured for 0.1 V applied voltage. Nanoparticles have been measured using portable nanoSCOUTER^TM. The number of nanopore devices used and the corresponding waveforms obtained are shown in Supplementary Tables S1–S3, respectively.

Waveforms of the two nanoparticles are obtained with high reproducibility (Fig. 1f–h). The histograms of I_p and t_d obtained from the waveforms corresponding to the nanoparticles have regions that overlap with the histogram (Fig. 1i, j). This overlap hinders the high-precision identification of the two nanoparticles using a single waveform. In order to overcome this issue, machine learning is implemented based on the algorithm shown in Fig. 1k. The detailed machine learning algorithm is described in Supplementary Figs. S1–S5. In the data training phase of the machine learning algorithm, the ionic current–time waveform of each nanoparticle is automatically extracted by an in-house developed signal extraction software and the features of the two nanoparticles are created. In the case of nanoparticles, the features are independent parameters such as I_p, t_d, current vector, time vector, and a combination of these features (e.g., I_p + t_d). The current vector (I₁, I₂, ‧ ‧ ‧, I₁₀) and time vector (t₁, t₂, ‧ ‧ ‧, t₁₀) are obtained by dividing the waveform data into tenfolds along the time and current directions³⁰. All the features are merged and machine learning is performed to create a single classifier with the highest F-value (Supplementary Fig. S5). Precision in machine learning refers to the percentage of data that is expected to be positive and is actually positive, whereas recall refers to the percentage of data that is actually positive and is predicted to be positive. Since precision and recall are in a tradeoff relationship, an index that considers these two indexes together, i.e., the harmonic mean of the two indexes, is defined as the F-value. The F-value is defined by Equation (4) in Supplementary Fig. S5 and is calculated using the confusion matrix. The discrimination accuracy between the 200 and 220 nm nanoparticles using an ionic current–time waveform is 97% (Fig. 1l). This indicates that a single ionic current–time waveform is sufficient to precisely differentiate the nanoparticles. The classifier has been built using the measurement data obtained from the three modules that is unaffected by the manufacturing variations of the nanopores and the measurement environment (Supplementary Figs S6 and S7). The accuracy of 97% for a single waveform indicates that if two waveforms are acquired, a classification accuracy of ≥99.9% can be achieved.

Discriminating cultivated coronaviruses

When HCoV-229E viral sample of 100 pfu/μl is measured at −0.1 V, the ionic current–time waveforms are obtained at a frequency of 14.2 pulses/min (Fig. 2a, b). When measured at 0.1 V, one waveform is obtained in 15 min. To confirm that the virus passes through the nanopore at −0.1 V, RT-PCR measurement is performed on the solution in the trans channel. When the solution is extracted from the trans channel, it is expected that the virus will be adsorbed on the wall surface of the channel and the number of viruses that are extracted reduces. Hence, RT-PCR measurement is performed after acquiring about 1200 waveforms. The test results showed the viral presence when passed through the nanopore at −0.1 V. However, in the RT-PCR performed in the trans channel that is left standing for 6 h without applying a voltage, absence of virus is noted. This result has demonstrated that the efficient viral transmission through the nanopore is achieved by electrophoresis instead of diffusional movement.

**Fig. 2: Identification of cultured coronavirus.**

To investigate the detection limit of the nanopore platform, the ionic current–time waveform is measured at −0.1 V by varying the HCoV-229E concentration (Fig. 2c). The threshold of the viral concentration is set at 250 pfu/μl due to the difficulty in culturing it at high concentrations. When the number of waveforms obtained by measuring for 15 min is examined, an average of three waveforms could be obtained at 2.5 pfu/μl. However, for the same duration with a concentration of 0.25 pfu/μl, no waveform is obtained. It is therefore concluded that the detection limit of coronavirus in Dulbecco’s modified Eagle’s medium (DMEM) is 2.5 pfu/μl when 0.1 V is applied for 15 min using a nanopore with a diameter of 300 nm.

The ionic current–time waveforms of MERS-CoV, SARS-CoV, and SARS-CoV-2 are individually measured under the same experimental conditions as HCoV-229E (Fig. 2a, b). The number of nanopore modules used in the measurement and the number of waveforms obtained are shown in Supplementary Table S4. The histograms of I_p and t_d of the four viruses reveal a major overlap (Fig. 2d, e), which complicates the accurate identification of the viruses. Machine learning is performed to identify the four cultured viruses using the algorithms (Fig. 1k and Supplementary Figs S1–S5) and features that are used to identify the nanoparticles. A classification model is created by using random forest algorithm on the waveforms of the four coronaviruses. The F-value for the four types of viruses is 0.66 (Fig. 2f, g). When this value is greater than 0.25, the four viruses could be identified in one waveform. The results show that the artificial intelligent nanopores can identify coronaviruses with high accuracy. The identification accuracy of the combination of two viruses and that of three viruses showed higher F-values ≥0.76 and ≥0.69, respectively (Supplementary Figs S8 and S9). The discrimination results have also revealed that the MERS-CoV and HCoV-229E species are the most difficult to discriminate.

Diagnostics for clinical specimens

Due to the high viscosity of saliva, it was filtered through a 0.45-μm membrane filter and diluted with 1× PBS. When the ionic current–time waveforms of the PCR-positive specimens of saliva are measured for SARS-CoV-2, the waveform is obtained at both 0.1 and −0.1 V with ten times higher number of waveforms obtained at the former than the latter applied voltage (Fig. 3a). To confirm that the novel coronavirus passes through the nanopore at 0.1 V, RT-PCR is performed on the solution in the trans channel after obtaining about 3000 waveforms. RT-PCR measurements have demonstrated that the new coronavirus passed through the nanopore when 0.1 V is applied. This result indicates that the surface charges of the cultured virus and its counterpart in the clinical specimen are different³¹.

**Fig. 3: Learning and diagnosis of clinical specimens.**

Ionic current–time traces of PCR-positive and PCR-negative specimens of saliva both showed pulsed waveforms (Fig. 3a). The number of nanopore modules used and the corresponding waveforms obtained are shown in Supplementary Table S5. The histograms of I_p and t_d obtained by measuring saliva completely overlap (Fig. 3b, c). Therefore, it is not possible to accurately determine whether a specimen is positive or negative based on the histograms.

Alternatively, using machine learning, waveforms were extracted from the ionic current–time traces of PCR-positive and PCR-negative specimens to determine positive and negative of the new coronavirus (Fig. 3d, e and Supplementary Figs S10–S12). Features similar to the studies on nanoparticles and cultured viruses are generated for each sample. The waveform obtained from the PCR-negative specimen is a noise waveform. Assuming that the noise waveform is common to the PCR-negative specimen and the PCR-positive specimen, the waveform obtained by the PCR-positive specimen is composed of the noise waveform and the waveform of the new coronavirus. When the assembly including the positive unlabeled classification method^30,32 is used, the waveform group of the new coronavirus and the waveform group of the noise from the waveform group of the PCR-negative specimen are learned from the waveform group of the PCR-positive specimen. The positive ratio given by the ratio of the number of waveforms of the new coronavirus to the number of waveforms of the specimen is calculated. The positive ratio provides a threshold for determining whether a specimen is positive or negative. Comparing the waveform corresponding to the novel coronavirus and that pertaining to noise, it is learned whether a given waveform is a positive or negative waveform. This learning empowers the classifier that results in the highest discrimination accuracy and confidence corresponding to one waveform. Confidence is a measure of the level of accuracy with which a waveform is classified as positive or negative. In clinical specimens, random forest algorithm based classifier gives the highest accuracy. In the diagnosis of clinical specimens, the algorithm shown in Fig. 3e and Supplementary Fig. S12 is used to determine the positive or negative nature of a specimen. The confidence is calculated from the F-value computed by using the classifier obtained by machine learning. Subsequently, the positive and negative nature of a waveform is determined. The positive and negative of the specimen are determined based on the threshold value of the positive ratio obtained in the learning process. The detailed algorithm is shown in Supplementary Fig. S12.

In learning clinical specimens of saliva, 40 PCR-positive and 40 PCR-negative specimens that were stored under refrigerated condition have been used. The number of nanopore modules used and the corresponding waveforms obtained are shown in Supplementary Table S5. The accuracy of discrimination between PCR-positive and PCR-negative by one ion current–time waveform is obtained as F = 1.00. When this value is >0.50, it is possible to distinguish between the positive and negative specimens with one waveform. The sensitivity and specificity at the measurement time of n minutes (n = 1–5) are calculated using Equations 1–4 shown in Supplementary Fig. S5 based on the confusion matrix (Supplementary Fig. S13) at each measurement time. The sensitivity and specificity are time independent and are both 100% (Fig. 3f). In the true positive sample, all waveforms exhibited high positive confidence during the measurement time, while in the true negative sample, all waveforms displayed low positive confidence (Supplementary Fig. S14). The errata for all specimens obtained from the 5 min measurement along with the positive confidence and ratio are given in Supplementary Data 1.

Using the classifier obtained in the learning process, 50 PCR-positive samples and 50 PCR-negative samples are diagnosed independently of the training data. The number of used nanopore modules and the corresponding obtained waveforms are shown in Supplementary Table S6. The sensitivity and specificity at n minutes (n = 1–5) are calculated based on the confusion matrix (Supplementary Fig. S15). Both sensitivity and specificity increased with an increase in the measurement time (Fig. 3g). After 5 min, the sensitivity and specificity are established to be 90% and 96%, respectively. Thus, the sensitivity and specificity are lower than those in the learning process, which could be attributed to overfitting. The errata for all specimens obtained from the 5 min measurement along with the positive confidence and ratio are shown in Supplementary Data 2. Figure 3h–k demonstrates the time dependence of the waveform confidence during a 5-min measurement. In the scatter plot, the orange and green points correspond to the waveform, which is determined to be positive and negative, respectively. The time dependence of the confidence of the waveform in the true positive sample suggests that the waveform with a high positive confidence is continuously large during the measurement and that the positive waveform ratio is also large (Fig. 3h). Conversely, the dependence of the waveform confidence in the true negative sample indicates that the number of waveforms with high positive confidence is small and that the positive waveform ratio is also small (Fig. 3i). Among false positive specimens, there are waveforms with a high positive confidence and a relatively large positive waveform ratio (Fig. 3j). Conversely, false negative specimens exhibit a smaller number of waveforms with a high positive confidence and a smaller positive waveform ratio (Fig. 3k). The sensitivity of 90% gives a false negative rate of 10%; therefore, it could be used for screening tests aimed at diagnosing a large number of people with a high throughput.

Discussion

This measurement platform using machine learning consists of nanopore devices, measuring devices, and machine learning. Since nanopore devices are manufactured within the processing accuracy (diameter error ±10 nm), there is no similar platform. The measurements are not made on the same platform in this case, and it is expected that the nanopore device could affect the discriminant F-value. That is, machine learning might be able to distinguish between nanopore devices rather than nanoparticles and viruses. To show that machine learning does not differentiate between nanopore devices, we can use the same nanopores to measure the combination of SARS-CoV-2, SARS-CoV, MERS-CoV, and HCoV-229E. However, biosafety level 3 (BSL-3) facilities can handle SARS-CoV and SARS-CoV-2, while BSL-3 facilities can handle MERS-CoV, and BSL-2 facilities can handle HCoV-229E. From the viewpoint of preventing contamination by other viruses, it is impossible to conduct experiments that simultaneously handle these viruses. However, the F-values = 0.96–1.0 (Supplementary Fig. S6) that discriminates between nanoparticles with diameters of 200 and 220 nm using different nanopore devices are identical to the F-values = 0.96–0.99 (Supplementary Fig. S7) used to distinguish between these two nanoparticles using the same nanopore devices. As a result, the difference between the nanopore devices has no effect on the F-value within the range of the processing accuracy of the nanopore device used in this study.

The artificial intelligent nanopore was successful in the detection of virus with high accuracy. By changing the training data from cultured viruses to PCR-positive/-negative specimens, the artificial intelligent nanopore platform becomes a device capable of detecting both positive and negative specimen with high sensitivity at high throughput. By modifying the training data, the platform is a versatile virus diagnostic system. For instance, infections caused by influenza A virus, which usually spreads between autumn–winter every year, will show the similar symptoms as caused by SARS-CoV-2^33,34,35. When a person infected by the new coronavirus, based on the flu-like symptoms approaches for medical aid, the risk of infection to medical staffs and the spreads the infection increases if not diagnosed accurately. According to this study, machine learning of the cultured SARS-CoV-2 and influenza A virus (H1N1) showed an extremely high discriminator with an F-value of 0.90 (Supplementary Fig. S16). When the clinical specimens collected from patients infected with each virus are used as training data, the identification result of the cultured viruses will enable the development of a device that can diagnose both viruses with high accuracy.

Methods

Preparation of cultured viruses

African green monkey kidney Vero cells (ATCC^®CCL-81™) and human cervix adenocarcinoma HeLa cells (ATCC) were maintained and grown in DMEM (Nacalai Tesque, Kyoto, Japan) containing 5% fetal bovine serum (Thermo Fisher Scientific, MA, USA). SARS-CoV Frankfurt strain, MERS-CoV/EMC2012 strain, and SARS-CoV-2/Hu/DP/Kng/19-020 strain (GenBank Accession number LC528232) were propagated using Vero cells. HCoV-229E (ATCC VR740) was propagated in HeLa cells. Viral stocks of these coronaviruses were filtered using Millex-HV Syringe Filter Unit 0.45 μm (Merck, Darmstadt, Germany) and the filtrates were used for diagnostic method using nanopore and machine learning. Viral titers were calculated by the modified 50% tissue culture infectious dose (TCID₅₀) assay and plaque assay^36,37. The genome copy number of HCoV-229E was determined by quantitative PCR using Taqman probe. Influenza A (H1N1pdm09, California/7/2009) virus was added to MDCK cells (ECACC84121903) in DMEM, and after incubation for 6 h, trypsin was added at a final concentration of 2.5 µg/ml. The infected cells were incubated until a cytopathic effect was observed. The medium of the infected cells was centrifuged at 440 × g for 5 min, the supernatant of medium was collected, and filtered with a filter having a pore size of 0.45 µm (Millex-HV; Millipore Co.). All the experiments using influenza virus were approved by the institutional biosafety committee, and precautiously carried out in BSL-2 facilities.

Clinical specimens

Specimens were acquired as residual samples from clinical examination with the approval of Institutional Review Board (IRB), Osaka University Hospital, Osaka University, Japan. The need to obtain informed consent was waived by the IRB committee since the samples were residual from a clinical examination without using any identifiable information of the individuals or the application of any intervention, in accordance with the Ethical Guidelines for Medical and Health Research Involving Human Subjects. (Public Notice of the Ministry of Education, Culture, Sports, Science and Technology and the Ministry of Health, Labor and Welfare No. 3 of 2014).

Artificial intelligence nanopore platform

A nanopore module, nanopore measuring instrument, and Aipore-ONE^TM, artificial intelligence software developed and implemented by Aipore Inc. were used. HCoV-229E was measured at the BSL-2 facility. SARS-CoV, SARS-CoV-2, and MERS-CoV were measured at the BSL-3 facility at the Research Institute for Microbial Diseases, Osaka University. The experiments were confirmed by the Institutional Biosafety Committee. The nanoSCOUTER™ was used for all measurements. The bias voltage was 0.1 V. The current amplification factor was 10⁷, and it had the characteristic of F_c = 260 KHz and the sampling rate was set as 1 MHz.

PCR analysis

Viral RNA of HCoV-229E was purified using TRIzol LS Reagent (Thermo Fisher Scientific). Viral RNA copy numbers were quantified using QuantiTect Probe RT-PCR Kit (Qiagen GmbH, Hilden, Germany) and PCR thermal cycler Dice (Takara). Primers and probes (Eurofins Genomics, Ebersberg, Germany) specific to the viral RNA-dependent RNA polymerase (RdRp) are listed in Supplementary Table S7. For standard of genome copy number, the partial nucleotides of RdRp (position 998-1999 of HCoV-229E, GenBank Accession number MT438700) was amplified using forward and reverse primers listed in Supplementary Table S7 and cloned into pCAGGS plasmid using Gibson assembly kit (New England Biolabs, Germany).

Saliva samples were examined by SARS-CoV-2 Direct Detection RT-qPCR Kit (Takara Bio, Otsu, Shiga, Japan) using Roche Light Cycler 96 for the detection of SARS-CoV-2 according to the manufacturer instructions. Primers and probes recommended by the CDC for N1, N2, and RNase P targets in a multiplex reaction are listed in Supplementary Table S7. In SARS-CoV-2 positive samples, viral copy numbers were calculated based on the C_t value of RT-qPCR.

Data availability

Source data are provided with this paper. The measurement data of the current–time profile of the nanoparticles, four types of viruses, and clinical specimens obtained in this study are available on Zenodo (https://doi.org/10.5281/zenodo.4761714). We provide the temporary account that enables researchers to login the Aipore server through a web browser to reproduce the AI model on Aipore-One^TM, deposited in Zenodo (https://doi.org/10.5281/zenodo.4761714) as a README file. Source Data are provided with this paper.

References

Masters, P. S. The molecular biology of coronaviruses. Adv. Virus Res. 66, 193–292 (2006).
Article CAS Google Scholar
de Wit, E., van Doremalen, N., Falzarano, D. & Munster, V. J. SARS and MERS: recent insights into emerging coronaviruses. Nat. Rev. Microbiol. 14, 523–534 (2016).
Article Google Scholar
Chan, J. F. W. et al. A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet 395, 514–523 (2020).
Article CAS Google Scholar
Chen, N. S. et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study. Lancet 395, 507–513 (2020).
Article CAS Google Scholar
Holshue, M. L. et al. First case of 2019 novel coronavirus in the United States. N. Engl. J. Med. 382, 929–936 (2020).
Article CAS Google Scholar
Huang, C., Wang, Y. & Li, X. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395, 496–496 (2020).
Article Google Scholar
Li, Q. et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N. Engl. J. Med. 382, 1199–1207 (2020).
Article CAS Google Scholar
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579, 265–269 (2020).
Article ADS CAS Google Scholar
Zhou, P. et al. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 579, 270–273 (2020).
Article ADS CAS Google Scholar
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. New. Engl. J. Med. 382, 727–733 (2020).
Article CAS Google Scholar
Cheng, M. P. et al. Diagnostic testing for severe acute respiratory syndrome–related coronavirus 2. Ann. Intern. Med. 172, 726–734 (2020).
Article Google Scholar
Branton, D. et al. The potential and challenges of nanopore sequencing. Nat. Biotechnol. 26, 1146–1153 (2008).
Article CAS Google Scholar
Dekker, C. Solid-state nanopores. Nat. Nanotechnol. 2, 209–215 (2007).
Article ADS CAS Google Scholar
Howorka, S. & Siwy, Z. Nanopore analytics: sensing of single molecules. Chem. Soc. Rev. 38, 2360–2384 (2009).
Article CAS Google Scholar
Venkatesan, B. M. & Bashir, R. Nanopore sensors for nucleic acid analysis. Nat. Nanotechnol. 6, 615–624 (2011).
Article ADS CAS Google Scholar
Voelkerding, K. V., Dames, S. A. & Durtschi, J. D. Next-generation sequencing: from basic research to diagnostics. Clin. Chem. 55, 641–658 (2009).
Article CAS Google Scholar
Wanunu, M. Nanopores: a journey towards DNA sequencing. Phys. Life Rev. 9, 125–158 (2012).
Article ADS Google Scholar
Rozevsky, Y. et al. Quantification of mRNA expression using single-molecule nanopore sensing. ACS Nano 14, 13964–13974 (2020).
Article CAS Google Scholar
McMullen, A., de Haan, H. W., Tang, J. X. & Stein, D. Stiff filamentous virus translocations through solid-state nanopores. Nat. Commun. 5, ARTN 4171 (2014).
Article ADS Google Scholar
Arima, A. et al. Selective detections of single-viruses using solid-state nanopores. Sci. Rep. 8, ARTN16305 (2018).
Article ADS Google Scholar
Darvish, A. et al. Mechanical characterization of HIV-1 with a solid-state nanopore sensor. Electrophoresis 40, 776–783 (2019).
Article CAS Google Scholar
Wu, H. W. et al. Translocation of rigid rod-shaped virus through various solid-state nanopores. Anal. Chem. 88, 2502–2510 (2016).
Article CAS Google Scholar
Tsutsui, M. et al. Discriminating single-bacterial shape using low-aspect-ratio pores. Sci. Rep. 7, ARTN17371 (2017).
Article ADS Google Scholar
Nivala, J., Mulroney, L., Li, G., Schreiber, J. & Akeson, M. Discrimination among protein variants using an unfoldase-coupled nanopore. ACS Nano 8, 12365–12375 (2014).
Article CAS Google Scholar
Henley, R. Y. et al. Electrophoretic deformation of individual transfer RNA molecules reveals their identity. Nano Lett. 16, 138–144 (2016).
Article ADS CAS Google Scholar
Im, J., Lindsay, S., Wang, X. & Zhang, P. M. Single molecule identification and quantification of glycosaminoglycans using solid-state nanopores. Acs Nano 13, 6308–6318 (2019).
Article CAS Google Scholar
Arima, A. et al. Digital pathology platform for respiratory tract infection diagnosis via multiplex single-particle detections. ACS Sen. 5, 3398–3403 (2020).
Article CAS Google Scholar
Meyer, N. et al. Machine learning to improve the sensing of biomolecules by conical track-etched nanopore. Biosensors 10, 140 (2020).
Article CAS Google Scholar
Goldsmith, C. S. & Miller, S. E. Modern uses of electron microscopy for detection of viruses. Clin. Microbiol. Rev. 22, 552–563 (2009).
Article Google Scholar
Taniguchi, M. et al. High-precision single-molecule identification based on single-molecule information within a noisy matrix. J. Phys. Chem. C. 123, 15867–15873 (2019).
Article CAS Google Scholar
Michen, B. & Graule, T. Isoelectric points of viruses. J. Appl. Microbiol. 109, 388–397 (2010).
Article CAS Google Scholar
Elkan, C. & Noto, K. Proc. of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 213–220 (ACM, Las Vegas, Nevada, USA, 2008).
Yang, J. et al. Prevalence of comorbidities and its effects in patients infected with SARS-CoV-2: a systematic review and meta-analysis. Int. J. Infect. Dis. 94, 91–95 (2020).
Article CAS Google Scholar
Bhatraju, P. K. et al. Covid-19 in critically ill patients in the Seattle region—case series. N. Engl. J. Med. 382, 2012–2022 (2020).
Article CAS Google Scholar
To, K. K. W. et al. Temporal profiles of viral load in posterior oropharyngeal saliva samples and serum antibody responses during infection by SARS-CoV-2: an observational cohort study. Lancet Infect. Dis. 20, 565–574 (2020).
Article CAS Google Scholar
Terada, Y., Kawachi, K., Matsuura, Y. & Kamitani, W. MERS coronavirus nsp1 participates in an efficient propagation through a specific interaction with viral RNA. Virology 511, 95–105 (2017).
Article CAS Google Scholar
Kawase, M., Shirato, K., Matsuyama, S. & Taguchi, F. Protease-mediated entry via the endosome of human coronavirus 229E. J. Virol. 83, 712–721 (2009).
Article CAS Google Scholar

Download references

Acknowledgements

This research was supported by AMED under the Grant Number JP20he0722002. We thank Dr Bart L. Haagmens (Erasmus Medical Center) for providing the MERS-CoV/EMC2012 strain through Dr Makoto Takeda (National Institute of Infectious Diseases), Dr John Ziebuhr (University of Würzburg) for providing the Frankfurt strain of SARS-CoV through Dr Fumihiro Taguchi, and Dr Tomohiko Takasaki (Kanagawa Prefectural Institute of Public Health) for providing the SARS-CoV-2/Hu/DP/Kng/19-020 strain.

Author information

Authors and Affiliations

The Institute of Scientific and Industrial Research, Osaka University, Ibaraki, Osaka, Japan
Masateru Taniguchi, Kenji Tatematsu & Takashi Washio
Research Institute for Microbial Diseases, Osaka University, Suita, Osaka, Japan
Shohei Minami, Chikako Ono, Rina Hamajima, Yukihiro Akeda, Yuta Kanai, Takeshi Kobayashi & Yoshiharu Matsuura
Center for Infectious Diseases Education and Research, Osaka University, Suita, Osaka, Japan
Chikako Ono & Yoshiharu Matsuura
Graduate School of Medicine, Osaka University, Suita, Osaka, Japan
Ayumi Morimura, Shigeto Hamaguchi, Yukihiro Akeda, Yoshiaki Yamagishi & Kazunori Tomono
Osaka University Hospital, Osaka University, Suita, Osaka, Japan
Shigeto Hamaguchi, Yukihiro Akeda, Yoshiaki Yamagishi & Kazunori Tomono
Graduate School of Medicine, Gunma University, Maebashi, Gunma, Japan
Wataru Kamitani
Center for Vaccine Research, University of Pittsburgh, Pittsburgh, PA, USA
Yutaka Terada
The Research Foundation for Microbial Diseases of Osaka University, Suita, Osaka, Japan
Koichiro Suzuki & Nobuaki Hatori
Medical Center for Translational and Clinical Research, Osaka University Hospital, Osaka University, Suita, Osaka, Japan
Yoshiaki Yamagishi
ADVANTEST Corporation, Kazo, Saitama, Japan
Nobuei Washizu
Aipore Inc., Shibuya, Tokyo, Japan
Hiroyasu Takei, Osamu Sakamoto & Norihiko Naono

Authors

Masateru Taniguchi
View author publications
You can also search for this author in PubMed Google Scholar
Shohei Minami
View author publications
You can also search for this author in PubMed Google Scholar
Chikako Ono
View author publications
You can also search for this author in PubMed Google Scholar
Rina Hamajima
View author publications
You can also search for this author in PubMed Google Scholar
Ayumi Morimura
View author publications
You can also search for this author in PubMed Google Scholar
Shigeto Hamaguchi
View author publications
You can also search for this author in PubMed Google Scholar
Yukihiro Akeda
View author publications
You can also search for this author in PubMed Google Scholar
Yuta Kanai
View author publications
You can also search for this author in PubMed Google Scholar
Takeshi Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Wataru Kamitani
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Terada
View author publications
You can also search for this author in PubMed Google Scholar
Koichiro Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Nobuaki Hatori
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiaki Yamagishi
View author publications
You can also search for this author in PubMed Google Scholar
Nobuei Washizu
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyasu Takei
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Sakamoto
View author publications
You can also search for this author in PubMed Google Scholar
Norihiko Naono
View author publications
You can also search for this author in PubMed Google Scholar
Kenji Tatematsu
View author publications
You can also search for this author in PubMed Google Scholar
Takashi Washio
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiharu Matsuura
View author publications
You can also search for this author in PubMed Google Scholar
Kazunori Tomono
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M. T. conceived the technology, supervised the project, and wrote the manuscript with input from co-authors. A. M., S. H., Y. A., Y. Y., K. S., N. H., and K. T. prepared clinical specimens and supervised the clinical specimen experiments. S. M., R. H., C. O., Y. K., T. K., W. K., Y. T., and Y. M. cultivated coronavirus and conducted experiments in BSL-3. K. T. cultured influenza virus. N. W. has developed a nanopore instrument. H. T. and N. N. conducted experiments on BSL-2 and measured clinical specimens. O. S. and T. W. developed the software for machine learning.

Corresponding authors

Correspondence to Masateru Taniguchi, Yoshiharu Matsuura or Kazunori Tomono.

Ethics declarations

Competing interests

M. T. and N. N. are the co-founders of Aipore Co., Ltd., and its Director and Chief Executive Officer, respectively. The authors other than M. T. and N. N. have no competing interests to declare.

Additional information

Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Peer Review File

Description of Additional Supplementary Files

Supplementary Data 1-2

Reporting-Summary

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Taniguchi, M., Minami, S., Ono, C. et al. Combining machine learning and nanopore construction creates an artificial intelligence nanopore for coronavirus detection. Nat Commun 12, 3726 (2021). https://doi.org/10.1038/s41467-021-24001-2

Download citation

Received: 20 October 2020
Accepted: 28 May 2021
Published: 17 June 2021
DOI: https://doi.org/10.1038/s41467-021-24001-2

This article is cited by

Using novel micropore technology combined with artificial intelligence to differentiate Staphylococcus aureus and Staphylococcus epidermidis
- Ayumi Morimura
- Masateru Taniguchi
- Shigeto Hamaguchi
Scientific Reports (2024)
Single-molecule RNA sizing enables quantitative analysis of alternative transcription termination
- Gerardo Patiño-Guillén
- Jovan Pešović
- Ulrich Felix Keyser
Nature Communications (2024)
Nanopore analysis of salvianolic acids in herbal medicines
- Pingping Fan
- Shanyu Zhang
- Shuo Huang
Nature Communications (2024)
Saliva-based detection of SARS-CoV-2: a bibliometric analysis of global research
- Chun Zhou
- Zhaopin Cai
- Zhigang Jin
Molecular and Cellular Biochemistry (2024)
Impact of nanotechnology on conventional and artificial intelligence-based biosensing strategies for the detection of viruses
- Murugan Ramalingam
- Abinaya Jaisankar
- Giovanna Marrazza
Discover Nano (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.