Development of a portable on-site applicable metagenomic data generation workflow for enhanced pathogen and antimicrobial resistance surveillance

Bloemen, Bram; Gand, Mathieu; Vanneste, Kevin; Marchal, Kathleen; Roosens, Nancy H. C.; De Keersmaecker, Sigrid C. J.

doi:10.1038/s41598-023-46771-z

Download PDF

Article
Open access
Published: 11 November 2023

Development of a portable on-site applicable metagenomic data generation workflow for enhanced pathogen and antimicrobial resistance surveillance

Bram Bloemen^1,2,
Mathieu Gand¹,
Kevin Vanneste¹,
Kathleen Marchal^2,3,
Nancy H. C. Roosens¹ &
…
Sigrid C. J. De Keersmaecker¹

Scientific Reports volume 13, Article number: 19656 (2023) Cite this article

2599 Accesses
1 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Rapid, accurate and comprehensive diagnostics are essential for outbreak prevention and pathogen surveillance. Real-time, on-site metagenomics on miniaturized devices, such as Oxford Nanopore Technologies MinION sequencing, could provide a promising approach. However, current sample preparation protocols often require substantial equipment and dedicated laboratories, limiting their use. In this study, we developed a rapid on-site applicable DNA extraction and library preparation approach for nanopore sequencing, using portable devices. The optimized method consists of a portable mechanical lysis approach followed by magnetic bead-based DNA purification and automated sequencing library preparation, and resulted in a throughput comparable to a current optimal, laboratory-based protocol using enzymatic digestion to lyse cells. By using spike-in reference communities, we compared the on-site method with other workflows, and demonstrated reliable taxonomic profiling, despite method-specific biases. We also demonstrated the added value of long-read sequencing by recovering reads containing full-length antimicrobial resistance genes, and attributing them to a host species based on the additional genomic information they contain. Our method may provide a rapid, widely-applicable approach for microbial detection and surveillance in a variety of on-site settings.

Rapid MinION profiling of preterm microbiota and antimicrobial-resistant pathogens

Article Open access 16 December 2019

A common protocol for the simultaneous processing of multiple clinically relevant bacterial species for whole genome sequencing

Article Open access 08 January 2021

Nanopore metagenomics enables rapid clinical diagnosis of bacterial lower respiratory infection

Article 24 June 2019

Introduction

Rapid, accurate and comprehensive diagnostics are key to identify outbreaks, and allow for improved pathogen surveillance to prevent further spread. Current pathogen detection methods have limitations that can restrict their use to dedicated laboratories with trained personnel. Culture-based approaches can be time-consuming and depend on the ability to cultivate the microbial species in question, while targeted molecular assays frequently require specialized equipment and prior knowledge. On the other hand, current point-of-care tests often only detect a limited range of predetermined pathogens¹. Furthermore, these methods provide limited additional information, or can fail to identify a pathogen altogether. In recent years, high-throughput next-generation sequencing technologies (NGS) have created the possibility to comprehensively analyze samples without culturing or prior knowledge, in an approach termed metagenomic sequencing (mNGS)^2,3. However, current short-read NGS technologies require specialized laboratories, skilled personnel, and high capital investments, while not providing information in real-time, impeding it to become a point-of-care test. Additionally, short reads provide a fractured view on the DNA content, limiting subsequent taxonomic or functional analyses⁴. Recently, the development of portable, real-time, long-read sequencing by Oxford Nanopore Technologies (ONT) has paved the way for rapid and comprehensive on-site microbial diagnostics and surveillance.

Several prior studies have demonstrated the added value of portable nanopore sequencing for diagnostic purposes. For example, Marin et al. recently published an on-farm 16S rDNA amplicon sequencing method using the portable Bento Bio Pro for DNA extraction and library preparation in combination with ONT MinION sequencing. The method was able to reliably detect Campylobacter in caecal samples from infected chickens in less than five hours, compared to more than 72 h required with conventional methods⁵. Additionally, Marcolungo et al. developed an amplicon sequencing approach to identify quarantine plant pathogens with high sensitivity⁶. Others have used in-situ nanopore amplicon sequencing for biodiversity assessment in a variety of remote settings^7,8,9. Although these amplicon-based approaches have proven successful in terms of sensitivity and swiftness, they are targeted methods which deliver limited additional information, and in the previously mentioned examples were used to identify a limited set of pathogens or target genes.

In contrast, shotgun mNGS allows for a more comprehensive analysis by sampling all genetic material instead of a limited range of marker genes. Furthermore, it can provide additional functional insights by detecting (full length) virulence or antimicrobial resistance genes (ARG) and attributing them to a microbial species, while increasing taxonomic resolution^4,10,11,12. In clinical settings, mNGS has already demonstrated added value by increased pathogen identification in lower respiratory tract, urinary and bloodstream infections when compared to culture-based or conventional tests^{13,14,15,16,17}. Other examples of clinical mNGS applications include the earliest published SARS-CoV-2 genomes by Zhu et al., or rapid detection of bacterial pathogens and their antimicrobial resistance by Serpa et al.^18,19. Apart from clinical diagnostics, mNGS has proven to be a capable tool to identify, characterize and trace the source of pathogens in agricultural, environmental and food safety contexts^20,21,22,23. Applying mNGS on-site could be invaluable to further decrease sample-to-detection times, to increase the comprehensiveness of microbial detection methods, and to allow for more wide-spread use. Currently, such on-site mNGS experiments have mainly been performed in various remote locations using a combination of portable laboratory equipment and the ONT MinION sequencing device^24,25,26,27. In these cases, sample transportation and preservation were not trivial, or were shown to impact taxonomic composition, highlighting the benefits of in-situ sample processing²⁷. However, no controls or standardized samples were used in these studies, hindering evaluation of their performance and comparison with other methods²⁸.

Even though nanopore sequencing itself can be performed on miniaturized devices, most current DNA extraction, library preparation, and data analysis methods still require non-portable devices and stable internet and electrical grid access, limiting potential use-cases. In recent years, several devices and methods have been developed to enable on-site metagenomic data generation. Examples include the Bento Bio⁵, SuperFastPrep-2²⁵ and the Claremont Bio OmnilyseX and DNAexpress kits^29,30,31. Additionally, the laptop-powered ONT VolTRAX allows for automated library preparation on a portable device. In this study we focused on the wet-lab aspect, and developed a method for portable on-site applicable mNGS by combining the Claremont Bio OmnilyseX, Bento Bio Pro and VolTRAX devices. When combined with potential future on-site bioinformatic analysis, this would provide a new tool for rapid in-situ diagnostics and surveillance. To optimize the data generation method, we used chicken fecal samples as a test-case, as the poultry gut microbiome has been demonstrated to contain a diverse repertoire of ARGs³². Additionally, chicken feces are known to be inhibitory towards molecular applications, making it a challenging proof-of-concept sample^33,34. Starting from a previously published protocol³¹, we developed gradual improvements to arrive to a final wet-lab protocol that generated high quality nanopore mNGS data and can be used outside of laboratories. Using spike-in defined mock communities (DMC), we compared taxonomic classification and resistome profiling performance to other sequencing workflows, including a laboratory-based protocol.

Results

Optimization of a rapid metagenomic DNA extraction method, applicable on-site

To reduce equipment requirements and sample processing times, the Claremont Bio DNAexpress kit was chosen to develop an on-site DNA extraction workflow³¹. The method consists of homogenizing and lysing the sample in a battery-powered Omnilyse X bead-beating tube (B) containing beads and lysis buffer, followed by the DNAexpress (D) column-based DNA purification protocol (Fig. 1). Various modifications were made to the protocol (abbreviated to BD, Table 1), and statistically compared to each other (supplementary Table S1). The protocol was initially followed as per manufacturer’s instructions, with varying bead-beating durations. However, BD yielded DNA of poor purity, with 260 nm/230 nm absorbance ratios (A260/230) being around 0.80 (Table 1), and no significant differences between bead-beating durations (p > 0.05, supplementary Table S1). Next, additional cleanup with AMPure XP beads was tested to improve DNA purity. Three strategies were assessed: a single round of bead cleanup with a bead/sample ratio of either (i) 0.4 or (ii) 0.8, or (iii) two rounds of cleanup with a bead/sample ratio of 0.4. All cleanup methods significantly improved the A260/230 (ranging from 1.47 to 1.73) and A260/280 (~ 1.90) ratios compared to no additional cleaning (p < 0.05, supplementary Table S1). However, the A260/230 remained substantially lower than 2, so the DNA extracts were considered impure³⁵. Additionally, AMPure cleanup resulted in significant loss of DNA in all cases (Table 1 and supplementary Table S1). Interestingly, the A260/230 ratio did not significantly increase with two rounds of cleanup (supplementary Table S1), indicating that method BD might insufficiently remove impurities that also bind to the AMPure XP beads. To address this, we developed method BQ by keeping the battery-powered Omnilyse X bead-beating (B) step but replacing the DNAexpress column purification process with the zymo Quick-DNA HMW magbead (Q) kit (Fig. 1), and found that this increased the A260/230 ratio to 1.91, and improved DNA yield by approximately 500 ng (Table 1). Finally, we assessed if reducing battery voltage in the bead-beating step could improve DNA integrity. Lowering the voltage from 6 to 1.5 V doubled average fragment lengths, increasing them from around 14 to 28 kbp, although this approximately halved DNA yield (Table 1). The final version of protocol BQ consisted of bead-beating using the Omnilyse X tube with 1.5 V, followed by Q purification and one round of AMPure XP bead purification with a 0.4 bead/sample ratio (Fig. 1). Furthermore, it returned comparable amounts of high purity DNA as the current laboratory-based EQ method (enzymatic (E) lysis followed by method Q purification, Fig. 1), although with shorter fragment sizes (28 kbp instead of 58 kbp)³¹.

Table 1 Summary of all tested methods and the quality metrics of the resulting DNA extracts.

Full size table

Performance assessment of different metagenomic workflows

Spike-in defined mock community to compare sequencing workflows

To assess the performance of the various metagenomic DNA extraction protocols, aliquots of a single pooled chicken fecal sample were spiked with a DMC consisting of the ZymoBIOMICS Gut Microbiome Standard (GMS), combined with the ZymoBIOMICS Spike-In Control I, which contains two species of marine origin that were not suspected to be present in the fecal background (Table 2)^36,37. Additionally, we report the ARG content in the DMC reference genomes as detected by ResFinder, and the lateral coverage (the proportion of the reference genome covered by sequencing reads) of each genome over time generated with the final on-site mNGS workflow, in a DMC-spiked chicken fecal sample (Table 2).

Table 2 Composition and characteristics of the defined mock community (DMC), along with lateral genome coverage throughout the sequencing run of the DMC spiked into a fecal sample and prepared with on-site method.

Full size table

Protocol impact on sequencing throughput and read length

To compare the performance of the on-site DNA extraction protocol to other workflows, we generated six nanopore sequencing libraries from the same fecal sample (Fig. 1, Table 3). Briefly, UEQL and EQL were generated with the laboratory-based DNA extraction method EQ combined with ligation sequencing (L), using an unspiked (U) or spiked fecal sample, respectively. EQR was also generated with method EQ, but with transposase-based rapid sequencing (R) instead of ligation sequencing. Library BDR was generated using on-site extraction method BD followed by the R sequencing kit. Next, BQR was generated with the optimal on-site DNA extraction method BQ performed on the Bento Bio Pro device, with a 1.5 V battery replacing the 6 V battery in BD. Finally, the complete, optimized on-site sequencing workflow (BQV) consisted of 1.5 V bead-beating lysis (B), Q purification performed on the Bento Bio Pro, followed by automated library preparation on the VolTRAX device, using the complete DNA extract in the transposase-based VolTRAX sequencing kit (V). In all cases, additional purification using a ratio of 0.4 volumes of AMPure XP beads to 1 volume of DNA extract was performed on the DNA extracts before continuing with library preparation. The aggregate sequencing output and statistics of the various protocols were compared (Table 3, Fig. 2). In general, the enzymatic lysis libraries (UEQL, EQL and EQR) outperformed the bead-beating libraries (BDR, BQR, BQV) in terms of sequencing throughput (Fig. 2b), with exception of BQV. They also demonstrated higher read length N50 (Fig. 2a). However, bead-beating methods were 1–2.5 h faster to execute (Table 3, Fig. 1). Between the enzymatic lysis libraries, EQL showed a broad distribution of read lengths, with low median but high N50, while EQR is characterized by higher median, but lower N50 (Fig. 2a). Interestingly, a rapid decrease in available pores was observed for on-site method BDR following flow cell loading (supplementary Fig. S1). This rapid drop-off in number of available pores did not occur for the other on-site methods (BQR and BQV), where the obtained DNA was of higher purity, increasing sequencing throughput. The reduced bead-beating intensity in BQR and BQV improved DNA fragment sizes, resulting in higher read lengths and throughput compared to BDR. Finally, the higher amount of input DNA in BQV, along with the additional automated magnetic bead cleanup in the VolTRAX library preparation, further increased read lengths and throughput, resulting in comparable throughput to the laboratory-based library EQR, although read N50 remained lower.

Table 3 Overview of sequencing workflows.

Full size table

Sequencing workflow impacts taxonomic classification

As we used a single pooled chicken fecal sample for all methods, spiked with the same DMC, this allowed us to identify any possible taxonomic bias introduced by the experimental workflow. We mapped the reads generated with each method to a database with the DMC reference genomes, and compared the observed relative abundance (RA) for each DMC species to its theoretical relative abundance (TRA) in the spike-in DMC (supplementary Table S2, Fig. 2). In the unspiked sample UEQL, containing no DMC, the species Bacteroides fragilis, Prevotella corporis, Candida albicans, Enterococcus faecalis and Clostridium perfringens were detected in high or similar amounts compared to the DMC, while most other DMC species were absent or observed in low abundance. Imtechella halotolerans and Allobacillus halotolerans were not observed, or observed at a very low level, respectively (Table S2). For the fecal sample spiked with DMC, Roseburia hominis and Bifidobacterium adolescentis were systematically underrepresented, but the bias was most pronounced for Bifidobacterium adolescentis in EQL and EQR, where the observed RAs were more than 100-fold diminished compared to the expected values (supplementary Table S2). For Lactobacillus fermentum, the RA was mainly diminished in EQR, but also in BDR, BQR and BQV. Method BDR showed reduced RA for DMC species down to 1.5% of TRA, while the species with TRA < 1.5% were detected more than expected. Further investigation of the mapped reads showed effects of the sequencing workflows on the read length distribution per species (supplementary Fig. S2). Briefly, enzymatic methods generated longer reads for almost all species, with exception of L. fermentum, and showed large variations in read length distributions between species. In contrast, bead-beating methods generally resulted in smaller read lengths, with length distributions between species showing less variation. Next, we additionally used a broad taxonomic database to further investigate the detected fecal microbiome background for all workflows. Compared to the unspiked background (UEQL), several background species such as Bifidobacterium gallinarum and members of the Lactobacillaceae family are underrepresented in enzymatic lysis workflows (EQL, EQR). For the bead-beating methods (BDR, BQR, BQV), this is not the case (supplementary Fig. S3).

The on-site workflow enables rapid taxonomic identification

Given a spiked-in taxon with known genome size and abundance, and the time at which each mapped read was sequenced, the sequencing run time required to reach a certain genome coverage can be determined. As such, the lateral coverage for each taxon in the DMC spiked into a fecal background at 1, 12, and 24 h was determined for the best performing on-site workflow, BQV (Table 2). Most DMC species with RA > 5% reached near-complete genome coverage within the first hour of sequencing, with exception of B. adolescentis and L. fermentum, which are both underrepresented in the overall sequencing data (Fig. 2). However, these species had been fully sequenced after 12 h, along with species with abundances between 1 and 5%. For the yeasts C. albicans and S. cerevisiae, 100% genome coverage is never reached. Similarly, the coverage for M. smithii did not reach saturation. Finally, S. enterica was not detected, while E. faecalis and C. perfringens, having RAs > 1000-times that of S. enterica in fecal background UEQL (Table S2), did not reach coverages above 10%. Next, by using a broad taxonomic database, on-site method BQV was found to outperform the other spiked sample runs in terms of number of uniquely identified species (Fig. 3a). Verifying whether this concerned true positive species was impossible due to the unknown fecal background composition. Ultimately, all methods converged to a number of ca. 60 species, except for method BDR which stabilized earlier at a lower level.

Resistome profiling using full-length resistance genes in single reads

To profile the ARG content, we sought to identify reads containing full-length ARGs by mapping them to the ResFinder database, with stringent filtering of 97% identity to the complete ARG template. The optimal on-site method, BQV, quickly picked up many types of ARGs, ultimately detecting a similar or higher number of unique ARGs as most other methods. The highest ARG diversity was found in the unspiked background UEQL (Fig. 3b). Next, we compared the resistance profiles between the different workflows. Among workflows performed on the spiked fecal sample, BQV had the highest ARG read count, followed by methods EQL, BQR, EQR and BDR (Fig. 3c). The enzymatic methods returned high read count proportions of ARGs present in the DMC, and generally lower proportions of ARG classes exclusive to the fecal background (UEQL) (supplementary Fig. S4). In contrast, the on-site workflows using bead-beating returned a combination of background and DMC ARG classes, with exception of method BDR, which returned a low overall number of ARG reads (Fig. 3d, supplementary Fig. S4). In summary, most ARGs detected in the fecal background (UEQL) are consistently detected by other workflows, albeit at different abundances. Compared to the other workflows, a substantially larger part of the ARG reads generated with method BQV contained the erm(B) gene, although it was detected in high RA in all cases.

Genomic context links resistance genes to their hosts

As long reads can provide additional information on the flanking regions of detected ARGs, we attempted to link ARGs to their microbial host species. To do so, we retrieved reads carrying full-length ARGs from their alignment to the broad taxonomic database. Optimal on-site method BQV generated the most of such combinations (Fig. 4, supplementary Figs. S4–S9). Across all experiments, more than half of the full-length ARG reads could not be linked to any host. According to the UEQL background, most of the identified ARG-hosts native to the fecal sample belong to the Lactobacillaceae family, with links to aminoglycoside and MLS-B resistance genes, followed by members of the Mordavella and Bacteroides genera with tetracycline resistance, including Bacteroides fragilis (a DMC member) which also contains beta-lactam and aminoglycoside ARGs. For the spiked samples, the enzymatic lysis workflows EQL and EQR mainly identified ARG-host links from the DMC members (supplementary Figs. S5 and S6) with high RA, with limited connections between ARGs and background species. Specifically, the aminoglycoside and MLS-B ARG carrying Lactobacillaceae from the background were strongly diminished. In contrast, the bead-beating workflows BQR and BQV demonstrated a higher diversity of ARG-host links (supplementary Figs. S8 and S9), primarily detecting background ARG-host combinations as well as several ARG-host combinations from the DMC. Additionally, these methods resulted in a higher number of ARGs being placed onto plasmids compared to the background. The DMC members P. corporis (absent from database), S. enterica and E. faecalis (< 0.01% TRA) were not detected as ARG hosts by any of the workflows. Interestingly, the DMC member R. hominis is reported as containing the ARGs aph(3’)-IIIIa and tet(W) in several of the datasets, while these genes were not detected in the reference genome of the DMC strain as supplied by the manufacturer. However, these ARGs were reported in the R. hominis genome present in the larger database (accession NZ_LR699011.1, supplementary Table S3).

Discussion

The development of real-time, high-throughput sequencing on portable devices has opened up the possibility of rapid, comprehensive and culture-free diagnostics and surveillance using mNGS. However, the DNA extraction and purification protocols preceding mNGS often still require centralized laboratories and high initial investments. Additionally, sample processing, DNA extraction and library preparation have been shown to affect the reliability of the results^27,38. In this study, we have developed an on-site applicable DNA extraction and library preparation method for nanopore sequencing that is comparable or superior to a current laboratory-based protocol^31,39. We also demonstrated the use of a spike-in defined mock community to compare workflows, and found large variations in taxonomic composition and ARG content, highlighting the importance of using appropriate DNA extraction and sequencing methods.

The optimized on-site applicable method (BQV) consists of a portable bead-beating cell lysis approach, combined with portable magnetic bead-based DNA purification. With the inclusion of an additional cleanup and size-selection step, the method consistently delivers high yields of high purity, high molecular weight DNA. In combination with the laptop-powered, portable VolTRAX device, high quality nanopore libraries can be obtained. As such, the final protocol allows for on-site mNGS library generation from complex samples in 3 h or less, by using portable devices for both cell lysis, DNA purification and library preparation (Fig. 1). Furthermore, the resulting library generated a similar sequencing throughput as current laboratory methods based on enzymatic lysis (EQL, EQR), while bypassing time-consuming incubation steps (Fig. 1)^31,39. Initially, the Claremont Bio DNAexpress method (BD, Fig. 1) was considered for DNA extraction because of its rapidness, portability, and prior results showing high DNA yields³¹. However, we observed it returned DNA libraries of poor quality, which was resolved by replacing the DNAexpress purification step with the Quick-DNA HMW MagBead kit. This modification improved purity, thereby reducing flow cell degradation, highlighting the importance of sufficient contaminant removal for mNGS applications^33,40. Additionally, reducing bead-beating voltage increased DNA fragment sizes, but lowered yield, indicating a trade-off between DNA yield and fragment lengths depending on bead-beating intensity. Similarly, enzymatic lysis methods generated higher read lengths than bead-beating methods, probably due to the lower mechanical stresses. We also assessed differences between the adapter ligation (L), and transposase adapter (R) library preparation kits. The differences we found are similar to previous findings by Tvedte et al.⁴¹. However, another study showed no clear effect of library preparation method on resulting read lengths⁴².

Overall, the choice of mNGS workflow substantially impacted the resulting resistome and taxonomic profiles, with our optimal on-site method (BQV) detecting most of the spiked DMC, along with high ARG and taxonomic diversity from the background sample. However, several of the spike-in DMC members were systematically over- or underrepresented, independent of the method used. On one hand, the presence of sample (background) strains that are closely related to spike-in strains likely results in their overrepresentation, as exemplified by some of our spike-in species which can be found in the chicken gut microbiome^38,43,44. This issue can be avoided by using spike-in species foreign to the sample, which we demonstrated by using A. halotolerans and I. halotolerans. In other cases, the use of a limited database with only spike-in genomes could have resulted in false positive mapping of fecal background reads to the DMC genomes, increasing their apparent abundance. Such errors could be a result of the applied read mapping approach, which only returns a single, best mapping template sequence, while a lowest common ancestor approach would be more accurate in ambiguous cases. On the other hand, underrepresentation of spike-in strains can occur either because they are present below the detection limit, or because of challenges in obtaining DNA of sufficient quality and quantity from those strains. As an example of the latter, B. adolescentis was severely depleted when using enzymatic lysis, which has also been observed for Bifidobacterium in previous studies^4,39,45. Additionally, L. fermentum was underrepresented in experiments using transposase-based sequencing methods (R or V), while showing short read lengths upon enzymatic lysis, as previously also found by Martin et al³⁹. The combination of enzymatic lysis with transposase library preparation (EQR) most severely diminished observed L. fermentum abundance, likely due to the aggravating effects of both methods on read lengths⁴⁶. These method-specific biases extended to the fecal background, where Lactobacillaceae family members and Bifidobacterium gallinarum were severely underrepresented after enzymatic lysis. In general, bead-beating methods such as in the final optimal on-site method BQV produced less severe biases and more uniform read lengths across organisms, at the cost of achievable read lengths. These findings indicate trade-offs between lysis efficiency, accurate taxonomic representation, and DNA integrity, likely due to varying cell wall degradation resistance across species^38,47. Similar method-specific batch effects have been demonstrated before, and could explain large discrepancies in taxonomic profiles between studies using different DNA extraction methods^38,43,48,49. Importantly, Lactobacillaceae have been reported to be among the dominant members of the chicken microbiome, and the inability to lyse and sequence them could therefore be problematic^{38,43,49,50,51}. Here, we additionally found method-specific effects on resistome profiles. Using the sequence information provided by long reads on ARG flanking regions, ARG host species could be inferred, allowing to trace back resistome differences to the method-specific taxonomic differences mentioned before. In the case of the chicken fecal samples analyzed here, variation in Lactobacillaceae detection across methods affected ARG profiles. However, a large part of the ARGs could not be attributed to any host organism. This could reflect limits in database contents, such as the absence of certain microbial strains or mobile elements, which are known to frequently carry ARGs^52,53,54. Additionally, it needs to be mentioned that ARG-host connections should be interpreted with caution. First, as ARGs can be highly identical to each other or to wild type genes, we used a 97% identity threshold to the full-length ARG template to reduce misclassification⁵⁴. Second, sequencing errors can reduce both ARG and taxonomic classification performance. Improved nanopore sequencing accuracy (i.e. with the R10.4.1 flow cell) could improve both ARG and taxonomic classification performance^55,56,57. Additionally, (integrative) mobile elements and the choice of database can confound the analysis, as was demonstrated for R. hominis in our spike-in DMC^52,53. Furthermore, ARGs are frequently located in mobile genetic elements, which are often absent from taxonomic databases as they are difficult to attribute to any particular host. To enhance the reliability of ARG-host attribution, future developments should therefore focus on refining taxonomic databases and improving classification algorithms. Adapting these computational methods for applicability in low resource settings might be needed to perform a complete sample to result workflow in situ.

In summary, we developed and optimized a rapid DNA-extraction and sequencing workflow (BQV), applicable for on-site nanopore mNGS in the context of pathogen diagnostics and surveillance, yielding DNA libraries of comparable quality to current laboratory-based protocols. Using a defined mock community spiked into fecal samples, we further assessed the quality of this method in terms of taxonomic and resistome profiling. By demonstrating rapid detection of both spike-in species and background diversity, we illustrate how detection times could be further reduced. Although we focused on chicken fecal samples here, we believe our method could also be applied to other complex sample types. More dilute samples could require an additional concentrating step, or the use of PCR-based library preparation. Overall, bead-beating was found to be the fastest lysis approach, while also giving a comprehensive view on sample taxonomic and resistome content. However, bead-beating intensity and duration should be optimized depending on the sample type being investigated, to find appropriate parameters that ensure effective cell lysis and DNA extraction, while maintaining long fragment sizes. If short-read mNGS would be used, the latter is less critical, although this would not provide results on-site and in real-time^33,38. Additionally, spike-in controls can allow for direct comparison across methods, but should contain microbial species foreign to the sample, and with varying cell wall composition. Finally, we demonstrate the added value of long-read metagenomic sequencing in identifying full-length ARGs and leveraging the additional sequence information to attribute them to a host. This paper focused on a workflow to generate real-time sequencing data by nanopore sequencing, applicable on-site. To analyze the data, a stable internet connectivity or connection to the electrical grid were still used. To obtain a full on-site applicable mNGS workflow covering data generation and analysis, complementary studies could focus on developing real-time computational pipelines, including adapted/downscaled databases, that can be performed on laptops or other portable devices.

Methods

Sample collection and spiking

Chicken fecal samples were collected and processed as follows: one spoonful of fecal material (≈ 1 g) was collected and stored in a DNA/RNA Shield™ Fecal Collection Tube R1101 containing 9 ml of DNA/RNA-shield (Zymo Research, Irvine, CA, USA), according to the manufacturer’s instructions. The sample was mixed well by vortexing extensively, and distributed in 1 ml aliquots. These were then centrifuged for 2 min at 5000g, after which the supernatant was stored separately. For comparison of metagenomic workflows, aliquots of 100 mg each were made from a single pooled fecal sample, and were recombined with 100 µL of the pooled supernatant from the previous step. Several of the aliquots were spiked with the spike-in DMC, consisting of 75 µl of ZymoBIOMICS Gut Microbiome Standard (D6331), along with 7.8 µl Zymo Spike-in control I (D6320) (Zymo Research, Irvine, CA, USA) (Table 2).

Development of on-site DNA extraction protocol

The Claremont Biosolutions DNAexpress method (method BD) was performed according to manufacturer’s instructions with the following adaptations: to the Omnilyse X™ bead-beating tube (Claremont Biosolutions, Upland, CA, USA), 2X Buffer Reagent 1 (Claremont Biosolutions, Upland, CA, USA), 20 µL of Proteinase K (20 mg/mL), 2.5 µL of 1 M DTT was added along with the sample. Bead-beating was performed for varying durations using the provided 6 V or 1.5 V batteries (Table 1). During the bead-beating, the tube was inverted several times for better homogenization. After lysis, the Proteinase K digestion was incubated for 30 min at room temperature. DNA was then purified with either the DNAexpress column according to manufacturer’s instructions (method BD), or with the Quick-DNA HMW magbead kit (Zymo Research, Irvine, CA, USA), with several modifications (method BQ): the lysed and proteinase-K digested sample was centrifuged at 6000g for 2 min, and the supernatant was transferred to a 1.5 mL tube. Thirty-three µL of MagBeads and 800 µL of Magbinding buffer (Zymo Research, Irvine, CA, USA) were added, after which the sample was put on a Hula mixer for 20 min. After the final washing step, the beads were dried at 55 °C for 7 min. Finally, the DNA was eluted at 55 °C for 10 min in a dry-bath. For some experiments, additional cleanup of the eluted DNA was performed using Agencourt AMPure XP beads (Beckman Coulter, Indianapolis, IN, USA) according to the following protocol: beads were added to the samples in varying ratios (Table 1), followed by 5 min incubation on a Hula mixer. The beads where then washed twice with freshly prepared 80% ethanol (Merck Millipore, Darmstadt, Germany). Finally, the beads were resuspended in 15 µL nuclease-free water (Thermo Fisher Scientific, Waltham, MA, USA), and incubated for 5 min at 55 °C to elute the DNA.

Statistical tests to compare DNA extraction protocols

Statistical comparisons of the DNA extraction protocols were performed in R (v4.2.2)⁵⁸, using the rstatix package (v0.7.1). Normality and equality of variance assumptions were tested using the Shapiro–Wilk and Bartlett’s tests, respectively. When these assumptions were met, ANOVA was used, followed by Tukey’s HSD test upon significance. In other cases, the Kruskal–Wallis test was used, followed by Dunn’s post-hoc test with Benjamini–Hochberg correction for multiple testing after a significant Kruskal–Wallis test.

Laboratory-based protocol

The laboratory-based extraction method consisted of enzymatic lysis and Quick-DNA HMW Magbead purification (EQ)³¹. Enzymatic lysis was performed according to the microbial lysis method in the Quick-DNA HMW MagBead Kit protocol (Zymo Research, Irvine, CA, USA) with the following modifications: Cell wall digestion was performed using 100 µL of Tris–HCl buffer and 20 µL of Metapolyzyme lytic enzyme mixture (Sigma-Aldrich, Saint Louis, MO, USA), with incubation at 37 °C for 1 h. After the lysis step, 20 µL of 10% SDS and 10 µL of Proteinase K were added, followed by incubation at 55 °C for 30 min. Magnetic bead purification was done with the Quick-DNA HMW magbead kit as described for method BQ. Further cleanup was done with Agencourt AMPure XP beads (Beckman Coulter, Indianapolis, IN, USA), carried out as described above, with bead to sample ratio of 0.4.

DNA quality and quantity

DNA purity, quantity, integrity and fragment lengths were measured as follows: purity was assessed by measuring the A260/280 and A260/230 ratios with the Nanodrop® 2000 (Thermo Fisher Scientific, Waltham, MA, USA) spectrophotometer. The quantity was measured using the Qubit™ dsDNA BR Assay kit on a Qubit™ 4 Fluorometer (Invitrogen by Thermo Fisher Scientific, Waltham, MA, USA). DNA integrity and fragment lengths were determined by capillary gel electrophoresis on the Tapestation 4200, using the genomic DNA screentapes and reagents (Agilent Technologies, Palo Alto, CA).

Nanopore sequencing

Various libraries were made from the spiked feces (Table 3). First, the laboratory-based DNA extraction protocol EQ was performed as described above, using a 0.4 ratio of AMPure XP beads (Beckman Coulter, Indianapolis, IN, USA) to sample in the final purification step. The obtained DNA extract was used in combination with the ONT ligation sequencing kit (SQK-LSK109, Oxford Nanopore Technologies, Oxford, UK) to make the EQL sequencing library, or with the rapid sequencing kit (SQK-RAD004, Oxford Nanopore Technologies, Oxford, UK) to generate the EQR library. Another library, UEQL, was generated from an unspiked fecal aliquot extracted with method EQ, purified using a 0.4 bead to sample ratio of AMPure XP beads (Beckman Coulter, Indianapolis, IN, USA) and was further prepared using the ligation sequencing kit (SQK-LSK109, Oxford Nanopore Technologies, Oxford, UK). Regarding the on-site DNA extraction methods, method BD was purified using a 0.4 bead to sample ratio of AMPure XP beads (Beckman Coulter, Indianapolis, IN, USA), followed by the rapid sequencing kit (SQK-RAD004, Oxford Nanopore Technologies, Oxford, UK) to generate library BDR. Method BQ was used to generate libraries BQR and BQV, by purifying the extracts using a 0.4 bead to sample ratio of AMPure XP beads (Beckman Coulter, Indianapolis, IN, USA), followed by either the rapid (SQK-RAD004) or VolTRAX (VSK-VSK004, Oxford Nanopore Technologies, Oxford, UK) sequencing kits, respectively. Additionally, bead-beating in method BQ was performed using the Omnilyse X tube (Claremont Biosolutions, Upland, CA, USA) powered by a 1.5 V battery, instead of the 6 V battery used for BD. All sequencing kits were used according to manufacturer’s instructions, with an amount of input DNA corresponding to kit requirements or the maximum amount of available DNA. In case of the Voltrax sequencing kit—all extracted DNA was used. During the ligation sequencing kit (SQK-LSK109, Oxford Nanopore Technologies, Oxford, UK), the Short Fragment Buffer was used during the adapter ligation and cleanup step. All heating and centrifugation steps in BQ were performed on the Bento Bio Pro (Bento Bioworks, London, UK), which is a portable device that includes a thermocycler and microcentrifuge. The BQV library was prepared automatically by the VolTRAX V2b device (Oxford Nanopore Technologies, Oxford, UK) by loading all obtained DNA along with the required VolTRAX sequencing kit reagents (VSK-VSK004, Oxford Nanopore Technologies, Oxford, UK), after which the device automatically carries out the required mixing, thermocycling and magnetic bead cleanup. Finally, all libraries were sequenced on a single R9.4.1 (FLO-MIN106, Oxford Nanopore Technologies, Oxford, UK) flow cell per library, for 72 h on a GridION Mk1 device (Oxford Nanopore Technologies, Oxford, UK).

Nanopore basecalling and QC

Guppy v5.0.7 was used for basecalling raw nanopore data, using the super accuracy (sup) basecalling model dna_r9.4.1_450bps_sup.cfg. The custom script GetFastqStats.py was then used to summarize read statistics per experiment. For further processing, reads were filtered using Nanofilt v2.8⁵⁹, removing reads with Q-score < 7 and length < 300.

Taxonomic analysis

KMA v1.4.4 was used to construct an indexed database from the DMC with the GMS and Spike-in control I reference genomes (provided by the manufacturer: https://zymoresearch.eu/collections/zymobiomics-microbial-community-standards) using the kma index command⁶⁰. Another broad in-house database was made with the same command, and contained all NCBI RefSeq genome entries with the “complete genome” assembly level (database accessed February 11, 2021⁶¹), and accession prefixes NC, NW, AC, and NZ of the following taxonomic groups: archaea, bacteria, fungi, protozoa, and viruses. Filtered reads were then mapped to these databases with the following options: -mem_mode -bc 0.7 -bcNano -ID 0.0 -ef -proxi 0.9 -na -nc -nf -1t1 -ca -sam. The res and mapstat summary files produced by KMA were aggregated and processed with the KMA_taxa_summary.py python script (v3.10.5) for taxonomic alignments. This script uses the Template_parsing.py script to parse template sequence names and groups the results on the species level. Template and query coverages and identity were aggregated by taking the means of these statistics across the different species templates, weighted by the bases mapped to those templates. Template sequences labelled as phage, virus or plasmid were not grouped together with their host species. Observed relative abundances for the DMC species were then calculated by dividing the amount of mapped bases to one species by the total mapped bases to the complete DMC database. The sam output generated by KMA were sorted and converted to bam format using samtools v1.9, and further processed with custom python scripts using the pysam module (v0.19.1): GetCoverageByTime.py, to summarize classification statistics over time and Readlevel_align_stats.py to calculate species-level sequencing statistics.

Resistome profiling

To identify the ARG composition of the DMC, the spike-in DMC reference genomes were mapped to the ResFinder database with KMA, using the same options as for taxonomic mapping, excluding the -1t1 option^60,62. Reads from the various workflows were mapped with the same method. The custom python script GetAMRlinksByTime.py was used to retrieve reads containing ARGs, requiring 97% identity to the full-length ResFinder database template. Additionally, this script retrieved the same reads from the alignment against the taxonomic databases, generating tabular outputs summarized by combination of ARG and taxonomic templates. To consider a reported ARG-host combination as likely true, a total of 50 kbp of matching bases between ARG reads and a taxonomic template was set as a cutoff value.

Data visualization

Summarized data were visualized in R (v4.2.2) with the ggplot2, ggpubr and circlize libraries^58,63,64,65. The custom script Plot_SeqRunStats.R was used to generate figures on sequencing summary statistics and taxonomic heatmaps, Species_ARG_ByTime.R generated figures on species and ARG detection over time, as well as ARG content. The custom script AMRlinkPlots generated the chord diagrams on ARG-host combinations.

Data availability

The sequencing data supporting the conclusions of this article is available in the NCBI Sequence Read Archive (SRA) repository, under the BioProject ID: PRJNA1011201.

Code availability

The custom scripts used to perform the analyses carried out in this study are available in the repository: https://github.com/brambloemen/FARMED_onsite.

References

Dincer, C., Bruch, R., Kling, A., Dittrich, P. S. & Urban, G. A. Multiplexed point-of-care testing—xPOCT. Trends Biotechnol. 35, 728–742 (2017).
Article CAS PubMed PubMed Central Google Scholar
Govender, K. N., Street, T. L., Sanderson, N. D. & Eyre, D. W. Metagenomic sequencing as a pathogen-agnostic clinical diagnostic tool for infectious diseases: A systematic review and meta-analysis of diagnostic test accuracy studies. J. Clin. Microbiol. 59, e02916-e2920 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ko, K. K. K., Chng, K. R. & Nagarajan, N. Metagenomics-enabled microbial surveillance. Nat. Microbiol. 7, 486–496 (2022).
Article CAS PubMed Google Scholar
Gehrig, J. L. et al. Finding the right fit: Evaluation of short-read and long-read sequencing approaches to maximize the utility of clinical microbiome data. Microb. Genom. 8, 000794 (2022).
PubMed PubMed Central Google Scholar
Marin, C. et al. Rapid oxford nanopore technologies MinION sequencing workflow for Campylobacter jejuni identification in broilers on site—a proof-of-concept study. Animals 12, 2065 (2022).
Article PubMed PubMed Central Google Scholar
Marcolungo, L. et al. Real-time on-site diagnosis of quarantine pathogens in plant tissues by nanopore-based sequencing. Pathogens 11, 199 (2022).
Article CAS PubMed PubMed Central Google Scholar
Chang, J. J. M., Ip, Y. C. A., Ng, C. S. L. & Huang, D. Takeaways from mobile DNA barcoding with BentoLab and MinION. Genes 11, 1121 (2020).
Article CAS PubMed PubMed Central Google Scholar
Carradec, Q. et al. A framework for in situ molecular characterization of coral holobionts using nanopore sequencing. Sci. Rep. 10, 15893 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Pomerantz, A. et al. Real-time DNA barcoding in a rainforest using nanopore sequencing: Opportunities for rapid biodiversity assessments and local capacity building. GigaScience 7, giy033 (2018).
Hillmann, B. et al. Evaluating the information content of shallow shotgun metagenomics. mSystems 3, e00069–18 (2018).
Durazzi, F. et al. Comparison between 16S rRNA and shotgun sequencing data for the taxonomic characterization of the gut microbiota. Sci. Rep. 11, 3030 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Campanaro, S., Treu, L., Kougias, P. G., Zhu, X. & Angelidaki, I. Taxonomy of anaerobic digestion microbiome reveals biases associated with the applied high throughput sequencing strategies. Sci. Rep. 8, 1926 (2018).
Article ADS PubMed PubMed Central Google Scholar
Edgeworth, J. D. Respiratory metagenomics: Route to routine service. Curr. Opin. Infect. Dis. 36, 115–123 (2023).
Article PubMed PubMed Central Google Scholar
Liu, M. et al. Detection of pathogens and antimicrobial resistance genes directly from urine samples in patients suspected of urinary tract infection by metagenomics nanopore sequencing: A large-scale multi-centre study. Clin. Transl. Med. 13, e824 (2023).
Article CAS PubMed PubMed Central Google Scholar
Zhou, Y., Shi, W., Wen, Y., Mao, E. & Ni, T. Comparison of pathogen detection consistency between metagenomic next-generation sequencing and blood culture in patients with suspected bloodstream infection. Sci. Rep. 13, 9460 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Liu, J., Zhang, Q., Dong, Y.-Q., Yin, J. & Qiu, Y.-Q. Diagnostic accuracy of metagenomic next-generation sequencing in diagnosing infectious diseases: A meta-analysis. Sci. Rep. 12, 21032 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Tsitsiklis, A. et al. Lower respiratory tract infections in children requiring mechanical ventilation: A multicentre prospective surveillance study incorporating airway metagenomics. Lancet Microbe 3, e284–e293 (2022).
Article PubMed PubMed Central Google Scholar
Zhu, N. et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 382, 727–733 (2020).
Article CAS PubMed PubMed Central Google Scholar
Serpa, P. H. et al. Metagenomic prediction of antimicrobial resistance in critically ill patients with lower respiratory tract infections. Genome Med. 14, 74 (2022).
Article CAS PubMed PubMed Central Google Scholar
Freeman, C. N. et al. Evaluating the potential of third generation metagenomic sequencing for the detection of BRD pathogens and genetic determinants of antimicrobial resistance in chronically ill feedlot cattle. BMC Vet. Res. 18, 211 (2022).
Article CAS PubMed PubMed Central Google Scholar
Johnson, M. A. et al. Investigating plant disease outbreaks with long-read metagenomics: sensitive detection and highly resolved phylogenetic reconstruction applied to Xylella fastidiosa. Microb. Genomics 8, 000822 (2022).
Article CAS Google Scholar
Talat, A., Blake, K. S., Dantas, G. & Khan, A. U. Metagenomic insight into microbiome and antibiotic resistance genes of high clinical concern in urban and rural hospital wastewater of Northern India Origin: A major reservoir of antimicrobial resistance. Microbiol. Spectrum 11, e04102-e4122 (2023).
Article Google Scholar
Buytaers, F. E. et al. Towards real-time and affordable strain-level metagenomics-based foodborne outbreak investigations using Oxford nanopore sequencing technologies. Front. Microbiol. 12, 3372 (2021).
Article Google Scholar
Gowers, G.-Oliver. F. et al. Entirely Off-grid and solar-powered DNA sequencing of microbial communities during an ice cap traverse expedition. Genes (Basel) 10, 902 (2019).
Maggiori, C., Raymond-Bouchard, I., Brennan, L., Touchette, D. & Whyte, L. MinION sequencing from sea ice cryoconites leads to de novo genome reconstruction from metagenomes. Sci. Rep. 11, 21041 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Latorre-Pérez, A. et al. A round trip to the desert: In situ nanopore sequencing informs targeted bioprospecting. Front. Microbiol. 12, 768240 (2021).
Article PubMed PubMed Central Google Scholar
Tennant, R. K. et al. In-situ sequencing reveals the effect of storage on lacustrine sediment microbiome demographics and functionality. Environ. Microbiome 17, 5 (2022).
Article CAS PubMed PubMed Central Google Scholar
Nicholls, S. M., Quick, J. C., Tang, S. & Loman, N. J. Ultra-deep, long-read nanopore sequencing of mock microbial community standards. GigaScience 8, giz043 (2019).
Deshpande, S. V. et al. Offline next generation metagenomics sequence analysis using MinION detection software (MINDS). Genes (Basel) 10, 578 (2019).
Patin, N. V. & Goodwin, K. D. Long-read sequencing improves recovery of picoeukaryotic genomes and zooplankton marker genes from marine Metagenomes. mSystems 7, e00595–22 (2022).
Gand, M., Bloemen, B., Vanneste, K., Roosens, N. H. C. & De Keersmaecker, S. C. J. Comparison of 6 DNA extraction methods for isolation of high yield of high molecular weight DNA suitable for shotgun metagenomics Nanopore sequencing to detect bacteria. BMC Genomics 24, 438 (2023).
Article CAS PubMed PubMed Central Google Scholar
Qiu, T. et al. Metagenomic assembly reveals hosts and mobility of common antibiotic resistome in animal manure and commercial compost. Environ. Microbiome 17, 42 (2022).
Article CAS PubMed PubMed Central Google Scholar
Pankoke, H. et al. Evaluation of commercially available DNA extraction kits for the analysis of the broiler chicken cecal microbiota. FEMS Microbiol. Lett. 368, 33 (2021).
Article Google Scholar
Rudi, K. et al. Direct Real-time PCR quantification of Campylobacter jejuni in chicken fecal and cecal samples by integrated cell concentration and DNA purification. Appl. Environ. Microbiol. 70, 790–797 (2004).
Article ADS CAS PubMed PubMed Central Google Scholar
Sambrook, J., Fritsch, E. F. & Maniatis, T. Molecular cloning: A laboratory manual. (1989).
Surendra, V., Bhawana, P., Suresh, K., Srinivas, T. N. R. & Anil Kumar, P. Imtechella halotolerans gen. nov., sp. nov., a member of the family Flavobacteriaceae isolated from estuarine water. Int. J. Syst. Evol. Microbiol. 62, 2624–2630 (2012).
Sheu, S.-Y., Arun, A. B., Jiang, S.-R., Young, C.-C. & Chen, W.-M. Allobacillus halotolerans gen. nov., sp. nov. isolated from shrimp paste. Int. J. Syst. Evol. Microbiol. 61, 1023–1027 (2011).
Fidler, G. et al. Tendentious effects of automated and manual metagenomic DNA purification protocols on broiler gut microbiome taxonomic profiling. Sci. Rep. 10, 3419 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Martin, S. et al. Nanopore adaptive sampling: A tool for enrichment of low abundance species in metagenomic samples. Genome Biol. 23, 11 (2022).
Article CAS PubMed PubMed Central Google Scholar
Schrader, C., Schielke, A., Ellerbroek, L. & Johne, R. PCR inhibitors—occurrence, properties and removal. J. Appl. Microbiol. 113, 1014–1026 (2012).
Article CAS PubMed Google Scholar
Tvedte, E. S. et al. Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes. G3 Genes Genomes Genet. 11, jkab083 (2021).
Wick, R. R., Judd, L. M., Wyres, K. L. & Holt, K. E. Y. Recovery of small plasmid sequences via Oxford Nanopore sequencing. Microb. Genom. 7, 000631 (2021).
CAS PubMed PubMed Central Google Scholar
Stanley, D., Geier, M. S., Chen, H., Hughes, R. J. & Moore, R. J. Comparison of fecal and cecal microbiotas reveals qualitative similarities but quantitative differences. BMC Microbiol. 15, 51 (2015).
Article PubMed PubMed Central Google Scholar
Robinson, K., Yang, Q., Stewart, S., Whitmore, M. A. & Zhang, G. Biogeography, succession, and origin of the chicken intestinal mycobiome. Microbiome 10, 55 (2022).
Article CAS PubMed PubMed Central Google Scholar
Maukonen, J., Simões, C. & Saarela, M. The currently used commercial DNA-extraction methods give different results of clostridial and actinobacterial populations derived from human fecal samples. FEMS Microbiol. Ecol. 79, 697–708 (2012).
Article CAS PubMed Google Scholar
Soares, L. M. M. et al. DNA read count calibration for single-molecule, long-read sequencing. Sci. Rep. 12, 17257 (2022).
Article ADS PubMed PubMed Central Google Scholar
Moss, E. L., Maghini, D. G. & Bhatt, A. S. Complete, closed bacterial genomes from microbiomes using nanopore sequencing. Nat. Biotechnol. 38, 701–707 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mohd Shaufi, M. A., Sieo, C. C., Chong, C. W., Gan, H. M. & Ho, Y. W. Deciphering chicken gut microbial dynamics based on high-throughput 16S rRNA metagenomics analyses. Gut Pathog. 7, 4 (2015).
Article PubMed PubMed Central Google Scholar
Rothrock, M. J. et al. A microbiomic analysis of a pasture-raised broiler flock elucidates foodborne pathogen ecology along the farm-to-fork continuum. Front. Vet. Sci. 6 (2019).
Feng, Y. et al. Metagenome-assembled genomes and gene catalog from the chicken gut microbiome aid in deciphering antibiotic resistomes. Commun. Biol. 4, 1–9 (2021).
Article ADS Google Scholar
Zhang, Y. et al. Improved microbial genomes and gene catalog of the chicken gut from metagenomic sequencing of high-fidelity long reads. GigaScience 11, giac116 (2022).
Lao, J. et al. ICEscreen: A tool to detect Firmicute ICEs and IMEs, isolated or enclosed in composite structures. NAR Genom. Bioinf. 4, lqac079 (2022).
Crits-Christoph, A., Hallowell, H. A., Koutouvalis, K. & Suez, J. Good microbes, bad genes? The dissemination of antimicrobial resistance in the human microbiome. Gut Microbes 14, 2055944 (2022).
Article PubMed PubMed Central Google Scholar
Gweon, H. S. et al. The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples. Environ. Microbiome 14, 7 (2019).
Article PubMed PubMed Central Google Scholar
Sereika, M. et al. Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing. Nat. Methods 19, 823–826 (2022).
Sanderson, N. D. et al. Comparison of R9.4.1/Kit10 and R10/Kit12 Oxford Nanopore flowcells and chemistries in bacterial genome reconstruction. Microb. Genom. 9, 000910 (2023).
Govender, K. N. & Eyre, D. W. Y. Benchmarking taxonomic classifiers with Illumina and Nanopore sequence data for clinical metagenomic diagnostic applications. Microb. Genom. 8, 000886 (2022).
CAS Google Scholar
R Core Team. R: A language and environment for statistical computing (R Foundation for Statistical Computing).
De Coster, W., D’Hert, S., Schultz, D. T., Cruts, M. & Van Broeckhoven, C. NanoPack: Visualizing and processing long-read sequencing data. Bioinformatics 34, 2666–2669 (2018).
Article PubMed PubMed Central Google Scholar
Clausen, P. T. L. C., Aarestrup, F. M. & Lund, O. Rapid and precise alignment of raw reads against redundant databases with KMA. BMC Bioinf. 19, 307 (2018).
Article Google Scholar
O’Leary, N. A. et al. Reference sequence (RefSeq) database at NCBI: Current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 44, D733–D745 (2016).
Article PubMed Google Scholar
Florensa, A. F., Kaas, R. S., Clausen, P. T. L. C., Aytan-Aktug, D. & Aarestrup, F. M. ResFinder—an open online resource for identification of antimicrobial resistance genes in next-generation sequencing data and prediction of phenotypes from genotypes. Microb. Genom. 8, (2022).
Wickham, H. ggplot2: Elegant graphics for data analysis (Springer-Verlag, 2016).
Book MATH Google Scholar
Kassambara, A. ggpubr: ‘ggplot2’ Based Publication Ready Plots. (2023).
Gu, Z., Gu, L., Eils, R., Schlesner, M. & Brors, B. circlize implements and enhances circular visualization in R. Bioinformatics 30, 2811–2812 (2014).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The research that yielded these results was funded by in‐kind contribution of Sciensano within the context of JRP12‐AMRSH5‐FARMED and by the EU’s Horizon 2020 Research and Innovation program under grant agreement No 773830: One Health European Joint Program. The authors would like to thank Stefan Hoffman for the assistance with the Oxford Nanopore Technologies GridION an VolTRAX devices and the FARMED consortium members for interesting discussions.

Author information

Authors and Affiliations

Transversal Activities in Applied Genomics, Sciensano, Rue Juliette Wytsman 14, 1050, Brussels, Belgium
Bram Bloemen, Mathieu Gand, Kevin Vanneste, Nancy H. C. Roosens & Sigrid C. J. De Keersmaecker
Department of Information Technology, IDLab, Ghent University, IMEC, 9052, Ghent, Belgium
Bram Bloemen & Kathleen Marchal
Department of Plant Biotechnology and Bioinformatics, Ghent University, 9052, Ghent, Belgium
Kathleen Marchal

Authors

Bram Bloemen
View author publications
You can also search for this author in PubMed Google Scholar
Mathieu Gand
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Vanneste
View author publications
You can also search for this author in PubMed Google Scholar
Kathleen Marchal
View author publications
You can also search for this author in PubMed Google Scholar
Nancy H. C. Roosens
View author publications
You can also search for this author in PubMed Google Scholar
Sigrid C. J. De Keersmaecker
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.B, M.G. and S.C.J.D.K. conceived the design and methodology of the study. B.B. conducted the optimization of the DNA extraction protocol and nanopore sequencing, initial data processing, data analysis and data visualization. S.C.J.D.K. supervised the study. B.B., S.C.J.D.K. and M.G. interpreted the results. K.V. contributed to the maintenance of computational infrastructure to allow the bioinformatics analysis. N.H.C.R., K.V. and K.M. provided specialist feedback. B.B. and S.C.J.D.K. wrote and edited the initial manuscript draft. All authors reviewed the manuscript before submission.

Corresponding author

Correspondence to Sigrid C. J. De Keersmaecker.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bloemen, B., Gand, M., Vanneste, K. et al. Development of a portable on-site applicable metagenomic data generation workflow for enhanced pathogen and antimicrobial resistance surveillance. Sci Rep 13, 19656 (2023). https://doi.org/10.1038/s41598-023-46771-z

Download citation

Received: 14 September 2023
Accepted: 04 November 2023
Published: 11 November 2023
DOI: https://doi.org/10.1038/s41598-023-46771-z

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Rapid MinION profiling of preterm microbiota and antimicrobial-resistant pathogens

A common protocol for the simultaneous processing of multiple clinically relevant bacterial species for whole genome sequencing

Nanopore metagenomics enables rapid clinical diagnosis of bacterial lower respiratory infection

Introduction

Results

Optimization of a rapid metagenomic DNA extraction method, applicable on-site

Performance assessment of different metagenomic workflows

Spike-in defined mock community to compare sequencing workflows

Protocol impact on sequencing throughput and read length

Sequencing workflow impacts taxonomic classification

The on-site workflow enables rapid taxonomic identification

Resistome profiling using full-length resistance genes in single reads

Genomic context links resistance genes to their hosts

Discussion

Methods

Sample collection and spiking

Development of on-site DNA extraction protocol

Statistical tests to compare DNA extraction protocols

Laboratory-based protocol

DNA quality and quantity

Nanopore sequencing

Nanopore basecalling and QC

Taxonomic analysis

Resistome profiling

Data visualization

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links