Analytical evaluation of circulating tumor DNA sequencing assays

Li, Wenjin; Huang, Xiayu; Patel, Rajesh; Schleifman, Erica; Fu, Shijing; Shames, David S.; Zhang, Jingyu

doi:10.1038/s41598-024-54361-w

Download PDF

Article
Open access
Published: 29 February 2024

Analytical evaluation of circulating tumor DNA sequencing assays

Wenjin Li¹^na1,
Xiayu Huang¹^na1,
Rajesh Patel²^na1,
Erica Schleifman²,
Shijing Fu¹,
David S. Shames² &
…
Jingyu Zhang¹

Scientific Reports volume 14, Article number: 4973 (2024) Cite this article

854 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

In China, circulating tumor DNA analysis is widely used and numerous assays are available. Systematic evaluation to help users make informed selections is needed. Nine circulating tumor DNA assays, including one benchmark assay, were evaluated using 23 contrived reference samples. There were two sample types (cell-free DNA and plasma samples), three circulating tumor DNA inputs (low, < 20 ng; medium, 20–50 ng; high, > 50 ng), two variant allele frequency ranges (low, 0.1–0.5%; intermediate, 0.5–2.5%), and four variant types (single nucleotide, insertion/deletion, structural, and copy number). Sensitivity, specificity, reproducibility, and all processes from cell-free DNA extraction to bioinformatics analysis were assessed. The test assays were generally comparable or superior to the benchmark assay, demonstrating high analytical sensitivity. Variations in circulating tumor DNA extraction and quantification efficiency, sensitivity, and reproducibility were observed, particularly at lower inputs. These findings will guide circulating tumor DNA assay choice for research and clinical studies, allowing consideration of multiple technical parameters.

Evaluating the analytical validity of circulating tumor DNA sequencing assays for precision oncology

Article 12 April 2021

Comparison of commercially available whole-genome sequencing kits for variant detection in circulating cell-free DNA

Article Open access 10 April 2020

eVIDENCE: a practical variant filtering for low-frequency variants detection in cell-free DNA

Article Open access 22 October 2019

Introduction

Cell-free DNA (cfDNA) is short extracellular DNA fragments that can originate from any cell type located in body fluids^1,2. Apoptosis and necrosis are thought to be the major mechanisms for release of cfDNA from cells into the peripheral bloodstream^1,3,4. In the field of oncology, there is much interest in the detection of tumor-derived cfDNA, known as circulating tumor DNA (ctDNA). ctDNA can be distinguished from non-tumor cfDNA via the detection of novel mutations that are not found in normal cells^5,6; however, there may be challenges related to distinguishing true ctDNA alterations from cfDNA variants related to clonal hematopoiesis⁶. The main benefits of ctDNA diagnostic testing over tissue DNA are the simple requirement of a blood sample rather than a more invasive biopsy, the potential to detect ctDNA shedding from multiple lesions or metastatic sites versus detection only from the biopsied site, and that dynamic changes can be followed over time⁷. Currently, the main application of ctDNA is in precision oncology, where it has the potential to be used in the assessment of prognosis or relapse, detection of minimal residual disease, treatment selection, and monitoring of disease burden and/or resistance^8,9. In addition, companion diagnostic tests using a next-generation sequencing (NGS) platform with liquid biopsy samples have been approved by the FDA for multiple cancer types and biomarkers¹⁰.

A major challenge to the clinical application of ctDNA testing is assay sensitivity as, generally, < 1% of the total cfDNA in a patient is tumor-derived¹¹. Other challenges include consistency with cfDNA/ctDNA extraction and analysis approaches¹². In Stetson et al., four NGS gene panel assays were tested using plasma samples to detect mutations in ctDNA¹³. False-negative and false-positive (FP) variants were identified and there was substantial variability in reporting between the four assays¹³. They found that variants with an allele frequency (AF) < 1% had the most discordance, and technical discordance was responsible for 68% of all unique variants identified by the ctDNA assays. Another study comparing the performance of several NGS assays reported that there was substantial variability in sensitivity and specificity, with assay precision being highly dependent on the quantity of DNA input¹⁴. Accuracy was highest for variants with AFs ≥ 1% and a well-established bioinformatics analysis workflow was found to be important for assay performance¹⁴. In 2021, an evaluation of the analytical performance of five ctDNA assays, which included evaluations at each stage of the ctDNA sequencing workflow using synthetic DNA spike-in experiments and standardized cell line-derived reference samples, found that there was high sensitivity, precision, and reproducibility for the detection of variant alleles that occurred at a frequency of > 0.5%¹⁵. Below that threshold, detection was unreliable and varied widely¹⁵.

We designed and conducted a systematic study to evaluate ctDNA sequencing assays available from nine NGS vendors in China. Assay and vendor names were masked to enable publication of data. Our evaluation included designed reference samples (cfDNA and contrived plasma) to evaluate the technical performance of the platforms for their analytic validity in precision oncology in China. Compared with previously published evaluations, the present study focused on variants with both low (< 0.5%) and intermediate (0.5–2.5%) AFs, as low frequency variants have presented a challenge in previously published studies^15,16. Study design comparisons with recent ctDNA platform evaluation publications were summarized in Table S1. Most previous studies have limited their evaluations to either cfDNA or plasma samples, but not both, which is why we felt it pertinent to do so. This study evaluated large (> 1 Mb) and small (< 1 Mb) NGS panels and focused on four variant types: single nucleotide variants (SNVs), insertion or deletion variants (InDels), structural variants (SVs, also known as fusion), and copy number variants (CNVs). Vendors with a variety of panel designs and sizes with different limits of detection (LOD) were included in this analysis. These assays are utilized in a variety of ctDNA applications for precision oncology, and the data presented herein are expected to be valuable for informing on the trade-offs of each.

Results

Overall study design

The study design is shown in Fig. 1. In total, 23 reference samples comprising two sample types (diluted cfDNA and synthetic plasma) were employed in this study: 20 cfDNA samples and 3 plasma samples. The cfDNA samples included different dilutions (0% WT, 0.1%, 0.5%, 1%, and 2.5%), cfDNA amounts (10 ng, 30 ng and 50 ng), as well as three pairs of replicates (10 ng at 0.1% and 0.5%; 30 ng at 0.1%). The three plasma samples contained 150 ng of cfDNA with VAFs at 0.2% and were tested in triplicate. The same input amount was provided to each vendor at each VAF concentration. The number of samples and the total amount of cfDNA for each sample by manufacturer’s quantification with cfDNA input categorized as low (< 20 ng), medium (20–50 ng), or high (> 50 ng) after quantification are shown in Table S3. These samples (but not WT) contained 45 hotspot alterations in 25 genes, covering 24 SNVs, 9 InDels, 8 SVs, and 4 CNVs. The aim of this design was to evaluate the sensitivity of each assay at different VAF and cfDNA sample inputs for these four classes of alterations. WT samples were included to evaluate FPs for the specificity analysis and replicates were used to test intra-lab reproducibility.

Panel details for the nine assays selected are shown in Fig. 1; one assay was selected from each vendor using the criteria listed above. Information collected on the overlap of targeted genes and each the LOD of each assay (TableS4 ) informed the design of reference materials, including which variants were included and the dilution gradients created to maximize the robustness of the evaluation. Both large (panel size > 1 Mb; assays D, E, and I) and small NGS panels were included. Only assays that had undergone analytical and clinical validations were chosen, to allow the results to be placed in the context of use in clinical practice. Each assay provider applied their internal standard operating procedures from cfDNA extraction and quantification to downstream bioinformatic pipelines with variant call format files. Summary tables were reported and included cfDNA quantification, mean deduplication depth, and on-target rate. Key experimental details and operational parameters were also evaluated, including sequencer, variant caller, and deduplication method (Fig. 1 and Table S4).

cfDNA quantification, sequence depth, and on-target rate

A summary of sample quality control (QC) and sequence QC results for each assay are presented in Table 1. Extraction efficiency and sequencing QC were evaluated as they may impact assay sensitivity. We particularly looked at the potential impact from different ctDNA input and variant allele frequency perspective. A comparison of the ctDNA extraction efficiency and quantification for each assay is shown in the supplementary (Fig. S1A,B, TablesS5 , and S6). There is variation among the reported quantification results of both ctNDA and plasma samples regardless of different DNA input and variant allele frequency. And regarding ctDNA extraction from plasma samples, assay F underestimated the cfDNA quantity for the plasma samples, with a mean extraction efficiency of 16%. Since there are variations for final quantification among these vendors even though they were provided with the same amount of samples, it is very important for vendors to have good extraction efficiency and accurate quantification.

Table 1 Summary of extraction and sequence QC results.

Full size table

A comparison of sequencing depth (deduplicated mean depth comparison) is shown in Fig. S1C and Table S7. Sequencing depth varied widely among the assays. Assays A and G had a much higher overall sequence depth (> 10,000×) than the other assays, while assays D, E, and F had a relatively low sequencing depth (< 5000×), with D and E having a larger panel size. The effect of sample input on sequence depth was evaluated. Sequencing depth varied with different cfDNA inputs within the assay. At an input of > 20 ng of cfDNA, all assays reached their expected depth. All samples reached the minimum coverage requirement for analysis and were therefore included in the evaluation. In addition, assays E and F had less variability in deduplicated mean depth compared with the other assays. As expected, a lower cfDNA input tended to have a lower deduplicated mean depth. Adequate sample input is important to ensure high sequence depth in order to detect variants especially with low mutation frequency.

All assays reached an on-target rate of ≥ 50%, which is considered acceptable by vendors for most assays (Fig. S1D and Table S7). Similar to what was observed for sequencing depth, samples with low input tended to have a lower on-target rate. Assay G had higher variability in the on-target rate compared with the other assays, while assays C, E, and H had the lowest on-target rate heterogeneity.

Overall panel sensitivity and impact factors

The sensitivity of each assay was first evaluated by assessing the sensitivity of SNV detection at the required sample input, which was generally > 20 ng (TableS8). All assays except A and E reached a sensitivity of approximately 0.95 for SNV detection at VAF 0.5% (Fig. 2A). The overall sensitivity including all four variant types at different inputs was calculated using on-target known variants in the relevant panel as a reference (Fig. 2B and Table S9). Assays B, D, and G overall had higher sensitivity than the other six assays (including the benchmark assay I) and performed comparably. For all assays, known on-target variants were detected with higher sensitivity in both intermediate VAF (0.5–2.5%) ctDNA samples and synthetic plasma samples when compared with low VAF (0.1%) ctDNA samples. There was a substantial increase in sensitivity for ctDNA samples from VAF 0.1–0.5% for all assays. The increase in sensitivity from VAF 0.5–2.5% was minimal for assays B, D, and G; however, these assays showed high sensitivity (0.90–0.98) at intermediate VAFs (0.5–2.5%) compared with the benchmark assay (0.55–0.92). For all assays, the sensitivity was drastically reduced at VAF 0.1% and ranged from 0.00 to 0.69. Regarding the plasma samples, at VAF 0.2%, three assays (B, D, and G) showed high sensitivity (0.91–0.98), which was better than the benchmark assay (0.79). Some assays had higher or comparable sensitivity in the plasma samples with 0.2% VAF relative to VAF 0.5% cfDNA samples (B, G, and I).

Sensitivity comparisons according to different cfDNA inputs (< 20 ng, 20–50 ng, and > 50 ng) are shown in Fig. 3, Tables S10–S12). For all cfDNA inputs, the trend for sensitivity was similar. For all assays, the sensitivity was more robust with different cfDNA inputs at higher VAFs. There was a substantial increase in sensitivity for most assays, particularly at VAF 0.1%, when cfDNA input changed from low (Fig. 3A) to medium (Fig. 3B). For all assays, there was minimal improvement in sensitivity when cfDNA input changed from medium (Fig. 3B) to high (Fig. 3C). With the medium cfDNA input of 20–50 ng, five assays had a high sensitivity (0.86–0.98) at VAF 1% and a sensitivity above 0.95 at VAF 2.5%.

For the sensitivity comparison according to panel size, three assays (D, E, and I) with a panel size > 1 Mb and six assays (A, B, C, F, G, and H) with a panel size < 1 Mb were evaluated. Among the three > 1 Mb panels, assay D performed best and had comparable performance with the top < 1 Mb assays (B and G). Assay E had the lowest sequence depth but showed comparable performance with the benchmark assay I for cfDNA samples.

Sensitivity of different variant types

For the cfDNA samples, the sensitivity comparison for on-target SNVs, using on-target known variants in the panel as reference, showed a similar pattern as the overall sensitivity and is shown in Fig. 4A and Table S13. Assays B, D, and G showed overall higher sensitivity than the other six assays; however, the difference was not as notable as that for on-target known variants. Additionally, most of the assays had better or comparable sensitivity versus the benchmark assay at intermediate VAFs. All assays had high sensitivity above 0.9 at VAF 2.5%, and all assays except I had a high sensitivity (0.92–1) at VAF 1%. At VAF 0.5%, six of the assays had a high sensitivity (0.92–1). With VAF 0.1%, sensitivity for all assays decreased notably; however, two of the assays had a sensitivity above 0.75 (B and G). Using panel variants as reference, sensitivity comparisons for on-target InDel variants are shown in Fig. 4B. Assays B, D and G had superior performance compared with the other assays; sensitivity increased substantially from VAF 0.1–0.5%. There was little increase in sensitivity for most assays from VAF 0.5–2.5%. All assays except C, E, and F had a sensitivity > 0.85 at VAF 2.5%. Five assays had better sensitivity than the benchmark with VAF 2.5%. Assays A, B, D, and G had relatively higher sensitivity at VAF 1% (0.86–1) and VAF 0.5% (0.82–1). For all assays, the sensitivity was drastically reduced at VAF 0.1%; only assay B had a sensitivity > 0.75. Sensitivity comparisons for on-target CNVs when using panel variants as reference are shown in Fig. 4C. For cfDNA, all assays except the benchmark assay performed poorly in detecting CNVs, even at VAF 2.5%. At VAF 2.5%, the benchmark assay was the only one with all four CNVs detected; assays D and G detected three (at both VAF 2.5% and 1%). For the plasma samples, the benchmark assay was also the only one with all four CNVs detected at VAF 0.2%.

Sensitivity comparisons for on-target SVs when using panel variants as a reference were also evaluated among the different assays (Fig. 4D). Five assays (B, D, E, G, and H) had better overall performance than the other assays. Sensitivity increased substantially from VAF 0.1–0.5% for all assays and a minimal increase was observed for assays B, D, E, G, and H from VAF 0.5–2.5%. Most assays had better sensitivity than the benchmark assay at different dilutions. At VAF 2.5%, all assays except C and F had a sensitivity > 0.95; at VAF 1%, five assays (A, B, D, E, and G) had high sensitivity (0.92–1); and at VAF 0.5%, four assays (B, D, E, and G) had high sensitivity (0.88–1). Only one assay (G) had a sensitivity > 0.6 at VAF 0.1%. For the plasma samples, there were three assays with a high sensitivity (0.92–1) at VAF 0.2%; this was higher than the benchmark (0.29). Sensitivity results for on-target SNV, InDel, CNV, and SV when using the list of detected variants at VAF 2.5% as a reference were similar to those obtained when using panel variants as the reference (Fig. S2 and Fig. S3A–D, respectively).

Specificity assessment

A FP was defined as the number of the designed variants detected in WT cfDNA samples. Specificity analysis showed that an FP was only detected in assay F (TP53p.R175H); these results are summarized in Table 1.

Reproducibility evaluation

The design of the study to evaluate panel reproducibility included three cfDNA samples (sample 1, sample 2, and sample 3), which were tested in duplicate, and one plasma sample, which was tested in triplicate. For all assays, reproducibility was affected by both the quantity of cfDNA input and VAF (Fig. 5). All assays had high reproducibility with sample 1 (cfDNA < 20 ng, VAF 0.5%) with six showing a reproducibility > 0.85. For sample 2 (cfDNA 20–50 ng, VAF 0.1%), assays B, C, D, and G had higher reproducibility (0.67–0.79) relative to the other assays (0.21–0.42). All assays had poor reproducibility with sample 3 (cfDNA < 20 ng, VAF 0.1%), ranging from 0.12 to 0.56. Although the cfDNA input was the same for samples 1 and 3, the higher VAF in sample 1 greatly improved the reproducibility compared with sample 3. For the plasma samples, assays B, D, and G had the highest reproducibility (0.88–1).

VAF accuracy

The correlation coefficient between the expected and observed VAF is plotted in Fig. 6 and Table S14. For most assays, the correlation coefficient was high (> 0.80) at different ctDNA inputs; assays D and I were exceptions as the correlation coefficient improved substantially with medium to high cfDNA input. Assay B tended to have the highest correlation coefficient of the assays at each ctDNA input. With a low cfDNA input, assays A, B, and H had higher correlation coefficients than the others, while at an input of > 20 ng, assays B, D, and H had higher correlation coefficients than the others. Overall, the detection accuracy decreased with lower sample input.

Discussion

ctDNA assays are making a significant impact in drug development and patient care. Given the many emerging use cases in the clinical setting, covering a variety of applications, it is critical to consider technical and analytical characteristics when choosing an appropriate ctDNA sequencing assay. Along with the considerable global interest in developing ctDNA technologies, ctDNA sequencing assays are rapidly evolving in China. However, a systematic understanding and evaluation of the overall readiness of these new assays, compared with commercially available and globally accessible assays, is lacking. To this end, this study provides the first large-scale and wide-range assessment of nine ctDNA assays available in China, using customized SeraCare contrived samples.

To maximize the effectiveness of this evaluation using reference samples, the present study selected panels with the largest overlap with the variants contained in the reference samples. Additionally, panels with better validation for applications in real-world clinical practice were preferentially selected. The present evaluation enabled the assessment of sensitivity, specificity, reproducibility, and accuracy through the design and use of different variant types, a range of VAFs (0.1% [low] to 2.5% [medium]), different cfDNA inputs (< 20 ng, 20–50 ng, and > 50 ng), as well as replicate samples and different sample types. Using designed reference samples, the present study showed that ctDNA sequencing assays from leading NGS vendors in China were generally comparable to or better than the benchmark assay, demonstrating a high potential for clinical utility. Performance differences may have implications for assay utility at different testing points along the patient journey. The results of the present study provide a comprehensive overview of the NGS-based ctDNA assay landscape in China, which is expected to help inform clinicians and researchers on the selection and application of ctDNA technology in precision oncology.

Multiple studies have shown that low-frequency mutations commonly occur in early-stage cancer^17,18 and in post-surgical minimal residual disease with VAF < 0.1%^18,19. In addition, we observed that around 25% of patients had VAF < 0.5% for specific hotspot SNVs (unpublished data). This calls for more sensitive assays to detect mutations at low VAF, especially for early-stage cancers. In the present study, assays B, D, and G had the greatest sensitivity regardless of variant type, variant VAF, ctDNA input, or sample type. SNVs with VAF ≥ 0.5% (0.5%, 1.0%, and 2.5%) were detected with high sensitivity using seven of the nine participating assays. However, the detection of SNVs below VAF 0.5% varied widely among the assays, with five having sensitivity > 0.8 at 0.2%, and assays B and G having the greatest sensitivity (> 0.75) at 0.1%. It is noted that some samples had cfDNA concentrations that reached the lower limit for some of the assays evaluated (Table S4), which may have resulted in an underestimation of assay performance. This finding is in line with recently published results of ctDNA platform evaluations^15,16 and is, in general, similar to the publicly available information from several ctDNA sequencing providers that have been widely used in global clinical practice^20,21.

Non-SNVs, such as Indels, SVs, and CNVs, are also actionable cancer biomarkers^{22,23,24,25,26}, and as such, there is a need for effective liquid biopsy assays to detect these variant types. For non-SNV types, we observed that assay performance is generally worse than that for SNVs and varies dramatically among different assays. Among the three non-SNV types tested in this study, InDels were detected with high sensitivity (> 90%) by three, three, and four assays at VAF 0.5%, 1%, and 2.5%, respectively. They were detected at high sensitivity by three assays at VAF 0.2% and two assays at VAF 0.1% with sensitivity ≥ 0.7. SVs were detected with high sensitivity (> 0.90) for four, five, and seven participating assays at VAF 0.5%, 1.0%, and 2.5%, respectively. SVs were detected by only three assays at VAF 0.2% (sensitivity > 0.9) and by only one assay at VAF 0.1% (sensitivity approximately 0.60); this is generally lower than what has been reported for the Guardant360 assay¹⁵. For CNVs, our evaluation focused more on VAF 2.5%, as the copy number is diluted and may reach undetectable levels at lower VAFs in the other cfDNA samples. As a result, all assays except the benchmark assay performed poorly in detecting CNVs. One possible influencing factor could be the covered target region of the gene among the different assays²⁷. Additionally, it was reported that the sensitivity of ctDNA CNV detection increases with high ctDNA fraction, high copy number change (≥ 4/copy), and high sequence depth²⁷. The low CNV amplification design (average of 3) and limited sequence depth in most of the assays may have limited the analyses of CNV detection in our study. The above data suggest that improved performance is needed for most assays for the detection of non-SNVs.

In addition to VAFs and variant types, sample input was an influencing factor on detection sensitivity. Several cfDNA samples with low input at various VAFs were incorporated in the present study, and most assays were able to extract ~ 10 ng ctDNA from those samples. We found that low ctDNA input tended to have lower sequence depth (< 3000×) for most assays. As a result, the overall sensitivity for samples with low input was worse than that from higher input; the differences were more prominent at low VAF. Similarly, some assays had higher sensitivity with VAF 0.2% (plasma samples) than with VAF 0.5% (cfDNA samples), which may have been because plasma samples had an adequate amount of cfDNA input. When evaluating the impact of sequence depth and panel size on sensitivity, neither seemed to play a significant role independently. Assays A and G had an overall higher sequence depth than the other assays; however, assay A did not perform as well as assay G. Three assays (D, E, and I) with large panels (panel size > 1 Mb) were included in the study with assay D among the top performing assays.

To avoid over treatment in the clinic, a low FP rate is an important feature of assays to be used in clinical settings. In the present study, specificity was high for all assays; eight of the nine assays did not detect any FPs using the detection of any variant in the WT ctDNA sample as the criteria. Therefore, compared with sensitivity, specificity is less of a differentiating factor for assay performance; this is consistent with a similar study that was recently published that reported low FP rates²⁸. It should be noted that the specificity estimation focused on the FP results when defined as the detection of any designed variants in WT cfDNA samples rather than other background noise. This may overestimate the performance of specificity.

This study provides insight on performance related to VAF accuracy. Accuracy on VAF reporting is not strictly required when it comes to the clinical utility of cancer detection and molecular stratification in cases where determining the presence or absence of target variants is sufficient. Nevertheless, VAF precision is critical for therapeutic monitoring and post-treatment surveillance. For example, when a more accurate representation of the true amount of ctDNA was needed, absolute ctDNA levels (as determined by normalizing VAFs by plasma volume), were used to explore the predictive value in some solid tumors^29,30. Here, accurate VAF calling was found to be the underlying foundation. In hematological malignancies, this was demonstrated as the log-fold change in ctDNA from baseline to on treatment, which was used as a prognostic factor during first-line treatment of patients with diffuse large B-cell lymphoma and relied on the stringent detection of VAF³¹. In the present study, VAF accuracy was higher with increased sample input, emphasizing the importance of sufficient cfDNA to guarantee solid VAF readouts in clinical practice. Additionally, assays B, D, and H demonstrated more accurate VAF than the other assays, with a coefficient > 0.9 when there was medium-to-high cfDNA input where asays B, D, H showed relatively high deduped depth compared with other assays (Fig. S1). At low cfDNA input, it is interesting to be noted that assay A showed a generally good VAF accuray (0.86 in Table S13), which may be partially due to its relatively high sequencing depth at low input (samples marked red of box A in Figure C of Fig. S1). It is consistent with previous findings that fragment depth may affect the detection accuracy^15,20. These observations suggest that sample input and sequence depth account for the level of VAF accuracy.

Experimental factors of the different assays had some effect on assay performance. The observed fluctuations of reported cfDNA input reflected, to some degree, both the extraction efficiency and quantification procedures, which can play a part in assay performance given that inaccurate sample input estimation may further affect downstream library construction and thus potentially impact the optimized ctDNA assay workflow. The capability of handling different samples was also evaluated by including plasma samples, which exhibited a similar extraction efficiency to cfDNA samples. Additionally, we did not observe a prominent effect on technical performance from on-target rate, sequencer and extraction method, which was likely because all the assays were optimized to be robust and were analytically validated with these variables.

Aside from the technical aspects, operational features should also be taken into consideration when performing assay selection in clinical practice. All the vendors of the assays that were included in the present study were accredited by the College of American Pathologists and all the assays have finished the analytical validations. Turnaround time should also be given special consideration when ctDNA testing is used prospectively, as this allows clinicians to receive information on ctDNA status in time to aid in therapy selection or, in the case of clinical trials, patient selection or stratification. Moreover, the responsiveness across all the vendors are all highly efficient. Additionally, the price is comparable except assay I (data not shown), which offered high price, although the actual quotation may vary depending on sample volume and operational requirements in the clinical setting.

In conclusion, this study evaluated multiple aspects of performance related to ctDNA detection across nine NGS assays available in China using designed reference cfDNA samples. The results of this study provide a framework and guidance for researchers and clinicians when selecting a ctDNA assay for specific applications within precision oncology. The cfDNA extraction efficiency, sequence depth, on-target rate, mutation frequency, mutation type, sample input are the main technical factors influencing ctDNA detection performance, which include the sensitivity, specificity and accuracy. How to increase the ctDNA input material and improve the coverage depth are the most critical variables in clinical translation of ctDNA assays. As contrived samples may not reflect the complexity of patient samples, assay performance should be further evaluated and improved in the clinical setting using patient samples. Furthermore, innovative approaches that allow the examination of ctDNA fragment size and ctDNA methylation are expected to play a part in increasing the resolution of ctDNA assays, which need to be further developed and accessed.

Materials and methods

cfDNA samples

The cfDNA samples used in this study were obtained from SeraCare Life Sciences (Milford, MA, USA) and consisted of 45 cancer DNA variants (Table S2) spiked into a background of wildtype (WT) genomic DNA (derived from cell line GM 24385, Coriell Institute for Medical Research, Camden, NJ, USA), with variants at a target AF of 2.5 ± 0.75; at least 95% of variants were within the acceptable range. The ctDNA variants included 45 hotspot alterations in 25 genes with 24 SNVs, nine InDels, eight SVs, and four CNVs (3.2 ± 0.5 total copies per CNV). To create the samples, cfDNA was prepared to a length of approximately 170–180 base pairs using a proprietary SeraCare process (https://www.seracare.com/resources-and-education/seraseq-ngs-reference-materials/).

Plasma samples

For the synthetic plasma samples, 0.2% of the custom cfDNA reference material described above (SeraCare) was added at approximately 30 ng/mL in 5 mL contrived plasma (total extractable cfDNA of approximately 150 ng). The AF target of each variant within this library was a 0.2% (0.14–0.26%) AF mix; at least 95% of variants were within the acceptable range. The CNVs in these plasma samples had a target average total copy number of 3.2 ± 0.5 per CNV as confirmed by droplet digital polymerase chain reaction (PCR).

Assay selection

This study included nine NGS-based ctDNA assays available in China. Assays were selected based on their panel design, validation status, LOD, and the qualifications of the laboratories performing the testing (e.g., College of American Pathologists/Clinical Laboratory Improvement Amendments accreditation; Fig. 1). Assay I, developed by a US-based company and applied globally, was chosen as the benchmark assay in this study. All the assays applied probe hybridization capture for target sequencing library building. A total of 23 reference samples were tested by all the assays, including 20 cfDNA samples and three plasma samples. Samples were sequenced using sequencing platforms and pipelines at each vendor site.

Sample preparation

WT cfDNA and 2.5% cfDNA were mixed to generate samples at different variant AFs (VAFs). After verifying these VAFs (SeraCare), the 1.0%, 0.5%, and 0.1% AF mixed samples were produced at 1.75 ng/μL, 1.25 ng/μL, and 0.5 ng/μL by further diluting the 2.5% mutant cfDNA with WT cfDNA according to mass as quantified using the Qubit dsDNA High Sensitivity assay kit (ThermoFisher Scientific, Waltham, MA, USA) with three input amounts designed (10 ng, 30 ng, 50 ng). Each cfDNA mixture was separated into 44 μL aliquots; 4 μL was used for quantification by the vendor and 40 μL was used for cfDNA extraction. The 2.5% mutant cfDNA mix had an average total copy number of 3.2 per CNVs as determined by droplet digital PCR. Samples containing 1.0%, 0.5%, and 0.1% CNVs were generated by blending the 2.5% sample with WT cfDNA according to mass to reach the desired dilution.

cfDNA extraction and quantification

cfDNA was extracted using a QIAamp circulating nucleic acid kit (Qiagen, Germantown, MD, USA). cfDNA concentrations were verified using the Qubit dsDNA Broad Range assay kit (ThermoFisher Scientific) by SeraCare and cfDNA input was quantified and categorized as low (< 20 ng), medium (20–50 ng), or high (> 50 ng), which was used as input classification for comparison. Digital PCR testing was performed to verify the presence of variants prior to the shearing process for both the 2.5% mix and the plasma samples containing 0.2% reference material. Upon the receipt of plasma samples, participant vendors performed cfDNA extraction, quantification, and all other internal procedures.

Statistical analyses

All statistical analysis was performed using R Statistical Software version 4.0.3³².

Evaluation of sensitivity

Assay sensitivity was defined as the number of on-target known variants detected in a given sample divided by either the number of on-target known variants in the panel (presented here) or by the maximum on-target known variants detected at VAF 2.5% (presented in the supplement). It was calculated within pre-defined expected VAFs (0.1%, 0.2%, 0.5%, 1% and 2.5%) globally and within specific ctDNA inputs, or within specific variant types.

Evaluation of specificity

FPs were defined as the number of on-target known variants present in WT samples. FPs were adjusted for the relevant assay panel size.

Evaluation of reproducibility

When comparing replicates of the same sample within the participating assays, reproducibility was defined as the portion of variants detected in all replicates relative to the total number of variants detected in each of the replicates.

Evaluation of VAF accuracy

VAF accuracy was evaluated by the correlation of the observed and expected VAFs of all reported true-positive variants. Pearson’s correlation was calculated for each assay both globally and within each cfDNA input category.

Data availability

All data generated or analysed during this study are included in this published article (and its Supplementary Information files).

References

Thierry, A. R., El Messaoudi, S., Gahan, P. B., Anker, P. & Stroun, M. Origins, structures, and functions of circulating DNA in oncology. Cancer Metastasis Rev. 35, 347–376. https://doi.org/10.1007/s10555-016-9629-x (2016).
Article CAS PubMed PubMed Central Google Scholar
Keller, L., Belloum, Y., Wikman, H. & Pantel, K. Clinical relevance of blood-based ctDNA analysis: Mutation detection and beyond. Br. J. Cancer 124, 345–358. https://doi.org/10.1038/s41416-020-01047-5 (2021).
Article PubMed Google Scholar
Jahr, S. et al. DNA fragments in the blood plasma of cancer patients: Quantitations and evidence for their origin from apoptotic and necrotic cells. Cancer Res. 61, 1659–1665 (2001).
CAS PubMed Google Scholar
Stroun, M., Lyautey, J., Lederrey, C., Olson-Sand, A. & Anker, P. About the possible origin and mechanism of circulating DNA apoptosis and active DNA release. Clin. Chim. Acta 313, 139–142. https://doi.org/10.1016/s0009-8981(01)00665-9 (2001).
Article CAS PubMed Google Scholar
Diaz, L. A. Jr. & Bardelli, A. Liquid biopsies: Genotyping circulating tumor DNA. J. Clin. Oncol. 32, 579–586. https://doi.org/10.1200/JCO.2012.45.2011 (2014).
Article PubMed PubMed Central Google Scholar
Razavi, P. et al. High-intensity sequencing reveals the sources of plasma circulating cell-free DNA variants. Nat. Med. 25, 1928–1937. https://doi.org/10.1038/s41591-019-0652-7 (2019).
Article CAS PubMed PubMed Central Google Scholar
Adashek, J. J., Janku, F. & Kurzrock, R. Signed in blood: Circulating tumor DNA in cancer diagnosis, treatment and screening. Cancers (Basel). https://doi.org/10.3390/cancers13143600 (2021).
Article PubMed PubMed Central Google Scholar
Fiala, C. & Diamandis, E. P. Utility of circulating tumor DNA in cancer diagnostics with emphasis on early detection. BMC Med. 16, 166. https://doi.org/10.1186/s12916-018-1157-9 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kunnath, A. P. & Priyashini, T. Potential applications of circulating tumor DNA technology as a cancer diagnostic tool. Cureus 11, e4907. https://doi.org/10.7759/cureus.4907 (2019).
Article PubMed PubMed Central Google Scholar
FDA approves liquid biopsy NGS companion diagnostic test for multiple cancers and biomarkers. https://www.fda.gov/drugs/resources-information-approved-drugs/fda-approves-liquid-biopsy-ngs-companion-diagnostic-test-multiple-cancers-and-biomarkers (2020).
Diehl, F. et al. Circulating mutant DNA to assess tumor dynamics. Nat. Med. 14, 985–990. https://doi.org/10.1038/nm.1789 (2008).
Article CAS PubMed Google Scholar
Andersson, D., Kristiansson, H., Kubista, M. & Stahlberg, A. Ultrasensitive circulating tumor DNA analysis enables precision medicine: Experimental workflow considerations. Expert Rev. Mol. Diagn. 21, 299–310. https://doi.org/10.1080/14737159.2021.1889371 (2021).
Article CAS PubMed Google Scholar
Stetson, D. et al. Orthogonal comparison of four plasma NGS tests with tumor suggests technical factors are a major source of assay discordance. JCO Precis. Oncol. 3, 1–9. https://doi.org/10.1200/PO.18.00191 (2019).
Article PubMed Google Scholar
Weber, S. et al. Technical evaluation of commercial mutation analysis platforms and reference materials for liquid biopsy profiling. Cancers (Basel). https://doi.org/10.3390/cancers12061588 (2020).
Article PubMed PubMed Central Google Scholar
Deveson, I. W. et al. Evaluating the analytical validity of circulating tumor DNA sequencing assays for precision oncology. Nat. Biotechnol. 39, 1115–1128. https://doi.org/10.1038/s41587-021-00857-z (2021).
Article CAS PubMed PubMed Central Google Scholar
The Guardant360® assay. https://www.therapyselect.de/sites/default/files/downloads/guardant360/guardant360_specification-sheet_en.pdf.
Abbosh, C. et al. Phylogenetic ctDNA analysis depicts early-stage lung cancer evolution. Nature 545, 446–451. https://doi.org/10.1038/nature22364 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Bettegowda, C. et al. Detection of circulating tumor DNA in early- and late-stage human malignancies. Sci. Transl. Med. 6, 224ra224. https://doi.org/10.1126/scitranslmed.3007094 (2014).
Article CAS Google Scholar
Abbosh, C., Birkbak, N. J. & Swanton, C. Early stage NSCLC—Challenges to implementing ctDNA-based screening and MRD detection. Nat. Rev. Clin. Oncol. 15, 577–586. https://doi.org/10.1038/s41571-018-0058-3 (2018).
Article CAS PubMed Google Scholar
Yu, L. et al. Direct comparison of circulating tumor DNA sequencing assays with targeted large gene panels. PLoS One 17, e0266889. https://doi.org/10.1371/journal.pone.0266889 (2022).
Article CAS PubMed PubMed Central Google Scholar
FoundationOne® liquid CDX: Technical information. https://info.foundationmedicine.com/hubfs/FMI%20Labels/FoundationOne_Liquid_CDx_Label_Technical_Info.pdf.
Coleman, N. et al. Beyond epidermal growth factor receptor: MET amplification as a general resistance driver to targeted therapy in oncogene-driven non-small-cell lung cancer. ESMO Open 6, 100319. https://doi.org/10.1016/j.esmoop.2021.100319 (2021).
Article CAS PubMed PubMed Central Google Scholar
Peng, H. et al. CNV detection from circulating tumor DNA in late stage non-small cell lung cancer patients. Genes (Basel). https://doi.org/10.3390/genes10110926 (2019).
Article PubMed PubMed Central Google Scholar
Harada, G., Santini, F. C., Wilhelm, C. & Drilon, A. NTRK fusions in lung cancer: From biology to therapy. Lung Cancer 161, 108–113. https://doi.org/10.1016/j.lungcan.2021.09.005 (2021).
Article CAS PubMed Google Scholar
Nakamura, Y. et al. Circulating tumor DNA-guided treatment with pertuzumab plus trastuzumab for HER2-amplified metastatic colorectal cancer: A phase 2 trial. Nat. Med. 27, 1899–1903. https://doi.org/10.1038/s41591-021-01553-w (2021).
Article CAS PubMed PubMed Central Google Scholar
Liu, H., Zhang, B. & Sun, Z. Spectrum of EGFR aberrations and potential clinical implications: Insights from integrative pan-cancer analysis. Cancer Commun. (Lond). 40, 43–59. https://doi.org/10.1002/cac2.12005 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Molparia, B., Nichani, E. & Torkamani, A. Assessment of circulating copy number variant detection for cancer screening. PLoS One 12, e0180647. https://doi.org/10.1371/journal.pone.0180647 (2017).
Article CAS PubMed PubMed Central Google Scholar
Tie, J. et al. Circulating tumor DNA analysis detects minimal residual disease and predicts recurrence in patients with stage II colon cancer. Sci. Transl. Med. 8, 346–392. https://doi.org/10.1126/scitranslmed.aaf6219 (2016).
Article CAS Google Scholar
Powles, T. et al. ctDNA guiding adjuvant immunotherapy in urothelial carcinoma. Nature 595, 432–437. https://doi.org/10.1038/s41586-021-03642-9 (2021).
Article ADS CAS PubMed Google Scholar
Bratman, S. V. et al. Personalized circulating tumor DNA analysis as a predictive biomarker in solid tumor patients treated with pembrolizumab. Nat. Cancer 1, 873–881. https://doi.org/10.1038/s43018-020-0096-5 (2020).
Article CAS PubMed Google Scholar
Kurtz, D. M. et al. Circulating tumor DNA measurements as early outcome predictors in diffuse large B-cell lymphoma. J. Clin. Oncol. 36, 2845–2853. https://doi.org/10.1200/JCO.2018.78.5246 (2018).
Article CAS PubMed PubMed Central Google Scholar
Ihaka, R. & Gentleman, R. R: A language for data analysis and graphics. J. Comput. Graph. Stat. 5, 1299–1314. https://doi.org/10.2307/1390807 (2023).
Article Google Scholar

Download references

Acknowledgements

The authors thank Eric Peters, Craig Cummings, Christina Bormann Chung, Zoe June Assaf, Armande Ang Houle, Wendy Huang, Jill Ray, Jennifer Zhang, Eric Zhou, and Tony Guo for discussions throughout the project. The authors thank SeraCare Life Sciences Inc. for reference samples and relevant discussions. The authors thank all the ctDNA assay providers who participated in this study. The authors also thank Sarah Bubeck, PhD, of Edanz (www.edanz.com) for providing medical writing support, which was funded by F. Hoffmann–La Roche/Genentech in accordance with Good Publication Practice (GPP 2022) guidelines (https://www.ismpp.org/gpp-2022).

Author information

These authors contributed equally: Wenjin Li, Xiayu Huang and Rajesh Patel.

Authors and Affiliations

Oncology Biomarker Development, Roche (China) Holding Ltd, Pudong, Shanghai, China
Wenjin Li, Xiayu Huang, Shijing Fu & Jingyu Zhang
Oncology Biomarker Development, Genentech, Ltd, South San Francisco, USA
Rajesh Patel, Erica Schleifman & David S. Shames

Authors

Wenjin Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiayu Huang
View author publications
You can also search for this author in PubMed Google Scholar
Rajesh Patel
View author publications
You can also search for this author in PubMed Google Scholar
Erica Schleifman
View author publications
You can also search for this author in PubMed Google Scholar
Shijing Fu
View author publications
You can also search for this author in PubMed Google Scholar
David S. Shames
View author publications
You can also search for this author in PubMed Google Scholar
Jingyu Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception/design: D.S., J.Z., R.P., W.L., X.H. Provision of study materials: R.P., W.L., S.F. Collection and/or assembly of data: W.L., X.H., S.F. Data analysis and interpretation: X.H., W.L., E.S., J.Z., D.S. Manuscript Writing: W.L., X.H., J.Z., E.S., D.S. Final approval of manuscript: All the authors.

Corresponding authors

Correspondence to David S. Shames or Jingyu Zhang.

Ethics declarations

Competing interests

Wenjin Li, Xiayu Huang, Shijing Fu, and Jingyu Zhang are employees of Roche (China) Holding Ltd, and hold stock or other ownership interests in F. Hoffmann–La Roche. Rajesh Patel, Erica Schleifman, and David S. Shames are employees of Genentech, Ltd, and hold stock or other ownership interests in F. Hoffmann–La Roche.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Legends.

Supplementary Figure S1.

Supplementary Figure S2.

Supplementary Figure S3.

Supplementary Table S1.

Supplementary Table S2.

Supplementary Table S3.

Supplementary Table S4.

Supplementary Table S5.

Supplementary Table S16.

Supplementary Table S7.

Supplementary Table S8.

Supplementary Table S9.

Supplementary Table S10.

Supplementary Table S11.

Supplementary Table S12.

Supplementary Table S13.

Supplementary Table S14.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, W., Huang, X., Patel, R. et al. Analytical evaluation of circulating tumor DNA sequencing assays. Sci Rep 14, 4973 (2024). https://doi.org/10.1038/s41598-024-54361-w

Download citation

Received: 30 October 2023
Accepted: 12 February 2024
Published: 29 February 2024
DOI: https://doi.org/10.1038/s41598-024-54361-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.