Detection of clinically relevant copy number alterations in oral cancer progression using multiplexed droplet digital PCR

Hughesman, Curtis B.; Lu, X. J. David; Liu, Kelly Y. P.; Zhu, Yuqi; Towle, Rebecca M.; Haynes, Charles; Poh, Catherine F.

doi:10.1038/s41598-017-11201-4

Download PDF

Article
Open access
Published: 19 September 2017

Detection of clinically relevant copy number alterations in oral cancer progression using multiplexed droplet digital PCR

Curtis B. Hughesman^1,2,3,
X. J. David Lu^1,3,
Kelly Y. P. Liu^1,3,
Yuqi Zhu^1,3,
Rebecca M. Towle³,
Charles Haynes² &
…
Catherine F. Poh^1,3,4

Scientific Reports volume 7, Article number: 11855 (2017) Cite this article

2810 Accesses
7 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Copy number alterations (CNAs), a common genomic event during carcinogenesis, are known to affect a large fraction of the genome. Common recurrent gains or losses of specific chromosomal regions occur at frequencies that they may be considered distinctive features of tumoral cells. Here we introduce a novel multiplexed droplet digital PCR (ddPCR) assay capable of detecting recurrent CNAs that drive tumorigenesis of oral squamous cell carcinoma. Applied to DNA extracted from oral cell lines and clinical samples of various disease stages, we found good agreement between CNAs detected by our ddPCR assay with those previously reported using comparative genomic hybridization or single nucleotide polymorphism arrays. Furthermore, we demonstrate that the ability to target specific locations of the genome permits detection of clinically relevant oncogenic events such as small, submicroscopic homozygous deletions. Additional capabilities of the multiplexed ddPCR assay include the ability to infer ploidy level, quantify the change in copy number of target loci with high-level gains, and simultaneously assess the status and viral load for high-risk human papillomavirus types 16 and 18. This novel multiplexed ddPCR assay therefore may have clinical value in differentiating between benign oral lesions from those that are at risk of progressing to oral cancer.

Signatures of copy number alterations in human cancer

Article Open access 15 June 2022

Ensemble of nucleic acid absolute quantitation modules for copy number variation detection and RNA profiling

Article Open access 04 April 2022

High-Resolution Copy Number Patterns From Clinically Relevant FFPE Material

Article Open access 20 June 2019

Introduction

Copy number alterations (CNAs), somatic changes involving the gain or loss of genomic material, are common drivers for tumorigenesis^1,2. CNAs can range in size from microscopic alterations, including deletion of chromosomal segments or entire chromosomes, to submicroscopic alterations where up to a 500 kbp segment of genomic DNA is gained or lost³. The importance of CNAs in tumorigenesis has driven extensive studies, including those comprising The Cancer Genome Atlas (TCGA), a genomic repository that includes over 10,000 tumors for which CNAs and other oncogenetic data have been collected (http://www.broadinstitute.org/tcga/home). Sorted by cancer type, TCGA data reveal patterns of common recurrent CNAs that can be used to identify specific loci and larger chromosomal regions more susceptible to alteration in a given cancer^4,5. Reasons for higher alteration rates in specific genomic regions include the presence of fragile sites⁶ and/or selective pressures promoting oncogene activation or tumor suppressor gene (TSG) inactivation via copy number gains or losses, respectively⁴. For example, focal CNAs involving gain of CCND1 at 11q13.3 or loss of CDKN2A at 9p21.3 are observed in a number of cancers, including oral squamous cell carcinoma (OSCC)^7,8,9. But as noted above, specific cancers and their progression are also associated with broader gains or losses of genomic material. One such example in OSCC is a broad gain at the subtelomeric region of 3q that contains many putative oncogenes^8,9.

The detection of one or more common recurrent CNAs has therefore been considered as a means to assess predisposition toward tumorigenesis^10,11. Clinically, this concept could be particularly useful in characterizing tissue where histo-pathological assessment alone is not sufficient to reliably predict malignant potential¹². For example, detection of gains in one or more oncogenes in histologically normal tissue taken from an uninvolved margin of breast cancer has shown promise in defining risk of sporadic breast cancer¹¹, while loss of one or more TSGs correlates with risk of oral lesions progressing to OSCC¹⁰.

Currently, comparative genomic hybridization (CGH) arrays^13,14, single nucleotide polymorphism (SNP) arrays^15,16 and whole-genome next generation sequencing (WG-NGS)¹⁷ are most widely used for genome-wide detection of CNAs. Alternatively, established methods for detecting CNAs at specific targeted genomic loci include fluorescence in-situ hybridization (FISH)¹⁸, multiplex ligation-dependent probe amplification (MLPA)¹⁹, targeted NGS^20,21,22, and quantitative PCR (qPCR)²³. An emerging technique capable of quantitative multiplex detection of CNAs is digital PCR^21,22,23,24. Digital PCR has a number of specific attributes, including low cost and high sensitivity, that make it amenable and particularly attractive for routine clinical use²⁵.

We have previously described a general strategy for designing and analyzing multiplexed droplet digital PCR (ddPCR) experiments to accurately quantify CNAs in genomic DNA recovered from tissue samples²⁶. In this study, we expand on that work to develop a novel multiplexed ddPCR assay designed to detect CNAs and larger chromosomal alterations that have been frequently observed to occur and/or to drive progression from oral precancerous lesions to OSCC. The method targets loci associated with common recurrent CNAs in OSCC progression found in the chromosomal regions 3p, 3q, 4q, 5p, 7p, 8p, 8q, 9p, 11q, 18q, 20p and 20q. Primers targeting the E6 regions of human papillomavirus (HPV) types 16 and 18 are also included²⁷, allowing us to concurrently assess for status and viral load of high-risk HPV, an emerging risk factor associated with head and neck squamous cell carcinoma (HNSCC), especially at the oropharyngeal site^28,29,30. We demonstrate that this ddPCR-based assay detects CNAs at the targeted loci in a manner that is consistent with data from benchmark methods (CGH arrays^31,32,33,34, SNP arrays³⁵ and WG-NGS³⁶) for both oral cell lines and clinical samples of different histopathological stages. We demonstrate advantages of the multiplexed ddPCR assay relative to these benchmark methodologies used to detect CNAs genome-wide, including the ability to detect and quantify submicroscopic homozygous deletions (HDs). Further, we show that the new method can infer the average ploidy level, from which viral loads and high-level copy number gains can be determined. Finally, the ability of the method to quantify different patterns of CNAs in various histopathological stages is demonstrated through application to cell lines having normal, dysplasia and OSCC phenotypes, and to clinical samples classified as non-progressing and progressing low-grade dysplasia (LGD), high-grade dysplasia (HGD), and OSCC.

Results

CNAs detected by our ddPCR Assay in normal, dysplastic and OSCC cell lines

To evaluate the performance of our multiplexed ddPCR assay to detect CNAs, we first analyzed a panel of genomic DNA samples taken from immortalized cell lines representing normal, dysplasia and OSCC phenotypes. Using our previously described sample-specific clustering method²⁶, for each cell line tested, a significant number of 13 reference loci were found to be stable, ranging from all 13 loci for the normal and dysplasia cell lines, as well as for one of the OSCC (CAL 27) cell lines, to 7 loci for the SCC-25 OSCC cell line. For each sample, the subset of stable reference loci are used to define a CNA-neutral benchmark as described previously by our group²⁶. That in turn allows CNAs, expressed as ${R}_{i/b}^{Norm}$ values where R is the normalized ratio of copies of target loci i relative to the average copy number of the benchmark, to be calculated, in this case for each of the 24 target loci.

For the OKF4 E6E7 (normal phenotype) cell line, no significant CNAs were detected in the set of target loci, consistent with orthogonal CGH array data (Supplementary Figure S2A). CNAs identified in the dysplastic cell line POE9n tert included an HD at CDKN2A and a low-level gain at 5p (TERT) (Supplementary Figure S2B). Deletions at CDKN2A in POE9n tert have been reported³⁷, while gains at 5p are consistent with the previously reported increase in TERT expression for this cell line^34,37.

Most of the 24 target loci had statistically significant CNAs in the OSCC cell lines, ranging from 17 CNAs for CAL27 to 23 CNAs for SCC-4 (Fig. 1). Each of these OSCC cell lines has been previously analyzed using CGH or SNP arrays, providing benchmarks for evaluating the performance of our ddPCR assay. For each cell line, the correlation coefficient (R) between each pair of analysis methods was determined (Table 1), with combined R values for the 4 OSCC cell lines determined to be 0.92 (ddPCR versus CGH array), 0.95 (ddPCR versus SNP array), and 0.88 (CGH array versus SNP array). Importantly, good agreement in both gains and losses at the different target loci was observed between the ddPCR assay and either CGH array (Fig. 2A) or SNP array data sets (Fig. 2B). We have also observed our assay can detect and quantify key genetic events, including HDs, changes in ploidy level, and high-level gains, as well as HPV16 and 18.

Table 1 Comparison of oncogenic events and CNAs detected by ddPCR, CGH array or SNP array.

Full size table

Detection of HDs

HDs within fragile sites (e.g. FRA3B, FRA16D) and tumor suppressor genes (e.g., FAT1, CDKN2A) are known to occur in HNSCCs, including OSCC. Accordingly, three regions of HD were identified by our ddPCR assay in the OSCC cell lines, including deletion of exon 5 of FHIT within the FRA3B region (3p14.2), deletion of FAT1 located near the telomeric end of 4q, and deletion of all or part of CDKN2A at 9p21.3. Each of these HDs was orthogonally confirmed by SNP array data (Table 1) except for CDKN2A observed in the SCC-25 cell line, which had been previously confirmed by homozygous deletion scanning³⁸. None of these HDs were detected in the CGH arrays (Table 1).

With additional target loci used at either 3p14.2 or 9p21.1, our assay was able to determine the size of the HD regions (Table 2). HD at FRA3B was observed in three target loci in both the CAL27 and SCC-9 cell lines, with the size of the HD thereby determined to be between 12.8 and 551.1 kbp (Table 2). In the three cell lines (POE9n tert, SCC-25 and SCC-9) with known HDs at 9p21.3, we were able to quantify the size of HDs to be between 6.9 kbp and 285.3 kbp (Table 2), encompassing the target loci situated in exon and intron 1 of CDKN2A. The size of the HDs at 3p14.2 and 9p21.3 would classify them as being submicroscopic and as noted here were not detected by CGH arrays. Although submicroscopic HDs are generally observed by SNP arrays, very small HDs, such as the one at CDKN2A in the SCC-25 cell line, can go undetected.

Table 2 Mapped regions of HDs at FRA3B 3p14.2 and/or CDKN2A 9p21.1 in oral dysplasia and SCC cell lines.

Full size table

Inferring ploidy level

Our multiplexed ddPCR assay can measure absolute copy number ratios (R _i/b) to infer ploidy level. When applied the data analysis algorithm described in the Methods section for the 4 OSCC cell lines, a diploid status is detected at a 99% significance threshold for CAL27 and SCC-25, and a triploid status is detected for SCC-4 and SCC-9 (Fig. 3); these ploidy levels agree with the average genomic copy number determined by SNP array (Table 1).

Quantification of high-level copy number gains

Current array CGH and SNP array platforms can be used to detect high-level gains. With SNP arrays, however, accurate estimation of the copy number in regions with high-level gains is challenged by signal saturation effects created when PCR amplification of the sample is conducted. Some algorithms used to interpret raw SNP array data³⁹ therefore apply a predefined maximum copy number call, while large copy number call made by alternative algorithms that employ no predefined maximum carry relatively large uncertainties due to the saturation effects observed during DNA hybridization and scanning of array signals⁴⁰. Likewise, uncertainties in array-CGH derived log₂ values increase, especially for high-level gains.

Our ddPCR-based assay can determine the absolute concentration of amplifiable targets, even when present at relatively high concentrations⁴¹, to enable accurate quantification of high-level gains. In the SCC-25 cell line, target loci at 11q13.3 (CCND1i3 and CCND1x5) showed CNAs of R _i/b = 3.6 ± 0.6 and 3.7 ± 0.6 respectively, indicating a ~4 fold increase in genomic copies of CCND1, which is consistent with previous independent and orthogonal studies reporting elevated CCND1 copy number in this cell line^42,43.

Quantification of HPV 16/18 viral load and detection of CNAs in high-risk HPV-positive cell lines

High-risk strains 16 and 18 of human papillomavirus (HPV) are established prognostic biomarkers for OSCC, with HPV 16/18 infections of the oropharynx accounting for ~20% of all HNSCCs²⁹. Our multiplexed ddPCR assay therefore amplifies serotype-specific GP5+/GP6+ targets (~150 bp) within the E6 gene encoded on the long control region of high-risk HPV strains 16 and 18²⁷ in a manner that allows one to detect those virulent strains and estimate viral loads in transfected cells or tissue samples. For example, OKF4 E6E7, an immortalized cell line transfected with HPV16 E6E7, tests positive for HPV16, with a R _i/b = 0.6 (±0.1) (Figure S3). Likewise, two additional HPV positive cell lines, SiHa and HeLa, tested positive for HPV-16 and HPV-18, respectively. Using the algorithm described in the Methods section, the inferred ploidy level for both of these cell lines was triploid, consistent with both SNP array data (Table 1) and other studies^36,44. Multiplying the ploidy level by the copy number ratio R _i/b (where i = the HPV-16 or HPV-18 biomarker) allows for estimates of viral load, which for SiHa and HeLa were thereby determined to be 5 (±1) and 13 (±1) copies/cell, respectively (Figure S3), consistent with previously reported viral loads for these non-OSCC cancer cell lines^44,45. CNAs for the remaining target loci within the assay were also measured. For both SiHa (Figure S4) and HeLa (Figure S5), good agreement was also observed between the ddPCR assay results and corresponding CGH and SNP array data sets (Table 1). Moreover, CNAs within the HeLa cells have also been measured by WG-NGS³⁶ (Figure S5), and our ddPCR data correlate well (R = 0.98 (ddPCR versus WG-NGS)) with that benchmark method, confirming the ability of the multiplexed ddPCR method to quantify CNAs in accordance with the three primary genome-wide CNA-detection methods currently employed in research laboratories – CGH arrays, SNP arrays, and WG-NGS – while offering at low cost a higher throughput than can be realized with any of those benchmarks.

CNAs detected by our ddPCR assay in clinical tissues representing different histopathological stages

Accurate detection of CNAs in clinical samples is intrinsically challenged by tumor heterogeneity, potential degradation of genomic DNA, and other artifacts, especially within FFPE tissue blocks. Any added uncertainty in assay performance when applied to clinical samples was therefore evaluated by comparing assay results for normal tissue collected from non-diseased patients in the form of fresh blood, frozen tissue block or formalin-fixed paraffin embedded (FFPE) tissue block samples. CNAs were measured for each sample and compared both within and across sample types. DNA extracted from the blood and frozen tissue samples was found to be of high quality with minimal fragmentation. As expected for these normal tissue samples, no statistically significant CNAs were detected at any of the target loci. DNA extracted from FFPE tissue was significantly fragmented; our previously described algorithm²⁶ to correct for amplicon-length mediated bias in CNAs was therefore applied. No statistically significant CNAs were then detected for any of the target loci (Figure S6).

The performance of the ddPCR assay on clinical samples was further verified through application to DNA extracted from clinical FFPE tissue biopsy blocks previously studied by array CGH³². Five samples were selected representing different histopathological stages, including 2 LGDs (Oral 67 and 48), 1 HGD (Oral 44), and 2 OSCCs (Oral 75 and 91). In Oral 67, a non-progressing LGD, no recurrent gains were observed in any of the 24 target loci. However, the only statistically significant loss at the TERTx2 locus at 5p observed, consistent with the log₂ value obtained from array CGH data (Fig. 4A), was not generally associated with oral cancer progression risk (Fig. 5). In Oral 48, histopathologically classified as a progressing LGD, 17 of the 24 target loci showed statistically significant CNAs, including gains at 3q, 5p, 7p, 8q, 11q and 20p/q and losses at 4q, 8p, 9p and 18q (Fig. 4B). Collectively, these observed CNAs are consistent with CGH array data and with OSCC progression (Fig. 5). Significant CNAs were also detected in Oral 44 (HGD) and in the 2 OSCC samples (Oral 75 and 91) (Fig. 4C–E), with the ddPCR results again in good agreement with benchmark array CGH data for these samples (Fig. 4F and Table 1).

Discussion

Although CNAs are an important event involved in tumorigenesis, their detection in clinical cancer genetics labs is generally limited to few cytogenetic-based assays focusing on gains of a single gene, such as ERBB2 used to determine breast cancer patient’s suitability for targeted therapy⁴⁶. In principle, technologies that screen the entire genome for CNAs, such as array CGH, SNP arrays, and WG-NGS, can ameliorate this problem, but are at present too costly and time intensive for general clinical use. Our novel multiplexed ddPCR assay is able to detect common recurring CNAs in DNA isolated from either oral cell lines or clinical samples of varying disease stage. The CNAs recorded are accurate, as shown through comparison to CGH, SNP arrays, and WG-NGS, all of which are considered gold-standards for genome-wide detection of copy number gains and losses⁴⁷. These capabilities, when combined with the relatively low cost and fast turn-around time of the method, suggest that our multiplexed ddPCR assay possesses attributes needed for clinical translation.

The design of the ddPCR assay presented several challenges, including establishing a stable reference for CNA quantification, selection and precise positioning of target loci, and the ability to account and correct for fragmentation in DNA extracted from FFPE tissue²⁶. By successfully addressing these challenges, the multiplexed ddPCR assay is able to identify several types of oncogenic events associated with OSCC progression, including HDs and high-level gains. By design, the ddPCR assay precisely targets regions with common recurrent gains or losses in oral cancer progression^7,10,48. For example, the ddPCR assay detects recurrent HDs, such as at CDKN2A, that may go undetected by CGH array, and in some cases even SNP array, because of their submicroscopic size (<500 kbps). We have also demonstrated a means by which ddPCR data can be used to infer average ploidy level, which has not been done before. The ability to quantify high-level gains, which can be challenging to CGH or SNP arrays due to a combination of algorithm limitations and saturation of the signal^39,40, could be clinically important. For instance, CCND1 expression levels correlate to radio-sensitivity and therefore may be used to predict effectiveness of radiation therapy in OSCC⁴².

The novel multiplexed ddPCR assay described here demonstrate its ability to detect recurrent CNAs frequently gained or lost during oral carcinogenesis. While no significant CNAs in the 24 target loci were detected in either the normal oral cell line OKF4 E6E7 or the normal oral tissue and non-progressing LGD biopsies, a number of target loci were found to have significant gains or losses in the dysplastic and SCC oral cell lines, as well as in clinical oral biopsies classified as either progressing-LGD, HGD or SCC. In the dysplastic cell line POE9n tert, the observed pattern of CNAs is indicative of p16 (CDKN2A) inactivation and telomerase (TERT) activation; results of the assay are therefore consistent with an oncogenic pathway implicated in cellular immortalization and tumor progression³⁷. In the progressing-LGD, loss of CDKN2A was observed along with other CNAs known to correlate with oral carcinogenesis⁴⁸, including losses at FAT1 and CSMD1, gains at MYC and 20p/q, and high-level gains at EGFR. Taken together, these results provide initial evidence that the pattern of common recurrent CNAs detected by ddPCR may be used to differentiate benign or noncancerous tissue from precancerous or cancerous tissue.

Although our primary goal was to establish a clinically-amenable method to detect CNAs as a predictor of cancer progression by targeting loci where gains or losses are indicative of oncogene activation or TSG suppression, respectively, there is growing evidence that CNAs in these regions also have prognostic value. For example, gains at EGFR or CCND1, and/or loss of 18q have been associated with lower survival and disease progression in OSCC⁴⁹. Detection of high-risk HPV strains may also have prognostic value, as HNSCC patients with HPV-positive cancers have better outcomes and may have improved response to certain treatments^29,30. Additionally, the identification of HPV positive tissue in lymph nodes may be used to localize the site of origin when the primary tumor site is not clinically apparent²⁸.

In conclusion, the multiplexed ddPCR assay we have reported on here demonstrate its clinical value in detecting multiple target loci for common recurrent CNAs known to be associated with oral cancer progression. Further study of this assay using a larger cohort of progressing and non-progressing oral lesions is on-going to examine its ability to assess the oral cancer progression risk of histologically similar low-grade lesions.

Methods

Selection of reference loci to establish CNA-neutral benchmark

Thirteen reference loci (Fig. 5, Supplementary Table S1) were identified from chromosomal regions (1p, 1q, 2p, 2q, 6p, 10q, 12q, 16p, 16q, 19p, 22q) found to have low CNA frequency rates in TCGA data for HNSCC. Additionally, for one reference loci (CPT2), amplification templates of various lengths were designed to correct any bias to measured CNAs arising from significant fragmentation of the DNA isolated from a sample. The algorithm used to make those length-based corrections, as well as that used to define a CNA-neutral benchmark from ddPCR data for the 13 reference loci, has been described previously²⁶.

Selection of target loci for detection of common CNAs in OSCC

A total of 24 target loci (Fig. 5, Supplementary Table S1) were designed to query twelve chromosomal regions, including 3p, 3q, 4q, 5p, 7p, 8p, 8q, 9p, 11q, 18q, 20p, and 20q, where recurrent CNAs have been observed and correlated with oral cancer progression from oral premalignant lesions⁴⁸, and/or where TCGA data have revealed frequent gains (3q, 5p, 7p, 8q, 11q, and 20p/q), losses (3p, 4q, 8p and 18q) or both (9p)⁸. Chromosomal regions associated with focal CNAs are queried with proximal target loci (e.g. CDKN2Ai1 and CDKN2Ax3 for the CDKN2A gene at 9p21.3), while broad regions are queried with distal target loci (e.g. EPHB3x3 and TP63i2 for the sub-telomeric region of 3q).

Primers and probes used for ddPCR amplification of reference and target loci

HPLC purified probes dual labeled with 3′-IBHQ and either 5′-FAM or 5′-HEX, as well as all standard desalted primers, were purchased from IDT Inc. (Coralville, IA). Sequences for the primers and probes for reference and target loci are shown in Supplementary Table S1. The primers and probes used to detect and quantify the E6 region of HPV types 16 and 18 were previously described²⁷. All primers and probes were resuspended to 100 µM in IDTE (10 mM Tris, pH 8.0, 0.1 mM EDTA) buffer.

Cell lines

The cell lines OKF4 E6E7, POE9n tert, CAL27, SCC-4, SCC-9, SCC-25, SiHa, and HeLa were obtained from the B.C. Cancer Research Centre with their original sources and culture methods reported in previous CGH studies^31,33,34. The identity of the cell lines was confirmed by CGH arrays^31,33,34, SNP arrays³⁵ and/or WG-NGS³⁶.

Clinical samples

A total of 11 clinical samples were analyzed. Four normal blood samples from healthy volunteers, one non-diseased frozen, and six FFPE tissue samples (one normal, two LGL, one HGL, and 2 OSCC)³² were collected under approval from the UBC/BCCA Clinical Ethics Research Board (H15-01554).

DNA sample preparation

DNA extraction protocols used on buffy coat from blood samples, and on micro-dissected frozen and FFPE tissue samples were described previously²⁶. For cell lines, cultured cells were trypsinized and harvested, then washed with 1× PBS, centrifuged, and resuspended in lysis buffer. DNA was extracted from the digested tissue using a SQ blood DNA kit (D5032-00, Omega Bio-tek Inc., Norcross, GA) according to the manufacturer’s instructions. All the DNA samples were quantified using a Qubit 2.0 Fluorometer (ThermoFisher Scientific, Waltham, MA).

ddPCR experiments

All experiments were performed using a QX200 droplet generator and reader (Bio-Rad Inc., Hercules, CA). Supplementary Figure S1 shows the schematic representation for ddPCR. The multiplexed ddPCR assay uses a total of 16 unique 4-plex amplification reactions, performed in duplicate in a total of 32 reaction wells. The four reactions performed in each well are provided in Supplementary Table S2. Each 4-plex ddPCR 20 μl reaction was prepared with 10 μl of 2X ddPCR Supermix for probes (No dUTP) (Bio-Rad Inc.), forward and reverse primers, each at a final concentration (C _t) = 900 nM, and a combination of FAM and HEX-labeled probes at a C _t = 60 to 300 nM (Table S2) so as to obtain a staggered layout of target-positive droplet clusters in the ddPCR output²⁶. For each of the samples tested, a total of ~300 ng (~10 ng/well) of DNA was used. Further details of the protocol used in the ddPCR workflow have been provided previously²⁶.

Analysis of ddPCR output

Our algorithm for analyzing assay output to quantify CNAs has been previously described in detail²⁶. Briefly, droplet counts are converted to a copy number ratio, R _i/cr, for each reference or target loci i, together with the associated error (standard deviation) σ _Ri/cr in the value (where, i = the concentration of the reference or target locus, and cr = the concentration of the constant reference locus HFE2 that is present in all 32 of the 4-plex ddPCR reaction wells). As we have previously described²⁶, R _i/cr values, determined for three amplicons of unique lengths (89, 106 and 125 bp) at the i = CPT2 locus, are used to analyze if fragmentation of the DNA isolated from the sample is significant. For those samples determined to have statistically significant fragmentation, the R _i/cr values for reference and target loci are corrected based on their amplicon lengths as previously described²⁶. For the ddPCR assay reported here, the amplicon lengths for the set of reference (97 to 106 bp) and target loci (93 to 124 bp) have been specifically designed to be short and similar in size to minimize the fragmentation corrections required (as well as the uncertainty (σ _Ri/cr) associated with R _i/cr) as the relative amount of amplifiable DNA for larger amplicons can be appreciably lower in samples where fragmentation is significant²⁶. The R _i/cr values for reference loci are then analyzed by a k-means type cluster analysis to select those reference loci inferred to be CNA-neutral. Their R _i/cr values are averaged to establish a benchmark (b). Normalized R _i/b $({R}_{i/b}^{Norm})$ are then determined for each target locus. Statistically significant CNAs for each target locus are identified using the null hypothesis for a chosen confidence interval (CI). Here we selected a CI = 99%, ensuring that significant CNAs are identified with high confidence. Gains and losses are identified as ${R}_{i/b}^{Norm} > 0\,{\rm{and}}\, < 0$, respectively, with high-level gains and homozygous deletions (HD) identified as ${R}_{(i/b)}^{Norm} > 1\,{\rm{and}}=-1$, respectively.

Algorithm to infer ploidy level

In homozygous systems, ${R}_{i/b}^{Norm}$ values are expected to represent discrete changes in copy numbers at each target locus i. Thus, in a pure diploid cell line low-level CNAs ($-1.0 < {R}_{i/b}^{Norm} < 1.0$) should be identified by an ${R}_{i/b}^{Norm}=\pm 0.5$, while in a triploid cell line, low-level CNAs should carry a ${R}_{i/b}^{Norm}=\pm 0.33\,{\rm{or}}\,\pm 0.66$. Based on this, ${R}_{i/b}^{Norm}$ values determined in our ddPCR assay may be used to infer ploidy level. Briefly, for a given sample, measured target-loci ${R}_{i/b}^{Norm}$ values between −0.99 and 0.99 (representative of low-level gains in a sample) are compared to the expected discrete copy number changes for an assumed ploidy (R _p(L)), where L represents the ploidy level being queried (ranging from 1 to 5 genomic copies in the algorithm used here). In this analysis, we exclude high-level gains (due to larger values of σ_R), as well as HDs, as both are non-informative with respect to ploidy level (L). For each assumed L being queried by the algorithm, the total number of target loci (N _t(L)) for which

$${R}_{p(L)}={R}_{i/b}^{Norm}\pm z{\sigma }_{R}$$

(1)

is determined at the z for the chosen CI (99% in our analysis). Note that as the assumed ploidy level is increased, the value of R _p(L) will decrease (0.5 for diploid, 0.33 for triploid, 0.25 tetraploid), increasing the odds that the number of target loci with ${R}_{i/b}^{Norm}$ that match the chosen R _p(L) will increase. To account for this bias, we determine the average and associated error (N _t(L)* ± σ _N*) of the total number of target loci expected to match each ploidy level due to random chance using a Monte Carlo simulation; the details of that simulation are provided in the Supplementary Methods. The alternative hypothesis H: N _t(L) > N _t(L)* is then tested for each assumed ploidy level using the relation

$${z}_{L}=\frac{{N}_{t(L)}-{N}_{t(L)}\ast }{{\sigma }_{N}\ast }$$

(2)

The inferred ploidy level is given by the L that maximizes z _L, with the level of significance determined from the associated analysis.

Array CGH data

The set of three array CGH probes that overlay or lie adjacent to each target locus i were mapped using the UCSC Genome Browser NCBI35/hg17 assembly⁵⁰. The log₂ values for the array CGH data^31,32,33,34 for each of those probes were calculated and averaged. The resulting log₂ values were then rescaled to facilitate direct comparison with ddPCR-derived ${R}_{i/b}^{Norm}$ values as follows; HD and high-level gains, typically defined as regions with log₂ values of −0.8 and 0.8 respectively³³, were taken as log₂ value minima/maxima and used to rescale CGH data to corresponding range of ddPCR-determined ${R}_{i/b}^{Norm}$ values (−1.0 for HD and +1.0 for high-level gain).

SNP array data

SNP-array based copy number analyses of several of the cell lines investigated have been performed previously, with results archived by the COSMIC (Catalogue of Somatic Mutations in Cancer) cell line project⁵¹ and data from the Affymetrix SNP 6.0 array analyzed with PICNIC (Predict Integral Copy Numbers In Cancer) software⁵². Inferred copy number of chromosomal segments and the genome average copy number for each cell line are available through canSAR (cansar.icr.ac.uk)³⁵. To allow for comparison between ddPCR and CGH array data, the change (Δ) in copy number from SNP data was determined for the different target loci by subtracting the inferred ploidy level (integer rounded average genomic copy number) from the copy number assigned to the chromosomal segment representing the target locus.

WG-NGS data

WG-NGS data for HeLa cell line have been previously published³⁶. The ploidy levels of chromosomal regions of interest were approximated based on the freely accessible figures and supplemental files published.

References

Hanahan, D. & Weinberg, RobertA. Hallmarks of Cancer: The Next Generation. Cell 144, 646–674, doi:10.1016/j.cell.2011.02.013 (2011).
Article CAS PubMed Google Scholar
Pikor, L., Thu, K., Vucic, E. & Lam, W. The detection and implication of genome instability in cancer. Cancer and Metastasis Reviews 32, 341–352, doi:10.1007/s10555-013-9429-5 (2013).
Article CAS PubMed PubMed Central Google Scholar
Tang, Y. C. & Amon, A. Gene copy-number alterations: a cost-benefit analysis. Cell 152, 394–405, doi:10.1016/j.cell.2012.11.043 (2013).
Article CAS PubMed PubMed Central Google Scholar
Beroukhim, R. et al. The landscape of somatic copy-number alteration across human cancers. Nature 463, 899–905, doi:10.1038/nature08822 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Zack, T. I. et al. Pan-cancer patterns of somatic copy number alteration. Nature genetics 45, 1134–1140, doi:10.1038/ng.2760 (2013).
Article CAS PubMed PubMed Central Google Scholar
Smith, D. I., Zhu, Y., McAvoy, S. & Kuhn, R. Common fragile sites, extremely large genes, neural development and cancer. Cancer Letters 232, 48–57, doi:10.1016/j.canlet.2005.06.049 (2006).
Article CAS PubMed Google Scholar
Leemans, C. R., Braakhuis, B. J. M. & Brakenhoff, R. H. The molecular biology of head and neck cancer. Nature Reviews Cancer 11, 9–22, doi:10.1038/nrc2982 (2010).
Article PubMed Google Scholar
Cancer Genome Atlas, N. Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature 517, 576–582, doi:10.1038/nature14129 (2015).
Article ADS Google Scholar
Beck, T. N. & Golemis, E. A. Genomic insights into head and neck cancer. Cancers of the Head & Neck 1, 1, doi:10.1186/s41199-016-0003-z (2016).
Article Google Scholar
Zhang, L. et al. Loss of heterozygosity (LOH) profiles–validated risk predictors for progression to oral cancer. Cancer prevention research 5, 1081–1089, doi:10.1158/1940-6207.CAPR-12-0173 (2012).
Article PubMed PubMed Central Google Scholar
Forsberg, L. A. et al. Signatures of post-zygotic structural genetic aberrations in the cells of histologically normal breast tissue that can predispose to sporadic breast cancer. Genome research 25, 1521–1535, doi:10.1101/gr.187823.114 (2015).
Article CAS PubMed PubMed Central Google Scholar
Speight, P. M. et al. Screening for oral cancer—a perspective from the Global Oral Cancer Forum. Oral Surgery, Oral Medicine, Oral Pathology and Oral Radiology, doi:10.1016/j.oooo.2016.08.021 (2016).
Kallioniemi, A. et al. Comparative genomic hybridization for molecular cytogenetic analysis of solid tumors. Science 258, 818–821 (1992).
Article ADS CAS PubMed Google Scholar
Snijders, A. M. et al. Assembly of microarrays for genome-wide measurement of DNA copy number. Nature genetics 29, 263–264, doi:10.1038/ng754 (2001).
Article CAS PubMed Google Scholar
Zhao, X. et al. An Integrated View of Copy Number and Allelic Alterations in the Cancer Genome Using Single Nucleotide Polymorphism Arrays. Cancer Research 64, 3060–3071, doi:10.1158/0008-5472.can-03-3308 (2004).
Article CAS PubMed Google Scholar
Peiffer, D. A. et al. High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping. Genome research 16, 1136–1148, doi:10.1101/gr.5402306 (2006).
Article CAS PubMed PubMed Central Google Scholar
Scheinin, I. et al. DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly. Genome research 24, 2022–2032, doi:10.1101/gr.175141.114 (2014).
Article CAS PubMed PubMed Central Google Scholar
Hu, L. et al. Fluorescence in situ hybridization (FISH): an increasingly demanded tool for biomarker research and personalized medicine. Biomarker research 2, 3, doi:10.1186/2050-7771-2-3 (2014).
Article PubMed PubMed Central Google Scholar
Schouten, J. P. et al. Relative quantification of 40 nucleic acid sequences by multiplex ligation-dependent probe amplification. Nucleic acids research 30, e57 (2002).
Article PubMed PubMed Central Google Scholar
Lechner, M. et al. Targeted next-generation sequencing of head and neck squamous cell carcinoma identifies novel genetic alterations in HPV + and HPV- tumors. Genome Med 5, 49, doi:10.1186/gm453 (2013).
Article PubMed PubMed Central Google Scholar
Beltran, H. et al. Targeted next-generation sequencing of advanced prostate cancer identifies potential therapeutic targets and disease heterogeneity. Eur Urol 63, 920–926, doi:10.1016/j.eururo.2012.08.053 (2013).
Article CAS PubMed Google Scholar
Robbins, C. M. et al. Copy number and targeted mutational analysis reveals novel somatic events in metastatic prostate tumors. Genome research 21, 47–55, doi:10.1101/gr.107961.110 (2011).
Article CAS PubMed PubMed Central Google Scholar
D’Haene, B., Vandesompele, J. & Hellemans, J. Accurate and objective copy number profiling using real-time quantitative PCR. Methods 50, 262–270, doi:10.1016/j.ymeth.2009.12.007 (2010).
Article PubMed Google Scholar
Vogelstein, B. & Kinzler, K. W. Digital PCR. Proceedings of the National Academy of Sciences of the United States of America 96, 9236–9241 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Day, E., Dear, P. H. & McCaughan, F. Digital PCR strategies in the development and analysis of molecular biomarkers for personalized medicine. Methods 59, 101–107, doi:10.1016/j.ymeth.2012.08.001 (2013).
Article CAS PubMed Google Scholar
Hughesman, C. B. et al. A Robust Protocol for Using Multiplexed Droplet Digital PCR to Quantify Somatic Copy Number Alterations in Clinical Tissue Specimens. PLoS ONE 11, e0161274, doi:10.1371/journal.pone.0161274 (2016).
Article PubMed PubMed Central Google Scholar
Schmitz, M. et al. Quantitative multiplex PCR assay for the detection of the seven clinically most relevant high-risk HPV types. Journal of clinical virology: the official publication of the Pan American Society for Clinical Virology 44, 302–307, doi:10.1016/j.jcv.2009.01.006 (2009).
Article CAS Google Scholar
Westra, W. H. The pathology of HPV-related head and neck cancer: implications for the diagnostic pathologist. Seminars in diagnostic pathology 32, 42–53, doi:10.1053/j.semdp.2015.02.023 (2015).
Article PubMed Google Scholar
Maxwell, J. H., Grandis, J. R. & Ferris, R. L. HPV-Associated Head and Neck Cancer: Unique Features of Epidemiology and Clinical Management. Annual Review of Medicine 67, 91–101, doi:10.1146/annurev-med-051914-021907 (2016).
Article CAS PubMed Google Scholar
Vokes, E. E., Agrawal, N. & Seiwert, T. Y. HPV-Associated Head and Neck Cancer. Journal of the National Cancer Institute 107, doi:10.1093/jnci/djv344 (2015).
Lockwood, W. W., Coe, B. P., Williams, A. C., MacAulay, C. & Lam, W. L. Whole genome tiling path array CGH analysis of segmental copy number alterations in cervical cancer cell lines. International journal of cancer 120, 436–443, doi:10.1002/ijc.22335 (2007).
Article CAS PubMed Google Scholar
Garnis, C. et al. Genomic imbalances in precancerous tissues signal oral cancer risk. Molecular cancer 8, 50, doi:10.1186/1476-4598-8-50 (2009).
Article PubMed PubMed Central Google Scholar
Tsui, I. F. & Garnis, C. Integrative molecular characterization of head and neck cancer cell model genomes. Head & neck 32, 1143–1160, doi:10.1002/hed.21311 (2010).
Article Google Scholar
Dickman, C. T., Towle, R., Saini, R. & Garnis, C. Molecular characterization of immortalized normal and dysplastic oral cell lines. Journal of oral pathology & medicine: official publication of the International Association of Oral Pathologists and the American Academy of Oral Pathology 44, 329–336, doi:10.1111/jop.12236 (2015).
Article CAS Google Scholar
Tym, J. E. et al. canSAR: an updated cancer research and drug discovery knowledgebase. Nucleic acids research 44, D938–D943, doi:10.1093/nar/gkv1030 (2016).
Article CAS PubMed Google Scholar
Landry, J. J. M. et al. The Genomic and Transcriptomic Landscape of a HeLa Cell Line. G3: Genes|Genomes|Genetics 3, 1213–1224, doi:10.1534/g3.113.005777 (2013).
Article PubMed PubMed Central Google Scholar
Dickson, M. A. et al. Human Keratinocytes That Express hTERT and Also Bypass a p16INK4a-Enforced Mechanism That Limits Life Span Become Immortal yet Retain Normal Growth and Differentiation Characteristics. Molecular and Cellular Biology 20, 1436–1447, doi:10.1128/mcb.20.4.1436-1447.2000 (2000).
Article CAS PubMed PubMed Central Google Scholar
Pineau, P., Marchio, A., Cordina, E., Tiollais, P. & Dejean, A. Homozygous deletions scanning in tumor cell lines detects previously unsuspected loci. International journal of cancer 106, 216–223, doi:10.1002/ijc.11214 (2003).
Article CAS PubMed Google Scholar
Mosén-Ansorena, D., Aransay, A. M. & Rodríguez-Ezpeleta, N. Comparison of methods to detect copy number alterations in cancer using simulated and real genotyping data. BMC Bioinformatics 13, 1–12, doi:10.1186/1471-2105-13-192 (2012).
Article Google Scholar
Liu, Z., Li, A., Schulz, V., Chen, M. & Tuck, D. MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells. PLoS ONE 5, e10909, doi:10.1371/journal.pone.0010909 (2010).
Article ADS PubMed PubMed Central Google Scholar
Hindson, B. J. et al. High-throughput droplet digital PCR system for absolute quantitation of DNA copy number. Analytical chemistry 83, 8604–8610, doi:10.1021/ac202028g (2011).
Article CAS PubMed PubMed Central Google Scholar
Shintani, S., Mihara, M., Ueyama, Y., Matsumura, T. & Wong, D. T. W. Cyclin D1 overexpression associates with radiosensitivity in oral squamous cell carcinoma. International journal of cancer 96, 159–165, doi:10.1002/ijc.1014 (2001).
Article CAS PubMed Google Scholar
Liu, H. S. et al. Detection of copy number amplification of cyclin D1 (CCND1) and cortactin (CTTN) in oral carcinoma and oral brushed samples from areca chewers. Oral oncology 45, 1032–1036, doi:10.1016/j.oraloncology.2009.06.007 (2009).
Article CAS PubMed Google Scholar
Roberts, I. et al. Critical evaluation of HPV16 gene copy number quantification by SYBR green PCR. BMC Biotechnology 8, 57, doi:10.1186/1472-6750-8-57 (2008).
Article PubMed PubMed Central Google Scholar
Meissner, J. D. Nucleotide sequences and further characterization of human papillomavirus DNA present in the CaSki, SiHa and HeLa cervical carcinoma cell lines. Journal of General Virology 80, 1725–1733, doi:10.1099/0022-1317-80-7-1725 (1999).
Article CAS PubMed Google Scholar
Hurvitz, S. A., Hu, Y., O’Brien, N. & Finn, R. S. Current approaches and future directions in the treatment of HER2-positive breast cancer. Cancer treatment reviews 39, 219–229, doi:10.1016/j.ctrv.2012.04.008 (2013).
Article CAS PubMed Google Scholar
Michels, E., De Preter, K., Van Roy, N. & Speleman, F. Detection of DNA copy number alterations in cancer by array comparative genomic hybridization. Genet Med 9, 574–584 (2007).
Article CAS PubMed Google Scholar
Salahshourifar, I., Vincent-Chong, V. K., Kallarakkal, T. G. & Zain, R. B. Genomic DNA copy number alterations from precursor oral lesions to oral squamous cell carcinoma. Oral oncology 50, 404–412, doi:10.1016/j.oraloncology.2014.02.005 (2014).
Article CAS PubMed Google Scholar
Park, B. J., Chiosea, S. I. & Grandis, J. R. Molecular changes in the multistage pathogenesis of head and neck cancer. Cancer biomarkers: section A of Disease markers 9, 325–339, doi:10.3233/cbm-2011-0163 (2010).
Article PubMed Central Google Scholar
Kent, W. J. et al. The human genome browser at UCSC. Genome research 12, 996–1006, doi:10.1101/gr.229102 (2002).
Article CAS PubMed PubMed Central Google Scholar
Forbes, S. A. et al. COSMIC: exploring the world’s knowledge of somatic mutations in human cancer. Nucleic acids research 43, D805–D811, doi:10.1093/nar/gku1075 (2015).
Article CAS PubMed Google Scholar
Greenman, C. D. et al. PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data. Biostatistics 11, 164–175, doi:10.1093/biostatistics/kxp045 (2010).
Article PubMed Google Scholar

Download references

Acknowledgements

The authors thank Kareem Fakhfakh, Louise Lund, Ryan Brinkman, Benedikt Brink and Cindy Cui for their assistance in the research and helpful scientific discussion. We also would like to thank Cathie Garnis for the oral cell lines.

Author information

Authors and Affiliations

Department of Oral Medical and Biological Sciences, Faculty of Dentistry, University of British Columbia, Vancouver, British Columbia, V6T 1Z3, Canada
Curtis B. Hughesman, X. J. David Lu, Kelly Y. P. Liu, Yuqi Zhu & Catherine F. Poh
Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia, V6T 1Z4, Canada
Curtis B. Hughesman & Charles Haynes
Department of Integrative Oncology, British Columbia Cancer Research Centre, Vancouver, British Columbia, V5Z 1L3, Canada
Curtis B. Hughesman, X. J. David Lu, Kelly Y. P. Liu, Yuqi Zhu, Rebecca M. Towle & Catherine F. Poh
Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, V6T 2B5, Canada
Catherine F. Poh

Authors

Curtis B. Hughesman
View author publications
You can also search for this author in PubMed Google Scholar
X. J. David Lu
View author publications
You can also search for this author in PubMed Google Scholar
Kelly Y. P. Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yuqi Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca M. Towle
View author publications
You can also search for this author in PubMed Google Scholar
Charles Haynes
View author publications
You can also search for this author in PubMed Google Scholar
Catherine F. Poh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.B.H. contributed to assay and study design, analyzed the data and wrote the main manuscript. X.J.D.L. contributed to study design, analyzed the data and performed experiments. K.Y.P.L. contributed to study design. Y.Z. prepared samples for the experiments. R.M.T. assisted with data analysis. C.H. and C.F.P. contributed to assay and study design, edited the manuscript and supervised the experiments. All authors reviewed and approved the manuscript.

Corresponding authors

Correspondence to Charles Haynes or Catherine F. Poh.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplemental information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hughesman, C.B., Lu, X.J.D., Liu, K.Y.P. et al. Detection of clinically relevant copy number alterations in oral cancer progression using multiplexed droplet digital PCR. Sci Rep 7, 11855 (2017). https://doi.org/10.1038/s41598-017-11201-4

Download citation

Received: 09 January 2017
Accepted: 21 August 2017
Published: 19 September 2017
DOI: https://doi.org/10.1038/s41598-017-11201-4

This article is cited by

Digital PCR-Based Method for Detecting CDKN2A Loss in Brain Tumours
- Shlomo Tsuriel
- Victoria Hannes
- Dov Hershkovitz
Molecular Diagnosis & Therapy (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.