Abstract
Targeted cancer therapies have revolutionized treatment but their efficacies are limited by the development of resistance driven by clonal evolution within tumors. We developed “CAPTURE”, a single-cell barcoding approach to comprehensively trace clonal dynamics and capture live lineage-coupled resistant cells for in-depth multi-omics analysis and functional exploration. We demonstrate that heterogeneous clones, either preexisting or emerging from drug-tolerant persister cells, dominated resistance to vemurafenib in BRAFV600E melanoma. Further integrative studies uncovered diverse resistance mechanisms. This includes a previously unrecognized and clinically relevant mechanism, chromosome 18q21 gain, which leads to vulnerability of the cells to BCL2 inhibitor. We also identified targetable common dependencies of captured resistant clones, such as oxidative phosphorylation and E2F pathways. Our study provides new therapeutic insights into overcoming therapy resistance in BRAFV600E melanoma and presents a platform for exploring clonal evolution dynamics and vulnerabilities that can be applied to study treatment resistance in other cancers.
Similar content being viewed by others
Introduction
Despite the success of targeted therapies in cancer treatment, complete cures are difficult to achieve due to the emergence of resistance. Although emerging evidence suggests that both preexisting intratumor heterogeneity and ongoing diversification of reversible drug-tolerant persister during therapy could enable some tumor cells to survive treatment and relapse1,2, the ability to both monitor clonal evolution trajectories and identify underlying resistance mechanisms or vulnerabilities remains largely limited.
Melanoma is the most lethal form of skin cancer3. Nearly half of melanomas harbor mutations of the v-raf murine sarcoma viral oncogene homolog B (BRAF) gene, leading to activation of the mitogen-activated protein kinase (MAPK) signaling pathway4. The majority of these mutations occur at codon 600, resulting in a valine substitution that is most commonly with glutamine (BRAFV600E). Therapies targeting toward mutant BRAF provide substantial benefits to patients. Nevertheless, these responses are not durable and a number of mechanisms have been identified that lead to treatment resistance5. Despite this knowledge, there continues to exist an urgent need to identify new mechanisms of tumor cell resistance and to develop new therapeutic strategies to overcome this resistance.
A variety of approaches have been developed to study cellular heterogeneity, perform lineage tracing and conduct functional screening. Single-cell sequencing is powerful to study cellular heterogeneity by obtaining a snapshot of thousands of cells6,7,8; however, it remains challenging to track individual cells at a multi-omics scale with high precision and to isolate phenotype-associated subclones in heterogeneous cell populations for functional validation. Cellular DNA barcoding approaches have been reported to efficiently and cost-effectively label and track thousands to millions of cells at the single-cell level9,10,11,12,13. However, despite many powerful applications, these methods are incapable of capturing live clonal subpopulations from heterogeneous pools for further validation and study. Genome-scale gene perturbation screens, including CRISPR-based gene deletion or activation screens14,15, have emerged as a robust functional screening strategy for mapping potential phenotype-associated genes. These approaches provide genotype-phenotype associations, but do not inform the temporal evolution and mechanism by which genes are altered physiologically in the development of the phenotype (e.g., resistance). Understanding what these alterations may be and how and when they arise may provide important insights into therapeutic strategy and drug development. Hence, capturing live lineage-coupled subpopulations from heterogeneous samples for integrative analysis and functional study is critical for understanding clonal evolution regulatory events and vulnerabilities.
New techniques aimed to enable clonal capture have been reported recently16,17,18,19; however, these approaches were either CRISPRa and miniCMV based or wild-type Cas9 based, with semi-random guide RNA (gRNA) target sites, which limits the control on inter-barcode and across genome off-target effects and on-target efficiency (see Discussion). Here, we developed a unique approach that allows the full design of each barcode for optimal off- and on-target control without sacrificing high barcode complexity using a Cas9D10A and paired-gRNA targetable unique reporter (CAPTURE) single-cell barcoding library. We demonstrated an integrative application of CAPTURE to reveal clonal dynamics of BRAFV600E melanoma cells’ response to vemurafenib and identify private and common druggable vulnerabilities of resistant cells.
Results
Establishment of the CAPTURE barcoding system
Semi-random sequences were used by other gRNA-based targetable barcoding approaches16,17,18,19 to reach high complexity, however, the gRNA sequences were known to be critical for on-target activity and off-target effects20. To develop a targetable barcoding system with high complexity and fully-designed sequences, we proposed a barcode design (Supplementary Fig. S1a) based on the Cas9D10A nickase, which requires two gRNAs to induce a DNA double-strand break21,22. The use of two gRNAs per barcode allows us to achieve millions of unique barcodes combinatorically using only thousands of fully-designed high-quality gRNAs (Supplementary Fig. S1b) and minimizes potential off-target effects induced by wild-type Cas9. To improve the targetability of the design, we constructed an all-in-one lentiviral backbone (Supplementary Fig. S1c) to deliver Cas9D10A and two gRNAs with previously optimized gRNA structure23,24. We then set out to determine the targetability of our barcode design by testing the enhanced green fluorescent protein (eGFP) switching efficiency of a set of barcodes that were placed upstream and fused in-frame with eGFP (Supplementary Fig. S1a). The eGFP switching efficiency was determined by delivering Cas9D10A and 2 gRNAs targeting each corresponding barcode. Each barcode carried a pair of reverse-oriented gRNA target sites separated by a short offset (15-bp or 9-bp was used based on previous publications21,22) flanked by two reverse-oriented protospacer adjacent motifs (PAMs). We tested eight barcodes carrying six pairs of gRNA-targeting sites (Supplementary Fig. S1a) in the human embryonic kidney (HEK) 293T cells. After transduction at a low multiplicity of infection (MOI < 0.3) followed by antibiotic selection for 7 days, CRISPR-Cas9D10A and the corresponding paired gRNAs significantly silenced eGFP fluorescence (Supplementary Fig. S1d). Flow cytometry results (Supplementary Fig. S1e) demonstrated that all barcodes were successfully targeted to switch the fluorescent signal, and the 15-bp offset (Supplementary Fig. S1a, BC5-6 vs BC7-8) resulted in higher efficiency. These results supported the targetability of our barcode design by Cas9D10A and the corresponding paired gRNAs. Meanwhile, low-level, spontaneous eGFP silencing was also observed in the cells absent of targeting (Supplementary Fig. S1e, Cas9D10A only cells), suggesting that simple eGFP could not be a good reporter of targeting due to the background.
To obtain a reporter that avoids spontaneous silencing effects, we optimized the barcode structure by adding an upstream red fluorescent protein (RFP) in-frame with the fused barcode region and downstream of eGFP (Fig. 1a). With this design, the example workflow was as shown in Fig. 1b. Barcoded cells are eGFP and RFP double-positive (RFP+/eGFP+) and targeted cells are RFP positive but eGFP negative (RFP+/eGFP–). In contrast, spontaneously silenced cells are negative for both reporters and easily distinguished (Fig. 1b). We first confirmed the paired-gRNA requirement for eGFP targeting and validated the correction of spontaneous eGFP silencing by upstream RFP (Supplementary Figs. S1f–h). We then determined the sensitivity in isolating cells with specific barcodes using the optimized approach. Briefly, barcoded pools with differing percentages of barcode BC1-carrying cells, ranging from 0.1% to 100%, were subjected to isolation using BC1-targeting gRNAs followed by fluorescence-activated cell sorting (FACS). Flow cytometry analysis (Fig. 1c) demonstrated that the CAPTURE system was capable of isolating the targeted barcoded cells at a frequency as low as 0.1% using the optimized vector. A similar efficiency of isolation was validated by isolation of barcode BC2 (Fig. 1d). The purity of isolated subpopulations was determined by amplification and sequencing of barcodes and showed that isolation purity was at least 91.6% when barcode frequency was > 1%, and 48.7% when barcode frequency was 0.1%, indicating the ability to isolate barcoded cells at a frequency as low as 0.1% with the CAPTURE approach (Fig. 1d). Editing nucleotide distribution outcomes from deep sequencing of the barcode region in the isolated populations demonstrated deletions and indicated successful targeting (Fig. 1e and Supplementary Table S1). Together, these results demonstrate the efficiency and sensitivity of the optimized CAPTURE design.
Having established the optimized vector capable of capturing unique barcodes, we then set out to engineer a CAPTURE barcoding library with tens of millions of barcodes to permit unique tagging of individual cells in complex cell populations, such as tumors. To ensure that the barcodes are highly targetable, we designed 12,000 highly active gRNA targets (Supplementary Table S2) based on two criteria. First, the gRNAs excluded stop codons in the barcodes to avoid interrupting the RFP-eGFP fusion. Second, the gRNAs were selected for high on-target scores20, maximizing targetability and specificity. Based on the existence of a complementary pairing region at the 3′ ends between the top and bottom oligonucleotides, synthesized oligonucleotides were annealed, followed by PCR extension and amplification. The amplification products were assembled with the linearized backbone (Fig. 1f). Amplicon sequencing was then employed to determine library complexity and quality. From a depth of approximately 144 million sequencing reads (Fig. 1g), more than 21 million unique barcodes were detected. The library did not show significant bias with > 95% of the observed barcodes detected by < 15 sequencing reads. As expected, the library-transduced cells were both RFP and eGFP positive (Fig. 1h).
CAPTURE barcoding reveals the clonal evolution landscape of BRAFV600E inhibition in melanoma
We applied the CAPTURE approach in the human melanoma cell line A375 to study treatment resistance using vemurafenib (also known as PLX4032, or PLX hereafter), which is used in the treatment of BRAFV600E-mutant melanoma. We transduced 3 million A375 cells at low MOI (< 20% transduced) to uniquely barcode the cells. The cells were then subjected to blasticidin selection followed by FACS to remove the non-barcoded cells. The founder-barcoded cells were then expanded to ensure sufficient barcode coverage (> 100×) prior to treatment to minimize the stochastic loss of barcodes. Barcoded cells were divided into a parallel control (treated with DMSO only for 40 days) group, to assess baseline barcode distribution, and five replicate groups treated with PLX (maintained at 2 µM for 40 days) (Fig. 2a).
As anticipated, only a small subpopulation of barcoded cells, mainly those with visible morphology changes, survived the initial treatment and resistant cells emerged after prolonged treatment. After 40 days of treatment, many individual clones and local confluent areas were obviously presented, indicating the emergence of resistant cells. The emerging cells were expanded for several additional passages and split into two halves (one for barcode analyses, the other frozen viably for downstream analyses). Barcode distribution analysis showed that a subset of barcodes in each PLX treatment group was highly enriched when compared to the control group (Fig. 2b; Supplementary Fig. S2a and Table S3). Cumulative analysis (Fig. 2c) revealed that the top ten most abundant barcodes of each of the PLX treatment groups accounted for > 75% of the final resistant pools, indicating that a small number of clones (founder cells each with a unique barcode and their offspring cells (subpopulations)) dominated the resistant cell population. Interestingly, in all the replicates, the most abundant barcode clone accounted for > 40% of the final resistant pools. Several hundred additional barcodes showed relative enrichment after treatment, but existed at a low frequency in the resistant pool, suggesting competition among the resistant subpopulations. To identify enriched barcodes for each treatment replicate, generalized log2 fold change (GFOLD)25, an algorithm developed for assigning reliable statistics based on the posterior distribution of log2 fold change for single replicate count data, was employed (Supplementary Fig. S3a, b). Using a threshold of GFOLD > 4, we identified a mean number of enriched barcodes per replicate of 44 (ranging from 38 to 50 across the five replicates) and a total of 157 unique enriched barcodes from all replicates combined (Fig. 2d and Supplementary Table S3). Given the detected barcode complexity of 0.23 million, the 157 enriched barcodes indicate that ~0.07% of clones in the founder population contribute to PLX resistance. These data suggest that a small number of competitive clones in the overall population are the primary contributors to resistance.
Based on the similar concept of Clontracer9, analysis was performed to determine the likelihood of the origin of resistance. We compared the top 50 most enriched barcodes across the five PLX-treated replicates, and found that 34%–66% of the enriched barcodes were observed in a single replicate (Fig. 2d). Although we cannot completely exclude the possibility of a stochastic loss of some barcodes during the experiment, it is more likely that these clones arose from drug-tolerant persister cells by acquiring de novo resistance alterations (late-emerging). One representative example of this late-emerging clonal variation is the barcode bc2-1 labeled cell, which constituted 84.2% of the replicate 2 (R2) resistant pool but was not observed enriched in any other replicates. Only 3 enriched barcodes (bc51234, bc41235, and bc41235-1) were shared by all replicates, indicating that these clones were most likely carrying preexisting resistance because the barcode-complexity adjusted possibility of a resistant barcode enriched in all five replicates by chance (not preexisting) was extremely low (3.86 × 10−11, see Materials and methods for calculation). Although these clones may have been present in the original pretreatment pool, their contribution to resistance varied in different replicates. For example, barcode bc51234 accounted for 0.25%, 1.09%, 0.034%, 13.90%, and 41.74% of resistant cells in the five replicates, respectively. This indicated that resistant clones may fit differently regardless of the origin of the resistance. The late-emerging resistant clone with barcode bc2-1 outperformed the preexisting bc51234 in R2. Importantly, the high percentage (34%–66%) of unique enriched barcodes for each replicate indicates a degree of randomness of acquired resistance from drug-tolerant persister cells. Together, these data suggest that both preexisting drug-resistant cells and random, late-emerging resistant cells from drug-tolerant persisters contributed to PLX resistance in A375 (Fig. 2e) and the preexisting ones do not necessarily confer a growth advantage.
Capturing BRAFV600E inhibitor-resistant subclones
We subsequently used CAPTURE system to capture cells with defined enriched barcodes (Supplementary Table S4 and Fig. S2b) for downstream analysis and validation. Based on the interest in resistance mechanisms and the sensitivity of cell isolation, we focused on isolating resistant barcoded cells from BRAF inhibitor (BRAFi) treated replicates that were at least ~0.1%. Briefly, we made all-in-one constructs for delivering Cas9D10A and paired-gRNA targeting to each of the cells with defined enriched barcodes (Supplementary Table S4). Lentiviral particles encoding each pair of gRNAs were used to infect the resistant pool cells at MOI of ~0.3 followed by 10 days of puromycin selection. The captured subclones (sorted single-cell clones from subpopulations, each with a unique barcode) were named by the barcode ID, source ID, and clone ID (Fig. 3a). Cells were captured by FACS for the RFP+/eGFP– cells (Fig. 3b and Supplementary Fig. S4a) which were further visualized by fluorescent microscopy (Supplementary Fig. S4b).
To compare different subclones with the same barcode within a replicate, we analyzed two subclones with the same barcode for three of the barcodes studied. To compare preexisting subclones with the same barcode, we selected two of the three barcodes that were enriched in all the replicate pools and, for each, captured them from the top two pools with the greatest frequency of each of those barcodes. Captured subpopulations showed heterogeneous cellular characteristics, including morphology (Supplementary Fig. S4b) and proliferation rates (Fig. 3c). To determine whether the subpopulations still retained PLX resistance, colony formation assays were performed to assess the PLX sensitivity of the parental A375 and each of the captured subclones. The cells showed very different plating efficiencies and PLX responses (Fig. 3d and Supplementary Fig. S4c). Cell toxicity was assayed (Fig. 3e) to further validate the PLX sensitivity of the captured resistant clones. Moreover, different subclones with the same barcode (e.g., bc2-1-R2a and bc2-1-R2b) showed similar phenotypes, indicating the stability of the subpopulation. Subclones with the same barcode from different replicates (e.g., bc51234-R4 and bc51234-R5) also showed similar phenotypes, confirming that these subclones were preexisting in the population prior to treatment.
Integrated analysis of captured subclones identifies resistance-associated genetic alterations
We hypothesized that captured subclones would harbor genetic and/or epigenetic alterations associated with PLX resistance. All captured subclones were evaluated by whole-exome sequencing (WES), transcriptome sequencing (RNA-seq), and methylome profiling (Infinium MethylationEPIC DNA bead chip arrays, EPIC array) analysis. To assess heterogeneity among resistant subclones with identical barcodes, we compared subclones with identical barcodes from within the same replicate pool (bc2-1-R2a vs bc2-1-R2b, bc12-1-R1a vs bc12-1-R1b, bc435-1-R4a vs bc435-1-R4b) and subclones with the identical barcodes from different replicate pools (bc41235-R4 vs bc41235-R5 and bc51234-R4 vs bc51234-R5). Both the parental A375 and the parallel DMSO-A375 cells were included as control cells. Principal component analysis (PCA) and clustering analysis (Supplementary Fig. S5) incorporating WES, RNA-seq, and EPIC array data showed that subclones with the same barcode were clustered closer than those with different barcodes, indicating that the alterations were relatively stable and capturable within a specific barcoded subpopulation.
WES was employed to explore the genetic alterations of each PLX-resistant subpopulation. Copy number variations (CNVs) occur commonly in the human genome and are more likely to have larger phenotypic effects, therefore we first evaluated CNVs from WES data. We identified both shared (i.e., common among multiple barcode subpopulations) and private (i.e., limited to a particular barcode subpopulation) gains and losses of focal or sub-chromosomal regions (Supplementary Table S5 and Fig. S6). A regional gain on the chromosome (chr) 18q21, which contains the BCL2 gene (among others), was shared by bc51234-R4, bc51234-R5, bc41235-R4, bc41235-R5, bc45-R4, and bc5-2-R5. In addition, the gain of the EGFR gene located on chr 7p (bc51234-R4 and bc51234-R5) and gain of the ETS2 gene located on chr 21q (bc2-1-R2a and bc2-1-R2b) were identified. Consistently, these CNVs were congruent with RNA-seq results and validated by genome DNA qPCR (Fig. 4a; Supplementary Fig. S7a and Table S6). Increased mRNA and protein levels of BID and BIM, two BCL2 family members, were found in patients treated with another BRAFV600E inhibitor (PLX4720) and related to its resistance26. BCL2 family proteins regulate apoptosis induced by BRAFi and inhibition of the BCL2 family increase sensitivity to mutant BRAF inhibition27, indicating that our finding of chr 18q gain may serve as a novel mechanism of BCL2 upregulation and PLX resistance. Reversible and adaptive upregulation of EGFR has been recognized to confer PLX resistance in BRAF mutant melanoma28,29. ETS2 is a major downstream effector of the MAPK pathway, and overexpression or mutation of ETS2 has been shown to confer PLX resistance30,31. The mRNA expression of these genes was validated by RT-qPCR (Supplementary Fig. S7b).
Some of the subpopulations did not show significant CNVs (e.g., bc12-1, bc345-1, bc345-2, and bc534-2). Somatic alterations (SAs), including single nucleotide variations (SNVs) and small insertions/deletions, are another source of genetic variation that drive clonal evolution32. We identified PLX resistance-associated SAs from WES data and filtered for variants observed in the vehicle-treated A375 cells (Supplementary Table S7). Between 22 and 68 SAs were detected in each captured subpopulation, of which more than half were nonsynonymous mutations (Fig. 4b). We also investigated the SNV pattern within subpopulations and noted that all subpopulations, including both preexisting (e.g., bc51234 and bc41235) and late-emerging (e.g., bc2-1 and bc5-2) resistance, demonstrated similar mutational signatures (Supplementary Fig. S8), indicating that the SNV patterns are not driven by PLX. To nominate promising resistance-associated genes (Fig. 4c), we generated a prioritization score (see Materials and methods) for each alteration using a similar strategy33. The highest-ranking SAs (Supplementary Fig. S9) included those previously reported as PLX resistance-associated, such as NRASQ61K (barcode subpopulations bc345-1 and bc345-2) and NRASQ61R (barcode subpopulation bc12-1). NRASQ61K and NRASQ61R have been reported as important drivers of PLX resistance34,35,36,37, highlighting that the CAPTURE approach is capable of identifying endogenous resistance-associated gene mutations. Moreover, the bc345-1 and bc345-2 subpopulations were very similar in all omics analyses (Fig. 4c and Supplementary Fig. S5), indicating that they were offspring cells of a common ancestor clone with NRASQ61K that was preexisting in the parental cell line.
We also identified other SAs of genes that have been reported as associated with PLX resistance (Fig. 4c and Supplementary Fig. S9), including mutations of the MAP3K11, GPS1, GMIP, and NOTCH1 genes. MAP3K11 is a mixed lineage kinase that activates MEK independently of RAF to mediate resistance to RAF inhibitors38. GPS1 is a suppressor of MAPK signaling39, the principal pathway involving BRAF. GMIP regulates RhoA whose activation contributes to PLX resistance40,41. NOTCH1 is downregulated in melanoma cell lines with intrinsic and acquired resistance to BRAF inhibition42.
Integrated analysis of captured clones reveals resistance-associated epigenetic alterations
In addition to genetic alterations, heterogeneity of DNA methylation has also been demonstrated to contribute to clonal evolution43. We compared PLX-resistant captured clones to control cells (the parental A375 and the parallel DMSO-A375) by EPIC array and identified hundreds to thousands of differentially-methylated positions (DMPs) (Supplementary Fig. S5 and Table S8). Differentially-methylated genes (DMGs) in the resistant clones were further identified (Supplementary Table S9) by methylated CpG set enrichment analysis (mCSEA)44. The correlation between gene promoter DNA methylation and gene silencing has long been recognized45. We next identified a list of genes (Fig. 4d and Supplementary Table S10) whose promoter methylation status was correlated with gene expression regulation by integrating methylation data with RNA-seq data. Among the genes identified, the promotor of SOX10 was hypermethylated with the corresponding underexpression of the gene in the bc435-1 subpopulation (Supplementary Fig. S10a). Loss of SOX10 expression has been reported as a mechanism of adaptive resistance to mutant BRAF inhibition28,46 and our study further showed that this could result from promoter hypermethylation, consistent with the finding of SOX10 methylation and silencing47. Interestingly, another SRY family gene, SOX11, was hypermethylated and underexpressed in the bc41235 subpopulation (Supplementary Fig. S10b). The promoter of NEK9, whose silencing is associated with mutant BRAF inhibition48, was hypermethylated and the gene was underexpressed in bc345-1 and bc345-2 (Supplementary Fig. S10c).
In addition to downregulated targets, promoters of EPHA3 and SPP1 (Fig. 4e and Supplementary Fig. S10d) were hypomethylated and these genes were overexpressed in bc5-2 and bc534-2, respectively. EPHA3 is a member of the Ephrin family of receptor tyrosine kinases, members of which have been shown to have a role in escaping BRAF inhibition49. EPHA3 has also been shown to be regulated by DNA methylation50. EPHA2 is a mediator of vemurafenib resistance and a novel therapeutic target in melanoma51. SPP1 was found to stimulate preneoplastic cellular proliferation through activation of the MAPK pathway52.
Together, these data demonstrate that both the preexisting and late-emerging genetic and epigenetic alterations fuel clonal evolution. These alterations, and the clones that harbor them, can be isolated and identified using the CAPTURE approach, highlighting the application of this technology for integrative clonal characterization.
PLX-resistant clones with chr 18q gain are vulnerable to BCL2 inhibition
In addition to known mechanisms of PLX resistance (e.g., NRAS mutation and SOX10 downregulation), chr 18q gain is a previously unreported mechanism of PLX resistance that was observed in four of the barcoded subpopulations from our CAPTURE analysis. Among each of the replicate pools, barcoded subpopulations with chr 18q gain accounted for a mean of 22% of all resistant cells. Of all the genes at this locus, we focused on BCL2 due to the reported implication of BCL2 family proteins in PLX sensitivity26,27,53. To evaluate the effect of BCL2 inhibition, we treated cells with the BCL2 inhibitor ABT263 and performed a cell viability assay. The parental A375 and parallel control (DMSO-A375) cells were insensitive to ABT263 treatment with half-maximal inhibitory concentrations (IC50) of > 3 µM (Fig. 5a, b), consistent with a previous report53. Conversely, all subpopulations with chr 18q gain were sensitive to ABT263 treatment with a > 10-fold decrease in IC50 (Fig. 5a, b and Supplementary Table S11), indicating a strong dependency of these cells on BCL2 for survival. Interestingly, all the other PLX-resistant clones were also relatively sensitive to ABT263 compared to control cells (Fig. 5a, b), indicating that other PLX-resistant clones may also benefit from BCL2 inhibition, albeit with lower sensitivity than those with chr 18q gain. Similar results were confirmed by colony formation assays (Fig. 5c and Supplementary Fig. S11a). Moreover, inhibition of BCL2 by low dose (100 nM) ABT263 re-sensitized the chr 18q gain resistant subpopulations, but not the parent cells or other subpopulations without chr 18q gain, to BRAF inhibition (Supplementary Fig. S11b, c).
Clinically, a trend towards higher BCL2 mRNA expression in the BRAFi-resistant samples compared with pretreatment samples was observed (Fig. 5d) by analyzing a published cohort of BRAFi-treated melanoma patients54. Notably, the expression of BCL2 was similar to and positively correlated with other genes at chr 18q21.33 (but not randomly chosen genes from other genomic loci) (Fig. 5e and Supplementary Fig. S12a) in this BRAFi patients data set, indicating that genes of this chromosome locus had either undergone the same DNA copy number changes or common transcriptional control. The strong positive correlation of BCL2 expression with chr 18q21.33 genes was not observed in a non-BRAFi data set (Supplementary Fig. S12b), which ruled out the possibility of common transcriptional control of genes from this locus. Hence, chr 18q gain was at least partially involved in the upregulation of BCL2 and resistance of BRAFi-treated patients. Consistently, BCL2 expression of pretreatment samples was negatively correlated with the degree of tumor regression (Fig. 5f) and progression-free survival (PFS; Fig. 5g). In an additional published cohort55, the Bcl2 expression level was higher in most (13 out of 17) of the PLX-resistant samples when compared to the matching initial sensitive biopsies (Fig. 5h). Moreover, we performed western blot to examine Bcl2 protein expression in three additional BRAF-mutant melanoma cell lines that were selected for resistance by treating with vemurafenib for about 40 days (a similar treatment time to the A375 study); and higher Bcl2 expression was observed in two of the three cell lines (Fig. 5i). Taken together, these results reveal a previously unrecognized and clinically relevant mechanism, whereby BCL2 upregulation resulting from either preexisting or acquired chr 18q gain confers PLX resistance, which can be overcome by BCL2 inhibition. Importantly, these discoveries illustrate the potential utility of the CAPTURE approach to identify and test clonal vulnerabilities.
Identification of potential common transcriptional dependencies of BRAFi-resistant clones
Since diverse alterations can fuel clonal evolution along with the stochastic development of late-emerging resistance, it might be impossible to overcome resistance by targeting a single mechanism alone. Thus, we set out to explore potential common transcriptional programs shared by the captured resistant cells carrying different resistance mechanisms. Transcriptomic analysis (Fig. 6a and Supplementary Fig. S13a) identified a set of commonly differentially expressed genes (DEGs) shared by all the captured clones that account for ~70% of the resistant cells. A gene signature comprised of the 30 upregulated genes among resistant cells negatively correlated with PFS (Fig. 6b), indicating the potential relevance of upregulated genes to BRAFi response. Interestingly, among the top upregulated genes, CDYL2 was previously identified as a melanoma accelerator in a BRAFV600E model56. CDYL2 is potentially druggable57, as well as some other upregulated genes (e.g., PPP1R15A, CD274, and HDAC9), providing potential therapeutic opportunities. Furthermore, gene set enrichment analysis (GSEA) also identified commonly enriched pathways of captured resistant clones (Fig. 6c). Among these pathways, the oxidative phosphorylation pathway has been reported to be related to BRAFi response and resistance, leading to oxidative metabolism dependence of resistant cells in BRAFV600E melanoma58,59, although there was a discordance (whether the regulation was PGC1α dependent or not) between the two studies. Consistently, the captured resistant cells were more sensitive to an oxidative phosphorylation inhibitor compared to parental control cells (Fig. 6d). Intriguingly, our clonal resolution results further revealed that PGC1α was only upregulated in some of the clones (e.g., bc41235) but not in others (Fig. 6e), indicating that both PGC1α upregulation-dependent and -independent signals were involved in regulating oxidative metabolism, which may account for the discordance between the two studies58,59. Another commonly enriched pathway of the captured clones, the MYC target pathway, was previously reported as a major targetable pathway activated by diverse pathways that drive BRAFi resistance in BRAF-mutant melanoma60. In addition to these dependencies, our results revealed the E2 factor (E2F) target pathway as another common pathway enriched in the captured resistant clones (Fig. 6c). These clones were more sensitive to E2F inhibition by YKL-5-12461 (Fig. 6f, g and Supplementary Fig. S13b). Collectively, these findings demonstrate integrative usage of CAPTURE to identify potential common vulnerabilities and provide potential therapeutic opportunities for more broadly overcoming resistance in BRAFV600E melanoma.
Potential single-cell application and upgradability of CAPTURE
Although the enriched cells could be captured by CAPTURE as shown in the above studies, single-cell technologies remain powerful complementary approaches for studying the rare cells (limited by the throughput and sensitivity of cell isolation) in the population. However, single-cell sequencing coverage may be biased towards more abundant clones in the population studied. Therefore, we next set out to explore a potential indirect application of the CAPTURE to reduce this bias. In our application of CAPTURE to study BRAFi resistance, for example, bc2-1 accounted for 84% of resistant cells in our replicate 2 pool. To determine whether CAPTURE could be used to filter out dominant clones and improve coverage of rare clones for single-cell sequencing, three populations were analyzed, including (1) the DMSO-treated A375 cells (DMSO-A375); (2) the PLX-resistant replicate pool 2 cells (A375-R2); and (3) A375-R2 cells with the dominant bc2-1 barcoded cells removed by application of CAPTURE and sorting for RFP+/GFP+ cells (A375-R2-Δbc2-1). Populations were multiplexed by hashing with barcoded antibodies62 for 10× Genomics single-cell RNA sequencing. Approximately 17,908 reads were obtained per cell, which generated a median of 8016 unique molecular identifiers per cell, 2548 expressed genes per cell, and more than 22,000 total genes detected in the population.
Projection of cells in a t-distributed stochastic neighbor-embedding (t-SNE) analysis provided a visual representation of cell clustering based on transcriptomic profiles. The first two-dimensional t-SNE projection segregated cells into 20 major clusters of transcriptomes by unsupervised clustering (Fig. 7a), demonstrating intercellular heterogeneity. Cells from each sample were highlighted by the expression of corresponding hashing oligonucleotides in the t-SNE projections, respectively (Fig. 7b–d). Compared to DMSO-A375 cells, a highly enriched population was shown on the t-SNE plot for the PLX-resistant cells (A375-R2), consistent with the enrichment of the barcode bc2-1 population in this sample. Interestingly, this population included three clusters (clusters 2, 5, and 11), indicating heterogeneous subpopulations within the bc2-1 lineage and suggesting further clonal evolution. The bc2-1 cell population was effectively depleted (from 51.89% to 3.96%) by comparing these clusters of the A375-R2 and A375-R2-Δbc2-1 (Fig. 7c, d). Importantly, coverage of the remaining subpopulations within A375-R2-Δbc2-1 improved as demonstrated by the mutually exclusive pattern of t-SNE projections between A375-R2 and A375-R2-Δbc2-1. For example, coverage of subpopulations of clusters 0, 3, and 6 were all increased by ~5-fold after depletion of bc2-1 by CAPTURE (Fig. 7c, d). The presented data here support the CAPTURE barcoding system as an effective upstream tool in the application of single-cell technologies to improve the coverage of rare clones.
Another potential way to study rare cells is to increase the sensitivity of cell capture. Outlier events due to fluorescence anomalies have been an issue with flow cytometry63,64 and limit the sensitivity for rare cell sorting. Taking advantage of the frameshifting feature of CAPTURE, which allows simultaneously shift-off and shift-on markers, we next set out to explore its upgradability to a system that shifts one marker for background elimination, one for pre-enrichment, and a third marker for sorting. The reporter of CAPTURE version 1 (v1) was replaced by a polycistronic reporter linked using 2 A self-cleaving peptides to construct the v2 (Fig. 7e). On the basis of the barcode-controlled GFP-RFP fusion, we added a P2A peptide linked in-frame blasticidin selection marker (BSD), following which an HA-tagged puromycin selection marker (PuroR) was linked by a + 1 bp frameshifted T2A peptide. With this reporter, unique marker combinations could be used to differentiate different cell events at both barcoding and capturing stages (Fig. 7f). During barcoding, BSD selection could help to remove the imperfectly barcoded cells. During capturing, puromycin selection could be applied to enrich target cells to minimize the influence of outlier events during sorting. To ensure the performance, the codons of the reporter were optimized so that no stop codons would be induced by –1 frameshifting and no –1 ribosomal frameshifting sites would be present. We then tested the puromycin selection of CAPTURE-v2. Barcoded cells without CRISPR targeting were totally sensitive to puromycin, while a portion of barcoded cells with anti-barcode CRISPR targeting were resistant to puromycin (Fig. 7g). Consistently, barcoded cells with CRISPR targeting showed PuroR-HA expression, which was strengthened after puromycin selection and was undetectable in non-targeted barcoded cells (Fig. 7h). These results indicate that barcode targeting did induce frameshifting leading to PuroR-HA expression and puromycin resistance. Next, we determined the sensitivity by capturing cells carrying a barcode (BC2-v2) at different percentages (10%–0.001%). Flow cytometry analysis showed that puromycin selection indeed significantly enriched RFP+/GFP– cells at all percentages without decreasing performance at a low percentage (Fig. 7i). Moreover, amplicon sequencing of the pre-enriched and sorted cells confirmed the purity of captured cells was ~90% in all groups (Fig. 7j), without a decrease in lower percentage groups. These data indicate that by modifying the reporter, CAPTURE-v2 allows pre-enrichments for sorting, enabling the capture of rare cells comprising < 0.001% of the population.
Discussion
In this study, we identify a novel mechanism of BRAFi resistance and uncover private and common vulnerabilities of resistant cells in BRAFV600E melanoma using an integrative methodology for investigating drug resistance and clonal evolution. This method relies on a Cas9D10A and paired-gRNA targetable unique reporter (CAPTURE) single-cell barcoding approach. Studies of clonal evolution using barcoding libraries have been previously reported; however, our method makes significant improvements over prior approaches that facilitate the discovery of novel resistance mechanisms. For example, in contrast to the previously reported ClonTracer barcode system9, an important advance provided by CAPTURE is the ability to specifically capture phenotype-associated clones from heterogeneous pools and link them to their clonal trajectories. This permits application to a broad range of integrated investigations to identify drivers and/or vulnerabilities.
During our manuscript preparation and submission, several conceptually similar approaches have been published16,17,18,19, emphasizing the importance of lineage-specific isolation. Compared to all these approaches (Supplementary Fig. S14), CAPTURE is the only one with the capability to obtain tens of million fully-designed (not semi-random) barcodes using tens of thousands of precise synthetic oligos, due to the use of an off-target minimizing paired-gRNA barcode design and Cas9 nickase system21. Hence, CAPTURE has the optimal control of the overall library functionality by controlling the off-target effects (either inter-barcode or across the genome) and on-target efficiency of each gRNA used for library construction, which is critical for CRISPR-based technology20. Moreover, COLBERT19, CaTCH16, and B-GLI18 were all based on miniCMV, which was known for its leakiness17,65, especially when some of the semi-random sequence barcodes may also serve as binding elements of endogenous transcription factors that could be highly active during the clonal evolution (e.g., two GATA-1 sites and others were found in CaTCH BC1 by PROMO66 analysis) and lead to another source of potential off-target activation. CloneSifter17 utilized semi-random sequence gRNA barcodes targeted by wild-type Cas9 that was known for its off-target20, especially with semi-random gRNAs. Most of the sorting-based approaches17,18,19 have a sensitivity of ~0.1%, perhaps due to the background signal and flow outlier that are often observed63,64; surprisingly, CaTCH had a sensitivity of 0.001%, which may be because its barcoded cells “were sorted for low GFP (CaTCH reporter) expression” before the clonal isolation experiment. Although it did not have miniCMV-related leakiness, our CAPTURE version 1 (v1) was also sorting-based and showed sensitivity at ~0.1%. Taking advantage of the frameshifting capability, we also explored the upgradability of CAPTURE to simultaneously shift-off a fluorescent reporter and shift-on an antibiotic selection marker, which minimizes the background and flow-outlier noise by antibiotic pre-enrichment. This upgraded version, CAPTURE-v2 showed a sensitivity of < 0.001%. Moreover, most other approaches (CaTCH, B-GLI, and CloneSifter) require pre-transduction of Cas9-related components (dCas9-VPR or tetR-Cas9), which may introduce more artificial effects due to the selection of pre-transduced cells and more lentiviral integrations. COLBERT does not require pre-transduction because it utilizes transfection, which would limit its usage for hard-to-transfect cells. CAPTURE does not require pre-transduction or transfection. Due to the involvement of cell freezing and thawing, we would also like to note one potential common limitation of all these clonal isolation approaches, including CAPTURE, that could miss certain unstable non-genetic mechanisms, although they are usually less important because resistant cells with non-persist mechanisms might easily become sensitive again after a drug holiday. Overall, CAPTURE provides a distinctive barcode tracing and isolation approach that is highly specific, sensitive, and controllable.
Applying CAPTURE, we demonstrated that a small number of single-cell clones, from preexisting resistant clones or drug-tolerant persister cells, dominated resistance to vemurafenib in BRAFV600E melanoma. The emergence of resistance from persister cells was stochastic, making any strategy to target resistant cells a major challenge. These findings suggest that an ideal treatment strategy to delay or even prevent resistance should both target preexisting resistant cells and drug-tolerant persister cells, especially the cycling ones67. Integration of multi-omics analyses led to the identification of potential resistance-associated alterations, including SAs, CNVs, DEGs, and DMPs. For example, NRAS mutations (Q61R and Q61K) have been validated as a major BRAFi resistance mechanism34,35,36, highlighting the ability of CAPTURE to identify resistance drivers. Additionally, CAPTURE further provided evidence for the selection of preexisting NRASQ61K mutant subpopulations from the parental bulk in response to therapeutic stress. SOX10 downregulation has been known to play a critical role in PLX resistance28,46 and we used CAPTURE to reveal the clonal epigenetic (promoter methylation) adaption of SOX10. Moreover, chr 18q amplification arising from both preexisting and late-emerging evolution paths was identified as another major novel mechanism in several captured clones by upregulating BCL2 expression. Chromosomal instability, including chr 18 gain, was recently observed to be involved in anti-cancer therapies resistance68. Clinically, a reanalysis of a published cohort54 demonstrated the involvement of chr 18q gain in BRAFi resistance of patients. Chr 18q gain was also detected in cutaneous melanoma69,70, supporting the possibility of preexisting this alteration in patients. Importantly, we also demonstrated that resistant clones with 18q gain were vulnerable to and re-sensitized by BCL2 inhibition. These discoveries expose a vulnerability of a previously unrecognized but clinically relevant resistance in BRAFV600E melanoma.
Since various genetic and/or epigenetic alterations, either preexisting or acquired, can fuel clonal evolution and resistance, another consideration for more broadly overcoming resistance is whether these cells with diverse resistance exhibit convergent resistance, which may lead to common vulnerabilities of resistance by different mechanisms. Interestingly, we identified several targetable common dependencies of our captured cells. Our finding of oxidative phosphorylation and MYC pathways as common addictions are consistent with and provides additional evidence for previous observations58,59,60. Additionally, we identified the E2F pathway as another novel druggable vulnerability of BRAFi resistance, and our observation of potential targetable commonly upregulated genes may also provide additional therapeutical opportunities. The molecular mechanisms by which these pathways and genes are regulated by the diverse pathways of resistance in melanoma are therefore important areas for future study.
In addition to the direct applications to the captured clones, by sorting out the dominant subpopulations, CAPTURE can be used to enrich rare subpopulations and improve the detection of rare events by standard sequencing71 or the coverage of single-cell sequencing technologies, as evidenced by our single-cell RNA sequencing of a CAPTURE-filtered sample. Furthermore, future work may enable the combination of CAPTURE with 10× Genomics feature-barcoding technology to allow profiling of gene expression in conjunction with the CAPTURE barcode sequence from the same single cell that would enable capturing lineages with the transcriptome of interest.
In this study, we prioritized the SAs by integration of multi-omic data, alteration pathogenicity, clinical relevance, and CRISPR functional screen scores. In more complex systems with large numbers of candidate SAs, SAs identified by CAPTURE could be validated by saturation genome editing technology72. CAPTURE also lends itself to functional genomic screening14,15 of captured clones, particularly when there is little a priori information about the possible mechanism involved in the specific phenotype being studied. Indeed, CAPTURE allows specific identification of vulnerabilities to reverse the phenotype of interest of captured clonal populations. This functional approach is exemplified by the identification of chr 18q gain in captured PLX-resistant clones, which led to the discovery of a vulnerability to BCL2 inhibition. The editing outcomes of captured clones might be useful in the future as a secondary source of information about within-clone diversity. Although we have demonstrated the use of CAPTURE in studying the clonal response to drug treatment in vitro, we anticipate that CAPTURE will be applicable to a broad range of biological and technological settings and should provide insights into clonal dynamics and regulatory events in heterogeneous cell populations.
Materials and methods
Vector construction
The constructs used in this study were generated using standard molecular cloning technologies, including PCR, restriction enzyme digestion and ligation. Custom oligonucleotides (Supplementary Table S12) were purchased from Sigma-Aldrich. Sequences of the constructs were verified with Sanger sequencing. Lenti-CMV-BC-eGFP vector was modified from pHAGE-CMV-MCS-PGK-puro73 with multiple modifications. The puromycin (Puro) resistance gene was replaced with the blasticidin (BSD) resistance gene, which was amplified from Lenti-Cas9-2A-Blast (Addgene, #73310)74. Then, eGFP, amplified from EGFP-C1 (Clontech), was inserted. The CAPTURE vector (Lenti-EFL-RFP-BC-eGFP) was modified from Lenti-CMV-BC-eGFP with two additional modifications. CMV was replaced with EF1-α intron A promoter, which was amplified from pHAGE-EF1alphaL-hAAT-W (Addgene, #24527)75. RFP was amplified from pL-CRISPR.EFS.tRFP (Addgene, #57819)76. Individual barcodes were constructed by insertion of annealed oligonucleotides into MluI/BamHI restriction sites of Lenti-CMV-BC-eGFP or the CAPTURE vector. CAPTURE-v2 was constructed by replacing the reporter of CAPTURE-v1. The lenti-CRISPR-V2-Cas9D10A-U6-optimized-gRNA vector was modified from lenti-CRISPR v2 (Addgene, #52961)77 by introducing the D10A mutant of Cas9 through site-direct mutagenesis and replacing the original gRNA scaffold with an optimized version synthesized from Genscript. Each gRNA was cloned by inserting annealed oligonucleotides into the lenti-CRISPR-V2-Cas9D10A-U6-optimized-gRNA vector at the Esp3I restriction site. A second PCR-amplified U6-optimized-gRNA was inserted into EcoRI/NheI sites to make lenti-CRISPR-V2-Cas9D10A-2×U6-optimized-gRNA (Supplementary Fig. S1c).
Cell culture, transfection, lentivirus production, and transduction
HEK 293T (ATCC) cells were cultured in DMEM with 10% fetal bovine serum (FBS) (Sigma). A375 (ATCC) cells were cultured in RPMI-1640 medium supplemented with 10% FBS (Sigma). WM2664, SK-MEL-5, and A2058 cells were gifts from Dr. Michael Davies (the University of Texas MD Anderson Cancer Center) and were cultured under the same condition as A375 cells. LookOut® Mycoplasma PCR Detection Kit (Sigma) was used to confirm that our cells were negative for mycoplasma contamination. Transfections were performed using Lipofectamine 3000 Reagent (Thermo Fisher) according to the manufacturer’s instructions. The CAPTURE or CRISPR plasmids were co-transfected with psPAX2 (Addgene, #12260) and pMD2.G (Addgene, #12259) at a ratio of 2:2:1 into HEK 293T cells. The supernatant containing the lentivirus was harvested 72 h after transfection and filtered with a 0.45 µm filter (Millipore). Appropriate amounts of lentivirus were added to cells supplemented with 8 µg/mL polybrene (Millipore) to transduce the cells. To achieve an MOI < 0.3, the amount of lentivirus resulting in a transduction efficiency of < 30% was used. Forty-eight hours after transduction, cells were subjected to blasticidin (5 µg/mL) or puromycin (2 µg/mL) selection for 10 days.
Flow cytometry analysis of eGFP targeting and assessment of barcode capturing sensitivity
HEK 293T cells were transduced with barcodes (BC1–BC8 in Lenti-CMV-BC-eGFP vector) carrying lentivirus as described above to barcode the cells. Barcoded cells were then targeted by Cas9D10A and a pair of gRNAs corresponding to each of the barcodes. The eGFP targeting efficiency was determined by the Gallios flow cytometer (Beckman Coulter). Similarly, HEK 293T cells were barcoded with BC1 and BC2 contained within the optimized Lenti-EFL-RFP-BC-eGFP vector. BC1-barcoded cells were pooled with BC2-barcoded cells at the following percentages: 100%, 10%, 1%, and 0.1%. Similar experiments were performed for CAPTURE-v2 at ratios from 0.001% to 10%. Cas9D10A and paired-gRNA targeting the barcodes were delivered to the pools. Puromycin (1 µg/mL) selection was carried out for CAPTURE-v2 pre-enrichment. RFP+/eGFP– cells were sorted using MoFlo Astrios with BSLII hood (Beckman Coulter) or SY3200™ highly automated parallel sorting (HAPS) cell sorters. DNA was extracted from the sorted cells, followed by amplicon sequencing of the barcode region to confirm specific targeting by Cas9D10A. Cytometry data was analyzed using FlowJo V10; a representative gating strategy was shown in Supplementary Fig. S15.
Barcode amplicon sequencing and analysis
TruSeq-style primers (Supplementary Table S12) were synthesized to amplify the amplicon library for Illumina sequencing. The P5 primer pool consisted of the P5 primer sequence combined with a staggering region of various lengths (0–8 bp) to increase the sequence diversity of the resulting library. Each P7 primer included an index of 8 bases for multiplexing samples in a sequencing lane. DNA from the CAPTURE library or barcoded cells, which represent > 20-fold barcode coverage, was used as the template to amplify the barcode region using the P5 primer pool and P7 primer with a specific sample index. The PCR was performed using ExTaq (Clontech) with the following thermal cycling parameters: 95 °C 1 min, 26 cycles of (95 °C 10 s, 54 °C 30 s, 72 °C 30 s), 72 °C 5 min. The PCR products were size-selected using a 2% agarose gel and purified using the QIAquick Gel Extraction kit (Qiagen). A secondary purification was performed using a 1× SPRIselect reagent (GE Healthcare, B23317). The purified products were quantified using KAPA Library Quantification Kit (Roche). 10% PhiX was included when loading the libraries to an appropriate Illumina sequencer based on required coverage and library size. For counting the barcodes and their fractions in each sample, FASTQ files were generated from the sequencing runs. Reads were trimmed for quality with Trimmomatic version 0.3678 and then Unix Grep and Regex were used for those matching the CGTCCG(N20)GCCACCATGGTCGAC(N20)CGGTAG motif. To keep only the barcode sequence, flanking bases were trimmed using the UNIX cut command. The barcodes were mapped to the reference sequences using the UNIX awk command. A combination of UNIX sort and uniq commands were used to count the barcodes. After these steps, barcodes that were 100% matched to reference were counted. The barcode count matrix was imported into R (version 3.5.1) for counting the barcode fraction and downstream analyses. The GFOLD (generalized fold change)25 algorithm was employed to produce reliable statistics based on the posterior distribution of log2 fold change following the manual. The possibility of a resistant barcode enriched in any r number of replicates out of the total five replicates by chance was estimated using the formula C (5, r) * Pr * (1–P)5–r, where P was the possibility that one barcoded population was selected as resistance (~0.0007 in this study, based on the 157 enriched barcodes out of the total detected barcode complexity of 0.23 million). Hence, the possibility of a resistant barcode enriched in all five replicates by chance (not preexisting) was extremely low (1.68 × 10−16). Considering the barcode complexity of 0.23 million, the adjusted possibility of a resistant barcode enriched in all five replicates by chance (not preexisting) was still extremely low (3.86 × 10−11). The R package pheatmap was used to generate heatmaps. CRISPResso279 was employed to analyze editing outcomes of targeted barcodes from amplicon sequencing.
CAPTURE library construction
Oligonucleotides (Supplementary Table S2) were synthesized on a CustomArray 12 K array (CustomArray Inc) and the assembled oligonucleotide pool was amplified by PCR using Phusion HS Flex (NEB) and primers LibF and LibR (Supplementary Table S12) with the following thermal cycling parameters: 98 °C 30 s, 25 cycles of (98 °C 10 s, 55 °C 30 s, 72 °C 20 s), 72 °C 5 min. The PCR products were size-selected using a 2% agarose gel and purified using the QIAquick Gel Extraction kit (Qiagen). The CAPTURE backbone was linearized with MluI and BamHI digestion and treated with FastAP before gel purification. Linearized vector (200 ng) and inserts (50 ng) were assembled in a 50 µL reaction using Gibson Assembly Master Mix (NEB). The assembled product was desalted by drop dialysis using a Type-VS Millipore membrane (Millipore). The desalted product was electroporated using 20 reactions of Lucigen electrocompetent cells (Endura) following the manufacturer’s suggested parameters and protocol. The transformation reactions were pooled. Diluted products (104-, 105-, and 106-fold dilution) were plated for calculating transformation efficiency to ensure the coverage of complexity. The remainder of the transformation was seeded into four 250 mL Erlenmeyer flasks containing 80 mL Luria broth (LB) liquid media with 100 µg/mL carbenicillin and cultured at 30 °C overnight. The CAPTURE library plasmid was extracted using the ZymoPURE Plasmid Maxiprep Kit (Zymo Research). The CAPTURE library was sequenced by amplicon sequencing described above to evaluate the barcode distribution of the library.
Clonal tracking and capturing of A375 in response to PLX4032
A375 cells (~3 million) were transduced with CAPTURE carrying lentivirus at a transfection rate < 20% by monitoring the fluorescent signal to ensure MOI of < 0.3 as described above. A larger barcode library was used to transduce a much small number of cells, which is important in order to avoid having more than one founder cell with the same barcode. To remove the non-barcoded cells, the cells were then subjected to blasticidin (5 µg/mL) selection for 10 days, followed by sorting for RFP+/eGFP+ cells. The founder-barcoded cells were then expanded to ensure sufficient cell number coverage of each barcode at the start of treatment (> 100× coverage for each treatment) and to minimize the stochastic loss of barcodes. Barcoded cells were divided into a barcode baseline distribution control group (DMSO-A375), which was treated with DMSO in parallel with five replicate treated pools (R1 through R5), each treated with 2 µM PLX. Cells were refreshed with the vehicle- or drug-containing medium every two days and passaged as needed. Forty days after the initial treatment, the cells were harvested and split into two halves. One was subjected to barcode amplicon sequencing for tracking barcode distribution, the other was expanded and frozen for downstream use. Barcode amplicon sequencing and analysis were performed as described above. To capture cells with a barcode of interest, each pair of gRNAs targeting the barcode were prepared in the lentiviral backbone. Cells were transduced with barcodes targeting CRISPR/Cas9D10A as described above. RFP+/eGFP– cells were sorted into a 96-well plate at one cell per well. The fluorescent signal was confirmed by fluorescence microscopy.
Colony formation assay, cell viability assay, and cell proliferation assay
Colony formation assays were performed in 12-well plates by plating 5000 cells (A375-bc41235) and 1500 cells (all others). Treatment of cells with the appropriate drugs began the next day. Drugs were renewed every 2–3 days. Plates were stained with crystal violet (1% dissolved in 10% ethanol and 90% water) between 8 and 10 days after the start of the experiment. The crystal violet staining of the colonies was extracted with 10% acetic acid and quantified by measuring the absorbance of the extracted dye at 590 nm. For the cell viability assay, cells were plated in triplicate into 96-well plates at a density of 5000 cells per well. Sixteen hours after cell seeding, cells were treated with appropriate drugs at the indicated concentration. All the drugs were purchased from MedChemExpress (PLX4032 (HY-12057), ABT263 (HY-10087), IACS-010759 (HY-112037), and YKL-5-124 (HY-101257)). Vehicle (DMSO) was used as needed to equalize the solvent concentration. After 2 days of treatment, cell viability was measured using Cell Counting Kit-8 (CCK-8, Dojindo Molecular Technologies, Inc.) or Deepblue (Biolegend) according to the manufacturer’s instructions. For cell proliferation assays, 1000 cells were seeded in 96-well plates and CCK-8 was used to measure the cell proliferation on day 0 (start point), day 2, day 4, and day 6. Prism 9 (Graphpad) was used to analyze cell viability and proliferation data.
WES and analysis
Exome sequencing was performed on two control cells (the parental A375 and the vehicle-treated parallel DMSO-A375) and 15 captured PLX-resistant clones (bc12-1-R1a, bc12-1-R1b, bc2-1-R2a, bc2-1-R2b, bc345-1-R3, bc345-2-R3, bc45-R4, bc435-1-R4a, bc435-1-R4b, bc41235-R4, bc41235-R5, bc5-2-R5, bc534-2-R5, bc51234-R4, and bc51234-R5). Genomic DNA was extracted using AllPrep DNA/RNA Mini Kit (Qiagen). Exomes were captured using the Illumina Nextera Flex kit (45 Mb exonic content) according to the manufacturer’s instructions and sequenced on an Illumina NovaSeq6000 sequencer with paired-end 100-bp reads. The average coverage was between 50- and 98-fold for the samples. Sequencing reads were trimmed for adapters and quality with Trimmomatic version 0.3678, and aligned to the human genome (hg19) using BWA-MEM version 0.7.1780 with default parameters. Duplicate reads were removed using Sambamba version 0.6.881, and BAM files were further processed with GATK version 3.882 and LoFreq version 2.1.283 for realignment around indels and base quality score recalibration. SAs, including point mutations and small indels, were called by GATK HaplotypeCaller and LoFreq. CNV analysis was performed on recalibrated BAM files using Control-FREEC version 11.384 following the manual with all default WES setting and “sex=XX” to exclude chr Y from the analysis. With either control cells as “normal”, the MuTect2 variant caller from GATK was used to call clonal specific SAs and the Control-FREEC was used to call clonal CNVs. Very similar results were obtained using either control cells and the results using “DMSO-A375” as the control was presented (Supplementary Tables S5, 7). Mutational signatures of the samples were analyzed with MutationalPatterns 3.685 from Variant Call Format (VCF) files following the package vignette. Downstream analysis and visualization were performed in R (version 3.5.1). WES data has been submitted to Sequence Read Archive (SRA) under accession code PRJNA565471.
RNA-seq and analysis
Total RNA was extracted using AllPrep DNA/RNA Mini Kit (Qiagen), and RNA-seq libraries were prepared with the Illumina TruSeq Stranded mRNA kit according to the manufacturer’s instructions. Paired-end reads (PE50) were sequenced on an Illumina HiSeq4000 sequencer. Reads were trimmed for adapters and quality with Trimmomatic version 0.3678 and aligned to the human reference genome (hg19) with STAR version 2.586 using standard settings. A genes-to-samples count matrix was generated using featureCounts87 with standard settings. DEGs were analyzed with DESeq288. The top 30 upregulated genes (CNST, MYH16, SMG1P1, ADAMTS6, STARD13, FGF5, PPP1R15A, CRYBG3, RGMB, CD22, NUDT11, CD274, MME, PCDHA4, SHANK1, HIVEP2, SCHIP1, CDYL2, MGLL, ABCC9, GAREM1, ARHGAP22, ADAMTS18, ABL2, SERPINB7, HDAC9, DTNA, GEM, GPR176, and MIR222HG) between captured resistant clones and control cells (the parental A375 and the vehicle-treated parallel DMSO-A375) by DESeq2 analysis were chosen as signature to calculate resistant gene signature score (res_score). Transcriptional levels were quantified as Fragments Per Kilobase of exon per Million fragments mapped (FPKM). PCA was conducted to investigate the relationship between samples. GSEA was performed using GSEA89. RNA-seq data have been submitted to the Gene Expression Omnibus (GEO) repository (GSE137309). RNA-seq data and clinical information from a cohort of BRAFi-treated patients were kindly shared by the authors54. Single-sample GSEA (ssGSEA) based signature score was analyzed using corto90.
Genome-wide DNA methylation analysis
Genomic DNA was extracted using AllPrep DNA/RNA Mini Kit (Qiagen), and bisulfite conversion was performed using EZ DNA Methylation-Gold Kit (Zymo Research). Genome-wide DNA methylation analysis was performed using InfiniumEPIC DNA Methylation Bead Chip arrays (Illumina) to determine the methylation status at 853,307 CpG sites. Illumina intensity data (IDAT) files from the arrays were further processed using the ChAMP91 pipeline following the vignette. DNA methylation level was reported as β value, ranging from zero to one, where zero represents non-methylated and one represents fully methylated, for every CpG site. Differentially-methylated CpG sites (Supplementary Table S8) were identified when β difference > 0.4 and adjusted P value (Benjamini–Hochberg method, FDR) < 0.05. Methylation array data were deposited into GEO under accession code GSE137452.
Multi-omic integrative analysis
To determine the effect of CNVs, the CNV profiling results were integrated with RNA-seq data by checking whether the CNVs led to expected expression changes of genes located in the corresponding chromosomal regions. Non-expressed genes (the genes whose FPKM values were < 0.1 in all samples) were excluded, and OmicCircos92 was employed to visualize the mRNA levels and CNV profiles of genes located on corresponding chromosomal regions. BiomaRt93 was used to derive gene annotations.
To prioritize coding SNVs and small indels, the results were integrated with multiple lines of evidence to assign each alteration a priority score (PS) for generating a ranked list of mutations (Supplementary Table S7). To exclude mutations that are likely passengers, genes that were not expressed were scored “0”. The remaining mutations were evaluated based on a computation that integrated alternative allele frequency, the likelihood of deleterious alteration, CRISPR screen functional score, and clinical relevance. To score alternative allele frequency, the frequencies were multiplied by 100 so that the alternative score (AS) ranged from 0 to 100. To evaluate the likelihood of deleterious alteration, CADD94, SIFT95, and PROVEAN96 were used. Frameshift deletion, stop codon gain mutation, and splicing could not be scored by the tools and were assigned with the maximum score because these alterations may introduce larger effects. Other alterations that could not be scored were assigned with the median value. Scores from each computational tool were scaled to a range of 0–100. The average of the scores from these three tools (CADD, SIFT, PROVEAN) was used as the deleterious score (DS). To obtain a CRISPR screen functional score (CS), raw counts were obtained from a previous publication14. The data were analyzed using MAGeCK97, and mutations were scored by the gene’s log2 fold change, which was also scaled to a range of 0–100. To score the clinical relevance, the genetic landscape of clinical resistance was obtained from a prior publication37. A gene score (GS) of “100” was assigned to the mutation of any gene observed in the published landscape; otherwise, the gene was scored as “0”. Similarly, mutations that were reported in the landscape were assigned a mutation score (MS) of “100”; otherwise, the mutation was scored as “0”. Finally, the PS was calculated based on these scores, by taking the average of AS, DS, CS, GS, and MS. PS also ranged from 0 to 100, with a higher score indicating the higher possibility of the alteration being associated with resistance.
To identify gene promoter methylation that affected gene expression, the DNA methylation matrix was analyzed by mCSEA44 to identify DMG promoters in PLX resistance subpopulations compared with controls (FDR ≤ 0.25). The DMGs were then mapped with the RNA-seq DEG results (using q value < 0.05) as described above to obtain a list of genes potentially regulated by methylation (Supplementary Table S9).
Single-cell sequencing and analysis
PLX replicate pool 2 resistant A375 (A375-R2) cells were transduced with bc2-1 targeting lentivirus to turn off the eGFP fluorescence of the cell subpopulation carrying bc2-1. Replicate 2 resistant A375 cells without bc2-1 subpopulation (A375-R2-Δbc2-1) were collected by sorting for RFP+/eGFP+ cells using an SY3200 cell sorter (SONY). A375-R2, A375-R2-Δbc2-1, and DMSO-A375 cells were subjected to single-cell RNA sequencing.
Single-cell sequencing was performed as described previously62. Briefly, cells were labeled with hashtags (TotalSeq-A0007 for DMSO-A375, TotalSeq-A0132 for A375-R2 and TotalSeq-A0125 for A375-R2-Δbc2-1) from Biolegend. Single-cell samples were processed with Single Cell 3′ Reagent Kit V2 (10× Genomics) following the manufacturer’s protocol up to the cDNA amplification step. Additive primers (Supplementary Table S12) were added at the cDNA amplification step to increase the yield of hashtagged product. After amplification, RNA-derived cDNA and hashtag DNA were separated using a 0.6× SPRIselect reagent (GE Healthcare, B23317). RNA-derived cDNA was eluted from the beads to prepare a cDNA library according to the 10× Genomics protocol. Hashtag DNA was purified from supernatant to prepare the hashtag library by second amplification and purification. The cDNA and hashtag libraries were pooled and loaded onto an Illumina NovaSeq6000 sequencer. Single-cell RNA expression raw reads were de-multiplexed and mapped to the hg19 reference genome by 10× Genomics Cell Ranger version 3.0.0 using default parameters to generate an expression matrix and the hashtag raw reads were analyzed and generated a count matrix using CITE-seq-Count98. All downstream single-cell analyses were performed using Seurat99. The top 25 aligned correlated components were used as input for t-SNE dimension reduction and clustering analysis.
Western blot and qPCR
The whole-cell lysate was extracted using RIPA buffer, and a western blot was performed following standard protocol. Primary antibodies anti-HA-tag (Cat# 2367) and anti-Bcl-2 (Cat# 4223) were from Cell Signaling Technology, and anti-tubulin (Cat# 12004166) was from BioRad. The results were developed using ChemiDoc MP Imaging System (BioRad) and quantified using Image Lab (BioRad). PowerUp™ SYBR™ Green Master Mix (Thermo Fisher, A25741) was used for qPCR using an Applied Biosystems ViiA 7 real-time PCR system. Primers were included in Supplementary Table S12.
Statistical analysis
Survival curve, Pearson correlation analysis, violin plot, paired-sample boxplot, and the relative statistical analysis were performed using R (version 3.5.1). All other statistical analyses were performed using Prism 9 software (GraphPad). Data were presented as means ± standard deviation (SD) or standard error of the mean (SEM) with statistical significance determined by tests as indicated in figure legends.
Data availability
All RNA-seq and EPIC array data were deposited in the GEO repository under accession codes GSE137309 and GSE137452. All WES data were deposited to SRA under accession code PRJNA565471. Other data that support the findings of this research are available from the corresponding author upon reasonable request.
References
Marusyk, A., Janiszewska, M. & Polyak, K. Intratumor heterogeneity: the rosetta stone of therapy resistance. Cancer Cell 37, 471–484 (2020).
Rehman, S. K. et al. Colorectal cancer cells enter a diapause-like DTP state to survive chemotherapy. Cell 184, 226–242.e21 (2021).
Matthews, N. H., Li, W. Q., Qureshi, A. A., Weinstock, M. A. & Cho, E. In Cutaneous Melanoma: Etiology and Therapy. Ch 1 (eds Ward, W. H. & Farma, J. M.) Exon Publications (2017).
Rubinstein, J. C. et al. Incidence of the V600K mutation among melanoma patients with BRAF mutations, and potential therapeutic response to the specific BRAF inhibitor PLX4032. J. Transl. Med. 8, 67 (2010).
Proietti, I. et al. Mechanisms of acquired BRAF inhibitor resistance in melanoma: a systematic review. Cancers 12, 2801 (2020).
Ren, X., Kang, B. & Zhang, Z. Understanding tumor ecosystems by single-cell sequencing: promises and limitations. Genome Biol. 19, 211 (2018).
Potter, S. S. Single-cell RNA sequencing for the study of development, physiology and disease. Nat. Rev. Nephrol. 14, 479–492 (2018).
Patel, A. P. et al. Single-cell RNA-seq highlights intratumoral heterogeneity in primary glioblastoma. Science 344, 1396–1401 (2014).
Bhang, H. E. et al. Studying clonal dynamics in response to cancer therapy using high-complexity barcoding. Nat. Med. 21, 440–448 (2015).
Hata, A. N. et al. Tumor cells can follow distinct evolutionary paths to become resistant to epidermal growth factor receptor inhibition. Nat. Med. 22, 262–269 (2016).
Nguyen, L. V. et al. Barcoding reveals complex clonal dynamics of de novo transformed human mammary cells. Nature 528, 267–271 (2015).
Lan, X. et al. Fate mapping of human glioblastoma reveals an invariant stem cell hierarchy. Nature 549, 227–232 (2017).
Seth, S. et al. Pre-existing functional heterogeneity of tumorigenic compartment as the origin of chemoresistance in pancreatic tumors. Cell Rep. 26, 1518–1532.e9 (2019).
Shalem, O. et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science 343, 84–87 (2014).
Gilbert, L. A. et al. Genome-scale CRISPR-mediated control of gene repression and activation. Cell 159, 647–661 (2014).
Umkehrer, C. et al. Isolating live cell clones from barcoded populations using CRISPRa-inducible reporters. Nat. Biotechnol. 39, 174–178 (2021).
Feldman, D. et al. CloneSifter: enrichment of rare clones from heterogeneous cell populations. BMC Biol. 18, 177 (2020).
Akimov, Y., Bulanova, D., Abyzova, M., Wennerberg, K. & Aittokallio, T. DNA barcode-guided lentiviral CRISPRa tool to trace and isolate individual clonal lineages in heterogeneous cancer cell populations. bioRxiv https://doi.org/10.1101/622506 (2019).
Al’Khafaji, A. M., Deatherage, D. & Brock, A. Control of lineage-specific gene expression by functionalized gRNA barcodes. ACS Synth. Biol. 7, 2468–2474 (2018).
Doench, J. G. et al. Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9. Nat. Biotechnol. 34, 184–191 (2016).
Ran, F. A. et al. Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity. Cell 154, 1380–1389 (2013).
Shen, B. et al. Efficient genome modification by CRISPR-Cas9 nickase with minimal off-target effects. Nat. Methods 11, 399–402 (2014).
Chen, B. et al. Dynamic imaging of genomic loci in living human cells by an optimized CRISPR/Cas system. Cell 155, 1479–1491 (2013).
Dang, Y. et al. Optimizing sgRNA structure to improve CRISPR-Cas9 knockout efficiency. Genome Biol. 16, 280 (2015).
Feng, J. et al. GFOLD: a generalized fold change for ranking differentially expressed genes from RNA-seq data. Bioinformatics 28, 2782–2788 (2012).
Frederick, D. T. et al. Clinical profiling of BCL-2 family members in the setting of BRAF inhibition offers a rationale for targeting de novo resistance using BH3 mimetics. PLoS ONE 9, e101286 (2014).
Serasinghe, M. N. et al. Anti-apoptotic BCL-2 proteins govern cellular outcome following B-RAF(V600E) inhibition and can be targeted to reduce resistance. Oncogene 34, 857–867 (2015).
Sun, C. et al. Reversible and adaptive resistance to BRAF(V600E) inhibition in melanoma. Nature 508, 118–122 (2014).
Girotti, M. R. et al. Inhibiting EGF receptor or SRC family kinase signaling overcomes BRAF inhibitor resistance in melanoma. Cancer Discov. 3, 158–167 (2013).
Johannessen, C. M. et al. A melanocyte lineage program confers resistance to MAP kinase pathway inhibition. Nature 504, 138–142 (2013).
Wagle, N. et al. MAP kinase pathway alterations in BRAF-mutant melanoma patients with acquired resistance to combined RAF/MEK inhibition. Cancer Discov. 4, 61–68 (2014).
Brady, S. W. et al. Combating subclonal evolution of resistant cancer phenotypes. Nat. Commun. 8, 1231 (2017).
Behan, F. M. et al. Prioritization of cancer therapeutic targets using CRISPR-Cas9 screens. Nature 568, 511–516 (2019).
Romano, E. et al. Identification of multiple mechanisms of resistance to vemurafenib in a patient with BRAFV600E-mutated cutaneous melanoma successfully rechallenged after progression. Clin. Cancer Res. 19, 5749–5757 (2013).
Luebker, S. A. & Koepsell, S. A. Diverse mechanisms of BRAF inhibitor resistance in melanoma identified in clinical and preclinical studies. Front. Oncol. 9, 268 (2019).
Kakadia, S. et al. Mechanisms of resistance to BRAF and MEK inhibitors and clinical update of US Food and Drug Administration-approved targeted therapy in advanced melanoma. Onco. Targets Ther. 11, 7095–7107 (2018).
Van Allen, E. M. et al. The genetic landscape of clinical resistance to RAF inhibition in metastatic melanoma. Cancer Discov. 4, 94–109 (2014).
Marusiak, A. A. et al. Mixed lineage kinases activate MEK independently of RAF to mediate resistance to RAF inhibitors. Nat. Commun. 5, 3901 (2014).
Spain, B. H. et al. Two human cDNAs, including a homolog of Arabidopsis FUS6 (COP11), suppress G-protein- and mitogen-activated protein kinase-mediated signal transduction in yeast and mammalian cells. Mol. Cell Biol. 16, 6698–6706 (1996).
Johnson, J. L., Monfregola, J., Napolitano, G., Kiosses, W. B. & Catz, S. D. Vesicular trafficking through cortical actin during exocytosis is regulated by the Rab27a effector JFC1/Slp1 and the RhoA-GTPase-activating protein Gem-interacting protein. Mol. Biol. Cell 23, 1902–1916 (2012).
Misek, S. et al. Rho-mediated signaling promotes BRAF inhibitor resistance in de-differentiated melanoma cells. Oncogene 39, 1466–1483 (2020).
Mikheil, D. M. et al. Notch signaling activation induces cell death in MAPKi-resistant melanoma cells. Pigment Cell Melanoma Res. 32, 528–539 (2019).
Hao, J. J. et al. Spatial intratumoral heterogeneity and temporal clonal evolution in esophageal squamous cell carcinoma. Nat. Genet. 48, 1500–1507 (2016).
Martorell-Marugan, J., Gonzalez-Rumayor, V. & Carmona-Saez, P. mCSEA: detecting subtle differentially methylated regions. Bioinformatics 35, 3257–3262 (2019).
Greenberg, M. V. C. & Bourc’his, D. The diverse roles of DNA methylation in mammalian development and disease. Nat. Rev. Mol. Cell Biol. 20, 590–607 (2019).
Song, C. et al. Recurrent tumor cell-intrinsic and -extrinsic alterations during MAPKi-induced melanoma regression and early adaptation. Cancer Discov. 7, 1248–1265 (2017).
Jin, S. G., Xiong, W., Wu, X., Yang, L. & Pfeifer, G. P. The DNA methylation landscape of human melanoma. Genomics 106, 322–330 (2015).
Phadke, M. et al. Dabrafenib inhibits the growth of BRAF-WT cancers through CDK16 and NEK9 inhibition. Mol. Oncol. 12, 74–88 (2018).
Lo, R. S. Receptor tyrosine kinases in cancer escape from BRAF inhibitors. Cell Res. 22, 945–947 (2012).
Dottori, M., Down, M., Huttmann, A., Fitzpatrick, D. R. & Boyd, A. W. Cloning and characterization of EphA3 (Hek) gene promoter: DNA methylation regulates expression in hematopoietic tumor cells. Blood 94, 2477–2486 (1999).
Miao, B. et al. EPHA2 is a mediator of vemurafenib resistance and a novel therapeutic target in melanoma. Cancer Discov. 5, 274–287 (2015).
Luo, X., Ruhland, M. K., Pazolli, E., Lind, A. C. & Stewart, S. A. Osteopontin stimulates preneoplastic cellular proliferation through activation of the MAPK pathway. Mol. Cancer Res. 9, 1018–1029 (2011).
Lee, E. F. et al. BCL-XL and MCL-1 are the key BCL-2 family proteins in melanoma cell survival. Cell Death Dis. 10, 342 (2019).
Kwong, L. N. et al. Co-clinical assessment identifies patterns of BRAF inhibitor resistance in melanoma. J. Clin. Invest. 125, 1459–1470 (2015).
Long, J. E. et al. Therapeutic resistance and susceptibility is shaped by cooperative multi-compartment tumor adaptation. Cell Death Differ. 26, 2416–2429 (2019).
Fazio, M. et al. SATB2 induction of a neural crest mesenchyme-like program drives melanoma invasion and drug resistance. Elife 10, e64370 (2021).
Barnash, K. D. et al. Chromodomain ligand optimization via target-class directed combinatorial repurposing. ACS Chem. Biol. 11, 2475–2483 (2016).
Haq, R. et al. Oncogenic BRAF regulates oxidative metabolism via PGC1alpha and MITF. Cancer Cell 23, 302–315 (2013).
Corazao-Rozas, P. et al. Mitochondrial oxidative stress is the achille’s heel of melanoma cells resistant to Braf-mutant inhibitor. Oncotarget 4, 1986–1998 (2013).
Singleton, K. R. et al. Melanoma therapeutic strategies that select against resistance by exploiting MYC-driven evolutionary convergence. Cell Rep. 21, 2796–2812 (2017).
Olson, C. M. et al. Development of a selective CDK7 covalent inhibitor reveals predominant cell-cycle phenotype. Cell Chem. Biol. 26, 792–803.e10 (2019).
Stoeckius, M. et al. Cell hashing with barcoded antibodies enables multiplexing and doublet detection for single cell genomics. Genome Biol. 19, 224 (2018).
Fletez-Brant, K., Spidlen, J., Brinkman, R. R., Roederer, M. & Chattopadhyay, P. K. flowClean: automated identification and removal of fluorescence anomalies in flow cytometry data. Cytometry A 89, 461–471 (2016).
Meskas, J., Yokosawa, D., Wang, S., Segat, G. C. & Brinkman, R. flowCut — An R package for precise and accurate automated removal of outlier events and flagging of files based on time versus fluorescence analysis. Cytometry A https://doi.org/10.1002/cyto.a.24670 (2022).
Ede, C., Chen, X., Lin, M. Y. & Chen, Y. Y. Quantitative analyses of core promoters enable precise engineering of regulated gene expression in mammalian cells. ACS Synth. Biol. 5, 395–404 (2016).
Farre, D. et al. Identification of patterns in biological sequences at the ALGGEN server: PROMO and MALGEN. Nucleic Acids Res. 31, 3651–3653 (2003).
Oren, Y. et al. Cycling cancer persister cells arise from lineages with distinct programs. Nature 596, 576–582 (2021).
Lukow, D. A. et al. Chromosomal instability accelerates the evolution of resistance to anti-cancer therapies. Dev. Cell 56, 2427–2439.e4 (2021).
Bastian, B. C., LeBoit, P. E., Hamm, H., Brocker, E. B. & Pinkel, D. Chromosomal gains and losses in primary cutaneous melanomas detected by comparative genomic hybridization. Cancer Res. 58, 2170–2175 (1998).
Pirker, C. et al. Chromosomal imbalances in primary and metastatic melanomas: over-representation of essential telomerase genes. Melanoma Res. 13, 483–492 (2003).
Chin, E. L., da Silva, C. & Hegde, M. Assessment of clinical analytical sensitivity and specificity of next-generation sequencing for detection of simple and complex mutations. BMC Genet. 14, 6 (2013).
Findlay, G. M. et al. Accurate classification of BRCA1 variants with saturation genome editing. Nature 562, 217–222 (2018).
Zhang, Q. et al. DUB3 deubiquitinates and stabilizes NRF2 in chemotherapy resistance of colorectal cancer. Cell Death Differ. 26, 2300–2313 (2019).
Hart, T. et al. High-resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell 163, 1515–1526 (2015).
Wilson, A. A. et al. Amelioration of emphysema in mice through lentiviral transduction of long-lived pulmonary alveolar macrophages. J. Clin. Invest. 120, 379–389 (2010).
Heckl, D. et al. Generation of mouse models of myeloid malignancy with combinatorial genetic lesions using CRISPR-Cas9 genome editing. Nat. Biotechnol. 32, 941–946 (2014).
Sanjana, N. E., Shalem, O. & Zhang, F. Improved vectors and genome-wide libraries for CRISPR screening. Nat. Methods 11, 783–784 (2014).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Clement, K. et al. CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nat. Biotechnol. 37, 224–226 (2019).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast processing of NGS alignment formats. Bioinformatics 31, 2032–2034 (2015).
McKenna, A. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Wilm, A. et al. LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets. Nucleic Acids Res. 40, 11189–11201 (2012).
Boeva, V. et al. Control-FREEC: a tool for assessing copy number and allelic content using next-generation sequencing data. Bioinformatics 28, 423–425 (2012).
Blokzijl, F., Janssen, R., van Boxtel, R. & Cuppen, E. MutationalPatterns: comprehensive genome-wide analysis of mutational processes. Genome Med. 10, 33 (2018).
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. USA 102, 15545–15550 (2005).
Mercatelli, D., Lopez-Garcia, G. & Giorgi, F. M. corto: a lightweight R package for gene network inference and master regulator analysis. Bioinformatics 36, 3916–3917 (2020).
Tian, Y. et al. ChAMP: updated methylation analysis pipeline for Illumina BeadChips. Bioinformatics 33, 3982–3984 (2017).
Hu, Y. et al. OmicCircos: a simple-to-use R package for the circular visualization of multidimensional omics data. Cancer Inf. 13, 13–20 (2014).
Durinck, S., Spellman, P. T., Birney, E. & Huber, W. Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt. Nat. Protoc. 4, 1184–1191 (2009).
Rentzsch, P., Witten, D., Cooper, G. M., Shendure, J. & Kircher, M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 47, D886–D894 (2019).
Sim, N. L. et al. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 40, W452–W457 (2012).
Choi, Y. & Chan, A. P. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics 31, 2745–2747 (2015).
Li, W. et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 15, 554 (2014).
Roelli, P., Bimber, B., Flynn, B., Revale, S. & Gui, G. Hoohm/CITE-seq-Count: 1.4.2. https://doi.org/10.5281/zenodo.2590196 (2019).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902.e21 (2019).
Acknowledgements
We thank Andrea Viale and Alessandro Carugo at the University of Texas MD Anderson Cancer Center (MDACC) for helpful discussion; Ying Hu at the National Institutes of Health/National Cancer Institute (NIH/NCI) for help in adjusting the Circos plot parameters. We thank Jerome Karp at NYU Langone Health for proofreading. Next-generation sequencing was performed by the NYU Langone Genome Technology Center, which is supported in part by grant P30CA016087 from NIH/NCI. Cell sorting/flow cytometry technologies were provided by NYU Langone’s Cytometry and Cell Sorting Laboratory (supported in part by grant P30CA016087) and the MDACC Flow Cytometry and Cellular Imaging Core Facility (supported by NCI grant P30CA16672). STR DNA fingerprinting was done by the Cancer Center Support Grant-funded Characterized Cell Line core (supported by NCI grant CA016672). This study was supported by R01 CA190121-01 (Z.-Y.Z., R.G.W.V., and E.P.S.) and by the DefeatGBM collaborative of the National Brain Tumor Association (Q.W. and E.P.S.).
Author information
Authors and Affiliations
Contributions
E.P.S., Z.-Y.Z., and R.G.W.V. conceived and designed the study; Z.-Y.Z. designed, performed, and analyzed most experiments; Y.D. performed and analyzed the EPIC array and provided assistance in other experiments; Z.-Y.Z. carried out most computational analyses of sequencing data with help from T.L., Y.D., Q.W., and J.Y.; R.E. provided administrative support and critical suggestions; A.S.M., H.Z., A.T., A.F., and G.F.D. provided critical suggestions; Z.-Y.Z. and E.P.S. wrote the manuscript with help from other authors.
Corresponding authors
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhang, ZY., Ding, Y., Ezhilarasan, R. et al. Lineage-coupled clonal capture identifies clonal evolution mechanisms and vulnerabilities of BRAFV600E inhibition resistance in melanoma. Cell Discov 8, 102 (2022). https://doi.org/10.1038/s41421-022-00462-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41421-022-00462-7