Prioritization of cancer therapeutic targets using CRISPR–Cas9 screens

Behan, Fiona M.; Iorio, Francesco; Picco, Gabriele; Gonçalves, Emanuel; Beaver, Charlotte M.; Migliardi, Giorgia; Santos, Rita; Rao, Yanhua; Sassi, Francesco; Pinnelli, Marika; Ansari, Rizwan; Harper, Sarah; Jackson, David Adam; McRae, Rebecca; Pooley, Rachel; Wilkinson, Piers; van der Meer, Dieudonne; Dow, David; Buser-Doepner, Carolyn; Bertotti, Andrea; Trusolino, Livio; Stronach, Euan A.; Saez-Rodriguez, Julio; Yusa, Kosuke; Garnett, Mathew J.

doi:10.1038/s41586-019-1103-9

Article
Published: 10 April 2019

Prioritization of cancer therapeutic targets using CRISPR–Cas9 screens

Fiona M. Behan^1,2^na1,
Francesco Iorio^1,2,3^na1,
Gabriele Picco¹^na1,
Emanuel Gonçalves¹,
Charlotte M. Beaver¹,
Giorgia Migliardi^4,5,
Rita Santos⁶,
Yanhua Rao⁷,
Francesco Sassi⁴,
Marika Pinnelli^4,5,
Rizwan Ansari¹,
Sarah Harper¹,
David Adam Jackson¹,
Rebecca McRae¹,
Rachel Pooley¹,
Piers Wilkinson¹,
Dieudonne van der Meer¹,
David Dow^2,6,
Carolyn Buser-Doepner^2,7,
Andrea Bertotti^4,5,
Livio Trusolino^4,5,
Euan A. Stronach^2,6,
Julio Saez-Rodriguez^2,3,8,9,10,
Kosuke Yusa^1,2^na2^nAff11 &
…
Mathew J. Garnett^1,2^na2

Nature volume 568, pages 511–516 (2019)Cite this article

121k Accesses
680 Citations
660 Altmetric
Metrics details

Subjects

Abstract

Functional genomics approaches can overcome limitations—such as the lack of identification of robust targets and poor clinical efficacy—that hamper cancer drug development. Here we performed genome-scale CRISPR–Cas9 screens in 324 human cancer cell lines from 30 cancer types and developed a data-driven framework to prioritize candidates for cancer therapeutics. We integrated cell fitness effects with genomic biomarkers and target tractability for drug development to systematically prioritize new targets in defined tissues and genotypes. We verified one of our most promising dependencies, the Werner syndrome ATP-dependent helicase, as a synthetic lethal target in tumours from multiple cancer types with microsatellite instability. Our analysis provides a resource of cancer dependencies, generates a framework to prioritize cancer drug targets and suggests specific new targets. The principles described in this study can inform the initial stages of drug development by contributing to a new, diverse and more effective portfolio of cancer drug targets.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Fig. 1: Target prioritization framework.**

**Fig. 2: Target prioritization and biomarker discovery.**

**Fig. 3: Priority targets and biomarker-linked dependencies.**

**Fig. 4: Cancer-type priority targets.**

**Fig. 5: WRN is a target in MSI cancer cells.**

Integrated cross-study datasets of genetic dependencies in cancer

Article Open access 12 March 2021

Revolutionizing DNA repair research and cancer therapy with CRISPR–Cas screens

Article 13 February 2023

Agreement between two large pan-cancer CRISPR-Cas9 gene dependency data sets

Article Open access 20 December 2019

Data availability

Data and analyses are included in the published article and supplementary data 1, 2 and 3 are available from FigShare (https://figshare.com/projects/CRISPRtargetID/60146). The gene fitness scores of the cell lines, raw counts of the sgRNA data, and processed data and results are available from the project Score web portal: https://score.depmap.sanger.ac.uk.

Code availability

Software code are available through GitHub at https://github.com/francescojm/CRISPRcleanR, https://github.com/francescojm/ADAM and https://github.com/francescojm/BAGELR.

References

Garraway, L. A. Genomics-driven oncology: framework for an emerging paradigm. J. Clin. Oncol. 31, 1806–1814 (2013).
Article PubMed Google Scholar
Zehir, A. et al. Mutational landscape of metastatic cancer revealed from prospective clinical sequencing of 10,000 patients. Nat. Med. 23, 703–713 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hay, M., Thomas, D. W., Craighead, J. L., Economides, C. & Rosenthal, J. Clinical development success rates for investigational drugs. Nat. Biotechnol. 32, 40–51 (2014).
Article CAS PubMed Google Scholar
Koike-Yusa, H., Li, Y., Tan, E.-P., Del Castillo Velasco-Herrera, M. & Yusa, K. Genome-wide recessive genetic screening in mammalian cells with a lentiviral CRISPR–guide RNA library. Nat. Biotechnol. 32, 267–273 (2014).
Article CAS PubMed Google Scholar
Meyers, R. M. et al. Computational correction of copy number effect improves specificity of CRISPR–Cas9 essentiality screens in cancer cells. Nat. Genet. 49, 1779–1784 (2017).
Article CAS PubMed PubMed Central Google Scholar
van der Meer, D. et al. Cell Model Passports—a hub for clinical, genetic and functional datasets of preclinical cancer models. Nucleic Acids Res. 47, D923–D929 (2019).
Article PubMed CAS Google Scholar
Iorio, F. et al. A landscape of pharmacogenomic interactions in cancer. Cell 166, 740–754 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hart, T. et al. High-resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell 163, 1515–1526 (2015).
Article CAS PubMed Google Scholar
Hart, T. et al. Evaluation and design of genome-wide CRISPR/SpCas9 knockout screens. G3 (Bethesda) 7, 2719–2727 (2017).
Article CAS Google Scholar
Tzelepis, K. et al. A CRISPR dropout screen identifies genetic vulnerabilities and therapeutic targets in acute myeloid leukemia. Cell Rep. 17, 1193–1205 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wang, T., Wei, J. J., Sabatini, D. M. & Lander, E. S. Genetic screens in human cells using the CRISPR–Cas9 system. Science 343, 80–84 (2014).
Article ADS CAS PubMed Google Scholar
McDonald, E. R. III et al. Project DRIVE: a compendium of cancer dependencies and synthetic lethal relationships uncovered by large-scale, deep RNAi screening. Cell 170, 577–592 (2017).
Article CAS PubMed Google Scholar
Massacesi, C. et al. PI3K inhibitors as new cancer therapeutics: implications for clinical trial design. OncoTargets Ther. 9, 203–210 (2016).
Article CAS Google Scholar
Brown, K. K. et al. Approaches to target tractability assessment — a practical perspective. MedChemComm 9, 606–613 (2018).
Article CAS PubMed PubMed Central Google Scholar
Viswanathan, V. S. et al. Dependency of a therapy-resistant state of cancer cells on a lipid peroxidase pathway. Nature 547, 453–457 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chu, W. K. & Hickson, I. D. RecQ helicases: multifunctional genome caretakers. Nat. Rev. Cancer 9, 644–654 (2009).
Article CAS PubMed Google Scholar
Cortes-Ciriano, I., Lee, S., Park, W.-Y., Kim, T.-M. & Park, P. J. A molecular portrait of microsatellite instability across multiple cancers. Nat. Commun. 8, 15180 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Haugen, A. C. et al. Genetic instability caused by loss of MutS homologue 3 in human colorectal cancer. Cancer Res. 68, 8465–8472 (2008).
Article CAS PubMed PubMed Central Google Scholar
Perry, J. J. P. et al. WRN exonuclease structure and molecular mechanism imply an editing role in DNA end processing. Nat. Struct. Mol. Biol. 13, 414–422 (2006).
Article CAS PubMed Google Scholar
Kamath-Loeb, A. S., Welcsh, P., Waite, M., Adman, E. T. & Loeb, L. A. The enzymatic activities of the Werner syndrome protein are disabled by the amino acid polymorphism R834C. J. Biol. Chem. 279, 55499–55505 (2004).
Article CAS PubMed Google Scholar
Ketkar, A., Voehler, M., Mukiza, T. & Eoff, R. L. Residues in the RecQ C-terminal domain of the human Werner Syndrome helicase are involved in unwinding G-quadruplex DNA. J. Biol. Chem. 292, 3154–3163 (2017).
Article CAS PubMed PubMed Central Google Scholar
Chan, E. M. et al. WRN helicase is a synthetic lethal target in microsatellite unstable cancers. Nature https://doi.org/10.1038/s41586-019-1102-x (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Saydam, N. et al. Physical and functional interactions between Werner syndrome helicase and mismatch-repair initiation factors. Nucleic Acids Res. 35, 5706–5716 (2007).
Article CAS PubMed PubMed Central Google Scholar
Opresko, P. L., Sowd, G. & Wang, H. The Werner syndrome helicase/exonuclease processes mobile D-loops through branch migration and degradation. PLoS ONE 4, e4825 (2009).
Article ADS PubMed PubMed Central CAS Google Scholar
Myung, K., Datta, A., Chen, C. & Kolodner, R. D. SGS1, the Saccharomyces cerevisiae homologue of BLM and WRN, suppresses genome instability and homeologous recombination. Nat. Genet. 27, 113–116 (2001).
Article CAS PubMed Google Scholar
Le, D. T. et al. PD-1 blockade in tumors with mismatch-repair deficiency. N. Engl. J. Med. 372, 2509–2520 (2015).
Article CAS PubMed PubMed Central Google Scholar
Tsherniak, A. et al. Defining a cancer dependency map. Cell 170, 564–576 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wang, T. et al. Gene essentiality profiling reveals gene networks and synthetic lethal interactions with oncogenic Ras. Cell 168, 890–903 (2017).
Article CAS PubMed PubMed Central Google Scholar
Ballouz, S. & Gillis, J. AuPairWise: a method to estimate RNA-seq replicability through co-expression. PLOS Comput. Biol. 12, e1004868 (2016). Home (25 Doggett St)
Article ADS PubMed PubMed Central CAS Google Scholar
Hart, T. & Moffat, J. BAGEL: a computational framework for identifying essential genes from pooled library screens. BMC Bioinformatics 17, 164 (2016).
Article PubMed PubMed Central CAS Google Scholar
Yoshihama, M. et al. The human ribosomal protein genes: sequencing and comparative analysis of 73 genes. Genome Res. 12, 379–390 (2002).
Article CAS PubMed PubMed Central Google Scholar
Iorio, F. et al. Unsupervised correction of gene-independent cell responses to CRISPR–Cas9 targeting. BMC Genomics 19, 604 (2018).
Article PubMed PubMed Central CAS Google Scholar
Anders, S. & Huber, W. Differential expression analysis for sequence count data. Genome Biol. 11, R106 (2010).
Article CAS PubMed PubMed Central Google Scholar
Aguirre, A. J. et al. Genomic copy number dictates a gene-independent cell response to CRISPR/Cas9 targeting. Cancer Discov. 6, 914–929 (2016).
Article CAS PubMed PubMed Central Google Scholar
Li, W. et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 15, 554 (2014).
Article PubMed PubMed Central CAS Google Scholar
Subramanian, A. et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl Acad. Sci. USA 102, 15545–15550 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Durinck, S. et al. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics 21, 3439–3440 (2005).
Article CAS PubMed Google Scholar
Cerami, E. G. et al. Pathway Commons, a web resource for biological pathway data. Nucleic Acids Res. 39, D685–D690 (2011).
Article CAS PubMed Google Scholar
Iorio, F. et al. Pathway-based dissection of the genomic heterogeneity of cancer hallmarks’ acquisition with SLAPenrich. Sci. Rep. 8, 6713 (2018).
Article ADS PubMed PubMed Central CAS Google Scholar
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article CAS Google Scholar
Cokelaer, T. et al. GDSCTools for mining pharmacogenomic interactions in cancer. Bioinformatics 34, 1226–1228 (2018).
Article CAS PubMed Google Scholar
Storey, J. D. & Tibshirani, R. Statistical significance for genomewide studies. Proc. Natl Acad. Sci. USA 100, 9440–9445 (2003).
Article ADS MathSciNet CAS PubMed MATH PubMed Central Google Scholar
Mi, H., Muruganujan, A. & Thomas, P. D. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res. 41, D377–D386 (2013).
Article CAS PubMed Google Scholar
Law, C. W., Chen, Y., Shi, W. & Smyth, G. K. voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15, R29 (2014).
Article PubMed PubMed Central CAS Google Scholar
Garcia-Alonso, L. et al. Transcription factor activities enhance markers of drug sensitivity in cancer. Cancer Res. 78, 769–780 (2018).
Article CAS PubMed Google Scholar
Ritchie, M. E. et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 43, e47 (2015).
Article PubMed PubMed Central CAS Google Scholar
Baralis, E., Bertotti, A., Fiori, A. & Grand, A. LAS: a software platform to support oncological data management. J. Med. Syst. 36, 81–90 (2012).
Article Google Scholar

Download references

Acknowledgements

We thank D. Adams, G. Vassiliou and L. Parts for comments on the manuscript, members of the M.J.G. laboratory and Sanger Institute facilities (Wellcome Trust grant 206194). Work was funded by Open Targets (OTAR015) to M.J.G., K.Y. and J.S.-R. The K.Y. laboratory is supported by Wellcome Trust (206194). The M.J.G. laboratory is supported by SU2C (SU2C-AACR-DT1213) and Wellcome Trust (102696 and 206194). Support was also received from AIRC 20697 (A.B.) and 18532 (L.T.); 5x1000 grant 21091 (A.B. and L.T.); ERC Consolidator Grant 724748 – BEAT (A.B.); FPRC-ONLUS, 5x1000 Ministero della Salute 2011 and 2014 (L.T.); and Transcan, TACTIC (L.T.).

Author information

Kosuke Yusa
Present address: Stem Cell Genetics, Institute for Frontier Life and Medical Sciences, Kyoto University, Kyoto, Japan
These authors contributed equally: Fiona M. Behan, Francesco Iorio, Gabriele Picco
These authors jointly supervised this work: Kosuke Yusa, Mathew J. Garnett

Authors and Affiliations

Wellcome Sanger Institute, Cambridge, UK
Fiona M. Behan, Francesco Iorio, Gabriele Picco, Emanuel Gonçalves, Charlotte M. Beaver, Rizwan Ansari, Sarah Harper, David Adam Jackson, Rebecca McRae, Rachel Pooley, Piers Wilkinson, Dieudonne van der Meer, Kosuke Yusa & Mathew J. Garnett
Open Targets, Cambridge, UK
Fiona M. Behan, Francesco Iorio, David Dow, Carolyn Buser-Doepner, Euan A. Stronach, Julio Saez-Rodriguez, Kosuke Yusa & Mathew J. Garnett
European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK
Francesco Iorio & Julio Saez-Rodriguez
Candiolo Cancer Institute-FPO, IRCCS, Turin, Italy
Giorgia Migliardi, Francesco Sassi, Marika Pinnelli, Andrea Bertotti & Livio Trusolino
Department of Oncology, University of Torino, Turin, Italy
Giorgia Migliardi, Marika Pinnelli, Andrea Bertotti & Livio Trusolino
GlaxoSmithKline Research and Development, Stevenage, UK
Rita Santos, David Dow & Euan A. Stronach
GlaxoSmithKline Research and Development, Collegeville, PA, USA
Yanhua Rao & Carolyn Buser-Doepner
Faculty of Medicine, Joint Research Centre for Computational Biomedicine, RWTH Aachen University, Aachen, Germany
Julio Saez-Rodriguez
Institute for Computational Biomedicine, Heidelberg University, Faculty of Medicine, Bioquant, Heidelberg, Germany
Julio Saez-Rodriguez
Heidelberg University Hospital, Heidelberg, Germany
Julio Saez-Rodriguez

Authors

Fiona M. Behan
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Iorio
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Picco
View author publications
You can also search for this author in PubMed Google Scholar
Emanuel Gonçalves
View author publications
You can also search for this author in PubMed Google Scholar
Charlotte M. Beaver
View author publications
You can also search for this author in PubMed Google Scholar
Giorgia Migliardi
View author publications
You can also search for this author in PubMed Google Scholar
Rita Santos
View author publications
You can also search for this author in PubMed Google Scholar
Yanhua Rao
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Sassi
View author publications
You can also search for this author in PubMed Google Scholar
Marika Pinnelli
View author publications
You can also search for this author in PubMed Google Scholar
Rizwan Ansari
View author publications
You can also search for this author in PubMed Google Scholar
Sarah Harper
View author publications
You can also search for this author in PubMed Google Scholar
David Adam Jackson
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca McRae
View author publications
You can also search for this author in PubMed Google Scholar
Rachel Pooley
View author publications
You can also search for this author in PubMed Google Scholar
Piers Wilkinson
View author publications
You can also search for this author in PubMed Google Scholar
Dieudonne van der Meer
View author publications
You can also search for this author in PubMed Google Scholar
David Dow
View author publications
You can also search for this author in PubMed Google Scholar
Carolyn Buser-Doepner
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Bertotti
View author publications
You can also search for this author in PubMed Google Scholar
Livio Trusolino
View author publications
You can also search for this author in PubMed Google Scholar
Euan A. Stronach
View author publications
You can also search for this author in PubMed Google Scholar
Julio Saez-Rodriguez
View author publications
You can also search for this author in PubMed Google Scholar
Kosuke Yusa
View author publications
You can also search for this author in PubMed Google Scholar
Mathew J. Garnett
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.J.G., K.Y. and C.B.-D. conceived the project. F.M.B. led CRISPR–Cas9 screening, co-developed the project Score web portal, contributed to analysis strategy, performed validation analyses and verified WRN dependency. F.I. led computational analyses and figure preparation, and contributed to the project Score web portal. G.P. performed experiments to verify WRN dependency, carried out analyses and contributed to in vivo studies. E.G. contributed to computational analysis and figure preparation. D.v.d.M. contributed to the project Score web portal. G.M., F.S., M.P., A.B. and L.T. performed in vivo studies. C.M.B., R.A., D.A.J., R.M., R.P. and P.W. performed CRISPR–Cas9 screens. R.S. performed tractability analysis. Y.R. performed WRN rescue experiments. C.M.B., S.H., A.B., L.T., E.A.S., D.D. and J.S.-R. assisted with project supervision. F.M.B., F.I., E.G., G.P., K.Y. and M.J.G. wrote the manuscript. K.Y. and M.J.G. directed the project. J.S.-R., A.B., L.T., M.J.G. and K.Y. acquired funding. All authors approved the manuscript.

Corresponding authors

Correspondence to Kosuke Yusa or Mathew J. Garnett.

Ethics declarations

Competing interests

E.A.S., D.D., C.B.-D., R.S. and Y.R. are GlaxoSmithKline employees. Open Targets is a public–private initiative involving academia and industry. K.Y. and M.J.G. receive funding from AstraZeneca. M.J.G. performed consultancy for Sanofi. All other authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Project Score CRISPR–Cas9 screening pipeline, data quality control and analysis set.

a, CRISPR–Cas9 screening pipeline workflow, including quality control steps and go/no-go decisions. b, Genomic characterization of the CRISPR–Cas9-screened cell lines. c, Average Pearson’s correlation of replicate sgRNA counts (n = 86,875) for individual cell lines. d, Data quality control threshold based on the distributions of Pearson’s correlation values of sgRNA fold change values between replicates of the same cell line (in green) and all possible pairwise comparisons (in grey), considering the 838 highly informative sgRNAs (described in the Methods). e, Percentage of experiments passing the quality control filter defined in d. f, Pearson’s correlation values as described in d for the cell lines in the final analysis set. g, ROC and precision/recall curves were obtained after classifying predefined essential (n = 354) and non-essential (n = 747) genes based on gene-level rank positions calculated using depletion fold changes. The median areas under the curve across all cell lines are reported. h, Glass’s Δ scores quantifying the depletion effect size for genes that encode ribosomal proteins (n = 61) and a priori known essential (n = 354) genes for all cell lines. i, Cell lines in the final analysis set grouped by tissue (inner ring) and cancer-type (outer ring). j, Median gene-level depletion fold change (FC) values and interquartiles for reference gene sets defined in g and h for the 324 cell lines included in the analysis set. GEX, gene expression; METH, methylation; CNA, copy number alteration; WES, whole-exome DNA sequencing; AUROC, area under receiver operating characteristic; AUPR; area under precision/recall curve.

Extended Data Fig. 2 Assessment of technical confounders in CRISPR–Cas9 screening data and summary of fitness genes.

a, Absence of association between screening data quality and the number of replicates (as quantified by a Pearson’s correlation with respect to the number of replicates, n = 5 distinct values). Data quality was assessed using the fitness effect (the median fold change) of genes that encode ribosomal protein (n = 61) in each cell line as a reference. b, Absence of an association between data quality (quantified as in a) and average Pearson’s correlation between replicates of individual screened cell lines (n = 324). The P value refers to a two-sample Student’s t-test, the score on the right plot is a Pearson’s correlation. c, Weak correlation and significant association between sgRNA library transduction efficiency in cell lines (averaged for replicates) and data quality. d, Weak correlation and significant association between the Cas9 activity of a cell line (averaged for replicates) and data quality. e, Absence of an association between library coverage and data quality. In c–e, P values, R and sample sizes (n) are defined as for b. f, Number of fitness genes in each cell line (BAGEL FDR < 5%; median = 1,459). g, Number of cell lines with fixed intervals of numbers of fitness genes. h, Absence of correlation between number of significant fitness genes per cell line and number of replicates, R defined as for a. i, The effect of the version of the sgRNA screening library on the number of fitness genes identified. A new version of the library (v.1.1) with additional guides for a subset of genes yields moderately larger numbers of fitness genes; however, this is equally variable in both groups and confounded by the tissue of origin of the cell lines. P value is from a two-sample Student’s t-test. j, Reproducible calling of fitness genes in HT-29 across sgRNA libraries. Left, the number of fitness genes detected in each library. Right, scatter plots of depletion scores at the genome-wide level or considering only highly informative sgRNAs for each library. In both cases, P values from a Fisher’s exact test are below machine precision (<10⁻¹⁶). R indicates Pearson’s correlation; C indicates the percentage of genes called as significantly depleted with both libraries over those detected as significantly depleted with one library only. k, Pearson’s correlation between the number of fitness genes per cell line and Cas9 activity level and library transduction efficiency. l, Pearson’s correlation between the number of fitness genes per cell line and the average Pearson’s correlation of cell line replicates. m, n, Pearson’s correlation between the number of fitness genes per cell line and the ability to detect a defined essential genes. For all panels, each data point is a cell line coloured by cancer type (except g and j). Box-and-whisker plots show the median, interquartile range and 95 percentiles.

Extended Data Fig. 3 Computation of ovary-specific and pan-cancer core fitness genes with ADaM, and a summary of context-specific and core fitness genes.

a, Number of fitness genes in each cell line. b, Number of fitness genes in a fixed number (m) of cell lines. c, Distributions and cumulative distributions of number of fitness genes observed in m cell lines across 1,000 randomized versions of the depletion scores for ovary cell lines. d, True-positive rates (for which a priori known essential genes are counted as positive) when considering the genes that are depleted (fitness genes) in at least m cell lines (blue curve) as predictions and the deviance of the number of these genes from expectations (computed using the randomized data shown in c) for all possible values of m (red curve). The x coordinate (rounded by excess) of the intersection of these two curves estimates the minimal number of cell lines m∗ in which a gene should be significantly depleted in order to be predicted as a core fitness gene for a cancer type. e, Number of genes predicted to be cancer-type-specific core fitness genes for a fixed number (k) of cancer types. f, Distributions (top) and cumulative distributions (bottom) of the number of core fitness genes predicted for a fixed number of tissue types for 1,000 randomized versions of the cancer-type-specific core fitness profiles. g, True-positive rates (for which a priori known essential genes are counted as positive) when considering the genes that are core fitness genes for at least k cancer types (blue curve) as predictions and the deviance of the number of these genes from expectation (computed using the randomized data shown in f; red curve). The x coordinate estimates the minimal number of cancer types k∗ for which a gene should have been predicted as a cancer-type-specific core fitness gene in order to be classified as a pan-cancer core fitness gene. All box-and-whisker plots show the interquartile ranges and 95th percentiles, with centres indicating medians.

Extended Data Fig. 4 Characterization of ADaM pan-cancer core fitness genes.

a, The 553 pan-cancer core fitness genes in reference essential gene sets are shown^9,10. Respective recall and enrichment significance P values from a hypergeometric test when considering the whole set of genes targeted in the CRISPR–Cas9 screen as the background population (n = 17,995). The 132 newly identified core fitness genes fall outside of these reference gene sets. b, c, Pathways (b) and gene families (c) enriched in the 132 newly identified pan-cancer core fitness genes (Benjamini–Hochberg-adjusted hypergeometric test P < 0.05). d, Comparison of the ADaM core fitness genes with two previously reported reference sets^9,10 of essential genes in terms of number of genes, estimated precision and recall (the genes included in reference gene sets corresponding to cellular essential process were considered to be true-positive genes). e, FDRs of putative context-specific fitness genes at different thresholds of reliability (n = 7,393, 2,233, 426 and 82 putative context-specific fitness genes, respectively, for thresholds equal to 20, 50, 100 and 200 of log-likelihood of skewed t-distributions). f, Clustering of cancer types based on core fitness gene similarity (left) and numbers of cancer-type core-specific fitness genes exclusive to each cancer type (right). g, Basal expression of cancer-type specific core fitness genes (n, across tissues indicated in Fig. 1c) in matched normal tissues compared with all the other genes in the genome, across cancer types (as indicated by the different colours). Five genes were identified as core fitness genes in a single cancer type and are not expressed at the basal level (<5% quantile) in matched normal tissue (red points). Cancer types are coloured as shown in f. Box-and-whisker plots show interquartile ranges and 95th percentiles, with sample sizes indicated in f (right), centres indicate median values.

Extended Data Fig. 5 Pan-cancer and cancer-type-specific priority scores.

a, Criteria for the target prioritization scoring system. b, ANOVA results from differential dependency biomarker analyses with all 1,001 significant associations classified as pan-cancer or cancer-type-specific associations (inner circle), loss- or gain-of-fitness marker (middle circle) and whether the marker is a mutation, copy number gain or loss (outer circle). c, Distributions of pan-cancer (left) and cancer-type-specific (right) non-null target priority scores based on the therapeutic indication of approved or preclinical compounds. The significance threshold was based on the distribution of scores for targets with approved anticancer compounds (specific anticancer compounds for the cancer-type-specific priority score) versus scores for targets with no available anticancer compounds. d, Overlap between cancer-type-specific priority targets (for at least one cancer type) and pan-cancer priority targets. e, Example priority targets identified only in the pan-cancer context. Each symbol is an individual cell line coloured by cancer type and symbol shapes indicate a significant dependency (n = 324 cell lines).

Extended Data Fig. 6 Priority therapeutic targets in 10 cancer types and pan-cancer.

Each data point is a target with a priority score classified into tractability buckets and groups. The shapes represent the indication of the approved and/or preclinical compound to the corresponding target (other disease (square), anticancer (triangle) or specific to the cancer type considered (rhombus)); circles indicate the absence of a compound. Symbols within each data point indicate the strength of the genomic marker associated with differential dependency on the target (class A to C indicate strong to weak associations).

Extended Data Fig. 7 GPX4 fitness selectivity for cells undergoing epithelial–mesenchymal transitions, functional classification of priority targets and WRN differential fitness in other cancer types.

a, Differentially expressed genes in cell lines that are dependent on GPX4 (left) (n = 113, non-dependent versus dependent, moderated t-statistic FDR estimates). Epithelial–mesenchymal transition is the top differentially enriched cancer hallmark gene signature in GPX4-dependent cell lines (right). P values from single-sample gene set enrichment analyses were obtained by randomly permuting gene signatures 10,000 times and adjusted for multiple testing using the Benjamini–Hochberg FDR correction. b, Functional classification of priority targets in each tractability group using the PANTHER database. For clarity, kinases (a subset of transferases) and transcription factors are shown separately. Protein classes are indicated by colour. Statistical enrichment was calculated using a systematic hypergeometric test across protein families, following correction for multiple testing with the Benjamini–Hochberg method. Pie charts indicate the percentage of targets in each group classified according to protein families. c, WRN dependency in multiple cancer types. Each data point is a cell line showing the quantile-normalized WRN sgRNA fold change value stratified by MSI status. Box-and-whisker plots show interquartile ranges and 95th percentiles and centres indicate median values. Individual values are shown as dots. Statistical significance was calculated from the systematic ANOVA analysis for each cancer type for which the number of cell lines was greater than 10 (n = 14 for gastric carcinoma).

Extended Data Fig. 8 Verification of WRN as a target in MSI cancers.

a, WRN dependency using a co-competition assay in MSI (top row, n = 7) and MSS (bottom row, n = 7) cell lines from four cancer types. sgRNAs targeting essential (sgEss) and non-essential (sgNon) genes were used as controls. Bars represent mean co-competition score; lines represent maximum and minimum values; individual data points overlaid. b, Selective WRN dependency in MSI versus MSS cell lines was confirmed using clonogenic assays in four cancer types (images are representative of two independent experiments). c, A reduction in WRN protein levels with all WRN sgRNAs was confirmed by western blot (images are representative of two independent experiments). d, An association between WRN dependency and MSI status was confirmed by mining data from an independent study that used RNA interference, project DRIVE¹² (Student’s t-test, P = 0.004; n = 214). Each circle represents the WRN RNA-interference dependency score in a cancer cell line. Box-and-whisker plots represent median and 1.5× interquartile range. e, siRNA depletion of WRN inhibited proliferation of HCT116 cells. Data are mean ± s.d. of three independent experiments. The P value was determined using a non-parametric Student’s t-test. f, siRNA-mediated depletion of WRN was verified by western blot (images are representative of two independent experiments). For western blot source data, see Supplementary Fig. 1.

Extended Data Fig. 9 MLH1 knockout, MMR rescue experiments and modulation of WRN dependency.

a, A WRN co-competition assay in MSS SW620 cells with stable MLH1 knockout. Cells were cultured for 3 months before assessing WRN dependency. Data are mean ± s.e.m. of three independent experiments. b, Western blotting confirmed MLH1 and WRN knockout (images are representative of two independent experiments). c, MLH1 and MSH3 expression by western blot in HCT116 parental and isogenic cell lines complemented with chromosome 2 (Ch.2; negative control), Ch.3 (which contains MLH1), Ch.5 (which contains MSH3) and Ch.3 + Ch.5 (which contains both MLH1 and MSH3). Data are representative of two independent experiments. d, Effect of WRN knockout (WRN sgRNAs 1 and 4 (sgWRN1 and sgWRN4, respectively)) on viability after 7 days in HCT116 parental and isogenic cell lines. Data are mean ± s.d. of three independent experiments. e, Clonogenic assays (14 days) after WRN knockout in HCT116 parental and isogenic cell lines. Data are representative of three independent experiments. f, Reduction in WRN levels was confirmed by western blot. Data are representative of two independent experiments. Source data for all western blots are shown in Supplementary Fig. 1.

Extended Data Fig. 10 Functional rescue experiments and in vivo validation of WRN dependency in a MSI colorectal cancer cell line.

a, Expression of wild-type mouse Wrn rescued the viability effect of WRN knockout in MSI cell line SW48. MSS cell line SW620 was used a negative control. Box-and-whisker plots represent the median and 1.5× interquartile range. Data represent two independent biological replicates completed in technical triplicate. b, Western blots confirmed expression of Flag-tagged protein using all variants of the Wrn vector. Images are representative of experiments performed in triplicate. c, WRN knockout induced by doxycycline treatment in WRN sgRNA-expressing HCT116 (HCT116-WRN) cells measured by western blot for two separate clonal lines. Data are representative of two independent experiments. d, Growth curves of HCT116 parental, HCT116 sgNon (non-essential sgRNA) and WRN sgRNA-expressing HCT116 cells grown in the absence (black line) or presence of doxycycline (2 μg ml⁻¹; yellow line). Data are mean ± s.d. of 10 technical replicate wells for each condition (1 image per well) and representative of two independent experiments. e, Growth curves of WRN sgRNA-expressing HCT116 (clone b) subcutaneous tumours from mice treated with doxycycline (50 mg kg⁻¹; yellow line) or vehicle (grey line). Tumour growth suppression was observed (P = 0.03, two-way ANOVA comparing doxycycline versus vehicle). The number of mice in each cohort is indicated. Data are mean ± s.e.m. f, Representative KI-67 immunohistochemistry assessment of WRN sgRNA-expressing HCT116 (clone b) tumours explanted after one week of doxycycline treatment (left). Scale bar, 50 μm; 40× magnification. Quantification of KI-67 staining (right). Data are mean ± s.d. of 10 fields from three different samples (n = 30) and means were compared using a two-sided Welch’s t-test. Source data for all western blots are shown in Supplementary Fig. 1.

Source Data

Supplementary information

Supplementary Information

This file contains Supplementary Text and Data (see contents page for details) and a guide to the Supplementary Data available on Figshare.

Life Sciences Reporting Summary

Supplementary Figures

This file contains the uncropped blots from Extended Data Figures 8-10.

Supplementary Tables

This file contains Supplementary Tables 1-10 and a Supplementary Table Guide.

Source data

Source Data Fig. 5

Source Data Extended Data Fig. 10

Rights and permissions

Reprints and permissions

About this article

Cite this article

Behan, F.M., Iorio, F., Picco, G. et al. Prioritization of cancer therapeutic targets using CRISPR–Cas9 screens. Nature 568, 511–516 (2019). https://doi.org/10.1038/s41586-019-1103-9

Download citation

Received: 03 August 2018
Accepted: 08 March 2019
Published: 10 April 2019
Issue Date: 25 April 2019
DOI: https://doi.org/10.1038/s41586-019-1103-9

This article is cited by

Joint analysis of mutational and transcriptional landscapes in human cancer reveals key perturbations during cancer evolution
- Jae-Won Cho
- Jingyi Cao
- Martin Hemberg
Genome Biology (2024)
scSNV-seq: high-throughput phenotyping of single nucleotide variants by coupled single-cell genotyping and transcriptomics
- Sarah E. Cooper
- Matthew A. Coelho
- Andrew R. Bassett
Genome Biology (2024)
tRNA-derived small RNAs in human cancers: roles, mechanisms, and clinical application
- Manli Zhou
- Xiaoyun He
- Chunlin Ou
Molecular Cancer (2024)
Comprehensive review of CRISPR-based gene editing: mechanisms, challenges, and applications in cancer therapy
- Mohammad Chehelgerdi
- Matin Chehelgerdi
- Abbas Mokhtari-Farsani
Molecular Cancer (2024)
FANCJ promotes PARP1 activity during DNA replication that is essential in BRCA1 deficient cells
- Ke Cong
- Nathan MacGilvary
- Sharon B. Cantor
Nature Communications (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.