Pooled CRISPR screening with single-cell transcriptome readout

Datlinger, Paul; Rendeiro, André F; Schmidl, Christian; Krausgruber, Thomas; Traxler, Peter; Klughammer, Johanna; Schuster, Linda C; Kuchler, Amelie; Alpar, Donat; Bock, Christoph

doi:10.1038/nmeth.4177

Article
Published: 18 January 2017

Pooled CRISPR screening with single-cell transcriptome readout

Paul Datlinger¹,
André F Rendeiro ORCID: orcid.org/0000-0001-9362-5373¹^na1,
Christian Schmidl¹^na1,
Thomas Krausgruber¹,
Peter Traxler¹,
Johanna Klughammer¹,
Linda C Schuster¹,
Amelie Kuchler¹,
Donat Alpar¹ &
…
Christoph Bock ORCID: orcid.org/0000-0001-6091-3088^1,2,3

Nature Methods volume 14, pages 297–301 (2017)Cite this article

91k Accesses
513 Citations
155 Altmetric
Metrics details

Subjects

Abstract

CRISPR-based genetic screens are accelerating biological discovery, but current methods have inherent limitations. Widely used pooled screens are restricted to simple readouts including cell proliferation and sortable marker proteins. Arrayed screens allow for comprehensive molecular readouts such as transcriptome profiling, but at much lower throughput. Here we combine pooled CRISPR screening with single-cell RNA sequencing into a broadly applicable workflow, directly linking guide RNA expression to transcriptome responses in thousands of individual cells. Our method for CRISPR droplet sequencing (CROP-seq) enables pooled CRISPR screens with single-cell transcriptome resolution, which will facilitate high-throughput functional dissection of complex regulatory mechanisms and heterogeneous cell populations.

Access through your institution

Buy or subscribe

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

**Figure 1: CROP-seq enables pooled CRISPR screening with single-cell transcriptome readout.**

**Figure 2: CROP-seq analysis of T cell receptor signaling.**

High-content CRISPR screening

Article 10 February 2022

Ultra-high-throughput single-cell RNA sequencing and perturbation screening with combinatorial fluidic indexing

Article 31 May 2021

Targeted Perturb-seq enables genome-scale genetic screens in single cells

Article 01 June 2020

Accession codes

Primary accessions

Gene Expression Omnibus

GSE92872

References

Blomen, V.A. et al. Gene essentiality and synthetic lethality in haploid human cells. Science 350, 1092–1096 (2015).
Article CAS Google Scholar
Wang, T. et al. Identification and characterization of essential genes in the human genome. Science 350, 1096–1101 (2015).
Article CAS Google Scholar
Shalem, O. et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science 343, 84–87 (2014).
Article CAS Google Scholar
Marceau, C.D. et al. Genetic dissection of Flaviviridae host factors through genome-scale CRISPR screens. Nature 535, 159–163 (2016).
Article CAS Google Scholar
Lamb, J. The Connectivity Map: a new tool for biomedical research. Nat. Rev. Cancer 7, 54–60 (2007).
Article CAS Google Scholar
Gapp, B.V. et al. Parallel reverse genetic screening in mutant human cells using transcriptomics. Mol. Syst. Biol. 12, 879 (2016).
Article Google Scholar
Macosko, E.Z. et al. Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell 161, 1202–1214 (2015).
Article CAS Google Scholar
Sanjana, N.E., Shalem, O. & Zhang, F. Improved vectors and genome-wide libraries for CRISPR screening. Nat. Methods 11, 783–784 (2014).
Article CAS Google Scholar
Tiscornia, G., Singer, O. & Verma, I.M. Design and cloning of an shRNA into a lentiviral silencing vector: version A. CSH Protoc. 1, pdb.prot5009 (2008).
Google Scholar
de Kok, S. et al. Rapid and reliable DNA assembly via ligase cycling reaction. ACS Synth. Biol. 3, 97–106 (2014).
Article CAS Google Scholar
Guschin, D.Y. et al. A rapid and general assay for monitoring endogenous gene modification. Methods Mol. Biol. 649, 247–256 (2010).
Article CAS Google Scholar
Brownlie, R.J. & Zamoyska, R. T cell receptor signalling networks: branched, diversified and bounded. Nat. Rev. Immunol. 13, 257–269 (2013).
Article CAS Google Scholar
Datlinger, P. et al. Pooled CRISPR screening with single-cell transcriptome read-out. Preprint at http://biorxiv.org/content/early/2016/10/27/083774 (2016).
Adamson, B. et al. A multiplexed single-cell CRISPR screening platform enables systematic dissection of the unfolded protein response. Cell 167, 1867–1882.e21 (2016).
Article CAS Google Scholar
Dixit, A. et al. Perturb-Seq: dissecting molecular circuits with scalable single-cell RNA profiling of pooled genetic screens. Cell 167, 1853–1866.e17 (2016).
Article CAS Google Scholar
Jaitin, D.A. et al. Dissecting immune circuits by linking CRISPR-pooled screens with single-cell RNA-seq. Cell 167, 1883–1896.e15 (2016).
Article CAS Google Scholar
Zheng, G.X.Y. et al. Massively parallel digital transcriptional profiling of single cells. Preprint at http://biorxiv.org/content/early/2016/07/26/065912 (2016).
Bock, C., Farlik, M. & Sheffield, N.C. Multi-omics of single cells: strategies and applications. Trends Biotechnol. 34, 605–608 (2016).
Article CAS Google Scholar
Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).
Article CAS Google Scholar
Bolger, A.M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS Google Scholar
Langmead, B., Trapnell, C., Pop, M. & Salzberg, S.L. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 10, R25 (2009).
Article Google Scholar
Glaus, P., Honkela, A. & Rattray, M. Identifying differentially expressed transcripts from RNA-seq data with biological variation. Bioinformatics 28, 1721–1728 (2012).
Article CAS Google Scholar
Li, J. et al. Single-cell transcriptomes reveal characteristic features of human pancreatic islet cell types. EMBO Rep. 17, 178–187 (2016).
Article CAS Google Scholar
Treutlein, B. et al. Dissecting direct reprogramming from fibroblast to neuron using single-cell RNA-seq. Nature 534, 391–395 (2016).
Article Google Scholar
Kuleshov, M.V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 44, W90–W97 (2016).
Article CAS Google Scholar

Download references

Acknowledgements

We would like to thank J. Bigenzahn, A. Fauster, and M. Owusu (CeMM) for providing Cas9-expressing cell lines; M. Farlik for contributing to the Drop-seq setup; F. Müller and J. Menche for bioinformatic discussions; N. Winhofer for feedback on the illustrations; the Biomedical Sequencing Facility at CeMM for assistance with next-generation sequencing; and all members of the Bock lab for their help and advice. C.S. is supported by a Feodor Lynen Fellowship of the Alexander von Humboldt Foundation. C.B. is supported by a New Frontiers Group award of the Austrian Academy of Sciences and by an ERC Starting Grant (European Union's Horizon 2020 research and innovation programme, grant agreement no. 679146).

Author information

André F Rendeiro and Christian Schmidl: These authors contributed equally to this work.

Authors and Affiliations

CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
Paul Datlinger, André F Rendeiro, Christian Schmidl, Thomas Krausgruber, Peter Traxler, Johanna Klughammer, Linda C Schuster, Amelie Kuchler, Donat Alpar & Christoph Bock
Department of Laboratory Medicine, Medical University of Vienna, Vienna, Austria
Christoph Bock
Max Planck Institute for Informatics, Saarland Informatics Campus, Saarbrücken, Germany
Christoph Bock

Authors

Paul Datlinger
View author publications
You can also search for this author in PubMed Google Scholar
André F Rendeiro
View author publications
You can also search for this author in PubMed Google Scholar
Christian Schmidl
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Krausgruber
View author publications
You can also search for this author in PubMed Google Scholar
Peter Traxler
View author publications
You can also search for this author in PubMed Google Scholar
Johanna Klughammer
View author publications
You can also search for this author in PubMed Google Scholar
Linda C Schuster
View author publications
You can also search for this author in PubMed Google Scholar
Amelie Kuchler
View author publications
You can also search for this author in PubMed Google Scholar
Donat Alpar
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Bock
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.D., A.F.R., C.S., and C.B. conceptualized the project; P.D. designed and developed CROP-seq; P.D., C.S., P.T., and L.C.S. conducted CROP-seq experiments; D.A. optimized sequencing protocols; P.D., T.K., L.C.S., and A.K. performed the arrayed validation screen; A.F.R. and J.K. developed software; P.D., A.F.R., C.S., and T.K. analyzed data; P.D. and A.F.R. visualized data; P.D., A.F.R., C.S., and C.B. wrote the original draft; T.K., P.T., L.C.S., A.K., and D.A. reviewed the draft; and C.B. supervised the project.

Corresponding author

Correspondence to Christoph Bock.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Integrated supplementary information

Supplementary Figure 1 Cloning and validation of the CROPseq-Guide-Puro plasmid

a) As starting point for preparing the CROPseq-Guide-Puro plasmid, we amplified four PCR products (A, B, C, and D) from the LentiGuide-Puro plasmid with the indicated primer pairs. b) CROPseq-Guide-Puro was constructed from these four amplicons using the ligase cycling reaction (LCR). The assembly was directed by four overlapping bridge oligonucleotides to flip the order of parts C and D. This rearrangement places the hU6-gRNA cassette into the 3′ LTR, downstream of the EF-1a puromycin marker. c) To validate the duplication of the hU6-gRNA cassette during lentiviral integration, we performed PCRs with primers that bind to the hU6 promoter but face in opposite directions. Productive amplification can occur only when amplifying from a circular plasmid or following duplication of the cassette during viral integration. As templates, we used gDNA from LentiGuide-Puro transduced cells (lane 1, resulting in no amplification), a plasmid preparation of CROPseq-Guide-Puro (lane 2), or gDNA from CROPseq-Guide-Puro transduced cells (lane 3).

Supplementary Figure 2 Genome editing efficiencies of LentiGuide-Puro and CROPseq-Guide-Puro based on the T7 endonuclease assay

a) Clonally expanded HEK293T cell lines as well as a HEK293T bulk population were transduced with LentiGuide-Puro (LentiGuide) or CROPseq-Guide-Puro (CROP-seq) vectors containing a gRNA targeting the MBD1 locus (+) or targeting a different locus (−). Ge-nome editing efficiencies for MBD1 were measured using the T7 endonuclease assay, which indicated highly similar performance between the two vectors. HEK293T clone 5 did not show any genome editing and was not used for further experiments. b) Table summa-rizing genome editing efficiencies for four cell lines (HEK293T, K562, Jurkat, KBM7) and two gRNAs (MBD1, DNMT3B).

Supplementary Figure 3 Configuration and validation of the droplet-based assay for single-cell transcriptome profiling

a) Setup of the Drop-seq workflow used as part of CROP-seq. b) Bioanalyzer trace of a typical cDNA library prepared with CROP-seq. c) Electropherogram of a sequencing-ready CROP-seq library after tagmentation. d) Doublet estimates based on a HEK293T (human) / 3T3 (mouse) mixing experiment across all detected cells and transcripts (without filtering). e) Percent of detected genes aligning to the human and mouse transcriptomes (filtered for cells with more than 500 detected genes). f) PCR duplication rates based on unique molecular identifiers (UMIs) in the HEK293T (human) / 3T3 (mouse) mixing experiment. g) Distribution of the distance of read mapping positions to the 3′end of gene models (blue line) and their cumulative sum (red line). h) Detailed performance statistics for twelve CROP-seq experiments. Green and orange labels indicate different batches of Drop-seq beads, where batch 1 suffered from production problems affecting the cell barcodes, which have been bioinformatically corrected to improve the data quality of the affected samples.

Supplementary Figure 4 Validation of the T cell receptor gRNA library and gRNA dynamics

a) gRNA representation in the T cell receptor (TCR) gRNA library, assessed by amplicon sequencing of the plasmid pool (top) and the gDNA of Jurkat cells at day 10 post transduction with CROPseq-Guide-Puro (bottom), both displayed as cumulative distribution plots. The fold change between the 10^th and 90^th percentile is highlighted as a measure of library imbalance, which expectedly increases upon transduction. b) Abundance of each gRNA shown as a heatmap. c) Scatterplots of gRNA abundance from amplicon libraries at day 10 versus the original plasmid library. Frequencies of detected gRNAs have been normalized to the evaluated reads in each experiment.

Supplementary Figure 5 Similarities and differences in the transcriptome response at the gRNA and gene level

a) Mean and standard deviation of pairwise distances (L2-norm) between CROP-seq transcriptomes for pairs of gRNAs that target the same gene (orange) or different genes (blue). Statistical significance was assessed with the Mann-Whitney U test. b) Pairwise distances as in panel a, shown separately for gRNAs targeting specific genes and for naïve as well as anti-CD3/CD28 stimulated cells. The dotted line indicates the 99^th percentile of the distribution of distances between non-targeting gRNAs. c) Statistical significance of the transcriptome-wide effect induced by gRNAs targeting specific genes relative to cells with non-targeting gRNAs (based on the Mann-Whitney U test). The dotted line indicates a p-value of 0.01. d) As in panel c, but aggregated at the gene level by combining pvalues using Fisher's method. Insets show scatterplots for the expression of two example genes with low (left) and high (right) systematic effects on the transcriptome compared to cells expressing non-targeting control gRNAs in the same stimulation condition (y-axis)

Supplementary Figure 6 Unsupervised analysis of the transcriptome response to T cell receptor stimulation

a) Principal component analysis of CROP-seq transcriptomes for cells with an assigned gRNA. b) Principal component analysis of median gene expression aggregated across cells expressing the same gRNA. c) Principal component analysis of median gene expression aggregated across cells expressing gRNAs targeting the same gene. The first principal component provided best separation between naïve and anti-CD3/CD28 stimulated cells, and the genes on the 99^th percentile of loading contributions were selected as the CROP-seq derived TCR activation signature (n = 165 genes). d) Genes of the TCR activation signature and their absolute loading values for the first principal component in panel c. e) Principal component analysis as in panel a, with cells colored by the expression of three marker genes selected as part of the TCR activation signature. f) Enriched pathways and biological processes of genes in panel d, as identified by Enrichr. The combined enrichment score (p-value * z-score) is displayed for the top 8 terms of each gene set library.

Supplementary Figure 7 Positioning cells and target genes on a spectrum defined by naive and stimulated cell states

a) Hierarchical clustering of single-cell transcriptomes with unambiguously assigned gRNA (n = 5,798) based on all genes included in the TCR activation signature. Clustering for cells and genes used the Pearson correlation, and the z-score of expression is displayed along with the stimulation state for each cell (left column). b) Hierarchical clustering of median gene expression values aggregated across cells expressing gRNAs for the same target gene (n = 107). c) Analytical procedure for assigning each cell to a specific position on a spectrum defined by the CROP-seq transcriptomes of non-targeted cells in the naive and the anti-CD3/CD28 stimulated cell state. In a first step (left), a matrix of synthetic transcriptome signatures (Z) is built by linear combination of the transcriptomes in the two defining cell states (μ_A, μ_B). In a second step (right), the position of the Z matrix that shows the maximum Pearson correlation with the transcriptome of the cells (E matrix) is taken as the cell's position along the spectrum of cell states. d) Correlation (row-wise z-score) of single-cell transcriptomes with a matrix comprising synthetic mixtures of transcriptome profiles between the median of non-targeted cells in both conditions (values close to one reflect similarity with anti-CD3/CD28 stimulated cells). Additional data on the transcrip-tome quality for each cell (unique reads per cell) and the overall correlation performance are shown as columns and reveal no relation-ship with the inferred position of the corresponding cells. All cells were ordered by the inferred signature value (position with maxi-mum correlation), rather than being clustered as in panel a. e) Same as in panel c, but for cells grouped by gRNA target genes.

Supplementary Figure 8 Bulk RNA-seq analysis for the arrayed validation screen

a) Hierarchical clustering of the median expression of TCR activation signature genes, based on bulk RNA-seq data aggregated across gRNAs for the same target gene. Clustering of rows and columns used the Pearson correlation, and z-scores of expression are shown. b) Correlation (row-wise z-score) of bulk RNA-seq transcriptomes with a matrix comprising synthetic mixtures of transcriptome profiles between the median of non-targeted cells in both conditions (values close to one reflect similarity with anti-CD3/CD28 stimulated cells). Additional metrics are shown as columns (center) and reveal no relationship with the inferred position of the corresponding samples. The effect that perturbing each target gene had on the TCR activation signature based on the bulk RNA-seq data was assessed in comparison to the control group (barplot on the right). c) Correlation (top) of median expression levels based on CROP-seq aggregating across target genes (left) or gRNAs (right) compared to the respective bulk RNA-seq libraries across all TCR activation signature genes. The corresponding number of cells in each group is shown at the bottom. Empty values reflect lack of matching bulk RNAseq libraries due to failed samples in the bulk RNA-seq. d) Comparison of the signatures inferred from CROP-seq (x-axis) with those derived from bulk RNA-seq data (y-axis) across all shared gRNAs. e) Comparison of the relative impact of each gRNA (left) or target gene (right) on the TCR activation signature relative to the corresponding control for CROP-seq (x-axis) and bulk RNA-seq (y-axis).

Supplementary Figure 9 Flow cytometry analysis for the arrayed validation screen

a) Experimental strategy of the arrayed validation screen. For this screen, 48 CROPseq-Guide-Puro constructs were individually cloned, targeting 20 genes with two gRNAs each and including eight non-targeting controls. Lentivirus production and transductions were performed in 96-well plates, and cells were expanded for 10 days under puromycin and blasticidin selection. Cells were then split into two parts, serum starved for three hours, and subjected to either anti-CD3/CD28-stimulation or continuous starvation for another four hours. The resulting cell populations were validated by Sanger sequencing of the corresponding gRNAs. For validation of the CROPseq signature, bulk RNA-seq was performed using a 3′ enrichment protocol, yielding data similar to Drop-seq (n = 87 RNA-seq libraries). As a complementary single-cell and protein-based read-out, flow cytometry (n = 96 samples) was performed for surface markers enriched in the TCR induction signature derived from CROP-seq (CD69, CD82) or previously reported as markers of T cell activation (CD25, CD38, CD154, PD-1). b) Examples of marker expression changes for TCR pathway activators identified by CROP-seq (ZAP70, LCK, LAT). c) Scatterplots comparing protein levels for TCR induction markers to RNA expression values obtained by CROP-seq.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Datlinger, P., Rendeiro, A., Schmidl, C. et al. Pooled CRISPR screening with single-cell transcriptome readout. Nat Methods 14, 297–301 (2017). https://doi.org/10.1038/nmeth.4177

Download citation

Received: 11 October 2016
Accepted: 10 January 2017
Published: 18 January 2017
Issue Date: March 2017
DOI: https://doi.org/10.1038/nmeth.4177

This article is cited by

Decoding leukemia at the single-cell level: clonal architecture, classification, microenvironment, and drug resistance
- Jianche Liu
- Penglei Jiang
- Pengxu Qian
Experimental Hematology & Oncology (2024)
Genome-wide quantification of copy-number aberration impact on gene expression in ovarian high-grade serous carcinoma
- Sanaz Jamalzadeh
- Jun Dai
- Sampsa Hautaniemi
BMC Cancer (2024)
Mapping the functional impact of non-coding regulatory elements in primary T cells through single-cell CRISPR screens
- Celia Alda-Catalinas
- Ximena Ibarra-Soria
- Radu Rapiteanu
Genome Biology (2024)
Spatial enhancer activation influences inhibitory neuron identity during mouse embryonic development
- Elena Dvoretskova
- May C. Ho
- Christian Mayer
Nature Neuroscience (2024)
scPerturb: harmonized single-cell perturbation data
- Stefan Peidli
- Tessa D. Green
- Chris Sander
Nature Methods (2024)