Mutation rate balances the need to protect genome integrity with the advantage of evolutionary innovations. Microorganisms increase their mutation rate when stressed, perhaps addressing the growing need for evolutionary innovation. Such a strategy, however, is only beneficial under moderate stresses that allow cells to divide and realize their mutagenic potential. In contrast, severe stresses rapidly kill the majority of the population with the exception of a small minority of cells that are in a phenotypically distinct state termed persistence. Although persisters were discovered many decades ago, the stochastic event triggering persistence is poorly understood. We report that spontaneous DNA damage triggers persistence in Saccharomyces cerevisiae by activating the general stress response, providing protection against a range of harsh stress and drug environments. We further show that the persister subpopulation carries an increased load of genetic variants in the form of insertions, deletions or large structural variations, which are unrelated to their stress survival. This coupling of DNA damage to phenotypic persistence may increase genetic diversity specifically in severe stress conditions, where diversity is beneficial but the ability to generate de novo mutations is limited.
Phenotypic persisters are individual microbes that survive harsh treatments that kill the majority of their genetically identical sister cells1,
A molecular marker for yeast persisters
To uncover events that trigger cells to become persisters, we established a marker to detect persisters in the rapidly growing, unstressed population. Since increased stress survival has been linked to stochastic activation of stress genes15,16, we asked whether heat-shock protein 12 (Hsp12) may serve as such a marker. Hsp12 is strongly induced by a variety of environmental stresses17 and its levels fluctuate highly between individual cells, with a subset of cells reaching high levels characteristic of stress-exposed cells (Fig. 1b).
We examined whether high Hsp12 expression predicts stress survival by sorting single cells expressing either high (top 0.1%, termed ‘extreme’) or normal (the remaining 99.9%, termed ‘control’) Hsp12-GFP (green fluorescent protein) into 96-well plates (Fig. 1c), and monitoring their ability to survive and generate a colony in a range of harsh stresses. In unstressed conditions, cells expressing extreme Hsp12 levels survived less well than control cells. In sharp contrast, extreme cells better survived practically all stress exposures (Fig. 1d,e). To verify that extreme cells maintain their advantage under competitive conditions, we co-sorted differentially labelled extreme and control cells into the same well at different initial frequencies, and measured the relative fraction of their progenies at saturation. This was made possible by sorting Hsp12-mCherry extreme and control cells from GFP-labelled or GFP-negative co-cultured strains. As stress levels rose, extreme cells became increasingly successful in out-competing control cells (Fig. 1f). For example, extreme frequency increased by 100-fold, from ~0.1% to ~10%, after sorting into media containing 1.7 M NaCl. Cultures seeded from a sorted extreme cell lost their high Hsp12 levels (Fig. 1g) and showed normal stress sensitivity, confirming that these cells are indeed phenotypic persisters. We conclude that Hsp12 marks budding yeast persisters.
Prolonged DNA damage leads to yeast persisters
To better understand the phenotype of extreme Hsp12-expressing cells, we sorted the cells using fluorescence-activated cell sorting (FACS) and imaged them using microfluidics-coupled video microscopy. About half (59%) of the sorted cells were large budded, resembling DNA damaged cells that are checkpoint-arrested at metaphase18,19. To examine whether Hsp12 induction correlates with DNA damage, we fluorescently tagged Rad52, a nuclear protein that localizes to positions of DNA double-strand breaks (DSB)20,21, confirming that approximately half of the sorted extreme cells had a long-lived (>40 min) Rad52 focus, while almost no control cells had such a focus (Fig. 2a). When subjecting these sorted cells to the antifungal drug fluphenazine, the focus-containing, high Hsp12-GFP cells better survived (Fig. 2b). About half of the focus-containing surviving cells resumed normal growth following drug treatment, while the rest did not divide robustly (Fig. 2b and Supplementary Fig. 1a). Diploid cells grew better than haploids, possibly due to a higher capacity to repair DNA damage from the undamaged locus. Therefore, extreme cells portray morphology and markers associated with prolonged DNA damage.
To examine whether Hsp12 induction follows DNA damage, we imaged freely cycling cells carrying Hsp12-mCherry and Rad52-GFP (Supplementary Video 1). Appearance of a Rad52 focus was the first indication of DNA damage, followed by cell-cycle arrest and Hsp12 induction (Fig. 2c,d and Supplementary Fig. 1b). After ~3–5 h of arrest, cells re-entered the cell cycle, either losing the Rad52 focus and resuming normal division time (~60% of cells), or maintaining the focus and entering a second arrest (Fig. 2e and Supplementary Fig. 1b). Hsp12 began to dilute once cells resumed division (Fig. 2c and Supplementary Fig. 1b). Exposing these cells to a high dose of fluphenazine led to the rapid killing of most cells, with a minority of survivors, many of which were DNA-damaged, high Hsp12-expressing cells (Supplementary Fig. 2 and Supplementary Videos 2 and 3). This same dynamic was confirmed with a second marker of DNA damage, Rnr3 (refs 22,23), whose induction also preceded Hsp12 induction and predicted drug survival (Supplementary Fig. 3 and Supplementary Video 4).
Consistent with the idea that DNA damage triggers persistence, we find an increased fraction of persisters in strains showing elevated mutation rates, as indicated by higher fractions of extreme Hsp12-expressing cells with increased stress survival (Supplementary Fig. 4). To verify directly that DNA damage triggers persistence, we induced DSBs by expressing a GAL1-inducible HO (homothallic switching) endonuclease in cells with a single HO cleavage site. Inducing HO led to the appearance of Rad52-GFP foci, followed by a delayed (~2 h) induction of Hsp12-mCherry (Fig. 2f,g and Supplementary Fig. 5). Subjecting cells to fluphenazine confirmed that cells became persisters (Fig. 2h and Supplementary Videos 5 and 6). While ~99% of cells with low Hsp12 died within 60 min of exposure, only 25% of high Hsp12 cells died at that time, with survivors losing viability at a significantly lower rate (Fig. 2i). Nearly all survivors had Rad52 foci (Fig. 2h). We conclude that a severe DNA damage, such as DSB, triggers persistence in budding yeast.
Yeast persistence depends on checkpoint and ESR activation
The DNA damage checkpoint orchestrates the response to DNA damage. We asked whether Hsp12 induction depends on checkpoint activation. HO induction of DSBs in cells deleted of the checkpoint sensor mec1 does not arrest the cell cycle20,21 (Supplementary Fig. 6a), but did induce Rad52-GFP foci and Hsp12-mCherry expression (Fig. 3a). Flow cytometry analysis further indicated a higher fraction of extreme Hsp12 cells in the mec1-deleted strain, and in a strain deleted of the Rad53 checkpoint kinase, suggesting that this induction is not checkpoint dependent (Supplementary Fig. 6b,c). Notably, extreme cells in these checkpoint mutants did not survive drug exposure when analysed in the HO-induced system or as freely cycling cells (Fig. 3b and Supplementary Fig. 6d). This suggests that persisters better survive drug exposure because they are non-dividing, as implicated in bacteria2,
Hsp12 is part of the environmental stress response (ESR), whose induction by a given moderate stress protects cells from a range of severe stresses24. ESR induction in extreme cells could therefore explain their persistence. Indeed, extreme expression of Hsp12 is regulated in trans, and Hsp12 extreme expression reflects a broader ESR induction, as indicated by high co-expression of a subset of ESR genes (Fig. 4a and Supplementary Fig. 8a). Furthermore, deletion of the ESR regulators, Msn2/4 (ref. 17) strongly reduced the extreme subpopulation (Fig. 4b) and reduced stress survival in freely cycling populations (Fig. 4c). Stress persistence was also reduced in Msn2/4 DNA-damaged extreme cells, as verified by FACS based on an Msn2/4-independent DNA-damage marker (Fig. 4d and Supplementary Fig. 8b) and by HO induction of DSBs (Fig. 4e). Unexpectedly, the remaining drug survivors on Msn2/4 deletion still expressed relatively high levels of Hsp12 (Fig. 4d,e), suggesting activation through a parallel pathway such as Hsf1. We conclude that ESR activation in extreme cells contributes significantly to their increased stress survival.
Yeast persisters are enriched for mutations
Imprecise repair of DNA damage is a major source of genetic mutations. We therefore hypothesized that the prolonged DNA damage leading to persistence might culminate in mutations. Since in exponentially growing cells about half of the mutations arise in just the last generation, we predicted that persister cells will be enriched for random mutations, unrelated to their stress survival. To examine this, we sorted and grew a total of 412 extreme and 315 control cells for deep genomic sequencing in three independent experiments (Fig. 5a). Single and multi-nucleotide polymorphisms (SNPs and MNPs), insertions or deletions (1–45 bp, indels), and large structural variations (SVs) were identified using a bioinformatics pipeline based on established software, followed by filtering, manual curation and extensive verifications by Sanger sequencing (Methods and Supplementary Information).
Indels or SVs were found in 26 of the 412 extreme samples, compared with 6 out of the 315 control samples (Table 1). By Fisher’s exact test, this enrichment is significant (P = 0.0027). Analysed individually, we find a 3-fold enrichment of indels (P = 0.017), ~3.5-fold enrichment of SVs (P = 0.079) and 30% more SNPs (P = 0.064) in extreme versus control cells. Rates for all three mutation categories were higher in extreme cells in each of the three independent experiments we performed (Supplementary Information). All detected SVs were flanked by repetitive sequences (Fig. 5b and Supplementary Information).
The frequencies of mutations in our control samples are consistent with literature values (Table 1 and Supplementary Information)25,
The molecular underpinning of persistence has proven tricky to define, partly because there may be multiple sources that render cells stress persistent. We show here that in budding yeast, strong DNA damage triggers phenotypic persistence. We propose that this coupling increases the probability that an emerging indel or SV will become fixated in the population specifically in severe stress conditions, where being a persister provides a significant survival advantage. This could help cells to better adapt to novel unexpected environments that demand unpredictable genetic changes. Phenotypic persistence is typically viewed as a bet-hedging strategy, increasing population fitness under fluctuating environments. Our study suggests a complementary outcome of increasing genetic diversity under extreme stress conditions.
Strains were constructed on verified clones from the BY4741 GFP yeast collection28. Diploids were generated by mating the BY4741 strain with BY4742. Standard gene deletion and tagging procedures were used29. mec1 and rad53 mutants also carry the suppressor sml1 deletion.
All HO nuclease strains are derivatives of JKM179 (ref. 30), which contains an HO cleavage site at the endogenous mating locus, and an integrated HO nuclease under the control of the inducible GAL1 promoter.
Cells were grown in standard synthetic complete media with 2% glucose (SD). For FACS analysis and microscopy, serial dilutions were inoculated, and cultures that grew to log phase at 30 °C with a maximal optical density (OD) 0.2 were assayed the next day. We noted that the Hsp12 FACS profiles of cultures with a higher OD occasionally shifts to a higher mean. When this occurred during an experiment, the culture was replaced with a lower OD sample from the dilution series.
Liquid cultures of cells bearing the GAL-inducible HO system were grown in synthetic complete medium supplemented with 0.1% glucose and 2% raffinose. HO induction was carried out with medium containing 0.1% glucose, 2% raffinose and 2% galactose for 7 h before fluphenazine treatment. For cell cycle synchronization in G1, cells were treated with 5 μg ml−1 α-factor for 3 h and then challenged with fluphenazine.
FACS analysis, sorting, single-cell stress-survival experiments
Populations were monitored using BD FACSAria and BD FACSAria Fusion cell sorters. Gating was done for single cells based on the FSC-A and SSC-A counts, which are associated with cell size and geometry. GFP or mCherry levels for gated cells were then monitored versus their FSC-A level. In every experiment, 100,000 cells were analysed to obtain Hsp12 levels in the population, and the fluorescence of the 0.1% highest Hsp12 levels were used as the cutoff for extreme cells. For the mutator strains, a wild type culture analysed in parallel was used to determine the cutoff intensity for the ‘wt extremes’, and ‘internal extremes’ were calculated internally as the top 0.1% in the mutator population. Enrichment of ‘wt extreme’ cells in mutator strains was calculated as the percent of cells in the mutator population with Hsp12 levels higher than the extreme cell cutoff measured in the parallel wild type culture. When multiple wild type and mutator populations were measured the same day, the average of the wild type and mutator Hsp12 intensities were used.
Single cells were sorted into 96-well plates with SD media using the ‘single cell’ FACS sorting accuracy setting. Sorting efficiency was very high, as judged by colony growth in nearly all wells into which a single control cell was sorted (Fig. 1d,e). In over 90% of the wells that had growth, only a single colony was observed. The minority of cases where more than one colony grew from the single cell were ignored and subtracted from the total number of wells.
Cells were sorted row by row, each row consisting of half control cells (up to 99.9% Hsp12 levels) and half extreme cells. Cells were sorted into SD media and incubated at 30 °C for unstressed conditions, or sorted directly into SD supplemented with 0.29 mg ml−1 of water-dissolved fluphenazine dihydrochloride (Sigma F4765), 1.5 M NaCl or 7–10% EtOH. The cells were grown in the above medium, and colonies were counted after 3–4 d for unstressed conditions, and 6–7 d for stress conditions. Stress experiments were always carried out in parallel to unstressed conditions to control for sorting accuracy and efficiency.
For heat-shock pulses, plates were preheated in an incubator to 56 °C for 15 min and then placed on the preheated sorting platform. Cells were sorted into the preheated plates (4–5 min sorting time) and then returned to the 56 °C incubator for a total of 20 min from the beginning of the sorting. Following the 20 min heat shock, plates were incubated at 30 °C and colonies were counted after 3–4 d.
FACS competition experiments
The Hsp12-mcherry haploid (MATα) strain was crossed with BY4741 (GFP negative) or a BY4741 strain with the highly expressed constitutive Tdh3 gene C-terminally tagged with GFP (GFP positive). GFP positive and negative strains were inoculated and grown in the same tube for FACS analysis and sorting. Single-cell sorting precision was used to sort GFP positive, Hsp12-mCherry extreme cells, together with GFP negative, Hsp12-mCherry control cells. As a control, extreme cells were GFP positive and control cells were GFP negative in approximately half the experiments, and vice versa in the other half.
One hundred extreme cells and 100 control cells were sorted from the same culture into a single well of a 96-well plate for competition assays between equal amounts of initial extreme and control cells (Fig. 1g). The 200 cells were sorted directly into SD media or SD with the specified stress. The volume of sorted liquid is negligible (immeasurable) for 200 cells, leaving the final stress concentration unchanged. One hundred extreme cells and 10,000 or 100,000 control cells (for initial ratios of 1:100 or 1:1,000, respectively) were sorted into 15 ml tubes, after which the final volume was measured, and the stress was uniformly applied by adding NaCl to the indicated final concentration (Fig. 1g).
The sorted cells were incubated in a shaker at 30 °C until saturation. At least 10,000 cells were then FACS analysed, and GFP positive and negative cells were readily distinguished to measure the fraction of cells originating from extreme cells versus cells originating from control cells in the final population.
The CellASIC ONIX Microfluidic Platform was used as per manufacturer instructions. Y04C and Y04D microfluidic plates were utilized for haploid and diploid cells, respectively. SD was supplemented with 10% polyethylene glycol (SD-PEG) MW 3,350 to increase the viscosity and minimize cell movement from the microfluidic flow. For fluphenazine killing experiments, SD-PEG (or raffinose-based SC for HO experiments) supplemented with 1.5–1.7 mg ml−1 fluphenazine was pulsed in for 30–45 min, followed by fluphenazine-free media to confirm viability. Strains and treatments that were directly compared with each other were imaged in parallel chambers and conditions in the same microfluidic device.
Microscopy and image analysis
Growth of microcolonies was observed with widefield fluorescence time-lapse microscopy at 27 °C using an Olympus IX81-ZDC inverted microscope with a motorized stage (Prior). Image sets were acquired with a Hamamatsu Back-thinned ORCA-II-BT or QIMAGING Retiga R6 CCD (charge-coupled device) cameras. Fluorescent proteins were excited using EXFO X-Cite light source and Chroma 49002ET-GFP and 49008ET-mCherry filter sets. Images were acquired with 20 min intervals.
Analysis was performed using ImageJ and FIJI software and plugins, together with a macro for segmenting cells of microcolonies at the time point preceding the addition of fluphenazine. Segmentation was carried out on brightfield images of yeast slightly out of focus, so that their contour is highlighted in black. Briefly, the macro created boundaries by inverting the image, enhancing boundaries using a band-pass filter, and applying a threshold. Boundaries were then closed by two dilate and erode rounds, and segments were created by inverting the image and applying watershed. Inaccurate and unsegmented cells were then corrected and segmented manually.
Segmentation was used to analyse mean Hsp12 levels in GAL–HO induced cells directly before fluphenazine treatment. The segments were applied to the fluorescent images in which a rolling ball with a radius of 50 pixels was used to subtract the background. For measuring % survival and % foci in microcolonies after GAL–HO induction, nuclei were counted to calculate the number of cells, and foci were scored manually, only if present at least two consecutive time points (40 min). For cell survival analysis, cell death following addition of fluphenazine was readily detected by cell bursting. Cell viability of cells that did not burst was confirmed by subsequent growth in SD free of the drug, except for the experiment in which cell death was monitored as a function of time in continuous fluphenazine.
Analysis of growth dynamics of unperturbed cells that experienced a long-lasting Rad52 focus that lead to extreme Hsp12 induction were performed manually. Cells were segmented and tracked over many divisions before, during and after the Rad52 focus and Hsp12 induction, and their size, Hsp12-mCherry levels, presence of Rad52-GFP foci and anaphase events (using the Rad52-GFP nuclear signal) were scored.
Bulk fluphenazine survival assays
An overnight culture was diluted in SD to an OD of 0.05, grown to OD 0.25, and subjected to 0.5 mg ml−1 fluphenazine. At the indicated times, 1 ml samples were extracted, spun down at 17,000×g, 900 μl were removed and replenished with fresh SD. Tenfold serial dilutions were prepared, and 100 μl of the dilutions were plated in duplicate on SD plates and incubated at 30 °C for 2 d. Plates with 100–500 colonies were counted, their dilution factor was taken into account and the final % survival was calculated by using the number of colonies before drug addition (time 0) as total cell survival. There was no cell division during the course of the experiment as verified by counting cells using a hemocytometer at time points 0, 60, 120 and 270.
Diploids co-expression analysis
Diploid cells were mCherry-tagged at the Hsp12 locus and GFP tagged the indicated locus. To define the extent of co-expression, 100,000 cells were analysed by flow cytometry. The resulting two-dimensional (GFP and mCherry) single-cell distributions of fluorescence levels were analysed as follows. Fluorescence levels were binned by the cumulative distribution fractions so that the first bin (1,1) included all cells, the bin (2,2) included the 30% of the cells with highest expression in both GFP and mCherry, the bin (2,3) included the cells with 30% highest GPF and 10% highest mCherry, and so on. The fraction of cells within each bin f ij was then calculated. This fraction was then normalized by the expected fraction, assuming no correlation between GFP and mCherry expression. The generated matrix was then corrected for cell size, by repeating the procedure above, this time considering coordination between GFP (or mCherry) versus FSC-W.
Selecting extreme and control cells for sequencing
A total of three sequencing experiments were carried out. Cells were struck out on a YPD plate, and a small single colony was inoculated in 5 ml of liquid SD for a total of 46 h. The cell density of the sorted culture was measured using a hemocytometer, and the colony founder cell was calculated to undergo 25, 24 and 23 generations (for experiments 1, 2 and 3, respectively) with a doubling time of ~110 min until sorting began. This culture was FACS sorted into single control and extreme cells, which were grown to saturation in 1.5 ml. Note that if a mutation occurred after the sorted single cell divided, it would be present in only half the cells after one division, one quarter after two division, and so on, and the fraction of reads with mutations would decrease exponentially with divisions. Therefore, as we detect the mutations in most of the reads, the mutations must have occurred in the original sorted cell.
For library preparation, DNA was extracted by blending the cells in 300 μl lysis buffer (50 mM HEPEs pH 7.5, 140 mM NaCl, 1 mM EDTA, 1% TritonX-100, 0.1% sodium deoxycholate) with 0.5 mm zirconium oxide beads in a Bullet Blender 24 (Next Advance) for 1 min at level 8. Cleared lysate was sonicated for 20 min (0.5 s on, 0.5 s off) in a Bioeruptor plus (Diagenode) cooled water bath sonicator, resulting in an average DNA fragment size of ~300 bp. Lysates were RNAse treated for 1 h at 37 °C, and then proteinase K treated for an additional 2 h at 37 °C. From each sample, 10 μl of the lysate was taken and a multiplexed library was prepared for sequencing31. Libraries were sequenced in an Illumina HiSeq 2500 with 100 bp paired-end sequencing.
A total of 334 control cells, 425 extreme cells and 4 reference cells were whole-genome sequenced in three independent experiments. Of these, 730 samples (315 control cells, 412 extremes and 3 reference cells) met our minimal requirement of 10× coverage of 90% of the genome and did not exhibit strong fluctuations of the coverage. Seventy-five percent of the genome within the samples analysed had a mean coverage of 56× or higher. The raw data for all sequenced genomes are available at http://www.ebi.ac.uk/ena/data/view/PRJEB17891.
Genome alignment and variant calling
We set up a pipeline for determining genetic variants in yeast. More specifically, we aim to determine single nucleotide polymorphisms (SNPs), small insertion and deletions (indels), and structural variations (SVs). To that end we use the Illumina paired-end sequencing to sequence 100 bp and 125 bp, respectively, of both ends of each DNA fragment. After an initial quality control of the raw sequencing data (using FastQC, version 0.11.3, http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) we trim adapters using cutadapt, version 1.832 and redo the quality control with FastQC. Then we align the read pairs using bwa-mem (version 0.7.12)33 against an S288C (R64-1-1) reference sequence for Saccharomyces cerevisiae 34 and a reference sequence for Saccharomyces paradoxus (para2 from the Saccharomyces Genome Resequencing Project, ftp://ftp.sanger.ac.uk/pub/users/dmc/yeast/latest/misc.tgz). These alignments need further processing and thus we remove duplicates, sort, clean and index the alignments using the Picard tools (http://broadinstitute.github.io/picard/ version 2.4.1). We then check the read coverage of the reference genome using the Genome Analysis Toolkit (GATK, GenomeAnalysisTK-3.5) from the Broad Institute35 and removed samples with insufficient coverage (less than 90% of the genome with coverage above 10×) or with unexpected strong fluctuating coverage. Mutations that were present in most sequenced samples were removed, as they correspond to constitutive variations between the reference genome and the founder cell, or mutations that arose early in the lineage and divided, as opposed to during the extreme event that was sorted for. We note that the vast majority of mutations removed this way were false positive; however, we did note a few cases in which mutation shared between the samples appeared real, perhaps indicating some ancestral relationship or positive contribution to fitness.
Defining SNPs and indels
For the discovery of de novo SNPs and indels, we run the HaplotypeCaller with parameter settings (-ploidy 1, -maxAltAlleles 1, -stand_call_conf 10, -stand_emit_conf 10) from GATK36 on each sample, then collect all variants and rerun the HaplotypeCaller (HC) only on the predetermined loci ±150 bp (using the R packages GRanges37 and VariantAnnotation38 to collect the data) jointly on all samples (-ploidy 1, -maxAltAlleles 2, -stand_call_conf 10, -stand_emit_conf 10). We then scaled the minimal non-zero normalized, Phred-scaled genotype likelihoods (referred to as genotype quality) of each variant by the locus depth plus 0.25 × mean coverage and removed low coverage variants (local depth <10 and <20 specifically for telomeric regions due to their repetitive nature). For SNPs, we demanded a high depth-scaled genotype quality for the median of the reference calls and a depth-scaled genotype quality of at least 10. For indels, based on extensive Sanger validations, we set the threshold at a depth-scaled genotype quality of 10. Applying these stringent criteria retains high-quality variants with a presumably very low number of false positives, leads to a good performance with respect to all other quality metrics and ensures that at least ~60% of the reads covering the locus carry the alternative allele, the vast majority exhibiting more than 80% alternative reads (see Supplementary Information for details). We did not differentiate between repetitive and unique regions in our analysis; however, most of the loci with de novo variations had a mean mapping quality >50× indicating their location in a unique region of the genome. Finally, we used freebayes (version v0.9.21-7-g7dd41db39) to check whether we recover all of the SNPs determined by HC.
To determine further structural variations, we applied a split read approach using the gapped alignment of reads (PINDEL40, version 0.2.5b6) with parameter settings (--window_size 0.5, --max_range_index 4, --report_long_insertions true, --report_interchromosomal_events false), thus disabling the search for interchromosomal variants. A region from chromosome 12 exhibiting excessive coverage across samples was excluded from the search. The program reports on potential deletions, insertions, inversions and tandem duplications. As we are only interested in de novo mutations, we retain variants called in only one sample, and prefilter for those having on average a support of ≥4 unique reads per sample, with at least one left and right anchor read and a ratio of the number of supporting reads to the number of reference reads ≥0.1. A final filtering step keeps only those variants with more than 10 unique supporting reads. The maximum size of structural variations to be detected by this split read alignment is set to 8,092; however, only medium-size inserts up to a size of about 80 bp and 100 bp, respectively, can be detected as this is limited by the read size. To exploit the information from discordant mapped and split reads as reported by bwa mem we used LUMPY41 (version 0.2.13). To obtain potential candidates, we require that there is support by more than five reads (split read or paired end), the relative support (support scaled by mean coverage of each sample) is larger than 5% and the fraction of the relative support is larger than 25%. The last requirement makes sure that the variants are only called in a small number of samples. As opposed to all other analysis we used SAMBLASTER42 (version 0.1.22) to mark duplicates on untrimmed reads and to filter discordant and split alignments which are in turn fed into LUMPY.
In addition, we also applied a read-depth method analysing the number of read starts in windows of a size of 250 bp overlapping by 50 bp as determined by FREEC (version 7.043,44). We normalize observed read counts per sample and per locus to remove corresponding biases and then filter on these normalized counts as well as demanding a size of 500 bp (loss of coverage) and 750 bp (gain of coverage), respectively, and a mean mappability larger than 0.75 to find genomic ranges with a gain or loss of coverage. For the determination of regions with an increase of coverage, we furthermore require the mean number of counts to be larger or equal to 60 and a separation of the observed signal from the background, and remove regions exhibiting a reciprocal overlap between different samples ≥50%. This was only necessary in experiments 1 and 2. This approach allows us effectively to determine ranges exhibiting a loss or gain of coverage with a size larger than about 0.5 to 1 kbp. Also, here we removed regions from chr8 and chr12 exhibiting excessive coverage (see Supplementary Information for details).
Detected SVs were visualized by plotting the normalized number of read counts in overlapping 250 bp windows alongside the lower and upper quartile of the background distribution of counts scaled by the median counts of the corresponding chromosome (grey). Read number is normalized by the local median chromosome scaled read counts. Repetitive sequences and genes flanking the mutated locus are shown with colour indicating orientation.
Testing for chromosome aneuploidy
We tested for chromosome aneuploidy, excluding the region from chr12 exhibiting excessive coverage, using samtools45 (version 1.2) and normalized copy number profiles inferred from the FREEC data. Cell ploidy was verified by DNA staining FACS analysis46.
Defining mutation enrichment based on literature values
Considering a specific mutation type, occurring at a frequency p per nucleotide site, per division, the probability of not being mutated equals 1 – Ng p, where Ng is the genome size. Consequently, the probability s to observe one or more mutations acquired during the 25 cell divisions before the single-cell bottleneck, is calculated as s = 1 – (1 – Ng p)25. The probability of obtaining exactly Nm mutated samples from the total Ns characterized samples is given by the binomial distribution B(Ns
, s). We therefore perform a binomial test against the alternative hypothesis that the true probability to observe one or more mutations is greater than s based on literature estimates25,
All sequenced genomes are available at http://www.ebi.ac.uk/ena/data/view/PRJEB17891.
How to cite this article: Yaakov, G., Lerner, D., Bentele, K., Steinberger, J. & Barkai, N. Coupling phenotypic persistence to DNA damage increases genetic diversity in severe stress. Nat. Ecol. Evol. 1, 0016 (2017).
We thank our group members for discussions and comments. We thank the Weizmann Institute of Science Flow Cytometry unit for support and O. Golani for help with the cell segmentation macro. This work was supported by the Minerva Center, European Research Council and the Israel Science Foundation. K.B. was supported by the European Molecular Biology Organization long-term fellowship.
DNA damage precedes Hsp12 induction.
Emergence of drug-tolerant, DANN-damage-induced extreme cells in an unperturbed population A.
Emergence of drug-tolerant, DANN-damage-induced extreme cells in an unperturbed population B.
Emergence of drug-tolerant, DANN-damage-induced extreme cells in an unperturbed population C.
HO-induced DSBs trigger persistence A.
HO-induced DSBs trigger persistence B.