Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

DSBCapture: in situ capture and sequencing of DNA breaks


Double-strand DNA breaks (DSBs) continuously arise and cause mutations and chromosomal rearrangements. Here, we present DSBCapture, a sequencing-based method that captures DSBs in situ and directly maps these at single-nucleotide resolution, enabling the study of DSB origin. DSBCapture shows substantially increased sensitivity and data yield compared with other methods. Using DSBCapture, we uncovered a striking relationship between DSBs and elevated transcription within nucleosome-depleted chromatin.

Access options

Rent or Buy article

Get time limited or full article access on ReadCube.


All prices are NET prices.

Figure 1: DSBCapture methodology and comparison to BLESS.
Figure 2: Genomic location and epigenetic context of endogenous DSBs in NHEK cells.

Accession codes

Primary accessions

Gene Expression Omnibus

Referenced accessions

Gene Expression Omnibus


  1. 1

    Srivastava, M. & Raghavan, S.C. Chem. Biol. 22, 17–29 (2015).

    CAS  Article  Google Scholar 

  2. 2

    Jackson, S.P. & Bartek, J. Nature 461, 1071–1078 (2009).

    CAS  Article  Google Scholar 

  3. 3

    Rodriguez, R. et al. Nat. Chem. Biol. 8, 301–310 (2012).

    CAS  Article  Google Scholar 

  4. 4

    Tsai, S.Q. et al. Nat. Biotechnol. 33, 187–197 (2015).

    CAS  Article  Google Scholar 

  5. 5

    Crosetto, N. et al. Nat. Methods 10, 361–365 (2013).

    CAS  Article  Google Scholar 

  6. 6

    Marchuk, D., Drumm, M., Saulino, A. & Collins, F.S. Nucleic Acids Res. 19, 1154 (1991).

    CAS  Article  Google Scholar 

  7. 7

    Aird, D. et al. Genome Biol. 12, R18 (2011).

    CAS  Article  Google Scholar 

  8. 8

    Mitra, A., Skrzypczak, M., Ginalski, K. & Rowicka, M. PLoS One 10, e0120520 (2015).

    Article  Google Scholar 

  9. 9

    Aymard, F. et al. Nat. Struct. Mol. Biol. 21, 366–374 (2014).

    CAS  Article  Google Scholar 

  10. 10

    Biffi, G., Tannahill, D., McCafferty, J. & Balasubramanian, S. Nat. Chem. 5, 182–186 (2013).

    CAS  Article  Google Scholar 

  11. 11

    Ribeyre, C. et al. PLoS Genet. 5, e1000475 (2009).

    Article  Google Scholar 

  12. 12

    Paeschke, K., Capra, J.A. & Zakian, V.A. Cell 145, 678–691 (2011).

    CAS  Article  Google Scholar 

  13. 13

    Chambers, V.S. et al. Nat. Biotechnol. 33, 877–881 (2015).

    Article  Google Scholar 

  14. 14

    ENCODE Project Consortium. Nature 489, 57–74 (2012).

  15. 15

    Gursoy-Yuzugullu, O., Ayrapetov, M.K. & Price, B.D. Proc. Natl. Acad. Sci. USA 112, 7507–7512 (2015).

    CAS  Article  Google Scholar 

  16. 16

    Storch, K. et al. Cancer Res. 70, 3925–3934 (2010).

    CAS  Article  Google Scholar 

  17. 17

    Falk, M., Lukásová, E. & Kozubek, S. Biochim. Biophys. Acta 1783, 2398–2414 (2008).

    CAS  Article  Google Scholar 

  18. 18

    Fong, Y.W., Cattoglio, C. & Tjian, R. Mol. Cell 52, 291–302 (2013).

    CAS  Article  Google Scholar 

  19. 19

    Yang, F., Kemp, C.J. & Henikoff, S. Mutat. Res. 773, 9–15 (2015).

    CAS  Article  Google Scholar 

  20. 20

    Schwer, B. et al. Proc. Natl. Acad. Sci. USA 113, 2258–2263 (2016).

    CAS  Article  Google Scholar 

  21. 21

    Lensing, S.V. et al. Protocol Exchange (2016).

Download references


We thank G. Legube, LBCMCP, Center for Integrative Biology (CBI), Université de Toulouse, Toulouse, France for providing U2OS AID-DIvA cells. We thank the genomic core facility at the Cancer Research UK Cambridge Institute. R.H.-H. acknowledges EMBO for support (EMBO Long-Term Fellowship to R.H.-H.). We acknowledge support from the University of Cambridge and the Cancer Research UK program. The Balasubramanian laboratory is supported by core funding from Cancer Research UK (C14303/A17197 to S.B.) and by an ERC Advanced Grant (S.B.). S.B. is a senior investigator of the Wellcome Trust.

Author information




S.V.L. developed the DSBCapture method, conceived the study, conducted experiments, interpreted results and wrote the manuscript. G.M. conceived the study, performed bioinformatics analyses, interpreted results and wrote the manuscript. R.H.-H. contributed to the development of the DSBCapture method, conceived the study, conducted experiments, contributed to bioinformatics analyses, interpreted results and wrote the manuscript. E.Y.L. contributed to the development of the DSBCapture method, conceived the study and conducted experiments. D.T. conceived the study, interpreted results and wrote the manuscript. S.B. conceived the study, interpreted results and wrote the manuscript.

Corresponding author

Correspondence to Shankar Balasubramanian.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Integrated supplementary information

Supplementary Figure 1 DNA processing workflows, adapter sequences and controls.

(a) BLESS DNA processing workflow following the ligation of both proximal and distal adapters. DNA is digested by I-SceI, PCR amplified, digested by XhoI and subsequently subjected to Illumina library preparation, consisting of end repair, size selection (not shown), A-tailing, Illumina adapter ligation and PCR amplification. Large black arrows indicate the site at which sequencing is initiated; the first 11 bases sequenced are shown. (b) DSBCapture DNA processing workflow following the ligation of both modified P5 and P7 Illumina adapters. The DNA is PCR amplified, size selected (not shown), and sequenced. Large black arrow indicates the site at which sequencing is initiated: the first base sequenced identifies the in situ captured break site. (c) Sequences of the modified P5, modified P7 and control modified P5 Illumina adapters as well as DSBCapture PCR primers (forward (PCR F) and reverse (PCR R)). AD identifies the Illumina adapter barcode sequence, three example reverse primers are shown; further primers can be created by substituting the barcode sequence. B = biotin; P = phosphorylated; * = phosphorothioate bond. (d) Orientation of the DSBCapture library on the Illumina flow cell. The first sequencing primer has complementarity to the P5 Illumina adapter and therefore sequencing is initiated from the P5 end. The ligation of the modified P5 Illumina adapter to the DSB in situ enables direct sequencing of the break site in single-end sequencing. (e) Bioanalyser profiles of the DNA products from DSBCapture and BLESS NHEK libraries. DSBCapture: no product is present in the controls performed without T4 DNA ligase during the first (-T4.1) or second (-T4.2) ligation reactions, or in the control performed with the non-biotinylated control modified P5 lllumina adapter (C). A DSBCapture library is only generated when the complete procedure is carried out (+). BLESS: No product is present in the control performed without T4 DNA ligase during the first ligation reaction (-T4.1). The product of BLESS is shown before Illumina library preparation (I-SceI; diluted 1:10) and after Illumina library preparation (Lib).

Supplementary Figure 2 DSBs mapped by DSBCapture at EcoRV and AsiSI restriction sites.

(a) DSBs created by EcoRV cleavage in fixed nuclei, mapped by DSBCapture (n = 1). PCR duplicates have been removed. Data range is shown in square brackets and black boxes illustrate the genomic location of EcoRV sites. A 20 kb region and a 110 bp region are shown. Pink and purple lines: reads from the sense and antisense strand, respectively. As EcoRV is a blunt cutter, reads originate directly from the cleavage site. (b) AsiSI cleavage sites (black boxes) detected by DSBCapture (n = 1). Cleavage by AsiSI generates a 2 bp 3’ overhang; end processing removes this overhang generating the 2 bp gap in the center of the peak. A 2 kb and a 200 bp region are shown.

Supplementary Figure 3 Overlap of DSBs detected by DSBCapture and GUIDE-seq in U2OS cells.

(a) Venn diagram showing the overlap between the DSBCapture peaks and the 25 sites detected by GUIDE-seq4. (b) Genomic tracts showing the 9 DSBs detected by GUIDE-seq that are also detected by DSBCapture. Each panel shows a genomic view of 2,000 bp around the GUIDE-seq detected DSB hotspot. In each panel, from top to bottom: DSBCapture coverage (grey track); peaks detected in DSBCapture by peak calling (black track); GUIDE-seq sites (dark blue track); RefSeq gene track; reference genome sequence (hg19).

Supplementary Figure 4 Analysis of DSBs detected by DSBCapture and BLESS.

(a) Overlap of peaks between two biological replicate experiments for DSBCapture and BLESS. Peaks called in both replicates (high confidence peaks) were used for data analysis. 84,946 and 18,816 high confidence peaks were identified in DSBCapture and BLESS, respectively. (b) Overlap between the high confidence peaks (shown in a) from the BLESS and DSBCapture experiments. The vast majority (98.6 %) of the BLESS peaks are also identified by DSBCapture, whereas 78.2 % of DSBCapture peaks are unique to this method. (c) Venn diagram showing the overlap of DSBs detected as peaks by DSBCapture performed with 50 μg and 20 μg input material. 74,951 peaks are commonly identified by the two conditions (n = 1). (d) Fraction of DSBs with different GC content in the DSBCapture unique peaks divided by the fraction of peaks shared between BLESS and DSBCapture within the same GC content range (fold enrichment). A fold change greater than one represents an increase in DSBs with that particular GC content in the DSBCapture unique peaks. (e) Fold enrichment of DSBCapture peaks with OQs13, calculated as the number of DSBs overlapping to OQs at each indicated % GC sequence content category (x-axis labels) divided by random overlap. All = all 716,311 OQs, irrespectively of GC content; error bars: standard deviation of the fold enrichment over random.

Supplementary Figure 5 Correlation of DSBs with chromatin marks, genic regions and transcription.

(a) Genomic location of DSBs detected by DSBCapture with respect to histone marks H2A.Z, H3K4me3, H3K4me1, H3K27ac, H3K27me3 as well as DNase and POL2B. Two 100 kb genomic regions are shown; upper: a gene dense region on chromosome 19; lower: a region upstream of the EGFR gene. The data range is shown in square brackets, 5’UTRs are highlighted with red boxes. (b) Number and fold enrichment of DSBs at active and inactive enhancers. (c) Fold enrichment of DSBs in genic and intergenic regions over random. Values > 1 indicate that DSBs are preferentially found within that genomic location. Error bars: standard deviation of the fold enrichment over random. (d) Gene expression values measured as rpkm for genes with (left box) or without (right box) DSBs within ± 1 kb of the TSS. Boxes span from the 25th to the 75th percentile with the median marked by a solid bar. All whiskers extend from the 5th to the 95th percentile.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–5 and Supplementary Tables 1–4. (PDF 1441 kb)

Supplementary Software

DSBCapture_code_submitted (ZIP 2 kb)

Source data

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Lensing, S., Marsico, G., Hänsel-Hertsch, R. et al. DSBCapture: in situ capture and sequencing of DNA breaks. Nat Methods 13, 855–857 (2016).

Download citation

Further reading


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing