Sequence dependent UV damage of complete pools of oligonucleotides

Kufner, Corinna L.; Krebs, Stefan; Fischaleck, Marlis; Philippou-Massier, Julia; Blum, Helmut; Bucher, Dominik B.; Braun, Dieter; Zinth, Wolfgang; Mast, Christof B.

doi:10.1038/s41598-023-29833-0

Download PDF

Article
Open access
Published: 14 February 2023

Sequence dependent UV damage of complete pools of oligonucleotides

Corinna L. Kufner¹,
Stefan Krebs²,
Marlis Fischaleck²,
Julia Philippou-Massier²,
Helmut Blum²,
Dominik B. Bucher³,
Dieter Braun⁴,
Wolfgang Zinth⁵ &
…
Christof B. Mast⁴

Scientific Reports volume 13, Article number: 2638 (2023) Cite this article

1214 Accesses
2 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Understanding the sequence-dependent DNA damage formation requires probing a complete pool of sequences over a wide dose range of the damage-causing exposure. We used high throughput sequencing to simultaneously obtain the dose dependence and quantum yields for oligonucleotide damages for all possible 4096 DNA sequences with hexamer length. We exposed the DNA to ultraviolet radiation at 266 nm and doses of up to 500 absorbed photons per base. At the dimer level, our results confirm existing literature values of photodamage, whereas we now quantified the susceptibility of sequence motifs to UV irradiation up to previously inaccessible polymer lengths. This revealed the protective effect of the sequence context in preventing the formation of UV-lesions. For example, the rate to form dipyrimidine lesions is strongly reduced by nearby guanine bases. Our results provide a complete picture of the sensitivity of oligonucleotides to UV irradiation and allow us to predict their abundance in high-UV environments.

Specificity and Efficiency of the Uracil DNA Glycosylase-Mediated Strand Cleavage Surveyed on Large Sequence Libraries

Article Open access 28 November 2019

DNA damage detection in nucleosomes involves DNA register shifting

Article 29 May 2019

Base-resolution UV footprinting by sequencing reveals distinctive damage signatures for DNA-binding proteins

Article Open access 11 May 2023

Introduction

The genome plays a central role as a blueprint for all known life, therefore damage of the information carrier DNA and the related faulty readout by polymerases is a major threat. In severe cases, DNA modifications can result in cell death or in defective organisms. DNA damage may be caused by reactive compounds, for example by reactive oxygen species, or by energetic radiation. In the latter case, energetic particles can break the backbone of DNA strands. Softer radiation in the ultraviolet (UV) range can alter the structure of DNA bases, for example by sequence-specific reactions such as the formation of cyclobutane pyrimidine dimers (CPD). UV-radiation damage of DNA should not only be considered harmful as a threat to genetic integrity but is currently used in medical applications such as cancer treatment or sterilization.

For better protection against DNA damage and for optimizing medical efficacy, a detailed understanding of damage formation and its mode of action is necessary. Information on a molecular level has been obtained especially for short DNA strands by chemical and physical methods. The damage quantum yields and molecular structures of damages in short oligomers and single dimers have been investigated for several decades^1,2,3,4. Important information on the molecular mechanisms of damage formation was obtained. For instance, it has been shown that the CPD lesion is formed predominantly via the singlet channel within sub-picoseconds and molecular reasons for the frequency of damage formation have been discovered^{5,6,7,8,9,10,11}. The combination of experimental results supplied important knowledge on the damage of short DNA strands¹².

For long DNA strands e.g. genomic DNA, biological methods such as next-generation/high-throughput sequencing (NGS, HTS), combined with tailored enzyme assays, have provided significant insights¹³. Examples are Excision-seq¹⁴, CPD-seq¹⁵, HS-damage-seq¹⁶, (t)XR-seq¹⁷ or Adduct-seq¹⁸. Most of them involve a cascade of enzymatic preparation steps such as fragmentation, repair, ligation, cleavage or digestion. Lesions caused by UV radiation or reactive chemicals such as cisplatin, were successfully excised from the genomic DNA in a region around the damage and identified using specific endonucleases or additionally used lesion-specific antibodies. Enzymatic removal of the damage then allowed for next generation sequencing. The genome-wide characterization of damage formation thus made it possible to pursue previously unknown possibilities for building a platform for DNA damage-induced mutagenesis^{16,17,19,20,21,22,23}. Additionally, significant experience in tracking dynamic changes in sequence has been made with techniques such as kinetic sequencing, applied for example to measure the activity of ribozymes²⁴.

Here, we demonstrate an approach which determines UV dose dependence of damage formation in DNA simultaneously for all 4096 possible hexamer sequences within a single experiment, only utilizing a commercial sequencing library preparation after irradiation and subsequent data analysis. Despite its simplicity, our technique is able to record bi-exponentialities due to secondary reactions of damage states, resolves the reduction in the damage rate of di-pyrimidine lesions by nearby guanines, and provides previously unavailable damage rates for DNA lengths all the way to the hexamer. The damage is detected via the specific termination of polymerase function at the lesion site during the preparation step for next-generation sequencing (NGS). By using a suitable polymerase for damage detection, the biological relevance of the corresponding damage is thus implicitly taken into account. The relative change in frequency per sequence obtained from NGS reflects the damage process. Data analysis then reveals the sequence-dependent damaging for all possible sequences and allows to determine the respective damage quantum yields.

Results

Our procedure comprises five steps to obtain the full sequence and dose dependence of damage formation from UV-irradiation (see Fig. 1 and Supplementary Information SI-1):

1. DNA synthesis A complete pool of oligomers with randomly distributed DNA sequences is synthesized with multiple copies of each sequence. We combine a central part, which contains the randomized nucleotides with tail sequences at the 5’ as well as at the 3’ end to facilitate analysis. In the present example the randomized part has a length of eight nucleotides and both tails have the sequence ACAC.
2. Exposure to damage inducing process The randomer pool is exposed to UV-radiation (266 nm) to induce damaging photoreactions. At defined irradiation doses portions of the DNA-pool are extracted, to produce a set of samples with ascending exposure doses up to ca. 500 photons/base. The different samples are identically handled in the subsequent steps.
3. Discrimination These samples undergo a discrimination step, which allows only intact DNA-oligomers to be processed in the subsequent sequencing step. In the present example, the discrimination occurs via the library preparation, which attaches pre-defined adapter strands to the DNA and amplifies the products in a polymerase chain reaction. The polymerases used in this process stop at previously generated UV lesions. Consequently, only strands that allow the polymerases to replicate the complete strand can be successfully prepared for sequencing and are therefore recognized as intact.
4. Sequencing Only the undamaged oligomers can be successfully processed, which allows the determination of their abundance in the sequencing process. Strands, where a large fraction is damaged by the UV-irradiation will yield only small relative readout frequencies. Since the presented technique relies on the number of surviving oligomers, the sequencing procedure should be performed in a way to yield a sufficiently large number of reads for each sequence in the randomized pool.
5. Analysis and quantification The frequencies of each sequence are finally analyzed and evaluated for each dose level so that the obtained two-dimensional result space allows for quantitative statements about the sequence-dependent damage. The analysis step is used to eliminate interference by possible intrinsic sequence dependencies in the synthesis, discrimination, or sequencing steps.

Due to the high number of extractions at well-defined dose values, our approach allows the investigation of complex damage models for complete sequence spaces. In the presented case, the complex sequence space covers all possible 4096 hexamers. In doing so, we can either analyze the influence of the sequence context by analyzing the full strands, or by summing up all sequence frequencies of shorter sequences in the central hexamer, in order to investigate its damage properties at a higher dynamic range. Different types of DNA-damages may be addressed by an appropriate choice of the polymerase used in the discrimination step. The technique thus combines the advantages of high-throughput sequencing, the parallel measurement of damage for a complete pool of DNA sequences, with those of quantitative methods in the determination of damage processes and quantum yields.

Sensitivity to CPD-lesions

In a first test-experiment we studied the ability of our approach to detect the most prominent DNA-photolesion, the cyclobutane pyrimidine dimer (CPD) between two adjacent thymidines, in which the atoms C5 and C6 from one thymine form covalent bonds to the corresponding atoms of an adjacent thymine^8,9 (see Fig. 2a). Upon absorption of photons in the dominant UV-C absorption band of DNA around 260 nm the CPD-lesion is formed predominantly via the singlet channel^5,6,7,11 with a quantum yield in the range of ca. 2 · 10^–2^2,4,25. This CPD-lesion causes a distortion of single and double-stranded DNA^26,27, which leads to a termination of the reading process for some polymerases^28,29. The recognition of the CPD-lesion by the library preparation kit used in the discrimination step (see “Materials and methods”) was tested on a number of DNA-hexamers containing a central native TT sequence or a central CPD lesion (T=T). The application of Step 3, discrimination and Step 4, sequencing, yielded the absolute frequencies of the different hexamers as given in Fig. 2b. Starting from the same amount of DNA, typical absolute frequencies of \(3\cdot {10}^{5}\) were obtained for the intact oligomers. For all oligomers which contained T=T lesions, the absolute frequencies are strongly reduced. We found small frequencies in the range of several ten to a few hundred. Apparently, the presence of the T=T-lesion causes an approximately thousand-fold reduction of the absolute frequencies. This experiment clearly demonstrates that the procedures used in Step 3 and Step 4, namely in library preparation and NGS are able to recognize the CPD damage with high sensitivity. The molecular modifications upon formation of the T=T lesion prevent the execution of at least one step in the library preparation procedure and prohibit the readout of the original sequence³⁰. The similarity of the molecular changes induced by CPD-lesions for the different di-pyrimidine lesions supports the notion that also T=C, C=T and C=C CPD-lesion are recognized by the procedure used. Here, the sensitivity of the employed technique might be deduced from a comparison of the results obtained by the present technique with literature values (see below). This approach may also be used to address the sensitivity of the presented technique towards other DNA-damages, particularly as the majority of dimer lesions alter the molecular structure of the bases significantly^25,31,32,33. UV-induced strand breaks³⁴ are also recognized as the fragments are rejected in the library preparation and sequencing steps 3 and 4.

Figure 2c shows the raw sequenced reads from strands comprising the randomized octamer and ACAC tails that are irradiated by different doses of UV-radiation (Step 2). Subsequently discrimination (Step 3) and sequencing (Step 4) are performed, which yield the frequencies of the different 16-mers as raw data. These frequencies are plotted against the sequence number of the central octamer. The figure shows that the frequencies vary strongly with the sequence. For non-irradiated oligomers (D = 0, blue) the largest frequency (4075 counts) is found for AAAAAAAA while many oligomers have frequencies close to the average value of 98 counts. Nearly all the 65,536 possible sequences show non-vanishing frequencies. The insert of Fig. 2c illustrates a narrow selection of oligomer sequences for different dose values. It is evident that the frequencies decay with increasing dose and that this decay may depend on the sequence. However, the strong variations of the frequencies of the raw data demonstrate that elaborate normalization procedures are required to obtain consistent frequencies for the different oligomers. For this purpose, we used reference sequences (poly-G oligomers with weak dose dependence). Details of the procedures for normalization and analysis used in step 5 are presented in the Supporting Information SI-1 to SI-9, SI-15.

Sequence dependent damage

The following paragraph focuses on the dependence of the frequencies of different oligomers on the applied UV irradiation dose D_j. All presented data were obtained for ACAC-tailed random octamers of type ACAC-NNNNNNNN-ACAC after the application of Step 1 to Step 5 of the measurement procedure. Since the ACAC-tails have an influence on the frequencies of the directly neighboring randomers via nearest neighbor interactions, we will only consider the frequencies of the 4096 oligomers of the central hexamer obtained upon summing over the outmost random nucleotides neighboring the tail parts. The relative frequencies of oligomers of shorter lengths can be calculated using the methods presented in the Supplementary chapters SI-2 to SI-4. These relative frequencies refer to the given oligomer with randomized neighbors.

In Fig. 3, we compare the evaluation of polymers of different lengths. Figure 3a shows the relative frequencies (color coded) for hexamers with sequence numbers (for an example of the numbering see SI-t3 and SI-t8, 9) between i = 3500 (TCGGTA) and i = 4095 (TTTTTT) as a function of exposure dose. The displayed sequences comprise oligomers with small decrease of frequency with dose (see e.g. around i = 3820, for oligomers containing a leading GTGT tetramer). Sequences rich in TT dimers (see i = 4095 (TTTTTT) or i = 3584 (TCTTTT), have rapidly decreasing relative frequencies which decay to < 0.2 on a dose range of 20 absorbed photon/base. Figure 3b gives quantitative information for the dose dependences of the relative frequencies for the poly-A oligomers (symbols). The mono-exponential fit functions exp(− µD_j) (solid curves) describe well the dose dependences. The decay constants of the fit functions are μ_AAAAAA = 0.015 base/photon, μ_AAAA = 0.0098 base/photon and μ_AA = 0.0039 base/photon. A typical relative error of these decay constants is in the order of 20%. The inverse of these numbers indicates how many photons are absorbed in each base to decrease the relative frequency of the respective oligomer to 1/e. For the hexamer, this number is ca. 67 photons absorbed in each base. Directly related is the quantum yield ϕ, i.e. the inverse of the total number of absorbed photons required to produce one damage. Since also photons absorbed in bases directly adjacent to the oligomer may damage (see SI-9), the quantum yield becomes ϕ = μ_polymer/(i + 1) = 2.1 · 10⁻³ (-AAAAAA-), 1.96 · 10⁻³ (-AAAA-) and 1.3 · 10⁻³ (-AA-). As expected, less photons are required (higher quantum yield) to damage the longer oligomers since they contain a larger number of damageable base pairs.

The poly-T oligomers (see Fig. 3c) show a behavior clearly different from that of poly-A. The thymine sequences exhibit a much faster initial decay in the range of ten photons/base. It is striking that the oligomer frequencies do not decrease completely with the initial decay. There is a secondary slower component. While the relative amplitude of the fast decay is in the range of 0.5 for TT, it becomes much larger for the tetramer and the hexamer. From this qualitative inspection of the data, it becomes evident that the frequency of poly-T oligomers cannot be described by a mono-exponential function. As discussed in the Supplementary Information SI-8, deviations from the mono-exponentiality point to a more complex reaction scheme. For instance, the photolesion may reconvert by secondary UV-absorption to the initial state (repair process) and an additional (irreversible) decay process may exist³⁵. For the given precision of the experimental data, a bi-exponential fit can lead to large uncertainties in fit-parameters (amplitudes and decay constants). However, the initial slope of the relative frequency, which contains information on the damage quantum yield can often be obtained with reasonable precision.

For the initial decay of the frequencies of the poly-T oligomers we find a strong acceleration with increasing number of bases. The corresponding damage coefficients range from μ_TT = 0.034 base/photon, μ_TTTT = 0.077 base/photon to μ_TTTTTT = 0.11 base/photon. Thus, the damage quantum yield of the poly-T oligomers becomes: ϕ = μ_polymer/(i + 1) = 0.011 (-TT-), 0.0154 (-TTTT-) and 0.016 (-TTTTTT-). These numbers indicates that the damage process in poly-T is at least 10-times more efficient than in poly-A.

Figures 4 gives information on the UV-damage of tetramers. Figure 4a to c present the dose dependence for a selection of tetramers together with fit curves. In Fig. 4d a 2D-plot shows the relative frequencies of all tetramers, while Fig. 4e and f give the data for narrow selections of sequences. The overview plot of 4d reveals ranges with very different dose dependences. There are sequences with negligible decay (see e.g. the yellow stripe around i = 170, where GT, GC or GG dimers form the tetramers). Rapid decays show up in the plot via an early appearance of a blue range. This happens for tetramers containing exclusively pyrimidines with several thymines. Details of the dose dependences are well represented by the plots on the left part of Fig. 4. Figure 4a shows the results for oligomers which contain the bases T and G. For TTTT (see Fig. 3b) we observe the rapid decay due to efficient damage and the deviation from the mono-exponential function which points to secondary photoreactions. CPD lesions are likely candidates for the primary photolesions in thymine-oligomers^4,12, whereas 6–4 photoadduct formation is another process with a lower quantum yield^2,12. The existence of several products and of backreactions may lead to a biexponential decay. For one or several thymines being replaced by guanines the relative frequencies show a much slower initial decay and the biexponentiality is less pronounced. Oligomers without the possibility to form internal bipyrimidine lesions have slower decay and show nearly monoexponential curves, which is indicative of the formation of one dominant photoproduct. The behavior found for GTGT, GGGT, and GGGG is notable: For GTGT the decay of the relative frequencies with dose nearly vanishes, while for GGGT a weak initial decay has a decay constant μ_GGGT = 0.0017 base/photon. Interestingly, the poly-G oligomer with μ_GGGG = 0.00077 base/photon (value determined by the normalization procedure, see SI) lies in between.

Different combinations of the pyrimidine bases thymine and cytosine are included in the oligomers shown in Fig. 4b. The poly-C data reveal a weak bi-exponentiality where the initial decay has only a small amplitude. The dominant part of the damage occurs at higher doses. In the mixed oligomers the most efficient damages occur when at least one TT-dimer is present. However, combinations of C and T without TT or CC dimers also lead to considerably fast lesion formation. For a comparison of oligomers showing more or less pronounced bi-exponentialities we consider the dose values D_50% where 50% of the oligomers are damaged. These dose values decrease from D_50% = ca. 190 (CCCC), via 110 (CCCT), 64 (CTCT), 21 (CCTT), 13(CTTT) to 9.6 photon/base (TTTT).

In Fig. 4c we compare tetramers with a central TT sequence. When TT is flanked by C, pyrimidine dimer lesions become possible between TT, CT and TC and lead to a rapid decay of the relative frequencies. Only ca. 21 photons/base are required to damage 50% of the CTTC tetramers. When purine bases are flanking the central TT dimer, there is a considerable reduction of the damage efficiency. For ATTA D_50% becomes 53 photon/base. With flanking G the damage resistivity improves even more. For GTTG, we observe a much higher damage resistivity D_50% = 200 photon/base. The oligomers ATTG and GTTA have similar relative frequencies with D_50% = 87and 92 photon/base, respectively.

Figures 3 and 4 contain the information on the dose dependence for a limited number of tetramers. An overview of all tetramers is given in the Supporting Information in Table SI-t17, where the total damaging coefficients μ_olig and the related quantum yields are given together with the D_50% values for all 256 sequences of the tetramer pool. Discrepancies between D_50% and D_50%,mono = ln2/μ_olig occur for sequences with strong deviations from the simple mono-exponential decay.

One remark should be added regarding the precision of the deduced damage coefficients, which amounts to ca. 25% relative error. One source for these uncertainties is the statistical error of the relative frequencies of the fit procedure. In most cases the uncertainty is dominated by the limited precision in the determination of the molecular concentrations and thus of the number of absorbed photons per base.

Molecular damage and quantum yield

The experimental results on the dose dependence of the oligomer frequencies, e. g. the initial decay constants μ_olig give important qualitative insight into the role of the sequence for the initial steps in UV-damage formation. Quantitative information on the underlying molecular processes may be obtained when combining the observations with a molecular model for damage formation. Here, we use a simple model of the initial step of damage formation, which is based on dimeric damage processes and nearest neighbor interactions (see scheme in Fig. 5, insert and SI-9 for details). We assume the same excitation of all bases in the strand, where the excitation of one base X in a sequence WXY is equally shared with the two neighboring bases W and Y leading to excited dimer states (WX)* and (XY)* which cause damages of the respective dimers with the dimeric quantum yields φ_WX and φ_XY.

From the tetramer data we obtained a set of quantum yields with values for the strongly damaging di-pyrimidine dimers in agreement with literature. However, for the guanine containing dimers GT and GC negative values for the quantum efficiencies, e.g. μ_GT = − 5 10^–3, resulted which require an extension of the molecular model by processes which reduce the formation of di-pyrimidine dimer lesions when G is adjacent. One explanation could be that the formation of charge transfer states G⁺T⁻ or G⁺C⁻ between the strong electron donor G and a pyrimidine deactivates the damage^{7,9,38,39,40,41,42,43}. In a very recent study the role of charge transfer quenching has been studied for tetramers⁴⁴.

From this complete model one obtains the dimeric quantum yields given in Fig. 5, filled circles. Our method solely characterizes the number of undamaged sequences after irradiation and can therefore not analyze the molecular structure of the formed photolesions. However, a comparison to the most abundant photolesions in the literature reveals good agreement with our data: The most efficient damages occur for the pyrimidine dimers. The largest damage quantum yield is found for the TT dimer with \({\Phi }_{TT}\) = 0.021 ± 0.003. For CT (or TC) a somewhat smaller value is obtained, \({\Phi }_{CT}\) = 0.011 ± 0.002. For CC the quantum yield is again smaller, \({\Phi }_{CC}\) = 0.006 ± 0.001 and shows a considerable uncertainty. The purine dimer AA has a quantum yield around \({\Phi }_{AA}\) = 0.0019 ± 0.0003. These values are within the ranges found in the literature (see Fig. 5 open symbols). For the other purine containing dimers quantum yields are in the range of < 0.5 · 10^–3. In other words, they are zero within the present precision of the experiment. The analysis of the frequencies from the dimer pool leads, within the given uncertainties, to the same quantum yields. Since the frequencies in the hexamer pool are much smaller than for dimers and tetramers, larger uncertainties arise and prevent a reliable bi-exponential fit, the prerequisite for a successful analysis.

Discussion

In this work we measured the dependence of oligomer abundance on UV-exposure for the complete sequence space of DNA hexamers. Our measurements reveal the sensitivity of the employed technique for dimeric DNA lesions (see Fig. 2). For complete hexamer, tetramer and dimer pools the sequence dependent abundance was determined and results are presented in Figs. 3, 4 and 5 and SI-t 8 and SI-t 9. There are oligomers with high UV-resistivity, where even dose values of > 2000 absorbed photons per hexamer (or 500 photon/base) scarcely harm the oligomers. These oligomers often contain a large fraction of guanines and single pyrimidines (see Table SI-t 9) but no TT dimers. For oligomers with longer pyrimidine parts, especially poly-T parts, even small doses in the range of 10 photon/base are sufficient to damage most of the oligomers. The role of pyrimidine dimers for the DNA-damage becomes evident when the data are analyzed using the simple molecular model presented above. It reveals that the most important damage quantum yield in the range of 2% occurs for TT dimers. The quantum yields for CT (TC) or CC-dimers are somewhat smaller with ca. 1% and 0.5%, respectively. Despite our method not analyzing the molecular structure of the lesions, these values agree with the literature quantum yields for the most frequent photolesions, proving that the presented technique with the discrimination step using the Swift Library Kit (see “Materials and methods”) is able to detect the CPD lesion. Most of the purine containing dimers have quantum yields below our detection limit of ≤ 10^–3. These small values are again in agreement with damage quantum yields reported in the literature (TA⁴⁵, GC³¹, AA³⁶). Only the AA-dimer shows a detectable damage quantum yield of ca. 0.2% in the range of the literature values. Thus, the used library preparation kit is also able to unravel AA damage sites. When we inspect the presence of A in the hexamers showing strong irradiation damage (see Table SI-t9, lower part) we often find poly-A parts. Apparently longer poly-A parts can be taken as an indicator for increased susceptibility towards UV-irradiation damage.

The DNA in our experiments was mainly single-stranded. By a special design of the tail sequences and by suitable temperature and salt conditions it would be possible to increase the probability for double strand formation and to record the corresponding UV-sensitivity. Further interesting extensions of the presented technique could involve the study of damage induced by other irradiation wavelengths, by photosensitization or to investigate the action of denaturing and reactive compounds.

The sensitivity of the technique depends on the readout possibilities of the sequencing procedure, because it is based on the frequency of surviving oligomers. This requires high count numbers for all sequences of the oligomer pool, restricting the maximal strand length. In future experiments, longer sequences could be addressed by strategies combining randomized parts with parts of defined sequences, to limit to total length of the randomized parts. Furthermore, by using restricted sequence pools and thus larger read counts per sequence, the quantum yields of the aforementioned secondary damage processes could also be quantified with high sensitivity.

The integrity of the genetic information of organisms living today is maintained by sophisticated enzymes. Our approach allows to examine damages of single stranded DNA without the need for additional enzymatic repair processes. The resulting information on irradiation-induced damage represents situations with (i) UV-exposure by high-intensity irradiation, where severe damage is established prior to the start of repair, (ii) for organisms with defective repair or (iii) for short DNA strands of early (molecular) life⁴⁶, before the establishment of efficient enzymatic repair mechanisms. Because of the major role of RNA in the origins of life, the extension of our technique to RNA opens promising perspectives. Investigation of UV irradiation-induced damage of RNA strands would require an adaptation of the discrimination Step 3 by using reverse transcriptases. This modification appears to be straight forward since suitable RNA library preparation kits exist for NGS. The potential for the recognition of RNA damages remains to be investigated.

Conclusion

We quantified the sequence dependent irradiation damage in large, randomized pools of DNA in a highly parallel fashion. Our approach combines different standard processing steps, such as synthesis of oligomers with randomized sequences, UV-irradiation, discrimination against damaged DNA strands and the identification of the intact oligomers by NGS with appropriate analysis. Using DNA-strands with centrally randomized octamers, we could characterize the sensitivity towards UV irradiation of the complete sequence pool up to hexamers which consists of 4096 different sequences. The evaluation of the damage response by means of a simple model with dimeric damage formation was used to obtain the damage quantum yields of bi-pyrimidine dimers in agreement with the literature. However, the comparison of the dimeric model with the experimental data also revealed a protecting action of the sequence context. For example, the rate to form TT lesions is strongly reduced by a nearby guanine. In this work, the proposed technique was demonstrated on the radiation-induced lesion formation of single stranded DNA. Extensions of the method while keeping the underlying principles are possible to study other DNA or RNA structures or to record the response of the nucleotide strands upon exposure to damaging reagents.

Materials and methods

DNA 16-mers with random parts of the sequence ACACNNNNNNNNACAC were synthesized by Biomers, (Germany), dissolved in PBS-buffer and diluted to a base concentration of ca. 0.7 mM (concentration of bases, checked by UV-absorption, Shimadzu UV1800). Oligonucleotides containing the CPD-damage T=T were purchased from IBA GmbH, Germany. A starting volume of 3.45 ml of the 16-mer sample was irradiated at 266 nm pulses from a Nd-based laser system (AOT-YVO-25QSP/MOPA from Advanced Optical Technology, UK) in a fused silica cuvette (path length 10 mm) under stirring. Monitoring of the laser power allowed to determine the power absorbed in the sample (for details of the determination of the absorbed dose see SI-7, relative error in the determination ca. 16%). At each dose level 50 µl sample was taken from the cuvette and stored at low temperatures for further handling. Subsequently the samples were prepared for sequencing (Step 3) according to the provided protocol of the Swift Accel-NGS 1S DNA Library Kit (Swift Bioscience, US). The sequencing (Step 4) was performed using a Hi-Seq sequencer (Illumina, US) and the data was then filtered using the framing sequence ACAC with a quality score larger then 20 and analyzed (Step 5) using the methods given in the Supplementary Information SI-1 to SI-14. The main parts of data analysis involve normalization with respect to the unirradiated sample and to the oligomer containing poly-G in the central part. Further analysis steps involve the numerical fitting of the data by bi-exponential functions and the computation of the dimeric quantum yields via the molecular model described in SI-9. Further details on the methods used can be found in SI-1.

Data availability

The data supporting the findings of this study are available within the paper and its Supplementary Information. The datasets from sequencing that were used in the current study are available in the NCBI Sequence Read Archive (SRA), accession number PRJNA929909 (https://www.ncbi.nlm.nih.gov/sra/PRJNA929909). Additional information and files are available from the corresponding author upon reasonable request.

Code availability

The operations discussed in the Supplementary Information and Materials and methods for evaluating and analyzing the data have been implemented using standard methods in Labview and Igor Pro and are available on request at any time.

References

Cadet, J. et al. Effects of UV and visible radiation on DNA-final base damage. Biol. Chem. 378, 1275–1286 (1997).
CAS PubMed Google Scholar
Johns, H. E., Pearson, M. L., LeBlanc, J. C. & Helleiner, C. W. The ultraviolet photochemistry of thymidylyl-(3′5′)-thymidine. J. Mol. Biol. 9, 503-IN1 (1964).
Article Google Scholar
Swenson, P. A. & Setlow, R. B. Kinetics of dimer formation and photohydration in ultraviolet-irradiated polyuridylic acID. Photochem. Photobiol. 2, 419–434 (1963).
Article CAS Google Scholar
Sztumpf, E. & Shugar, D. Photochemistry of model oligo- and polynucleotides VI. Photodimerization and its reversal in thymine dinucleotide analogues. Biochim. Biophys. Acta (BBA)—Spec. Sect. Nucleic Acids Relat. Subj. 61, 555–566 (1962).
CAS Google Scholar
Liu, L., Pilles, B. M., Gontcharov, J., Bucher, D. B. & Zinth, W. Quantum yield of cyclobutane pyrimidine dimer formation via the triplet channel determined by photosensitization. J. Phys. Chem. B 120, 292–298 (2016).
Article CAS PubMed Google Scholar
Schreier, W. J. et al. Thymine dimerization in DNA is an ultrafast photoreaction. Science 315, 625–629 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Banyasz, A. et al. Electronic excited states responsible for dimer formation upon UV absorption directly by thymine strands: Joint experimental and theoretical study. J. Am. Chem. Soc. 134, 14834–14845 (2012).
Article CAS PubMed Google Scholar
Middleton, C. T. et al. DNA excited-state dynamics: From single bases to the double helix. Annu. Rev. Phys. Chem. 60, 217–239 (2009).
Article ADS CAS PubMed Google Scholar
Schreier, W. J., Gilch, P. & Zinth, W. Early events of DNA photodamage. Annu. Rev. Phys. Chem. 66, 497–519 (2015).
Article ADS CAS PubMed Google Scholar
Giussani, A. & Worth, G. A. On the intrinsically low quantum yields of pyrimidine DNA photodamages: Evaluating the reactivity of the corresponding minimum energy crossing points. J. Phys. Chem. Lett. 11, 4984–4989 (2020).
Article CAS PubMed Google Scholar
Schreier, W. J. et al. Thymine dimerization in DNA model systems: Cyclobutane photolesion is predominantly formed via the singlet channel. J. Am. Chem. Soc. 131, 5038–5039 (2009).
Article CAS PubMed Google Scholar
Law, Y. K., Azadi, J., Crespo-Hernández, C. E., Olmon, E. & Kohler, B. Predicting thymine dimerization yields from molecular dynamics simulations. Biophys. J. 94, 3590–3600 (2008).
Article ADS CAS PubMed PubMed Central Google Scholar
Premi, S. et al. Genomic sites hypersensitive to ultraviolet radiation. Proc. Natl. Acad. Sci. U. S. A. 116, 24196–24205 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Bryan, D. S., Ransom, M., Adane, B., York, K. & Hesselberth, J. R. High resolution mapping of modified DNA nucleobases using excision repair enzymes. Genome Res. 24, 1534–1542 (2014).
Article CAS PubMed PubMed Central Google Scholar
Mao, P. & Wyrick, J. J. Genome-wide mapping of UV-induced DNA damage with CPD-seq. In The Nucleus Vol. 2175 (ed. Hancock, R.) 79–94 (Springer US, 2020).
Chapter Google Scholar
Hu, J., Adebali, O., Adar, S. & Sancar, A. Dynamic maps of UV damage formation and repair for the human genome. Proc. Natl. Acad. Sci. U. S. A. 114, 6758–6763 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Li, W. et al. Human genome-wide repair map of DNA damage caused by the cigarette smoke carcinogen benzoapyrene. Proc. Natl. Acad. Sci. U. S. A. 114, 6752–6757 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Mingard, C., Wu, J., McKeague, M. & Sturla, S. J. Next-generation DNA damage sequencing. Chem. Soc. Rev. 49, 7354–7377 (2020).
Article CAS PubMed Google Scholar
Choi, J.-H. et al. Highly specific and sensitive method for measuring nucleotide excision repair kinetics of ultraviolet photoproducts in human cells. Nucleic Acids Res. 42, e29 (2014).
Article CAS PubMed Google Scholar
Hu, J., Adar, S., Selby, C. P., Lieb, J. D. & Sancar, A. Genome-wide analysis of human global and transcription-coupled excision repair of UV damage at single-nucleotide resolution. Genes Dev. 29, 948–960 (2015).
Article CAS PubMed PubMed Central Google Scholar
Hu, J. et al. Genome-wide mapping of nucleotide excision repair with XR-seq. Nat. Protoc. 14, 248–282 (2019).
Article CAS PubMed PubMed Central Google Scholar
Li, W., Adebali, O., Yang, Y., Selby, C. P. & Sancar, A. Single-nucleotide resolution dynamic repair maps of UV damage in Saccharomyces cerevisiae genome. Proc. Natl. Acad. Sci. U. S. A. 115, E3408–E3415 (2018).
ADS CAS PubMed PubMed Central Google Scholar
Li, W. & Sancar, A. Methodologies for detecting environmentally induced DNA damage and repair. Environ. Mol. Mutagen. 61, 664–679 (2020).
Article CAS PubMed PubMed Central Google Scholar
Shen, Y., Pressman, A., Janzen, E. & Chen, I. A. Kinetic sequencing (k -Seq) as a massively parallel assay for ribozyme kinetics: Utility and critical parameters. Nucleic Acids Res. 49, e67–e67 (2021).
Article CAS PubMed PubMed Central Google Scholar
Lemaire, D. G. & Ruzsicska, B. P. Quantum yields and secondary photoreactions of the photoproducts of dTpdT, dTpdC and dTpdU. Photochem. Photobiol. 57, 755–769 (1993).
Article CAS PubMed Google Scholar
Park, H. et al. Crystal structure of a DNA decamer containing a cis-syn thymine dimer. Proc. Natl. Acad. Sci. U. S. A. 99, 15965–15970 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
Taylor, J. S., Garrett, D. S., Brockie, I. R., Svoboda, D. L. & Telser, J. 1H NMR assignment and melting temperature study of cis-syn and trans-syn thymine dimer containing duplexes of d(CGTATTATGC).d(GCATAATACG). Biochemistry 29, 8858–8866 (1990).
Article CAS PubMed Google Scholar
Selby, C. P., Drapkin, R., Reinberg, D. & Sancar, A. RNA polymerase II stalled at a thymine dimer: Footprint and effect on excision repair. Nucleic Acids Res. 25, 787–793 (1997).
Article CAS PubMed PubMed Central Google Scholar
Mei Kwei, J. S. et al. Blockage of RNA polymerase II at a cyclobutane pyrimidine dimer and 6–4 photoproduct. Biochem. Biophys. Res. Commun. 320, 1133–1138 (2004).
Article PubMed Google Scholar
Khan, M. I. et al. DNA polymerase β of Leishmania donovani is important for infectivity and it protects the parasite against oxidative damage. Int. J. Biol. Macromol. 124, 291–303 (2019).
Article CAS PubMed Google Scholar
Münzel, M., Szeibert, C., Glas, A. F., Globisch, D. & Carell, T. Discovery and synthesis of new UV-induced intrastrand C(4–8)G and G(8–4)C photolesions. J. Am. Chem. Soc. 133, 5186–5189 (2011).
Article PubMed Google Scholar
Zhao, X. The structure of d(TpA), the major photoproduct of thymidylyl-(3’5’)- deoxyadenosine. Nucleic Acids Res. 24, 1554–1560 (1996).
Article CAS PubMed PubMed Central Google Scholar
Kumar, S. et al. Adenine photodimerization in deoxyadenylate sequences: Elucidation of the mechanism through structural studies of a major d(ApA) photoproduct. Nucleic Acids Res. 19, 2841–2847 (1991).
Article CAS PubMed PubMed Central Google Scholar
Görner, H. New trends in photobiology. J. Photochem. Photobiol. B 26, 117–139 (1994).
Article PubMed Google Scholar
Bucher, D. B., Kufner, C. L., Schlueter, A., Carell, T. & Zinth, W. UV-induced charge transfer states in DNA promote sequence selective self-repair. J. Am. Chem. Soc. 138, 186–190 (2016).
Article CAS PubMed Google Scholar
Pörschke, D. A specific photoreaction in polydeoxyadenylic acid. Proc. Natl. Acad. Sci. U. S. A. 70, 2683–2686 (1973).
Article ADS PubMed PubMed Central Google Scholar
Marguet, S. & Markovitsi, D. Time-resolved study of thymine dimer formation. J. Am. Chem. Soc. 127, 5780–5781 (2005).
Article CAS PubMed Google Scholar
Zhang, Y. et al. Efficient UV-induced charge separation and recombination in an 8-oxoguanine-containing dinucleotide. Proc. Natl. Acad. Sci. U. S. A. 111, 11612–11617 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Kufner, C. L., Zinth, W. & Bucher, D. B. UV-induced charge-transfer states in short guanosine-containing DNA oligonucleotides. ChemBioChem 21, 2306–2310 (2020).
Article CAS PubMed PubMed Central Google Scholar
Pilles, B. M. et al. Identification of charge separated states in thymine single strands. Chem. Commun. 50, 15623–15626 (2014).
Article CAS Google Scholar
Bucher, D. B., Pilles, B. M., Carell, T. & Zinth, W. Charge separation and charge delocalization identified in long-living states of photoexcited DNA. Proc. Natl. Acad. Sci. U. S. A. 111, 4369–4374 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Quinn, S. et al. Ultrafast IR spectroscopy of the short-lived transients formed by UV excitation of cytosine derivatives. Chem. Commun. 21, 2130 (2007).
Article Google Scholar
Doorley, G. W. et al. Tracking DNA excited states by picosecond-time-resolved infrared spectroscopy: Signature band for a charge-transfer excited state in stacked adenine-thymine systems. J. Phys. Chem. Lett. 4, 2739–2744 (2013).
Article CAS Google Scholar
Lu, C., Gutierrez-Bayona, N. E. & Taylor, J. S. The effect of flanking bases on direct and triplet sensitized cyclobutane pyrimidine dimer formation in DNA depends on the dipyrimidine, wavelength and the photosensitizer. Nucleic Acids Res. 49, 4266–4280 (2021).
Article CAS PubMed PubMed Central Google Scholar
Bose, S. N. & Davies, R. J. The photoreactivity of T-A sequences in oligodeoxyribonucleotides and DNA. Nucleic Acids Res. 12, 7903–7914 (1984).
Article CAS PubMed PubMed Central Google Scholar
Beckstead, A. A., Zhang, Y., de Vries, M. S. & Kohler, B. Life in the light: Nucleic acid photoproperties as a legacy of chemical evolution. Phys. Chem. Chem. Phys. 18, 24228–24238 (2016).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) through the Clusters of Excellence “Center of Integrated Protein Science Munich” (W.Z.) and Project-ID 364653263—TRR 235 (CRC235), Project P08 (C.B.M). Funding from the Simons Foundation (327125 to D.B.), Volkswagen Initiative ‘Life?—A Fresh Scientific Approach to the Basic Principles of Life’ (C.B.M., D.B.), ERC ADV 2018 Grant 834225 (EAVESDROP) (D.B.) and from ERC-2017-ADG from the European Research Council (D.B.) is gratefully acknowledged. Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany´s Excellence Strategy—EXC-2094—390783311 (D.B., C.B.M). D.B.B. acknowledges funding by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No 948049). This work was supported by a grant from the Simons Foundation (SCOL 290360FY18 to Dimitar Sasselov). The work is supported by the Center for Nanoscience Munich (CeNS).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Harvard-Smithsonian Center for Astrophysics, Department of Astronomy, Harvard University, 60 Garden Street, Cambridge, MA, 02138, USA
Corinna L. Kufner
Laboratory for Functional Genome Analysis, Gene Center, Ludwig Maximilians University Munich, Feodor-Lynen-Straße 25, 81377, Munich, Germany
Stefan Krebs, Marlis Fischaleck, Julia Philippou-Massier & Helmut Blum
Chemistry Department, TUM School of Natural Sciences, Technical University of Munich, Lichtenbergstr. 4, 85748, Garching, Germany
Dominik B. Bucher
Systems Biophysics, Ludwig Maximilians University Munich, Amalienstr. 54, 80799, Munich, Germany
Dieter Braun & Christof B. Mast
Biomolecular Optics and Center for Integrated Protein Science, Ludwig Maximilians University Munich, Oettingenstrasse 67, 80538, Munich, Germany
Wolfgang Zinth

Authors

Corinna L. Kufner
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Krebs
View author publications
You can also search for this author in PubMed Google Scholar
Marlis Fischaleck
View author publications
You can also search for this author in PubMed Google Scholar
Julia Philippou-Massier
View author publications
You can also search for this author in PubMed Google Scholar
Helmut Blum
View author publications
You can also search for this author in PubMed Google Scholar
Dominik B. Bucher
View author publications
You can also search for this author in PubMed Google Scholar
Dieter Braun
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Zinth
View author publications
You can also search for this author in PubMed Google Scholar
Christof B. Mast
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.L.K., S.K., M.F., J.P.M., W.Z., C.B.M. performed the experiments. D.B., C.L.K., D.B.B., H.B., W.Z., C.B.M. conceived and designed the experiments. W.Z., C.B.M. analyzed the data. C.L.K., D.B.B., D.B., W.Z., C.B.M. wrote the paper. All authors discussed the results and commented on the manuscript.

Corresponding author

Correspondence to Christof B. Mast.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kufner, C.L., Krebs, S., Fischaleck, M. et al. Sequence dependent UV damage of complete pools of oligonucleotides. Sci Rep 13, 2638 (2023). https://doi.org/10.1038/s41598-023-29833-0

Download citation

Received: 21 October 2022
Accepted: 10 February 2023
Published: 14 February 2023
DOI: https://doi.org/10.1038/s41598-023-29833-0

This article is cited by

DNA damage and repair in the nucleosome: insights from computational methods
- Natacha Gillet
- Elise Dumont
- Emmanuelle Bignon
Biophysical Reviews (2024)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.