Uracil-DNA glycosylase efficiency is modulated by substrate rigidity

Orndorff, Paul B.; Poddar, Souvik; Owens, Aerial M.; Kumari, Nikita; Ugaz, Bryan T.; Amin, Samrat; Van Horn, Wade D.; van der Vaart, Arjan; Levitus, Marcia

doi:10.1038/s41598-023-30620-0

Download PDF

Article
Open access
Published: 08 March 2023

Uracil-DNA glycosylase efficiency is modulated by substrate rigidity

Paul B. Orndorff¹^na1,
Souvik Poddar^2,3^na1,
Aerial M. Owens^2,4^na1,
Nikita Kumari^2,3,
Bryan T. Ugaz^2,3,
Samrat Amin⁵,
Wade D. Van Horn^2,4,
Arjan van der Vaart¹ &
…
Marcia Levitus^2,3

Scientific Reports volume 13, Article number: 3915 (2023) Cite this article

1845 Accesses
3 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Uracil DNA-glycosylase (UNG) is a DNA repair enzyme that removes the highly mutagenic uracil lesion from DNA using a base flipping mechanism. Although this enzyme has evolved to remove uracil from diverse sequence contexts, UNG excision efficiency depends on DNA sequence. To provide the molecular basis for rationalizing UNG substrate preferences, we used time-resolved fluorescence spectroscopy, NMR imino proton exchange measurements, and molecular dynamics simulations to measure UNG specificity constants (k_cat/K_M) and DNA flexibilities for DNA substrates containing central AUT, TUA, AUA, and TUT motifs. Our study shows that UNG efficiency is dictated by the intrinsic deformability around the lesion, establishes a direct relationship between substrate flexibility modes and UNG efficiency, and shows that bases immediately adjacent to the uracil are allosterically coupled and have the greatest impact on substrate flexibility and UNG activity. The finding that substrate flexibility controls UNG efficiency is likely significant for other repair enzymes and has major implications for the understanding of mutation hotspot genesis, molecular evolution, and base editing.

Single molecule analysis reveals monomeric XPA bends DNA and undergoes episodic linear diffusion during damage search

Article Open access 13 March 2020

DNA repair glycosylase hNEIL1 triages damaged bases via competing interaction modes

Article Open access 05 July 2021

Covalent binding of uracil DNA glycosylase UdgX to abasic DNA upon uracil excision

Article 17 May 2019

Introduction

Cellular repair pathways maintain genetic integrity against thousands of daily DNA lesions. The repair of small, non-helix distorting lesions, is initiated by specialized DNA glycosylases that catalyze the excision of the damaged base¹. DNA sequence effects have been identified for many glycosylases and alkyl-transferases that repair base mismatches, uracil bases, alkylated bases, etc^2,3,4,5,6,7. However, the molecular features that give rise to preferences for particular DNA sequences have not yet been resolved. Here, we focus on understanding how DNA sequence impacts the repair of uracil by uracil-DNA glycosylase (UNG). Uracil is a highly mutagenic and common lesion in DNA that arises from dUTP misincorporation or from spontaneous cytosine deamination⁸. Unrepaired cytosine deamination results in U:G mismatches that give rise to G:C to A:T transition mutations⁹.

The crystal structure of the human UNG-DNA complex shows the uracil extruded from the duplex into the enzyme active site¹⁰. The first step in uracil recognition by UNG relies on trapping spontaneous extrahelical excursions of uracil^11,12,13, which occur at enhanced rates compared to thymine in the same sequence context¹⁴. Consistent with this observation, a thermodynamic study of the effect of thymine to uracil substitutions found reduced base stacking in U:A base pairs compared to T:A¹⁵. While these studies provide a molecular framework for understanding how UNG distinguishes between uracil and thymine, the molecular principles that determine UNG substrate preferences are still poorly understood. In E. coli, the efficiency of UNG varies more than 15-fold depending on the DNA context^4,5. A vicinal thymine 3′ of uracil generally results in poor excision, and substrates with high local GC content are generally poor substrates^16,17. Yet, the underlying principles that dictate UNG substrate preferences still remain elusive.

Preferential excision of uracil does not correlate with DNA melting temperatures; for instance, substrates containing uracil in TUA contexts are better UNG substrates than sequences with AUT contexts^4,5. The differences between TUA and AUT sequences are remarkable and are reminiscent of the asymmetries observed in the mechanical properties of undamaged DNA. TA steps are particularly flexible with regard to roll, slide, and twist, while AT steps are more rigid^18,19. DNA flexibility is indeed suspected to dictate the binding preferences of various non-specific DNA enzymes and proteins. For instance, DNA flexibility explains sequence preferences in phosphodiester backbone cleavage by endonuclease DNase 1²⁰, and the role of DNA sequence in nucleosome stability and dynamics²¹. In this work, we test the hypothesis that UNG activity is dictated by the intrinsic local deformability of the DNA sequence around the uracil. Structural studies provide a rationale for this hypothesis; the formation of the catalytically active UNG-DNA complex requires DNA hydrogen bond breakage and loss of stabilizing base-stacking interactions^10,22, and because these interactions are determined by the mechanical properties of DNA, the intrinsic deformability of the region surrounding the lesion is likely an important variable in understanding catalytic efficiency. A previous study that compared substrates containing uracil in AUA and GUG contexts hinted at a connection between substrate flexibility and UNG activity²³, but the small scale of the study (2 substrates) was not sufficient to establish a correlation between the two variables, nor to determine the molecular basis for substrate preference.

To illuminate the link between UNG repair efficiency and substrate flexibility, we determined UNG specificity constants (k_cat/K_M) for a variety of designed substrates and correlated the outcomes with the results of biophysical experiments and MD simulations that probe DNA substrate flexibility. Our results not only establish a clear link between the fundamental nature of UNG substrate flexibility, the contributions of distinct substrate dynamics, and the hierarchy of these motions in regulating the resulting UNG catalytic efficiency, but also identify and quantify allosteric coupling of bases flanking the uracil.

Materials and methods

Enzymes, oligonucleotides, and reagents

Uracil-DNA Glycosylase (UNG, MW = 25.7 kDa) was purchased from New England BioLabs (Catalog # M0280L) at a concentration of 5000 U/mL. Free 2-aminopurine riboside was purchased from AstaTech (PA, USA). Oligonucleotides containing canonical bases, uracil, or 2-aminopurine (2AP) were purchased from IDT (IA, USA) as desalted oligonucleotides. All DNA sequences are listed in Table 1 and are represented by a number and two letters. The number refers to the different DNA series (1–4) and the letters refer to the bases adjacent to the uracil. For instance, the four substrates in series 1 (1TA, 1TT, 1AA and 1AT) share the same overall sequence but differ in the bases adjacent to the uracil. Oligonucleotides for fluorescence experiments (kinetic assays, time-resolved fluorescence, and fluorescence quantum yields) were 39 nt in length (Table 1). Oligonucleotides for NMR experiments were shortened to the central 13 nucleotides to facilitate resonance assignments while still ensuring duplex formation. All oligonucleotides were solubilized in 1 × PBS buffer (20 mM sodium phosphate, 100 mM NaCl, 50 µM EDTA, pH 7.0). Concentrations were determined from measured absorbances at 260 nm using extinction coefficients provided by IDT.

Table 1 DNA sequences.

Full size table

Sample preparation

Duplex DNA substrates used for the fluorescence-based experiments were prepared by annealing uracil-containing oligonucleotides with complementary strands containing 2AP opposite to the uracil. For the kinetic assays, a small excess of the 2AP-strand is preferable to an excess of the uracil-containing strand because the latter is also a substrate of UNG, and therefore its presence may affect the measured kinetic rates. For the kinetic assays, DNA substrates were prepared by annealing the strands at room temperature while monitoring the fluorescence intensity of 2AP in real-time. The uracil-containing strand was added to a known concentration of 2AP-containing strand, and the reduction in fluorescence intensity due to the formation of the duplex was measured until a small addition of the uracil strand did not result in a further decrease. The concentration of the resulting dsDNA substrate was calculated from the absorbance of the initial 2AP-containing strand and the volumes before and after adding the uracil strand. Traditional native polyacrylamide gel electrophoresis was used to confirm that the annealing procedure at room temperature was highly efficient and did not result in any measurable 2AP- or uracil-containing single strands. For the time-resolved and fluorescence quantum yield experiments, a slight excess of the U-strand is preferable to an excess of the 2AP-strand because 2AP in ssDNA is significantly brighter than 2AP in a duplex. Samples for these experiments were prepared as before and followed by the addition of a ~ 20% excess of the uracil-containing strand. In each case, we verified that further addition of the uracil-containing strand did not change the measured lifetimes or quantum yields. Duplexes for NMR experiments were prepared from complementary oligonucleotides mixed at 1:1 molar ratios that were heated to ~ 80 °C and annealed by cooling to room temperature for ~ 2 h.

UNG kinetic assay and data analysis

A continuous fluorescence kinetic assay was used to measure UNG activity on DNA substrates containing 2AP opposite to the uracil²⁴. 2AP is highly fluorescent when exposed to water but is highly quenched in dsDNA. The cleavage of the dU glycosidic bond by UNG results in an aldehydic abasic site opposite to 2AP, and this environmental change leads to a large fluorescence increase. The increase in fluorescence intensity can be used to calculate the reaction rate in a continuous kinetic assay. The kinetic parameters measured in this way are indistinguishable from the values obtained using a traditional radioactivity-based electrophoresis assay with non-fluorescent substrates²⁴. In addition, the perturbation introduced by the 2AP probe has been shown to have a negligible effect on both the measured dissociation and association kinetic constants of the UNG-DNA complex²⁵. 600 μL of duplex DNA was placed in a quartz micro cuvette (optical path length 10 × 2 mm) and fluorescence intensity (λ_ex = 310 nm, λ_em = 370 nm) was monitored continuously before and after addition of UNG. Fluorescence intensities were corrected in real-time for potential fluctuations in the incident intensity over the long measurement times. A small amount of stock enzyme (5,000 U/mL) was diluted 40-fold in 1 × PBS buffer containing 1 mg/mL BSA, and this dilution was stored at 4 °C for no longer than six hours and used for all kinetic experiments performed the same day. For the kinetic experiments, 2 μL of this dilution were added to the cuvette containing the DNA substrate for a final concentration of UNG in the assay mixture of 0.16 nM. The initial velocity (V₀, units of M·s^-1) was obtained from the initial slope of the measured intensity (F(t)) as ${V}_{0}=\frac{{[S]}_{0}}{\Delta F}{\left(\frac{dF}{dt}\right)}_{0}$, where [S]₀ is the concentration of DNA substrate (0.075–6 μM) and ΔF is the total change in fluorescence. A representative sample run and a sample calculation are shown in Fig. S1. Error bars in Figs. 1 and S7 represent 95% confidence intervals. To rule out systematic sources of error, for 1AT, 23 trials were performed at 0.3 μM concentration involving (1) at least four different purchased UNG stock solutions, (2) three different experimentalists, and (3) DNA substrates prepared from oligos purchased at different times. The V₀ versus [DNA] curves were fitted using the Michaelis–Menten equation in Origin Pro (Northampton, MA) using the Lavenberg Marquardt iteration algorithm and using the reciprocal of the variances as weights.

Time-resolved fluorescence

Time-resolved fluorescence intensity measurements were performed using the time-correlated single-photon counting (TCSPC) technique. A mode-locked Ti:Sapphire laser (Mira 900, Coherent) pumped by a frequency-doubled Nd:YVO4 laser (44% from an 18 W Verdi, Coherent) was used as the excitation source. The 130 fs light pulses (at 800 nm with a repetition rate of 250 kHz) were generated by a regeneratively amplified Ti:S laser system (RegA 9000, Coherent Laser). The pulses were sent to an optical parametric amplifier (OPA) to generate the excitation light at 620 nm and then frequency-doubled to obtain excitation pulses at 310 nm. Fluorescence emission was collected at a 90° geometry setting and detected using a double-grating monochromator (Oriel Instruments) and a microchannel plate photomultiplier tube (Hamamatsu R3809U-51). Decays were measured at three emission wavelengths (380, 390, and 400 nm) for global analysis as described below. The polarization of the emission was 54.7° relative to that of the excitation (magic angle). A single-photon counting card (Becker-Hickel, SPC-830) using two time windows (3.3 ns and 25 ns) was used for data acquisition. Instrumental response functions (IRF) were determined for both time resolutions. The typical IRF had a FWHM of 40 ps, measured from the scattering of Ludox sample at 310 nm. The three decays obtained at different emission wavelengths for each sample were fitted globally keeping the lifetimes as common parameters among the three data sets. This approach minimizes the problem of correlation between pre-exponential factors and lifetimes, which is common when fitting multi-exponential decays²⁶. The fitting parameters were obtained through iterative reconvolution of the model function $F\left(\lambda ,t\right)={F}_{0}\left(\lambda \right)\sum_{i=1}^{n}{\alpha }_{i}\left(\lambda \right){e}^{-t/{\tau }_{i}}$ with the measured IRF using an in-home written software package (ASUFIT). Here, λ represents the emission wavelength, and $\sum_{i=1}^{n}{\alpha }_{i}\left(\lambda \right)=1.$ Lifetimes as long as ~ 8 ns and as short as ~ 30 ps are expected²⁶, and in our experience, results are more robust and reproducible if the shorter components are determined from data measured with the highest time resolution achievable with the single-photon counting card (814 fs/channel, 2¹² channels = 3.3 ns total), while the longer lifetimes are obtained from data measured with a wider time window. The decays measured using 3.3 ns acquisition windows were fitted using three exponential terms. A fourth term did not improve the quality of the fit. A new fit was then conducted using the decays measured with a 25 ns acquisition window (6.1 ps/channel) using four exponential terms but fixing the two shortest lifetimes (τ₁ and τ₂) to the values obtained in the previous fit. In this way, the fitting parameters in the second fit were τ_3, τ₄ and ${\alpha }_{1-4}\left(\lambda \right).$ This procedure allows a more accurate determination of τ₁ and τ₂ (both below 0.5 ns) from measurements using 814 fs/channel resolution, and τ₃ and τ₄ (both over 1 ns) from measurements using 6.1 ps/channel resolution. Mean lifetimes were calculated for each sample as $\langle \tau \rangle = \sum_{i=1}^{4}{\alpha }_{i}{\tau }_{i}$ using the ${\alpha }_{i}$ values obtained in the second fit.

Fluorescence quantum yields

Fluorescence quantum yields (ϕ) were determined relative to a reference as ${\phi }_{s}={\phi }_{R}\frac{{I}_{s}}{{I}_{R}}\times \frac{1-{10}^{-{A}_{R}}}{1-{10}^{-{A}_{S}}}$, where the subscripts “S” and “R” refer to the sample and reference, respectively, I is the integrated emission intensity measured over the entire fluorescence band, and A is the absorbance at the excitation wavelength (315 nm)²⁷. Absorbances were kept below 0.05 to avoid inner filter effects. Experiments replacing the 2AP-containing strand with an adenine-containing strand were used to verify that the absorbance of the canonical bases at 315 nm was negligible compared to the absorbance measured with the 2AP-containing DNA. Steady-state emission fluorescence spectra were acquired on a PTI Quantamaster 4/2005SE spectrofluorimeter. Fluorescence spectra showed clear contributions from Raman scattering at 352 nm, and to account for these contributions a buffer sample was used as a blank and subtracted from the measurements containing 2AP-containing DNA. The free 2AP riboside is commonly used as a reference (${\phi }_{R}=0.68$ in water)²⁸, but fluorescence intensities in duplex DNA are reduced 100-fold or more due to quenching, and therefore the determination of ϕ in DNA involves measuring very small fluorescence intensities. To improve accuracy, we performed five independent determinations of ϕ for the sample with the highest quantum yield (1AT) using free 2AP riboside in water as a reference. The average of five determinations was ϕ_1AT = 0.0103 (standard deviation = 6.5 × 10^–4, 95% confidence interval = 8.1 × 10^–4), and this value was subsequently used as a reference for the ϕ determinations of all other 2AP-DNA samples. Values listed in Table S1 are averages of 4 independent determinations. All standard deviations are 3% or lower.

Fractional population of highly stacked species (α₀)

The fractional population of dark (highly stacked) 2AP probes in the duplex DNA substrates was calculated from the sample mean lifetime ($\langle \tau \rangle$) and fluorescence quantum yield (ϕ) as ${\alpha }_{0}=1-\frac{{\tau }_{2AP}}{\langle \tau \rangle }\frac{\phi }{{\phi }_{2AP}}$, with ${\phi }_{2AP}=0.68$ and ${\tau }_{2AP}=10.2$ ns^28,29. Errors were estimated as ${\Delta \alpha }_{0}={\left({\left(\frac{{\tau }_{2AP}}{{\phi }_{2AP}}\frac{1}{\langle \tau \rangle }\Delta\phi \right)}^{2}+{\left(\frac{{\tau }_{2AP}}{{\phi }_{2AP}}\frac{\phi }{{\langle \tau \rangle }^{2}}\Delta \langle \tau \rangle \right)}^{2}\right)}^{1/2}$. Values of $\Delta\phi$ were taken as the standard deviation of the quantum yield measurements for each sample. $\Delta \langle \tau \rangle$ values were estimated from the two repeats performed for each sample (Table S2). Because performing large numbers of repeats for individual samples in unfeasible due to time and cost, we calculated the percent deviation of each $\langle \tau \rangle$ determination (two values for each of the six measured samples) from the respective average for the same sample. We determined that, on average, $\langle \tau \rangle$ values are measured with a 2.5% precision. This value was then used to estimate $\Delta \langle \tau \rangle$ for each sample.

NMR-detected imino proton exchange rate measurements

NMR samples were prepared in 3 mm tubes with DNA concentrations ranging from 2 to 4 mM with 5% v/v deuterium oxide. NMR experiments were performed on a Bruker 850 MHz Avance III HD spectrometer equipped with a 5 mm TCI CryoProbe, and a Bruker 600 MHz Avance III HD spectrometer equipped with a Prodigy probe. NMR spectra were processed and analyzed using Bruker TopSpin 4.1, MestReNova 14.2, and Matlab 2019b.

Resonance assignments

Two-dimensional ¹H, ¹H—Nuclear Overhauser Effect Spectroscopy (NOESY) experiments were recorded at 20 °C utilizing water suppression by excitation sculpting. The resulting 2D spectra were used to assign imino protons for each duplex using traditional methods (Figs. S2, S3)^30,31. All assignments and subsequent experiments were collected at 20 °C.

Water inversion efficiency factor (E)

The water inversion efficiency factor was measured as previously described³², with a relaxation delay of 30 s, and is further described in the supplementary information (Fig. S4). Data processing and fitting were completed in Bruker TopSpin 4.1 and MestReNova 14.2.

Longitudinal relaxation rate of water (R_1w)

The relaxation rate of water (R_1w) was measured utilizing a previously described saturation-recovery method that is compatible with high Q cryoprobes³². Variable time delays ranged from 1 ms to 18 s. Data processing and fitting were completed in Bruker TopSpin 4.1 and Matlab 2019b for verification (Fig. S5). Determination of R_1w was completed using the TopSpin T1/T2 Module.

Imino proton longitudinal relaxation (R_1n) and exchange rates (k_ex)

The sum of the longitudinal imino proton relaxation rates (R₁) and the imino proton solvent exchange rate (k_ex) can be used to determine the longitudinal relaxation rate of each imino proton (R_1n). A pseudo-two-dimensional experiment was implemented to determine the longitudinal relaxation of imino protons (R_1n) and the imino proton exchange rates (k_ex) following established methods³³ utilizing a 24-point variable delay sequence ranging from 1 ms to 15 s. The respective spectra were processed in TopSpin 4.1, and the data were fit with MATLAB R2019b (MathWorks) using nonlinear least-squares fit, first fitting for the longitudinal relaxation of imino protons (R_1n) followed by the exchange rate of imino protons (k_ex). Representative fits of R_1n and k_ex are shown in Fig. S6. The k_ex was determined by fitting the individual peak areas to the equation:

$$\frac{A\left( t \right)}{{A_{0} }} = 1 - {{E \cdot k_{ex} } \mathord{\left/ {\vphantom {{E \cdot k_{ex} } {\left( {R_{1w} - R_{1n} } \right) \cdot \left( {e^{{ - R_{1n} \cdot t}} - e^{{ - R_{1w} \cdot t}} } \right)}}} \right. \kern-0pt} {\left( {R_{1w} - R_{1n} } \right) \cdot \left( {e^{{ - R_{1n} \cdot t}} - e^{{ - R_{1w} \cdot t}} } \right)}}$$

(1)

where A(t) is the area of the peak at exchange time point t, A₀ is the peak intensity at equilibrium, E (determined experimentally) is the water inversion efficiency factor, R_1w (determined experimentally) is the water longitudinal relaxation rate, and R_1n (determined experimentally) is the sum of imino proton longitudinal relaxation rates and its solvent exchange rate. The reported errors were estimated from the fitting of k_ex (Table S3).

Double mutant cycles and base-pair coupling analyses

The energetic impact of base substitutions was quantified from thermodynamic cycles that depict differences in transition state free energies: $\mathrm{\Delta \Delta }{G}_{1\to 2}^{\ddagger }=\Delta {G}_{2}^{\ddagger }-\Delta {G}_{1}^{\ddagger }$ , where indices 1 and 2 indicate two different DNA sequences³⁴. Two cycles were constructed. The first was based on NMR data, where $\Delta {G}^{\ddagger }$ represents the barrier for imino exchange, and $\mathrm{\Delta \Delta }{G}_{1\to 2}^{\ddagger }=-RT \mathrm{ln}\left[\frac{{\left({k}_{ex}\right)}_{2}}{{\left({k}_{ex}\right)}_{1}}\right]$ . Differences between the 2TA, 2TT, 2AA, and 2AT sequences were assessed. The second cycle was based on k_cat/K_M measurements, where $\Delta {G}^{\ddagger }$ represents the barrier for enzymatic uracil excision, and $\mathrm{\Delta \Delta }{G}_{1\to 2}^{\ddagger }=-RT \mathrm{ln}\left[\frac{{\left({k}_{cat}/{K}_{M}\right)}_{2}}{{\left({k}_{cat}/{K}_{M}\right)}_{1}}\right]$. In this cycle, differences between 2AT, 2TT, 2AA, 2TA, and 1AT, 1TT, 1AA, 1TA (see Table 1 for substrate nomenclature) were calculated. From these cycles, coupling free energies

$$\Delta G_{coup} = \Delta \Delta G_{AT \to TT}^{\ddag } - \Delta \Delta G_{AA \to TA}^{\ddag } = \Delta \Delta G_{AT \to AA}^{\ddag } - \Delta \Delta G_{TT \to TA}^{\ddag }$$

(2)

between the base pairs directly adjacent to uracil were assessed. ${\Delta G}_{coup}$ is zero when these base pairs are independent of each other, and nonzero when they are coupled and influence each other. Since the transition state free energies of the two cycles correspond to different processes, free energy differences and coupling free energies are expected to differ between the thermodynamic cycles.

MD simulations

The dsDNA sequences of Table 1 were built in the unbent BII conformation using 3DNA³⁵; U₆ was base-paired to A₁₉. Each strand was solvated in a rectangular TIP3P water box³⁶ of 100 mM NaCl with a solvent layer of 15 Å in each direction. After energy minimizations, each system was heated from 100 to 300 K over 2.5 ns with a 1 kcal/(mol·Å²) harmonic restraint on all DNA atoms. During heating, flat bottom distance restraints with a force constant of 1 kcal/(mol·Å²) were added to the hydrogen bonds between the bases. After heating, the harmonic restraints on the DNA atoms were gradually removed over 1.2 ns, while restraints on the hydrogen bonds remained in effect. The latter were subsequently removed over an additional 3 ns. The unrestrained systems were then equilibrated for 400 ns, followed by at least 600 ns of production simulations. Heating and restraints removal were performed with Langevin dynamics in AMBER³⁷, while the production runs were done with Langevin dynamics in OPENMM³⁸. The simulations were performed in NPT, periodic boundary conditions were in effect, SHAKE³⁹ was applied to all covalently bonded hydrogen atoms, and long-range electrostatic interactions were handled using the particle-mesh Ewald method⁴⁰. All simulations used the AMBER OL15 DNA force field⁴¹; deoxyribose parameters for U were taken from T deoxyribose. Convergence was assessed by monitoring cumulative averages of DNA bending and total winding angles. In addition, all trajectories were decorrelated using pymbar⁴², and all properties were calculated from 100 decorrelated frames per trajectory. If needed, simulations were extended for additional blocks of 500 ns until convergence. Simulations were run in triplicate for each strand.

Geometric analyses of the DNA base and step parameters were performed with 3DNA³⁵. DNA bending angles (ɸ) were calculated from tilt, roll, and twist base step angles using the MADBEND procedure^43,44. Bending persistence lengths (BPLs) were calculated from: $BPL = - L\frac{{\partial {\text{ln}}P\left(\upphi \right)}}{{\partial \left( {1 - \cos \left(\upphi \right)} \right)}}$⁴⁵, where L is the contour length, and P(ɸ) the probability of observing a particular bending angle. Contour lengths were calculated from the sum of the helical rise; to account for fraying, the terminal base steps were excluded from the ɸ and BPL analyses. Torsional persistence lengths (TPLs) were calculated from the variance of the total winding angle (${\Delta }_{\Omega }^{2}$): $TPL=L/{\Delta }_{\Omega }^{2}$⁴⁶. The total winding angle was calculated as the sum of the individual twist steps for each sequence. To account for fraying and base flipping, the two terminal base steps and those neighboring U₆ were excluded from the sum.

Extrahelical flipping of uracil was assessed by monitoring the hydrogen bonding distance between HN₃ of U₆ and N₁ of its complementary A₁₉, and the flipping angle. This flipping angle was taken as the pseudo-dihedral angle between the center of mass (COM) of the base ring of U₆, the COM of the U₆ backbone, the COM of the backbone of residue 8, and the COM of the backbone of the base complementary to U₆^45,45. Based on visual inspections of the trajectories, U₆ was considered flipped out when the U₆ − A₁₉ hydrogen bond distance exceeded 4.5 Å and the pseudo-dihedral angle was greater than 40 or less than − 40°. Negative pseudo-dihedral angles correspond to major groove flipping, while positive angles correspond to minor groove flipping.

Results

DNA substrates

Relative excision efficiencies for uracils embedded within a long (> 6 kbp) viral dsDNA have previously been reported⁵. Inspection of those results suggests that AUT sequences are generally poorer UNG substrates than TUA sequences despite similar overall AT/GC content. Based on these published data, we designed dsDNA substrates containing uracil in TUA or AUT contexts, and adenine opposite uracil (Table 1). An initial set of substrates (1TA, 2AT, 3AT and 4TA) were designed to span a wide range of uracil excision efficiencies. For uracils embedded in a viral dsDNA genome, substrates 1TA and 4TA represent the high and low end of removal efficiencies in TUA contexts (100% and 50%, respectively), while substrates 3AT and 2AT represent the high and low end in AUT contexts (35% and 10%, respectively)⁵. Substrates 1AT and 2TA were then designed to evaluate the impact of swapping the flanking A and T bases of the substrates with the best (1TA) and worst (2AT) removal efficiency in the original set. Lastly, substrates containing uracil in a AUA (1AA and 2AA) or a TUT (1TT and 2TT) context were designed to evaluate the contributions of the uracil-flanking nucleotides.

Kinetics of uracil removal by UNG

UNG activity was measured as described in Materials and Methods; an example is given in Fig. S1. Although 2AP introduces slight local conformational perturbations^48,49, its effects on enzyme binding and kinetics are marginal^25,26,50,51. Calculated Michaelis–Menten parameters (K_M and k_cat, Figs. 1 and S7) and specificity constants (k_cat/K_M) are listed in Table S4. K_M values fall within the range of previous measurements^23,52,53,54. For substrates 1TA, 2AT, 3AT, and 4TA, k_cat/K_M ratios parallel the percent uracil removal efficiencies reported previously⁵ in longer and more complex viral DNA: 1TA > 4TA > 3AT > 2 AT (Fig. 1, inset). For substrates 1AT and 2TA, whose core motifs are identical to 1TA and 2AT except for the swapped flanking bases, k_cat/K_M ratios are higher in TUA than AUT contexts (i.e. 1TA > 1AT and 2TA > 2AT, Fig. 1 and Table S4). Competition experiments (Supporting Information) using UNG acting on two different uracil-containing substrates present in the same reaction mixture confirm that UNG’s specificity is greater in TUA than AUT contexts, and that substrate specificity is not solely determined by the flanking bases (see also Fig. 1, inset).

Fluorescence quantum yields and time-resolved fluorescence

Unlike the canonical DNA bases, 2-aminopurine (2AP) is highly fluorescent when free in solution and strongly quenched when incorporated into DNA²⁶. Inter-base interactions with its neighboring bases in duplex DNA give rise to a multiexponential decay that reflects the highly heterogeneous environment sensed by the probe. We used steady-state and time-resolved 2AP fluorescence to probe DNA dynamics around the uracil lesion. Consistent with previous reports^55,56,57,58, four exponential terms with lifetimes ranging from tens of picoseconds to nanoseconds were needed to fit the time-resolved TCSPC data (Table S2):

$$I\left( t \right) = I_{0} \sum\nolimits_{i = 1}^{4} {\alpha_{i} e^{{ - t/\tau_{i} }} } ,\quad \sum\nolimits_{i = 1}^{4} {\alpha_{i} = 1}$$

(3)

Lifetimes in the picosecond timescale have been reported for 2AP in dsDNA using ultrafast methods⁵⁹, indicating that a fraction of the emitting 2AP fluorophores has lifetimes shorter than the TCSPC resolution (~ 40 ps). The fractional population of the highly stacked probes that give rise to these ultrafast lifetimes (denoted by α₀) was determined from the average lifetimes (Table S2) and the measured fluorescence quantum yields (Table S1)²⁹. Results are shown in Table S5 and Fig. 2.

Seibert et al. reported fluorescence quantum yields and lifetimes of 2AP opposite a uracil in two different 19 bp DNA substrates: one containing uracil flanked by two As (AUA, high UNG efficiency) and one containing uracil flanked by two Gs (GUG, low UNG efficiency)²³. The AUA sequence had a shorter average lifetime (0.32 ns) than GUG (2.48 ns), which was interpreted as AUA being more flexible and therefore leading to more efficient dynamic quenching. Guanine, however, is an efficient quencher of 2AP, and 2AP lifetimes as low as 400 fs, were measured for 2AP in the vicinity of G⁵⁹. The surprisingly long average lifetime reported for GUG (2.48 ns), therefore, likely reflects the fact that most of the 2AP population is quenched dynamically with lifetimes shorter than the resolution of the experiment (the shorter lifetimes reported were ~ 200 ps while the typical resolution of TCSPC measurements is about 40 ps). Had the authors measured the short lifetimes with high relative amplitudes expected for 2AP in the vicinity of G, the calculated 〈τ❭ would have been significantly shorter and likely below the value measured for AUA. These arguments illustrate the problems with interpreting relative 2AP average lifetimes in terms of substrate flexibility. Because measured average lifetimes are sensitive to instrumental resolution, we favor the currently accepted view that the relative contributions of each lifetime (α_i), but not the lifetimes themselves, are useful measures of the degree of base stacking, and therefore reflect substrate flexibility²⁶.

The excited 2AP population is expected to be partitioned between several different local environments that lead to different quenching efficiencies. The normalized amplitudes in Eq. (3) (α_i ), measure the fractional population of each conformation detectable by TCSPC, from more stacked (α₁), to more exposed to the solvated environment (α₄)²⁶. Here, we focus on the remaining population of 2AP molecules that cannot be detected by TCSPC (α₀). This population emits with lifetimes shorter than the resolution of the experiment (~ 40 ps) due to ultrafast quenching. All calculated α₀ values are quite high (α₀ > 0.4 for all sequences, Table S5), but values are higher for substrates containing uracil in a AUT context compared to TUA. A higher α₀ value indicates a higher fraction of highly stacked 2AP fluorophores, which we interpret as an indication of a less deformable substrate.

Substrates 1AT and 2TA were designed from parent substrates with high (1TA) and low (2AT) UNG efficiency to test the hypothesis that differences in substrate deformability determine preferential repair efficiencies. Swapping the flanking A and T while keeping all other bases constant affects stacking interactions in the vicinity of the uracil without changing the melting temperature. This swap resulted in a higher fraction of highly stacked 2AP probes for substrates with AUT contexts, i.e. α₀ (1AT) > α₀ (1TA) and α₀ (2AT) > α₀ (2TA). As noted above, k_cat/K_M values follow the opposite trend, suggesting a correlation between substrate deformability and uracil removal efficiency by UNG. A strongly negative correlation between k_cat/K_M and α₀ was indeed observed for all sequences (Fig. 2A), indicating that more flexible sequences have higher repair rates.

Molecular Dynamics (MD) Simulations

The flexibility of the various DNA duplexes (Table 1) was quantified by MD. Calculated bending persistence lengths showed that all TA sequences and 3AT were more flexible than undamaged DNA (which has a persistence length of ~ 500 Ȧ)^60,61, while the 1AT, 2AT, 1TT, 2TT, 1AA, and 2AA sequences were similar to undamaged DNA in bending rigidity (Table S6). We observed a clear distinction between the TA and AT sequences, with the former having lower bending persistence lengths than the latter. The AA and TT sequences were least bendable. The higher flexibility of the TA sequences was largely echoed in calculated torsional persistence lengths: the AT sequences generally had larger torsional persistence lengths than the TA sequences. The only exception was the 3AT sequence, which was torsionally more flexible than the stiffest TA sequence (2TA). Torsional persistence lengths of the AA and TT sequences were similar to the AT sequences. The standard deviation of the bending angle was highest for the TA sequences, indicating large bending flexibility of these sequences, and lowest for AA/TT, indicating higher rigidity (Table S6). Overall, calculated properties indicated the highest flexibility for the TA sequences and the lowest flexibility for the AA and TT sequences. Figure 3 shows that these three properties are strongly correlated with the α₀ values obtained from fluorescence measurements. The correlation coefficient (r) was 0.918 for the bending persistence length, 0.869 for the torsional persistence length, and − 0.939 for the standard deviation of the bending angle. Moreover, ranking of the sequences by flexibility was largely similar for these MD measures and α₀.

The sequences displayed markedly different local dynamics around the lesion. Figure S8 shows the shift, slide, and rise translational step parameters, and tilt, roll, and twist rotational parameters for the central A₅U₆A₇, A₅U₆T₇, T₅U₆A₇, and T₅U₆T₇ steps. These values were averaged over all trajectories and all sequences; reported standard deviations are a measure of the base step flexibilities. Interestingly, differences in UA and AU step flexibilities depend on context and do not completely mirror the behavior of TA and AT steps of undamaged DNA^18,19,21. For example, UA is particularly flexible in the TUA motif, but rigid in AUA. The TUA motif was by far most flexible, displaying large flexibilities in all step parameters of both steps. Some asymmetries in the UA and TU steps of this motif were observed. The roll angle of its UA step was more flexible than its TU step, its TU step was more flexible than UA in slide and twist, while both steps had similar flexibilities for the other parameters. Second most flexible was AUT. Its UT step was more flexible in roll, twist, shift, and rise, its AU step was more flexible in slide, and its tilt flexibility was similar for both steps. In contrast, all steps of the AUA and TUT motifs displayed low flexibilities.

The main reason for the high flexibility of the central base steps of the TUA sequences was extra-helical base flipping of U₆ (Table 2), which was observed in all TUA sequences. Flipping was reversible and would occur throughout the simulations, but in 2TA and 4TA U₆ remained extra-helical for nearly the entirety of the simulations. Base flipping was also observed in the AUT sequences, particularly in the 3AT and 1AT sequences, but this was by far not as prominent as when uracil was in a TUA context. 1AA and 2AA displayed even less flipping than the AUT sequences, and flipping did not occur in 1TT and 2TT.

Table 2 Average and standard deviation of the time U6 is intra-helical over three MD replicas.

Full size table

Flipping of U₆ was highly correlated to bending motions. The standard deviation of the flipping angle was highly correlated to the bending persistence length and the standard deviation of the bending angle, with correlation coefficients of -0.972 and 0.897, respectively (Fig. 4). The correlation to the torsional motion was weaker, with a correlation coefficient of − 0.763 for the torsional persistence length. Flipping would start toward the major groove (negative flipping angles). The fully flipped U would either remain in the major groove or start interacting with the minor groove, thereby changing the flipping angle to positive values. Since flipping was prevalent in the TA sequences, this behavior led to large standard deviations over the TA replicas (Fig. 4).

NMR-detected imino proton exchange rates

We probed individual base pair imino proton exchange rates of UNG substrates with solution NMR spectroscopy. These rates are directly related to nucleic acid breathing motions, whereby the imino base pair protons in transiently open states can exchange with water (Fig. 5A)^33,62,63. The imino proton exchange rate (k_ex) provides insight into the base pair stability and duplex dynamics at the single base-pair level. As indicated by the MD simulations, we hypothesized that the k_ex of the target uracil (U₆, Fig. 5B) will be sequence-dependent, and that a higher k_ex for central base pairs reflects more suitable UNG substrates. The imino protons for each NMR duplex listed in Table 1 were assigned (Fig. S3), and individual base pair k_ex values were measured. The 1TA exchange rates were not characterized due to significant resonance overlap of uracil (U₆) with T₁₈ (Fig. S3) which impeded accurate deconvolution and analysis of the U₆ and T₁₈ resonances in R_1n and k_ex experiments. The UNG substrates from series one were therefore not included in the NMR analyses. The NMR data used for fitting the k_ex are deposited under BMRB Entry ID 51612.

Imino proton NMR spectra are only observed in the presence of base-pairing^64,65; due to fraying, terminal duplex base-pairs were therefore not observed. Though most of the surrounding exchange rates were comparable between sequences, the central uracil exhibited strikingly distinct k_ex rates that varied by more than ten-fold across sequences (Fig. 5C). Differences in uracil imino exchange rates demonstrate a strong dependence on adjacent base pairs. In 2AT and 2TA, the k_ex significantly increases as the result of a conserved change in sequence order of the surrounding base pairs. Given the significant differences in dynamics observed by fluorescence, MD simulations, and NMR when swapping between AT and TA bases flanking the central uracil, we measured imino proton exchange rates for 2TT and 2AA to elucidate whether the 5′ T or 3′ A relative to uracil dictates substrate flexibility and UNG efficiency. Interestingly, the sequences with the highest uracil imino exchange rates are those with adenine 3′ to U₆ (A₇). The 2TA duplex measures the highest U₆ k_ex of 10.24 ± 0.75 s⁻¹ while 2AA is intermediate with a k_ex of 3.32 ± 0.31 s⁻¹ (Fig. 5C). Our experiments identify that the base pair at the 3′ side of uracil is most influential on the k_ex of U₆, demonstrated by the nearly 15-fold difference in k_ex between 2TA and 2TT (10.24 ± 0.75 s⁻¹ and 0.70 ± 0.08 s⁻¹, respectively). This could suggest a structural and/or energetic hindrance to UNG efficiency in repairing such sequences. A positive correlation between k_ex of U₆ and k_cat/K_M was observed (Fig. 2B), indicating that more flexible sequences have higher repair efficiencies.

Thermodynamic cycle analysis

Given that the base identity 3′ to uracil most significantly contributes to substrate flexibility, we probed allosteric coupling between the 5′ and 3′ uracil flanking positions. Thermodynamic cycle analysis of the NMR data (${k}_{ex})$ measured for substrate series two identifies that there is coupling between these positions with a ΔG_coup of 3.9 ± 0.4 kJ/mol (Eq. 2 and Fig. 6). Based on the finding that substrate dynamics govern UNG activity, we expected coupling between the 5′ and 3′ flanking uracil positions in enzyme kinetics as well. This was validated by a thermodynamic cycle of substrate series two using $\Delta {G}^{\ddagger }$ values from k_cat/K_M measurements, yielding a coupling energy of 1.8 ± 0.6 kJ/mol (Fig. 6). Thermodynamic theory dictates that this allosteric coupling should be preserved independent of the surrounding sequence, and to confirm this, a thermodynamic cycle was carried out with substrate series one. As expected, we obtained the same coupling energies in both cases (horizontal lines in Fig. 6).

Discussion

The first uracil DNA glycosylase (UNG) was identified almost 50 years ago in E. coli⁶⁶, and since, has been found across mammals, plants, bacteria, and viruses. While UNG is well characterized functionally and structurally, the observation that DNA substrate sequence significantly impacts its enzymatic efficiency has not been fully explained. To deconvolute the physical DNA substrate properties that impact UNG function, we employed a set of dsDNA UNG substrates that encompasses variable enzymatic efficiencies. UNG activity was correlated to substrate uracil dynamics by fluorescence, NMR, and MD. These studies identified both proximal and distal substrate features whose contributions impact UNG activity.

Uracil recognition by human and E. coli UNG relies on trapping extrahelical uracils that are spontaneously exposed by thermally induced base pair opening motions^12,13. NMR imino proton exchange measurements have shown that U:A base pairs open with rate constants that are about one order of magnitude faster than T:A base pairs in the same sequence context¹⁴, and this difference has been proposed to contribute to the mechanism by which UNG discriminates between thymine and uracil. Here, we report uracil imino exchange rates that vary nearly 15-fold across sequences. These results support the notion that the sequence context surrounding the uracil modulate its opening dynamics and ultimately regulate the rate of uracil removal. To establish a correlation between substrate flexibility and UNG activity, we determined the UNG Michaelis–Menten constants for ten designed DNA substrates containing uracil flanked by either A or T bases (Figs. 1, S7 and Table S4). This experimental design allowed us to focus on the effects of substrate flexibility without affecting the melting temperature and stability of the duplex. UNG specificity constants (k_cat/K_M) were calculated and used to evaluate relative UNG substrate preferences. We identify that the UNG specificity constant (k_cat/K_M) is generally smaller for substrates containing a thymine 3′ to the uracil. Replacing the 3′ thymine in substrates 1TT and 2TT with an adenine (TUT → TUA), results in a c.a. threefold increase in k_cat/K_M. Swapping A and T around the uracil in substrates 1AT and 2TA (AUT → TUA) also results in an increase in k_cat/K_M.

The fact that UNG recognizes spontaneously exposed uracil suggests that sequence effects in k_cat/K_M are due to inherent differences in the flexibility of the DNA helix around the uracil. Accordingly, we observe an inverse correlation between k_cat/K_M and α₀, a fluorescence-derived observable that measures the degree of base stacking around the uracil (Fig. 2A). Values of α₀ are smaller for TUA sequences than for AUT, indicating that uracil is less dynamic in the second group. Similarly, a thymine 3′ to U results in lower exchange rate constants for the uracil imino proton (Table S3 and Fig. 2B), and we observed a correlation between k_cat/K_M and k_ex for the five substrates for which we were able to obtain k_ex constants by NMR (Fig. 2B). This further supports the notion that k_cat/K_M is greater for substrates containing the uracil in more flexible contexts. We note that there is no clear correlation between k_cat and α₀ (Fig. S9), indicating that the values of K_M, but not k_cat, are dictated by the mechanical properties of the substrate. This is consistent with current mechanistic knowledge that UNG k_cat is limited by product release^67,68.

MD data suggests that the fluorescence spectroscopy-derived α₀ values and the NMR-determined U₆-k_ex values measure different aspects of substrate motion. Values of α₀ correlate more strongly with DNA bending than uracil flipping (Figs. 3, 4), while U₆-k_ex values correlate more strongly to base flipping than DNA bending (Figs. 4, S10). Nevertheless, these motions are coupled (Fig. 4), and given the correlation of α₀ and U₆−k_ex with k_cat/K_M (Fig. 2), both motions contribute to UNG substrate recognition and uracil excision. The necessity of both motions is consistent with crystal structures, in which DNA in complex with UNG is both flipped and bent (by an average of 33°)^{10,13,69,70,71,72,73}. The stronger correlation of α₀ with k_cat/K_M (Fig. 2) is intriguing since it would indicate that excision is more strongly correlated to DNA bending than base flipping. The observation that K_M, but not k_cat, is governed by the mechanical properties of the substrate, indicates that DNA bending favors the formation of the enzyme–substrate complex. Our MD data shows that DNA bending correlates with base flipping, which is consistent with previous studies that show that DNA bending facilitates base flipping by pushing the system up in energy^74,75.

For a given substrate, changing one of the bases flanking the uracil affects k_cat/K_M in a way that depends on the identity of the other. For example, for substrate series one (i.e., 1AT, 1AA, 1TT, and 1TA) and two (i.e., 2AT, 2AA, 2TT, and 2TA), the effect of replacing an adenine 3′ to uracil with thymine is much greater when the base 5′ to uracil is T than A. Similarly, the effect of substituting an adenine 5′ to uracil by thymine is much greater when the base 3′ to uracil is T than A. The thermodynamic coupling between the bases flanking the uracil was analyzed by two types of double mutant cycles, one using data from solution NMR on the isolated substrates, and one from UNG activity (Fig. 6)³⁴. It is evident from both cycles that $\mathrm{\Delta \Delta }{G}_{AT\to AA}^{\ddagger } \ne \mathrm{\Delta \Delta }{G}_{TT\to TA}^{\ddagger }$ and $\mathrm{\Delta \Delta }{G}_{AT\to TT}^{\ddagger } \ne \mathrm{\Delta \Delta }{G}_{AA\to TA}^{\ddagger }$, indicating that substituting one of the bases adjacent to uracil affects the energy of the transition state in a manner that depends on the identity of the other one. In other words, the sites directly 5′ and 3′ of uracil are allosterically coupled. Substituting a thymine 3′ to uracil by adenine (vertical edges of the cube in Fig. 6) stabilizes the transition states, but in both cycles, effects are more significant when the base 5′ to uracil is thymine. Similarly, substituting a thymine 5′ to uracil by adenine stabilizes the transition state when the base 3′ to uracil is thymine, but destabilizes it when adenine is in this position. This allosteric coupling is quantified by the coupling energy, with a value of 3.9 ± 0.4 kJ/mol for the k_ex-based cycle with substrate series two, and values of 1.8 ± 0.6 kJ/mol and 1.8 ± 0.5 kJ/mol for the k_cat/K_M -based cycle with substrate series two and one, respectively. While magnitudes of coupling necessarily differ between the two cycles, the coupling has the same sign in both cases, which supports the conclusion that UNG catalysis correlates with substrate dynamics. Coupling energies are higher for the imino proton exchange-based cycle compared to the k_cat/K_M-based cycle. Considering that the former measures coupling in the isolated substrates, these results point to the role of the enzyme in reducing the coupling energy, consistent with the fact that UNG must be capable of removing uracil in diverse sequence contexts.

For the k_cat/K_M-based cycle, coupling energies for substrate series one and two are identical, within error, indicating that the effect of a single change in a base adjacent to uracil is independent of the identity of the bases that differ between substrate series one from series two (i.e., $\Delta \Delta {G}_{1XY\to 1XX}^{\ddagger }=\Delta \Delta {G}_{2XY\to 2XX}^{\ddagger }$, where X and Y are A or T). This indicates that all the combined differences between substrates one and two have a constant effect in k_cat/K_M that is independent of the identity of the bases flanking the uracil. Consistent with this, all values of $\Delta {G}_{2XY\to 1XY}^{\ddagger }$ are the same for any combination of X and Y (lines connecting the thermodynamic squares of substrates 1 and 2 in Fig. 6). Although results show that sequence effects are not limited to the bases immediately surrounding the uracil lesion (k_cat/K_M values are 1.4-fold greater for substrate series one), for these sequences, effects appear to be additive to the effects of the bases flanking the uracil. Based on this analysis, we conclude that the bases adjacent to the uracil have the greatest impact in both substrate flexibility and UNG activity, and that the bases not immediately surrounding the uracil provide a secondary level of modulation.

While our study focused on E. coli UNG, we anticipate the results to be broadly applicable because the catalytic cores of different UNGs are closely related. For example, the root-mean-square deviation between all Cα positions of human and E. coli UNG enzymes is just 0.9 Å^76,77. UNGs are also structurally similar (despite low sequence identity) to bacterial mismatch-specific uracil-DNA glycosylases (MUGs) and to eukaryotic thymine-DNA glycosylates (TDGs), which use a base-flipping mechanism for the recognition of uracil and thymine^78,79. We speculate that the MUG and TDG efficiencies will also be dictated in large part by substrate lesion DNA dynamics, though future studies are needed to test this hypothesis. Given the fundamental nature of genomic integrity, the implications of our studies are significant. That repair efficiencies, at least for UNG, are in large part dictated by DNA sequence deformability and flexibility could help explain the molecular mechanisms that underly fundamental observations in the fields of oncogenetics and cancer hotspots⁸⁰, and evolutionary adaptation⁸¹. One particularly relevant context extends to the rapidly expanding field of base editing, where UNG and other glycosylases have been tethered to Cas9 nickases enabling precision DNA alterations with great potential for therapeutic intervention^82,83. Ultimately, our data show a clear correlation between UNG activity and substrate flexibility that can be used to make predictions about the functional attributes of substrates, and may help rationalize sequence effects in base editing and other fields.

Data availability

The data that support the findings of this paper are available from the corresponding authors upon reasonable request. The NMR data used for fitting the k_ex are deposited under BMRB Entry ID 51612.

References

McCullough, A. K., Dodson, M. L. & Lloyd, R. S. Initiation of base excision repair: Glycosylase mechanisms and structures. Annu. Rev. Biochem. 68, 255–285 (1999).
Article CAS PubMed Google Scholar
Jones, M., Wagner, R. & Radman, M. Repair of a mismatch is influenced by the base composition of the surrounding nucleotide-sequence. Genetics 115, 605–610 (1987).
Article CAS PubMed PubMed Central Google Scholar
SibghatUllah, ‡ et al. Base analog and neighboring base effects on substrate specificity of recombinant human G: T mismatch-specific thymine DNA-glycosylase. Biochemistry 35, 12926–12932 (1996).
Article CAS PubMed Google Scholar
Eftedal, I., Guddal, P. H., Slupphaug, G., Volden, G. & Krokan, H. E. Consensus sequences for good and poor removal of uracil from double-stranded DNA by uracil-DNA glycosylase. Nucl. Acids Res. 21, 2095–2101 (1993).
Article CAS PubMed PubMed Central Google Scholar
Nilsen, H., Yazdankhah, S. P., Eftedal, I. & Krokan, H. E. Sequence specificity for removal of uracil from U-center-dot-a pairs and U-center-dot-G mismatches by uracil-DNA glycosylase from Escherichia-coli, and correlation with mutational hotspots. Febs Lett. 362, 205–209 (1995).
Article CAS PubMed Google Scholar
Dolan, M. E., Oplinger, M. & Pegg, A. E. Sequence specificity of guanine alkylation and repair. Carcinogenesis 9, 2139–2143 (1988).
Article CAS PubMed Google Scholar
Haseltine, W. A. et al. Cleavage of pyrimidine dimers in specific DNA-sequences by a pyrimidine dimer DNA-glycosylase of M-luteus. Nature 285, 634–641 (1980).
Article ADS CAS PubMed Google Scholar
Krokan, H. E., Drablos, F. & Slupphaug, G. Uracil in DNA–occurrence, consequences and repair. Oncogene 21, 8935–8948 (2002).
Article CAS PubMed Google Scholar
Visnes, T. et al. Uracil in DNA and its processing by different DNA glycosylases. Philos. Trans. R. Soc. B 364, 563–568 (2009).
Article CAS Google Scholar
Parikh, S. S. et al. Base excision repair initiation revealed by crystal structures and binding kinetics of human uracil-DNA glycosylase with DNA. Embo J. 17, 5214–5226 (1998).
Article CAS PubMed PubMed Central Google Scholar
Fuxreiter, M., Luo, N., Jedlovszky, P., Simon, I. & Osman, R. Role of base flipping in specific recognition of damaged DNA by repair enzymes. J. Mol. Biol. 323, 823–834 (2002).
Article CAS PubMed Google Scholar
Cao, C. Y., Jiang, Y. L., Stivers, J. T. & Song, F. H. Dynamic opening of DNA during the enzymatic search for a damaged base. Nat. Struct. Mol. Biol. 11, 1230–1236 (2004).
Article CAS PubMed Google Scholar
Parker, J. B. et al. Enzymatic capture of an extrahelical thymine in the search for uracil in DNA. Nature 449, 433–437 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Parker, J. B. & Stivers, J. T. Dynamics of uracil and 5-fluorouracil in DNA. Biochemistry 50, 612–617 (2011).
Article CAS PubMed Google Scholar
Carr, C. E., Khutsishvili, I., Gold, B. & Marky, L. A. Thermodynamic stability of DNA duplexes comprising the simplest T –> dU substitutions. Biochemistry 57, 5666–5671 (2018).
Article CAS PubMed Google Scholar
Slupphaug, G. et al. Properties of a recombinant human uracil-DNA glycosylase from the UNG gene and evidence that UNG encodes the major uracil-DNA glycosylase. Biochemistry 34, 128–138 (1995).
Article CAS PubMed Google Scholar
Holz, K., Pavlic, A., Lietard, J. & Somoza, M. M. Specificity and efficiency of the uracil DNA glycosylase-mediated strand cleavage surveyed on large sequence libraries. Sci. Rep. 9, 17822 (2019).
Article ADS PubMed PubMed Central Google Scholar
ElHassan, M. A. & Calladine, C. R. Conformational characteristics of DNA: Empirical classifications and a hypothesis for the conformational behaviour of dinucleotide steps. Philos. Trans. R. Soc. A 355, 43–100 (1997).
Article ADS CAS MATH Google Scholar
Packer, M. J., Dauncey, M. P. & Hunter, C. A. Sequence-dependent DNA structure: dinucleotide conformational maps. J. Mol. Biol. 295, 71–83 (2000).
Article CAS PubMed Google Scholar
Heddi, B., Abi-Ghanem, J., Lavigne, M. & Hartmann, B. Sequence-dependent DNA flexibility mediates DNase I cleavage. J. Mol. Biol. 395, 123–133 (2010).
Article CAS PubMed Google Scholar
Widom, J. Role of DNA sequence in nucleosome stability and dynamics. Q. Rev. Biophys. 34, 269–324 (2001).
Article CAS PubMed Google Scholar
Stivers, J. T. & Jiang, Y. L. a mechanistic perspective on the chemistry of DNA repair glycosylases. Chem. Rev. 103, 2729–2760 (2003).
Article CAS PubMed Google Scholar
Seibert, E., Ross, J. B. & Osman, R. Role of DNA flexibility in sequence-dependent activity of uracil DNA glycosylase. Biochemistry 41, 10976–10984 (2002).
Article CAS PubMed Google Scholar
Stivers, J. T. 2-Aminopurine fluorescence studies of base stacking interactions at abasic sites in DNA: Metal-ion and base sequence effects. Nucl. Acids Res. 26, 3837–3844 (1998).
Article CAS PubMed PubMed Central Google Scholar
Stivers, J. T., Pankiewicz, K. W. & Watanabe, K. A. Kinetic mechanism of damage site recognition and uracil flipping by Escherichia coli uracil DNA glycosylase. Biochemistry 38, 952–963 (1999).
Article CAS PubMed Google Scholar
Jones, A. C. & Neely, R. K. 2-Aminopurine as a fluorescent probe of DNA conformation and the DNA-enzyme interface. Q. Rev. Biophys. 48, 244–279 (2015).
Article CAS PubMed Google Scholar
Levitus, M. Tutorial: Measurement of fluorescence spectra and determination of relative fluorescence quantum yields of transparent samples. Methods Appl. Fluores 8, 033001 (2020).
Article ADS CAS Google Scholar
Ward, D. C., Reich, E. & Stryer, L. Fluorescence studies of nucleotides and polynucleotides .I. Formycin 2-aminopurine riboside 2,6-diaminopurine riboside and their derivatives. J. Biol. Chem. 244, 1228 (1969).
Article CAS PubMed Google Scholar
Avilov, S. V., Piemont, E., Shvadchak, V., de Rocquigny, H. & Mely, Y. Probing dynamics of HIV-1 nucleocapsid protein/target hexanucleotide complexes by 2-aminopurine. Nucl. Acids Res. 36, 885–896 (2008).
Article CAS PubMed Google Scholar
Fürtig, B., Richter, C., Wöhnert, J. & Schwalbe, H. NMR spectroscopy of RNA. ChemBioChem 4, 936–962 (2003).
Article PubMed Google Scholar
Weiss, M. A., Patel, D. J., Sauer, R. T. & Karplus, M. Two-dimensional 1H NMR study of the lambda operator site OL1: A sequential assignment strategy and its application. Proc. Natl. Acad. Sci. U. S. A. 81, 130–134 (1984).
Article ADS CAS PubMed PubMed Central Google Scholar
Szulik, M. W., Voehler, M. & Stone, M. P. NMR analysis of base-pair opening kinetics in DNA. Curr. Protoc Nucl. Acid Chem. 59, 72021–272018 (2014).
Google Scholar
Anosova, I. et al. Structural insights into conformation differences between DNA/TNA and RNA/TNA chimeric duplexes. ChemBioChem 17, 1705–1708 (2016).
Article CAS PubMed PubMed Central Google Scholar
Carter, P. J., Winter, G., Wilkinson, A. J. & Fersht, A. R. The use of double mutants to detect structural changes in the active site of the tyrosyl-tRNA synthetase (Bacillus stearothermophilus). Cell 38, 835–840 (1984).
Article CAS PubMed Google Scholar
Lu, X.-J. & Olson, W. K. 3DNA: a versatile, integrated software system for the analysis, rebuilding and visualization of three-dimensional nucleic-acid structures. Nat. Protoc. 3, 1213–1227 (2008).
Article CAS PubMed PubMed Central Google Scholar
Jorgensen, W. L., Chandrasekhar, J., Madura, J. D., Impey, R. W. & Klein, M. L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 79, 926–935 (1983).
Article ADS CAS Google Scholar
AMBER 2018 (University of California, San Francisco, 2018).
Eastman, P. et al. OpenMM 7: Rapid development of high performance algorithms for molecular dynamics. PLOS Comp. Biol. 13, e1005659 (2017).
Article Google Scholar
Ryckaert, J.-P., Ciccotti, G. & Berendsen, H. Numerical-integration of cartesian equations of motion of a system with constraints – molecular-dynamics of N-alkanes. J. Comput. Phys. 23, 327–341 (1977).
Article ADS CAS Google Scholar
Darden, T., York, D. & Pedersen, L. Particle mesh Ewald: An N⋅log( N ) method for Ewald sums in large systems. J. Chem. Phys. 98, 10089–10092 (1993).
Article ADS CAS Google Scholar
Galindo-Murillo, R. et al. Assessing the Current State Of Amber Force Field Modifications for DNA. J. Chem. Theory Comput 12, 4114–4127 (2016).
Article CAS PubMed PubMed Central Google Scholar
Shirts, M. R. & Chodera, J. D. Statistically optimal analysis of samples from multiple equilibrium states. J. Chem. Phys. 129, 124105–124105 (2008).
Article ADS PubMed PubMed Central Google Scholar
Strahs, D. & Schlick, T. A-tract bending: Insights into experimental structures by computational models. J. Mol. Biol. 301, 643–663 (2000).
Article CAS PubMed Google Scholar
Ma, N. & van der Vaart, A. Anisotropy of B-DNA groove bending. J. Am. Chem. Soc. 138, 9951–9958 (2016).
Article CAS PubMed Google Scholar
Mazur, A. K. Wormlike chain theory and bending of short DNA. Phys Rev. Lett. 98, 218102 (2007).
Article ADS PubMed Google Scholar
Mazur, A. K. Evaluation of elastic properties of atomistic DNA models. Biophys. J. 91, 4507–4518 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Banavali, N. K. & MacKerell, A. D. Free energy and structural pathways of base flipping in a DNA GCGC containing sequence. J. Mol. Biol. 319, 141–160 (2002).
Article CAS PubMed Google Scholar
Dallmann, A. et al. 2-aminopurine incorporation perturbs the dynamics and structure of DNA. Angew. Chem. Int. Ed. 49, 5989–5992 (2010).
Article CAS Google Scholar
Lycksell, P. O. et al. Base pair opening dynamics of a 2-aminopurine substituted Eco RI restriction sequence and its unsubstituted counterpart in oligonucleotides. Nucl. Acids Res. 15, 9011–9025 (1987).
Article CAS PubMed PubMed Central Google Scholar
Bellamy, S. R. & Baldwin, G. S. A kinetic analysis of substrate recognition by uracil-DNA glycosylase from herpes simplex virus type 1. Nucl. Acids Res. 29, 3857–3863 (2001).
Article CAS PubMed PubMed Central Google Scholar
McCullough, A. K., Dodson, M. L., Scharer, O. D. & Lloyd, R. S. The role of base flipping in damage recognition and catalysis by T4 endonuclease V. J. Biol. Chem. 272, 27210–27217 (1997).
Article CAS PubMed Google Scholar
Slupphaug, G. et al. A nucleotide-flipping mechanism from the structure of human uracil-DNA glycosylase bound to DNA. Nature 384, 87–92 (1996).
Article ADS CAS PubMed Google Scholar
Krusong, K., Carpenter, E. P., Bellamy, S. R., Savva, R. & Baldwin, G. S. A comparative study of uracil-DNA glycosylases from human and herpes simplex virus type 1. J. Biol. Chem. 281, 4983–4992 (2006).
Article CAS PubMed Google Scholar
Kavli, B. et al. Excision of cytosine and thymine from DNA by mutants of human uracil-DNA glycosylase. Embo J. 15, 3442–3447 (1996).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Neely, R. K. et al. Time-resolved fluorescence of 2-aminopurine as a probe of base flipping in M.HhaI-DNA complexes. Nucl. Acids Res. 33, 6953–6960 (2005).
Article CAS PubMed Central Google Scholar
Guest, C. R., Hochstrasser, R. A., Sowers, L. C. & Millar, D. P. Dynamics of mismatched base pairs in DNA. Biochemistry 30, 3271–3279 (1991).
Article CAS PubMed Google Scholar
Rachofsky, E. L., Seibert, E., Stivers, J. T., Osman, R. & Ross, J. B. Conformation and dynamics of abasic sites in DNA investigated by time-resolved fluorescence of 2-aminopurine. Biochemistry 40, 957–967 (2001).
Article CAS PubMed Google Scholar
Nordlund, T. M. et al. Structure and dynamics of a fluorescent DNA oligomer containing the EcoRI recognition sequence: Fluorescence, molecular dynamics, and NMR studies. Biochemistry 28, 9095–9103 (1989).
Article CAS PubMed Google Scholar
Manoj, P., Min, C.-K., Aravindakumar, C. T. & Joo, T. Ultrafast charge transfer dynamics in 2-aminopurine modified double helical DNA. Chem. Phys. 352, 333–338 (2008).
Article CAS Google Scholar
Baumann, C. G., Smith, S. B., Bloomfield, V. A. & Bustamante, C. Ionic effects on the elasticity of single DNA molecules. Proc. Natl. Acad. Sci. U. S. A. 94, 6185–6190 (1997).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, N. & van der Vaart, A. KCI dependence of B-DNA groove bending anisotropy. J. Phys. Chem. B 121, 5322–5330 (2017).
Article CAS PubMed Google Scholar
Guéron, M. & Leroy, J. L. [16] Studies of base pair kinetics by NMR measurement of proton exchange. In Methods in Enzymology Vol. 261 383–413 (Academic Press, 1995).
Google Scholar
Guéron, M., Kochoyan, M. & Leroy, J.-L. A single mode of DNA base-pair opening drives imino proton exchange. Nature 328, 89–92 (1987).
Article ADS PubMed Google Scholar
Wemmer, D. E., Wand, A. J., Seeman, N. C. & Kallenbach, N. R. NMR analysis of DNA junctions: Imino proton NMR studies of individual arms and intact junction. Biochemistry 24, 5745–5749 (1985).
Article CAS PubMed Google Scholar
Patel, D. J., Pardi, A. & Itakura, K. DNA conformation, dynamics, and interactions in solution. Science 216, 581–590 (1982).
Article ADS CAS PubMed Google Scholar
Lindahl, T. An N-glycosidase from Escherichia coli that releases free uracil from DNA containing deaminated cytosine residues. Proc. Natl. Acad. Sci. U. S. A. 71, 3649–3653 (1974).
Article ADS CAS PubMed PubMed Central Google Scholar
Fitzgerald, M. E. & Drohat, A. C. Coordinating the Initial Steps of Base Excision Repair: apurinic/apyrimidinic endonuclease 1 actively stimulates thymine DNA glycosylase by disrupting the product complex. J. Biol. Chem. 283, 32680–32690 (2008).
Article CAS PubMed PubMed Central Google Scholar
Wong, I., Lundquist, A. J., Bernards, A. S. & Mosbaugh, D. W. Presteady-state analysis of a single catalytic turnover by Escherichia coli uracil-DNA glycosylase reveals a “pinch-pull-push” mechanism. J. Biol. Chem. 277, 19424–19432 (2002).
Article CAS PubMed Google Scholar
Slupphaug, G. et al. A nucleotide-flipping mechanism from the structure of human uracil–DNA glycosylase bound to DNA. Nature 384, 87–92 (1996).
Article ADS CAS PubMed Google Scholar
Parikh Sudip, S. et al. Uracil-DNA glycosylase–DNA substrate and product structures: Conformational strain promotes catalytic efficiency by coupled stereoelectronic effects. Proc. Natl. Acad. Sci. U. S. A. 97, 5083–5088 (2000).
Article ADS PubMed Central Google Scholar
Bianchet, M. A. et al. Electrostatic guidance of glycosyl cation migration along the reaction coordinate of uracil DNA glycosylase. Biochemistry 42, 12455–12460 (2003).
Article CAS PubMed Google Scholar
Kosaka, H., Hoseki, J., Nakagawa, N., Kuramitsu, S. & Masui, R. Crystal Structure of family 5 uracil-DNA glycosylase bound to DNA. J. Mol. Biol. 373, 839–850 (2007).
Article CAS PubMed Google Scholar
Pedersen, H., Johnson, K., McVey, C., Leiros, I. & Moe, E. Structure determination of uracil-DNA N-glycosylase from Deinococcus radiodurans in complex with DNA. Acta Crystallogr. D 71, 2137–2149 (2015).
Article CAS PubMed Google Scholar
Ramstein, J. & Lavery, R. Energetic coupling between DNA bending and base pair opening. Proc. Natl. Acad. Sci. U. S. A. 85, 7231–7235 (1988).
Article ADS CAS PubMed PubMed Central Google Scholar
Ma, N. & van der Vaart, A. Free energy coupling between DNA bending and base flipping. J. Chem. Inf. Model 57, 2020–2026 (2017).
Article CAS PubMed Google Scholar
Mol, C. D. et al. Crystal structure and mutational analysis of human uracil-DNA glycosylase: structural basis for specificity and catalysis. Cell 80, 869–878 (1995).
Article CAS PubMed Google Scholar
Xiao, G. Y. et al. Crystal structure of Escherichia coli uracil DNA glycosylase and its complexes with uracil and glycerol: Structure and glycosylase mechanism revisited. Proteins 35, 13–24 (1999).
Article CAS PubMed Google Scholar
Gallinari, P. & Jiricny, J. A new class of uracil-DNA glycosylases related to human thymine-DNA glycosylase. Nature 383, 735–738 (1996).
Article ADS CAS PubMed Google Scholar
Schormann, N., Ricciardi, R. & Chattopadhyay, D. Uracil-DNA glycosylases-structural and functional perspectives on an essential family of DNA repair enzymes. Protein Sci. 23, 1667–1685 (2014).
Article CAS PubMed PubMed Central Google Scholar
Krokan, H. E., Drabløs, F. & Slupphaug, G. Uracil in DNA – occurrence, consequences and repair. Oncogene 21, 8935–8948 (2002).
Article CAS PubMed Google Scholar
Lewis, C. A., Crayle, J., Zhou, S., Swanstrom, R. & Wolfenden, R. Cytosine deamination and the precipitous decline of spontaneous mutation during Earth’s history. Proc. Natl. Acad. Sci. USA. 113, 8194–8199 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Zhao, D. et al. Glycosylase base editors enable C-to-A and C-to-G base changes. Nat. Biotechnol. 39, 35–40 (2021).
Article CAS PubMed Google Scholar
Kurt, I. C. et al. CRISPR C-to-G base editors for inducing targeted DNA transversions in human cells. Nat. Biotechnol. 39, 41–46 (2021).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

M.L. acknowledges use of the Ultrafast Laser Spectroscopy Facility at Arizona State University. W.V.H. acknowledges use of the Magnetic Resonance Research Center at Arizona State University. A.v.d.V. acknowledges use of Research Computing at the University of South Florida. This work was supported by the National Science Foundation [# 1918716 to ML and WVH, # 1919096 to AvdV]. Research reported in this publication was also supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Number R35GM141933 [WVH]. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Author information

These authors contributed equally: Paul B. Orndorff, Souvik Poddar and Aerial M. Owens.

Authors and Affiliations

Department of Chemistry, University of South Florida, Tampa, FL, 33620, USA
Paul B. Orndorff & Arjan van der Vaart
School of Molecular Sciences, Arizona State University, Tempe, AZ, 85287, USA
Souvik Poddar, Aerial M. Owens, Nikita Kumari, Bryan T. Ugaz, Wade D. Van Horn & Marcia Levitus
The Biodesign Institute Center for Single Molecule Biophysics, Arizona State University, Tempe, AZ, 85287, USA
Souvik Poddar, Nikita Kumari, Bryan T. Ugaz & Marcia Levitus
The Biodesign Institute Virginia G. Piper Center for Personalized Diagnostics, Arizona State University, Tempe, AZ, 85287, USA
Aerial M. Owens & Wade D. Van Horn
Magnetic Resonance Research Center, Arizona State University, Tempe, AZ, 85287, USA
Samrat Amin

Authors

Paul B. Orndorff
View author publications
You can also search for this author in PubMed Google Scholar
Souvik Poddar
View author publications
You can also search for this author in PubMed Google Scholar
Aerial M. Owens
View author publications
You can also search for this author in PubMed Google Scholar
Nikita Kumari
View author publications
You can also search for this author in PubMed Google Scholar
Bryan T. Ugaz
View author publications
You can also search for this author in PubMed Google Scholar
Samrat Amin
View author publications
You can also search for this author in PubMed Google Scholar
Wade D. Van Horn
View author publications
You can also search for this author in PubMed Google Scholar
Arjan van der Vaart
View author publications
You can also search for this author in PubMed Google Scholar
Marcia Levitus
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.L., A.v.d.V, W.V.H. designed and supervised the research, and analyzed data; S.P., A.M.O., N.K., B.T.U. performed experiments and analyzed data, P.B.O. carried out simulations and analyzed M.D. trajectories, W.V.H., A.M.O., S.A. analyzed NMR data, M.L., A.v.d.V., W.V.H., S.P., A.M.O., P.B.O. wrote and edited the paper. All authors have read and agreed to the published version of the manuscript.

Corresponding authors

Correspondence to Wade D. Van Horn, Arjan van der Vaart or Marcia Levitus.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Orndorff, P.B., Poddar, S., Owens, A.M. et al. Uracil-DNA glycosylase efficiency is modulated by substrate rigidity. Sci Rep 13, 3915 (2023). https://doi.org/10.1038/s41598-023-30620-0

Download citation

Received: 21 December 2022
Accepted: 27 February 2023
Published: 08 March 2023
DOI: https://doi.org/10.1038/s41598-023-30620-0

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.