Article | Open | Published:

# Mapping the sensing spots of aerolysin for single oligonucleotides analysis

Nature Communicationsvolume 9, Article number: 2823 (2018) | Download Citation

## Abstract

Nanopore sensing is a powerful single-molecule method for DNA and protein sequencing. Recent studies have demonstrated that aerolysin exhibits a high sensitivity for single-molecule detection. However, the lack of the atomic resolution structure of aerolysin pore has hindered the understanding of its sensing capabilities. Herein, we integrate nanopore experimental results and molecular simulations based on a recent pore structural model to precisely map the sensing spots of this toxin for ssDNA translocation. Rationally probing ssDNA length and composition upon pore translocation provides new important insights for molecular determinants of the aerolysin nanopore. Computational and experimental results reveal two critical sensing spots (R220, K238) generating two constriction points along the pore lumen. Taking advantage of the sensing spots, all four nucleobases, cytosine methylation and oxidation of guanine can be clearly identified in a mixture sample. The results provide evidence for the potential of aerolysin as a nanosensor for DNA sequencing.

## Introduction

Nanopore sensing has developed into a powerful tool for single-molecule analysis, especially for single-molecule DNA sequencing in a rapid, low-cost, and label-free way without the need of PCR amplification1. Measuring the fluctuations of ion current as four kinds of individual bases of DNA (A, T, C, and G) transport through a nanopore allows the real-time sequencing of single-strand DNA (ssDNA)2. So far, some membrane proteins and porins have been explored as a biological nanopore for DNA analysis, such as α-hemolysin3, MspA4, phi29 DNA-packaging nanomotor5, ClyA6, FhuA7, aerolysin8, and CsgG9,10. Compared with the commonly used α-hemolysin, whose channel constriction is nearly 1.4 nm, MspA has higher sensitivity in current resolution, since its funnel-like aperture is only ~1.2 nm in diameter at the narrowest point, with a translocation height of less than 0.6 nm11. Therefore, the architecture of the protein pore, and especially pore constriction, plays a crucial role in nanopore sensing ability.

Aerolysin, produced by Aeromonas sp., has been used as a biological nanopore to study the conformation and length of peptides12,13, dynamics of unfolded proteins14, enzyme activity15,16, and mass of PEGs17. Compared to α-hemolysin and MspA, aerolysin exhibited a higher sensitivity in current and duration for oligonucleotides analysis18. Recently, aerolysin has been rationally designed to control the selectivity and sensitivity for oligonucleotide sensing via site-directed mutagenesis19. Aerolysin shows an optimal single-nucleobase resolution and translocation rate of 2.0 ms nt−1 for oligonucleotides in wild-type conformation18 without the need of any additional complement, such as pore engineering20, DNA immobilization21, adapter incorporation22, or the use of enzymes2. Aerolysin could directly detect ssDNA as short as dinucleotides, and resolve individual short oligonucleotides that range from 2 to 10 bases in length with high sensitivity, achieving the real-time monitoring of the step-wise cleavage of oligonucleotides by Exo I23. In the future, taking advantage of the high spatial and temporal resolution of aerolysin, the operations of DNA analysis may be simplified and its accuracy could be significantly improved.

However, a high-resolution structure of the mature aerolysin pore is still missing to date, hindering a much deeper understanding of its capabilities for single-molecule analysis and the development of further applications. Nonetheless, aerolysin pore models have been recently proposed based on near-atomic resolution cryo-electron microscopy (EM) structures (3.9–4.5 Å) of its mutated prepore and quasi-pore states, along with a 7.9-Å resolution model of the wild-type pore state24. This structural analysis revealed a heptameric pore architecture featuring a very long (~10 nm) membrane spanning the β-barrel pore channel, which resembles the one found for anthrax toxin25 in dimensions, and is much longer than the α-hemolysin pore3. Moreover, aerolysin presents a new fold constituted by two concentric β-barrels at the top of the transmembrane pore, held together mainly by hydrophobic interactions, which are suggested to be responsible for the prion-like ultra-stability of this toxin24.

Herein, we combine the results of nanopore experiments along with molecular modeling and simulation to precisely map the sensing spots of this protein for oligonucleotides analysis. Our results revealed two clear sensing spots located at the two main constriction points across the pore lumen, located at amino- acid residues R220 and K238. We show that the first spot at position R220 is mainly responsible for specifically recognizing different nucleobases. Moreover, based on the construction features of the pore entry, we demonstrated the preference of DNA translocation entering from the 3′ end, consistently with other pore-forming toxins (PFTs) such as α-hemolysin26,27. Therefore, by rationally probing ssDNA length and composition upon pore translocation, we gained new important molecular insights that which could help the future design of aerolysin variants with improved capabilities for sequencing.

## Results

### Aerolysin pore is fully blocked by a 14-base oligonucleotide

To determine the blockade characteristics of the aerolysin nanopore, we performed single-channel experiments of polydeoxyadenines (dAn) with different lengths, from dA10 to dA20. The experimental schematic diagram is shown in Fig. 1a, and it is constituted by a lipid membrane that separates the chamber into two parts, cis and trans, with the aerolysin pore connecting trans and cis side on the membrane. An applied transmembrane potential pulls the negatively charged ssDNA from the cis to the trans side. According to the typical current traces of dAn (Fig. 1b), the level of blockade current gradually closed to 0 pA as the length of oligonucleotide strands increased, corresponding to a full decrease of residual current and an increase in the duration of DNA translocation (Fig. 1c–e). We found that these current values presented a strong dependence with the length of oligonucleotides for dAn shorter than 14 bases (n<14), for which the value of I/I0 decreased from 0.35 ± 0.01 to 0.05 ± 0.01 a.u. with n=10–14. (Here, I0 represents the value of open pore current, while I is the value of residual current; thus I/I0 indicates the current ratio of blockade. Mean values (± s.d.) in at least five separate experiments). In contrast, when the length of dAn is no shorter than 14, the I/I0 remained almost unchanged with a constant value of 0.07 ± 0.02 a.u. (mean values (± s.d.) in at least five separate experiments) (Fig. 1c and d). For all measured dAn, I/I0 followed a Gaussian distribution (Fig. 1d, inset), while the distribution of the translocation time was fitted by a falling exponential function (Fig. 1e, inset). Moreover, the statistical data at different applied voltages indicated that the translocation duration of each dAn significantly decreased with increased applied voltages (Supplementary Fig. 1), which demonstrated that dAn are mostly transported through the aerolysin nanopore. Notably, the duration of dA14 is surprisingly longer compared to other tested oligonucleotides (Fig. 1e), suggesting that the interaction between dA14 and the aerolysin nanopore is much stronger, likely due to the optimal spatial filling of dA14 within the ~10 nm long aerolysin channel.

To prove this hypothesis and better characterize the main interactions between oligonucleotides and aerolysin, which may explain the longer duration time observed for dA14, we performed 250 ns of molecular dynamics (MD) simulations of dA14 fully inserted in the pore lumen. The initial conformation was obtained by steered MD, as explained in the methods section. In Fig. 1f (and in Supplementary Table 1), the main interactions between the dA14 and aerolysin are summarized. Briefly, stable salt-bridge interactions are observed between the phosphate groups and the positive aerolysin amino acids, mainly with R282 and R220 at the pore entry, and K238 and K242 at the pore exit. Moreover, nucleobases are forming cation-π interactions with positively charged side chains, as well as hydrogen bonds and van der Waals contacts with several lumen residues (e.g., D216, N226, E254, E258, N262, and Q268). Altogether, these interactions are able to keep dA14 stable in its extended conformation within the two main positively charged rings defined by R220 and R282 at the top of the channel and K238, K242, and K244 at its bottom. This observation agrees with previous MD studies using the MspA pore, which revealed that the inclusion of positive side chains in the lumen resulted in an increase in translocation time28. The optimal steric and electrostatic match of dA14 within the lumen is likely at the origin of the observed increased duration time. However, by comparison, shorter (dA10) and longer (dA20) oligonucleotides did not show the same perfect match during MD simulations (Supplementary Fig. 2). For the longer dA20, the results showed that this oligonucleotide remains in fact more extended than the pore length, resulting in the accumulation of the remaining bases at the pore entry or exit depending on starting conditions. On the other hand, dA10 does not completely fill the pore in length, and the DNA tends to accumulate at the pore exit. Contrary to the optimal match of dA14, shorter or longer oligonucleotides may accelerate their exit from the pore by facilitating a faster restoration of ion current thanks to conformations that allow for a partially filled pore lumen (e.g., dA10) or solvated bases in the bulk (e.g., dA20).

### ssDNA shows a preferred translocation direction by 3′ end

The scatter plots of Fig. 1c show a unique direction of entry for dAn. In order to understand which this preferential direction was, we used a higher salt concentration (3.0 M), which produced two well-defined peaks in the I/I0 histograms of dA14, each corresponding to a different entry direction (Fig. 2a). One peak represents a lower blockade with a I/I0 value of 0.10, which contains 79.4% of the total number of events (defined as PI hereafter), while the other (PII hereafter), containing the remaining 20.6% of events, presents a higher average I/I0 value (0.13). These two peaks may represent two distinct translocation mechanisms, initiated either by the 5′ or by the 3′ end, similarly to that reported by previous studies of α-hemolysin30. To confirm this hypothesis and to further determine the assignment of these two peaks, a streptavidin-biotin system was designed to control the orientation of dA14 while entering the pore. Streptavidin (SA) from Streptomyces avidinii31 has a size of ~10 × 10 × 125 nm3, which prevents it from entering the aerolysin pore. The biotinylated dA14 could specifically bind to SA and thus allow dA14 to enter the pore only in one direction. To eliminate the influence of biotin for I/I0, the translocations of Biotin-5′-dA14-3′ and Biotin-3′-dA14-5′ were examined, respectively (Supplementary Fig. 3). Unlike the dA14, the I/I0 histograms of both SA-Biotin-5′-dA14-3′ and SA-Biotin 3′-dA14-5′ show only one peak, proving that SA immobilizes the free dA14 inside the pore and ensures one certain orientation (Fig. 2a). Moreover, considering the biotin influence, the corrected I/I0 value of SA-Biotin-5′-dA14-3′ (I/I03′ = 0.10) is equal to I/I0 of PI for dA14, which means that the peak with more than 79% of total events was induced by the dA14 entering the aerolysin pore from the 3′ end, while the corrected I/I0 value of SA-Biotin-3′-dA14-5′ (I/I05′ = 0.13) is consistent with the PII of dA14. Therefore, while single oligonucleotides are able to cross the aerolysin channel directed either by 3′ or 5′ end (similarly to other PFTs26,30), the dAn tested above show a significant preference for translocation via the 3′ end.

In order to understand why DNA would more favorably translocate by 3′ end, contrary to what was expected due to the extra negative charge present at the 5′ end, we performed Steered MD (SMD) of the DNA, pulling it inside the pore either by its 3′ or by its 5′ end. We could observe a relatively higher force needed for translocation by the 5′ end (Supplementary Fig. 4). As already proposed for DNA translocation through α-hemolysin26, this extra resistance is likely originated by the enantiomeric properties of the desoxi-D-ribose, which produces more steric hindrance for DNA bases as they move down the channel encountering large side chains (e.g., R220, R282) that render the lumen narrower (Fig. 2b). Therefore, we conclude that, at the low applied voltages (approx. 200 mV) and high ion concentration (1–3 M KCl), the extra negative charge at the 5′ end is not enough to counterbalance the additional energy barrier needed to translocate ssDNA by the 5′ end.

### Aerolysin nanopore presents two main sensing spots

Next, to precisely map the sensing spots of aerolysin for single-molecule detection, we designed a series of oligonucleotides containing an abasic site (marked by an “X”, which means that the nucleobase is substituted by a hydrogen atom32, Fig. 3a) at different positions of dA14. It should be noted that we used free ssDNA, which is different from previously reported methods that were using biotin–streptavidin complex or hairpin structure to immobilize ssDNA32,33. In comparison to them, our free ssDNA could precisely reflect the dynamic properties of the system. The sequencing of oligonucleotides is depicted in Fig. 3b and are named as dA14Xn (with n from 1 to 14), in which the position was numbered relative to the 3′ end of the oligonucleotide. The residual current data for each oligonucleotide can be found in Supplementary Figs. 418. All I/I0 histograms are fitted to a Gaussian function and all abasic oligonucleotides presented higher residual current values than dA14 (Fig. 3b, red circle with the error bars), which is consistent with previous studies of α-hemolysin32, as the absence of a nucleobase induced a smaller blockade of ion current. Notably, a maximum value of I/I0 occurred when X was located at positions 4 or 11, suggesting the existence at those positions of two sensing spots across the aerolysin pore.

Calculating the diameter of the aerolysin pore using PoreWalker34, we observed a direct correlation between the diameter of the lumen and the increase in ion flux for abasic ssDNA translocation. The two main constriction points, where the inner region of the β-barrel becomes <1 nm in diameter, and which are located around R220 and R238, coincide with the putative position of the abasic site in dA14X11 and dA14X4, respectively (Fig. 3b). Analyzing the MD simulation of dA14 fully inserted in the pore, we observed that the 11th base is in fact located at the heptameric ring formed by R220, and that the 4th base of dA14 is interacting with the ring formed by K238 (Fig. 3c). Therefore, the two main aerolysin sensing spots would correspond to amino acids R220 and K238. Interestingly, during MD, transient π-stacking interactions were observed around both constriction points, between the 11th and the 12th bases, or the 3th and 4th. Similar interactions have been previously described as related to ionic current blockade in the MspA pore35.

### Discrimination of single different nucleobases

Once these two sensing spots were identified, we aimed to study their ability for discriminating different nucleobases. To do so, we replaced the adenine nucleobase in positions 4 and 11, one at a time, to cytosine, thymine, and guanine (called hereafter dA14X4-C, dA14X4-T, dA14X4-G, dA14X11-C, dA14X11-T, and dA14X11-G, respectively). Unlike in α-hemolysin, the lack of a vestibule structure at the pore entry and the narrower pore diameter makes it in principle much harder for aerolysin to capture long ssDNA (>10 bases)36. Therefore, we used a higher salt concentration of 3.0 M KCl to increase the aerolysin capture rate, in order to obtain enough data for statistical analysis. We added the oligonucleotides one by one with the same concentration (3.0 μM) in the cis chamber and measured the residual currents. For the sensing spot at position 4, we observed an overlap in residual current values between both dA14X4-G and dA14X4-A, and also between dA14X4-T and dA14X4-C (Supplementary Fig. 19). On the contrary, for position 11, all the residual current histograms fitted to a Gaussian distribution and showed a perfect current separation (Fig. 4a). The four nucleobases presented a residual current ranked as G < A < T < C, which is consistent with the results obtained for single-nucleobase discriminations by the MoS2 nanopores37, and is well correlated to nucleobase size.

Moreover, taking advantages of the optimal current separation between dA14 and dA14X11-C/G, we further exploited the pore ability for detection of modified nucleobases in a mixture sample containing methylated cytosine (mC) and oxidized guanine (oG). Sensing position 11, defined by the constriction at R220, has indeed the ability to identify all four kinds of nucleobase (A, T, C, G, Fig. 4a) and two modified nucleobases (mC and oG, Fig. 4b). Finally, we studied the ability of aerolysin to identify different single nucleobases in a heteropolymeric ssDNA strand, using four kinds of oligonucleotides that only differ at position of 11 (Fig. 4c). In contrast to the results of dA14 homopolymeric analysis, the order of residual current of A and G changed for heteropolymeric DNA, which demonstrates that the flanking nucleobase significantly affect the streaming of blockade current levels.

The ionic current was calculated using a well-established method35,38 for an aerolysin pore system without ssDNA and for the dA14 system, giving results of 118.6 ± 1.4 and 0.7 ± 0.3 pA, respectively (mean values (± s.d.) in three separate simulations). Our approach, however, is known to overestimate the absolute current mainly due to the overestimation by the CHARMM force field of the bulk electrolyte conductivity by ~10–40%35,38,39,40. Given the approximate nature of our approach, the calculated current is thus reasonably well in line with the experimental values (125.0 and 5.0 pA) (Supplementary Fig. 20). Looking at the distribution of ions along the pore during MD simulations with dA14 (Fig. 4d), we observed large ion occupancy in between the two constriction points, while ions were more confined around R220/R282 and K238/K242, likely contributing to determine the specific ion current features of the aerolysin pore. Furthermore, while DNA translocates through the pore, the electrostatic potential in the lumen becomes more negative, particularly around the two constriction points (Fig. 4e), affecting the ion current; namely negative and larger chloride ions are more easily blocked when compared to potassium ions (Fig. 4d).

## Discussion

Residual current and duration time of translocation are the two most significant properties that can be used to discriminate different DNA sequences translocating across nanopores. Understanding these two characteristics at a molecular level is of critical importance for the design of new and more sensitive nanopores. Herein, we present an analysis of ssDNA translocation across aerolysin biological pores, which is based on both experimental and computational analysis, and allows us to unveil novel properties of aerolysin-oligonucleotide interplay.

Previous works about DNA nanopore sequencing using PFTs show the importance of the pore diameter for resolution. For example, the MspA nanopore (~1.2 nm diameter) is capable of higher current resolution than α-hemolysin (1.4 nm diameter) due to its smaller diameter funnel-like aperture. Herein, we show that the diameter of aerolysin channel is even smaller, with a value under 1 nm at its narrowest constrictions, which can be related to its observed high sensitivity for analytes, including oligonucleotides18 and poly(ethylene glycol) molecules17. Single-channel measurements in different lengths of dAn demonstrated that dA14 performed a full blockade of aerolysin nanopore, and computational studies suggested that the optimal steric and electrostatic match of dA14 length with the pore barrel extension is likely at the origin of the observed increased duration time of dA14 translocation. MD simulations showed relevant salt bridge interactions between both the 5′ and 3′ DNA phosphates and R282 and K244 residues at the pore entry and exit, respectively, which are likely responsible for capturing ssDNA for effective translocation. Comparing the pore lumen of aerolysin with the other commonly used PFT-based nanopore, α-hemolysin, and MspA, we observe a greater abundance of charged residues in aerolysin, mainly at the pore exit, which can favor translocation (Fig. 4e and Supplementary Fig. 21). As mutational experiments of charged amino acids in other PFTs have been successfully performed to precisely control the translocation capabilities21,41, aerolysin presents a significant potential to further improve its native sensing capabilities.

In particular, based on the long translocation time of dA14, longer than in any other pores, we could use this oligonucleotide construct as a quasi-immobilized probe, as similarly achieved by DNA-streptavidin based protocols used to study other faster-translocating nanopores33,42. Thus, experiments of the abasic site at different positions in dA14 were able to precisely map the sensing spots of aerolysin at single-nucleotide resolution. The complementary computational analysis allowed us to identify, as also expected from the initial structural analysis of the pore, R220 and K238 as the key sensing residues, providing valuable information to understand the high sensitivity of aerolysin for oligonucleotide detection. Our results show a good relationship between ion flux decrease and the pore diameter, suggesting that the main determinant for the residual current is produced by pore steric hindrance, as already proposed for other nanopores35,43,44. The narrowest constriction defined by R220 is most sensitive for the discrimination of all four types of nucleobases, as well as cytosine methylation and oxidation of guanine in a free ssDNA. In comparison to α-hemolysin and MspA, the sensing spot produced by a heptameric ring of large arginine residues is significantly narrower (less than 0.9 nm), allowing aerolysin to detect dinucleotides, which is not feasible for the two other pores mentioned above.

To achieve the high sensitivity, nanopore technology has been searching for nanopores with a translocation height as small as one nucleotide in order to decrease the sequencing noise. In addition, the first studied α-hemolysin pore, with a translocation height of 5.0 nm, has been lately replaced by MspA, with a height of less than 0.6 nm11. Notably, the aerolysin pore is constituted by a barrel longer than other PFTs, reaching nearly 10 nm. Our study reveals that the actual sensing height of aerolysin is indeed one amino acid at its narrowest constriction point (R220), which is associated with a longer pore lumen that could be the reason for the long translocating time, which is also necessary for an improved resolution. Moreover, the longer pore lumen, together with the existence of several charged residues mainly at the pore exit, opens up the opportunity to modulate aerolysin sensitivity by site-directed mutagenesis, creating nanopores with controlled specific functions. Interestingly, in MD simulations we observed the remarkable flexibility of the aerolysin barrel, which could pass from circular to elliptical conformations. This native flexibility may help to better translocate molecules along the pore and should be considered to understand and design the aerolysin pore variants.

Altogether, the data presented here allow setting the bases for performing sequence modifications in aerolysin, which will hopefully yield nanopores with improved sensitivity and resolution. While this manuscript was revised, a work showing the potential of aerolysin for peptide sequencing was published, which further strengthens the potential of this pore for future applications45. Moreover, nanopore techniques could be further developed as a single-molecule method to investigate the function or structure of membrane pore in real biological environments.

## Methods

### Materials

Details about the materials used in this study can be found in the Supplementary Methods.

### Single-channel recording experiments

1,2-Diphytanoyl-sn-glycero-3-phosphocholine powder (Avanti Polar Lipids, Alabaster, AL, USA) was dissolved in decane (Sigma-Aldrich, St. Louis, MO, USA) for a final concentration of 2.0 mg per 100 μl. Proaerolysin protein was incubated with Trypsin-EDTA (Sigma-Aldrich, St. Louis, MO, USA) for 10 h at room temperature and then stored at −20 °C. Lipid bilayer membranes were formed across an orifice with the dimeter of 50 μm in a Delrin bilayer cup (Warner Instruments, Hamden, CT, USA), which separated the recording apparatus into two chambers, cis and trans. After the addition of aerolysin monomer protein to the cis chamber, it could self-assemble to form a transmembrane heptameric protein pore. A single aerolysin nanopore generated a constant current of nearly 50.0 pA at baseline under the applied voltage of +100 mV in at a temperature of 24 ± 2 °C in 1.0 M KCl solution buffered with 10 mM Tris and 1.0 mM EDTA, titrated to pH 8.0. DNA oligomers were added to the cis side of the pore. Two matched Ag/AgCl electrodes were used to record the ionic currents. Then, the current traces were amplified and measured with a patch clamp amplifier (Axon 200B) equipped with a Digidata 1440A A/D converter (Molecular Devices, Sunnyvale, CA, USA). The signals were filtered at 5 kHz and acquired with Clampex 10.4 software (Molecular Devices, Sunnyvale, CA, USA) at a sampling rate of 100 kHz. The data were analyzed using MOSAIC46 and OriginLab 8.0 (OriginLab Corporation, Northampton, MA, USA) software. The DNA sequences used in this study are listed in Supplementary Table 2.

### Molecular modeling and simulations

The aerolysin pore was described using the equilibrated conformation defined by previous MD simulations of the aerolysin pore47. This original model was reduced by removing the membrane binding domains (i.e., residues 24–195, 301–408, and 425–447), and the last strand (residues 409–424) was linked to the previous one by creating a loop between amino acids A300 and Q409 using the software Modeler v9.1348. The simulations were thus performed on the membrane spanning β-barrel only, which remained stable as the full pore unit during all the MD simulations presented here. This same approximation has been already used for translocation studies in other PFTs28,49. A ~11 × 11 nm2 membrane bilayer was modeled by 1-palmitoyl-2-oleoylphosphatidylcholine (POPC) lipids using the charmm-gui server, following the membrane positioning as suggested by the PPM server50. A ssDNA was derived from a model of double-stranded DNA of 10, 14 or 20 adenosines, created with the 3D-DART web server51. A phosphate group at 5′ end position, with two negative charges, as predicted by the chemicalize server (https://chemicalize.com/), was parameterized with the CGenFF tool of the charmm-gui web server52. Afterwards, the ssDNA molecule was manually placed on the top of the barrel (extracellular side), pointing either its 5′ or 3′ terminus to the pore entry, depending on the studied direction of entry. This system was then solvated in a 1 M KCl water box with initial dimensions ~11 × 11 × 21.7 nm3.

All MD simulations were run using the GROMACS software version 4.853, with the charmm36 force field54, the SHAKE algorithm on all the bonds between hydrogen and heavy atoms, and Particle-Mesh Ewald, treating the electrostatic interactions in periodic boundary conditions. The system was first minimized using the steepest descent algorithm, and afterwards equilibrated using a similar protocol of that suggested by the charm-gui server55, but in 2 blocks of 6 steps each, totalizing 975 ps. In the first block of equilibration, the lipid restraints were gradually reduced to zero, while those on protein and DNA were maintained, and were only gradually and completely removed in the second block. Afterwards it was simulated for 10 ns without restraints. A more detailed description of this equilibration protocol can be found as Supplementary Table 3. The approximate dimensions of the box at the end of the equilibration were ~10.6 × 10.6 × 21.3 nm3.

For all further simulations (SMD and MD of selected snapshots), we chose an integration step of 2 fs. A temperature of 22 °C was controlled with the Nose-Hoover thermostat and the Parrinello-Rahman method was used for semi-isotropic pressure coupling. SMD was used to introduce the DNA in the aerolysin pore, using an umbrella biasing potential, based on direction-periodic geometry and using as reaction coordinate the z axis, with the aerolysin pore aligned to it. A harmonic biasing force (spring constant of 100 kJ mol−1 nm−2) was applied to the spring connecting the center of mass of either the 3′ or the 5′ nucleotide, depending on the direction of entry studied, and the center of mass of the α-carbons of the seven K244 residues, located at the pore exit, at a constant velocity of 0.004 nm ps−1. A biased voltage of 250 mV was applied. The stretching events during SMD, and its relaxation during MD, have been measured based on the change of the inter-strand distance, as already performed by others56 (Supplementary Fig. 22). A full relaxation of the DNA stretching produced by SMD is observed already in the first ns of unbiased MD simulations; thus it reaches stabilization of the initial configuration with a length of 73 Å, lower than the estimated contour length for dA14, given that inter-phosphate distances of ssDNA can range between 5.9 and 7 Å57,58.

For the study of dA14 fully inserted on the aerolysin pore, three snapshots were selected from three independent replicas of SMD, where the DNA is located with its 3′-end at the pore exit and its 5′-end at the pore entry. For the study of shorter (dA10) and longer (dA20) oligonucleotides, two SMD snapshots were selected for each DNA length, one with the 3′-end located at the pore exit, and another with the 5′-end located at the pore entry. Those selected snapshots were then equilibrated for 150 ps, using the former pulling protocol but with constant velocity equal to zero, to keep the pulling distance unchanged, and afterwards submitted to 100 ns of unrestrained MD simulation, applying a biased voltage of 250 mV. All replica of dA14 showed the same positioning and interactions, and therefore only one of them was further simulated until a total time of 250 ns. The protein remained very stable during simulations at this electric field without the need of applying position restraints. This is likely due to the prion-like stability conferred to the pore by the concentric double β-barrel conformation at the extracellular end (see Supplementary Fig. 23 for RMSD plots of the equilibration and MD simulation at applied voltage). In all simulations with applied electric field, the dimension of the box along the z axis was kept fixed. An image showing the location of the protein with respect to the lipid bilayer membrane and the extent of the solvent volume at the end of the equilibration has been added in Supplementary Fig. 23.

A snapshot from the MD of dA14 fully inserted in the pore was used for electrostatic calculations using the APBS-PDB2PQR web server59,60, with and without ssDNA at 1 M salt concentration. The ion current was calculated following a method already described by others38 using the following formula, where zi and qi are the z coordinate and the charge of atom i, respectively:

$$I(t) = \frac{1}{{\Delta tL_Z}}\mathop {\sum}\nolimits_{i = 1}^N {q_i} \left[ {z_i(t + \Delta t) - z_i(t)} \right]$$
(1)

This calculation was performed similarly for two systems, one with dA14 fully inserted in the pore, and the other with no DNA. The free-DNA system was simulated following the same procedure described above for the dA14 system, applying an electric field corresponding to a voltage of 250 mV. The systems were simulated for 250 ns, but only the last 200 ns were used for the calculations (as to allow DNA to adjust its position from that obtained by SMD). Our approach is known to overestimate the absolute current by 10–40%, mainly due to the overestimation of KCl bulk conductivity by the CHARMM force field35,38,39,40. Results are provided in Supplementary Fig. 20.

### Data availability

All relevant data included in the paper and its supplementary information are available on request.

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## References

1. 1.

Venkatesan, B. M. & Bashir, R. Nanopore sensors for nucleic acid analysis. Nat. Nanotechnol. 6, 615–624 (2011).

2. 2.

Laszlo, A. H. et al. Decoding long nanopore sequencing reads of natural DNA. Nat. Biotechnol. 32, 829–833 (2014).

3. 3.

Song, L. et al. Structure of staphylococcal α-Hemolysin, a heptameric transmembrane pore. Science 274, 1859–1865 (1996).

4. 4.

Butler, T. Z., Pavlenok, M., Derrington, I. M., Niederweis, M. & Gundlach, J. H. Single-molecule DNA detection with an engineered MspA protein nanopore. Proc. Natl Acad Sci. USA 105, 20647–20652 (2008).

5. 5.

Wendell, D. et al. Translocation of double-stranded DNA through membrane-adapted phi29 motor protein nanopores. Nat. Nanotechnol. 4, 765–772 (2009).

6. 6.

Soskine, M., Biesemans, A., De Maeyer, M. & Maglia, G. Tuning the size and properties of ClyA nanopores assisted by directed evolution. J. Am. Chem. Soc. 135, 13456–13463 (2013).

7. 7.

Mohammad, M. M. et al. Engineering a rigid protein tunnel for biomolecular detection. J. Am. Chem. Soc. 134, 9521–9531 (2012).

8. 8.

Cao, C., Yu, J., Wang, Y.-Q., Ying, Y.-L. & Long, Y.-T. Driven translocation of polynucleotides through an aerolysin nanopore. Anal. Chem. 88, 5046–5049 (2016).

9. 9.

Goyal, P. et al. Structural and mechanistic insights into the bacterial amyloid secretion channel CsgG. Nature 516, 250–253 (2014).

10. 10.

Brown, C. G. & Clarke, J. Nanopore development at Oxford Nanopore. Nat. Biotechnol. 34, 810–811 (2016).

11. 11.

Schneider, G. F. & Dekker, C. DNA sequencing with nanopores. Nat. Biotechnol. 30, 326–328 (2012).

12. 12.

Stefureac, R., Long, Y.-T., Kraatz, H. B., Howard, P. & Lee, J. S. Transport of α-helical peptides through α-hemolysin and aerolysin pores. Biochemistry 45, 9172–9179 (2006).

13. 13.

Li, S., Cao, C., Yang, J. & Long, Y.-T. Detection of peptides with different charges and lengths by using the aerolysin nanopore. ChemElectroChem 5, 1–5 (2018).

14. 14.

Pastoriza-Gallego, M. et al. Dynamics of unfolded protein transport through an aerolysin pore. J. Am. Chem. Soc. 133, 2923–2931 (2011).

15. 15.

Wang, Y. et al. Nanopore sensing of botulinum toxin type B by discriminating an enzymatically cleaved peptide from a synaptic protein synaptobrevin 2 derivative. ACS Appl. Mat. Interfaces 7, 184–192 (2015).

16. 16.

Fennouri, A. et al. Kinetics of enzymatic degradation of high molecular weight polysaccharides through a nanopore: experiments and data-modeling. Anal. Chem. 85, 8488–8492 (2013).

17. 17.

Baaken, G. et al. High-resolution size-discrimination of single nonionic synthetic polymers with a highly charged biological nanopore. ACS Nano 9, 6443–6449 (2015).

18. 18.

Cao, C. et al. Discrimination of oligonucleotides of different lengths with a wild-type aerolysin nanopore. Nat. Nanotechnol. 11, 713–718 (2016).

19. 19.

Wang, Y.-Q. et al. Rationally designed sensing selectivity and sensitivity of an aerolysin nanopore via site-directed mutagenesis. ACS Sens 3, 779–783 (2018).

20. 20.

Rincon-Restrepo, M., Mikhailova, E., Bayley, H. & Maglia, G. Controlled translocation of individual DNA molecules through protein nanopores with engineered molecular brakes. Nano. Lett. 11, 746–750 (2011).

21. 21.

Ayub, M., Stoddart, D. & Bayley, H. Nucleobase recognition by truncated α-hemolysin pores. ACS Nano 9, 7895–7903 (2015).

22. 22.

Clarke, J. et al. Continuous base identification for single-molecule nanopore DNA sequencing. Nat. Nanotechnol. 4, 265–270 (2009).

23. 23.

Cao, C., Liao, D.-F., Yu, J., Tian, H. & long, Y.-T. Construction of an aerolysin nanopore in a lipid bilayer for single-oligonucleotide analysis. Nat. Protoc. 12, 1901–1911 (2017).

24. 24.

Iacovache, I. et al. Cryo-EM structure of aerolysin variants reveals a novel protein fold and the pore-formation process. Nat. Commun. 7, 12062 (2016).

25. 25.

Jiang, J., Pentelute, B. L., Collier, R. J. & Zhou, Z. H. Atomic structure of anthrax protective antigen pore elucidates toxin translocation. Nature 521, 545–549 (2015).

26. 26.

Mathé, J., Aksimentiev, A., Nelson, D. R., Schulten, K. & Meller, A. Orientation discrimination of single-stranded DNA inside the α-hemolysin membrane channel. Proc. Natl Acad. Sci. USA 102, 12377–12382 (2005).

27. 27.

Butler, T. Z., Gundlach, J. H. & Troll, M. A. Determination of RNA orientation during translocation through a biological nanopore. Biophys. J. 90, 190–199 (2006).

28. 28.

Bhattacharya, S. et al. Molecular dynamics study of MspA arginine mutants predicts slow DNA translocations and ion current blockades indicative of DNA sequence. ACS Nano 6, 6960–6968 (2012).

29. 29.

Humphrey, W., Dalke, A. & Schulten, K. VMD: Visual molecular dynamics. J. Mol. Graph. 14, 33–38 (1996).

30. 30.

Wang, H., Dunning, J. E., Huang, A. P.-H., Nyamwanda, J. A. & Branton, D. DNA heterogeneity and phosphorylation unveiled by single-molecule electrophoresis. Proc. Natl Acad. Sci. USA 101, 13472–13477 (2004).

31. 31.

Weber, P., Ohlendorf, D., Wendoloski, J. & Salemme, F. Structural origins of high-affinity biotin binding to streptavidin. Science 243, 85–88 (1989).

32. 32.

Jin, Q. et al. Base-excision repair activity of uracil-DNA glycosylase monitored using the latch zone of α-hemolysin. J. Am. Chem. Soc. 135, 19347–19353 (2013).

33. 33.

Stoddart, D., Heron, A. J., Mikhailova, E., Maglia, G. & Bayley, H. Single-nucleotide discrimination in immobilized DNA oligonucleotides with a biological nanopore. Proc. Natl Acad. Sci. USA 106, 7702–7707 (2009).

34. 34.

Pellegrini-Calace, M., Maiwald, T. & Thornton, J. M. PoreWalker: a novel tool for the identification and characterization of channels in transmembrane proteins from their three-dimensional structure. PLoS. Comput. Biol. 5, e1000440 (2009).

35. 35.

Bhattacharya, S., Yoo, J. & Aksimentiev, A. Water mediates recognition of DNA sequence via ionic current blockade in a biological nanopore. ACS Nano 10, 4644–4651 (2016).

36. 36.

Wang, Y., Tian, K., Du, X., Shi, R.-C. & Gu, L.-Q. Remote activation of a nanopore for high-performance genetic detection using a pH taxis-mimicking mechanism. Anal. Chem. 89, 13039–13043 (2017).

37. 37.

Feng, J. et al. Identification of single nucleotides in MoS2 nanopores. Nat. Nanotechnol. 10, 1070–1076 (2015).

38. 38.

Aksimentiev, A. & Schulten, K. Imaging α-hemolysin with molecular dynamics: ionic conductance, osmotic permeability, and the electrostatic potential map. Biophys. J. 88, 3745–3761 (2005).

39. 39.

Wells, D. B. et al. Nanopore-Based Technology, 165–186 (Humana Press, Totowa, NJ, 2012).

40. 40.

Aksimentiev, A. Deciphering ionic current signatures of DNA transport through a nanopore. Nanoscale 2, 468–483 (2010).

41. 41.

Maglia, G., Restrepo, M. R., Mikhailova, E. & Bayley, H. Enhanced translocation of single DNA molecules through α-hemolysin nanopores by manipulation of internal charge. Proc. Natl Acad. Sci. USA 105, 19720–19725 (2008).

42. 42.

Stoddart, D. et al. Nucleobase recognition in ssDNA at the central constriction of the α-hemolysin pore. Nano. Lett. 10, 3633–3637 (2010).

43. 43.

Smeets, R. M. M. et al. Salt dependence of ion transport and DNA translocation through solid-state nanopores. Nano. Lett. 6, 89–95 (2006).

44. 44.

Reiner, J. E., Kasianowicz, J. J., Nablo, B. J. & Robertson, J. W. F. Theory for polymer analysis using nanopore-based single-molecule mass spectrometry. Proc. Natl Acad. Sci. USA 107, 12080–12085 (2010).

45. 45.

Piguet, F. et al. Identification of single amino acid differences in uniformly charged homopolymeric peptides with aerolysin nanopore. Nat. Commun. 9, 966 (2018).

46. 46.

Balijepalli, A. et al. Quantifying short-lived events in multistate ionic current measurements. ACS Nano 8, 1547–1553 (2014).

47. 47.

Cirauqui, N., Abriata, L. A., van der Goot, F. G. & Dal Peraro, M. Structural, physicochemical and dynamic features conserved within the aerolysin pore-forming toxin family. Sci. Rep. 7, 13932 (2017).

48. 48.

Eswar, N. Comparative protein structure modeling using Modeller. Curr. Protoc. Bioinformatics Chapter 5, Unit 5.6 (2006).

49. 49.

Bond, P. J., Guy, A. T., Heron, A. J., Bayley, H. & Khalid, S. Molecular dynamics simulations of DNA within a nanopore: Arginine−Phosphate tethering and a binding/sliding mechanism for translocation. Biochemistry 50, 3777–3783 (2011).

50. 50.

Lomize, M. A., Pogozheva, I. D., Joo, H., Mosberg, H. I. & Lomize, A. L. OPM database and PPM web server: resources for positioning of proteins in membranes. Nucleic Acids Res. 40, D370–D376 (2012).

51. 51.

van Dijk, M. & Bonvin, A. M. J. J. 3D-DART: a DNA structure modelling server. Nucleic Acids Res. 37, W235–W239 (2009).

52. 52.

Jo, S., Kim, T., Iyer, V. G. & Im, W. CHARMM-GUI: A web-based graphical user interface for CHARMM. J. Comput. Chem. 29, 1859–1865 (2008).

53. 53.

Pronk, S. et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics 29, 845–854 (2013).

54. 54.

Best, R. B. et al. Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone ϕ, ψ and side-chain χ1 and χ2 dihedral angles. J. Chem. Theory Comput. 8, 3257–3273 (2012).

55. 55.

Jo, S., Kim, T. & Im, W. Automated builder and database of protein/membrane complexes for molecular dynamics simulations. PLoS ONE 2, e880 (2007).

56. 56.

Marin-Gonzalez, A., Vilhena, J. G., Perez, R. & Moreno-Herrero, F. Understanding the mechanical response of double-stranded DNA and RNA under constant stretching forces using all-atom molecular dynamics. Proc. Natl Acad. Sci. USA 114, 7049–7054 (2017).

57. 57.

Olson, W. K. Configurational statistics of polynucleotide chains. A single virtual bond treatment. Macromolecules 8, 272–275 (1975).

58. 58.

Saenger, W. Principles of Nucleic Acid Structure (Springer-Verlag New York, New York, 1984).

59. 59.

Dolinsky, T. J. et al. PDB2PQR: expanding and upgrading automated preparation of biomolecular structures for molecular simulations. Nucleic Acids Res. 35, W522–W525 (2007).

60. 60.

Baker, N. A., Sept, D., Joseph, S., Holst, M. J. & McCammon, J. A. Electrostatics of nanosystems: Application to microtubules and the ribosome. Proc. Natl Acad. Sci. USA 98, 10037–10041 (2001).

## Acknowledgements

This research was supported by the National Natural Science Foundation of China (21421004, 21327807), the Program of Introducing Talents of Discipline to Universities (B16017), Innovation Program of Shanghai Municipal Education Commission (2017-01-07-00-02-E00023), the Fundamental Research Funds for the Central Universities (222201718001, 222201717003), the Swiss National Science Foundation (to M.D.P.), the European Union’s Horizon 2020 Research and innovation program under the Marie Skłodowska-Curie Grant Agreement No. 665667 (to C.C). N.C. is supported by a fellowship of the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) of Brazil. We thank WenPing Lyu for discussion of current calculations, and Matteo Degiacomi for discussion on aerolysin molecular simulations.

## Author information

### Affiliations

1. #### Key Laboratory for Advanced Materials & School of Chemistry and Molecular Engineering, East China University of Science and Technology, Shanghai, 200237, P.R. China

• Chan Cao
• , Meng-Yin Li
• , Ya-Qian Wang
• , He Tian
•  & Yi-Tao Long
2. #### Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), 1015, Lausanne, Switzerland

• Chan Cao
• , Nuria Cirauqui
•  & Matteo Dal Peraro
3. #### Department of Pharmaceutical Biotechnology, Universidade Federal do Rio de Janeiro, 21941-902, Rio de Janeiro, Brazil

• Nuria Cirauqui

### Contributions

C.C., H.T., and Y.T.L. conceived the idea; C.C., M.-Y.L., and Y.-Q.W. designed and performed the nanopore experiments, collected and analysis data; C.C. and N.C. performed the molecular dynamics simulations; C.C. and M.-Y.L. performed image processing; C.C., N.C., M.D.P, and Y.-T.L. interpreted the data; C.C., N.C., M.D.P., and M.-Y.L. wrote the paper; Y.-T.L. supervised the project; M.D.P. supervised molecular modeling and simulations for the project and all authors revised the paper.

### Competing interests

The authors declare no competing interests.

### Corresponding authors

Correspondence to Matteo Dal Peraro or Yi-Tao Long.