Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# FraC nanopores with adjustable diameter identify the mass of opposite-charge peptides with 44 dalton resolution

## Abstract

A high throughput single-molecule method for identifying peptides and sequencing proteins based on nanopores could reduce costs and increase speeds of sequencing, allow the fabrication of portable home-diagnostic devices, and permit the characterization of low abundance proteins and heterogeneity in post-translational modifications. Here we engineer the size of Fragaceatoxin C (FraC) biological nanopore to allow the analysis of a wide range of peptide lengths. Ionic blockades through engineered nanopores distinguish a variety of peptides, including two peptides differing only by the substitution of alanine with glutamate. We also find that at pH 3.8 the depth of the peptide current blockades scales with the mass of the peptides irrespectively of the chemical composition of the analyte. Hence, this work shows that FraC nanopores allow direct readout of the mass of single peptide in solution, which is a crucial step towards the developing of a real-time and single-molecule protein sequencing device.

## Introduction

Proteins regulate nearly all life processes. Currently, mass spectrometry is the method of choice for protein analysis, sequencing, and proteome characterization. In a typical experiment in bottom-up proteomics, proteins are extracted and proteolytically digested into peptides and separated by liquid chromatography. Peptide spectra are then collected using tandem mass spectrometry, within a cycle time of about 1 s1. Using this method, most of the proteins that have been expressed in an organism can be identified and quantified. However, proteins in biological samples are extremely heterogeneous, spanning several orders of magnitude in abundance. In addition, most eukaryote proteins contain a variegated and dynamic range of post-translational modifications (PTMs). Due to the fast amount of conceivable combinations, the identification and sequencing of proteins in such heterogeneous mixtures is challenging for conventional mass spectrometry2.

A high-throughput single-molecule technique could address these limitations. Although no single-molecule protein sequencer exists today, a few approaches have been proposed, mainly aimed at protein identification. For instance, it has been shown that if only cysteine and lysine residues are read in sequence, most of human proteins can be identified3. In a recent proof-of-concept experiment4, peptides with cysteine and a lysine residues were labeled with a fluorescence acceptor, whereas a ClpXP unfoldase/protease was labeled with a fluorescence donor. Then, single-molecule Förster resonance energy transfer was used to monitor the passage of the acceptor dyes near the donor dye as the linearized polypeptide was processively transported through the ClpXP chamber. In another recent method, millions of peptides with fluorescently labeled cysteine5,6,7, lysine, or phosphoserine residues were immobilized on a glass coverslip. Total internal reflection fluorescence microscopy was then used to monitor each molecule’s fluorescence following consecutive cycles of N-terminal amino acid removal using Edman degradation chemistry. The authors identified a variety of peptides and achieved single-molecule positional readout of the phosphorylated sites.

Nanopores might also be used for single-molecule protein analysis and sequencing. Stein and co-workers proposed to couple a nanopore to a mass spectrometer. The nanopore would linearize individual proteins, whereas the mass spectrometer would be used to identify peptides as they are sequentially cleaved8. In a more conventional nanopore approach, an external potential is applied across the nanopore and the resulting ionic current is used to recognize proteins or peptides traversing the nanopore. In an early experiment, inspired by DNA nanopore sequencing, a ClpXP enzyme complex was used to force the unfolding of a protein through a biological nanopore9. An independent study showed that nanopore currents are capable of recognizing modifications in individual amino acid within a linearized polypeptide strand10. However, despite these encouraging results enzymes that process proteins or polypeptides amino-acid-by-amino-acid are yet to be discovered.

In an alternative approach, a protease is placed atop of a nanopore to fragment a protein. Then the mass of individual peptides is identified by nanopore currents. This method would be similar to conventional protein sequencing using tandem mass spectrometry, with the additional advantage of being low-cost, portable, and single molecule. For this approach to be feasible, however, the signal rising from the peptide blockade must be directly correlated to the mass of the peptide. Previous work with PEG molecules11,12,13,14,15,16,17, oligosaccharides18, and homopolymeric peptides19,20,21 revealed that there might be a direct correlation between the depth of the current blockade and the molecular weight of polymers, providing the charge composition of the analyte is uniform22. In such circumstances, it has been shown that nanopores can resolve the signal of poly-arginine peptides from 10 to 5 amino acids, hence distinguishing peptides differing by one arginine in length (174 Da)19. Peptides in a biological sample, however, have a heterogenous chemical composition. Work with DNA23,24 and amino-acid enantiomers25 revealed that the chemical identity of molecules and the charge inside the nanopore26 have an unpredictable effect on the ionic current. On the other hand, additional work with peptides showed that the correlation between mass and ionic signal is retained with peptides27,28, providing they are either neutral or uniformly charged. Nonetheless, peptides with an overall charge that is opposite to the applied bias have not been systematically studied, most likely because they are not efficiently captured and analyzed at such potentials29,30,31,32. Finally, the diameter and geometry of biological nanopores cannot be easily adapted to study the array of sizes, shapes, and chemical composition of polypeptides in solution.

Recently, we have shown that octameric fragaceatoxin C (FraC, Fig. 1a) nanopores33 from the sea anemone Actinia fragacea can be used to study DNA34, proteins, and peptides35. The transmembrane region of FraC is unique compared with other nanopores used in biopolymer analysis as it is formed by α-helices that describe a sharp and narrow constriction at the trans exit of the nanopore. We showed that an electroosmotic flow across the nanopore can be engineered to capture polypeptides at a fixed potential despite their charge composition35. However, peptides smaller than 1.6 kDa in size translocated too fast across the nanopore to be sampled, indicating that nanopores with a smaller diameter should be used to detect peptides with lower molecular weight. In this work, we show that the diameter of FraC nanopores can be tuned, permitting the identification of a large range of peptides sizes. Using engineered nanopores, we also show that peptides differing by the substitution of one-amino acid (44 Da) can be identified. At selected pH conditions, the nanopore signal directly correlates to the mass of the peptide, including peptides with high content of acidic residues (i.e., negatively charged peptides at physiological pH). Therefore, this nanopore approach can be used to identify the mass of individual peptides in solution and, providing a protease is attached immediately above the nanopore, might allow the sequencing of proteins in real-time.

## Results

### Engineering the size of FraC nanopores

One of the main challenges in biological nanopores analysis is to obtain nanopores with different size and shape. Most biological nanopores are formed by multiple repeats of individual monomers. Hence, different nanopore sizes might be obtained by engineering the protein oligomeric composition36. We noticed that at pH 7.5 a small fraction of wild-type FraC (Wt-FraC) nanopores showed a lower conductance (1.26 ± 0.08 nS, −50 mV, type II Wt-FraC) compared with the dominant fraction (2.26 ± 0.08 nS, −50 mV, type I Wt-FraC), suggesting that FraC might be able to spontaneously assemble into nanopores with a smaller size. At pH 4.5, type I and type II FraC nanopores were also observed, however, a smaller nanopore conductance was identified alongside (0.42 ± 0.03 nS, type III Wt-FraC, −50 mV, Fig. 1b). Occasionally, nanopores with a yet smaller conductance were observed, however, their appearance was too rare for meaningful characterization. We noticed that the reconstitution of lower conductance nanopores depended on several purification conditions (Supplementary Figure 1 and 2). In particular, the occurrence of type II and type III nanopores increased when the oligomers were stored in solution for several weeks or when the concentration of monomeric Wt-FraC was reduced during oligomerization (Supplementary Figure 1 and 2). In an effort to enrich type II and type III FraC nanopores, we weakened the interaction between the nanopore and the lipid interface by substituting W112 and W116 at the lipid interface of FraC (Fig. 1a) with serine. We reasoned that a lower concentration of monomers, during oligomerization, would increase the population of lower molecular mass oligomers. Rewardingly, we found that at both pH 7.5 and pH 4.5 the proportion of type II and type III FraC nanopores increased dramatically. For example, W112S-W116S-FraC formed 60% of type II pore at pH 7.5, and 40% of type III pore at pH 4.5 (Fig. 1b, Supplementary Figure 3). The different nanopore types could also be separated by Ni-NTA affinity chromatography using an imidazole gradient (Supplementary Figure 2e-f). Finally, at pH 7.5, type II and type III FraC nanopores could also be obtained by replacing aspartic acid 109 (Supplementary Note 1, Supplementary Figure 2g-h, 3e-f) at the lipid interface with serine. Importantly, the reconstituted type II and type III nanopores did not show any particular gating (spontaneous opening and closing) or bilayer instability (e.g., the detachment of the nanopores from the lipid bilayer was never observed).

Among FraC nanopores of the same type, the lipid interface modifications brought by W112S and W116S substitutions did not alter the conductance and ion selectivity of the nanopores (Fig. 1c, Supplementary Figure 3 and 4, Supplementary Table 1), suggesting that the overall fold of the nanopores was unchanged by the surface modifications. When characterized in lipid bilayers, type I, type II, and type III nanopores showed a well-defined single conductance distribution, a steady open pore current (Figs. 1d, e) and comparable power spectra (Supplementary Figure 5). Interestingly, the nanopore types with a reduced conductance also showed an increased cation selectivity (2.0±0.1, 2.5±0.2, and 4.2±0.2 for type I, type II, type III W116S-FraC nanopores, respectively, at pH 4.5, Fig. 1f, Supplementary Table 1). The increased ion selectivity most likely reflects a larger overlap of the electrical double layer in the nanopores with a narrower constriction. These and several addition lines of evidence (Supplementary note 1, Supplementary Figure 6) strongly suggest that the three types of FraC nanopores represent nanopores with different protomeric compositions. Molecular modeling allowed predicting the diameter of type II (1.1 nm) and type III (0.84 nm) nanopores (Fig. 1g). These values corresponded well to the diameters estimated from their conductivity values (1.17 ± 0.04 and 0.71 ± 0.01 for type II and type III, Supplementary Figure 3). Notably, type III FraC, having a sub-nanometer constriction, is the biological nanopore with the smallest inner diameter known to date.

### Identification of single amino-acid substitutions with type II FraC nanopores

Type II FraC nanopores were used to sample a series of angiotensin peptides (Figs. 23, Table 1, Supplementary Figure 7), which regulate blood pressure and fluid balance. The peptides were added to the cis side of type II W116S-FraC nanopores and the magnitude of the ionic current associated with a peptide blockade (IB) was measured. The pH of the solution was set to 4.5, because at higher pH the capture of some peptides was either not observed or greatly reduced35. To characterize the peptide blockade, we used the percentage of excluded currents (Iex%), defined as [(IOIB)/IO] × 100, where IO represents the open pore current. Iex%, which relates to the ionic current that is lost during the transit of the peptide across the nanopore, and is expected to be proportional to the volume inside the nanopore excluded by the peptide. Angiotensin I (DRVYIHPFHL, 1296.5 Da), showed the deepest blockade (Iex% = 91.2 ± 0.2) and angiotensin IV (VYIHPF, 774.9 Da) the shallowest blockade (Iex% = 61.1 ± 4.0). The percent of excluded current of angiotensin II (DRVYIHPF, 1046.2 Da, Iex% = 82.1 ± 1.3) and angiotensin III (RVYIHPF, 931.1 Da, Iex% = 77.9±0.5) fell at intermediate values. When the four peptides were tested simultaneously, individual peptides could be discriminated (Fig. 2f).

The resolution limit of the nanopore sensor was challenged by sampling mixtures of angiotensin II and angiotensin A, which have an identical composition with the exception of the initial amino acid that is aspartate in angiotensin II and alanine in angiotensin A. These two peptides, differing by 44 Da, appeared as distinctive peaks in Iex% plots (Fig. 3). Smaller peptide differences, e.g., the 34 Da difference between phenylalanine and isoleucine in angiotensin III and Ile7 angiotensin III, were observed but not easily detected (Supplementary Figure 8), placing the resolution of our system at ~ 40 Da. It should be noticed that a more complex classification of peptides has been demonstrated elsewhere37,38,39, and would likely improve the sensitivity of discrimination. Smaller peptides such as angiotensin II 4–8 (YIHPF, 675.8 Da), endomorphin I (YPWF, 610.7 Da), or leucine enkephalin (YGGFL, 555.6 Da) translocated too quickly across type II W116S-FraC nanopores to be sampled, but they could be measured using type III W112S-W116S-FraC nanopores (Table 1, Supplementary Note 2, Supplementary Figure 9-11).

### A nanopore mass spectrometer for peptides

Although the ability of nanopores to distinguish between known analytes is useful, a more powerful application would be the identification of peptide masses directly from ionic current blockades without holding prior knowledge of the analyte identity. In nanopores, ionic current blockades are expected to be directly proportional to the volume excluded by the analyte inside the nanopore40. Hence, the current blockade of a peptides should reveal the volume of the peptide, which might approximate to its mass by the relation: volume (nm3) = 1.212 × 10−3 (nm3/Da) × MW (Da)41,42. In the effort to assess FraC nanopores as a peptide mass identifier, we tested additional peptides at pH 4.5 in 1 M KCl solutions using type I, type II, and type III FraC nanopores (Figs. 4a–c, Table 1, Supplementary Figure 7, 10, 12). We found that for most peptides there was a direct correlation between the excluded current and the volume/mass of the peptide. Although linear regression fitted the data well, if the expected values for an empty nanopore were to be included (i.e., Iex% is zero when no peptide is inside the nanopore), quadratic functions showed best fits for the data collected with type I and type II nanopores (Figs. a, b). By contrast, linear regressions could be used for the data measured with type III FraC nanopores (Fig. 4c). Interestingly, the extrapolated volumes for a fully occupied nanopore (3.5 nm3, 2.0 nm3, and 0.96 nm3 for type I, type II, and type III FraC, respectively), were similar to the volumes comprised between D10 and D17 residues of FraC (3.6 nm3, 1.8 nm3, and 1.0 nm3, respectively, Fig. 4), suggesting that the constriction (D10) and the amino acid one turn of the helix above it (D17) most likely define the sensing region within the nanopore.

Although the Iex% of most peptides fitted well to the empirical quadratic functions, two notable exceptions were c-Myc 410–419 (1203.3 Da) and neuropeptide-like protein 3 (NLP-3) (66–75, 1099.2 Da). These peptides were intentionally selected because they included several acidic residues (Table 1). c-Myc 410-419 and NPL-3 (added in cis) could be readily captured at negative applied potentials (trans), indicating that the cis to trans electroosmotic flow across the nanopore can overcome the electrostatic energy barrier opposing peptide capture. However, the dwell times were faster and the Iex% lower than peptides with similar mass (Table 1, Fig. 4b), suggesting that electrophoretic and electrostatic interactions between the pore and the peptides might prevent them from entering the sensing region of the nanopore.

Thus, we tested a range of pHs where the aspartate and glutamate side chains are expected to be protonated (Fig. 5a, Table 1). We found that only at pH 3.8, the signal corresponding to c-Myc 410–419 (1203.3 Da) fell between the signal of angiotensin I (1296.5 Da), and angiotensin II (1046.2 Da, Fig. 5a), suggesting that after losing its negative charges, c-Myc 410–419 peptide might access the recognition volume of FraC. Rewardingly, at pH 3.8 all the remaining peptides showed Iex% values that scaled with the masses of the peptides (Fig. 5b). Notably, at pH 3.8 the peptide signals showed relatively high variability and the conditions had to be carefully controlled (Supplementary Note 3). Most likely, this is because at pH 3.8 the charge density of the constriction (Fig. 1a) is strongly affected by small variations in pH.

### Peptide translocation across nanopores

It has been assumed43,44,45,46 and experimentally47 proven that the voltage dependence of the average dwell time (τoff) can report on the translocation of a molecule across a nanopore. Under a negative bias (trans) for positively charged peptides (added in cis), both electrophoretic and electroosmotic forces (from cis to trans) promote the entry and translocation35 across the nanopore (Supplementary Figure 13). For negatively charged peptides, such as c-Myc 410–419 at pH 4.5 (Fig. 5a), the electroosmotic driving force must be stronger than the opposing electrophoretic force. The voltage dependence of τoff was then examined for the most acidic peptide c-Myc 410–419 at different pH values (Fig. 5c). At pH 4.5, the peptide exhibited a maximum in τoff at –50 mV, suggesting that at low potentials c-Myc 410–419 returns to the cis chamber (<50 mV), and at higher potentials (>50 mV) c-Myc 410–419 exits to the trans chamber. At pH 3.8 and lower, we observed a decrease in τoff, albeit at higher potentials, indicating that at pH 3.8 c-Myc 410–419 crossed the membrane region of FraC to the trans chamber.

## Discussion

We have engineered the assembly of FraC to obtain three nanopores types with 1.6, 1.1, and 0.84 nm inner diameters. The nanopores can accommodate peptides ranging from 22 to 4 amino acids in length. Smaller peptides might be detected using further fine tuning of the transmembrane region of the nanopore, for example, by introducing amino acids with bulky side chains in the recognition volume of the nanopore. We also showed that the nanopores can discriminate differences between an alanine and a glutamate (44 Da) in a mixture of peptides. Furthermore, we found that at exactly pH 3.8 the ionic signal of the peptides depended on the mass of the analyte, whereas at higher pH values the current signal of negatively charged peptides was higher than expected from their mass alone. Most likely, a negatively charged recognition region is important for creating an electrostatic environment for peptide mass recognition. At the same time, the electrostatic interaction of the constriction with negatively charged analytes might prevent the correct positioning of the analyte within the reading frame of the nanopore. Hence, the next-generation nanopores might be fabricated using unnatural amino acids that hold a negative charge at a low pH range (e.g., sulfate or phosphate groups). Alternatively, peptides might be chemically modified (e.g., by esterification) to neutralize the negative charge.

Mass spectrometry is the workhorse of the proteomics field. At present, the nanopore system falls short from the resolution of commercial mass spectrometers. However, the technology is young and improvements are to be expected. It should also be noticed that a peptide mass analyzer device based on nanopores will have distinctive advantages compared with conventional mass spectrometers, the latter being expensive, extremely complex, and unwieldy. By contrast, nanopores can be integrated in portable and low-cost devices containing hundreds of thousands of individual sensors. In addition, the electrical nature of the signal allows sampling biological samples in real-time. Furthermore, since the nanopore reads individual molecules, the signal contains additional information not available for ensemble techniques. Finally, single-molecule detection, especially when coupled to high-throughput analysis, is amenable for detecting low abundance peptides and to unravel the chemical heterogeneity in PTMs, challenges that are hard to address with conventional mass spectrometry.

A nanopore peptide mass detector might also be integrated in real-time protein sequencing system, providing a number of requirements are met. First, a protease-unfoldase pair should be coupled directly above the nanopore sensor. The barrel-shaped ATP-dependent ClpXP protease appears to be an ideal candidate because it would encase the digested peptides preventing its release in solution. The coupling could be achieved by chemical attachment, by genetic fusion, or by introducing binding loops to the nanopore that interact with the peptidase. We have taken the latter approach to couple α-hemolysin nanopores with heptameric GroEL48. The cleaved peptides will be sequentially recognized and translocated across the nanopore. Here we have taken several steps showing this approach might be feasible. We demonstrated that the peptides entering the cis side of the nanopore have a high probability of exiting the nanopore to the trans chamber, which will prevent duplicate detection events. Furthermore, we showed that at pH 3.8 peptides are likely to be captured and their mass recognized by the nanopore at a fixed applied potential irrespectively of their chemical composition. If such low pH values will not be compatible with enzymatic activity, asymmetric solutions on both side of the nanopore can be used49,50,51. In such system, conditions in the cis side will be tuned to optimize the ATPase activity of the unfoldase-peptidase, whereas the pH and ionic strength of the trans side will be optimized to capture and recognize individual peptides.

## Methods

### Chemicals

Endothelin 1 (≥97%, CAS# 117399-94-7), endothelin 2 (≥97%, CAS# 123562-20-9), dynorphin A porcine (≥95%, CAS# 80448-90-4), angiotensin I (≥90%, CAS# 70937-97-2), angiotensin II (≥93%, CAS# 4474-91-3), c-Myc 410-419 (≥97%, # M2435), Asn1-Val5-Angiotensin II (≥97%, CAS# 20071-00-5), Ile7 Angiotensin III (≥95%, #A0911), leucine enkephalin (≥95%, #L9133), 5-methionine enkephalin (≥95%, CAS# 82362-17-2), endomorphin I (≥95%, CAS# 189388-22-5), pentane (≥99%, CAS# 109-66-0), hexadecane (99%, CAS# 544-76-3), Trizma®hydrochloride (≥99%, CAS# 1185-53-1), Trizma®base (≥99%, CAS# 77-86-1), potassium chloride (≥99%, CAS# 7447-40-7), N,N-dimethyldodecylamine N-oxide (LADO, ≥99%, CAS# 1643-20-5) were obtained from Sigma-Aldrich. Pre-angiotensinogen 1–14 (≥97%, # 002-45), angiotensin 1–9 (≥95%, # 002-02), angiotensin A (≥95%, # 002-36), angiotensin III (≥95%, # 002-31), angiotensin IV (≥95%, # 002-28) NLP-3 (66–75) (≥97%, # 076-36) were purchased from Phoenix Pharmaceuticals. Angiotensin 4–8 (≥95%) was synthesized by BIOMATIK. 1,2-Diphytanoyl-sn-glycero-3-phosphocholine (DPhPC, #850356P) and sphingomyelin (Porcine brain, # 860062) were purchased from Avanti Polar Lipids. Citric acid (99.6%, CAS# 77-92-9) was obtained from ACROS. n-Dodecyl β-d-maltoside (DDM, ≥99.5%, CAS# 69227-93-6) was bought from Glycon Biochemical EmbH. DNA primers were synthesized from Integrated DNA Technologies (IDT), enzymes from Thermo Scientific. All peptides were dissolved with Milli-Q water without further purification and stored in −20 °C freezer. pH 7.5 buffer containing 15 mM Tris in this study was prepared by dissolving 1.902 g Trizma® HCl and 0.354 g Trizma® base in 1 liter Milli-Q water (Millipore, Inc.).

### FraC mutation preparation

FraC mutants were prepared according to MEGAWHOP method54. In all, 25 µL REDTaq® ReadyMix™ was mixed with 4 µM primer (Supplementary Figure 2) containing the desired mutation with 50 ng plasmid (pT7-SC1 with Wt-FraC gene) as template and the final volume was brought to 50 µL with Milli-Q water. The PCR protocol was initiated by a 150-s denature step at 95 °C, followed by 30 cycles of denaturing (95 °C, 15 s), annealing (55 °C, 15 s), and extension (72 °C, 60 s). The PCR products (MEGA primer) were combined and purified using a QIAquick PCR purification kit with a final DNA concentration around 200 ng/µL. Then, a second PCR was performed using the MEGA primer for whole-plasmid amplification. In all, 2 µL of MEGA primer, 1 µL Phire II enzyme, 10 µL 5× Phire buffer, 1 µL 10 mM dNTPs, were mixed with PCR water to 50 µL final volume. PCR started with pre-incubated at 98 °C (30 s) and then 25 cycles of denaturing (98 °C, 5 s), extension (72 °C, 180 s). When the PCR was completed, 1 µL DpnI enzyme was added and the mixture kept at 37 °C for 1 h. Then, the temperature was raised to 65 °C for 1 min to inactivate the enzyme. Products were then transformed into E. cloni® 10G cells (Lucigen) competent cell by electroporation. Cells were grown on LB agar plates containing 100 µg/mL ampicillin and were grown overnight at 37 °C. Single clones were enriched and sent for sequencing.

### Sphingomyelin-DPhPC liposome preparation

In all, 20 mg sphingomyelin and 20 mg DPhPC (1,2-diphytanoyl-sn-glycero-3-phosphocholine) were dissolved in 4 mL pentane with 0.5% v/v ethanol and brought to a round flask. The solvent was then removed by rotation while being heated using a hair dryer. After evaporation, the flask was kept at ambient temperature for an additional 30 min. The lipid film was resuspended with 4 mL SDEX buffer (150 mM NaCl, 15 mM Tris, pH 7.5) and the solution immersed in a sonication bath for 5 min. Liposome suspensions were stored at −20 °C.

### FraC oligomerization

FraC oligomerization was triggered by incubation of FraC monomers with sphingomyelin-DPhPC liposomes. Frozen liposome were thawed and sonicated in a water bath for 1 min. FraC monomers were diluted to 1 mg/mL using SDEX buffer, and then 50 µL of FraC monomers were added to 50 µl of a 10 mg/mL liposome solution to obtain a mass ratio of 10:1 (liposome:protein). The lipoprotein solution was incubated at 37 °C for 30 min to allow oligomerization. Then, 10 µl of 5% (w/v, 0.5% final) N,N-Dimethyldodecylamine N-oxide (LDAO) was added to the lipoprotein solution to solubilize the liposomes. After clarification (typically 1 min), the solution was transferred to a 50 mL Falcon tube. Then, 10 mL of SDEX buffer containing 0.02% DDM and 100 µL of pre-washed Ni-NTA beads were added to the Falcon tube and mixed gently in a shaker for 1 h at room temperature. The beads were then spun down and loaded to a Micro Bio-spin column. In all, 10 mL wash buffer (150 mM NaCl, 15 mM Tris, 20 mM imidazole, 0.02% DDM, pH 7.5) was used to wash the beads and oligomers eluded with 100 µL elution buffer (typically 200 mM EDTA, 75 mM NaCl, 7.5 mM Tris pH 7.5, 0.02% DDM). The FraC oligomers were stored at 4 °C and the nanopores are stable for several months.

### W112S-W116S-FraC oligomer separation with His-Trap chromatography

In total, 200 µL of W112S-W116S-FraC monomers (3 mg/mL) were incubated with 300 µL of Sphingomyelin-DPhPC liposome (10 mg/mL) and kept at 4 °C for 48 h after which 0.5% LADO (final concentration) was added to solubilize the lipoprotein. Then the buffer was exchanged to 500 mM NaCl, 15 mM Tris, 0.01% DDM, 30 mM imidazole, pH 7.5 (binding buffer) using a PD SpinTrap G-25 column. W112S-W116S-FraC oligomers were then loaded to Histrap HP 1 mL column (General Electric) using an ÄKTA pure FPLC system (General Electric). The loaded oligomers were washed with 10 column volumes of 500 mM NaCl, 15 mM Tris, 0.01% DDM, 30 mM imidazole, pH 7.5, prior to applying an imidazole gradient (from 30 mM to 1 M imidazole, 500 mM NaCl, 15 mM Tris, 0.01% DDM, pH 7.5) over 30 column volumes. The protein concentration in flow was monitored with the absorbance at 280 nm and fractions were collected when the absorbance was higher than 5 mAu.

### Electrophysiology measurement and data analysis

Electrical recordings were performed using two silver/silver-chloride electrodes immerged into an electrophysiology chamber connected to an Axopatch 200B amplifier (Axon Instrument). The chamber was separated into two 500 µL compartments by a ~ 100 µm polytetrafluoroethylene Teflon aperture (Goodfellow Cambridge Limited). The aperture was pretreated with ~ 5 µL of hexadecane (10% v/v hexadecane in pentane) before loading the buffer. A bilayer was formed using 10 µL of 10 mg/mL DPhPC solution (in pentane), which was added into each compartment35,55. Ionic currents were digitized with a Digidata 1440 A/D converter (Axon Instrument). All peptides measurements were conducted with a 50 kHz sampling rate and a 10 kHz Bessel filter. Single-channel events were collected by applying the single-channel search function in Clampfit (Molecular Devices). Events shorter than 100 µs were ignored. IO values, referring to open pore current, were measured by using Gaussian fittings to event amplitude histograms. Percent of excluded current values (Iex%) were calculated by dividing the excluded current (IOIB) by open pore current (IO) and multiplied by 100. Dwell times and interevent times were measured by fitting single exponentials to histograms of cumulative distribution. Electrical recordings at pH 7.5 were performed using 1 M NaCl solutions and 15 mM Tris, recordings at pH 4.5 were performed using 1 M KCl solutions in 0.1 M citric acid and 180 mM Tris base.

### Ion permeability measurement

In order to measure reversal potentials, a single channel was obtained under symmetric conditions (840 mM KCl, 500 µL in each electrophysiology chamber) and the electrodes were balanced. The 400 µL of a buffered stock solution of 3.36 M KCl was then slowly added to cis chamber, whereas 400 µL of salt-free buffered solution was added to the trans chamber to obtain a total volume of 900 µL in both sides (trans:cis, 467 mM KCl:1960 mM KCl). After the equilibrium was reached, IV curves were collected from −30 to +30 mV. The resulting voltage at zero current is the reversal potential (Vr). The ion selectivity (PK+/PCl) was then calculated using the Goldman–Hodgkin–Katz equation52, Eq. (1), where $$[ {a_{{\mathrm{K}}^ + /{\mathrm{Cl}}^ - }} ]_{cis/trans}$$ is the activity of the K+ or Cl in the cis or trans compartment, R the gas constant, T the temperature and F the Faraday’s constant.

$$\frac{{P_{{\mathrm{{K }}}^{+}}}}{{P_{{\mathrm{{Cl}}}^{-}}}} = \frac{{\left[ {a_{{\mathrm{Cl}}^ {-}}} \right]_{trans} - \left[ {a_{{\mathrm{{Cl}}}^{-}}} \right]_{cis}e^{V_rF/RT}}}{{\left[ {a_{{\mathrm{{K}}}^{+}} }\right]_{trans}e^{V_rF/RT} - \left[ {a_{{\mathrm{{K}}}^{+}}} \right]_{cis}}}$$
(1)

The activity of ions was calculated by multiplying the molar concentration of the ion with the mean ion activity coefficients (0.649 for 500 mM KCl, and 0.573 for 2000 mM)56. Ag/AgCl electrodes were surrounded by 2.5% agarose bridges containing a 2.5 M NaCl solution.

### Molecular models of type I, II, and III FraC nanopores

The three-dimensional models with different multimeric order, ranging from five to nine monomers, were constructed with the symmetrical docking function of Rosetta57. A monomer without lipids was extracted from the crystal structure of FraC with lipids (PDB_ID 4tsy33). Symmetrical docking arranged this monomer around a central rotational axis ranging in order from 5 to 9. In total, Rosetta generated and scored 10,000 copies for each symmetry. In all cases, a multimeric organization with a symmetry similar to the crystal structure could be identified as a top scoring solution. However, in the pentameric assembly, the multimer interface was not fully satisfied as compared with the crystal structure, with large portions left exposed. The ninefold symmetric model, however, exhibited a significant drop in Rosetta score compared with the six-, seven-, and eightfold symmetric models indicating an unfavored assembly of the nonameric assembly with the six-, seven-, and eightfold assemblies as the most plausible. To create lipid-bound models, the crystal structure with lipids was superimposed on each monomer of the generated models, allowing the lipid coordinates to be transferred. The residues within 4.5 angstrom of the lipids were minimized with the Amber10 force field.

## Data availability

The authors declare that the data supporting the findings of this study are available within the article and its supplementary information files or from the corresponding authors upon reasonable request.

## References

1. 1.

Aebersold, R. & Mann, M. Mass-spectrometric exploration of proteome structure and function. Nature 537, 347–355 (2016).

2. 2.

Restrepo-Pérez, L., Joo, C. & Dekker, C. Paving the way to single-molecule protein sequencing. Nat. Nanotechnol. 13, 786–796 (2018).

3. 3.

Yao, Y., Docter, M., Ginkel, J. Van, Ridder, De, D. & Joo, C. Single-molecule protein sequencing through fingerprinting : computational assessment. Phys. Biol. https://doi.org/10.1088/1478-3975/12/5/055003 (2015).

4. 4.

Ginkel, J. Van et al. Single-molecule peptide fingerprinting. Proc. Natl. Acad. Sci. USA 115, 1–6 (2018).

5. 5.

Swaminathan, J., Boulgakov, A. A. & Marcotte, E. M. A theoretical justification for single molecule peptide sequencing. PLoS Comput. Biol. https://doi.org/10.1371/journal.pcbi.1004080 (2015).

6. 6.

Hernandez, E. T., Swaminathan, J., Marcotte, E. M. & Anslyn, E. V. Solution-phase and solid-phase sequential, selective modification of side chains in KDYWEC and KDYWE as models for usage in single-molecule protein sequencing. New J. Chem. 41, 462–469 (2017).

7. 7.

Swaminathan, J. et al. Highly parallel single-molecule identification of proteins in zeptomole-scale mixtures. Nat. Biotechnol. 36, 1076–1091 (2018).

8. 8.

Bush, J. et al. The nanopore mass spectrometer. Rev. Sci. Instrum. https://doi.org/10.1063/1.4986043 (2017).

9. 9.

Lieberman, K. R. et al. Processive replication of single DNA molecules in a nanopore catalyzed by phi29 DNA polymerase. J. Am. Chem. Soc. 132, 17961–17972 (2010).

10. 10.

Rosen, C. B., Rodriguez-larrea, D. & Bayley, H. Single-molecule site-specific detection of protein phosphorylation with a nanopore. Nat. Biotechnol. 32, 179–181 (2014).

11. 11.

Bezrukov, S. M., Vodyanoy, I., Brutyan, R. A. & Kasianowicz, J. J. Dynamics and free energy of polymers partitioning into a nanoscale pore. Macromolecules 29, 8517–8522 (1996).

12. 12.

Robertson, J. W. F. et al. Single-molecule mass spectrometry in solution using a solitary nanopore. Proc. Natl. Acad. Sci. USA 104, 8207–8211 (2007).

13. 13.

Baaken, G. et al. High-resolution size-discrimination of single nonionic synthetic polymers with a highly charged biological nanopore. ACS Nano 9, 6443–6449 (2015).

14. 14.

Aksoyoglu, M. A. et al. Size-dependent forced PEG partitioning into channels: VDAC, OmpC, and α-hemolysin. Proc. Natl. Acad. Sci. USA 113, 9003–9008 (2016).

15. 15.

Oukhaled, A. G., Biance, A. L., Pelta, J., Auvray, L. & Bacri, L. Transport of long neutral polymers in the semidilute regime through a protein nanopore. Phys. Rev. Lett. 108, 1–4 (2012).

16. 16.

Krasilnikov, O. V., Rodrigues, C. G. & Bezrukov, S. M. Single polymer molecules in a protein nanopore in the limit of a strong polymer-pore attraction. Phys. Rev. Lett. 97, 1–4 (2006).

17. 17.

Piguet, F. et al. High temperature extends the range of size discrimination of nonionic polymers by a biological nanopore. Sci. Rep. 6, 1–10 (2016).

18. 18.

Bacri, L. et al. Discrimination of neutral oligosaccharides through a nanopore. Biochem. Biophys. Res. Commun. 412, 561–564 (2011).

19. 19.

Piguet, F. et al. Identification of single amino acid differences in uniformly charged homopolymeric peptides with aerolysin nanopore. Nat. Commun. 9, 966 (2018).

20. 20.

Ji, Z., Kang, X., Wang, S. & Guo, P. Biomaterials Nano-channel of viral DNA packaging motor as single pore to di ff erentiate peptides with single amino acid di ff erence. Biomaterials 182, 227–233 (2018).

21. 21.

Zhao, Q., Jayawardhana, D. A., Wang, D. & Guan, X. Study of peptide transport through engineered protein channels. J. Phys. Chem. B 113, 3572–3578 (2009).

22. 22.

Reiner, J. E., Kasianowicz, J. J., Nablo, B. J. & Robertson, J. W. F. Theory for polymer analysis using nanopore-based single-molecule mass spectrometry. Proc. Natl. Acad. Sci. USA 107, 12080–12085 (2010).

23. 23.

Maglia, G., Restrepo, M. R., Mikhailova, E. & Bayley, H. Enhanced translocation of single DNA molecules through α-hemolysin nanopores by manipulation of internal charge. Proc. Natl. Acad. Sci. USA 105, 19720–19725 (2008).

24. 24.

Stoddart, D., Heron, A. J., Mikhailova, E., Maglia, G. & Bayley, H. Single-nucleotide discrimination in immobilized DNA oligonucleotides with a biological nanopore. Proc. Natl. Acad. Sci. USA 106, 7702–7707 (2009).

25. 25.

Boersma, A. J. & Bayley, H. Continuous stochastic detection of amino acid enantiomers with a protein nanopore. Angew. Chem. - Int. Ed. 51, 9606–9609 (2012).

26. 26.

Stoddart, D. et al. Nucleobase recognition in ssDNA at the central constriction of the α-hemolysin pore. Nano Lett. 10, 3633–3637 (2010).

27. 27.

Chavis, A. E. et al. Single molecule nanopore spectrometry for peptide detection. ACS Sens. 2, 1319–1328 (2017).

28. 28.

Robertson, J. W. F. & Reiner, J. E. The utility of nanopore technology for protein and peptide sensing. Proteomics 18, 1–36 (2018).

29. 29.

Li, S., Cao, C., Yang, J. & Long, Y.-T. Detection of peptides with different charges and lengths by uusing the aerolysin nanopore. ChemElectroChem 4, 1–5 (2018).

30. 30.

Asandei, A. et al. Electroosmotic trap against the electrophoretic force near a protein nanopore reveals peptide dynamics during capture and translocation. ACS Appl. Mater. Interfaces 8, 13166–13179 (2016).

31. 31.

Chinappi, M. & Cecconi, F. Protein sequencing via nanopore based devices: a nanofluidics perspective. J. Phys. Condens. Matter 30, 204002 (2018).

32. 32.

Luan, B. & Zhou, R. Single-file protein translocations through graphene–MoS 2 heterostructure nanopores. J. Phys. Chem. Lett. 9, 3409–3415 (2018).

33. 33.

Tanaka, K., Caaveiro, J. M. M., Morante, K., González-Manãs, J. M. & Tsumoto, K. Structural basis for self-assembly of a cytolytic pore lined by protein and lipid. Nat. Commun. 6, 4–6 (2015).

34. 34.

Wloka, C., Mutter, N. L., Soskine, M. & Maglia, G. Alpha-helical fragaceatoxin C nanopore engineered for double-stranded and single-stranded nucleic acid analysis. Angew. Chem. - Int. Ed. 55, 12494–12498 (2016).

35. 35.

Huang, G., Willems, K., Soskine, M., Wloka, C. & Maglia, G. Electro-osmotic capture and ionic discrimination of peptide and protein biomarkers with FraC nanopores. Nat. Commun. 8, 1–13 (2017).

36. 36.

Soskine, M., Biesemans, A., De Maeyer, M. & Maglia, G. Tuning the size and properties of ClyA nanopores assisted by directed evolution. J. Am. Chem. Soc. 135, 13456–13463 (2013).

37. 37.

Farimani, A. B. Identification of amino acids with sensitive nanoporous MoS 2 : towards machine learning-based prediction. npj 2D Mater. Appl. 2, 14 (2018).

38. 38.

Kennedy, E., Dong, Z., Tennant, C. & Timp, G. Reading the primary structure of a protein with 0.07 nm 3 resolution using a subnanometre-diameter pore. Nat. Nanotechnol. 11, 968–976 (2016).

39. 39.

Kolmogorov, M., Kennedy, E., Dong, Z., Timp, G. & Pevzner, A. Single-molecule protein identification by sub-nanopore sensors. PLoS Comput. Biol. https://doi.org/10.1371/journal.pcbi.1005356 (2017).

40. 40.

Bhattacharya, S., Yoo, J. & Aksimentiev, A. Water mediates recognition of DNA sequence via ionic current blockade in a biological nanopore. ACS Nano 10, 4644–4651 (2016).

41. 41.

Erickson, H. P. Size and shape of protein molecules at the nanometer level determined by sedimentation, gel filtration, and electron microscopy. Biol. Proced. Online 11, 32–51 (2009).

42. 42.

Harpaz, Y., Gerstein, M. & Chothia, C. Volume changes on protein folding. Structure 2, 641–649 (1994).

43. 43.

Clarke, J. et al. Continuous base identification for single-molecule nanopore DNA sequencing. Nat. Nanotechnol. 4, 265–270 (2009).

44. 44.

Wanunu, M., Sutin, J., McNally, B., Chow, A. & Meller, A. DNA translocation governed by interactions with solid-state nanopores. Biophys. J. 95, 4716–4725 (2008).

45. 45.

Rincon-Restrepo, M., Mikhailova, E., Bayley, H. & Maglia, G. Controlled translocation of individual DNA molecules through protein nanopores with engineered molecular brakes. Nano Lett. 11, 746–750 (2011).

46. 46.

Boukhet, M. et al. Probing driving forces in aerolysin and α-hemolysin biological nanopores: Electrophoresis: versus electroosmosis. Nanoscale 8, 18352–18359 (2016).

47. 47.

Biesemans, A., Soskine, M. & Maglia, G. A protein rotaxane controls the translocation of proteins across a ClyA nanopore. Nano Lett. 15, 6076–6081 (2015).

48. 48.

Ho, C. W. et al. Engineering a nanopore with co-chaperonin function. Sci. Adv. 1, 1–9 (2015).

49. 49.

Wanunu, M., Morrison, W., Rabin, Y., Grosberg, A. Y. & Meller, A. Electrostatic focusing of unlabelled DNA into nanoscale pores using a salt gradient. Nat. Nanotechnol. 5, 160–165 (2010).

50. 50.

Stoddart, D., Franceschini, L., Heron, A., Bayley, H. & Maglia, G. DNA stretching and optimization of nucleobase recognition in enzymatic nanopore sequencing. Nanotechnology 26, 10–16 (2015).

51. 51.

Nova, I. C. et al. Investigating asymmetric salt profiles for nanopore DNA sequencing with biological porin MspA. PLoS ONE 12, 1–14 (2017).

52. 52.

Gu, L.-Q. et al. Reversal of charge selectivity in transmembrane protein pores by using noncovalent molecular adapters. Proc. Natl. Acad. Sci. USA 97, 3959–3964 (2000).

53. 53.

Stryer, L. Biochemistry 4th edn (W. H. Freeman and Company, New York, 1995).

54. 54.

Miyazaki, K. MEGAWHOP cloning: a method of creating random mutagenesis libraries via megaprimer PCR of whole plasmids. Methodsn Enzymol. 498, 399–406 (2011).

55. 55.

Soskine, M., Biesemans, A. & Maglia, G. Single-molecule analyte recognition with ClyA nanopores equipped with internal protein adaptors. J. Am. Chem. Soc. 137, 5793–5797 (2015).

56. 56.

Lide, D. R. CRC handbook of chemistry and physics, 84th edition, 2003-2004. Handb. Chem. Phys. 53, 2616 (2003).

57. 57.

Andre, I., Bradley, P., Wang, C. & Baker, D. Prediction of the structure of symmetrical protein assemblies. Proc. Natl. Acad. Sci. USA 104, 17656–17661 (2007).

## Acknowledgements

This work is financially supported by ERC consolidator grant.

## Author information

Authors

### Contributions

G.H. and G.M. designed the experiments. G.M. supervised the project. G.H. performed the experiments and data analysis. A.V. made the molecular models. G.M. and G.H. wrote the manuscript.

### Corresponding author

Correspondence to Giovanni Maglia.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

Journal peer review information: Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Huang, G., Voet, A. & Maglia, G. FraC nanopores with adjustable diameter identify the mass of opposite-charge peptides with 44 dalton resolution. Nat Commun 10, 835 (2019). https://doi.org/10.1038/s41467-019-08761-6

• Accepted:

• Published:

• ### Biological Nanopore Approach for Single‐Molecule Protein Sequencing

• Zheng‐Li Hu
• , Ming‐Zhu Huo
• , Yi‐Lun Ying
•  & Yi‐Tao Long

Angewandte Chemie International Edition (2021)

• ### Biological Nanopores: Engineering on Demand

• Ana Crnković
• , Marija Srnko
•  & Gregor Anderluh

Life (2021)

• ### Design, assembly, and characterization of membrane-spanning DNA nanopores

• Conor Lanphere
• , Daniel Offenbartl-Stiegert
• , Genevieve Pugh
• , Elena Georgiou
• , Yongzheng Xing
• , Jonathan R. Burns
•  & Stefan Howorka

Nature Protocols (2021)

• ### Designed alpha-helical barrels for charge-selective peptide translocation

• Smrithi Krishnan R.
• , Amina H. Shaji
• , K. Santhosh Kumar
• , Gayathri Mohan
•  & Kozhinjampara R. Mahendran

Chemical Science (2021)

• ### Biological Nanopore Approach for Single‐Molecule Protein Sequencing

• Zheng‐Li Hu
• , Ming‐Zhu Huo
• , Yi‐Lun Ying
•  & Yi‐Tao Long

Angewandte Chemie (2021)