Introduction

Cell-penetrating peptides(CPPs) have been extensively studied for several decades because of their capability to transport various cargoes into cells1,2. Multiple factors affect the transport mechanisms of CPPs, e.g., the concentration of CPPs and the properties of the membrane3,4. One of the most common difficulties in studying CPPs is that the translocation mechanisms of different CPPs are not the same, and most CPPs can have more than a single pathway depending on the experimental conditions5.

Molecular dynamics (MD) simulations have been a valuable tool for revealing the mechanical and functional properties of CPPs and their interactions with lipid bilayers6,7,8; however, the transport mechanism of CPPs and interactions with lipids are still unclear within MD simulation approaches. One of the issues in MD simulations is that it is difficult to observe the spontaneous translocation of CPPs within a few hundred ns time scale because it usually takes minutes or so for the translocation of CPP in experiments9. Therefore, people have been using biased simulations, such as the umbrella sampling10,11,12,13,14 and steered MD simulations15, to study CPPs and their interactions with lipid bilayers during translocation. The umbrella sampling is very popular for obtaining free energy barriers between CPPs and lipid bilayers. However, there could be artifacts in the umbrella sampling and thus its free energy analysis because the initial conformations are usually generated by biased simulations (e.g., SMD simulations).

Among various CPPs, arginine (R)-rich peptides have been extensively studied in experiments and simulations because of their effectiveness in translocation4,16. Strong interaction with negatively charged phospholipid heads is the primary mechanism of inserting R-rich peptides into the lipid bilayer. In the previous study, we implemented a weighted ensemble (WE) method17,18 in all-atom MD simulations of Arg\(_9\)(R9) with a DOPC/DOPG(4:1) model membrane19. The WE method is a very flexible path sampling technique and is easy to implement in any MD package. The WE method uses an ensemble of simulation trajectories. Each trajectory is independent and has a statistical weight. The progress (or reaction) coordinate is divided into multiple bins. Trajectories are periodically replicated in bins if there are too few trajectories, while they are pruned in bins if there are too many trajectories18. Zuckerman and Chong18 gave a detailed review. We used the WESTPA software20,21,22 that has been widely applied to various systems. One of the most significant advantages of using the WE method is that one can simulate a system without any biased potential. In our previous simulations19, we found the WE method was very effective for studying interactions between Arg\(_9\) and the model membrane. However, we couldn’t observe a spontaneous translocation of Arg\(_9\) across the membrane because Arg\(_9\) was stuck in the hydrophobic core of the model membrane and couldn’t move for a long time19.

In this study, the previous WE simulation19 was continued with a few different boundaries and bin sizes, and we finally observed a spontaneous translocation of Arg\(_9\) across the membrane. In the following sections, we show how a single Arg\(_9\) peptide can translocate across the model membrane using the WE method and how the orientation of Arg\(_9\) affects the translocation efficiency. We believe that the WE method will help study the translocation of various CPPs and their transport mechanisms.

Results

Translocation of a single Arg\(_9\) peptide was observed within a very short time scale

We observe that a single Arg\(_9\) translocates the model membrane during the WE simulations. In the previous WE simulation, the translocation of Arg\(_9\) was not observed because Arg\(_9\) was trapped in the hydrophobic core of the membrane for a long time19. In the current simulations, the size of each bin was decreased from 0.25 Å in the previous WE simulation19 to 0.10 Å, which could help Arg\(_9\) overcome free energy barriers along the translocation path. The progress coordinate was defined as the distance between the center of mass of one of Arg\(_9\)s and that of the phosphorus(P) atoms in the upper leaflet (see the “Methods” section). The other three Arg\(_9\) peptides were closely contacted with the upper leaflet during the simulations, and they didn’t show any translocation or any meaningful penetration.

Figure 1 shows the penetration depth of Arg\(_9\) vs. the simulation time. The total number of splitting-combining trajectories was 5080 in our WE simulations, and this number was converted to the simulation time using 5 ps as the time interval for splitting and combining trajectories (see the “Methods” section). We define the penetration depth as the distance between the center of mass of Arg\(_9\) and that of the upper leaflet’s phosphorus(P) atoms. Therefore, the negative sign denotes that Arg\(_9\) is located below the upper leaflet. The maximum penetration depth of Arg\(_9\) in the previous WE simulation was about \(-\,17.6\) Å 19, and now it can reach up to \(-\,45\) Å in the current simulations.

Figure 1
figure 1

The penetration depth vs. the simulation time.

As shown in the figure, the penetration depth changes rapidly within a very short time scale. Furthermore, the figure shows that Arg\(_9\) can translocate the model membrane quickly (\(\sim\) 25 ns) since the WE simulation started. Therefore, the WE method provides a very effective tool to overcome free energy barriers between Arg\(_9\) and the model membrane. Thus the WE method could be used to study interactions between membrane-active peptides(MAPs), including cell-penetrating peptides(CPPs) and antimicrobial peptides(AMPs), and various model membranes within MD simulation approaches.

Figure 2 presents snapshots of Arg\(_9\) during the WE simulations. They are snapshots at 7.5 ns, 15.0 ns, 22.5 ns, and the last (25.4 ns). In the figure, yellow shows Arg\(_9\), and gray is the lipid molecules in the model membrane (DOPC/DOPG(4:1)). The blue and red dots are phosphorus atoms of the upper and the lower leaflets, respectively. Water molecules, ions, and the other three Arg\(_9\)s are omitted for clarity. As shown in Fig. 2, the membrane is deformed when Arg\(_9\) penetrates the middle of the membrane. There are two interesting findings during the deformation of the model membrane: The first finding is that a part of the lipids in the upper leaflet moved along with Arg\(_9\). The last snapshot showed that these lipids went through the lower leaflet. Later, we will discuss how these lipids are transported and reorganized in the model membrane. The second finding is the disruption of the lower leaflet both at 22.5 ns and at the last (25.4 ns) shown in the figure. When Arg\(_9\) comes to the bottom of the membrane, the lower leaflet deforms drastically, widening of a water pore in the lower leaflet.

Figure 2
figure 2

Snapshots at (a) 7.5 ns, (b) 15.0 ns, (c) 22.5 ns, and (d) the last (25.4 ns). (yellow: Arg\(_9\), grey: lipids, blue & red: phosphorus atoms of the upper leaflet & lower leaflet, respectively).

During the translocation, one of the interesting quantities to analyze would be hydrogen bond formation between Arg\(_9\) and the lipids molecules. It has been known that the hydrogen bonding between CPPs and lipids (or water) is critical when CPPs contact the membrane at the beginning of the penetration and when they translocate across the membrane23,24,25,26. Figure S1 presents the number of hydrogen bonds during the translocation (from WE1 to WE5 simulations). The blue line denotes the number of hydrogen bonds between Arg\(_9\) and the lipid molecules (including the phosphate group), while the green line is between Arg\(_9\) and only the phosphate group in the lipids. The red line depicts the ones between Arg\(_9\) and water. Each point in the figure is an averaged value over 50 ps. The average number of hydrogen bonds between Arg\(_9\) and lipids and water is about 20, showing slight fluctuation during the translocation. This indicates that the strength of hydrogen bonding doesn’t affect the efficiency of the translocation. Although the hydrogen bonding between Arg\(_9\) and the membrane is vital at the initial penetration stage, our simulation suggests that another factor could be responsible for translocating Arg\(_9\) across the membrane. We think water can be another factor that makes the translocation. We will discuss this issue in the following section.

Another interesting quantity to analyze is the conformational change of Arg\(_9\) during the translocation. It has been known that Arg\(_9\) stayed at a random coil both in the solution and in the membrane environment27. Figure S2 shows the conformational change of four Arg\(_9\)s during the WE simulation, identified using the Timeline in VMD28. The red box corresponds to the only Arg\(_9\) that translocated the membrane. During the translocation (Arg\(_9\) in the red box), most arginine residues showed a turn(aqua) in the Timeline graph, and this structure remained until the end of the simulation. The other three Arg\(_9\)s showed a random coil(white) or a combination of both a coil and a turn, and this is because most parts of Arg\(_9\)s were exposed to water molecules in the upper solution. Our simulation suggests a distinct conformational change (from a random coil to a turn) when Arg\(_9\) translocates across the membrane. More translocated trajectories are needed to reveal the conformational changes of Arg\(_9\) and their roles for the translocation across the membrane.

Water molecules play a role in translocating Arg\(_9\) across the model membrane

The pore formation process has been well known based on MD simulations of one of the antimicrobial peptides, melittin29. According to their results, a P/L(peptide/lipid) ratio above a threshold (P/L = 1/64) and aggregation of peptides were necessary for making pores. Although we didn’t see the aggregation of Arg\(_9\)s, our simulation suggests that even a single Arg\(_9\) can induce a water pore.

When Arg\(_9\) approaches the middle of the model membrane, the number of water molecules coordinated with Arg\(_9\) increases, as shown in Fig. 3. A water pore is shown at 22.5 ns; however, most of the pore is blocked by Arg\(_9\). Only a few water molecules can translocate across the membrane. Figure 4 presents the total accumulated number of water molecules translocated across the membrane (up(exiting) & down(entering)) as a function of time.

Figure 3
figure 3

Water molecules near Arg\(_9\) at (a) 7.5 ns, (b) 15.0 ns, (c) 22.5 ns, and (d) the last (25.4 ns). (yellow: Arg\(_9\), white: water, blue & red : phosphorus atoms of the upper leaflet & lower leaflet, respectively). The water flow and the movement of Arg\(_9\) are correlated with each other.

Figure 4
figure 4

The total number of water molecules translocated across the membrane (the moving-up(exiting) & the moving-down(entering)) vs. the simulation time.

We counted the number of water molecules that translocate across the membrane from z = +10 Å  to z = − 10 Å(distances with respect to the membrane center)  as the moving-down (entering) water, and water molecules from z = − 10 Å  to z = + 10 Å as the moving-up(exiting) water. It is interesting to see an increase in the moving-down(entering) water molecules when Arg\(_9\) approaches the bottom of the membrane. On the other hand, the number of the moving-up(exiting) water molecules slightly changed during the last part of the simulation. Therefore, there is a strong correlation between the direction of water flow and the movement of Arg\(_9\). The rapid increase in the number of downward(entering) water molecules is closely related to the orientation of Arg\(_9\) to the membrane, as shown in the following section.

During the WE simulation, the downward(entering) water flux was about 5.9 (water molecules/ns), while the upward(exiting) water flux was 1.6. If we count only the number of water molecules during the last part of translocation after the water pore was made, the downward(entering) water flux was 44.8 (water molecules/ns), and the upward(exiting) water flux was 12.4. These numbers can be compared with the previous result (6.2 \(\sim\) 27.3), where the water pore was made by moving a single lipid to the center of the bilayer using the umbrella sampling30. It turns out that the water pore is very transient in our WE simulations, and it is closed soon after Arg\(_9\) is translocated to the bottom of the membrane. We will discuss more on the closure of the water pore in one of the following sections.

The orientation angle of Arg\(_9\) affects the translocation

As shown in Fig. 1, the penetration depth was increased rapidly during the last part of the translocation (22.5 ns \(\sim\)). In the previous section, we mentioned that rapid penetration is related to the water flow. In this section, we show that the orientation angle of an Arg\(_9\) peptide also affects the efficiency of the translocation.

We define the orientation angle of the peptide as follows: First, obtain the center of mass positions of the first three and the last three C\(\alpha\)’s of Arg\(_9\). Second, find a vector(\(\vec {v}_1\)) connecting those two center-of-mass positions. Last, calculate an angle between the vector \(\vec {v}_1\) and a unit vector parallel to the z-axis (e.g., \(\vec {v}_2\) = (0,0,1)). Thus, a smaller angle means that the peptide is almost normal to the model membrane. Note that the conformation of Arg\(_9\) is a random coil both in solution and in the membrane environment27, and our simulation shows a turn during the translocation (Fig. S2). Therefore, the calculated angle is not as precise as in the case of a straight helix; however, our calculation will give us an idea of how Arg\(_9\) is oriented during the translocation. Figure 5 shows the orientation angle of Arg\(_9\) during the translocation. During most simulation time, Arg\(_9\) is almost parallel to the surface of the model membrane (70\(^\circ\) \(\sim\) 90\(^\circ\)); however, the orientation is rapidly changed to a smaller angle (\(\sim\) 40\(^\circ\)) when the peptide reaches the bottom of the model membrane. Based on this figure, one can conjecture that Arg\(_9\) with a smaller orientation angle shows more efficient translocation across the model membrane. This efficiency should be related to free energy barriers between Arg\(_9\) and the model membrane.

Figure 5
figure 5

The penetration depth (blue line) and the orientation angle (red line) of Arg\(_9\) vs. the simulation time.

Figure 6 shows the potential of mean force(pmf) along the translocation path of Arg\(_9\) using the umbrella sampling. We have 49 windows and sampled 140 ns data at each window. We discarded the first 20 ns data and analyzed the rest of the data (120 ns) for the free energy calculation. In the figure legend, 20 ns means that we collected the data between 0 to 20 ns (after discarding the first 20 ns data), 40 means 0 to 40 ns data, and so on. The pmf plots show convergence, and the error bars in each plot are getting smaller when increasing the number of data points. The details of the umbrella sampling are given in the “Methods” section. When Arg\(_9\) approaches the bottom of the membrane (e.g., the WE4 & WE5 simulations), the slope of the free energy curve decreases compared with that in the hydrophobic core (e.g., the WE2 & WE3 simulations). During the WE4 & WE5 simulations, the penetration depth is drastically increased, and the number of transported water molecules increases. Our simulation confirms the previous simulations8,31 that the slope of the potential of mean force becomes smaller in the presence of the water pore. The total free energy barrier from the upper solution to the bottom of the lower leaflet is about 70 kcal/mol. Therefore, the translocation of single Arg\(_9\) across the DOPC/DOPG(4:1) membrane is energetically unfavorable even though the water molecules decrease the energy barrier.

Figure 6
figure 6

The potential of mean force(pmf) vs. the progress coordinate. 20 ns means that we collected the data between 0 to 20 ns (after discarding the first 20 ns data), 40 ns means 0 to 40 ns data, and so on.

Table S1 shows two additional simulations at [− 45.0 Å, − 33.0 Å], i.e., WE5a and WE5b simulations. We want to compare the translocation efficiency and the orientation angle of Arg\(_9\) in each simulation. In Fig. S3, we compare the penetration depth and the orientation angle from WE5, WE5a, and WE5b simulations. The solid blue and red lines correspond to the penetration depth and orientation angle from the WE5 simulation. The results from the WE5a and WE5b simulations were shown as the dashed and dashed-dot lines in the figure. When the penetration depth changes rapidly, the orientation angle shows similar behavior. Whenever Arg\(_9\) reaches the bottom of the membrane, the orientation angle becomes smaller (40 \(\sim\) 50\(^\circ\)). The upright position seems efficient for the translocation of Arg\(_9\) in our simulation. Our simulation suggests that the water flow and the orientation angle of Arg\(_9\) play a role in translocation. We observed only a single translocation; more translocated trajectories are needed to confirm our findings. Furthermore, we might need other observables to quantify the orientation of Arg\(_9\) during the translocation.

The water pore was very transient and lipid flip-flops were observed

When Arg\(_9\) was located at the bottom of the membrane, the WE5 simulation was stopped, and an additional conventional MD simulation (without using the WE method) was run at this position. The simulation method is the same as in the “Methods” section and in the previous work19,32. A total of 1 \(\mu\)s simulation was performed. During the simulation, Arg\(_9\) moved up and down for a while, and eventually, it moved to the bottom of the membrane and stayed there until the end of the simulation. The water pore was closed at 70 ns, and there was no translocation of water molecules after that. Figure S4 presents several snapshots during this simulation which show a closure of the water pore and the behavior of Arg\(_9\). The pore lifetime in our simulations (WE simulations + an additional conventional simulation) is less than 100 ns (Fig. S5), and the order of lifetime is very similar to previous results for other systems33,34.

Another interesting finding during the simulation was the translocation of lipids molecules. The phospholipids(PL) flip-flop (or PL translocation) has been well studied in numerous works11,35,36,37,38,39,40,41,42. There are two primary mechanisms for the lipid flip-flop: One is water pore-mediated. The other one is peptide-mediated. As shown in Fig. 2, a few lipids in the upper leaflet moved down along with Arg\(_9\). A total of six DOPG lipids moved down to the lower leaflet during the WE simulation, and two of them moved back to the upper leaflet when Arg\(_9\) moved up during the conventional simulation (Fig. S4). Our simulation shows that the lipids’ location was switched rapidly from the upper leaflet to the lower leaflet and vice versa. The total time of PL translocation (moved from the upper leaflet to the lower and then moved back to the upper leaflet) was about 75 ns. This time scale is due to the fast movement of Arg\(_9\) in our simulations. Our simulations suggest that the primary mechanism of the lipid translocation is a peptide-mediated process; however, the water flow also affects PL translocation because the peptide movement and the water flow are closely related, as shown in the WE simulations and the additional conventional simulation. We found that the translocated lipids were quickly reorganized (Fig. S4). After the translocation of lipids to the lower or the upper leaflet, the flip-flopped lipids were well-reorganized with the other lipids during the conventional simulation.

Discussion

Translocation of a single Arg\(_9\) was observed using the WE method within the MD simulations approach. Compared with the previous WE simulation19, which didn’t show any translocation of Arg\(_9\) across the membrane, the smaller bin size is one of the critical factors that made it possible to observe a spontaneous translocation across the model membrane. Finding an appropriate number of walkers(replicas) and a reasonable bin size in the WE simulations is difficult without trial and error. Note that an adaptive bin scheme was already developed for the WE simulation to change the bin size on-the-fly43. This scheme could be applied to the current work to make rapid translocation of Arg\(_9\) or any other CPPs to study their transport mechanisms. Our WE simulations showed that the translocation of a single Arg\(_9\) was energetically unfavorable due to higher free energy barriers (\(\sim\) 70 kcal/mol) between Arg\(_9\) and the model membrane along the translocation path.

It has been known that one of the essential factors in the translocation of CPPs is the concentration of peptides. Although we used four Arg\(_9\)s in our WE simulation, the concentration in a local area is insufficient to see collective behavior between Arg\(_9\)s. The progress coordinate in our WE simulation was defined as the distance between the upper leaflet and one of the Arg\(_9\)s, and thus the other three Arg\(_9\)s didn’t show any penetration into the model membrane during the whole simulation. The collective behavior between CPPs could enhance the efficiency of translocation. Although our simulation shows that the translocation of a single CPP is not favorable, the WE method can be used to study spontaneous translocation at low concentrations. One can implement a progress coordinate as a distance between the center of mass of the upper leaflet’s phosphorus atoms and more than two CPPs, e.g., two Arg\(_9\)s, three Arg\(_9\)s, or four Arg\(_9\)s at the same time. However, our experience in the WE simulations shows that it is not easy to observe simultaneous translocation of more than two CPPs due to much slower CPPs than a single CPP movement.

Another critical factor in our successful translocation is the orientation of Arg\(_9\) relative to the membrane. Our simulation showed that the orientation angle of Arg\(_9\) is essential to determining the energy barrier along the translocation path of Arg\(_9\). It is much easier for Arg\(_9\), which was oriented along the normal of the membrane surface (i.e., parallel to the z-axis), to translocate across the membrane, as shown in Fig. S3. It has been known that the peptides’ orientation depends on the concentration. For example, a two-state model has been suggested to orient adsorbed peptides: S-state & I-state44. Here, the S-state means that peptides are parallel to the membrane and observed at low P/Ls, while the I-state denotes that peptides are perpendicular to the membrane and observed above the threshold concentration. To stabilize the water pore, it is necessary to change the average peptide orientation from S to I11,44. The threshold value (P/L\(^{*}\)) is the peptide concentration when the energy levels of the S and I states are equal. Although Huang’s two-state model was initially developed for antimicrobial peptides, the orientation of Arg\(_9\) relative to the membrane in our simulations can also depend on the P/L ratio. The P/L ratio in our system is low (P/L \(\sim\) 0.042); however, this ratio is much greater than the critical thresholds in the literature11,45. Note that the threshold value (P/L\(^{*}\)) varies with the lipid composition of the membrane11,44,45. Arg\(_9\) stayed at the S state during most of the simulation time and turned into the I state during the last part of the simulation. Although we found the translocation of Arg\(_9\) across the model membrane within a very short time (a few tens ns), it is necessary to work on different lipid compositions and concentrations to see if there is any significant change in this time scale.

Although we observed a spontaneous translocation of a single Arg\(_9\) within a very short time scale, our results have some limitations. First, there is a considerable hysteresis in the PMF profile using the umbrella sampling due to asymmetric membrane distortions, as shown in Fig. 2. The opening of the water pore did not happen until the peptide reached the lower leaflet, and this makes the PMF plot asymmetric around the center of the membrane. We think that the orientation angle of Arg\(_9\) also contributes to the hysteresis in the PMF plot. A total of 140 ns of sampling in each window may not be enough to sample equilibrium configurations. Second, the external electric field was applied in all the simulations, including the WE simulations, and this electric field could contribute to the significant distortions of the membrane because our membrane has asymmetric distributions of DOPG lipids in both layers. It has been shown that the penetration of R8 within coarse-grained simulations depends on the external electric field46, and the authors showed that a higher electric field resulted in a short penetration time. The free energy calculations were also dependent on the electrostatic properties of the membrane47. Although our electric field strength (0.05 V/nm) is smaller than Wang et al.’s (0.17 \(\sim\) 0.20 V/nm)46, we can not neglect the electric field effect on our charged Arg\(_9\) and its translocation across the membrane. We may need another WE simulation without the external electric field to see the changes in the translocation of charged peptides across the charged membrane. Third, the translocation time scale reported here can not be compared with experiments. In our current simulation setup, it is challenging to compute transition probabilities between bins and thus a mean first passage time. Short time scales in our current work are only computer simulation time. Last, although we observed the translocation of Arg\(_9\) within a very short time scale, the umbrella sampling showed that the free energy barrier is still large for a single Arg\(_9\) to translocate across the membrane. The single reaction coordinate (the penetration depth in the z-direction) in our umbrella sampling may not be enough to identify the actual translocation paths of Arg\(_9\) and the free energy barriers along the paths. One can add an additional reaction coordinate (e.g., the orientation angle of Arg\(_9\)) in the umbrella sampling to see if there is any change in the free energy barrier.

Our WE simulations suggest that the WE method can effectively sample rare events, such as a spontaneous translocation of CPP within MD simulations approach. Furthermore, we expect the WE method to be applied to any CPPs to reveal interactions between CPPs and various membranes. Moreover, the WE method can provide insights into the transport mechanisms of various membrane-active peptides(MAPs).

Methods

Conventional MD simulations

All simulations were performed using the NAMD package48 and CHARMM36 force field49. In addition, we used a pre-equilibrated structure from the previous study32, equilibrated up to 1 \(\mu s\), that was also used in the previous WE simulations19. The detailed method regarding the WE simulations is provided in the following subsection. Here is a recap of the equilibrium simulation: Our system contains 4 Arg\(_9\), DOPC/DOPG(4:1) membrane, TIP3P water molecules, and ions generated by CHARMM-GUI50. It was well equilibrated before starting the WE simulations. The DOPC/DOPG(4:1) mixture consists of 76 DOPC and 19 DOPG lipids in each layer. K\(^+\) and Cl\(^-\) ions were added to each system to make a concentration of 150 mM. All Arg\(_9\) peptides were initially located in the upper solution and bound to the upper layer during the equilibration. The NPT simulations were performed at T = 310 K. Temperature and pressure were kept constant using Langevin dynamics. An external electric field(0.05 V/nm) was applied in the negative z-direction(from CPPs to the membrane) as suggested in the previous work7,51 and also in our earlier simulations19,32 to account for the transmembrane potential52. The particle-mesh Ewald(PME) algorithm was used to compute the electric forces, and the SHAKE algorithm allowed a 2 fs time step during the simulation. During the equilibration, CPPs were confined in the upper water box, so there was no interaction between CPPs and the lower leaflet of model membranes. Whenever a CPP was leaving the upper water box, a small force was applied to pull that CPP inside the box. All CPPs were well contacted with the lipid molecules after the 1 \(\mu\)s long equilibration, and then the WE simulations were performed using this equilibrated system.

Weighted ensemble (WE) simulations

We use the WESTPA (The Weighted Ensemble Simulation Toolkit with Parallelization and Analysis) software package (v2020.05, see https://github.com/westpa/westpa/wiki/Installing-WESTPA)20,21,22 to enable the simulation of rare events, for example, the translocation of CPPs across a model membrane. WESTPA is open-source, and its utility has been proven for many problems. All WE trajectories are unbiased and used to calculate conditional probabilities or transition rates18.

To use the WE method in MD simulations, we need to define a progress coordinate, the total number of bins, the number of walkers (child simulations or replicas) in each bin, and a time interval for splitting and combining trajectories20. We define the progress coordinate as a distance in the z-direction between the center of mass of phosphorus atoms in the upper leaflet and that of Arg\(_9\). After equilibration of the system, an initial distance between phosphorus atoms and Arg\(_9\) was measured, and a boundary was set using this initial position and the position of the center of the membrane, for example, [− 18 Å (the center of membrane), 3 Å (the initial distance)]. Each boundary can have different bin sizes (0.25 Å or 0.1 Å). For example, we set a smaller bin size when the peptide has difficulty overcoming the free energy barriers. The number of walkers (replicas) in each bin was 5. Due to the shortage of computing resources, we had to run several WE simulations to combine all the trajectories and observe the translocation of Arg\(_9\) (Table 1). In the current setup (for example, in WE2), replicas can move above the upper boundary (− 14  Å); however, replicas can not move below the lower boundary (− 25  Å). Therefore, when one of the replicas reached the lower boundary, the WE simulation was stopped automatically, and we continued the simulation at this position with a new boundary. The time interval for splitting-combining the trajectories (called an iteration in Table 1) during each WE simulation was 5 ps. The total iterations during all the WE simulations were 5080. The progress coordinate was calculated using MDAnalysis53.

Table 1 A list of WE simulations.

Umbrella sampling

We used umbrella sampling to obtain the PMF along the translocation path. We chose 49 frames among the total of 5080 frames(iterations). Note that the size of each window is not uniform. We used the colvar module for the umbrella sampling48. The distance in the z direction between the center of mass of \(C_\alpha\)s in Arg\(_9\) and that of phosphorus atoms was restrained using the harmonic restraints with a force constant (k = 10.0  kcal/mol/ Å\(^2\)). We sampled simulation data at each window for 140 ns. We discarded the initial 20 ns and analyzed the rest of the data (120 ns) using the WHAM(weighted histogram analysis method) code54.