In silico design and optimization of selective membranolytic anticancer peptides

Gabernet, Gisela; Gautschi, Damian; Müller, Alex T.; Neuhaus, Claudia S.; Armbrecht, Lucas; Dittrich, Petra S.; Hiss, Jan A.; Schneider, Gisbert

doi:10.1038/s41598-019-47568-9

Download PDF

Article
Open access
Published: 02 August 2019

In silico design and optimization of selective membranolytic anticancer peptides

Scientific Reports volume 9, Article number: 11282 (2019) Cite this article

6117 Accesses
39 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Membranolytic anticancer peptides represent a potential strategy in the fight against cancer. However, our understanding of the underlying structure-activity relationships and the mechanisms driving their cell selectivity is still limited. We developed a computational approach as a step towards the rational design of potent and selective anticancer peptides. This machine learning model distinguishes between peptides with and without anticancer activity. This classifier was experimentally validated by synthesizing and testing a selection of 12 computationally generated peptides. In total, 83% of these predictions were correct. We then utilized an evolutionary molecular design algorithm to improve the peptide selectivity for cancer cells. This simulated molecular evolution process led to a five-fold selectivity increase with regard to human dermal microvascular endothelial cells and more than ten-fold improvement towards human erythrocytes. The results of the present study advocate for the applicability of machine learning models and evolutionary algorithms to design and optimize novel synthetic anticancer peptides with reduced hemolytic liability and increased cell-type selectivity.

Predicting cell-penetrating peptides using machine learning algorithms and navigating in their chemical space

Article Open access 07 April 2021

Evolutionary computational platform for the automatic discovery of nanocarriers for cancer treatment

Article Open access 21 September 2021

Structure-based discovery of novel P-glycoprotein inhibitors targeting the nucleotide binding domains

Article Open access 01 December 2023

Introduction

Cancer therapy faces the challenge of resistance to chemotherapeutics and receptor-targeted anticancer agents. Several cell resistance mechanisms, such as drug inactivation or efflux, target protein alteration, DNA damage repair and signaling cascade alteration have been identified^1,2. Moreover, the indiscriminate action of most chemotherapeutics towards all rapidly dividing cells causes a variety of severe side effects^3,4. Membranolytic anticancer peptides (ACPs) represent a new class of potential cancer therapeutics. Their receptor-independent mechanism of action may hinder the development of cellular resistance^3,4,5. Nevertheless, the underlying structure-activity relationship that explains the membranolytic properties of these peptides is not completely understood. Peptide amphipathicity, moderate overall hydrophobicity, and a positive net charge are known requirements for ACP activity^6,7,8,9. However, no simple combination of these properties has been found sufficient to fully explain the activity and selectivity of ACPs towards cancer cells¹⁰. Producing novel peptides lacking toxicity against nonneoplastic cells also remains challenging¹¹. Various machine learning methods have been successfully applied to guide the rational design of both ACPs^{12,13,14,15,16,17,18} and antimicrobial peptides (AMPs)^19,20, as well as other membrane-active peptides²¹. The lack of a systematic annotation of the selectivity of ACPs towards cancer cells in the literature and in peptide databases has hindered the development of predictive models that take selectivity into account. There is a need for innovative methods that do not require selectivity data for peptide optimization.

Simulated molecular evolution (SME) is a stochastic optimization algorithm pioneered in the 1990s for computational peptide design^22,23,24. SME belongs to the class of evolutionary algorithms, which also includes genetic algorithms, and enables the optimization of peptide properties that are encoded in a theoretical fitness function or in combination with an experimental fitness evaluation when structure-activity relationships cannot be determined a priori. We have recently applied this design concept to generate innovative membrane-targeting peptides^25,26. Here, we present a peptide design approach that is based on a novel ACP prediction model and on SME for the optimization of ACP selectivity for cancer cells. The predictive machine learning model led to the discovery of four novel synthetic ACPs with low-micromolar activity (1–20 µM) against A549 lung cancer and MCF7 breast cancer cells. One of these peptides was then subjected to SME. After the first iteration of the optimization process, we obtained a novel ACP that showed micromolar activities against a range of cancer cell types with significantly reduced activity towards human dermal microvascular cells (HDMEC) and human erythrocytes. The results of this study advocate for machine-learning models in combination with computational sequence generators for designing and optimizing functional peptides in silico.

Results and Discussion

ACP classifier model

We developed a machine learning model to classify peptides into ACPs and non-ACPs based on their amino acid sequence representations. The machine-learning classifier was trained on “positive” (ACPs, active) and “negative” (non-ACPs, inactive) peptides. We retrieved alpha-helical ACPs from the CancerPPD database²⁷ as positive examples (N = 339). For the negative class, we retrieved alpha-helices from nontransmembrane proteins in the PDB database²⁸ (N = 680). All amino acid sequences were represented numerically in a computer-readable form by the use of molecular descriptors. For this purpose, we utilized a combination of PEPCATS pharmacophore feature descriptors²⁹ and four global properties, namely, Eisenberg’s hydrophobicity, Eisenberg’s hydrophobic moment³⁰, charge density, and peptide length (number of residues). The PEPCATS descriptor represents the amino acid sequences as binary vectors indicating cross-correlated pharmacophore features of the individual amino acids (hydrophobic, aromatic, hydrogen-bond acceptor, hydrogen-bond donor, positively ionizable, negatively ionizable). The cross-correlation of pharmacophoric feature pairs is determined within a sliding sequence window encompassing seven residues. The resulting 151-dimensional descriptor vector was reduced to an 18-dimensional feature vector by covariance elimination and sequential feature elimination (Fig. S1, Supplementary Information). The dataset was split into a training set (2/3) and an independent test set (1/3) by stratified sampling, preserving the proportion between the positive and negative classes. Two machine learning algorithms were considered for model development: random forests³¹ and support vector machines (SVM)³². We optimized the SVM model’s hyperparameter by 10-fold cross-validation on the training data and chose a linear kernel for SVM training to enable straightforward feature interpretation. The performance of both classifiers exceeded 0.9 for both the training and the test data for all calculated metrics (Table 1). The SVM model was selected for further analysis due to the robustness of its decision function, which is determined solely by the support vectors and therefore unaltered by the addition of new data points that lie outside the decision margin³². Additionally, an analytical decision function as a linear combination of the model features can be extracted from linear support vector machines, whose weights indicate feature importance for the classification problem³².

Table 1 Performance of support vector machine and random forest models for ACP prediction.

Full size table

We then compared the performance on the test dataset for our SVM model to online available ACP prediction tools, specifically the AntiCP models 1 and 2¹³, the iACP model³³, and the MLACP model¹⁸. These ACP prediction models are also based on an SVM classifier but utilize different descriptors and training data (Table S1, Supplementary Information). The prediction performance of the four classifiers and our SVM model was assessed on the independent test dataset (Table 2). In this experiment, the performance of our SVM model on the independent test set was superior to all four publicly available ACP prediction models in terms of all performance metrics, except for precision. The MLACP model showed higher precision but lower Matthews correlation coefficient (MCC), accuracy and recall than the other models. Therefore, the MLACP model is better at avoiding false positives but retrieves a higher number of false negatives compared to the SVM model developed in this study.

Table 2 Comparison of the model performance (Test score) with other online available ACP prediction tools calculated by using the independent test set. The Matthews correlation coefficient (MCC), accuracy, precision and recall were used as metrics (Methods Eqs 1–4).

Full size table

Feature importance for ACP activity

We analyzed the feature weights of the SVM classifier to gain an understanding of important discriminatory features for distinguishing between ACPs and non-ACPs (Table 3, Fig. S2, Supplementary Information). Features were ranked by their absolute weight values as a measure of their relative importance for ACP classification. The global hydrophobicity (H), hydrophobic moment (µ_H) and the frequency of positively charged amino acid pairs separated by one residue (PPd2) were identified as important features of the classifier (weight values w = 1.65, w = 0.5 and w = 0.39, respectively). This finding is in accordance with previous reports on ACPs that highlight the relevance of the hydrophobicity, the hydrophobic moment and a net positive charge for anticancer activity^7,34. The peptide length was also identified as a discriminatory feature (w = 0.4), indicating that longer peptides were considered more likely to be active. Two features that take into account the frequency of amino acids with hydrogen-bond donor and acceptor groups (ADd0, DDd0) were identified as bearing the greatest absolute weights (w = −1.94 and w = 1.67, respectively), emphasizing their role in distinguishing ACPs from inactive peptides (Table 3).

Table 3 The 18 features obtained after covariance elimination and sequential feature selection.

Full size table

De novo design of ACPs

To make use of the SVM model for the in silico design of novel ACPs, we generated three virtual peptide libraries of 100,000 peptides each, based on different design principles (Fig. S3, Supplementary Information):

(1)
The Helical library contains peptides with the position-dependent amino acid distribution of alpha-helical ACPs¹¹.
(2)
The Amphipathic Arc library contains amphipathic peptides with differently sized hydrophobic arcs and a high probability of being cationic.
(3)
The Gradient library contains amphipathic peptides that possess a linear hydrophobic gradient.

We predicted the activity of the peptides from each library with our SVM model (Fig. S4, Supplementary Information). More than 80% of the peptides from the Amphipathic Arc and Gradient libraries and more than 60% of the peptides from the Helical library received an SVM score >0.5, indicating potential actives. In contrast, only 10% of peptides with random sequences were predicted to be active. The design principles, therefore, enriched the libraries with potentially active peptides in contrast with random peptide generation.

The similarity of the peptides in the training data was analyzed to consider the applicability domain of the SVM model³⁵; this domain is the chemical space in which the model predictions may be considered reliable. The SVM model was utilized to estimate the pseudo-probabilities (i.e., the probabilities predicted by the model) of the peptides to belong to the active and inactive classes. These scores were subsequently weighted by the similarity to the training data to obtain similarity-weighted scores that consider the model’s applicability domain (ϕ_ACP, ϕ_Neg, Eqs 5 and 6).

From each peptide library, we selected the two peptides with the highest ϕ_ACP and ϕ_Neg scores. None of the peptides were found in the training data or the CancerPPD database. No peptides were retrieved from the CancerPPD database with >95% similarity to the selected ones, as determined by the CD-HIT program³⁶. We finally synthesized the 12 peptides and determined their half-effective concentration (EC₅₀₎ values against the MCF7 and A549 cancer cell lines. For 10 of the 12 synthesized peptides, the predictions were correct (Table 4). All of the peptides predicted to be inactive did not kill more than 50% of the cancer cells at a concentration of 50 µM. Of the six peptides predicted to be active, two were determined to be false positives (inactive at 50 µM) (Figs S10 and S11, Supplementary Information). Of the four correctly predicted active peptides, three were active in a low-micromolar range against both of the tested cancer cell lines, and the fourth (Gradient2) showed activity solely against MCF7 cells (Table 4).

Table 4 Experimental validation of the SVM prediction model.

Full size table

The AmphiArc2 peptide, the shortest peptide of the low micromolar active peptides, has a high hydrophobic moment (µ_H = 0.87) and a 180° arc of hydrophobic residues in an idealized helical structure (Fig. 1a). As determined by circular dichroism (CD) spectroscopy, the AmphiArc2 peptide is unstructured in pure water but adopts an alpha-helical structure in a hydrophobic environment (in 50% v/v water:2,2-trifluoroethanol, TFE) (Fig. 1b). Helix formation in a hydrophobic, membrane-like environment has been shown to be a characteristic of certain alpha-helical AMPs and ACPs^37,38. To further investigate its membranolytic action, we observed the activity of AmphiArc2 on single MCF7 cells entrapped in a microfluidic chip. Video recordings showed morphological changes in the cell membrane and leakage of the cytosolic components as early as 30 seconds after initial contact with the peptide in the cells (Fig. 1c, Supplementary Information, Video SV1). After 95 seconds, the dye encapsulated in the cancer cell had leaked out, and the cell membrane showed deformations and blebbing.

After characterizing the anticancer activity of the AmphiArc2 peptide, we tested its cell-type selectivity. We determined its EC₅₀ value against the noncancer HDMEC primary cell line and half-effective hemolytic concentration (HC₅₀) against human erythrocytes (Fig. 1d). Both values were found to be in the same low-micromolar range as the EC₅₀ against cancer cell lines, indicating toxicity of this peptide against noncancer cells.

Selectivity optimization of a de novo designed ACP

We applied the SME algorithm to improve the selectivity of the AmphiArc2 peptide towards noncancer cells. SME contained a variation (mutation) and a selection operator (Fig. 2a). By variation, a series of offspring was generated from a parent sequence. The fittest offspring of a generation was selected and used as a parent in the next SME iteration. In this study, parents were selected among the offspring that maintained anticancer activity but showed enhanced selectivity for cancer cells (selection operator). The mutations in the sequence variation step were performed according to a normalized Gaussian probability distribution of pairwise amino acid similarity (d_ij) (Fig. 2b). As a similarity measure, we utilized the Grantham matrix, which takes into account the atom composition, the polarity and the molecular volume of the residues³⁹. The probability of substitution of residue i to residue j decreases with decreasing pairwise amino acid similarity. The degree of similarity of the offspring peptides to the parent sequence (offspring diversity) was controlled via the sigma (σ) parameter (Fig. 2b). A higher sigma value allowed the generation of sequences further away from the parent peptide (Fig. S5, Supplementary Information).

We performed a total of three SME iterations, starting from the AmphiArc2 peptide. In the first iteration, we generated 10 offspring peptides with σ = 0.1 (Fig. 2c). The mutations introduced by this sigma value were conservative amino acid changes that maintained the overall amphipathicity of the peptide. We synthesized and tested all ten offspring peptides of the three SME generations against the MCF7 and A549 cancer cell lines to determine their anticancer activity. For selectivity assessment, we tested their activity against the noncancer HDMEC primary cell line and measured their hemolytic activity on human erythrocytes (Fig. 2d).

The results obtained demonstrate that small conservative amino acid replacements affected the activity and selectivity of these ACPs while conserving their overall amphipathic helical structure in a lipophilic environment. Offspring n.2 (Off2) maintained the low-micromolar activity of the AmphiArc2 peptide against the A549 and MCF7 cancer cells but showed a 12-fold reduction of hemolytic activity against human erythrocytes and a ten-fold reduction of activity against HDMEC cells (Fig. 2d). Therefore, we selected Off2 as the parent for the next SME iteration (Fig. 3), in which ten new peptides were generated (Off2.1 to Off2.10).

The second generation of peptide variation did not achieve meaningful selectivity improvements with respect to HDMEC cells (Fig. S7, Supplementary Information). Five of the offspring peptides (Off2.1, Off2.3, Off2.4, Off2.9, Off2.10) were inactive. This loss of activity correlated with the introduction of a proline residue in the sequence (Fig. S7, Supplementary Information). Prolines affect alpha-helical conformation by introducing helix kinks and breaks⁴⁰. We corroborated this secondary structure disruption with circular dichroism analysis of Off2.1, Off2.3, Off2.4 and Off2.9 (Fig. S9, Supplementary Information).

In the third SME generation (Off2.2.1 – Off2.2.10), we actively omitted proline residues and reduced the sigma value from 0.1 to 0.06 to explore close analogs of Off2 and Off2.2 (Fig. S8, Supplementary Information). Off2.2.10 showed decreased activity towards the noncancer HDMEC primary cells (Fig. 3d). This increase in selectivity was accompanied by a decreased activity against both the A549 and MCF7 cell lines.

The most active, but nonselective, AmphiArc2 parent peptide and the most cancer-cell selective Off2.2.10 peptide possess several differences and commonalities in their physicochemical properties. Even though both peptides display a hydrophobic arc of 180°, the hydrophobic moment of Off2.2.10 (µ_H = 0.64) is lower than that of AmphiArc2 (µ_H = 0.87) (Fig. 3c). The parent peptide bears eight positive charges, while Off2.2.10 contains seven positively ionizable residues caused by the N-terminal K1Q mutation. This moderate reduction of both the hydrophobic moment and the net positive charge improved the peptide selectivity for cancer cells and reduced the risk of killing non-transformed cells. To further explore these sequence features, we analyzed the ratios of the EC₅₀ in the noncancer cells and in the cancer cell lines of all tested peptides. The more selective peptides (higher EC₅₀ ratio) are characterized by moderate hydrophobic moments and charge densities (Supplementary Information, Fig. S10), suggesting a guideline for optimizing the cancer-cell selectivity of ACPs. This observation is in accordance with reports stating that decreasing the hydrophobic moment of helical ACPs reduces both their hemolytic potential and anticancer activity^7,8,9.

NCI-60 cancer cell panel testing

The ACP candidates AmphiArc2 (parent), Off2 and Off2.2.10 were tested on the NCI-60 cancer cell panel⁴¹. The three tested peptides inhibited the growth of all the cancer cell lines in the NCI-60 panel at a low micromolar concentration (Table 5, Supplementary Information Table S3). This result corroborated the wide-spectrum effect of the anticancer peptides across a range of cancer types. Both the activity of Off2 and Off2.2.10 peptides on the cell lines tested were significantly lower than the anticancer activity of the AmphiArc2 peptide (p-value = 4.9 × 10⁻¹³ and 1.7 × 10⁻¹², respectively, Welch two sample t-test), suggesting that the initial increased cancer cell selectivity comes at a cost of an activity loss. No significant anticancer activity difference was found between Off2 and Off2.2.10 peptides (p-value = 0.66, Welch two sample t-test), indicating that the additionally improved anticancer selectivity does not affect the average anticancer activities of these two peptides.

Table 5 Cellular growth inhibition of 60 cell lines in the NCI-60 cancer cell test for the AmphiArc2 (Parent), Off2 and Off2.2.10 peptides.

Full size table

Conclusions

In this study, the combination of a machine learning model and the SME algorithm resulted in ACPs with low-micromolar potency against a wide variety of cancer cells (NCI-60 panel) and selectivity with respect to non-transformed cells (HDMEC) and human erythrocytes. The machine-learning classifier alone was able to identify active peptides but was insufficient to identify cancer cell selective peptides. Virtual screening of computationally designed peptide libraries with the implemented machine-learning classifier led to the discovery of four novel ACPs as the starting point for selectivity optimization by SME. In the first design-synthesize-test cycle, peptide hemolysis was reduced ten-fold, and after three cycles, peptide activity towards noncancer cells was reduced more than 20-fold while retaining anticancer activity compared to the parent peptide (AmphiArc2). The results of this study advocate for the SME method for experiment-guided peptide design and for exploration of the ACP structure-activity landscape. SME is applicable to all kinds of experimental readouts and provides an alternative to more conventional peptide optimization techniques, e.g., alanine scanning. At the same time, the results suggest that additionally increased cancer cell selectivity of membranolytic ACPs might come at the price of reduced peptide potency. This working hypothesis provides a basis for future study.

Methods

Machine learning model

Both machine learning models were constructed in Python v2.7 using the Scikit-Learn v0.18 library. For model training, the peptide dataset was split into 2/3 training and 1/3 testing subsets. Random forest classifier: the number of trees (“n_estimators”) was set to 500, and the number of features to be considered by each tree (“max_features”) was set to the squared root of all features (“sqrt”). SVM classifier: a linear kernel was employed and hyperparameter C was optimized by a ten-fold cross-validation in which the model is trained on 90% of the training data and validated on the remaining 10% in ten repetitions of training. The obtained mean of the 10 repetitions (cross-validation MCC score) was used to evaluate the performance of the models. The test scores were obtained with the independent test set.

Scoring metrics

The Matthews correlation coefficient (MCC, Eq. 1), accuracy (Eq. 2), precision (Eq. 3) and recall (Eq. 4) were calculated. TP, FP, TN and FN correspond to the number of true positives, false positives, true negatives and false negatives predicted by the model, respectively.

$$MCC=\frac{TP\times TN-FP\times FN}{\sqrt{(TP+FP)(TP+FN)(TN+FP)(TN+FN)}}$$

(1)

$$Accuracy=\frac{TN+TP}{TN+TP+FN+FP}$$

(2)

$$Precision=\frac{TP}{TP+FP}$$

(3)

$$Recall=\frac{TP}{TP+FN}$$

(4)

Data weighted scoring functions

To appropriately consider the applicability domain of the SVM classifier, the final scoring function for ACPs (ϕ_ACP, Eq. 5) and inactive (negative) peptides (ϕ_Neg, Eq. 6) considers both the pseudo-probability of the peptide to be an ACP (P_ACP) as predicted by the SVM model and the similarity of the predicted peptides to the training data (Sim. score). k-means clustering with k = 3 was performed with Python v2.7 and the Scikit-Learn v0.18 library package. The similarity score is calculated as the inverse of the Euclidean distance in descriptor space of the peptides to the three centroids.

$${\varphi }_{ACP}=\frac{{P}_{ACP}+Sim.\,score}{2}$$

(5)

$${\varphi }_{Neg}=\frac{(1-{P}_{ACP})+Sim.\,score}{2}$$

(6)

Virtual peptide libraries

Three virtual peptide libraries were generated according to three different design principles. For each library, the peptide length was restricted to a range of 11 to 30 amino acids, as peptides able to fold in an alpha-helix are typically inside this range⁴². Duplicate sequences were eliminated, and the similarity of the sequences was restricted with the CD-HIT³⁶ program to a threshold of 0.8 similarity. A total of 10⁶ peptides were selected from each of the libraries.

Helical library. The Helical library was generated with the position-dependent amino acid distributions of 62 anuran and hymenopteran alpha-helical ACPs¹¹ in amino acid positions 1–18 (exactly 5 helical turns). For longer peptides, the pattern was repeated. The method to generate this library is included in the modlAMP⁴³ Python package (modlamp.sequences.HelicesACP).
Amphipathic Arc library. The design principle of the Amphipathic Arc library was amphipathic peptide sequences, which would potentially be alpha-helical with a preference for positively charged amino acids in the polar phase of the helix and varying hydrophobic arcs in the range 100–260°. The method to generate this library was included in the python package modlAMP as the class AmphipathicArc (modlamp.sequences.AmphipathicArc).
Gradient library. The Gradient library was designed using the same procedure as the Amphipathic Arc library but with an additional hydrophobic gradient in the peptide structure from the N- to the C-terminus. For this, the amino acids in the C-terminal third of the peptide sequence were substituted with hydrophobic amino acids. In the modlAMP package, this was achieved by the method make_H_gradient in the modlamp.sequences.Amphipathic Arc class.

Simulated molecular evolution

The simulated molecular evolution (SME) algorithm is based on the (1, λ) evolution strategy⁴⁴ in which λ mutated sequences (offspring) are generated from a parent sequence^22,23,25. The offspring was scored according to a fitness function, which was defined as the experimentally determined peptide anticancer activity and selectivity with respect to non-transformed cells. The best offspring were selected as a parent for the following optimization iteration. The amino acid mutations were generated according to an amino acid similarity matrix that has been row-normalized (d_ij) to allow for a pseudo-probability calculation of the amino acid transitions (Eq. 7). Here, the Grantham amino-acid similarity matrix was utilized³⁹. The amino acids cysteine and methionine were excluded from the mutation matrix to avoid potential peptide cyclization and facilitate peptide synthesis.

$$P\,(i\to j)=exp(-\frac{{d}_{ij}^{2}}{2{\sigma }^{2}})/\sum _{j}\,exp(-\frac{{d}_{ij}^{2}}{2{\sigma }^{2}}).$$

(7)

where σ is a strategy parameter that controls the distance of the offspring sequences to the parent sequence and, thus, the sequence diversity among the offspring. The σ strategy parameter was set to 0.1 for the two initial SME iterations. Sequence diversity was characterized by the Shannon entropy⁴⁵ (H) of the residue distribution among the offspring (Eq. 8), where p_i corresponds to the frequency of amino acid i in a certain sequence position. The Shannon entropy values were normalized to [0, 1]. The simulated molecular evolution strategy and Shannon entropy calculation were programmed with Python v2.7.

$$H=\sum _{i=1}^{20}\,{p}_{i}lo{g}_{2}{p}_{i}.$$

(8)

References

Holohan, C., Van Schaeybroeck, S., Longley, D. B. & Johnston, P. G. Cancer drug resistance: an evolving paradigm. Nat. Rev. Cancer 13, 714–726 (2013).
Article CAS Google Scholar
Chatterjee, S., Damle, S. G. & Sharma, A. K. Mechanisms of resistance against cancer therapeutic drugs. Curr. Pharm. Biotechnol. 15, 1105–1112 (2014).
Article CAS Google Scholar
Papo, N. & Shai, Y. Host defense peptides as new weapons in cancer treatment. C. Cell. Mol. Life Sci. 62, 784–790 (2005).
Article CAS Google Scholar
Schweizer, F. Cationic amphiphilic peptides with cancer-selective toxicity. Eur. J. Pharmacol. 625, 190–194 (2009).
Article CAS Google Scholar
Mader, J. S. & Hoskin, D. W. Cationic antimicrobial peptides as novel cytotoxic agents for cancer treatment. Expert Opin. Investig. Drugs 15, 933–946 (2006).
Article CAS Google Scholar
Riedl, S. et al. In search of a novel target — phosphatidylserine exposed by non-apoptotic tumor cells and metastases of malignancies with poor treatment efficacy. Biochim. Biophys. Acta - Biomembr. 1808, 2638–2645 (2011).
Article CAS Google Scholar
Harris, F., Dennison, S. R., Singh, J. & Phoenix, D. A. On the selectivity and efficacy of defense peptides with respect to cancer cells. Med. Res. Rev. 33, 190–234 (2013).
Article CAS Google Scholar
Huang, Y., Wang, X., Wang, H., Liu, Y. & Chen, Y. Studies on mechanism of action of anticancer peptides by modulation of hydrophobicity within a defined structural framework. Mol. Cancer Ther. 10, 416–426 (2011).
Article CAS Google Scholar
Yang, Q.-Z. et al. Design of potent, non-toxic anticancer peptides based on the structure of the antimicrobial peptide, temporin-1CEa. Arch. Pharm. Res. 36, 1302–1310 (2013).
Article CAS Google Scholar
Dennison, S. R., Harris, F., Bhatt, T., Singh, J. & Phoenix, D. A. A theoretical analysis of secondary structural characteristics of anticancer peptides. Mol. Cell. Biochem. 333, 129–135 (2010).
Article CAS Google Scholar
Gabernet, G., Müller, A. T., Hiss, J. A. & Schneider, G. Membranolytic anticancer peptides. Med. Chem. Commun. 7, 2232–2245 (2016).
Article CAS Google Scholar
Lin, Y.-C. et al. Multidimensional design of anticancer peptides. Angew. Chem. Int. Ed. 54, 10370–10374 (2015).
Article CAS Google Scholar
Tyagi, A. et al. In silico models for designing and discovering novel anticancer peptides. Sci. Rep. 3, 2984 (2013).
Article Google Scholar
Chen, W., Ding, H., Feng, P., Lin, H. & Chou, K. iACP: a sequence-based tool for identifying anticancer peptides. Oncotarget 7, 16895–16909 (2016).
PubMed PubMed Central Google Scholar
Hajisharifi, Z., Piryaiee, M., Mohammad Beigi, M., Behbahani, M. & Mohabatkar, H. Predicting anticancer peptides with Chou’s pseudo amino acid composition and investigating their mutagenicity via Ames test. J. Theor. Biol. 341, 34–40 (2014).
Article CAS Google Scholar
Saravanan, V. & Lakshmi, P. T. V. ACPP: A web server for prediction and design of anti-cancer peptides. Int. J. Pept. Res. Ther. 21, 99–106 (2015).
Article Google Scholar
Grisoni, F. et al. Designing anticancer peptides by constructive machine learning. ChemMedChem 13, 1300–1302 (2018).
Article CAS Google Scholar
Manavalan, B. et al. MLACP: machine-learning-based prediction of anticancer peptides. Oncotarget 8, 77121–77136 (2017).
Article Google Scholar
Fjell, C. D. et al. Identification of novel antibacterial peptides by chemoinformatics and machine learning. J. Med. Chem. 52, 2006–2015 (2009).
Article CAS Google Scholar
Müller, A. T., Hiss, J. A. & Schneider, G. Recurrent neural network model for constructive peptide design. J. Chem. Inf. Model. 58, 472–479 (2018).
Article Google Scholar
Lee, E. Y., Wong, G. C. L. & Ferguson, A. L. Machine learning-enabled discovery and design of membrane-active peptides. Bioorg. Med. Chem, https://doi.org/10.1016/j.bmc.2017.07.012 (2017).
Article CAS Google Scholar
Schneider, G. & Wrede, P. The rational design of amino acid sequences by artificial neural networks and simulated molecular evolution: de novo design of an idealized leader peptidase cleavage site. Biophys. J. 66, 335–344 (1994).
Article ADS CAS Google Scholar
Schneider, G., Schuchhardt, J. & Wrede, P. Peptide design in machina: development of artificial mitochondrial protein precursor cleavage sites by simulated molecular evolution. Biophys. J. 68, 434–447 (1995).
Article ADS CAS Google Scholar
Schneider, G. et al. Peptide design by artificial neural networks and computer-based evolutionary search. Proc. Natl. Acad. Sci. USA 95, 12179–12184 (1998).
Article ADS CAS Google Scholar
Hiss, J. A., Stutz, K., Posselt, G., Weßler, S. & Schneider, G. Attractors in sequence space: peptide morphing by directed simulated evolution. Mol. Inf. 34, 709–714 (2015).
Article CAS Google Scholar
Stutz, K. et al. Peptide–membrane interaction between targeting and lysis. ACS Chem. Biol. 12, 2254–2259 (2017).
Article CAS Google Scholar
Tyagi, A. et al. CancerPPD: a database of anticancer peptides and proteins. Nucleic Acids Res. 43, 837–843 (2015).
Article Google Scholar
Berman, H. M. The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000).
Article ADS CAS Google Scholar
Koch, C. P. et al. Scrutinizing MHC-I binding peptides and their limits of variation. PLoS Comput. Biol. 9, e1003088 (2013).
Article CAS Google Scholar
Eisenberg, D., Weiss, R. M. & Terwilliger, T. C. The helical hydrophobic moment: a measure of the amphiphilicity of a helix. Nature 299, 371–374 (1982).
Article ADS CAS Google Scholar
Breiman, L. Random Forests. Mach. Learn. 45, 5–32 (2001).
Article Google Scholar
Cortes, C. & Vapnik, V. Support-vector networks. Mach. Learn. 20, 273–297 (1995).
MATH Google Scholar
Chen, Y. et al. Comparison of biophysical and biologic properties of alpha-helical enantiomeric antimicrobial peptides. Chem. Biol. Drug Des. 67, 162–173 (2006).
Article CAS Google Scholar
Riedl, S., Zweytick, D. & Lohner, K. Membrane-active host defense peptides – Challenges and perspectives for the development of novel anticancer drugs. Chem. Phys. Lipids 164, 766–781 (2011).
Article CAS Google Scholar
Schroeter, T. S. et al. Estimating the domain of applicability for machine learning QSAR models: A study on aqueous solubility of drug discovery molecules. J. Comput. Aided. Mol. Des. 21, 651–664 (2007).
Article ADS CAS Google Scholar
Li, W. & Godzik, A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Article CAS Google Scholar
Marion, D., Zasloff, M. & Bax, A. A two-dimensional NMR study of the antimicrobial peptide magainin 2. FEBS Lett. 227, 21–26 (1988).
Article CAS Google Scholar
Zelezetsky, I. & Tossi, A. Alpha-helical antimicrobial peptides—Using a sequence template to guide structure–activity relationship studies. Biochim. Biophys. Acta Biomembr. 1758, 1436–1449 (2006).
Article CAS Google Scholar
Grantham, R. Amino acid difference formula to help explain protein evolution. Science 185, 862–864 (1974).
Article ADS CAS Google Scholar
Nilsson, I. et al. Proline-induced disruption of a transmembrane α-helix in its natural environment. J. Mol. Biol. 284, 1165–1175 (1998).
Article CAS Google Scholar
Monks, A. et al. Feasibility of a high-flux anticancer drug screen using a diverse panel of cultured human tumor cell lines. J. Natl. Cancer Inst. 83, 757–766 (1991).
Article CAS Google Scholar
Manning, M. C., Illangasekare, M. & Woody, R. W. Circular dichroism studies of distorted alpha-helices, twisted beta-sheets, and beta turns. Biophys. Chem. 31, 77–86 (1988).
Article CAS Google Scholar
Müller, A. T., Gabernet, G., Hiss, J. A. & Schneider, G. modlAMP: Python for antimicrobial peptides. Bioinformatics, https://doi.org/10.1093/bioinformatics/btx285 (2017).
Article Google Scholar
Rechenberg, I. Evolutionsstrategie - Optimierung technischer Systeme nach Prinzipien der biologischen Evolution (Frommann-Holzboog, Stuttgart, 1973).
Asadi, M., Ebrahimi, N. & Soofi, E. S. Shannon entropy measures. In Wiley StatsRef: Statistics Reference Online 1–8, https://doi.org/10.1002/9781118445112.stat07920 (John Wiley & Sons, New York, 2017).

Download references

Acknowledgements

The authors thank Sarah Haller for technical support and Prof. Cornelia Halin and Prof. Stephanie Krämer for the use of the cell culture facilities. The Developmental Therapeutics Program of the National Cancer Institute and Dr. John A. Beutler kindly performed the NCI-60 cancer cell panel tests. We thank Dr. Francesca Grisoni and Alexander L. Button for constructive input and discussions. This work was financially supported by the Swiss National Science Foundation (Grant No. 2000021_157190 to G.S. and J.A.H.).

Author information

Authors and Affiliations

Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich, Switzerland
Gisela Gabernet, Damian Gautschi, Alex T. Müller, Claudia S. Neuhaus, Jan A. Hiss & Gisbert Schneider
Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
Lucas Armbrecht & Petra S. Dittrich

Authors

Gisela Gabernet
View author publications
You can also search for this author in PubMed Google Scholar
Damian Gautschi
View author publications
You can also search for this author in PubMed Google Scholar
Alex T. Müller
View author publications
You can also search for this author in PubMed Google Scholar
Claudia S. Neuhaus
View author publications
You can also search for this author in PubMed Google Scholar
Lucas Armbrecht
View author publications
You can also search for this author in PubMed Google Scholar
Petra S. Dittrich
View author publications
You can also search for this author in PubMed Google Scholar
Jan A. Hiss
View author publications
You can also search for this author in PubMed Google Scholar
Gisbert Schneider
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

G.G., D.G., A.T.M. and C.S.N. performed the peptide syntheses and activity assays. G.G. and L.A. performed the microfluidics assay. J.A.H., P.S.D. and G.S. designed and supervised the study. G.G., A.T.M. and G.S. programmed the software. All authors analyzed the data and contributed to the manuscript. G.G. and G.S. wrote the manuscript.

Corresponding author

Correspondence to Gisbert Schneider.

Ethics declarations

Competing Interests

G.S. declares a potential financial conflict of interest in his role as life-science industry consultant and cofounder of inSili.com GmbH, Zurich. No further competing interests are declared.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Video SV1

Supplementary Information

Supplementary Dataset 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gabernet, G., Gautschi, D., Müller, A.T. et al. In silico design and optimization of selective membranolytic anticancer peptides. Sci Rep 9, 11282 (2019). https://doi.org/10.1038/s41598-019-47568-9

Download citation

Received: 20 December 2018
Accepted: 17 June 2019
Published: 02 August 2019
DOI: https://doi.org/10.1038/s41598-019-47568-9

This article is cited by

A Review of the in Silico Design and Development Approaches of Ras-Specific Anticancer Therapeutics
- Parinaz Motiei
- Hamid Reza Heidari
- Elnaz Mehdizadeh Aghdam
International Journal of Peptide Research and Therapeutics (2023)
Anti-cancer Peptide Recognition Based on Grouped Sequence and Spatial Dimension Integrated Networks
- Hongfeng You
- Long Yu
- Weidong Wu
Interdisciplinary Sciences: Computational Life Sciences (2022)
Copper-binding anticancer peptides from the piscidin family: an expanded mechanism that encompasses physical and chemical bilayer disruption
- Fatih Comert
- Frank Heinrich
- Mihaela Mihailescu
Scientific Reports (2021)
Machine learning-guided discovery and design of non-hemolytic peptides
- Fabien Plisson
- Obed Ramírez-Sánchez
- Cristina Martínez-Hernández
Scientific Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

ACP classifier model

Feature importance for ACP activity

De novo design of ACPs

Selectivity optimization of a de novo designed ACP

NCI-60 cancer cell panel testing

Conclusions

Methods

Machine learning model

Scoring metrics

Data weighted scoring functions

Virtual peptide libraries

Simulated molecular evolution

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links