In Silico Models for Designing and Discovering Novel Anticancer Peptides

Tyagi, Atul; Kapoor, Pallavi; Kumar, Rahul; Chaudhary, Kumardeep; Gautam, Ankur; Raghava, G. P. S.

doi:10.1038/srep02984

Download PDF

Article
Open access
Published: 18 October 2013

In Silico Models for Designing and Discovering Novel Anticancer Peptides

Atul Tyagi¹^na1,
Pallavi Kapoor¹^na1,
Rahul Kumar¹^na1,
Kumardeep Chaudhary¹^na1,
Ankur Gautam¹^na1 &
…
G. P. S. Raghava¹^na1

Scientific Reports volume 3, Article number: 2984 (2013) Cite this article

11k Accesses
194 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Use of therapeutic peptides in cancer therapy has been receiving considerable attention in the recent years. Present study describes the development of computational models for predicting and discovering novel anticancer peptides. Preliminary analysis revealed that Cys, Gly, Ile, Lys and Trp are dominated at various positions in anticancer peptides. Support vector machine models were developed using amino acid composition and binary profiles as input features on main dataset that contains experimentally validated anticancer peptides and random peptides derived from SwissProt database. In addition, models were developed on alternate dataset that contains antimicrobial peptides instead of random peptides. Binary profiles-based model achieved maximum accuracy 91.44% with MCC 0.83. We have developed a webserver, which would be helpful in: (i) predicting minimum mutations required for improving anticancer potency; (ii) virtual screening of peptides for discovering novel anticancer peptides and (iii) scanning natural proteins for identification of anticancer peptides (http://crdd.osdd.net/raghava/anticp/).

PERCEPTION predicts patient response and resistance to treatment using single-cell transcriptomics of their tumors

Article 18 April 2024

Discovery of potent inhibitors of α-synuclein aggregation using structure-based iterative learning

Article Open access 17 April 2024

A small-molecule TNIK inhibitor targets fibrosis in preclinical and clinical models

Article Open access 08 March 2024

Introduction

Cancer with leading cause of deaths remains the matter of health concern for both developed and developing countries¹. Despite the advances in cancer treatments, mortality rate due to this deadly disease is still very high¹. Owing to the development of resistance by cancer cells towards current anti-cancer chemotherapeutic drugs, there is an urgent need to add new weapons in the anti-cancer drug arsenal to fight with this deadly disease. In the last decade, small peptides having anticancer properties have emerged as a potential alternative approach for cancer therapy². Peptide-based therapy has numerous advantages over small molecules that involve high specificity, low production cost, high tumor penetration, ease of synthesis and modification etc³.

Anticancer peptides (ACPs) are small (5–30 amino acids) peptides, often derived from antimicrobial peptides (AMPs) and are cationic in nature⁴. Previous studies have demonstrated that many cationic AMPs, which are toxic to bacteria but not to normal cells, show a broad spectrum cytotoxicity against various cancer cells⁵. Although ACP is a rapidly emerging field, their mechanism of action remains elusive. However, few studies have suggested that there are few differences between the cell membranes of cancer and normal cells and selective killing of cancer cells by certain ACPs could be due to these differences^4,5. In this context, electrostatic interactions between cationic amino acids of ACPs and anionic components of cancer cell membranes are suggested to be one of the major contributing factors in the selective killing of cancer cells by ACPs⁴. Also, high membrane fluidity and high cell-surface area^6,7 of cancer cells compared to untransformed cells lead to enhance the lytic activity of ACPs and binding of the increased number of ACPs, respectively. In addition, few ACPs induce apoptosis (program cell death) by disrupting mitochondrial membrane when delivered into the cancer cells⁸. Many peptide-based therapies to treat various tumor types are currently being evaluated in various phases of preclinical and clinical trials^9,10,11,12. The success of these peptides in clinics has open the door for ACPs to reach clinical settings.

Keeping in mind the immense therapeutic importance of ACPs, in the present study, we have made a systematic attempt to develop in silico methods for the prediction and designing of ACPs. Support vector machine (SVM) based models using various features of peptides like amino acid composition, dipeptide composition and binary profile pattern have been developed. In addition, models discriminating ACPs from AMPs have also been developed. Binary profile-based SVM model using NT10 dataset achieved maximum accuracy of 91.44% with MCC and AUC values 0.83 and 0.94 respectively. To assist scientific community, for the first time, a user-friendly webserver, AntiCP, has been developed to predict and design highly efficacious ACPs.

Results

Compositional analysis

We wanted to develop in silico models, which can differentiate ACPs from non-ACPs, as well as ACPs from AMPs. Therefore, first we sought to determine the frequency of occurrence of all 20 amino acids in these peptides. For this, percent average composition of amino acids in ACPs, non-ACPs (random peptides) and AMPs were calculated and compared. As shown in Figure 1, certain residues, including Gly, Lys, Cys, Phe, Ile and Trp were found to be abundant in ACPs compared to non-ACPs while Gly, Ala, Lys and Leu were abundant in AMPs compared to ACPs and non-ACPs. Since terminal residues play crucial roles in biological functions of peptides¹³, we computed and compared the percent average amino acid composition of N-terminal and C-terminal residues (split amino acid composition) in these peptides. As shown in Figure 2A and 2B, average amino acid compositions of terminal residues are more or less similar to whole amino acid composition. However, among N-terminal residues, only Cys was found to be in a higher proportion in ACPs compared to both AMPs and non-ACPs. In C-terminal residue analysis, Tyr and Trp were found to abundant in ACPs compared to both AMPs and non-ACPs (Figure 2B).

Residue preference

In order to understand residue preference at both termini of peptides, we computed sequence logos. The sequence logos of 10 N-terminal and 10 C-terminal residues are shown in Figure 3A and 3B. As shown, no exclusive preference of residues was observed except Gly at the first position at N-terminus. However, there are few residues like Leu, Lys, Ala and Phe at N-terminus and Val, Cys, Leu and Lys at C-terminus which are also preferred but relatively less preferred than Gly at various positions.

Support vector machine models

SVM models were developed on both realistic datasets (main datasets and alternate datasets) and balanced datasets (balanced dataset-1 and balanced dataset-2) using amino acid composition, dipeptide composition and binary profiles as input features.

SVM model based on amino acid composition

Since certain residues were found to be abundant over others in ACPs and AMPs, ACPs can be discriminated from non-ACPs and AMPs on the basis of their amino acid composition. Therefore, we have developed whole amino acid composition-based SVM models. The performance of whole composition-based SVM models has been shown in Table 1 and 2. The whole composition-based SVM model developed on balanced dataset-1 achieved maximum accuracy of 88.89% with MCC and AUC values 0.78 and 0.94 respectively (Table 1 and Figure 4A). In addition, SVM models based on split amino acid composition (NT5, CT5, NT5CT5, NT10, CT10 and NT10CT10) were also developed. The performance of these models is summarized in Table 1. Model developed with NT10CT10 dataset performed similar to whole composition-based model and achieved maximum accuracy of 88.4% with MCC and AUC values of 0.77 and 0.93, respectively (Table 1). The performance of models developed on main dataset was comparable to models developed on balanced dataset-1 (Table 1).

Table 1 The performance of amino acid composition-based models on main dataset

Full size table

Table 2 Performances of amino acid composition-based models on alternate dataset

Full size table

Similarly, SVM models on balanced dataset-2 and alternate dataset were also developed using amino acid composition as input features. The performances of these models are summarized in Table 2. The overall performances of the models developed with balanced dataset-2 were more or less similar to models developed with balanced dataset-1. The whole composition-based SVM model developed on balanced dataset-2 achieved maximum accuracy of 85.33% with MCC and AUC values 0.71 and 0.90 respectively. Similarly, models based on split amino acid composition were also developed (Table 2) and the model developed on NT10CT10 dataset achieved maximum accuracy of 87.73% with MCC and AUC values 0.75 and 0.92 respectively (Table 2). Amino acid composition based models developed on alternate dataset performed poorer than the models developed on balanced dataset-2 (Table 2).

Dipeptide composition-based SVM model

In many previous studies, SVM model based on dipeptide composition has been developed to discriminate different classes of peptides^14,15,16. Dipeptide composition is a simple feature and it encapsulates information of the amino acid fraction as well as local order of amino acids. Therefore, SVM models based on dipeptide composition have been constructed on all the datasets. Performances of dipeptide composition-based models are summarized in Table 3 and 4. Models developed on balanced dataset-1 achieved maximum accuracy of 87.78% with an MCC and AUC values 0.76 and 0.93 respectively (Table 3, Figure 4B). For balanced dataset-2, models developed on whole peptide and NT5CT5 datasets achieved maximum accuracy of 86.89% with MCC and AUC values 0.74 and 0.91 respectively.

Table 3 Performance of dipeptide composition-based models on main dataset

Full size table

Table 4 Performance of dipeptide composition-based model on alternate dataset

Full size table

Binary profile based SVM model

Since apart from composition, order of amino acid is also important feature, therefore, to implement information about frequency as well as the order of residues, we developed models based on binary profiles of peptides. We have used the following three approaches.

N-terminal (NT) approach

For balanced dataset-1, the accuracies of the models developed on NT5 and NT10 datasets were 80.89% and 83.95% with MCC 0.62, 0.68 and AUC 0.87, 0.91 respectively (Table 5). For balanced dataset-2, models developed on NT5 and NT10 datasets achieved maximum accuracies 88.44% and 91.44% with MCC 0.77 and 0.83 and AUC values 0.93 and 0.94 respectively (Table 6 and Figure 4C).

Table 5 Performance of binary profile-based model on main dataset

Full size table

Table 6 Performance of binary profile-based model on alternate dataset

Full size table

C-terminal (CT) approach

Similarly, models were developed using 5 and 10 C-terminal residues and performances are summarized in Table 5 and 6. For balanced dataset-1, model developed using 5 and 10 C-terminal residues (CT5 and CT10) achieved accuracies 74.67% and 79.75% with MCC 0.51, 0.60 and AUC 0.79, 0.84 respectively (Table 5). For balanced dataset-2, models developed on CT5 and CT10 datasets achieved maximum accuracies 78.22% and 78.7% with MCC 0.57 and 0.58 and AUC values 0.83 and 0.86 respectively (Table 6).

N + C-terminal (NTCT) approach

Similar strategy, as used in the N- and C-terminal approaches, was applied in this approach also. The comparative performances of SVM model based on N + C terminal residues are shown in Table 5 and 6. For balanced dataset-1, model developed on NT10CT10 datasets achieved maximum accuracy 84.94% with MCC 0.70 and AUC 0.91 (Table 5). For balanced dataset-2, model developed on NT10CT10 dataset achieved maximum accuracy 90.74% with MCC 0.82 and AUC 0.94 (Table 6).

Performance on independent dataset

In order to validate our models, we have evaluated the performances of our best models on an independent dataset. The amino acid composition-based model achieved accuracy 86% with MCC 0.72 while model based on binary profiles (NT10) achieved accuracy 89% with MCC 0.78. These results indicate that our models performed equally well on an independent dataset suggesting that our models are not over trained and may also work in real life. We evaluated the performance of both models (amino acid composition and binary (NT10) based models) using five-fold and ten-fold cross-validation and achieved similar results. In addition, we evaluated the performance of our models 100 times, each time training and testing set of peptides were reshuffled randomly. We computed average performance of these 100 models with standard error, which is summarized in the supplementary information. The average performance of our models indicates that even after repeating 100 times, models performed similarly. This evaluation further demonstrates the reliability of models developed in this study.

Implementation and description of webserver

In order to serve the scientific community, the best SVM-based models were implemented to build a webserver (AntiCP, Figure 5) using a CGI/Perl script. Various tools have been integrated to assist users to design and predict ACPs (Figure 5). Users may submit the peptide and the server will generate all the possible single substitution mutants of a given peptide. Besides generating mutants, server will also give prediction status as ACP or non-ACP. Along with this, server calculates key physico-chemical properties in a Tabular format. In addition, user can discover novel ACPs by screening multiple peptides at a time. For this, virtual screening tool has been integrated where user has to submit multiple peptide sequences in FASTA format. Another powerful tool is protein scan, which will be useful for the detection of putative ACP regions in the protein. Here, user may submit the protein sequence and overlapping peptides will be generated by the server, where all the peptides will be clickable. Sorting of results in ascending/descending order of their values is another attractive feature provided with the web server. AntiCP is freely accessible at http://crdd.osdd.net/raghava/anticp.

Discussion

The peptide-based therapeutics is gaining tremendous interest nowadays^2,3, which has been reflected in the papers published in the last five years. Many peptides-based strategies for targeting and delivering therapeutics to various tumor types have been used over the years² and few of them have successfully translated into the clinics. In this context, ACPs have also been emerged as promising candidates for cancer therapy⁴. Identification and development of novel ACPs in the wet lab is extremely time consuming and labor intensive approach. Therefore, development of in silico methods, which can predict ACPs prior to their synthesis is the need of the hour. Such prediction methods are not only helpful for biologists for designing effective ACPs, but also save money and time. The present study describes an in silico method for designing and predicting ACPs. For the development of SVM models, both positive and negative examples are required. Therefore, we have collected 225 experimentally validated ACPs from literature and from various databases^17,18,19. Since, experimentally validated non-ACPs were not reported in the literature, equal number of negative examples were generated randomly from SwissProt proteins and these peptides were assumed to be non-ACPs. This approach has been used in number of previous studies^15,16,20 where sufficient amount of negative examples were not available in the literature. As it was observed that most of the ACPs are derived from AMPs, we have collected AMPs without anti-cancer activities (no anti-cancer activities reported in the literature) and developed alternate dataset, which comprises ACPs as positive examples and AMPs without anti-cancer activities as negative examples. The models developed on this dataset discriminated ACPs from AMPs.

A preliminary analysis of amino acid composition has shown that certain residues are dominated in ACPs/AMPs. These differences in amino acid composition between ACPs/AMPs and non-ACPs prompted us to develop SVM models based on amino acid composition and dipeptide composition of peptides. The whole composition-based model performed reasonably well and model developed on balanced dataset-1 performed the best among the rest of the whole composition-based models. However, models developed on split amino acid compositions could not perform better than the whole composition-based models and it was expected as there was not significant difference observed in amino acid composition between ACPs, non-ACPs and AMPs at N- and C-terminal residues (Figure 1 and 2). We compared the performance of models developed on balanced and realistic datasets and got similar results.

Dipeptide composition is an attractive feature which encapsulates the information of fraction of amino acids as well as their local order. Therefore, we have developed SVM models using dipeptide composition. As shown in the result section, performance of dipeptide composition-based models performed comparable to amino acid composition-based model.

It is well known that peptide's function is strongly related to its residue order. Plethora of studies has suggested that the membrane interaction and insertion of membrane-active peptides (e.g. AMPs, cell penetrating peptides, ACPs, etc.) could be due to their conformation (e.g. helical, β stranded, etc.)^21,22, which can be associated to a particular order of amino acids or distribution of residues. Thus, apart from composition of amino acids, order of amino acids is also important feature and might be associated with anti-cancer properties of ACPs. Therefore, to incorporate the order information, binary profiles of the peptides were generated. Binary profiles encapsulate information of both composition and order of amino acids. In many previous studies, binary profiles based models have been used to discriminate various classes of peptide/proteins^15,16. In the present study, binary-based models performed reasonably well. In order to provide service to the scientific community, we have implemented best models in a webserver, AntiCP, which is freely available. We hope that our method will provide momentum in the discovery and designing of novel efficient ACPs.

Methods

Datasets

We have extracted 225 experimentally validated anticancer peptides from literature and databases like antimicrobial database (APD, http://aps.unmc.edu/AP/main.php)¹⁷, collection of antimicrobial peptides (CAMP, http://www.bicnirrh.res.in/antimicrobial)¹⁸ and database of anuran defense peptides (DADP, http://split4.pmfst.hr/dadp/)¹⁹. Majority of these peptides are AMPs with a broad spectrum anticancer activities. All these peptides were unique and considered as positive examples. Since there are very few experimentally proved non-anticancer peptides, we derived 2250 random peptides from SwissProt proteins. In this study, we assign these random peptides as non-ACPs (negative examples), though it is possible that some of these random peptides have anticancer properties. We also extracted AMPs from above databases like APD, CAMP, DADP for which no anticancer activity was reported in the literature and considered as non-ACPs. Following datasets were derived from the above data.

Main dataset

This dataset contains 225 experimentally validated anticancer (positive examples) and 2250 random or potential non-anticancer peptides (negative examples).

Alternate dataset

This dataset contains 225 experimentally validated anticancer peptides and 1372 non-anticancer (AMPs without anticancer activities, negative examples).

Balanced datasets

It is a well known fact that classification techniques, particularly machine learning techniques performed best on balanced datasets. Thus, we generated balanced datasets for both main and alternate datasets. Our main balanced dataset contains 225 anticancer and 225 non-anticancer or random peptides (randomly obtained 2250 SwissProt peptides). Similarly, alternate balanced dataset contains 225 anticancer and 225 non-anticancer or AMPs (randomly obtained from 1372 AMPs).

Independent dataset

For developing independent dataset, we collected 50 experimentally validated ACPs from literature and patents and an equal number of random peptides were generated from SwissProt proteins and considered as negative examples. None of the peptides in independent dataset is identical to peptides in training or testing dataset.

Support vector machine

In this study, we developed models for discriminating anticancer and non-anticancer peptides using a highly successful machine learning technique, support vector machine (SVM)²³. We developed SVM models using SVM^light Version 6.02 package. Various features, including amino acid composition, dipeptide composition and binary profile of pattern were used as input features.

Residue composition as input features

In order to develop SVM models based on machine learning techniques, one needs fixed length input features. Our dataset contains peptides of variable length; thus we have computed composition profile of peptides. In this study, we computed amino acid and dipeptide composition where information is encapsulated in a vector of 20 and 400 dimensions respectively. The calculation of amino acid and dipeptide composition was described previously^15,16.

Binary profile of patterns

Binary profiles is a key feature and has been used in a number of existing methods.

It encapsulates information of both composition and order of amino acid in peptides. Therefore, binary profiles for first 5 and 10 residues from N- and C-terminus were generated for each peptide, where each amino acid is represented by a vector of dimensions of 20 (e.g. Ala by 1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0) as described previously¹⁵.

Sequence logos

The sequence logos, which provides information about the position specific frequency of amino acids in peptide, were generated using the WebLogo software²⁴.

Performance measures

The performance of models were evaluated using threshold-dependent and threshold-independent parameters. Sensitivity (Sn), specificity (Sp), accuracy (Ac) and Matthew's correlation coefficient (MCC) were used as threshold-dependent parameters as previously described¹⁵. For threshold-independent parameter, ROC (Receiver Operating Characteristic) for all of the models were created in order to evaluate the performance of models.

Cross validation technique

The ten-fold cross validation technique was used to evaluate the performance of various SVM models. In this technique, sequences are randomly divided into ten sets, of which nine sets are used for training and the remaining tenth set for testing. The process is repeated ten times in such a way that each set is used once for testing. Final performance is obtained by averaging the performance of all the ten sets.

References

Siegel, R., Naishadham, D. & Jemal, A. Cancer statistics, 2013. CA Cancer J Clin 63, 11–30 (2013).
PubMed Google Scholar
Thundimadathil, J. Cancer treatment using peptides: current therapies and future prospects. J Amino Acids 2012, 967347 (2012).
PubMed PubMed Central Google Scholar
Vlieghe, P., Lisowski, V., Martinez, J. & Khrestchatisky, M. Synthetic therapeutic peptides: science and market. Drug Discov Today 15, 40–56 (2010).
CAS PubMed Google Scholar
Mader, J. S. & Hoskin, D. W. Cationic antimicrobial peptides as novel cytotoxic agents for cancer treatment. Expert Opin Investig Drugs 15, 933–946 (2006).
CAS PubMed Google Scholar
Hoskin, D. W. & Ramamoorthy, A. Studies on anticancer activities of antimicrobial peptides. Biochim Biophys Acta 1778, 357–375 (2008).
CAS PubMed Google Scholar
Kozlowska, K., Nowak, J., Kwiatkowski, B. & Cichorek, M. ESR study of plasmatic membrane of the transplantable melanoma cells in relation to their biological properties. Exp Toxicol Pathol 51, 89–92 (1999).
CAS PubMed Google Scholar
Sok, M., Sentjurc, M. & Schara, M. Membrane fluidity characteristics of human lung cancer. Cancer Lett 139, 215–220 (1999).
CAS PubMed Google Scholar
Ellerby, H. M. et al. Anti-cancer activity of targeted pro-apoptotic peptides. Nat Med 5, 1032–1038 (1999).
CAS PubMed Google Scholar
Hariharan, S. et al. Assessment of the biological and pharmacological effects of the alpha nu beta3 and alpha nu beta5 integrin receptor antagonist, cilengitide (EMD 121974), in patients with advanced solid tumors. Ann Oncol 18, 1400–1407 (2007).
CAS PubMed Google Scholar
Gregorc, V. et al. Phase I study of NGR-hTNF, a selective vascular targeting agent, in combination with cisplatin in refractory solid tumors. Clin Cancer Res 17, 1964–1972 (2011).
CAS PubMed Google Scholar
Khalili, P. et al. A non-RGD-based integrin binding peptide (ATN-161) blocks breast cancer growth and metastasis in vivo. Mol Cancer Ther 5, 2271–2280 (2006).
CAS PubMed Google Scholar
Deplanque, G. et al. Phase II trial of the antiangiogenic agent IM862 in metastatic renal cell carcinoma. Br J Cancer 91, 1645–1650 (2004).
CAS PubMed PubMed Central Google Scholar
Otvos, L., Jr Antibacterial peptides and proteins with multiple cellular targets. J Pept Sci 11, 697–706 (2005).
CAS PubMed Google Scholar
Petrilli, P. Classification of protein sequences by their dipeptide composition. Comput Appl Biosci 9, 205–209 (1993).
CAS PubMed Google Scholar
Gautam, A. et al. In silico approaches for designing highly effective cell penetrating peptides. J Transl Med 11, 74 (2013).
CAS PubMed PubMed Central Google Scholar
Sharma, A. et al. Computational approach for designing tumor homing peptides. Sci Rep 3, 1607 (2013).
PubMed PubMed Central Google Scholar
Wang, G., Li, X. & Wang, Z. APD2: the updated antimicrobial peptide database and its application in peptide design. Nucleic Acids Res 37, D933–937 (2009).
CAS PubMed Google Scholar
Thomas, S., Karnik, S., Barai, R. S., Jayaraman, V. K. & Idicula-Thomas, S. CAMP: a useful resource for research on antimicrobial peptides. Nucleic Acids Res 38, D774–780 (2010).
CAS PubMed Google Scholar
Novkovic, M., Simunic, J., Bojovic, V., Tossi, A. & Juretic, D. DADP: the database of anuran defense peptides. Bioinformatics 28, 1406–1407 (2012).
Article CAS Google Scholar
Sanders, W. S., Johnston, C. I., Bridges, S. M., Burgess, S. C. & Willeford, K. O. Prediction of cell penetrating peptides by support vector machines. PLoS Comput Biol 7, e1002101 (2011).
ADS CAS PubMed PubMed Central Google Scholar
Huang, Y. B., Wang, X. F., Wang, H. Y., Liu, Y. & Chen, Y. Studies on mechanism of action of anticancer peptides by modulation of hydrophobicity within a defined structural framework. Mol Cancer Ther 10, 416–426 (2011).
CAS PubMed Google Scholar
Eiriksdottir, E., Konate, K., Langel, U., Divita, G. & Deshayes, S. Secondary structure of cell-penetrating peptides controls membrane interaction and insertion. Biochim Biophys Acta 1798, 1119–1128 (2010).
CAS PubMed Google Scholar
Joachims, T. Making large-scale support vector machine learning practical, 169–184 (Scholkopf B., Burges C., & Smola A., Cambridge, MA: MIT Press 1999).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res 14, 1188–1190 (2004).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

Authors are thankful to funding agencies Council of Scientific and Industrial Research (project Open Source Drug discovery and GENESIS BSC0121) and Department of Biotechnology (project BTISNET), Govt. of India for financial support.

Author information

Tyagi Atul and Kapoor Pallavi contributed equally to this work.

Authors and Affiliations

Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, 160036, India
Atul Tyagi, Pallavi Kapoor, Rahul Kumar, Kumardeep Chaudhary, Ankur Gautam & G. P. S. Raghava

Authors

Atul Tyagi
View author publications
You can also search for this author in PubMed Google Scholar
Pallavi Kapoor
View author publications
You can also search for this author in PubMed Google Scholar
Rahul Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Kumardeep Chaudhary
View author publications
You can also search for this author in PubMed Google Scholar
Ankur Gautam
View author publications
You can also search for this author in PubMed Google Scholar
G. P. S. Raghava
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.T. and P.K. collected the data and created the datasets. A.T., R.K., P.K. and K.C. developed computer programs, implemented SVM and created the back end server. A.T., A.G., P.K. and K.C. developed the front end user interface. A.G. and P.K. analyzed the results and wrote the manuscript. G.P.S.R. conceived and coordinated the project, helped in the interpretation of data, refined the drafted manuscript and gave overall supervision to the project. All of the authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Supplementary Dataset 1

Supplementary Information

Supplementary Dataset 2

Supplementary Information

Supplementary Dataset 3

Supplementary Information

Supplementary Dataset 4

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/

Reprints and permissions

About this article

Cite this article

Tyagi, A., Kapoor, P., Kumar, R. et al. In Silico Models for Designing and Discovering Novel Anticancer Peptides. Sci Rep 3, 2984 (2013). https://doi.org/10.1038/srep02984

Download citation

Received: 15 May 2013
Accepted: 27 September 2013
Published: 18 October 2013
DOI: https://doi.org/10.1038/srep02984

This article is cited by

Derivation of a novel antimicrobial peptide from the Red Sea Brine Pools modified to enhance its anticancer activity against U2OS cells
- Mona Elradi
- Ahmed I. Ahmed
- Asma Amleh
BMC Biotechnology (2024)
Design and investigation of interactions of novel peptide conjugates of purine and pyrimidine derivatives with EGFR and its mutant T790M/L858R: an in silico and laboratory study
- Hannah L. Hunt
- Beatriz G. Goncalves
- Ipsita A. Banerjee
Molecular Diversity (2024)
Characterization of a novel peptide mined from the Red Sea brine pools and modified to enhance its anticancer activity
- Youssef T. Abdou
- Sheri M. Saleeb
- Asma Amleh
BMC Cancer (2023)
Novel cationic cryptides in Penaeus vannamei demonstrate antimicrobial and anti-cancer activities
- Amr Adel Ahmed Abd El-Aal
- Fairen Angelin Jayakumar
- Kavita Reginald
Scientific Reports (2023)
GCNCPR-ACPs: a novel graph convolution network method for ACPs prediction
- Xiujin Wu
- Wenhua Zeng
- Fan Lin
BMC Bioinformatics (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Compositional analysis

Residue preference

Support vector machine models

SVM model based on amino acid composition

Dipeptide composition-based SVM model

Binary profile based SVM model

N-terminal (NT) approach

C-terminal (CT) approach

N + C-terminal (NTCT) approach

Performance on independent dataset

Implementation and description of webserver

Discussion

Methods

Datasets

Main dataset

Alternate dataset

Balanced datasets

Independent dataset

Support vector machine

Residue composition as input features

Binary profile of patterns

Sequence logos

Performance measures

Cross validation technique

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links