Computational approach for designing tumor homing peptides

Sharma, Arun; Kapoor, Pallavi; Gautam, Ankur; Chaudhary, Kumardeep; Kumar, Rahul; Chauhan, Jagat Singh; Tyagi, Atul; Raghava, Gajendra P. S.

doi:10.1038/srep01607

Download PDF

Article
Open access
Published: 05 April 2013

Computational approach for designing tumor homing peptides

Arun Sharma¹^na1,
Pallavi Kapoor¹^na1,
Ankur Gautam¹^na1,
Kumardeep Chaudhary¹^na1,
Rahul Kumar¹^na1,
Jagat Singh Chauhan¹^na1,
Atul Tyagi¹^na1 &
…
Gajendra P. S. Raghava¹^na1

Scientific Reports volume 3, Article number: 1607 (2013) Cite this article

6605 Accesses
58 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Tumor homing peptides are small peptides that home specifically to tumor and tumor associated microenvironment i.e. tumor vasculature, after systemic delivery. Keeping in mind the huge therapeutic importance of these peptides, we have made an attempt to analyze and predict tumor homing peptides. It was observed that certain types of residues are preferred in tumor homing peptides. Therefore, we developed support vector machine based models for predicting tumor homing peptides using amino acid composition and binary profiles of peptides. Amino acid composition, dipeptide composition and binary profile-based models achieved a maximum accuracy of 86.56%, 82.03% and 84.19% respectively. These methods have been implemented in a user-friendly web server, TumorHPD. We anticipate that this method will be helpful to design novel tumor homing peptides. TumorHPD web server is freely accessible at http://crdd.osdd.net/raghava/tumorhpd/.

Identification of subtypes of anticancer peptides based on sequential features and physicochemical properties

Article Open access 30 June 2021

Kai-Yao Huang, Yi-Jhan Tseng, … Shun-Long Weng

Peptides as multifunctional players in cancer therapy

Article Open access 01 June 2023

Sri Murugan Poongkavithai Vadevoo, Smriti Gurung, … Byungheon Lee

Streamlined selection of cancer antigens for vaccine development through integrative multi-omics and high-content cell imaging

Article Open access 03 April 2020

Ki-Cheol Han, Daechan Park, … Mihue Jang

Introduction

Cancer is a major public health concern and remains a leading cause of mortality across the globe. This devastating disease affects both developed and developing countries. Despite the considerable progress in understanding the molecular basis of cancer, mortality rate is still high¹. The chemotherapy is the principal mode of current cancer treatment, but it is limited by significant toxicity and frequently acquired resistance². In the last decade, treatment options for cancer have shifted towards the more specific targeted therapies^3,4. Many strategies have been exploited to target tumors. The most commonly used strategy is engineered antibodies or antibody fragments⁵. Though monoclonal antibodies are very selective, poor penetration inside the tumors and high production cost hinders their usage as therapeutic agents⁶. Nowadays, use of peptides for tumor targeting is getting much attention. In this context, tumor homing peptides (THPs) have become a very promising strategy to deliver therapeutics at tumor site. In the last decade, much attention has been paid on targeting tumor cells or tumor vasculature using THPs⁷.

THPs are short peptides (3–15 amino acids), which specifically recognize and bind to tumor cells or tumor vasculature. Since the introduction of tumor homing concept in 1998, a large number of THPs have been identified by in vitro and in vivo phage display technology. THPs have some common motifs like RGD, NGR, which specifically bind to a surface molecule on tumor cells or tumor vasculature. For example, RGD peptide binds to α integrins⁸ and NGR binds to a receptor aminopeptidase N, which is present on the surface of tumor endothelial cells⁹. Due to their tumor homing capability, THPs are being used in cancer diagnosis and treatment. Many anti-cancer drugs and imaging agents have been targeted to tumor site in mice models once conjugated with THPs¹⁰. The results of such studies are very encouraging and few THPs are already in clinical trials¹¹.

With such potential of THPs in cancer therapeutics, the computer aided prediction of THPs would be very beneficial in designing and developing novel THPs, thus saving time and labor of experimental biologists. To the best of authors' knowledge, no method has been developed for predicting/designing THPs. In the present study, a systematic attempt has been made to develop highly accurate support vector machine (SVM)-based models using various features of proteins/peptides like amino acid composition (AAC), dipeptide composition (DPC) and binary profile patterns (BPP). A user-friendly web server has also been developed to help the cancer biologists to predict and design THPs.

Results

Analysis of THPs

Compositional analysis

In order to find out overall dominant residues in THPs, we computed and compared percent amino acid composition of THPs and non-THPs in the main dataset. It was observed that certain types of residues like C, R, G, W, P, L and S are more abundant in THPs (Figure 1). In order to understand preference of residues at N- and C-terminals, we computed and compared percent AAC of N- and C-terminus residues of THPs and non-THPs. However, we did not find any significant difference in AAC in terminal residues (data not shown).

Preference of residues

In order to understand preference of certain types of residues at different positions in THPs, we generated sequence logos. The sequence logos of 10 N-terminal and C-terminal residues of peptides are shown in Figure 2 & 3, respectively. As shown in Figure 2, certain residues are preferred at specific positions, e.g., C, A, S, G at first position; G, R, P, E at 2^nd position etc. Overall, THPs are dominated by certain type of residues like C, G, L, P etc., being present at most of the positions. Similarly, certain residues are preferred at the C-terminus (Figure 3), for example, residues P, R, C, N and S are preferred at most of the positions.

AAC-based model

In compositional analysis of THPs, it has been observed that certain residues are dominated over others. This means that THPs and non-THPs can be discriminated on the basis of their AAC. Based on this observation, we developed SVM model on main dataset. The performance of AAC-based SVM model has been shown in Table 1. The model developed on main dataset achieved a maximum accuracy of 82.52% with an MCC and area under the curve (AUC) of 0.65 and 0.90 respectively. Similarly, SVM models were developed on subsets NT5, CT5, NTCT5, NT10, CT10 and NTCT10 and performances of these models have been summarized in Table 1. Model developed with NTCT10 dataset achieved a little higher accuracy of 86.56% with MCC 0.70 and AUC 0.91.

Table 1 Performances of SVM models developed using amino acid composition of peptides

Full size table

DPC-based model

DPC has been used previously to discriminate different classes of proteins¹². Dipeptide encapsulates the global information of the amino acid fraction as well as the local order of amino acids. Thus, DPC is a better feature as compared to AAC. Therefore, SVM models based on DPC have been constructed on all the datasets. Performances of DPC-based models are summarized in Table 2. Overall, performance of DPC-based models is poorer than AAC-based model. DPC-based model developed with main dataset achieved maximum accuracy of 81.29% with MCC and AUC values of 0.63 and 0.90 respectively, which is less than the models based on AAC (Table 2). Model developed with NTCT10 dataset achieved a maximum accuracy of 82.03% with MCC and AUC values of 0.63 and 0.88, respectively.

Table 2 The performances of SVM models developed using dipeptide composition of peptides

Full size table

BPP-based method

In THPs, certain residues are preferred at specific positions on N- and C-terminus (Figure 2 & 3). Therefore, to implement the information about frequency as well as the order of residues, we made an attempt to develop a method using binary profiles of peptides. We have generated BPP of peptides. In binary pattern, a vector of dimension 20 represents a residue and for N residues the input vector of dimension is 20 × N. We have used the following three approaches:

N-terminal approach: In this approach, we used subsets NT5 and NT10, consist of 5 and 10 N-terminal residues of THPs and non-THPs (See Material and Methods). We extracted 5 and 10 N-terminus residues from each peptide and generated binary profile of dimension 5 × 20 and 10 × 20, respectively. These profiles were then used to develop SVM models. The accuracy of models developed on NT5 and NT10 datasets were 77.08% and 81.03% with MCC 0.54, 0.62 and AUC 0.84, 0.89 respectively (Table 3).
Table 3 Performances of SVM models developed using binary profile of peptides
Full size table
C-terminal approach: We adopted same strategy for the C-terminal as used for the N-terminal except taking the residues from C-terminal instead of N, using subsets CT5 and CT10. The performance of BPP-based SVM model using 5 and 10 C-terminal residues was almost similar to N-terminal approach. As shown in table 3, we achieved maximum accuracy of 76.38% and 79.84% with MCC of 0.53 and 0.60 for 5 and 10 C- terminal residues of peptides respectively.
N- and C-terminal approach: In order to check, if using the N- and C-terminal of the peptides together would enhance the accuracy of the method, we developed models using N- and C-terminal residues. In this approach, we made two subsets named NTCT5 and NTCT10. First model was developed using BPP of first 5 residues from N-terminal and 5 residues from C-terminal. Second model was developed using BPP of 10 residues from N-terminal and 10 residues from C-terminal. As shown in Table 3, we achieved maximum accuracy 84.19% with MCC 0.69 and AUC 0.91 for NTCT10 subset.

SVM model on peptides with length up to 10

Since the most of the THPs have length between 4 and 10, therefore, we have constructed a dataset (469 peptides) consisting of peptides having length up to 10. SVM models were developed using all the above features and terminal of window size 5. Performances of all models are summarized in Table 4. Maximum accuracy of 81.88% with MCC of 0.65 and AUC 0.88 was achieved in binary profile of dataset NTCT5 (Table 4).

Table 4 Performance of monopeptide, dipeptide and binary profiles-based SVM models on dataset of peptides having length up to 10

Full size table

ROC Plot

In order to have a threshold-independent evaluation of our models, we have generated receiver operating characteristic (ROC) curve for these models. PASW statistical package was used for creating ROC plots with area under curve (AUC). The AUC gave a single value to evaluate the performance of a method. BPP-based method in case of hybrid of N-terminal and C- terminal residues (window size 5 and 10) performed better as compared to AAC-based method. ROC plots are shown in Figure 4.

Performance on independent dataset

In order to validate our in silico methods, performances of our best methods (whole composition, NTCT5, NTCT10 and NTCT5 (up to 10)) were evaluated on independent dataset. All these models performed reasonably good as shown in Table 5, demonstrating that these models are useful or effective in real life. Composition-based model achieved highest accuracy of 83.73% among all these models.

Table 5 Performances on independent dataset

Full size table

Implementation and utility of TumorHPD

TumorHPD not only provides facility to predict THPs, but also offers opportunity to design analogues with better tumor homing abilities. TumorHPD first generates all possible single substitution mutants of original peptide; then it predicts whether mutants and original peptide is tumor homing or not. It also calculates SVM score for each peptide, which is propotional to reliability of prediction. Along with prediction, server also calculates important physicochemical properties (e.g. hydrophobicity, amphipathicity, charge, pI, etc.) in an aesthetic tabular format with sorting option. This feature is helpful for user to select better analogues based on desired physicochemical properties, as many peptide analogues may have higher SVM score or better-desired properties than the original peptide. In addition, users can further generate all possible mutants (2nd round) of their selected analogue if they wish to and may get even better peptide analogues with higher tumor homing abilities (based on SVM score). This cycle can be run until the peptide analogue with desired properties (tumor homing and physicochemical) is obtained. Similarly, protein scanning is another tool, which allows user to submit protein sequence and it scan putative THPs in protein sequence. Graphical display of the scanned results speeds-up the identification of THP specific regions from protein. In addition, users can also predict secondary structures of their peptides using Psipred¹³. TumorHPD is accessible from URL http://crdd.osdd.net/raghava/tumorhpd/.

Discussion

In the past, THPs have been successfully used as delivery vehicles to target imaging agents, drug molecules, oligonucleotides and inorganic nanoparticles to tumors^7,10. Most of the THPs have been identified by in vivo phage display technology, which is a very time consuming and laborious process. Therefore, development of an in silico method for predicting THPs will be very useful for biologists working in the field of peptide-based drug delivery. Thus, keeping these facts in mind, in the present study, we have made a systematic attempt to develop an in silico approach to predict/design THPs. The overall approach is summarized in Figure 5. We have collected 651 THPs from TumorHoPe database and analyzed them. THPs have wide variation in length ranging from 3 to 35 residues and majority of peptides have length between 5 and 10 residues.

In preliminary analysis of THPs, we have observed that certain residues are dominated over others and certain residues are preferred at specific positions. Based on these observations, we developed models for discriminating THPs and non-THPs using machine learning techniques. We have developed SVM models of various features using AAC, DPC and BPPs. The DPC-based models performed poorer than AAC-based method. However, BPP-based method performed well over other methods. Since binary profiles incorporate information about both frequency as well as order of amino acids, it is a better feature than AAC alone. Among all the subsets, NTCT5 and NTCT10 achieved the maximum AUC of 0.88 and 0.91 respectively. Binary performance was also best in case of peptides with length range in between 5 and 10 residues. Based on above approaches, an online web service-TumorHPD has been developed. To the best of our knowledge, TumorHPD is first in silico method in its kind for the prediction of THPs. Therefore, there are no existing methods for comparison. We hope that establishment of such methods will speed up the pace of identifying novel THPs. Thus, it will facilitate better drug delivery system for cancer.

Methods

Main dataset

Recently, our group has collected and compiled experimentally validated THPs (peptides bind/home to tumor) from literature and developed a public database TumorHoPe¹⁴. In this study, we have obtained 651 THPs from TumorHoPe. These peptides are considered as positive examples. In order to develop a classification method, we needed negative examples (i.e. peptides, which do not bind to tumor or non-THPs). Unfortunately, experimentally validated non-THPs have not been reported in the literature. In order to generate negative dataset, we have generated 651 random peptides from proteins obtained from SwissProt. These random peptides were considered as non-THPs. Though it is possible that some of the random peptides may have tumor homing property, but probability is very low. This is a standard procedure to use random peptides as negative examples in situations where experimentally validated negative examples are not available^15,16. Finally, main dataset is consists of 651 THPs (experimentally validated) and 651 non-THPs (random peptides).

Small dataset

It was observed that most of the THPs have 10 or less than 10 residues. Therfore, we created a sub dataset from main dataset where peptides (THPs or non-THPs) have minimum four residues and maximum ten residues. This small dataset contains 469 THPs and equal number of non-THPs (random peptides).

Terminus datasets

In order to understand the role of N- and C-terminal residues of THPs, we have created terminus datasets considering the N- and C-terminal residues of peptides from main dataset. Following type of terminus datasets have been derived from main dataset; (i) NT5 contains first five residues (5 N-terminus residues) of peptides, (ii) CT5 contains last five residues (5 C-terminus residues) of peptides and (iii) NTCT5: in this dataset, various features (amino acid composition, dipeptide composition and binary profiles) of first five and last five residues of peptides were generated and combined them for developing models. Similarly, NT10, CT10 and NTCT10 terminus datasets were derived from main dataset where ten residues were taken either from any one terminus or from both termini.

Sequence logos

In order to understand frequency of different types on amino acids at different positions in THPs, we created sequence logos using WebLogo software¹⁷. The size of the residue in logo represents the frequency of residues at a given position. The height of the residue is a measure of the variability of that residue at that particular position: the taller the logo, the lesser variability at that position.

Support vector machine

SVM is a machine-learning tool based on the structural risk minimization principle of statistics learning theory. SVMs are a set of related supervised learning methods used for classification and regression. The user can choose and optimize number of parameters and kernels (e.g. Linear, polynomial, radial basis function and sigmoidal) or any user-defined kernel. In this study, we implemented SVMlight Version 6.02 package of SVM¹⁸, which requires a fixed number of inputs for training, thus necessitating a strategy for encapsulating the global information about proteins of variable length in a fixed length format. The fixed length format was obtained from protein sequences of variable length using amino acid composition, dipeptide composition and binary profile.

Amino acid composition (AAC)

It has been shown in previous studies that simple frequency of 20 amino acids in a protein sequence can be used to predict various functions of proteins like sub-cellular localization and classification of proteins¹⁹. In this study, we have used AAC of peptides for discriminating THPs and non-THPs. Thus, peptide information was encapsulated in a vector of 20 dimensions, using amino acid composition of the peptide. AAC is the fraction of each amino acid type within a peptide. The fractions of all 20 natural amino acids were calculated by using the following equation:

Where Comp (i) is the percent composition of amino acid (i); R_i is number of residues of type i and N is the total number of residues in the peptide.

Dipeptide composition (DPC)

DPC provides composition of pair of residues (e.g. Ala-Ala, Ala-Leu) present in peptide and used to transform the variable length of peptides to fixed length feature vectors. It gives a fixed pattern length of 400 (20 × 20) and encapsulates information about the fraction of amino acids as well as their local order. It is calculated using following equation:

Where dipeptide (i) is one out of 400 dipeptides.

Binary profile patterns (BPP)

BPP were generated for each peptide, where a vector of dimensions of 20 represents each amino acid (e.g. Ala by 1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0). A pattern of window length W was represented by a vector of dimensions 20 × W. We have created binary profile patterns for first 5 and 10 residues from N-terminus, similarly for last 5 and 10 residues from C-terminus of peptides in datasets. The BPP has been used in a number of existing methods^20,21,22.

Cross-validation technique

One of the major challenges in developing in silico models is to validate these models using standard techniques. One of the well known and commonly used technique for validation is jack-knife or leave-one-out cross-validation where one peptide is used for testing and remaining peptides for training. This process is repeated in such a way that each peptide is used for testing. This technique is CPU time intensive, so in this study we have used five-fold cross-validation technique. Here all peptides are randomly divided into five sets, where four sets used for training and remaining set for testing. The process is repeated five times in such a way that each set is used once for testing. Final performance is obtained by averaging the performance of all the five sets.

Performance measure

The performance of various models developed in this study was evaluated by using threshold-dependent as well as threshold-independent parameters. In threshold dependent parameters we used sensitivity (Sn), specificity (Sp), overall accuracy (Ac) and Matthew's correlation coefficient (MCC) using following equations.

Where TP and TN are correctly predicted positive and negative examples, respectively. Similarly, FP and FN are wrongly predicted positive and negative examples, respectively.

We created receiver-operating characteristic (ROC) for all of the models in order to evaluate performance of models using threshold-independent parameters. ROC plots with area under the curve (AUC) were created using PASW statistical package.

Independent dataset

In order to evaluate the performance of our methods, we have created an independent dataset of 83 novel experimentally validated THPs and equal number of random peptides (non-THPs), which have not been included in the training, feature selection and parameters optimization of the model. Experimentally validated THPs were collected manually from recent research papers and patents, while random peptides were generated randomly from proteins obtained from Swissprot as discribed in methods.

References

Hanna, T. P. & Kangolle, A. C. Cancer control in developing countries: using health data and health services research to measure and improve access, quality and efficiency. BMC Int Health Hum Rights 10, 24 (2010).
Article Google Scholar
Lee, C., Raffaghello, L. & Longo, V. D. Starvation, detoxification and multidrug resistance in cancer therapy. Drug Resist Updat 15, 114–22 (2012).
Article CAS Google Scholar
Flaherty, K. T., Hodi, F. S. & Fisher, D. E. From genes to drugs: targeted strategies for melanoma. Nat Rev Cancer 12, 349–61 (2012).
Article CAS Google Scholar
Higgins, M. J. & Baselga, J. Targeted therapies for breast cancer. J Clin Invest 121, 3797–803 (2011).
Article CAS Google Scholar
Scott, A. M., Wolchok, J. D. & Old, L. J. Antibody therapy of cancer. Nat Rev Cancer 12, 278–87 (2012).
Article CAS Google Scholar
Chames, P., Van Regenmortel, M., Weiss, E. & Baty, D. Therapeutic antibodies: successes, limitations and hopes for the future. Br J Pharmacol 157, 220–33 (2009).
Article CAS Google Scholar
Laakkonen, P. & Vuorinen, K. Homing peptides as targeted delivery vehicles. Integr Biol (Camb) 2, 326–37 (2010).
Article CAS Google Scholar
Zitzmann, S., Ehemann, V. & Schwab, M. Arginine-glycine-aspartic acid (RGD)-peptide binds to both tumor and tumor-endothelial cells in vivo. Cancer Res 62, 5139–43 (2002).
CAS PubMed Google Scholar
Pasqualini, R. et al. Aminopeptidase N is a receptor for tumor-homing peptides and a target for inhibiting angiogenesis. Cancer Res 60, 722–7 (2000).
CAS PubMed PubMed Central Google Scholar
Ruoslahti, E., Bhatia, S. N. & Sailor, M. J. Targeting of drugs and nanoparticles to tumors. J Cell Biol 188, 759–68 (2010).
Article CAS Google Scholar
Ruoslahti, E. Peptides as targeting elements and tissue penetration devices for nanoparticles. Adv Mater 24, 3747–56 (2012).
Article CAS Google Scholar
Petrilli, P. Classification of protein sequences by their dipeptide composition. Comput Appl Biosci 9, 205–9 (1993).
CAS PubMed Google Scholar
McGuffin, L. J., Bryson, K. & Jones, D. T. The PSIPRED protein structure prediction server. Bioinformatics 16, 404–5 (2000).
Article CAS Google Scholar
Kapoor, P. et al. TumorHoPe: a database of tumor homing peptides. PLoS One 7, e35187 (2012).
Article CAS ADS Google Scholar
Sanders, W. S., Johnston, C. I., Bridges, S. M., Burgess, S. C. & Willeford, K. O. Prediction of cell penetrating peptides by support vector machines. PLoS Comput Biol 7, e1002101 (2011).
Article CAS ADS Google Scholar
Wang, P. et al. Prediction of antimicrobial peptides based on sequence alignment and feature selection methods. PLoS One 6, e18476 (2011).
Article CAS ADS Google Scholar
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: a sequence logo generator. Genome Res 14, 1188–90 (2004).
Article CAS Google Scholar
Joachims, T. Making large-scale support vector machine learning practical. In Advances in kernel methods: support vector learning Edited by: Scholkopf B, Burges C, Smola A. Cambridge, MA: MIT Press, 169–184 (1999).
Garg, A., Bhasin, M. & Raghava, G. P. Support vector machine-based method for subcellular localization of human proteins using amino acid compositions, their order and similarity search. J Biol Chem 280, 14427–32 (2005).
Article CAS Google Scholar
Xiao, X., Shao, S., Ding, Y., Huang, Z. & Chou, K. C. Using cellular automata images and pseudo amino acid composition to predict protein subcellular location. Amino Acids 30, 49–54 (2006).
Article CAS Google Scholar
Xiao, X., Wang, P. & Chou, K. C. GPCR-CA: A cellular automaton image approach for predicting G-protein-coupled receptor functional classes. J Comput Chem 30, 1414–23 (2009).
Article CAS Google Scholar
Lata, S., Sharma, B. K. & Raghava, G. P. Analysis and prediction of antibacterial peptides. BMC Bioinformatics 8, 263 (2007).
Article Google Scholar

Download references

Acknowledgements

Authors are thankful to funding agencies Council of Scientific and Industrial Research (project Open Source Drug Discovery and GENESIS BSC0121) and Department of Biotechnology (project BTISNET), Govt. of India.

Author information

Sharma Arun and Kapoor Pallavi contributed equally to this work.

Authors and Affiliations

Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, 160036, India
Arun Sharma, Pallavi Kapoor, Ankur Gautam, Kumardeep Chaudhary, Rahul Kumar, Jagat Singh Chauhan, Atul Tyagi & Gajendra P. S. Raghava

Authors

Arun Sharma
View author publications
You can also search for this author in PubMed Google Scholar
Pallavi Kapoor
View author publications
You can also search for this author in PubMed Google Scholar
Ankur Gautam
View author publications
You can also search for this author in PubMed Google Scholar
Kumardeep Chaudhary
View author publications
You can also search for this author in PubMed Google Scholar
Rahul Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Jagat Singh Chauhan
View author publications
You can also search for this author in PubMed Google Scholar
Atul Tyagi
View author publications
You can also search for this author in PubMed Google Scholar
Gajendra P. S. Raghava
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.S., P.K., A.G. and K.C. collected the data and created the datasets. A.S., A.T. and P.K. developed computer programs, implemented S.V.M. A.S. and J.S.C. created the back end server. A.S., P.K., A.G., R.K. and K.C. developed the front end user interface. A.G., P.K. and A.S. wrote the manuscript. G.P.S.R. conceived and coordinated the project, helped in the interpretation of data, refined the drafted manuscript and gave overall supervision to the project. All of the authors read and approved the final manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Rights and permissions

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/

Reprints and permissions

About this article

Cite this article

Sharma, A., Kapoor, P., Gautam, A. et al. Computational approach for designing tumor homing peptides. Sci Rep 3, 1607 (2013). https://doi.org/10.1038/srep01607

Download citation

Received: 23 November 2012
Accepted: 22 March 2013
Published: 05 April 2013
DOI: https://doi.org/10.1038/srep01607

This article is cited by

A hepatic antimicrobial peptide, hepcidin from Indian major carp, Catla catla: molecular identification and functional characterization
- P.P. Athira
- V.V. Anooja
- Rosamma Philip
Journal of Genetic Engineering and Biotechnology (2022)
Accurate Prediction of Anti-hypertensive Peptides Based on Convolutional Neural Network and Gated Recurrent unit
- Hongyan Shi
- Shengli Zhang
Interdisciplinary Sciences: Computational Life Sciences (2022)
Peptide profiling in cow urine reveals molecular signature of physiology-driven pathways and in-silico predicted bioactive properties
- Rohit Kumar
- Syed Azmal Ali
- Sudarshan Kumar
Scientific Reports (2021)
Identification of biomarkers for the accurate and sensitive diagnosis of three bacterial pneumonia pathogens using in silico approaches
- Olalekan Olanrewaju Bakare
- Marshall Keyster
- Ashley Pretorius
BMC Molecular and Cell Biology (2020)
Assigning biological function using hidden signatures in cystine-stabilized peptide sequences
- S. M. Ashiqul Islam
- Christopher Michel Kearney
- Erich J. Baker
Scientific Reports (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Analysis of THPs

Compositional analysis

Preference of residues

AAC-based model

DPC-based model

BPP-based method

SVM model on peptides with length up to 10

ROC Plot

Performance on independent dataset

Implementation and utility of TumorHPD

Discussion

Methods

Main dataset

Small dataset

Terminus datasets

Sequence logos

Support vector machine

Amino acid composition (AAC)

Dipeptide composition (DPC)

Binary profile patterns (BPP)

Cross-validation technique

Performance measure

Independent dataset

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links