Abstract
Predicting the function of a protein from its sequence is typically addressed using sequence-similarity. Here we propose a motif-based approach, using supervised motif extraction from protein sequences belonging to one functional family. The resulting deterministic motifs form Common Peptides (CPs) that characterize this family, allow for data mining of its proteins and facilitate further partition of the family into clusters
Similar content being viewed by others
Article PDF
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Gottlieb, A., Weingart, U. & Horn, D. Data mining of protein families using common peptides. Nat Prec (2008). https://doi.org/10.1038/npre.2008.2189.1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/npre.2008.2189.1