KampoDB, database of predicted targets and functional annotations of natural medicines

Sawada, Ryusuke; Iwata, Michio; Umezaki, Masahito; Usui, Yoshihiko; Kobayashi, Toshikazu; Kubono, Takaki; Hayashi, Shusaku; Kadowaki, Makoto; Yamanishi, Yoshihiro

doi:10.1038/s41598-018-29516-1

Download PDF

Article
Open access
Published: 25 July 2018

KampoDB, database of predicted targets and functional annotations of natural medicines

Ryusuke Sawada¹,
Michio Iwata²,
Masahito Umezaki³^na1,
Yoshihiko Usui³,
Toshikazu Kobayashi³,
Takaki Kubono⁴,
Shusaku Hayashi⁴,
Makoto Kadowaki⁴ &
…
Yoshihiro Yamanishi^2,5

Scientific Reports volume 8, Article number: 11216 (2018) Cite this article

5389 Accesses
10 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Natural medicines (i.e., herbal medicines, traditional formulas) are useful for treatment of multifactorial and chronic diseases. Here, we present KampoDB (http://wakanmoview.inm.u-toyama.ac.jp/kampo/), a novel platform for the analysis of natural medicines, which provides various useful scientific resources on Japanese traditional formulas Kampo medicines, constituent herbal drugs, constituent compounds, and target proteins of these constituent compounds. Potential target proteins of these constituent compounds were predicted by docking simulations and machine learning methods based on large-scale omics data (e.g., genome, proteome, metabolome, interactome). The current version of KampoDB contains 42 Kampo medicines, 54 crude drugs, 1230 constituent compounds, 460 known target proteins, and 1369 potential target proteins, and has functional annotations for biological pathways and molecular functions. KampoDB is useful for mode-of-action analysis of natural medicines and prediction of new indications for a wide range of diseases.

Associating 197 Chinese herbal medicine with drug targets and diseases using the similarity ensemble approach

Article 17 September 2019

Shuo Gu & Lu-hua Lai

In silico activity and ADMET profiling of phytochemicals from Ethiopian indigenous aloes using pharmacophore models

Article Open access 23 December 2022

Lemessa Etana Bultum, Gemechu Bekele Tolossa, … Doheon Lee

Molecular docking as a tool for the discovery of molecular targets of nutraceuticals in diseases management

Article Open access 17 August 2023

P. C. Agu, C. A. Afiukwa, … P. M. Aja

Introduction

Traditional medicines are used clinically in many areas of the world, including in Japan (Kampo), China, Korea, India (Ayurveda) and Perso-Arabic countries (Yunani). Traditional medicines usually comprise mixtures of the crude extracts from several medicinal herbs, each of which contains multiple components. The World Health Organization took the initiative to promote the globalization of traditional medicine in 1972 by founding a Division of Traditional Medicine. Approximately 45 years later, traditional medicines are widely available and are commonly used in many parts of the world. Recently, there has been a dramatic worldwide increase in the number of patients suffering from complex diseases, such as lifestyle-related diseases, cardiovascular diseases, diabetes, and immune-mediated diseases. It can be difficult to cure these complex diseases effectively with Western medicines by using the “one disease, one target, one drug” approach, and there are growing expectations toward the “one disease, multiple targets, multiple drugs” approach with multi-effective drugs such as traditional medicines used in combination therapies with Western medicines.

Kampo medicine originated from ancient Chinese medicine but evolved independently over a long period of time (more than 1500 years) to become a style individual to Japan. Kampo formulas often differ from Chinese or Korean traditional formulas, although many of the same medicinal herbs are used for traditional medicines across eastern Asian countries. Kampo medicines are decoctions or dry powders that include pharmaceutical active ingredients extracted by boiling from a mixture of naturally derived medicinal herbs. They are generally factory-produced by pharmaceutical companies in Japan and provided in a ready-to-use form. To assure the quality of Kampo products, the Japanese Ministry of Health, Labour and Welfare published their “Guideline on Data Requirements for Ethical Kampo Formulation” in 1985, resulting in Kampo medicines becoming standardized with respect to the quality and quantity of their ingredients. The Ministry maintains oversight of Kampo medicines. Japanese traditional formulas Kampo medicines are prescribed in hospitals in Japan as either monotherapy or harmoniously combined therapy with standard western therapy and >80% of medical doctors prescribe Kampo medicines in Japan¹. Thus, Kampo medicines are established as a pivotal part of mainstream medicine in Japan and the cost of Kampo medicines is covered by the National Health Insurance. On the other hand, in the United States, National Institutes of Health (NIH) now supports clinical and basic research on the traditional medicine. In recent years, NIH support and the US Food and Drug Administration (FDA) guideline on investigating botanical drug products, including complex formulas containing many constituents, has fostered the development of botanical drugs in the United States^1,2. Presently, randomized, double-blind, placebo-controlled clinical trials of some Kampo medicines (e.g., Daikenchuto for bowel diseases) are underway for phase II or phase III studies for FDA approval in the United States.

However, the pharmacotherapy with Kampo medicines greatly depends on the empirical knowledge of medical doctors in practice, and there is insufficient scientific evidence explaining the underlying molecular mechanisms of Kampo medicines. The mechanisms of Kampo medicines are different from those of ordinary medicines. The efficacies of Kampo medicines stem from multiple compound–multiple target interactions. Figure 1 shows an illustration of the difference of the mode-of-action between ordinary medicines and Kampo medicines. It is, therefore, indispensable to establish fundamental technologies to comprehensively analyze the underlying mechanisms of every pharmacological action of multicomponent Kampo medicines in the human body as a complex system.

In recent biomedical science, clinical and molecular data for Kampo medicine-based pharmacotherapy have been accumulated, and a variety of omics data are becoming available in the genome, transcriptome, proteome, metabolome, phenome, and diseasome. These “big data” are useful resources for mode-of-action analysis of Kampo medicines; thus, there is a strong need to develop databases and associated tools for Kampo medicines. Many databases for Western medicines exist (e.g., DrugBank³, KEGG DRUG⁴, Matador⁵, SuperTarget⁵, ChEMBL⁶, Therapeutic Target Database⁷, BindingDB⁸, PubChem⁹, Comparative Toxicogenomics Database¹⁰). However, there is no integrated database of Kampo medicine-related chemical and biological data, and clinical research data and clinical findings. There is a wiki-system database of Kampo medicines and crude drugs¹¹, but it is mainly Kampo medicine-related pharmacognostical and chemical database and thereby cannot help to understand the mode-of-actions and further clinical applications of Kampo medicines.

Here, we present KampoDB (http://wakanmoview.inm.u-toyama.ac.jp/kampo/), a novel platform for the analysis of natural medicines, which provides various useful scientific resources on Kampo medicines, constituent herbal medicines, constituent compounds, and target proteins of these constituent compounds. Potential target proteins of these constituent compounds were predicted by docking simulations and machine learning methods based on large-scale omics data (e.g., genome, proteome, metabolome, interactome). Therefore, KampoDB is useful for understanding the mode-of-action of natural medicines in terms of biological pathways and molecular functions of target proteins, which can lead to new indications for a wide range of diseases. The present study aims to elucidate the underlying mechanisms of Kampo medicines, while predicting their target proteins and new indications, thereby repositioning Kampo medicines for their extensive application in clinical practice, with a view toward using them more effectively in clinical practice.

Results

Data collection

The current version of KampoDB contains 42 Kampo medicines, 54 crude drugs, 1230 constituent compounds, 460 known target proteins, and 1369 potential target proteins and has functional annotations for biological pathways and molecular functions. The molecular information on natural medicines in KampoDB was collected and digitized from scientific literature, molecular databases, and clinical reports. We collected the relationships between Kampo drugs and crude drugs (and also below layers) from the Traditional Medical & Pharmaceutical Database of the Institute of Natural Medicine, University of Toyama (http://wakankensaku.inm.u-toyama.ac.jp/). As the information was provided in Japanese, we translated it to English. The correlation between Kampo drugs and crude drugs was not based on the computational predictions. The mode-of-action was elucidated by applying the state-of-the-art computational methods (see the METHODS section for more details). KampoDB is compatible with other molecular biology databases (e.g., KEGG¹², ChEMBL⁶, UniProt¹³, KNApSAcK¹⁴) by using the same identifiers (compound IDs, protein IDs, disease IDs).

Inputs and outputs

KampoDB consists of three components: 1) natural medicines list, 2) functional analysis, and 3) target prediction. Figure 2 shows a diagrammatic representation of KampoDB. All of the resources are accessible via the website (http://wakanmoview.inm.u-toyama.ac.jp/kampo/).

In the “Natural medicines list” component, a user can input a natural medicine name (e.g., “kakkonto”) as a query. Kakkonto is one of the most frequently used Kampo medicines in Japan, because it is a highly effective and safe medicine against the common cold¹⁵, influenza¹⁶ and allergic rhinitis¹⁷ either as sole therapy or in combination with modern Western medicines. Kakkonto is composed of seven Japanese Pharmacopoeia standard medicinal herbs: Puerariae Radix, Cinnamomi Cortex, Zizyphi Fructus, Paeoniae Radix, Ephedrae Herba, Zingiberis Rhizoma and Glycyrrhizae Radix. The main bioactive compound in kakkonto is thought to be puerarin, which is an isoflavonoid derived from Puerariae Radix that exhibits many pharmacological properties, including such as anti-inflammation, vasodilation, neuroprotection, antioxidant and anticancer effects (Supplementary Fig. 1)¹⁸. Clicking on the search button, the user can obtain the corresponding information on Kampo medicines, crude drugs, constituent compounds and target proteins. Note that compound IDs correspond to KNApSAcK IDs¹⁴, and protein IDs correspond to KEGG GENES IDs¹². The user can see a global classification of Kampo medicines, crude drugs, constituent compounds, and target proteins in a hierarchical manner (Kampo medicines on the 1st layer; crude drugs on the 2nd layer; constituent compounds on the 3rd layer; target proteins on the 4th layer). Note that each Kampo medicine consists of multiple crude drugs, each crude drug consists of multiple compounds, and each constituent compound is supposed to interact with its target proteins. If the proteins are therapeutic targets of diseases, the corresponding diseases are shown.

In the “Functional analysis” component, the user can input natural medicine names. The output is the summary of the mode-of-action analysis of the corresponding natural medicines, which provides molecular function annotations of target proteins (e.g., molecular functions in Gene Ontology¹⁹) and biological pathway annotations (e.g., biological pathways in KEGG PATHWAY¹²). A visualization of the results at different layer levels enables the user to see the mode-of-action information in a hierarchical manner within a natural medicine classification. The user can select one option from the following four categories and click on the corresponding button: (1) Pathway: biological pathways in the KEGG PATHWAY, (2) Brite: protein classifications in KEGG BRITE, (3) Process: biological process terms in GO, and (4) Function: molecular function terms in GO. For example, in the case of “Pathway”, the output is the list of pathway names with high enrichment ratio scores and low p-values (See the METHODS section for more details).

In the “Target prediction” component, the user can see the results of newly predicted target proteins of major constituent compounds by performing docking simulations and machine learning techniques. The user can select a query compound by clicking on a compound name of interest in the list of the constituent compounds that are defined as standard compounds in the Japanese pharmacopoeia (see the METHODS section for more details). The outputs are the list of predicted human proteins for the query compound and associated information. In the docking simulation method, docking was performed for the constituent compounds with each human protein 3D structure. In the machine learning method, supervised classification with compound chemical structure similarity was performed for each human protein (see the METHODS section for more details).

Possible applications

An application of the “Natural medicines list” component in KampoDB is to view a hierarchical classification of natural medicines. Figure 3 shows an example of the output page of the query “kakkonto” (an example of Kampo medicines) as an input. The 2nd and 3rd layers show the crude drugs (e.g., “Ephedra herb”) constituting the Kampo medicine (“kakkonto” in this case) and the compounds (e.g., “Methylephedrine”) constituting the crude drug (“Ephedra herb” in this case), respectively. The 4th layer shows the target proteins (e.g., “ADRA1D”) that are known to interact with the constituent compound (“Methylephedrine” in this case). The output enables the user to investigate the hierarchical relationship between Kampo medicines, crude drugs, constituent compounds and target proteins.

An application of the “Functional analysis” component in KampoDB is to perform the mode-of-action analysis of natural medicines in terms of biological pathways and molecular ontologies. Figure 4 shows an example of the output page of the query “Methylephedrine” (an example of constituent compounds of “kakkonto”) as an input in the “Functional analysis” page. In the case of pathway enrichment analysis, biological pathways with high enrichment ratios and low p-values can be thought of as candidates for the associated pathways. For example, the “cGMP-PKG signaling pathway”, “Calcium signaling pathway”, and “Adrenergic signaling in cardiomyocytes” were detected as the pathways associated with the term “Methylephedrine.” This is a reasonable result because target proteins of methylephedrine (e.g., ADRA1D, ADRA1B, ADRA1A) are known to be involved in the adrenergic signaling process⁴.

An application of the “Target prediction” component in KampoDB is to predict unknown target proteins of the constituent compounds of natural medicines. Figure 5 shows an example of the output page of the query “shikonin” (a constituent compound of “Lithospermum erythrorhizon”) with the docking simulation option in the “Target prediction” component. The left panel in Fig. 5 shows the binding form of the predicted interaction between shikonin with FK506-binding protein (FKBP). The graphical picture enables the user to investigate the ligand binding sites on the protein 3D structure. The validity of the shikonin-FKBP interaction and its pharmacological effects were experimentally confirmed in a previous work²⁰.

Figure 6 shows an example of the output page of the query “Sinomenine” (a constituent compound of “boiogito”: see red rectangle in Supplementary Fig. 2) with the machine learning option in the “Target prediction” component. Boiogito is prescribed as a Kampo remedy for arthritis, nephrosis, edema, hyperhidrosis and obesity. Boiogito is composed of six Japanese Pharmacopoeia standard medicinal herbs: Sinomeni Caulis et Rhizoma, Astragali Radix, Atractylodis Lanceae Rhizoma, Zizyphi Fructus, Glycyrrhizae Radix and Zingiberis Rhizoma. Sinomenine, an ingredient extracted from the Sinomenium Stem, exerts anti-inflammatory effects through inhibiting lymphocyte proliferation²¹, and decreasing eicosanoid synthesis and nitric oxide production²². Furthermore, sinomenine ameliorates experimental arthritis in an animal model²³. The list of target candidate proteins and the associated information (e.g., molecular functions, biological pathways, applicable diseases) are shown with a ranking from the highest prediction score. Examples of predicted applicable diseases of sinomenine are adiposity and type II diabetes mellitus, implying that sinomenine is effective for treatment of adiposity and type II diabetes mellitus based on the target proteins: GAA, OPRM1, OPRD1, OPRK1. These observations are reasonable, because Kampo medicine “boiogito” that includes sinomenine as a constituent compound is known to be useful for adiposity. These results also suggest that GAA, OPRM1, OPRD1, and OPRK1 may play key roles in the pharmacological action of “boiogito”. This is how the method can be used for the mode-of-action analysis of Kampo medicines.

A case study

As a case study, we show here how KampoDB could be used with daikenchuto, one of the most frequently used Kampo medicines in Japan. Daikenchuto is beneficial for postoperative complications such as ileus and abdominal bloating. Although the mechanisms of daikenchuto are not fully understood, it has been reported that daikenchuto ameliorates these intestinal motility disorders via the release of serotonin and suppress the inflammation via the inhibition of cyclooxygenase-2 activity^24,25.

When “daikenchuto” was entered as a query in KampoDB, the “Functional analysis” component predicted “serotonergic synapse” and “arachidonic acid metabolism” as the associated pathways. It also predicted “Wnt signaling pathway”, “T cell receptor signaling pathway”, and “TNF signaling pathway” as candidates for target pathways associated with the mechanisms of daikenchuto. This suggests that daikenchuto derives its anti-inflammatory activity via arachidonic acid metabolism²⁶ and several other pathways. “T cell receptor signaling pathway” and “TNF signaling pathway” have been supported by previous reports^27,28 as the underlying mechanisms of daikenchuto. However, to the best of our knowledge, there is no report on the role of daikenchuto in the “Wnt signaling pathway”.

We previously showed that daikenchuto markedly alleviated dextran sulfate sodium (DSS)-induced experimental colitis in mice. Ulcerative colitis is a chronic inflammatory bowel disease (IBD) in which patients experience intermittent remission and relapse over decades. The long-term chronic inflammation elevates the risk of colitis-associated cancer (CAC) and can lead to CAC-related death. Therefore, CAC is regarded as the most serious complication of IBD. However, not all medicines effective against experimental colitis are necessarily effective against CAC. Indeed, it has been reported that an agonist for a prostaglandin E2 receptor subtype suppresses DSS-induced colitis and also prevents the development of colorectal carcinogenesis in a murine CAC model, whereas sulfasalazine, a prodrug of 5-aminosalicylic acid with efficacy against DSS-induced colitis, did not prevent colorectal tumor formation in a murine CAC model²⁹.

Using KampoDB, “Wnt signaling pathway”, “T cell receptor signaling pathway”, and “TNF signaling pathway” were predicted as candidates for target pathways associated with the underlying mechanisms of daikenchuto that contribute to the development of CAC. In particular, the contribution of the Wnt signaling pathway to the colorectal carcinogenesis is established³⁰. Recently, it was reported that the activation of Wnt/β-catenin signaling is essential for the early phase development of IBD-associated colorectal cancer^31,32. Additionally, Wnt signaling-initiated tumorigenesis has been reported in a murine CAC model³³.

Taking all together, these findings suggest that daikenchuto attenuates the development of chronic inflammation-associated cancer. It has the potential to be a new therapeutic strategy while repositioning the use of Kampo medicine. While testing this hypothesis, we found that the daikenchuto treatment indeed significantly suppressed the development of chronic colitis-associated colon cancer in a murine experimental model, as shown in Fig. 7.

Daikenchuto comprises three medicinal herbs: ginseng root, processed ginger, and Zanthoxylum peel (Supplementary Fig. 3). KampoDB was able to predict the possibility that the Wnt signaling pathway was a target of ginseng root and that the T cell receptor and TNF signaling pathways were underlying mechanisms of the anti-CAC effects of processed ginger and Zanthoxylum peel. These results suggest that the additive or synergistic actions of constitutive medicinal herbs contribute to the suppressive effect of daikenchuto on the development of CAC. Therefore, KampoDB can be useful for predicting new roles or aspects of traditional medicines, helping to clarify the underlying mechanisms of traditional medicines.

Discussion

KampoDB is the first platform for the analysis of natural medicines for mode-of-action analysis and repositioning of natural medicines in the world. The primary contribution of this study is to propose computational methods for the mode-of-action analysis and repositioning of Kampo medicines. In this study, we put great efforts on establishing a methodology for the computational prediction of target proteins and new indications of Kampo medicines. We established a useful web service that makes it easier for medical doctors to use Kampo medicines in clinical practice. The methods are expected to be useful for analyzing the complex systems of natural medicines. Thus, the technologies should contribute to innovation in the field of health science.

A related work of this study is a wiki-system of Kampo medicines and crude drugs¹¹ and the Kampo section of the KNApSAcK¹⁴ database that enables group search of medicinal plants, formula search by a medicinal plant, and medicinal plant search by a Kampo formula. However, these existing databases do not provide the information on potential target proteins, target pathways, and applicable diseases. Thus, they cannot help to understand the mode-of-actions and further clinical applications of Kampo medicines and crude drugs.

The performances of the target prediction and indication prediction depend heavily on the data representation of Kampo medicines, crude drugs, constituent compounds, and proteins. In this study, we used chemical structures of the constituent compounds and protein structures, but another approach would be to use other omics data. Recently, compound-induced transcriptome data (e.g., chemical treatment on human cell lines) and genetically-perturbed transcriptome data (e.g., gene knockdown, gene overexpression) have been utilized in various pharmaceutical applications. Similarity, the analysis of gene expression profiles by perturbations with Kampo medicines and crude drugs would be an interesting approach for target prediction and indication prediction. The inclusion of these gene expression data will be one of our important future works.

Traditional medicines have considerable advantages, such as the abundance of clinical experience gained over a long time, the diversity of chemical structures of the constituent compounds, and their biological activity in humans, providing an incomparable source of new drug leads for effective drug development. The results of the present study provided possible concepts and methodologies from traditional medicine that could help the discovery and development of new drugs.

We plan to maintain KampoDB by updating the molecular data on a regular basis and by analyzing the data using more sophisticated computational methods. For the “Natural medicine list” and “Functional analysis” components, we intend to incorporate the latest information from the literature and from other molecular databases. For the docking simulation analysis in the “Target prediction” component, we plan to perform docking simulations for missing compound–protein pairs as soon as the information on protein structures becomes available and to investigate the possibility of using other docking software, such as myPresto. For the machine learning analysis in the “Target prediction” component, we plan to use more sophisticated machine learning methods (e.g., deep learning, support vector machine, and logistic regression) to improve its accuracy in predicting target proteins and the applicable diseases. Currently, the prediction results for applicable diseases are presented at the level of the constituent compounds of the Kampo medicines and crude drugs, but we intend to develop integrative methods to show the prediction results for applicable diseases at the level of Kampo medicines and crude drugs themselves. In Japanese traditional medicines, various kinds of Kampo medicines, crude drugs, and constituent compounds exist. Our KampoDB is just the first version and does not cover all Kampo medicines and diseases. In our future versions, we will add more Kampo medicines, crude drugs, constituent compounds, and diseases.

Methods

Chemical structure representation

The chemical structures of constituent compounds were obtained from KNApSAcK¹⁴ and PubChem⁹ and were represented by their KEGG Chemical Function and Substructures (KCF-S) descriptors³⁴. Each compound was coded by a high-dimensional feature vector in which each element indicates the frequency of a feature defined by KEGG Chemical Function Substructures (KCF-S) (i.e., chemical substructures). The number of features was 475,692. We computed chemical structure similarity scores of compounds by using the generalized Jaccard correlation coefficient.

Compound–protein interactions

Known compound–protein interactions were acquired from public databases: ChEMBL⁶, MATADOR⁵, DrugBank³, the Psychoactive Drug Screening Program Ki, KEGG DRUG⁴, the Binding DB⁸, and the Therapeutic Target Database⁷. For the ChEMBL data, we selected only compound–protein interaction pairs that were clearly denoted as active interactions or had binding affinities of <30 μM (e.g., IC₅₀), which yielded 1,287,404 compound–protein interactions involving 519,061 compounds and 3,735 proteins. Compounds and proteins included in the chemical–protein interactome data are referred to as interactome compounds and interactome proteins, respectively.

Constituent compounds

Kampo formulas are recognized as official prescription drug and listed in the Japanese pharmacopoeia. We selected 80 compounds derived from constituent medicinal herbs of Kampo formulas that are most frequently used for the medical treatment in Japan. 80 compounds are listed as standard drugs for the crude drug analysis (medicinal herb analysis) in the Japanese pharmacopoeia.

Target prediction by docking simulation

We performed a target prediction by performing a docking simulation. Protein 3D structures were obtained from the PDB database³⁵ and SAHG³⁶. In this study, we used AutoDock, which is a suite of automated docking tools, to predict how compounds bind to a target protein³⁷. We performed a large-scale docking simulation for all possible pairs of the constituent compounds and about 40,000 human proteins. The predicted protein–ligand complexes were optimized and ranked according to the empirical scoring function, which estimates the binding free energy of the ligand receptor complex. We stored the calculated numerical results in the platform.

Target prediction by machine learning

We performed a target prediction by using our previously developed method, called TESS (target estimation based on similarity search)³⁸, to predict target proteins on the basis of compound chemical structures and large-scale chemical–protein interactome data in the framework of chemogenomics. We propose to apply the TESS algorithm to each constituent compound of Kampo medicine. In the TESS procedure, we calculated the similarity scores of compound chemical structures by the Jaccard index based on the KCF-S descriptors³⁴, which were used as prediction scores.

First, we compute pairwise similarity scores for all pairs between a query constituent compound and all of the interactome compounds in our chemical–protein interactome data. Second, from the interactome compounds known to interact with the k-th protein (k = 1, 2,…, p), we select an interactome compound with the highest similarity to the query constituent compound and use the corresponding similarity score as a prediction score to assess the possibility that the query compound interacts with the k-th protein. Third, we repeat this procedure for all p interactome proteins and assign the prediction scores to pairs between the query compound and all interactome proteins. Finally, high scoring compound–protein pairs are predicted as candidates for interaction pairs. Then, the predicted compound–protein pairs are grouped into Kampo medicines based on their constituent compounds. Figure 8 shows an illustration of the process. The details of the performance evaluation can be found in the “Performance evaluation” section in Supplementary Information.

Pathway/ontology enrichment analysis

We performed the functional enrichment analyses for natural medicine (e.g., Kampo medicines, crude drugs) by mapping a set of target proteins of the constituent compounds of each natural medicine to biological pathways or molecular ontology terms. There are four options: (1) Pathway: biological pathways in KEGG PATHWAY, (2) Brite: protein classifications in KEGG BRITE, (3) Process: biological process terms in GO, and (4) Function: molecular function terms in GO. Here, we focus on the explanation of the enrichment analysis for Pathway. Note that the same procedure can be performed not only for Pathway but also for other options (Brite, Process, and Function).

We used the 163 biological pathways in KEGG (except for Global and overview maps). The enrichment ratio was calculated as the ratio of the number of associated target proteins to the number of all proteins in each pathway. The p-value was calculated by performing a hypergeometric test^39,40. Let G_comp denote a set of target proteins of the constituent compounds of a natural medicine (e.g., Kampo medicines, crude drugs) of interest, and let G_path denote a set of target proteins in a pathway map. Further, let r = |G_comp|, k = |G_path|, z = |G_comp $\cap $ G_path|, and l equal the total number of genes in the entire dataset (l = 460). We assumed that z follows a hypergeometric distribution. The probability of observing an intersection of size z between G_path and G_comp is computed as follows:

$$p({G}_{path},{G}_{comp})=\sum _{i=z}^{min(k,r)}(\begin{array}{c}k\\ i\end{array})(\begin{array}{c}l-k\\ r-i\end{array})/(\begin{array}{c}l\\ r\end{array})\cdot $$

(1)

The resulting p-values were corrected by using the false discovery rate⁴¹. In this study, Kampo medicines were associated with all possible target proteins through their constituent compounds. Several proteins were overlapped between different pathways, and the activities of protein-coding genes were not considered, rendering the enrichment analysis likely to produce high values. To determine more specific pathways, the mapping of Kampo medicines-induced gene expression data onto biological pathway maps would be a solution; however, it was out of this paper’s scope.

Disease–target associations

The information on therapeutic target proteins for each disease was obtained from scientific literature and medical books. Drugs regulate therapeutic target proteins known to be useful for the treatment of each disease. Note that target proteins that are not known to be associated with diseases are not taken into consideration. In total, 2,062 disease–target associations involving 250 diseases and 462 therapeutic target proteins were obtained.

Indication prediction by target matching

We performed a prediction of drug indications (i.e., applicable diseases) of the query constituent compound based on its target proteins (including known target proteins and newly predicted target proteins by TESS) and the disease–target association set.

First, we take a target protein of the query constituent compound and look for the same target protein in the disease–target association set. Second, we select diseases associated with the matched target protein, and link the query constituent compound to the selected diseases via the matched target protein. The prediction scores are set to one if the matched target proteins are known targets of the query drug, while the prediction scores are set to the TESS score if the matched target proteins are newly predicted.

In vivo experiments with a CAC mouse model

Male BALB/c mice (8–10 weeks) were purchased from Japan SLC (Shizuoka, Japan). The mice were housed in the experimental animal facility at the University of Toyama and given free access to food and water. All experiments were performed in accordance with the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health and the University of Toyama. The Animal Experiment Committee at the University of Toyama approved all of the animal care procedures and experiments (authorization no. A2015INM-2). CAC model was induced as described previously⁴². The mice were administered azoxymethane intraperitoneally (10 mg/kg; Sigma-Aldrich, St. Louis, MO). After 5 days, the mice were administered 2% DSS (36-50 kDa; MP Biomedicals, Santa Ana, CA) in their drinking water for 5 days, followed by 16 days of regular water. This cycle was repeated three times. The body weight of each mouse was measured every other day, and its colonic mucosa was monitored using a mouse endoscopy system (AE-C1; AVS, Tokyo, Japan). On day 70 after the start of azoxymethane administration, the mouse colon was excised for macroscopic evaluation and histological and biological analyses. Visible tumors (>1 mm along the major axis) were counted in the mid to distal colon of each mouse.

References

Grayson, M. Traditional Asian medicine. Nature 480, S81, https://doi.org/10.1038/480S81a (2011).
Article ADS PubMed CAS Google Scholar
Li, X. M. & Brown, L. Efficacy and mechanisms of action of traditional Chinese medicines for treating asthma and allergy. J Allergy Clin Immunol 123, 297–306, quiz 307-298, https://doi.org/10.1016/j.jaci.2008.12.026 (2009).
Knox, C. et al. DrugBank 3.0: a comprehensive resource for ‘omics’ research on drugs. Nucleic Acids Res 39, D1035–1041, https://doi.org/10.1093/nar/gkq1126 (2011).
Article PubMed CAS Google Scholar
Kanehisa, M., Goto, S., Furumichi, M., Tanabe, M. & Hirakawa, M. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic Acids Res 38, D355–360, https://doi.org/10.1093/nar/gkp896 (2010).
Article PubMed CAS Google Scholar
Gunther, S. et al. SuperTarget and Matador: resources for exploring drug-target relationships. Nucleic Acids Res 36, D919–922, https://doi.org/10.1093/nar/gkm862 (2008).
Article PubMed CAS Google Scholar
Gaulton, A. et al. ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40, D1100–1107, https://doi.org/10.1093/nar/gkr777 (2012).
Article PubMed CAS Google Scholar
Qin, C. et al. Therapeutic target database update 2014: a resource for targeted therapeutics. Nucleic Acids Res 42, D1118–1123, https://doi.org/10.1093/nar/gkt1129 (2014).
Article PubMed CAS Google Scholar
Liu, T., Lin, Y., Wen, X., Jorissen, R. N. & Gilson, M. K. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res 35, D198–201, https://doi.org/10.1093/nar/gkl999 (2007).
Article PubMed CAS Google Scholar
Kim, S. et al. PubChem Substance and Compound databases. Nucleic Acids Res 44, D1202–1213, https://doi.org/10.1093/nar/gkv951 (2016).
Article PubMed CAS Google Scholar
Davis, A. P. et al. The Comparative Toxicogenomics Database’s 10th year anniversary: update 2015. Nucleic Acids Res 43, D914–920, https://doi.org/10.1093/nar/gku935 (2015).
Article ADS PubMed CAS Google Scholar
Arita, M. et al. Database for crude drugs and Kampo medicine. Genome Inform 25, 1–11 (2011).
PubMed Google Scholar
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res 40, D109–114, https://doi.org/10.1093/nar/gkr988 (2012).
Article PubMed CAS Google Scholar
UniProt, C. UniProt: a hub for protein information. Nucleic Acids Res 43, D204–212, https://doi.org/10.1093/nar/gku989 (2015).
Article CAS Google Scholar
Afendi, F. M. et al. KNApSAcK family databases: integrated metabolite-plant species databases for multifaceted plant research. Plant Cell Physiol 53, e1, https://doi.org/10.1093/pcp/pcr165 (2012).
Article PubMed CAS Google Scholar
Terasawa, K. Evidence-based Reconstruction of Kampo Medicine: Part II-The Concept of Sho. Evid Based Complement Alternat Med 1, 119–123, https://doi.org/10.1093/ecam/neh022 (2004).
Article PubMed PubMed Central Google Scholar
Kurokawa, M., Tsurita, M., Brown, J., Fukuda, Y. & Shiraki, K. Effect of interleukin-12 level augmented by Kakkon-to, a herbal medicine, on the early stage of influenza infection in mice. Antiviral Res 56, 183–188 (2002).
Article PubMed CAS Google Scholar
Okubo, K. et al. Japanese guideline for allergic rhinitis. Allergol Int 60, 171–189, https://doi.org/10.2332/allergolint. 11-RAI-0334 (2011).
Zhou, Y. X., Zhang, H. & Peng, C. Puerarin: a review of pharmacological effects. Phytother Res 28, 961–975, https://doi.org/10.1002/ptr.5083 (2014).
Article PubMed CAS Google Scholar
Ashburner, M. et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25, 25–29, https://doi.org/10.1038/75556 (2000).
Article PubMed PubMed Central CAS Google Scholar
Wang, X. et al. Shikonin, a constituent of Lithospermum erythrorhizon exhibits anti-allergic effects by suppressing orphan nuclear receptor Nr4a family gene expression as a new prototype of calcineurin inhibitors in mast cells. Chem Biol Interact 224, 117–127, https://doi.org/10.1016/j.cbi.2014.10.021 (2014).
Article PubMed CAS Google Scholar
Wang, Q. & Li, X. K. Immunosuppressive and anti-inflammatory activities of sinomenine. Int Immunopharmacol 11, 373–376, https://doi.org/10.1016/j.intimp.2010.11.018 (2011).
Article PubMed CAS Google Scholar
Liu, L., Riese, J., Resch, K. & Kaever, V. Impairment of macrophage eicosanoid and nitric oxide production by an alkaloid from Sinomenium acutum. Arzneimittelforschung 44, 1223–1226 (1994).
PubMed CAS Google Scholar
Liu, L. et al. Amelioration of rat experimental arthritides by treatment with the alkaloid sinomenine. Int J Immunopharmacol 18, 529–543 (1996).
Article PubMed CAS Google Scholar
Hayakawa, T. et al. Effects of Dai-kenchu-to on intestinal obstruction following laparotomy. J Smooth Muscle Res 35, 47–54 (1999).
Article PubMed CAS Google Scholar
Kono, T. et al. Complementary and synergistic therapeutic effects of compounds found in Kampo medicine: analysis of daikenchuto. Front Pharmacol 6, 159, https://doi.org/10.3389/fphar.2015.00159 (2015).
Article PubMed PubMed Central CAS Google Scholar
Oshima, H. & Oshima, M. The inflammatory network in the gastrointestinal tumor microenvironment: lessons from mouse models. J Gastroenterol 47, 97–106, https://doi.org/10.1007/s00535-011-0523-6 (2012).
Article ADS PubMed CAS Google Scholar
Iwasa, T. et al. Feeding administration of Daikenchuto suppresses colitis induced by naive CD4+ T cell transfer into SCID mice. Dig Dis Sci 57, 2571–2579, https://doi.org/10.1007/s10620-012-2218-0 (2012).
Article PubMed CAS Google Scholar
Ueno, N. et al. TU-100 (Daikenchuto) and ginger ameliorate anti-CD3 antibody induced T cell-mediated murine enteritis: microbe-independent effects involving Akt and NF-kappaB suppression. PLoS One 9, e97456, https://doi.org/10.1371/journal.pone.0097456 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Watanabe, Y. et al. KAG-308, a newly-identified EP4-selective agonist shows efficacy for treating ulcerative colitis and can bring about lower risk of colorectal carcinogenesis by oral administration. Eur J Pharmacol 754, 179–189, https://doi.org/10.1016/j.ejphar.2015.02.021 (2015).
Article PubMed CAS Google Scholar
Rogler, G. Chronic ulcerative colitis and colorectal cancer. Cancer Lett 345, 235–241, https://doi.org/10.1016/j.canlet.2013.07.032 (2014).
Article PubMed CAS Google Scholar
Claessen, M. M. et al. WNT-pathway activation in IBD-associated colorectal carcinogenesis: potential biomarkers for colonic surveillance. Cell Oncol 32, 303–310, https://doi.org/10.3233/CLO-2009-0503 (2010).
Article PubMed PubMed Central CAS Google Scholar
Robles, A. I. et al. Whole-Exome Sequencing Analyses of Inflammatory Bowel Disease-Associated Colorectal Cancers. Gastroenterology 150, 931–943, https://doi.org/10.1053/j.gastro.2015.12.036 (2016).
Article PubMed PubMed Central CAS Google Scholar
Bollrath, J. et al. gp130-mediated Stat3 activation in enterocytes regulates cell survival and cell-cycle progression during colitis-associated tumorigenesis. Cancer Cell 15, 91–102, https://doi.org/10.1016/j.ccr.2009.01.002 (2009).
Article PubMed CAS Google Scholar
Kotera, M. et al. KCF-S: KEGG Chemical Function and Substructure for improved interpretability and prediction in chemical bioinformatics. BMC Syst Biol 7(Suppl 6), S2, https://doi.org/10.1186/1752-0509-7-S6-S2 (2013).
Article PubMed PubMed Central Google Scholar
Rose, P. W. et al. The RCSB Protein Data Bank: views of structural biology for basic and applied research and education. Nucleic Acids Res 43, D345–356, https://doi.org/10.1093/nar/gku1214 (2015).
Article PubMed CAS Google Scholar
Motono, C. et al. SAHG, a comprehensive database of predicted structures of all human proteins. Nucleic Acids Res 39, D487–493, https://doi.org/10.1093/nar/gkq1057 (2011).
Article PubMed CAS Google Scholar
Morris, G. M. et al. AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility. J Comput Chem 30, 2785–2791, https://doi.org/10.1002/jcc.21256 (2009).
Article PubMed PubMed Central CAS Google Scholar
Sawada, R., Iwata, H., Mizutani, S. & Yamanishi, Y. Target-Based Drug Repositioning Using Large-Scale Chemical-Protein Interactome Data. J Chem Inf Model 55, 2717–2730, https://doi.org/10.1021/acs.jcim.5b00330 (2015).
Article PubMed CAS Google Scholar
Hung, J. H. Gene Set/Pathway enrichment analysis. Methods Mol Biol 939, 201–213, https://doi.org/10.1007/978-1-62703-107-3_13 (2013).
Article PubMed CAS Google Scholar
Mizutani, S., Pauwels, E., Stoven, V., Goto, S. & Yamanishi, Y. Relating drug-protein interaction network with drug side effects. Bioinformatics 28, i522–i528, https://doi.org/10.1093/bioinformatics/bts383 (2012).
Article PubMed PubMed Central CAS Google Scholar
Zaykin, D. V., Young, S. S. & Westfall, P. H. Using the false discovery rate approach in the genetic dissection of complex traits: a response to Weller et al. Genetics 154, 1917–1918 (2000).
PubMed PubMed Central CAS Google Scholar
Hayashi, S. et al. Nicotine suppresses acute colitis and colonic tumorigenesis associated with chronic colitis in mice. Am J Physiol Gastrointest Liver Physiol 307, G968–978, https://doi.org/10.1152/ajpgi.00346.2013 (2014).
Article PubMed CAS Google Scholar

Download references

Acknowledgements

We thank Dr. Yoshifumi Fukunishi for fruitful discussion. This work is supported by JST PRESTO Grant Number JPMJPR15D8 to Y.Y., JSPS KAKENHI Grant Number 16H05276 to Y.Y. and 16H05276 to M.K., and a Grant-in-Aid for the Cooperative Research Project from Institute of Natural Medicine, University of Toyama in 2014 and 2015 to M.U., Y.Y. and M.K.

Author information

Masahito Umezaki is deceased.

Authors and Affiliations

Medical Institute of Bioregulation, Kyushu University, 3-1-1 Maidashi, Higashi-ku, Fukuoka, Fukuoka, 812-8582, Japan
Ryusuke Sawada
Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
Michio Iwata & Yoshihiro Yamanishi
Division of Chemo-Bioinformatics, Institute of Natural Medicine, University of Toyama, Toyama, 930-0194, Japan
Masahito Umezaki, Yoshihiko Usui & Toshikazu Kobayashi
Division of Gastrointestinal Pathophysiology, Institute of Natural Medicine, University of Toyama, Toyama, 930-0194, Japan
Takaki Kubono, Shusaku Hayashi & Makoto Kadowaki
PRESTO, Japan Science and Technology Agency, Kawaguchi, Saitama, 332-0012, Japan
Yoshihiro Yamanishi

Authors

Ryusuke Sawada
View author publications
You can also search for this author in PubMed Google Scholar
Michio Iwata
View author publications
You can also search for this author in PubMed Google Scholar
Masahito Umezaki
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihiko Usui
View author publications
You can also search for this author in PubMed Google Scholar
Toshikazu Kobayashi
View author publications
You can also search for this author in PubMed Google Scholar
Takaki Kubono
View author publications
You can also search for this author in PubMed Google Scholar
Shusaku Hayashi
View author publications
You can also search for this author in PubMed Google Scholar
Makoto Kadowaki
View author publications
You can also search for this author in PubMed Google Scholar
Yoshihiro Yamanishi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.K. and Y.Y. designed research; R.S., M.I., M.U. and S.H. performed the experiments; Y.U., T.K. and T.K. contributed new analytic tools; M.K. and Y.Y. wrote the paper.

Corresponding author

Correspondence to Yoshihiro Yamanishi.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Sawada, R., Iwata, M., Umezaki, M. et al. KampoDB, database of predicted targets and functional annotations of natural medicines. Sci Rep 8, 11216 (2018). https://doi.org/10.1038/s41598-018-29516-1

Download citation

Received: 08 September 2017
Accepted: 12 July 2018
Published: 25 July 2018
DOI: https://doi.org/10.1038/s41598-018-29516-1

This article is cited by

Effects of maoto (ma-huang-tang) on host lipid mediator and transcriptome signature in influenza virus infection
- Akinori Nishi
- Noriko Kaifuchi
- Hiroaki Kitano
Scientific Reports (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.