Abstract
Many machine learning techniques provide a simple prediction for drug-drug interactions (DDIs). However, a systematically constructed database with pharmacokinetic (PK) DDI information does not exist, nor is there a machine learning model that numerically predicts PK fold change (FC) with it. Therefore, we propose a PK DDI prediction (PK-DDIP) model for quantitative DDI prediction with high accuracy, while constructing a highly reliable PK-DDI database. Reliable information of 3,627 PK DDIs was constructed from 3,587 drugs using 38,711 Food and Drug Administration (FDA) drug labels. This PK-DDIP model predicted the FC of the area under the time-concentration curve (AUC) within ± 0.5959. The prediction proportions within 0.8–1.25-fold, 0.67–1.5-fold, and 0.5–2-fold of the AUC were 75.77, 86.68, and 94.76%, respectively. Two external validations confirmed good prediction performance for newly updated FDA labels and FC from patients’. This model enables potential DDI evaluation before clinical trials, which will save time and cost.
Similar content being viewed by others
Introduction
A drug-drug interaction (DDI) occurs when the pharmacokinetics (PK) or pharmacodynamics (PD) of the victim drug is changed by a perpetrator drug previously taken or administered in combination. DDIs may lead to products’ withdrawal from the market. For instance, astemizole, a drug for the treatment of allergic symptoms, was withdrawn from the market due to the possibility of prolongation of the QT interval and arrhythmias when combined with cytochrome P450 3A4 (CYP3A4) inhibitors, including grapefruit juice and erythromycin1. Mibefradil, a treatment for hypertension and chronic angina, was withdrawn from the market due to bradycardia and rhabdomyolysis when combined with various cardiovascular drugs, such as beta-blockers or statins2. Likewise, DDIs have been studied as one of the causes of severe adverse reactions occurring in clinical settings3,4. Furthermore, the increasing trend of multi-drug prescriptions increases the possibility of side effects due to DDIs5.
However, despite this importance, numerous DDIs exist, but have not been identified. What is worse, approximately 10% of DDI pairs may have adverse reactions due to DDIs among all combinations of commercially available drugs6. This is because, first, the Food and Drug Administration (FDA) recommends that a clinical trial for DDIs be conducted when drugs affect only, or are affected by, a specific enzyme in an in-vitro study7. High costs and time-consuming clinical trials may be part of the reason for the limited number of known DDIs. Second, the mechanisms by which DDIs occur are very diverse, and each mechanism may be complex, so not all potential DDIs may be detected.
Various machine learning techniques have been developed to predict DDIs to overcome the lack of known DDI pairs. In previous studies8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42, many models have been developed to predict the presence or absence of DDIs, discovering DDI pairs that cause side effects, or classifying the types of DDIs using open source databases (DBs). However, there are clear limitations. First, most models have only provided a simple prediction for the existence or classification of DDIs. These models do not aid in complex clinical decisions, such as precise dose adjustment or alternative drug selection. Predictions about fold change of PK parameters are needed to help physicians and pharmacists, but, to date, there are no models that have been successful in predicting this. Second, a systematically constructed true-negative dataset does not exist. The DDI DB, such as DrugBank, widely used for DDIs prediction, contains information that ‘there is a DDI between drug A and B’, but does not contain information that ‘there is no DDI’. As a result, researchers inevitably have selected random sets of drug pairs thinking there were no interactions9,11,19,20,28,29,30,31,32,34,37,42,43. Certainly, absence of evidence is not evidence of absence. Using a model without good input makes it difficult to obtain reliable output. If the negative set is random, it is difficult to identify the exact cause when unexpected problematic output occurs.
Therefore, a sufficient amount of DDI information containing fold change of PK parameters was collected by hand search from FDA drug labels for high model performance and a reliable PK-DDI DB was constructed (Fig. 1). Using this data, a PK DDI prediction model (PK-DDIP model) is proposed that quantitatively predicts the fold change of drug PK parameters in DDIs. In addition, a standalone application, which provides predicted fold changes and reported fold changes of PK parameters, anatomical therapeutic chemical (ATC) code-based alternative drug choices, and single nucleotide polymorphism (SNP) action information was distributed.
Results
Construction of the PK-DDI DB
A total of 38,711 FDA drug label data for 3,587 prescription drugs was downloaded in an XML format from the DailyMed website. After applying standard operating procedures, 3,627 reliable DDI information as a training dataset for PK-DDIP model, including area under the time-concentration curve (AUC) fold change values, were selected (Supplementary Fig. 1). Of the 3,627 DDIs, 1,189 were positive (765: increase; 424: decrease), and 2,438 were negative. The median fold change value for the AUC was 1.82 (interquartile range (IQR) 1.45–2.8; min-max 1.26-190), 0.55 (IQR 0.31–0.69; min-max 0.03-0.8), and 1.0 (IQR 1.0–1.0; min-max 0.81-1.25) for categories of increase, decrease, and negative, respectively.
Model performance
Figure 2 is the evaluation of the PK-DDIP model against various criteria. Figure 2a shows the DDI class distribution of predicted fold changes and label fold changes of victim drug AUCs. The distribution of DDI class predicted by the model was generally very similar to the DDI class distribution of the label. However, the values predicted by the model were less distributed for a strong DDI class.
Figure 2b more explicitly shows which class the PK-DDIP model predicts. Among the seven classes, classes with a moderate increase, moderate decrease, negative predictions, and a weak increase were accurately predicted. Among them, the prediction accuracy of a negative prediction was 90.2%. In the case of the strong decrease class, although only 22.4% of the pairs were predicted as the correct class, 72.4% of the drug pairs were predicted as the moderate decrease class. However, similar to the problem in Fig. 2a, for the strong increase class, it was confirmed that only 1.2% of the strong increase classes could be correctly predicted, but 79.5% were predicted as the moderate increase class.
The model obtained a 5-fold cross-validation result of a root-mean-squared error (RMSE) of 0.5959 (Supplementary Table 1) and predicted the fold change within 0.5959 on average (Fig. 2c, Fig. 2d). Median percent error of the predicted fold change from the label was 7.9%, with an IQR of 2.4–23.9%. When the PK fold change value was at an extreme level such as the strong increase or strong decrease category, the percent error value tended to increase, whereas in negative or weak increase/decrease, the percent error value was small (Supplementary Table 2). However, this is only an average, the fold change range varies, and the fold change corresponding to a negative or weak class was significantly affected by even a tiny error. Therefore, numerical values were evaluated according to pre-specified DDI classification criteria. Figure 2e shows the accuracy according to five DDI classification criteria. Case 1-1 had the most rigorous evaluation where 75.77% of pairs entered the interval, followed by 86.68% for Case 1-2 and 94.76% for Case 1-3. The accuracy of predicting the exact class (Case 2) and achieving a combined task (Case 3) was 74.33% and 80.29%, respectively.
External validation of model 1: Comparison with newly updated FDA labels
Newly reported DDI information was collected and compared with prediction results from the PK-DDIP model (Table 1). Twenty-one drug pairs contained new information on the change in the AUC of victim drugs. Among the 21 drug pairs, seven of ten DDIs in newly approved drugs were closely predicted and evaluated as ‘good’. Six of 11 newly updated drug labels and the remaining five were evaluated as good, and moderate, respectively. All 21 drug pairs achieved at least a ‘moderate’ grade, which meant that the PK-DDIP model satisfactorily predicted the DDI direction and extent.
External validation of model 2: Comparison with real patients’ results
Data from 8,684 patients who used tacrolimus at a tertiary hospital were collected (Supplementary Fig. 2). After excluding patients without tacrolimus lab data, the 18 eligible cohorts included 2,143 patients. The value of correlation coefficient (R-squared) between predicted values and observed values was 76.9% (Fig. 3). The 11 of 18 DDI pairs were satisfactorily predicted except for the failure of the 7 pairs (atorvastatin, ciprofloxacin, esomeprazole, finasteride, fluconazole, fluvastatin, and rifampin). The significant differences between the two values were observed as follows. (Predicted value: observed value = 1.08:0.69 [atorvastatin], 1.72:0.9 [ciprofloxacin], 0.97:1.25 [esomeprazole], 1:1.44 [finasteride], 2.03:1.51 [fluconazole], 1.14:0.83 [fluvastatin], and 0.29:0.54 [rifampin])
Standalone application
A standalone application provided the following features (Fig. 4). Users may search by entering perpetrator and victim drugs. In a protein-protein interaction network, the application helps the user identify the fold change and polypeptide bond relationship between the perpetrator drug and the victim drug, which can also be shown graphically as a relational network. The application also provides SNP action information of the entered perpetrator drug and victim drug. The table provides information on the Korean allele frequency (wild type allele and variant allele) of SNPs related to the drugs. Furthermore, it contains information on the clinical impacts of the variant that are expected, thereby enabling clinicians to get useful insights into how often side effects from DDI will occur. The application also helps identify alternative drugs when the fold change is more significant than expected. This table provides the fold change for alternative drug pairs. The suggestion for an alternative pair of drugs was proposed as a result of selecting drugs with the same 4th ATC level of perpetrator/victim drug.
Discussion
A machine learning model was proposed for the quantitative prediction of DDIs with high accuracy while constructing a highly-reliable PK-DDI DB. The PK-DDIP model satisfactorily predicted PK parameter fold changes when a perpetrator and a victim drug were simultaneously administered. The in-sample mean-squared error was 0.2494 and the RMSE of 5-fold cross-validation was 0.5959. Traditionally, a drug interaction between two drugs was determined by conducting clinical trials or designing a physiologically based pharmacokinetic (PBPK) model. It was impossible to perform drug interaction tests for all possible drug pairs. Our machine learning approach to predict PK parameter fold change is not constrained to a given gene list and thus improves model flexibility. It is expected that this model will enable the evaluation of potential interactions before performing human-based trials, which will significantly save time and cost. Furthermore, it will be possible to use machine learning techniques in evaluating DDIs for new drug approval.
One of the strengths of this study is that reliable data sources were manually collected and utilized. There are several well-known DBs widely used for developing models to predict DDIs, including DrugBank44, Kyoto Encyclopedia of Genes and Genomes45, NDF-RT46, and BioSNAP47 data with positive DDIs used for classification prediction8,9,10,11,13,14,15,17,18,19,20,21,22,23,24,25,29,31,32,34,35,36,37,38,40,43,48. SemMedDB contains DDI information extracted from PubMed49. TWOSIDES DB provides PD DDI information that utilizes DDI side effects data from the FDA adverse event reporting system50. SemMedDB and TWOSIDES have also been commonly used for classification prediction12,15,16,20,23,27,28,30,33,41,42,48. For the DeepDDI DB presented by Ryu et al.,36 DDIs are classified into 86 types by processing DDI information provided by DrugBank. The DeepDDI DB has been widely used in the development of multi-classification task models20,28,30,39,41,42,51. Most DDI information in these DBs often contains a large amount of data that have not been tested in actual clinical trials, only providing speculative information based on each drugs’ metabolic information or mechanical characteristics. For this reason, there is no information that includes drug pairs without an interaction (a true negative dataset). The PK-DDI DB was constructed from reliable FDA labels, whose data was based on actual clinical trials. Furthermore, the PK-DDI DB contains negative DDI pairs where DDIs did not occur and is the DB containing PK parameter change information. We hope this freely available PK-DDI DB will be widely used for various DDIs prediction studies to increase model performance. A more accurate and predictive model may be implemented if PK-DDI DB updated information is continuously collected. Another strength is that this PK-DDIP model has been externally validated by comparison with newly updated FDA labels. Furthermore, the results were confirmed with the validation of real-world patients taking drugs.
Quantitative prediction of drug exposure has usually been performed by using a PBPK model52,53,54,55. However, the PBPK method has poor versatility because it predicts a method limited to a specific drug or enzymes. Additionally, it takes a lot of time and technical expertise to construct a specific PBPK-based DDI model. The PK-DDIP model performed quantitative DDI prediction for all drugs with better performance for all-available drugs. In PBPK model studies, Waters et al. have evaluated a CYP450-mediated DDI prediction model for therapies used in the oncology setting and obtain RMSE values of 0.24-4.67 (combined model) and 0.20-2.45 (liver only model)52. Another study developed an in-vivo mechanistic static model (IMSM) with an RMSE range of 1.48-5.6553. Tod et al. show that their IMSM model predicts the proportion of predictions within 0.67- to 1.5-fold (corresponding to the Case 1-2) and within 0.5- to 2-fold (corresponding to the Case 1-3) as 79% and 93%, respectively (for DDIs mediated by P-glycoprotein and CYP3A4 only)54. In another study, the proportions are 90% and 99%, respectively (for DDIs mediated by organic anion transporting polypeptides, breast cancer resistance protein, and CYP2C8 only)55. In the PK-DDIP model, the RMSE value was 0.596, similar or lower than that in the PBPK models. The PK-DDIP model showed prediction proportions of 86.7% and 94.8% in Case 1-2 and Case 1-3, respectively, and this result was not limited to specific DDI types.
Using the PK-DDI DB, quantitative AUC fold changes were predicted. Many studies have used a graph neural network to capture molecular structural features of drug pairs that generate DDIs20,30,42,51 and there has been an attempt to express the overall drug network by the occurrence of DDIs, and then extract the properties that cause interactions41. However, there is a fundamental problem with these attempts: they do not understand the underlying mechanisms of why DDIs occur. Although not used in this study, drugs have numerous properties other than their molecular structure; for example, properties, such as indication, adverse effects, molecular weight, and signaling pathway do not depend solely on the molecular structure. Therefore, it may be stated that a drug is an abstract concept that encompasses these characteristics and results. Unfortunately, there is no obvious data explaining why DDIs occur. Therefore, it is essential to use as many features as possible to preserve important information that might be missed, and at the same time, to include features that are expected to have the most significant impact on DDIs. In that respect, the relationship between the substrate, inhibitor, and inducer was analyzed in the description of fold changes in the meta dataset. Using this information, it was confirmed that interaction occurrence was very clearly divided according to the type of action and the AUC fold change was predicted with high accuracy.
However, there were some cases where an exception to this rule occurred, and the prediction failed. Rosuvastatin and fluconazole share cytochrome P450 2C9 and act as an enzyme-substrate and an enzyme inhibitor, respectively56,57. From the model, it would be expected that fluconazole would increase the rosuvastatin AUC fold change; the model predicted the AUC fold change to be 1.79. However, the rosuvastatin AUC fold change by fluconazole is 1.1 on the label58. In terms of quantitative modeling, it cannot be assumed that all enzymes cause an interaction, so it may be estimated that cytochrome P450 2C9 induces a wimpy interaction. Alternatively, cytochrome P450 2C9 mediates an interaction, but the effect is very insignificant, and it is speculated that some reaction between rosuvastatin and fluconazole inhibits the interaction. Another prediction error between ramelteon and fluvoxamine was caused by not having high enough expectations. These two drugs share a total of four CYP enzymes (1A2, 2C19, 2C9, and 3A4)59,60. Since the victim drug has all of these enzymes as a substrate and all of the perpetrator drugs have inhibitors, it was inferred that fluvoxamine would greatly raise the ramelteon AUC fold change. The model also expected a high ramelteon AUC fold change at 3.51. Nevertheless, the ramelteon AUC fold change by fluvoxamine is 190 on the label61, which is exceptionally high. Likewise, it was speculated that there were some other reactions or causes that need to be additionally considered. If many drug pairs share various enzymes or if an unknown influence is identified by adding various drug features to the features used for modeling, it will be possible to improve irregular pairs. However, this must be considered very carefully, as dimensionality is deeply bound by quantitative modeling.
It was not the goal for this model to be used for individual-level patient care. The PK-DDIP model was developed based on the PK-DDI DB from the FDA drug label. The PK fold change in the newly updated FDA label was predicted with high accuracy. However, the recommendation in the FDA drug label rely on a mean population recommendation, sometimes performed in healthy volunteers whom do not always reflect patient characteristics. Furthermore, tacrolimus is characterized by a wide range of inter-individual variability in its bioavailability62. Various clinical conditions have also been reported to affect tacrolimus PK, such as concomitant drugs, genotypes (e.g., SNPs), diet, and clinical values63,64. For this reason, the patients pool observed in a tertiary hospital clinical data warehouse (CDW) could be quite different from the ideal patient group, which might have caused the differences in predicted AUC fold change and the observed fold change for actual patients. Therefore, caution should be exercised when applying this model directly to a patient. Instead, we could reinforce therapeutic drug monitoring (TDM) of narrow therapeutic index (NTI) drugs such as tacrolimus after adding a perpetrator drug which is predicted to have a significant effect on the NTI drug.
Despite the several strengths, this study also has several important limitations. First, the DDI information was collected under specific controlled conditions in clinical trials. Different doses or usages usually result in different DDI results65. For instance, the effect of rifampin on the atorvastatin differs depending on the timing of administration and the duration of concurrent use66. Steady-state rifampin markedly decreases exposure when administration is separated, but slightly increases exposure when both drugs are given simultaneously; whereas, single-dose rifampin markedly increases exposure. However, it is not possible to predict different results when these conditions change; thus, the model only provides a rough average estimate. Second, the purpose of the study was to quantitatively predict the PK parameter fold change. The PK parameter fold change does not always result in a PD change or a side-effect occurrence. Therefore, another study might be needed to reveal the correlation between PK-PD. Lastly, the PK-DDIP model showed a poor performance in predicting extreme changes in AUC, reinforced TDM in the case of using a strong perpetrator drug is needed.
In conclusion, a PK-DDIP model was developed for the quantitative prediction of DDIs while constructing a PK-DDI DB. It is expected that many future studies will be conducted using the PK-DDIP model and PK-DDI DB. Further research is needed to elucidate the specific mechanisms of DDIs and improve model limitations.
Methods
DDI information data source
FDA drug labels were used to build a reliable DDI dataset as a training and validation dataset. The data were considered to be reliable in the following cases: 1) results of clinical trials performed on human subjects, 2) results using PBPK models. DDI information that has not been tested in actual clinical trials, only providing speculative information and not having any apparent fold change was not considered reliable. DDI information was collected using the DailyMed website (https://dailymed.nlm.nih.gov/dailymed/), which contains information on insert papers of drugs approved by the FDA. Information was extracted from 38,711 FDA drug labels (accessed date: May 2020), and the extracted items included: perpetrator drug, victim drug, and the AUC fold change. The perpetrator drug was a drug that caused a DDI, and a victim drug was affected by the interaction. If the perpetrator was reported to not affect the victim drug AUC, the AUC fold change value was 1. If the fold change had a ratio value other than 1, it meant that the ratio of the victim drug AUC with a perpetrator drug was divided by the one without the perpetrator drug. DDI information was manually obtained from sentences, figures, and tables in the clinical pharmacology and drug interaction sections in the drug label and collected by ten trained researchers. All data were reviewed twice by HJ and JS. The data were further processed based on principles discussed by the researchers to ensure data consistency.
Data preprocessing
To determine the fold change of PK parameters in DDIs required for learning, the following data pre-processing standard operation procedure was defined as below. a) Absent a fold change in AUC: Cases were excluded that suggested an interaction between a drug class and a specific drug without presenting an apparent fold change (e.g.: ‘Antacids reduce the AUC of drug A’). b) Various dose and dosing times: If there were different AUC fold change values according to different dosing times, the AUC fold change values with simultaneous dosing times were selected. If there were different values according to different drug doses, results using the standard dose were selected. However, if several doses were standard, the geometric mean of the reported AUC fold changes was used. c) Interaction among three or more drugs: If drug A and drug B acted as perpetrator drugs and drug C acted as a victim drug, the features of drug A and drug B were combined and a new drug entity was assigned.
Feature extraction
All drug features used in model development were obtained from the March 2021 version of DrugBank44. The most important feature was the binding relationship between the drug and the target/enzyme/carrier/transporter. This data was combined with information about polypeptide targets, enzymes, carriers, or transporters that ultimately were used to speculate which polypeptides might be involved in physiological effects and mechanisms of action. Polypeptides were identified using the Universal Protein Resource (UniProt) identifier. The gene name of a polypeptide, such as prostaglandin-endoperoxide synthase 1 (PTGS1), did not always have a one-to-one correspondence to the UniProt ID. If so, the gene name was manually matched. The second most important feature is its pharmacodynamics. The DrugBank defines the following 11 types of PD for drug pairs. We classified a drug pair as 11 types of PD action; otherwise, it was classified as negative PD (decrease adverse effects, decrease dynamics, decrease specific adverse effects, decrease specific effects, decrease therapeutic efficacy, increase adverse effects, increase dynamics, increase risk of hypersensitivity, increase specific adverse effects, increase specific effects, increase therapeutic efficacy).
Feature vector encoding
The structure of a general sentence describing DDI was reflected when expressing the relationship between perpetrator and victim drugs. For example, DDIs are often described as follows: ‘The metabolism of silodosin can be decreased when combined with luliconazole67. Silodosin undergoes extensive metabolism via oxidative pathways mediated by CYP3A4. Potent CYP3A4 inhibitors may interfere with silodosin metabolism, resulting in increased serum concentrations of the drug and an elevated risk for developing drug-related adverse effects. Co-administration of 8 mg of silodosin and 400 mg of ketoconazole led to a 3.2-fold increase in AUC of silodosin68.’ As demonstrated in the description, CYP3A4 mediated the DDI between the perpetrator drug luliconazole and the victim drug silodosin, leading to a moderate effect on the increase of silodosin AUC. Furthermore, there is a substantial interplay between PK and the PD69. There is a PD interaction between luliconazole and silodosin, where the resulting drug concentration of silodosin at the site of action ultimately contributes to the PD response. The label DDI data were arranged where the perpetrator drug and the victim drug were centered, and various polypeptides/PD interactions were radially linked. Assuming that DDIs occurred according to the relationship of polypeptides and PD interactions linked to each drug, the primary information was summarized as the following four types.
-
P-1 The linkage between each drug and polypeptide (e.g., related polypeptide in each perpetrator or victim drug)
-
P-2 Types of polypeptides belonging to the drug (e.g., target, enzyme, carrier, or transporter)
-
P-3 How the drugs affected each other’s polypeptides (e.g., inhibitor, inducer, or substrate)
-
P-4 Types of PD action
Therefore, a polypeptide-PD-drug-type (PPDT) tokenization was proposed that as much as possible expressed DDI flexibly while reflecting the causal relationship. PPDT tokenization combined polypeptide, PD interaction, drug, and type into one word. For example, silodosin has {ATP binding cassette subfamily B member 1 [ABCB1], ABCB4, aldo-keto reductase family 1 member A1 [AKR1A1], aldehyde dehydrogenase 2 family member [ALDH2], CYP3A4, and UDP glucuronosyltransferase family 2 member B7 [UGT2B7]} as substrates and {adrenoceptor alpha 1A [ADRA1A], ADRA1B, and ADRA1D} as inhibitors67. However, luliconazole has {CYP2C19, CYP3A4, and lanosterol 14-alpha-demethylase [ERG11]} as inhibitors70. Therefore, the DDI of silodosin and luliconazole would be 13 tokens, including “CYP3A4PDinh CYP2C19PDinh ERG11PDinh CYP3A4VDsub UGT2B7VDsub ALDH2VDsub ADRA1AVDinh ADRA1BVDinh ADRA1DVDinh ABCB1VDsub ABCB4VDsub AKR1A1VDsub decrease_dynamics” (PD, perpetrator drug; VD, victim drug; inh, inhibitor; sub, substrate; decrease_dynamics. pharmacodynamic interaction type). Each token contained information on the relationship between each drug and polypeptide (P-1). The second information type, the type of polypeptide belonging to each drug, was reflected (P-2). The third information type included how the drugs affected each other’s polypeptides (P-3). The last information type included whether there is PD action or not (P-4). The extracted DDI information was PPDT-tokenized to obtain one document with 3,627 sentences with 2,830 unique tokens. After that, each DDI was encoded into a 2,830-dimensional vector through normalization using Term Frequency–Inverse Document Frequency (tf-idf).
PK-DDIP model development
The AUC was selected as a proper PK parameter for prediction because various PK studies frequently rely on PK measures, such as the AUC to assess the extent of systemic exposure71. Therefore, the PK-DDIP model focused on predicting the effect of the perpetrator drug on the victim drug AUC. Of the extracted DDI information, 3627 pairs with feature information were used for training and tested. Log2 was used to transform the fold change to relax the distribution of values and then it was used for training. The regression bagged (bootstrap aggregation) trees method, which had the best performance, was used as the application model. The data characteristics needed to learn were as follows: First, it had a high dimension compared to the size of the data. Second, there were outliers on a specific range (strong increase/decrease). Small trees in bagged trees increased the level of decision-making. Therefore, an average performance improvement was expected because the abrupt change in variance by outliers did not significantly impair the tree bias. In addition, the lack of data was partially resolved through bootstrap aggregation, which repeatedly selected sample data. The tree consisted of 615 branches, and there were 308 nodes for which fold change values were determined. According to a decision rule, a branch was responsible for moving instances to the next level. If there was no branch in the node, the instance that arrived at the node had the fold change suggested by the node. If one DDI was predicted, it was decided as one fold change out of 308 values. The results of many decision trees were combined because individual decision trees tended to overfit. Therefore, the tree used bootstrap data samples to grow a decision tree from the ensemble. This process enabled an increase in the number of samples to 3627, which was small compared to 2830 dimensions. The number of trained learners in the ensemble was 30.
Predictive model evaluation
Model evaluation was performed by calculating the difference between the fold change value presented in the label and the predicted value as the RMSE. The model was developed to have the lowest RMSE value. Further evaluation of model performance was conducted in two ways. First, DDIs were classified into several classes according to strength and whether the model predicted it correctly was evaluated using the modified FDA’s classification criteria7 as follows:
-
A strong perpetrator drug increased the AUC of a victim drug ≥ 5-fold.
-
A moderate perpetrator drug increased the AUC of a victim drug by ≥ 2- to <5-fold.
-
A weak perpetrator drug increased the AUC of a victim drug by ≥ 1.25- to <2-fold.
-
A strong perpetrator drug decreased the AUC of a victim drug by ≥ 80 percent.
-
A moderate perpetrator drug decreased the AUC of a victim drug by ≥ 50 to <80 percent.
-
A weak perpetrator drug decreased the AUC of a victim drug by ≥ 20 to <50 percent.
Furthermore, the fold change between 0.8- and 1.25-fold, which did not belong to any of the criteria, was defined as negative. Another model evaluation examined whether the predicted value was located within the pre-defined range. The numerical values were evaluated according to the following criteria, where FClab is the label AUC fold change, FCpre is the predicted AUC fold change, Classlab is the label class, and Classpre is the predicted class.
-
Case 1-1: {0.8 × FClab ≤ FCpre} ∧ {1.25 × FClab ≥ FCpre}
-
Case 1-2: {0.67 × FClab ≤ FCpre} ∧{1.5 × FClab ≥ FCpre}
-
Case 1-3: {0.5 × FClab ≤ FCpre} ∧{2 × FClab ≥ FCpre}
-
Case 2: Classlab = Classpre
-
Case 3: Case 1-1 ∨ Case 2
The fold change uses the equation FC = (victim drug AUC in the presence of perpetrator)/(victim drug AUC in the absence of the test perpetrator). The FCs in decreased fold change of victim drug AUC are distributed between (0, 1), while the FCs in increased fold change of victim drug AUC are distributed between (1, ∞). Therefore, to compare the appropriate FC ranges, the values were compared by taking the inverse of the decreased fold change of victim drug AUC in DDIs in the (0, 1) interval. The interval range [0.8, 1.25] of Case 1-1 refers to the negative interval of the FDA, and the criteria [0.67, 1.5] and [0.5, 2] is used refer to the criteria in the PK DDI prediction study using a PBPK model55.
Predictive model external validation
External validation of the predictive model was performed in two ways. First, newly updated FDA labels containing the DDI information section were collected using the ‘Drug Safety-related Labeling Changes’ search platform (https://www.accessdata.fda.gov/scripts/cder/safetylabelingchanges/) (assessed date: Jul 2021). The date range was set after the learning data collection period (after May 2020). Among them, label information that reported AUC changes of victim drugs were extracted and compared to the results predicted by the PK-DDIP model. The prediction results were defined as ‘good’ if AUC fold changes of the PK-DDIP model prediction were between 0.8- and 1.25-fold change of the newly updated drug label. If the only classification of whether the AUC fold change was >1 (or <1) was successful (meaning that the PK-DDIP model correctly predicted increase/decrease in AUC of victim drug), then the prediction results were considered ‘moderate’.
Another validation of the PK-DDIP model was performed by examining whether the prediction results of the PK-DDIP model were clinically observed in patients. The main object of this retrospective real-world study was to determine if there was a change in the blood concentration of the victim drug before and after taking the perpetrator drug.
Tacrolimus was selected as a victim drug for validation. Tacrolimus is an immunosuppressant drug widely used in most organ transplants, and its concentration is measured at regular intervals under a TDM system in a tertiary hospital72. For the list of perpetrator drugs, 18 drugs were selected by the research team among drugs predicted to affect the tacrolimus blood concentration. The study population included all patients who had been treated with tacrolimus using a CDW at a tertiary hospital between 2001 and 2021. The tacrolimus trough blood concentration is generally used as a simplified marker of drug exposure, and this correlates well with the AUC73,74. Therefore, the measured tacrolimus trough blood level was collected just before and every 1–2 days after initiating perpetrator drugs for seven days75. Because the physician applied adjusted doses of tacrolimus in response to changed blood levels, the tacrolimus concentration/dose (C/D) ratio was calculated seven days after initiating perpetrator drugs when the tacrolimus blood levels had stabilized. The changed C/D ratio at day seven was divided by the one at day 0, and the obtained value was compared with the prediction value from the PK-DDIP model. Approval from the Institutional Review Board at Seoul National University hospital was obtained prior to collecting and analyzing the data (IRB No. 2107-233-1240). Written informed consent was not required for CDW-based studies using anonymized data.
Standalone application
The following additional information for an all-around understanding of the user’s DDI was provided in a standalone application. The application provides SNP actions from DrugBank and its allele frequency in Koreans obtained from KRGDB76 SNPs associated with drug activity or metabolism can affect pharmacological activity. The SNP information of each perpetrator or victim drug can be considered for dose escalation or therapy change when administering the drug to a patient. The application additionally recommends alternative drugs whose ATC 4th levels are same with perpetrator or victim drugs. The hierarchical structure of ATC can be found at https://www.whocc.no.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
The PK-DDI DB is freely available from Github repository: https://github.com/harryscpt/pk-ddip. The standalone application can be operated by connecting to https://pk-ddi.snu.ac.kr/en. Access to the standalone application website is possible after obtaining an access code through a request to the corresponding author, JO.
Code availability
The source code used to develop our PK-DDIP model is available from Github repository: https://github.com/harryscpt/pk-ddip.
References
Broadhurst, P. & Nathan, A. W. Cardiac arrest in a young woman with the long qt syndrome and concomitant astemizole ingestion. Br. Heart J. 70, 469–470 (1993).
Schmassmann-Suhijar, D., Bullingham, R., Gasser, R., Schmutz, J. & Haefeli, W. E. Rhabdomyolysis due to interaction of simvastatin with mibefradil. Lancet 351, 1929–1930 (1998).
Kongkaew, C., Noyce, P. R. & Ashcroft, D. M. Hospital admissions associated with adverse drug reactions: a systematic review of prospective observational studies. Ann. Pharmacother. 42, 1017–1025 (2008).
Leape, L. L. et al. Systems analysis of adverse drug events. ADE Prevention Study Group. JAMA 274, 35–43 (1995).
Kantor, E. D., Rehm, C. D., Haas, J. S., Chan, A. T. & Giovannucci, E. L. Trends in prescription drug use among adults in the United States from 1999-2012. JAMA 314, 1818–1831 (2015).
Liu, R. et al. Data-driven prediction of adverse drug reactions induced by drug-drug interactions. BMC Pharmacol. Toxicol. 18, 44 (2017).
FDA. Clinical drug interaction studies—Cytochrome P450 enzyme-and transporter-mediated drug interactions guidance for industry. Available at: fda.gov/media/134581/download. Accessed March 6 2020.
Vilar, S. et al. Drug—drug interaction through molecular structure similarity analysis. J. Am. Med. Inform. Assoc. 19, 1066–1074 (2012).
Huang, J. et al. Systematic prediction of pharmacodynamic drug-drug interactions through protein-protein-interaction network. PLoS Comput. Biol. 9, e1002998 (2013).
Cami, A., Manzi, S., Arnold, A. & Reis, B. Y. Pharmacointeraction network models predict unknown drug-drug interactions. PLoS One 8, e61468 (2013).
Cheng, F. & Zhao, Z. Machine learning-based prediction of drug–drug interactions by integrating drug phenotypic, therapeutic, chemical, and genomic properties. J. Am. Med. Inform. Assoc. 21, e278–e286 (2014).
Zhang, P., Wang, F., Hu, J. & Sorrentino, R. Label propagation prediction of drug-drug interactions based on clinical side effects. Sci. Rep. 5, 12339 (2015).
Park, K., Kim, D., Ha, S. & Lee, D. Predicting pharmacodynamic drug-drug interactions through signaling propagation interference on protein-protein interaction networks. PLoS One 10, e0140816 (2015).
Sridhar, D., Fakhraei, S. & Getoor, L. A probabilistic approach for collective similarity-based drug–drug interaction prediction. Bioinformatics 32, 3175–3182 (2016).
Zhang, W. et al. Predicting potential drug-drug interactions by integrating chemical, biological, phenotypic and network data. BMC Bioinforma. 18, 18 (2017).
Shi, J.-Y., Shang, X.-Q., Gao, K., Zhang, S.-W. & Yiu, S.-M. An integrated local classification model of predicting drug-drug interactions via Dempster-Shafer theory of evidence. Sci. Rep. 8, 11829 (2018).
Yu, H. et al. Predicting and understanding comprehensive drug-drug interactions via semi-nonnegative matrix factorization. BMC Syst. Biol. 12, 14 (2018).
Deepika, S. & Geetha, T. A meta-learning framework using representation learning to predict drug-drug interaction. J. Biomed. Inform. 84, 136–147 (2018).
Rohani, N. & Eslahchi, C. Drug-drug interaction predicting by neural network using integrated similarity. Sci. Rep. 9, 13645 (2019).
Chen, X., Liu, X. & Wu, J. Drug-drug interaction prediction with graph representation learning. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 354–361 (2019).
Purkayastha, S., Mondal, I., Sarkar, S., Goyal, P. & Pillai, J. K. Drug-drug interactions prediction based on drug embedding and graph auto-encoder. In 2019 IEEE 19th International Conference on Bioinformatics and Bioengineering (BIBE), 547–552 (2019).
Zhang, W. et al. SFLLN: a sparse feature learning ensemble method with linear neighborhood regularization for predicting drug–drug interactions. Inf. Sci. 497, 189–201 (2019).
Karim, M. R. et al. Drug-drug interaction prediction based on knowledge graph embeddings and convolutional-LSTM network. In Proceedings of the 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, 113–123 (2019).
Song, D. et al. Similarity-based machine learning support vector machine predictor of drug-drug interactions with improved accuracies. J. Clin. Pharm. Ther. 44, 268–275 (2019).
Liu, S., Huang, Z., Qiu, Y., Chen, Y.-P. P. & Zhang, W. Structural network embedding using multi-modal deep auto-encoders for predicting drug-drug interactions. In 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 445–450 (2019).
Zhang, C. & Zang, T. CNN-DDI: A novel deep learning method for predicting drug-drug interactions. In 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 1708–1713 (2020).
Zhu, J., Liu, Y. & Wen, C. MTMA: Multi-task multi-attribute learning for the prediction of adverse drug–drug interaction. Knowl.-Based Syst. 199, 105978 (2020).
Chen, X., Liu, X. & Wu, J. GCN-BMP: Investigating graph representation learning for DDI prediction task. Methods 179, 47–54 (2020).
Huang, K., Xiao, C., Hoang, T., Glass, L. & Sun, J. Caster: Predicting drug interactions with chemical substructure representation. Proc. AAAI Conf. Artif. Intell. 34, 702–709 (2020).
Wang, H., Lian, D., Zhang, Y., Qin, L. & Lin, X. GoGNN: Graph of graphs neural network for predicting structured entity interactions. Proc. Twenty-Ninth Int. Jt. Conf. Artif. Intell. 183, 1317–1323 (2021).
Seo, S., Lee, T., Kim, M.-H. & Yoon, Y. Prediction of side effects using comprehensive similarity measures. BioMed. Res. Int. 2020, 1357630 (2020).
Mei, S. & Zhang, K. A machine learning framework for predicting drug-drug interactions. Sci. Rep. 11, 17619 (2021).
Schwarz, K., Allam, A., Perez Gonzalez, N. A. & Krauthammer, M. AttentionDDI: Siamese attention-based deep learning method for drug–drug interaction predictions. BMC Bioinforma. 22, 412 (2021).
Zhang, J., Yang, X. & Qian, Y. Drug-drug interaction prediction with common structural patterns. In 2021 International Joint Conference on Neural Networks (IJCNN), 1–7 (2021).
Gottlieb, A., Stein, G. Y., Oron, Y., Ruppin, E. & Sharan, R. INDI: a computational framework for inferring drug interactions and their associated recommendations. Mol. Syst. Biol. 8, 592 (2012).
Ryu, J. Y., Kim, H. U. & Lee, S. Y. Deep learning improves prediction of drug–drug and drug–food interactions. Proc. Natl Acad. Sci. 115, E4304–E4311 (2018).
Celebi, R. et al. Evaluation of knowledge graph embedding approaches for drug-drug interaction prediction in realistic settings. BMC Bioinforma. 20, 726 (2019).
Lee, G., Park, C. & Ahn, J. Novel deep learning model for more accurate prediction of drug-drug interaction effects. BMC Bioinforma. 20, 415 (2019).
Hou, X., You, J. & Hu, P. Predicting drug-drug interactions using deep neural network. In Proceedings of the 2019 11th International Conference on Machine Learning and Computing, 168–172 (2019).
Deng, Y. et al. A multimodal deep learning framework for predicting drug–drug interaction events. Bioinformatics 36, 4316–4322 (2020).
Dai, Y., Guo, C., Guo, W. & Eickhoff, C. Drug–drug interaction prediction with Wasserstein adversarial autoencoder-based knowledge graph embeddings. Briefings Bioinformatics 22, bbaa256 (2021).
Wang, Y., Min, Y., Chen, X. & Wu, J. Multi-view graph contrastive representation learning for drug-drug interaction prediction. In Proceedings of the Web Conference 2021, 2921–2933 (2021).
Ferdousi, R., Safdari, R. & Omidi, Y. Computational prediction of drug-drug interactions based on drugs functional similarities. J. Biomed. Inform. 70, 54–64 (2017).
Wishart, D. S. et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074–D1082 (2018).
Kanehisa, M., Goto, S., Furumichi, M., Tanabe, M. & Hirakawa, M. KEGG for representation and analysis of molecular networks involving diseases and drugs. Nucleic acids Res. 38, D355–D360 (2010).
Brown, S. H. et al. VA National Drug File Reference Terminology: a cross-institutional content coverage study. Stud. Health Technol. Inform. 107, 477–481 (2004).
Leskovec, J. & Sosicˇ, R. SNAP: A general-purpose network analysis and graph-mining library. ACM Trans. Intell. Syst. Technol. 8, 1 (2016).
Kastrin, A., Ferk, P. & Leskošek, B. Predicting potential drug-drug interactions on topological and semantic similarity features using statistical learning. PLoS One 13, e0196865 (2018).
Kilicoglu, H., Shin, D., Fiszman, M., Rosemblat, G. & Rindflesch, T. C. SemMedDB: a Pubmed-scale repository of biomedical semantic predications. Bioinformatics 28, 3158–3160 (2012).
Tatonetti, N. P., Ye, P. P., Daneshjou, R. & Altman, R. B. Data-driven prediction of drug effects and interactions. Sci. Transl. Med. 4, 125ra31 (2012).
Xu, N., Wang, P., Chen, L., Tao, J. & Zhao, J. MR-GNN: Multi-resolution and dual graph neural network for predicting structured entity interactions. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 3968–3974 (2019).
Waters, N. J. Evaluation of drug–drug interactions for oncology therapies: in vitro–in vivo extrapolation model-based risk assessment. Br. J. Clin. Pharmacol. 79, 946–958 (2015).
Tod, M., Pierrillas, P. B., Bourguignon, L. & Goutelle, S. Comparison of the static in vivo approach to a physiologically based pharmacokinetic approach for metabolic drug–drug interactions prediction. Int. J. Pharmacokinet. 1, 25–34 (2016).
Tod, M., Goutelle, S., Bleyzac, N. & Bourguignon, L. A generic model for quantitative prediction of interactions mediated by efflux transporters and cytochromes: Application to p-glycoprotein and cytochrome 3A4. Clin. Pharmacokinet. 58, 503–523 (2019).
Tod, M., Bourguignon, L., Bleyzac, N. & Goutelle, S. Quantitative prediction of interactions mediated by transporters and cytochromes: Application to organic anion transporting polypeptides, breast cancer resistance protein and cytochrome 2C8. Clin. Pharmacokinet. 59, 757–770 (2020).
DrugBank online. Rosuvastatin. https://go.drugbank.com/drugs/DB01098. (accessed date: March 2021).
DrugBank online. Fluconazole. https://go.drugbank.com/drugs/DB00196. (accessed date: March 2021).
DailyMed. Drug label: Rosuvastatin (CRESTOR). https://dailymed.nlm.nih.gov/dailymed/drugInfo.cfm?setid=325a5d0e-9a72-4015-9fcd-1655fb504cee. (accessed date: May 2020).
DrugBank online. Ramelteon. https://go.drugbank.com/drugs/DB00980. (accessed date: March 2021).
DrugBank online. Fluvoxamine. https://go.drugbank.com/drugs/DB00176. (accessed date: March 2021).
DailyMed. Drug label: Ramelteon (ROZEREM). https://dailymed.nlm.nih.gov/dailymed/drugInfo.cfm?setid=9de82310-70e8-47b9-b1fc-6c6848b99455. (accessed date: May 2020).
Lampen, A. et al. Metabolism of the immunosuppressant tacrolimus in the small intestine: Cytochrome P450, drug interactions, and interindividual. (1995).
Iwasaki, K. Metabolism of tacrolimus (FK506) and recent topics in clinical pharmacokinetics. Drug Metab. pharma-cokinetics 22, 328–335 (2007).
Passey, C. et al. Dosing equation for tacrolimus using genetic variants and clinical factors. Br. J. Clin. Pharmacol. 72, 948–957 (2011).
Tornio, A., Filppula, A. M., Niemi, M. & Backman, J. T. Clinical studies on drug–drug interactions involving metabolism and transport: Methodology, pitfalls, and interpretation. Clin. Pharmacol. Ther. 105, 1345–1361 (2019).
Baxter, K. & Preston, C. L. Stockley’s Drug Interactions, vol. 495 (Pharmaceutical Press, London, 2010).
Drugbank online. Silodosin. https://go.drugbank.com/drugs/DB06207. (accessed date: March 2021).
DailyMed. Drug label: Silodosin (RAPAFLO). https://dailymed.nlm.nih.gov/dailymed/drugInfo.cfm?setid=da0314f6-b1d6-44ee-8b1e-f763dbb8613f. (accessed date: May 2020).
Sandritter, T. L., McLaughlin, M., Artman, M. & Lowry, J. The interplay between pharmacokinetics and pharmacodynamics. Pediatr. Rev. 38, 195–206 (2017).
DrugBank online. Luliconazole. https://go.drugbank.com/drugs/DB08933. (accessed date: March 2021).
FDA. Guidance for industry: Bioavailability and bioequivalence studies submitted in NDAs or INDs—General considerations. Rockville, MD: Food Drug Adm. 2014 (2014).
Sohn, M. et al. Whole exome sequencing for the identification of CYP3A7 variants associated with tacrolimus concentrations in kidney transplant patients. Sci. Rep. 8, 18064 (2018).
Staatz, C. E. & Tett, S. E. Clinical pharmacokinetics and pharmacodynamics of tacrolimus in solid organ transplantation. Clin. Pharmacokinet. 43, 623–653 (2004).
Ihara, H. et al. Intra-and interindividual variation in the pharmacokinetics of tacrolimus (FK506) in kidney transplant recipients—importance of trough level as a practical indicator. Int. J. Urol. 2, 151–155 (1995).
Mori, T. et al. Drug interaction between voriconazole and calcineurin inhibitors in allogeneic hematopoietic stem cell transplant recipients. Bone Marrow Transplant. 44, 371–374 (2009).
Jung, K. S. et al. KRGDB: the large-scale variant database of 1722 Koreans based on whole genome sequencing. Database 2020, baaa030 (2020).
Acknowledgements
This research was supported by a grant (19182MFDS407) from Ministry of Food and Drug Safety in 2019-2021. We would like to express our gratitude to the 10 researchers (SM Kim, HY Na, JS Hwang, WG Kang, DH Kim, YJ Shin, WB Lee, JH Lee, YM Jeong, and YJ Hwang) who helped build the PK-DDI DB.
Author information
Authors and Affiliations
Contributions
H.J. and J.S. contributed to conception and design of the study, data acquisition, analysis and interpretation of results, drafted, and revised the manuscript. H.J. and J.S. contributed equally to this work as first authors. J.K. and I.K. contributed to conception and design of the study, interpretation of results, and revised the manuscript. H.L., B.M., and J.O. contributed to conception and design of the study, analysis and interpretation of results, and revised the manuscript. All authors approved the final version to be submitted.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Jang, H.Y., Song, J., Kim, J.H. et al. Machine learning-based quantitative prediction of drug exposure in drug-drug interactions using drug label information. npj Digit. Med. 5, 88 (2022). https://doi.org/10.1038/s41746-022-00639-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41746-022-00639-0
This article is cited by
-
Tribulations and future opportunities for artificial intelligence in precision medicine
Journal of Translational Medicine (2024)
-
Learning motif-based graphs for drug–drug interaction prediction via local–global self-attention
Nature Machine Intelligence (2024)
-
Pharmacokinetics–Pharmacodynamics Modeling for Evaluating Drug–Drug Interactions in Polypharmacy: Development and Challenges
Clinical Pharmacokinetics (2024)
-
Analysis and identification of drug similarity through drug side effects and indications data
BMC Medical Informatics and Decision Making (2023)
-
Recent trends and perspectives of artificial intelligence-based machine learning from discovery to manufacturing in biopharmaceutical industry
Journal of Pharmaceutical Investigation (2023)