Introduction

Warfarin is the most widely used oral anticoagulant worldwide. There were more than 25 million prescriptions for warfarin in 20101 and about 7 to 8 million warfarin treatment visits annually between 2009 and 2014 in the United States2. Despite its high efficacy, warfarin has a narrow therapeutic window and large interpatient variability with a 10- to 20-fold differences in the therapeutic dose required to achieve target International Normalized Ratio (INR). Because of these challenges associated with warfarin use, it is one of the leading causes in emergency department visits and the most often cited cause of drug-related mortality3.

Half of interpatient variability in dose requirement for warfarin could be explained by clinical factors, demographic variables and genetic variants. Of the genetic variants, polymorphisms in cytochrome p450, family 2, subfamily C, polypeptide 9 (CYP2C9), and vitamin K epoxide reductase complex, subunit1 (VKORC1) independently correlates with warfarin therapeutic dose4,5,6. It is estimated that polymorphisms in CYP2C9 and VKORC1 together explain approximately 30% (20ā€“25% for VKORC1; 5ā€“10% for CYP2C9) of the interpatient warfarin dose variance4,5,6,7. In view of the strong genetic effects on warfarin dose, the U.S. Food and Drug Administration (FDA) updated the warfarin product label to instruct how to predict individualized dose based on combined polymorphisms of CYP2C9 and VKORC18. Many pharmacogenetic algorithms integrating clinical, demographic and genetic variables have also been developed to predict the dose required in individual patients9,10,11,12. However, there is a lack of classification for warfarin responses in patients to reflect the genetic influence. In addition, clinical simple tools to identify warfarin sensitivity are needed for clinical use.

In this study, we proposed a classification of warfarin sensitivity based on combined polymorphisms of CYP2C9 and VKORC1. We also developed a clinical algorithm to predict the warfarin sensitivity in patients without laboratory tests for CYP2C9 and VKORC1 polymorphisms using a large and diverse data set with patients around the world. We then compared it with a fixed-dose strategy to determine whether the performance of the algorithm was significantly better. Finally, we validated the algorithm externally with an independent cohort to test how generalizable the algorithm is.

Methods

The Easton cohort

A total of 492 qualified patients with various cardiovascular diseases were enrolled. Among them, 138 patients were on warfarin therapy. This study protocol complied with the Declaration of Helsinki and was approved by Copernicus Group Institutional Review Boards. Written informed consent was obtained from every patient.

The international warfarin pharmacogenetic consortium (IWPC) cohort

IWPC Cohort has been reported previously9,13. Pooled data containing 6922 chronic warfarin users from four continents were downloaded from the PharmGKB website (http://www.pharmgkb.org/downloads/). This data set records detailed de-identified curated data on demographic factors, clinical features, and genetic variables such as CYP2C9 and VKORC1. Multivariate linear regression was employed to impute missing values for height and weight. Specifically, the height variable was modeled based on weight, race, and sex, while the weight variable was based on height, race, and sex. For missing values of the VKORC1 rs9923231, the imputation strategy has been described (TableĀ 1)9, which is based on linkage disequilibrium in VKORC1 and race5. Imputation of rs9923231 genotype was performed in 13.8% of Asians, 3.6% of blacks, and 20.0% of whites.

Table 1 Imputation of VKORC1 rs9923231 genotype.

Study design and sample

The design and selection criteria for the IWPC Study have been detailed elsewhere9. Patients were eligible for the present investigation if they had nonmissing data after imputation on clinical variables, such as warfarin stable dose, height, weight, age, race and use of amiodarone. In addition, we excluded those lacking of CYP2C9 and VKORC1 rs9923231 genotypes, low allele frequency of CYP2C9*5, *6, *11, *13 and *14, and an outlier subject with warfarin stable dose 315ā€‰mg/week. A total of 5542 subjects were included in this study. The study aimed to develop a clinical algorithm to predict the warfarin sensitivity using clinical variables. The study sample size was based on patients with nonmissing data on clinical and genetic covariates in IWPC and Easton cohorts.

Warfarin sensitivity genotyping

Genomic DNA was extracted from buccal swab leukocytes using QiangenQiaCube instrument and MagMAXā„¢ DNA Multi-sample kits and concentrated by magnetic bead-based method. VKORC1 allelic variants rs9923231 (āˆ’1639 Gā€‰>ā€‰A) in the promoter region were sequenced in 428 patients. CYP2C9*1, *2 and *3 were genotyped in 480 patients.

Outcome measurement

Warfarin stable dose (mg/week) and incidence of over-anticoagulation event (INRā€‰>ā€‰5) were used as the major outcome in the Easton cohort. Warfarin stable dose was defined as the dose led to an INR in the therapeutic range (2ā€“3) on at least 3 consecutive INR measurements.

Warfarin sensitivity

Warfarin sensitivity was defined by the combined profile of CYP2C9 *1, *2 and *3 and VKORC1 rs9923231 genotypes based on the FDA warfarin label (TableĀ 2)8. VKORC1 G/G; CYP2C9 *1/*1, VKORC1 G/G; CYP2C9 *1/*2 and VKORC1 A/G; CYP2C9 *1/*1 were three compound genotypes for warfarin normal responders. The rest 15 compound genotypes were deemed warfarin sensitive including sensitive and very sensitive groups (TableĀ 2), which requires reduced warfarin dose. Warfarin sensitivity (normal or sensitive) was a categorical variable inputted into binary logistic regression.

Table 2 Warfarin sensitivity based on genotypes and three ranges of recommended warfarin doses (mg/week) from the USA FDA drug label (COUMADIN, Reference ID: 3022954).

Model selection

Binary logistic regression was used to build a clinical model for predicting warfarin sensitivity. We randomly chose 80% of the eligible patients from IWPC Cohort (stratified according to warfarin sensitivity, for a total of 4433) as the derivation cohort (training data set) to develop logistical regression models. The remaining 20% of the patients (Nā€‰=ā€‰1109) were reserved as the validation cohort (testing data set) to calculate unbiased estimates of correct classification rates. The variables were initially identified based on reported pharmacogenetic dosing algorithm9, including warfarin stable dose, height, weight, race, age, enzyme inducer and use of amiodarone. In addition, fixed-dose approaches based on warfarin label were employed to predict warfarin sensitivity, in which warfarin sensitive was inferred when warfarin stable dose was less than intermediate dose between normal and sensitive responders. In this study, both 31ā€‰mg/week and 32ā€‰mg/week were tested as cutoff values.

Cross-Validation and Model Evaluation

The IWPC data set was randomly partitioned into ten equal parts (folds) for a 10-fold cross-validation (CV). Thus, in each iteration, a model was trained on all but one held-aside folds and then tested on the held-aside fold of the data. The iteration was repeated 10 times and each fold served as a test data set to evaluate the model performance. For model quality assessment, the pseudo r-squared measures, the areas under the ROC Curve (AUC), the overall prediction accuracy, the sensitivity and the specificity of the 10-fold CV were evaluated. Logistic regression models were compared with fixed-dose methods using McNemarā€™s Ļ‡2 test, in which the numbers of times each model had correct predictions were counted for each patient in a group when a discordant pair occurred between the two models.

Statistical analysis

For differences in continuous variables, warfarin stable dose, height, and weight between the derivation and validation cohorts (TableĀ 3) were compared with the Wilcoxon rank-sum test. Warfarin doses in whites, blacks and Asians among different sensitivity groups (TableĀ 4) were calculated with the Kruskal-Wallis test. For categorical variables, Fisher exact test was used in CYP2C9 allele frequencies, Ļ‡2 tests were used for VKORC1 rs9923231 genotype, age, and race. Proportions for the use of amiodarone were compared with the z-test. P valuesā€‰<ā€‰0.05 were considered to be statistically significant. All analyses were conducted with R (version 3.4.4).

Table 3 Demographic and clinical characteristics of the Derivation, Validation and Easton cohorts.
Table 4 Warfarin sensitivity in the IWPC and Easton cohorts.

Results

Basic characteristics of the study cohorts

The characteristics of the patients are shown in TableĀ 3. In the Easton cohort, of 138 patients on long-term warfarin therapy for thromboembolic disorders and other cardiovascular diseases, 106 patients were included for analyses with complete clinical and genotype data. All patients reached the therapeutic warfarin stable dose, defined as the dose of warfarin required to maintain an INR between 2 and 3. The median warfarin stable dose was 27.5ā€‰mg/week. The percentages of patient age less than 50, 50 to 80, and older than 80 were 1.9%, 52.8%, and 45.3%, respectively. Among them, 9.4% (10/106) of patients were concomitantly taking amiodarone.

There were 5542 patients with a median warfarin stable dose of 28.0ā€‰mg/week included for analyses in the IWPC cohort. Th INR for all patients fell within the target range of 1.7 to 3.3, with the majority maintained between 2 and 3. The proportions of patient age in the IWPC cohort less than 50, 50 to 80, and older than 80 were 16.8%, 71.0%, and 12.2%, respectively. The percentage of patients concomitantly taking amiodarone was 4.9% (269/5542).

Warfarin sensitivity profiles of the study cohorts

To better profile the warfarin responses in different individuals, we classified the warfarin sensitivity into 3 categories (normal, sensitive and very sensitive) base on combined polymorphisms in CYP2C9 and VKORC1 in the FDA warfarin label (TableĀ 2). Warfarin stable doses for different warfarin sensitive groups in the IWPC cohort are shown in Fig.Ā 1. There were 44.8% of warfarin sensitive patients overall. In the normal group, VKOCR1 G/G; CYP2C9 *1/*1 (24.6%) and VKORC1 A/G; CYP2C9 *1/*1 (25.4%) were the two most common genotypes, while VKOCR1 G/A; CYP2C9 *1/*2 (6.6%) and VKORC1 A/A; CYP2C9 *1/*1 (25.5%) in the sensitive group and VKOCR1 G/A; CYP2C9 *2/*3 (0.5%) and VKORC1 A/A; CYP2C9 *1/*3 (2.4%) in the very sensitive group were the two most common genotypes, respectively. As shown in TableĀ 4, the median warfarin stable doses (40ā€‰mg/week and 35ā€‰mg/week) for the Black in the normal and sensitive group were higher than those (36ā€‰mg/week and 25ā€‰mg/week) in White (Pā€‰<ā€‰0.001), whereas the median warfarin stable doses (28ā€‰mg/week and 21ā€‰mg/week) for the Asian in the normal and sensitive group were the least among the three races (Pā€‰<ā€‰0.001). In line with this, 95.1% of the Black in the IWPC cohort were normal warfarin responders, in contrast to 74.8% of the Asian being warfarin sensitive (Pā€‰<ā€‰0.001), indicating race is a contributing factor for warfarin sensitivity. In the Easton cohort, the median warfarin stable dose for the normal, sensitive and very sensitive groups were 35.0ā€‰mg/week, 18.5ā€‰mg/week, and 12.0ā€‰mg/week, respectively.

Figure 1
figure 1

Warfarin stable dose across different genotypes. N: normal; S: sensitive; VS: very sensitive.

Incidence of INR >5

It has been shown that the intensity of anticoagulant therapy is the most important risk factor for bleeding, which is the main complication of oral anticoagulant therapy and that hemorrhagic events increase exponentially as the INR increases >5.014. Therefore, the incidence of INR measurements >5 is often used as a surrogate marker for bleeding complications during warfarin therapy3. As shown in TableĀ 5, the average incidence of INR >5 in the sensitive and very sensitive combined group was 1.02, which was nearly 2-fold more frequent than that (0.36) in the normal group in the Easton cohort (Pā€‰=ā€‰0.03), although the median follow-up time was 44 months in the sensitive and very sensitive group compared with 32 months in the normal group (Pā€‰=ā€‰0.05), suggesting warfarin sensitive patients are more prone to bleeding complications.

Table 5 Times of over-anticoagulation events in the Easton cohort.

Comparison of the prediction models

To develop a parsimonious model to predict warfarin sensitivity, stepwise logistic regression was used to identify the important features. In logistic regression analyses, the variables warfarin stable dose, height, weight, race, age, and use of amiodarone were found to be significantly associated with warfarin sensitivity. Moreover, fixed-dose approaches based on the FDA warfarin label were also tested for clinical utility. The performance of the logistic regression and fixed-dose models in the derivation and validation cohorts is shown in TableĀ 6. The logistic regression model provided significantly better prediction than the fixed-dose approach. With a threshold of probability >0.4 for warfarin sensitivity, the accuracy, sensitivity, and specificity in the logistic regression were 0.781, 0.824, and 0.746 for the derivation cohort and 0.795, 0.831, and 0.766 for the validation cohort, while those using the fixed-dose method with a cutoff value of warfarin stable dose less than 31ā€‰mg/week were 0.717, 0.799, and 0.651 for the derivation cohort and 0.743, 0.819, and 0.676 for the validation cohort (Pā€‰<ā€‰0.001), respectively. Similarly, the accuracy, sensitivity, and specificity in the logistic regression with a threshold of probability (Pā€‰>ā€‰0.35) were 0.769, 0.862, and 0.693 for the derivation cohort and 0.784, 0.863, and 0.721 for the validation cohort, while those using the fixed-dose method with a cutoff value of warfarin stable dose less than 32ā€‰mg/week were 0.702, 0.837, and 0.593 for the derivation cohort and 0.743, 0.845, and 0.660 for the validation cohort (Pā€‰=ā€‰0.038), respectively. The AUC was used as an additional metrics for evaluating the logistic regression model. The closer AUC is to 1, the greater is the predictive ability of the model. In the logistic regression model, the AUC was 0.865 for the derivation cohort and 0.878 for the validation cohort. Moreover, the model performance was evaluated by 10-fold CV. The logistic regression model exhibited a consistent predictive power with an AUC of 0.867ā€‰Ā±ā€‰0.014 (TableĀ 6), indicating the strong robustness of the model. To investigate the importance of selected features in the clinical prediction model, the AUC were determined after excluding each variable in the model. The AUC without warfarin stable dose or race by CV were 0.766ā€‰Ā±ā€‰0.109 and 0.823ā€‰Ā±ā€‰0.025, respectively, indicating that warfarin stable dose plays a crucial role in the model.

Table 6 Model performance between the clinical algorithm and fixed-dose approach.

Final model and external validation

To fully utilize the IWPC data set, we pooled the derivation and validation cohorts and rederived a final logistic regression model using the same variables. The odds ratios and pseudo r-squared measures for regression are shown in TableĀ 7. The clinical refinement algorithm is shown in TableĀ 8. We next tested the final logistic regression model using the external Easton cohort (Fig.Ā 2). The AUC was 0.836 for the Easton cohort and 0.867 for the whole IWPC cohort. With the threshold of probabilityā€‰>ā€‰0.4 for warfarin sensitive, the sensitivity and specificity were 0.745 and 0.712 for the Easton cohort, 0.826 and 0.746 for the IWPC cohort, respectively.

Table 7 Regression Odds Ratios and Pseudo R-squared Measures.
Table 8 Logistic regression model for warfarin sensitivity.
Figure 2
figure 2

ROC curve analysis of the IWPC and Easton cohorts using the warfarin sensitivity algorithm.

Discussion

Due to a narrow therapeutic index and interpatient variability, it is of great importance to recognize the warfarin response individually. It has been shown in warfarin pharmacogenetics that genetic variants in CYP2C9 and VKORC1 explain approximately one third of the interpatient dose variance4,5,6 and are more strongly associated with warfarin stable dose than all other known patient factors. Given a strong genetic basis underlying warfarin stable dose, here we proposed a simple clinical classification for warfarin response in individual patients based on the FDA warfarin label to better represent and recapitulate the genetic variants. To better understand the distribution of warfarin sensitivity across different races, we examined the IWPC cohort and found that 95.1% of the Black were normal warfarin responders, while 74.8% of the Asian were warfarin sensitive. This discrepancy is attributing to minor allele frequencies of VKORC1 SNP rs9923231 and CYP2C9 SNP rs1799853 (*2) and rs1057910 (*3) between ethnicities, with more genetic variation among individuals of European descent than in individuals of African and Asian descent13. In addition, although the majority of the Black in IWPC cohort was normal warfarin responders, it is worth noting that rare CYP2C9 variant alleles (CYP2C9*5, *6, *8, and *11) with reduced enzymatic activity contribute to dose variability among African Americans15,16.

Bleeding is the main complication during warfarin therapy, which is well correlated with the incidence of INR measurements >5. Homozygosity for the VKORC1 āˆ’1639 Gā€‰>ā€‰A (A/A) has been reported to be associated with a significantly increased number of INR >5 and occurrence of bleeding events during the first month of therapy, compared to the G/G genotype17. Consistent with this finding, we found that the average incidence of INR >5 in the sensitive and very sensitive combined group was 1.02, compared to 0.36 in the normal group in the Easton cohort (Pā€‰=ā€‰0.03). Moreover, it has been reported that sensitive warfarin responders require pharmacogenomic-guided protocols to achieve well-controlled INR, while normal warfarin responders only need a fixed-dose or clinical protocol to achieve well-controlled INR18. Taken together, these data suggest warfarin sensitive responders are predisposed to out-of-range INR than warfarin normal responders. Therefore, it is reasonable to frequently monitor INR in warfarin sensitive patients.

Since warfarin sensitivity contributes to adverse events including bleeding, recognizing warfarin sensitivity is crucial to patient care. In this study, we developed a clinical model to predict warfarin sensitivity. We foresaw how it could be utilized in several clinical scenarios. First, patients who are at increased risk of bleeding complications, such as history of gastrointestinal tract bleeding or patients who suffer bleeding events while on warfarin therapy. In the case of bleeding, our algorithm may help to determine whether the underlying cause of bleeding was due to warfarin sensitivity as a result of genetic variants. Second, patients who schedule for warfarin withdrawal prior to an invasive procedure. It has been reported that patients with two CYP2C9 variant alleles (CYP2C9*2/*2 or CYP2C9*2/*3), the odds of having an INR of ā‰„1.5 before the planned day of surgery is 8.64 times greater (95% confidence interval 2.25ā€“33.25) than for other patients19. This algorithm could facilitate the patient management by reducing potential harm resulting from either discontinuing warfarin too early (thus predisposing the patient to thrombosis) or stopping it too late (thus at increased risk of perioperative bleeding). In general, if patients with predicted warfarin sensitive, we recommend increasing the frequency to monitor INR compared to those with normal warfarin response and determining genetic variants by warfarin sensitivity tests. Of note, personized genotype-guided warfarin dosing has demonstrated clinical benefits and superior clinical outcomes in major clinical trials20,21,22, future studies will investigate how warfarin sensitivity algorithm can be used along with pharmacogenetic dosing algorithms.

The clinical model we developed includes 6 variables, such as warfarin stable dose and race. The importance of selected features in the prediction model was determined after excluding each variable by the relative AUC changes. The warfarin stable dose was the most important variable to predict warfarin sensitivity. Moreover, compared to the fixed-dose strategy, i.e. warfarin stable dose, the clinical model provided significantly better prediction, especially for specificity. From the clinical point of view, many patients with warfarin stable doses, but unknown of warfarin sensitivity as discussed above in several clinical scenarios could be predicted more accurately by using the clinical model than just by warfarin stable doses, thereby improving clinical outcomes.

Several limitations are present in our study. First, missing genotypes of VKORC1 in the IWPC cohort for some patients were imputed based on linkage disequilibrium5. Missing values for height and weight were imputed using multivariate linear regression. Although these imputation strategies are generally reliable, some errors could have been introduced in the study. Second, as shown in Fig.Ā 1, warfarin resistance (>70ā€‰mg/week) existed in many patients in the IWPC cohort, especially in warfarin normal groups. The polymorphisms in VKORC1 and CYP2C9 associated with warfarin resistance were not explored23,24. Future studies need to identify additional genetic variants to differentiate warfarin normal and resistant patients. Third, additional potential variables affecting the prediction of warfarin sensitivity were not included in the model, such as comorbidities, additional drugā€“drug interactions, and patient behaviors, including diet, exercise, and compliance. Future studies are needed to determine how important these additional potential variables are for the prediction of warfarin sensitivity. Of note, with more variables integrated into the model, better performance is achieved at the risk of overfitting and reduced model simplicity.

In conclusion, according to the FDA warfarin label based on genetic variants in CYP2C9 and VKORC1, we established a clinical classification for warfarin sensitivity. In our Easton cohort, warfarin sensitive caused an increased incidence of out-of-range INR in patients than that of warfarin normal responders. In addition, using data from a large and diverse cohort of patients from IWPC, we developed and validated a clinical algorithm to predict warfarin sensitivity in individual patients. This clinical algorithm performed better than a fixed-dose approach. With the use of this algorithm, it is very likely to facilitate patient care associated with warfarin therapy in a number of clinical scenarios, thereby improving clinical outcomes.