Introduction

Proton-pump inhibitors (PPIs) are one of the most commonly prescribed medications worldwide, with millions of people using them to manage gastroesophageal reflux disease and peptic ulcers. PPIs irreversibly inhibit the H + /K + -ATPase enzyme in the stomach, thereby decreasing acid secretion1. While PPIs are effective in treating these conditions, recent studies have raised concerns about potential adverse effects associated with their long-term use. These adverse effects include hypomagnesemia2,3, acute kidney injury (AKI)4,5, and acute tubular interstitial nephritis (ATIN)6,7,8. There is also evidence suggesting a correlation between PPI use and chronic kidney disease (CKD)9,10,11, although the precise mechanism is not well understood.

CKD is a global public health problem affecting millions of people worldwide, and it is associated with significant morbidity, mortality, and healthcare costs12,13,14,15. Previous epidemiologic studies have shown a potential link between PPI use and an increased risk of developing CKD. However, these studies have several limitations, including higher rates of comorbidities in the PPI group compared to the control group, lack of important baseline CKD-related information such as estimated glomerular filtration rate (eGFR) and concomitant medication, limited data on the duration of PPI use, and multiple medication switches during the observation period 9,10,11. Addressing these limitations is crucial to better understand the potential risks associated with long-term PPI use and inform clinical decision-making.

To address these limitations of observational studies, a distributed research network using a Common Data Model (CDM) has been developed to standardize heterogeneous data sources into a consistent format using the CDM-based vocabulary provided by the Observational Health Data Sciences and Informatics (OHDSI) organization16,17. This enables researchers to conduct clinical research using standardized, large-scale data.

This study seeks to evaluate the relationship between long-term PPI use and the risk of CKD by using National Health Insurance Service-National Sample Cohort (NHIS-NSC) and multicenter electronic health record (EHR) database, which have been converted into the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) format using validated large-scale data. Specifically, our study aims to investigate the incidence of CKD in new users of acid suppression therapy, either PPI or histamine H2 receptor antagonist (H2RA) and compare the risk of CKD between these two groups. By addressing the limitations of previous studies and using a standardized, large-scale database, our study aims to provide a more robust understanding of the potential link between PPI use and CKD. This, in turn, may inform clinical decision-making and improve patient outcomes.

Results

Study flow and baseline characteristics

The NHIS-NSC CDM database included 1,125,700 subjects from 2002 to 2013, while the six-hospital CDM databases encompassed 10,083,608 subjects from 1999 to 2018. To balance the baseline characteristics between the PPI and H2RA groups, PSM was performed in these databases. Supplementary Table S1 provides additional information about each database and its specific study periods.

In the NHIS-CDM database, 38,881 subjects were eligible for inclusion. After applying exclusion criteria and performing PSM with 12,949 covariates, 1,869 subjects were analyzed in both groups, respectively. Similarly, in the six-hospital CDM database, 68,433 subjects were initially included. After applying exclusion criteria and performing PSM with covariates ranging from 4,695 to 6,573, the final analysis was performed on 5,967 subjects in both groups, respectively (Fig. 1). Most standardized mean differences were approximately 0.1 after PSM (Fig. 2). Supplementary Table S2 and Figure S1 presented a list of critical covariates considered in PSM and the relative risks for the negative control outcomes to assess systemic errors, respectively. Supplementary Figure S2 showed the distribution of propensity scores for both study groups in each dataset, before and after matching.

Figure 1
figure 1

Diagram of cohort construction. PPI, proton pump inhibitor; H2RA, histamine 2 receptor antagonist.

Figure 2
figure 2

Flowchart of Study Participants in NHIS-NCS CDM database (a) and six hospital-based CDM databases (b). NHIS-NSC CDM, National Health Insurance Service-National Sample Cohort Common Data model; GN, glomerulonephritis; CKD, chronic kidney disease.

Before PSM, the PPI group in the NHIS-NSC CDM database had a higher Charlson comorbidity index score, gastroesophageal reflux disease, hyperlipidemia, and malignant neoplastic disease compared to the H2RA group. However, after PSM, baseline covariates were well-balanced in both groups. Table 1 displays the baseline characteristics of the matched cohorts, including covariates with proportions exceeding 5% of the total patients after PSM. Supplementary Table S4 – Table S9 presents the baseline characters for each six hospital cohorts.

Table 1 Baseline characteristics of PPI and H2RA groups with ≥ 180 days of use in the NHIS-NSC CDM and six-hospital CDM databases before and after propensity score matching.

Supplementary Table S10 shows the Charlson comorbidity index scores in the NHIS-CDM and six-hospital CDM databases. Overall, the study effectively controlled for covariates and ensured that baseline characteristics were well-balanced between the PPI and H2RA groups in both databases.

PPI use and the risk of CKD

The primary analysis in the NHIS-CDM compared the incidence of CKD 180 days after drug exposure between subjects who used PPIs for at least 180 days and those who used H2RA for at least 180 days. During a median follow-up of 2.71 years in the PPI group and 2.68 years in the H2RA group, 29 patients in the PPI group and 38 patients in the H2RA group experienced CKD. The incidence of CKD was comparable between the two groups (5.72/1000 person-years vs. 7.57/1000 person-years, respectively; HR, 0.68; 95% CI, 0.35—1.30, P = 0.26). In the secondary analysis, the incidence of CKD was also comparable between the two groups in the six-hospital CDM databases (Table 2). A meta-analysis using the results of the six-hospital CDM databases showed that PPI use was not associated with an increased risk of CKD compared to H2RA use (Fig. 3).

Table 2 Incidence rates and Cox proportional hazard ratios of chronic kidney disease comparing proton pump inhibitors and histamine-2 receptor antagonists use for ≥ 180 days: NHIS-NSC CDM and six-hospital CDM databases.
Figure 3
figure 3

Covariate balance plot before and after propensity score matching in NHIS-NCS CDM database (a) and across six hospital-based CDM databases (b). NHIS-NSC CDM, National Health Insurance Service-National Sample Cohort Common Data model; AUMC, Ajou University Medical Center; DCMC, Daegu Catholic Medical Center; KHMC, Kyung Hee University Medical Center; KWMC, Kangwon National University Hospital; PUNH, Pusan National University Hospital; WKUH, Wonkwang University Hospital.

Sensitivity analysis

To further assess the robustness of our results and identify the influence of PPI duration on subjects, we conducted a sensitivity analysis using 1:4 propensity score matching and PPI use ≥ 365 days. The sensitivity analysis results were consistent with the primary analysis, showing no significant difference in the incidence of CKD between the PPI and H2RA groups. Additionally, we conducted a subgroup analysis for subjects with PPI use ≥ 365 days, which also found no increased risk of CKD in this subgroup compared to the H2RA group (Table 3).

Table 3 Sensitivity analysis of proton pump inhibitors and histamine-2 receptor antagonists groups in the NHIS-NSC CDM and six-hospital CDM databases.

Subgroup analysis

In the subgroup analysis for patients with diabetic mellitus (DM), both the NHIS-NSC CDM and six-hospital CDM databases showed no significant association between PPI use and CKD risk compared to H2RA use across different matching ratios and lag periods (Table 4).

Table 4 Subgroup analysis of chronic kidney disease risk comparing proton pump inhibitors and histamine-2 receptor antagonists use for ≥ 180 days in diabetic patients in the NHIS-NSC CDM and six-hospital CDM databases.

Discussion

This study analyzed the incidence of CKD between PPI and H2RA users using large-scale healthcare databases, including the NHIS-NSC CDM and six-hospital CDM databases. The study employed large-scale PSM to balance the baseline characteristics of the two groups, and the results showed that PPI use was not associated with an increased risk of CKD compared to H2RA use.

PPIs are known to cause AKI by disrupting kidney function through several mechanisms. PPIs can reduce magnesium levels, which can interfere with the activity of enzymes and transporters in the kidneys, potentially leading to AKI. Additionally, PPIs can affect proton transport in the kidneys, which may also reduce kidney function and contribute to AKI. Furthermore, PPIs may increase the risk of ATIN, a condition that can cause AKI. In fact, several case reports have demonstrated a correlation between PPI use and ATIN6,7,8. However, it is believed that many patients with PPI-associated ATIN cases may not present with typical hypersensitivity reactions or undergo kidney biopsy, potentially leading to underdiagnosis of ATIN. This may lead to long-standing ATIN progressing to chronic tubulointerstitial nephritis, potentially contributing to CKD18. Although it is possible that long-standing ATIN may progress to chronic tubulointerstitial nephritis and contribute to CKD, this has not been directly proven by clinical studies. The mechanism by which PPIs may contribute to the development of CKD is not entirely clear. Therefore, conflicting results of clinical observational studies and unclear mechanisms make the association between long-term PPI use and CKD uncertain.

In the broader context of medical literature, findings regarding the association between PPI use and CKD risk have been mixed. Some studies have highlighted potential risks associated with PPIs, while others found no such associations9,10,19,20. Our Sensitivity Analysis, as presented in Table 3, indicates that patients with prolonged use of PPI (> 365 days) might have a lower propensity to develop CKD compared to those on H2RAs. However, this observation was not statistically significant. Notably, we have not identified studies that corroborate this specific observation. A possible explanation for this trend might be the influence of unmeasured confounding factors. For instance, patients on long-term PPI therapy might access medical care more frequently, leading to enhanced overall health monitoring and potentially early detection or prevention of CKD. Other factors, such as dietary habits, lifestyle choices, and unaccounted clinical variables, might also contribute to this observed trend.

Several epidemiologic studies suggested that PPIs could increase the risk of CKD. However, these studies had limitations. For instance, the PPI group had higher rates of comorbidities compared to the placebo group, and important information about CKD, such as baseline eGFR and concomitant medication, was not widely available for comparison between medication groups. Additionally, in studies comparing PPI use with H2RA use, it is unlikely that participants were well-matched for the severity of their gastrointestinal disorders since PPIs are often prescribed as a first-line therapy for more serious disorders such as Helicobacter pylori infection, gastroduodenal ulcers, and bleeding. Therefore, the positive signal toward CKD progression in PPI users may more accurately reflect a sicker group at baseline. Moreover, previous studies lacked the ability to determine the quantity and duration of PPI prescription use, which increases the risk of confounding during group assignments. This limitation is noteworthy because it can lead to the creation of alternative definitions of study outcomes.

This study represents a significant contribution to the literature examining the relationship between PPI use and CKD. This study addressed the limitations of previous research by utilizing large-scale healthcare databases and rigorous statistical methods to control for multiple confounding variables and the duration of PPI prescription. Notably, our subgroup analysis targeting patients with DM, consistently demonstrated that PPIs did not elevate the risk for CKD, even among diabetic individuals. Given the extensive size of our study and the significance of diabetes as a potential risk factor for CKD, this analysis offers crucial insights into the safety profile of PPIs in a high-risk demographic. However, concerns may arise regarding the representativeness of the study population due to the use of six-hospital CDM databases, possibly leading to selection bias and limiting the generalizability of the findings. While eGFR data availability in the six-hospital CDM databases is a strength, it is important to note that eGFR data was not available in the NHIS-CDM database, limiting the ability to diagnose CKD accurately in this dataset. Nevertheless, the study made the best use of both databases to overcome their respective weaknesses, providing a more comprehensive and robust analysis of the association between PPI use and CKD.

Despite the strengths of this study, there are some limitations that should be acknowledged.

Firstly, while the study controlled for multiple confounding variables, some unmeasured factors could affect the association between PPI use and CKD. For example, the study did not consider lifestyle factors such as smoking or alcohol consumption, which could potentially influence the risk of CKD. Secondly, the study design was based on comparing the incidence of CKD between PPI and H2RA users, which assumes that H2RA use does not contribute to CKD risk. If H2RA use were to have an impact on CKD risk, the study design might not accurately capture the true effect of PPI use on CKD risk. This could potentially lead to an underestimation of the association between PPI use and CKD. Thirdly, our study population did not include patients with established CKD; therefore, the effect of PPI use on this specific population remains unclear. Fourthly, our reliance on data up to 2013 and 2018 might not encapsulate the most recent shifts in clinical practices, drug formulations, or patient demographics. Fifthly, there were inherent differences between the groups before matching from the six hospitals, and a significant number of patients were excluded post-matching. This exclusion might limit the generalizability of our findings to the entire cohort of PPI and H2RA users. We have taken measures to balance the groups using propensity score matching, but the results should be interpreted with this limitation in mind. Lastly, it is important to note that in Korea, there is no evidence suggesting a bias in prescribing PPIs over H2RAs for early CKD, which might make the concept of protopathic bias less relevant to our study's context.

In conclusion, this study found no significant association between PPI use and an increased risk of CKD compared to H2RA use. Therefore, it is not recommended for clinicians to de-prescribe PPIs in patients who require continued therapy and are benefiting from it. However, there have been rare cases of acute tubulointerstitial nephritis (TIN), potentially leading to CKD, associated with PPI use. Providers need to individualize care to determine the benefits versus risks of ongoing medication use. Further research is needed to confirm these findings and investigate the potential mechanisms underlying the association between PPI use and CKD.

Materials and methods

Ethics statement

This study received ethical approval from the Institutional Review Board (IRB) at Kangdong Sacred Hospital, and the study was conducted in accordance with the principles of the Declaration of Helsinki. The requirement for written informed consent from study participants was waived by the IRB. The study was also conducted at six other hospitals that are affiliated with the Research Board Free Zone of the Korea CDM data network, which recognized the IRB approval of the research organizing center and did not require individual IRB approval for the study (Ref. no.2023-01-007).

Data sources

This study utilized national population-based and hospital-based cohorts in the OMOP-CDM format for analysis. The primary analysis was conducted on the NHIS-NSC database, which contains medical treatment history, insurance eligibility, health examination findings, and healthcare provider information of over one million individuals. The NHIS-NSC database is a representative, stratified random sample of 2.2% of the Korean population in 2002, and the individuals were followed for 11 years. The OMOP-CDM version of the NHIS-NSC was validated in several multinational observational health data science and informatics studies through a common analytic R code21,22,23,24. The six hospital-based EHR databases were also transformed into the OMOP-CDM format and made accessible through the Federated E-health Big Data for Evidence Renovation Network in Korea (FEEDER-NET) (https://feedernet.com), supporting collaboration between OHDSI networks.

Study design and cohort definition

This study utilized a retrospective observational design to compare the incidence of CKD in new users of PPIs and new users of H2RAs. To minimize the risk of immortal time bias, only patients with at least 365 days of a continuous observational period before entering either cohort were considered eligible for the study. The index date, marking the beginning of the study, was determined as the first date a patient used PPIs or H2RAs (Fig. 1).

Patients were included in the study if they met the following criteria: (1) New users of PPIs or H2RAs for over 180 days. New users were defined as those who did not use H2RAs within 365 days before starting PPI treatment and those who did not use PPIs within 365 days before starting H2RA treatment. (2) Aged 18 years or older. (3) Had an observation period of at least one year prior to cohort entry. (4) Had no previous history of glomerulonephritis (ICD-10 codes: N00-N08, M32.1, M32.14, M31.3, M31.31), kidney transplantation (ICD-10 codes: Z94.0, T86.1), or CKD (ICD-10 codes: N18.3-N18.5, N18.9).

The target cohort consisted of patients prescribed PPIs for a consecutive period of at least 180 days with no more than a 30-day gap between prescriptions. Medications in the target cohort included dexlansoprazole, esomeprazole, lansoprazole, omeprazole, pantoprazole, and rabeprazole. The comparative cohort comprised patients prescribed H2RAs for a consecutive period of at least 180 days with no more than a 30-day gap between prescriptions. The comparative cohort included ranitidine, nizatidine, famotidine, and cimetidine.

Patients were excluded from the study if they met any of the following criteria: 1) Had a diagnosis of CKD in NHIS-NSC CDM databases or an eGFR of less than 60 ml/min/1.73 m2 in six hospital-CDM databases. 2) Were not matched for the minimum time at risk of 1 day.

In the six-hospital CDM databases, patients with eGFR below 60 ml/min/1.73 m2 were excluded from the analysis. It is important to note that eGFR data was not available in the NHIS-NSC CDM database but was available in the six hospital-CDM databases. The inclusion and exclusion criteria for the study are depicted in Fig. 4.

Figure 4
figure 4

Incidence Rates of New-Onset Chronic Kidney Disease for PPI and H2RA Users with ≥ 180 Days of Use: Meta-Analysis Forest Plot. PPI, proton pump inhibitor; H2RA, histamine 2 receptor antagonist. aIncidence rate per 1000 person-years.

Outcome

The outcome of this study was the occurrence of CKD, defined as having an eGFR of less than 60 ml/min/1.73 m2 or being diagnosed with ICD codes of N18.3-N18.5, N18.9. In the NHIS-NSC CDM database, CKD was defined using ICD codes, while in the six-hospital CDM databases, CKD was determined by having an eGFR less than 60 ml/min/1.73 m2 on at least three occasions during the observation period. The second eGFR measurement should be taken three months after the initial measurement of eGFR less than 60 ml/min/1.73 m2. The date of CKD diagnosis was based on the date of initial measurement.

Statistical analysis

The study was conducted using the OHDSI CohortMethod R package (https://github.com/OHDSI/CohortMethod) and ATLAS version 2.7.6. To control for potential confounding, large-scale propensity score matching (PSM) was employed between the two groups. Covariates used in the matching process included age, sex, prior conditions, drugs observed during the long-term (within ~ 365 days) and short-term (within ~ 30 days) before study drug exposure, and the Romano Adaptation of the Charlson comorbidity index25.

To address overfitting, the investigators employed logistic regression models with an L1 penalty (also known as LASSO regularization) during the propensity score estimation. The L1 penalty serves to shrink some coefficients to zero, effectively performing feature selection and reducing the risk of overfitting. Moreover, the hyperparameter for the L1 penalty was chosen using tenfold cross-validation, a robust technique to ensure the model generalizes well to new data26. One-to-one greedy-search matching with a caliper of 0.2 times for the standard deviation of the propensity score distribution was used for matching patients.

The primary analysis was conducted with 1:1 propensity matching and a 180-day lag period. The 180-day lag period was chosen to minimize the potential for protopathic bias, wherein the initiation of the drug may be influenced by early symptoms of the outcome, in this case, CKD. This lag period allows for a more accurate assessment of the causal relationship between PPI or H2RA use and the development of CKD. During the analysis, patients who switched between PPIs and H2RAs were treated as censoring events to account for potential changes in treatment regimens. Cox regression was used to calculate the hazard ratio (HR) for CKD. Incidence rates were determined per 1000 person-years, and the cumulative incidence between the two groups was compared using the log-rank test. A two-sided P value of less than 0.05 was considered statistically significant. Empirical calibration of the P values was performed by fitting an empirical null distribution to the point estimates of the negative control outcomes, which were assumed not to be associated with the target or comparative cohorts27,28. The true relative risk between the target and comparative cohort was assumed to be 1. A total of 87 negative control outcomes were selected, listed in Supplementary Table S3.

After conducting the identical analytic process in six-hospital CDM databases, the results from each were combined using a meta-analysis. Statistical tests of heterogeneity were calculated using the tau-squared (τ2) and I2 statistics. When there was no significant difference between the results (P > 0.10, I2 < 50%), a fixed-effects model was used to combine the results. However, a random-effects model was used when there was a significant difference between the results. Both the fixed-effects and random-effects models were reported as a sensitivity analysis. All analyses were performed using R statistical software (version 3.6.1) provided by the R Foundation for Statistical Computing.

Additionally, sensitivity analyses were also conducted with different matching ratios (1:1 and 1:4) and lag periods (180 and 365 days) in propensity score matching. An analysis was also performed for patients with consecutive prescription periods ≥ 365 days to investigate potential dose–response effects. Furthermore, a subgroup analysis specifically targeted patients diagnosed with DM.