Introduction

Antiplatelet medications are currently the core treatment for atherosclerotic cardiovascular disease (ASCVD) prevention1, but they carry an increased risk of gastrointestinal bleeding. Studies have shown a 1.8-fold increased risk of gastrointestinal bleeding during low-dose aspirin (LDA) treatment and a 7.4-fold increased risk during dual antiplatelet therapy2. Moreover, epidemiological data have demonstrated that approximately 15–20% of patients receiving antiplatelet therapy3,4 have preexisting peptic ulcers (PUs), and a history of PUs can increase the risk of gastrointestinal bleeding by more than 6-fold5. However, it is rare for people to undergo PU risk screening before taking antiplatelet drugs. In most cases, people need to undergo physical examinations to determine the necessity of taking antiplatelet drugs. Identifying a simple tool that could screen for PU risk during routine clinical evaluation would be valuable for preventing gastrointestinal bleeding after antiplatelet therapy.

Currently, endoscopy is the most reliable method for screening PUs. However, its limited tolerance, high cost, and incompatibility with dynamic observation render it unsuitable for routine clinical evaluation, and it cannot be widely promoted as a screening tool for PUs in clinical practice. Studies have shown that the main mechanism of gastrointestinal injury induced by antiplatelet medications is local mucosal damage and the inhibition of mucosal repair6. In recent years, many studies have identified biomarkers that can predict PUs, such as platelet count, white blood cell (WBC) count, pepsinogen I (PGI), and gastrin 17 (G-17)7,8,9. However, most related studies have focused on specific clinical and hematological markers, resulting in limited value in identifying PUs using a single marker. Moreover, comparisons between the markers are lacking, and some of the markers reflect the same underlying mechanism of PU occurrence. Therefore, it may be more rational to endeavor to extract and integrate these markers for PU screening.

Nomograms, a pictorial representation of a complex mathematical formula (Grimes, 2008), reflect various factors simultaneously to help patients visualize their probability of developing a disease10. Several prediction nomogram models have been developed to effectively predict bleeding in PUs11,12, but studies focusing on screening for PUs in the physically examined population are rare. Therefore, this study was designed to develop and validate a nomogram for individually predicting PUs in the physically examined population, providing a reliable clinical tool for the early identification of PUs. This study provides insights into safe personalized treatment for potential recipients of antiplatelet therapy.

Methods

Study design and study population

From April 2019 to May 2022, data were prospectively and continuously collected from participants who underwent upper gastrointestinal mucosal assessment as part of their physical examination at our hospital. The inclusion criteria for patients were as follows: (i) ≥ 18 years of age, (ii) no severe organ failure, and (iii) no indication of hospitalization for gastrointestinal symptoms. The exclusion criteria for patients were as follows: (i) had specific gastrointestinal diseases, including acute upper gastrointestinal bleeding and esophageal malignant tumors; (ii) exhibited intolerance during endoscopy; (iii) poor image quality preventing proper assessment; (iv) had a history of upper gastrointestinal surgery and an upper gastrointestinal mucosal assessment process that was prospectively formulated; and (v) no documented history of proton-pump inhibitor (PPI) usage or cessation of such medications for a duration exceeding one month. The process involved collecting blood samples for PG, gastrin 17 (G-17) and Helicobacter pylori (HP) antibody analysis and gastroscopic assessment of gastrointestinal mucosal morphology.

Patient involvement

Participants and their families were not involved in setting the research question or the outcome measures, but they were intimately involved in the design and implementation of the research. Participants and their families were also central to dissemination of the baseline information, thereby helping the study to proceed.

Ethics statement

This retrospective study was approved by the human ethics committee of Huzhou Nanxun District People's Hospital (No. 2017N016). All subjects provided written informed consent prior to the study, and the protocols were approved by the local ethics committee. All clinical investigations were conducted according to the principles expressed in the Declaration of Helsinki.

Clinical data extraction

The following data were extracted for each patient: age, sex, past medical history (hypertension, diabetes, and stroke), medication history (antiplatelet drugs, anticoagulants, and nonsteroidal anti-inflammatory drugs (except for aspirin), atherosclerotic cardiovascular disease (ASCVD) risk, and laboratory test results. All laboratory test results were acquired within 24 h before endoscopic examination (routine blood test results, liver function, coagulation function, electrolytes, blood lipids, PGI, PGII, G-17, HP antibody). The China-PAR13 model, which is specifically tailored for the Chinese population, was utilized to assess the risk of ASCVD. High-risk ASCVD was defined as having a 10-year ASCVD risk ≥ 10% or a lifetime risk (the cumulative probability of developing cardiovascular and cerebrovascular diseases from the current age until reaching 85 years) ≥ 32.8%. The ASCVD risk was evaluated through the following link: https://www.cvdrisk.com.cn/ASCVD/Eval.

Endoscopy procedure for mucosal assessment

The upper gastrointestinal mucosal assessment process was prospectively formulated. The physically examined patients underwent endoscopy examination after one night of fasting. Four endoscopists participated in this study and were responsible for reviewing the images to determine the presence of PUs (at least 3 years of experience in upper gastrointestinal endoscopy with more than 3000 examinations). All the examinations were carried out according to strict protocols. A total of 40 films of the whole stomach and 8 films of the whole duodenum were taken via endoscopy. Blood samples were collected for PG, gastrin 17 (G-17) and Helicobacter pylori (HP) antibody analysis, and gastroscopic assessment of gastrointestinal mucosal morphology was performed.

The assessment of PU

The upper gastrointestinal mucosal damage was assessed by the modified Lanza score (MLS)14. A score of 5 indicated that the diameter of the mucosal defect was > 5 mm, which was defined as a gastric ulcer (GU). Duodenal ulcers (DUs) were defined as mucosal defects > 5 mm in diameter. PUs were defined as GUs (MLS = 5 points) and/or DUs.

Sample size

According to the formula presented by van Smeden et al.15 (the formula is available at https://mvansmeden.shinyapps.io/BeyondEPV/), we set the mean absolute prediction error (MAPE) to be no larger than 0.050, the anticipated occurrence rate of PU to be 0.2 and at most 8 candidate predictor parameters of the final model. Thus, at least 600 participants were included in the development cohort (Fig. 1). We set the sample ratio between the development and validation sets at 2:1. Therefore, a total of at least 900 participants were enrolled in this analysis.

Figure 1
figure 1

Flow chart of the study process. LASSO, the least absolute shrinkage and selection operator; ROC, receiver operating characteristic; DCA, decision curve analysis.

Model-building procedure and temporal validation

Among the enrolled patients, patients assessed between April 2019 and April 2021 were classified in the development cohort, and those assessed between May 2021 and May 2022 were classified in the validation cohort.

For the development cohort, least absolute shrinkage and selection operator (LASSO) regression was used to screen for risk factors, and factors with P < 0.05 were subsequently included in multivariate logistic regression analysis to determine the predictive factors of PU. Thus, a nomogram and receiver operating characteristic (ROC) curves were established based on the identified predictive factors. Discrimination and calibration of the established nomogram were determined by the area under the curve (AUC) and calibration curve, respectively. Decision curve analysis (DCA) was also conducted to assess the clinical practicability of the nomogram. Finally, temporal validation of the nomogram was performed in the validation set, and the area under the curve (AUC), sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) were calculated. Given that antiplatelet medication was always prescribed in the high-risk ASCVD population, the established nomogram was further validated in the population with high-risk ASCVD in the validation set.

Statistical analysis

The quantitative variables in this study are presented as the means ± SDs or interquartile ranges, and the categorical variables are presented as proportions. Continuous variables were evaluated by t tests, categorical variables by Chi-square tests, and rank variables by independent U tests. All the statistical analyses were performed using SPSS version 26.0 (IBM, Armonk, New York) and R software, version 4.2.2, as well as MSTATA software (www.mstata.com).

Results

Participants

The flow chart is shown in Fig. 1. A total of 902 participants were included in this study, 12 of whom were excluded (2 due to acute upper gastrointestinal hemorrhage, 2 due to esophageal malignant tumors, and 8 due to inability to cooperate, resulting in incomplete imaging data). The age of the participants ranged from 18 to 87 years, with an average age of 53.8 ± 12.5 years. Among the participants, 468 (51.9%) individuals were male, 282 (31.3%) had hypertension, 56 (6.2%) had diabetes, 9 (1.0%) had a history of stroke, and 222 (24.6%) had high-risk ASCVD. According to the endoscopy results, there were 204 (22.6%) patients with peptic ulcers, including 89 (9.9%) with GU, 133 (14.7%) with DU, and 18 (2.0%) with both GU and DU. The baseline characteristics of all 902 participants are described in Table 1.

Table 1 Baseline characteristics of all subjects.

The development and validation sets were composed of 631 and 271 subjects, respectively, determined through time-based assignment. All the baseline characteristic data in the two sets are given in Table 2. A comparison of the baseline data showed that there was no significant difference between the development set and the validation set.

Table 2 Comparison of development and validation sets.

Predictor selection and identification in the development set

Twenty-five characteristics measured at baseline (Table 1) were included in the LASSO regression analysis. After LASSO regression was performed (Supplementary Figure I), the following seven characteristics were found to be significant predictors of PU: male sex, white blood cell (WBC) count, red blood cell (RBC) count, globulin (GLB), albumin (ALB), PGI and HP antibody positivity. As shown in Supplementary Figure II, the ROC analysis of the abovementioned variables yielded AUC values greater than 0.5 for male sex (AUC = 0.631), WBC (AUC = 0.654), RBC (AUC = 0.667), GLB (AUC = 0.694), ALB (AUC = 0.703), PGI (AUC = 0.759), and HP antibody positivity (AUC = 0.550). Further multivariate logistic regression analysis revealed that male sex, antiplatelet agent use, WBC count, RBC count, GLB, PGI and HP antibody positivity were independently associated with the presence of PUs (all P < 0.05; see Supplementary Table I).

Development of the nomogram for PU prediction

The risk of PUs was calculated using a multivariate logistic regression equation: ln(P/1−P) = 3.92103 + 0.85315*(male) + 0.14551*(WBC) −0.72291*(RBC) −0.07839*(GLB) −0.06550*(ALB) + 0.00788*(PGI) + 0.91716*(HP antibody-positive). In this equation, P represents the probability of a PU.

A predictive nomogram for PUs [hereafter referred to as the HAMPROW (HP antibody, Albumin, Male, PGI, RBC, globulin, WBC)] (Fig. 2) was developed based on the results of multivariate logistic regression, which is illustrated in Fig. 2A; the nomogram is available online (https://worldn.shinyapps.io/dynnomapp/) and presented in Fig. 2B. Factors were assigned points according to the absolute maximum beta value from the logistic regression model. Due to the differences in units, points were assigned differently for continuous predictors (RBC, WBC, ALB, GLB, and PGI) and categorical predictors (male, HP antibody-positive). The length of the line on the nomogram reflects the impact of these factors on PU, with a longer line indicating a greater impact. By summing all the points, a total score can be obtained, which helps determine the risk of PU by locating it on the HAMPROW score during subject presentation.

Figure 2
figure 2

The nomogram HAMPROW score. (A) To use the nomogram, draw an upward vertical line from each covariate to the points bar to calculate the number of points. Based on the sum of the covariate points, draw a downward vertical line from the total points line to calculate the probability of developing PU. (B) Online dynamic nomogram accessible athttps://worldn.shinyapps.io/dynnomapp/. GLB, globulin; ALB, albumin; PG I, pepsinogen I; HP, helicobacter pylori.

ROC curve analysis revealed that the predictive value of the HAMPROW score for PU in the development cohort was good, with an AUC of 0.854 (95% CI = 0.816–0.891) (Fig. 3A). The cutoff score of the HAMPROW was 146, with a sensitivity of 76.9% and specificity of 83% (Youden index = 0.599).

Figure 3
figure 3

Evaluation of validity, reliability, and decision curve of HAMPROW score. (A) ROC curve for the HAMPROW score based on the development set. The bias-corrected AUC is 0.854. (B) ROC curve from the validation set and the AUC is 0.833. (C) Calibration curves of the HAMPROW score for the development set. (D) Calibration curves of the HAMPROW score for the validation set (E) Decision curve analysis of the HAMPROW score of the development set. (F) Decision curve analysis of the HAMPROW score of the validation set. AUC, area under the curve; ROC, receiver operating characteristic.

Validation of the HAMPROW score

We further tested the predictive value of the HAMPROW score in the validation cohort (n = 271). According to the ROC curve analysis, the area under the curve (AUC) was 0.833 (95% CI = 0.771–0.895) (Fig. 3B), which was significantly greater than that of any factor in the HAMPROW (all P < 0.05) (Supplementary Figure III). Moreover, the area under the curve (AUC) of the HAMPROW score was not significantly different between the development and validation sets (P > 0.05). Additionally, calibration curves and decision curve analysis (DCA) were also plotted to show the clinical utility of the HAMPROW score (Fig. 3C–F). The nomogram exhibited a net benefit across a wide range of threshold probabilities for predicting PUs, regardless of the development set or the validation set. A cutoff value of 146 for the HAMPROW score had a sensitivity of 74.5% and specificity of 79.3%. The PPV was 46.9%, and the NPV was 92.7%.

In the population with high-risk ASCVD in the validation set (n = 63), the AUC of the HAMPROW score was 0.840 (95% CI = 0.723–0.957). A cutoff value of 146 for the HAMPROW score had a sensitivity of 82.4% and specificity of 71.1%. The PPV was 51.9%, and the NPV was 88.9% in this subgroup.

Discussion

In this study, we constructed a HAMPROW score for predicting PUs using seven indicators: HP antibody concentration, albumin concentration, male sex, PGI, RBC, globulin concentration, and WBC count. Our novel nomogram (HAMPROW) demonstrated excellent predictive performance in both the development and validation sets, providing a convenient and reliable method for early screening of the PU risk population.

Of the seven variables associated with the HAMPROW, consistent with previous findings, we proved that male sex16, a history of HP infection17, and RBC18 were key factors associated with the presence of PUs. In addition, this study further identified four other factors from blood tests that are involved in the occurrence of PU: excessive secretion of gastric acid, decreased repair ability of the gastrointestinal mucosa (i.e., PGI) and impaired gastrointestinal mucosal barrier function (i.e., WBC, ALB and GLB).

The serum PGI, initially discovered by Samloff IM19, has been found to accurately indicate gastric acid levels and serve as a reliable biomarker for the noninvasive assessment of gastric mucosal secretion7,20,21. Patients with PU exhibit elevated PGI levels22,23, while those with gastric atrophy exhibit significantly reduced PGI levels24,25, indicating its potential as a biomarker for both secretory function and the state of the gastric mucosa. Gastric acid can inflict direct damage on the upper gastrointestinal mucosa through chemical stimulation and induce secondary harm by disrupting mucosal defense and repair mechanisms26,27. Increased gastric acid secretion plays an important role in the development of upper gastrointestinal mucosal damage. Notably, antiplatelet medications can also increase gastric acid secretion, resulting in exacerbation of gastrointestinal mucosal damage. Studies have confirmed that increased gastric acid secretion following antiplatelet therapy is a crucial risk factor for upper gastrointestinal bleeding 28,29.

Our study also highlights the important screening potential of WBC, serum ALB, and GLB levels for detecting the presence of PUs. Although all three factors reflect compromised gastrointestinal mucosal barrier function, they were not mutually adjusted for in the multivariate analysis, suggesting that they play independent roles in promoting PU development. WBCs commonly serve as markers for assessing inflammatory status or blood concentration. Elevated WBC counts may indicate inflammation or immune response activation, potentially leading to mucosal barrier damage and facilitating PU development. GLB actively contributes to immune function and antibody production and is vital for maintaining a healthy immune system. Changes in GLB levels during PU development signify shifts in immune function, mirroring mucosal barrier changes. Conversely, the serum ALB concentration reflects patient nutritional status. Previous research has indicated that prolonged mucosal injury in the upper gastrointestinal tract leads to impaired nutrient absorption and decreased albumin levels30. Furthermore, low serum ALB concentrations have been identified as a predictor of unfavorable treatment outcomes in patients with PU, indicating heightened severity of gastric mucosal barrier damage among these individuals31.

Notably, NSAIDs are not represented in the HAMPROW score. Although previous studies have identified NSAIDs as potential risk factors for PU, our study, through both univariate and multivariate analyses, revealed no associations between PU and either antiplatelet drug use or non-aspirin NSAIDs. The primary contributing factor to this disparity is the relatively low prevalence of non-aspirin NSAID use among healthy individuals, with only 2.4% utilizing antiplatelet drugs (development set: 2.4%, validation set: 2.6%) and non-aspirin NSAID usage accounting for 1.7% (development set: 1.6%, validation set: 1.8%) of the overall population. Prior studies that emphasized NSAIDs as a risk factor focused on populations with existing PUs, potentially resulting in differences in inclusion criteria.

Although each variable can be used to predict PU, the HAMPROW score generated from these seven variables demonstrated superior effectiveness in predicting the occurrence of PU compared to that of individual variables alone. In our current study, the PPV of the HAMPROW was low (46.9%), implying that more than half of the individuals with a HAMPROW score higher than 146 are likely to have false-positives. This highlights the necessity for further screening to confirm the presence of PUs. The diminished PPV may be attributed to the relatively low incidence of PUs in the physical population (approximately 22.6%). However, the NPV of the score was high (92.7%). A HAMPROW score less than 146 signifies an extremely low risk of PU occurrence, indicating that antiplatelet therapy is safe for such patients. Additionally, the HAMPROW score showed a similar predictive ability in the ASCVD subgroup population as in the overall validation cohort (AUC validation set vs. AUC high-risk ASCVD population: 0.833 vs. 0.840). This result suggests that the HAMPROW score can be applied not only to the general population undergoing physical examination but also to those requiring antiplatelet therapy due to the high risk of ASCVD.

The strengths of our model, which support the validity of our findings, include the simplicity of a statistical model comprising a limited number of clinically relevant predictors and the consistency of the results with those of all considered studies. However, there are still some limitations in our study. First, the data used in this study were obtained from a single center in China, possibly limiting the generalizability of the nomogram to patients worldwide. Further studies using data from multiple centers are needed to validate our model externally. Second, additional research is needed to explore the clinical application of the identified biomarkers. It is important to investigate whether applying our predictive nomogram for risk stratification in patients receiving antiplatelet therapy with gastric mucosal protection can effectively reduce the incidence of gastrointestinal bleeding and adverse reactions. Third, some items related to the occurrence of PUs, such as alcohol consumption history, Helicobacter pylori eradication history, and gastrointestinal symptoms, were not recorded. This may have resulted in the omission of some PU-related risk factors. Future studies should address these issues to refine the selection of variables for collection.

In conclusion, we developed a quantifiable, user-friendly, and reliable predictive nomogram, the HAMPROW score, for PU screening in a population subjected to physical examination. Future research will focus on confirming the value of the HAMPROW score in guiding antiplatelet therapy through prospective validation methods.