Introduction

High-density lipoprotein cholesterol (HDL-C) is inversely associated with cardiovascular diseases or mortality1,2 and non-HDL lipid such as triglyceride and low density lipoprotein cholesterol (LDL-C) is positively associated with them1. A protective role of HDL-C in cancer has also been suggested, with epidemiological studies reporting an inverse association between HDL-C and subpopulations of breast3, endometrial4, lung, colorectal, and liver cancers5,6. However, some studies have found no significant association between HDL-C and several cancers such as breast and colorectal cancers5,7,8. Recent cohort studies have suggested that low HDL-C levels increased the risk of several individual cancers9,10 and a meta-analysis has shown a significant inverse association between HDL-C level and overall cancer11. The favorable impact of HDL-C on cardiovascular disease and cancer risk appears to be related to anti-atherosclerotic, anti-thrombotic, anti-inflammatory, anti-apoptotic, anti-oxidative, and immune-modulating effects12.

As mentioned above, baseline HDL-C levels are inversely associated with the risk of several cancers in cohort studies but have no significant association with some cancers. The normal HDL-C at the baseline is often maintained, but the HDL-C may decrease significantly in some persons. Low HDL-C will also be maintained in many cases, but may be improved to normal HDL-C in some people. The effects of these HDL-C changes on cancer development have not been reported. Here, we show the impact of longitudinal changes in HDL-C level on gastrointestinal cancer risk using a large database from the Korean National Health Insurance Service System (NHISS). Several cancer studies using the Korean NHISS have been published13,14. The definition of normal and low HLD-C was defined according to the Adult Treatment Panel III (ATP III)15. Furthermore, we conducted interaction analysis between HDL-C change and important cofactors in cancer risk and performed sub-group analyses to determine their associations by sex, smoking status, and age groups.

Results

Baseline demographic and laboratory findings

In 2010, 6.18 million underwent both national heath examination and any cancer screening. Cancers before 2010 were excluded and subjects who did not undergo gastric cancer screening. A total of 4.373 million underwent both national heath examination and gastric cancer screening. After the exclusion of subjects with any cancer diagnosed within 1year and those who died within 1year, 4.323 million persons were enrolled at baseline. After excluding non-participants for the national general health examination or absence of HDL-C value in 2014, 3.135 million individuals were eligible. After further excluding unknown sex type, 3,130,795 persons (1,387,648 men [44%] and mean age of 54 years) are eligible [model I] and followed up until 2021 (Fig. 1a). The baseline characteristics of the included and excluded individuals was provided in Supplementary Table 1. Low and normal HDL-C levels were defined according to the ATP III15. Baseline characteristics in the final eligible individuals by HDL-C change group were provided in Table 1. The total person year was 34,907,641 and 112,467 patients had newly detected gastrointestinal cancers up to 12 years. Among 3.131 million persons, 40696 gastric, 35,707 colorectal, 21,309 liver, 11,532 pancreatic, 4225 gallbladder, and 7051 biliary cancers were newly detected. Normal HDL-C level at the baseline was maintained in 84% (n = 2,026,087) at follow-up, and HDL-C decreased to low levels in 16% (385,774). Furthermore, low HDL-C level at baseline persisted in 55% (392,532) at follow-up, but HDL-C level increased to the normal range of HDL-C in 45% (326,402).

Fig. 1: Study overview.
figure 1

a Study flow. b Three different models according to exposure time. c Scheme of high-density lipoprotein cholesterol change category. HDL-C, high density lipoprotein cholesterol. NGHE National General Health Examination. According to the Adult Treatment Panel III (ATP III), normal HDL-C levels were defined as HDL-C ≥ 40 mg/dL in men and ≥50 mg/dL in women. Low HDL-C levels were defined as HDL-C levels of <40 mg/dL in men and <50 mg/dL in women. ∆HDL-C = [(HDL-C at follow-up) - (HDL-C at baseline)].

Table 1 Baseline Characteristics by HDL-C change

Effect of baseline HDL-C on gastrointestinal cancer risk

In the adjusted analysis, normal HDL-C at baseline was associated with a reduced risk of gastric and pancreatic cancer comparing to low HDL-C in overall population men, and women. Gallbladder-biliary cancer risk had no association with baseline HDL-C. A normal HDL-C at baseline reduced the risk of men’s liver cancer but had no association with women’s liver cancer risk. Both low (HDL-C < 30 mg; adjusted HR [aHR] = 1.69; 95% CI = 1.55–1.84) and high HDL-C (HDL-C ≥ 60 mg; aHR = 1.14; 95% CI = 1.09–1.18) was associated with an increased risk of liver cancer (Supplementary Table 2).

Impact of HDL-C change on gastrointestinal cancer risk

In the adjusted analysis for potential variables except triglyceride and LDL-C, the risk of gastric, colorectal, and liver cancers was higher in the persistently low HDL-C and normal-to-low HDL-C groups compared to the persistently normal HDL-C group and the risk of pancreatic and gallbladder cancer was higher in normal-to-low HDL-C groups compared to the persistently normal HDL-C group. In the adjusted analysis for all potential variables including triglyceride and LDL-C, these patterns persisted in most GI cancers (Table 2 and Fig. 2a). The persistently low HDL-C and normal-to-low HDL-C groups had a higher risk of gastric and biliary cancers compared to the persistently normal HDL-C group. The persistently low (aHR = 1.17; 95% CI = 1.12–1.23), normal-to-low (aHR = 1.26; 95% CI = 1.21–1.32), and low-to-normal (aHR = 1.07; 95% CI = 1.02–1.12) HDL-C groups had a higher risk of liver cancer compared to the persistently normal HDL-C group. Normal-to-low HDL-C groups had a higher risk of colorectal (aHR = 1.08; 95% CI = 1.05–1.12), pancreatic (aHR = 1.11; 95% CI = 1.05–1.17), and gallbladder (aHR = 1.12; 95% CI = 1.02–1.23) cancer compared to the persistently normal HDL-C group.

Table 2 Gastrointestinal cancer risk by HDL-C change (Model I)
Fig. 2: Adjusted hazard ratio by HDL-C change in two models.
figure 2

a Cancer risk by high density lipoprotein cholesterol (HDL-C) change after excluding any cancer developed within 1 years from first measurement of HDL-C (model I). b Cancer risk after excluding any cancer developed within 2 years from first measurement of HDL-C (model II). c Cancer risk after excluding any cancer developed up to second measurement of HDL-C (model III). X-axis is adjusted hazard ratio (aHR) and y-axis is HDL-C change group. Adjusted HRs and CIs are derived from Cox proportional regression analysis. The dashed blue lines illustrate an HR of 1. The reference is a persistently normal (Normal-to-Normal) group. The HR is indicated with the central bar and the error bars show the 95% confidence interval (CI). Red bar is statistical significant HR with CI. Low to Low (n = 392,532), Low to Normal (n = 326,402), Normal to Low (n = 385,774), Normal to Normal (n = 2,026,087) in model I. Low to Low (n = 389,731), Low to Normal (n = 324,102), Normal to Low (n = 382,820), Normal to Normal (n = 2,013,369) in model II. Low to Low (n = 380,934), Low to Normal (n = 698,472), Normal to Low (n = 373,488), Normal to Normal (n = 1,975,069) in model IIII. The exact hazard ratios and 95% confidence intervals are provided in Table 2 and Supplementary Tables 3, 4.

Sensitivity analysis

Our main results were robust to a range of sensitivity analyses for the impact of the exposure period in most cancers (Table 2, Supplementary Tables 34). The estimated effect sizes and CIs were in model II (Supplementary Table 3) similar to those in the model I, and the two graphs nearly overlapped in gastric, colorectal, liver, pancreatic, and gallbladder cancer risks (Fig. 2a, b), wherein HDL-C change had no association with biliary cancer risk in model II. In adjusted analysis after further exclusion of cancer up to 2nd measurement of HDL-C [model III], the effect of HDL-C change on gastric, colorectal, and liver cancer risks was similar with that in model I, whereas HDL-C change had no significant effect on the risk of pancreatic, gallbladder, and biliary cancer (Supplementary Table 4 and Fig. 2a, c).

Subgroup analysis by sex

Interaction analysis showed a significant interaction between sex and HDL change in the risk of gastrointestinal cancers (Supplementary Table 5). Therefore, we performed subgroup analysis by sex. In adjusted analysis, the impact of HDL-C change on individual cancer risk showed a sex discrepancy in most gastrointestinal cancers (Table 3). The persistently low HDL-C (aHR = 1.11; 95% CI = 1.06–1.16), low-to-normal (aHR = 1.07; 95% CI = 1.02–1.13), and normal-to-low HDL-C groups (aHR = 1.07; 95% CI = 1.02–1.12) showed an increased gastric cancer risk in women, wherein normal-to-low HDL-C groups increased gastric cancer risk in men. Normal-to-low HDL-C group was associated with an increased risk of colorectal and liver cancer in men, wherein persistent low HDL-C and normal-to-low HDL-C group was associated with an increased risk of colorectal and liver cancer in women. Normal-to-low HDL-C and low-to-normal HDL-C groups were associated with an increased risk of pancreatic cancer in men, wherein persistent low HDL-C and normal-to low HDL-C increased risk of pancreatic cancer in women. HDL-C change had no association with the risk of gallbladder and biliary cancer in both sexes.

Table 3 Sub group analysis by sex (adjusted analysis)

Subgroup analysis by smoking status

Interaction analysis showed a significant interaction between smoking status and HDL change in the risk of several gastrointestinal cancers. Therefore, we performed subgroup analysis by smoking status (Table 4). The persistently low and normal-to-low HDL-C groups showed increased gastric cancer risk in both never smokers and ever-smokers [past/current smokers]. The hazardous impact of persistently low HDL-C and change from normal to low HDL-C on liver cancer risk was more remarkable in ever-smokers. The persistently low HDL-C group and normal-to low HDL-C group slightly increased the risk of colorectal cancer in never smokers, whereas the normal-to-low HDL-C group increased colorectal cancer risk in ever-smokers.

Table 4 Effect of HDL-C change by smoking status (adjusted analysis)

Subgroup analysis by combination of sex and smoking

We conducted subgroup analysis using a combination of sex and smoking (Supplementary Table 6). Persistent low, low-to-normal, and normal-to low HDL-C change slightly increased the gastric cancer risk in never-smoking women and persistent low and normal-to low HDL-C change increased the gastric cancer risk in smoking men. These patterns were also observed in pancreatic cancer risk. In men, normal-to-low HDL-C group was associated with an increased risk of CRC in both never-smokers and ever-smokers, whereas in women persistent low and normal-to low HDL-C change increased the CRC risk in never-smokers.

Persistent low, low-to-normal, and normal-to low HDL-C change markedly increased the liver cancer risk in smoking men. Persistent low and normal-to low HDL-C change increased the liver cancer risk in never-smoking women and persistent low and low-to-normal HDL-C markedly increased the liver cancer risk in never-smoking women. However, the effect size (aHR) was markedly high in smokers than in never smokers in both sex.

Subgroup analysis by age groups

Interaction analysis showed a significant interaction between age groups and HDL change in the risk of gastrointestinal cancers. In adjusted analysis, the impact of HDL-C change on several gastrointestinal cancers showed an age discrepancy (Table 5). The effect of HDL-C change on the risk of gastric and gallbladder cancer was significant only in age ≥60 years but not significant in age <60 years. The effect of HDL-C change on the risk of liver and colorectal cancer was significant in both age groups. The effect of HDL-C change on the risk of pancreatic and biliary cancer was significant only in age <60 years, but not significant in age ≥60 years.

Table 5 Effect of HDL-C change by age (adjusted analysis)

Cancer risk by further categorization of HDL-C change

We assessed gastrointestinal cancer risk according to HDL-C change using ATP III and absolute HDL-C change (∆ HDL-C) to investigate further increase of HDL-C on the cancer risk among baseline normal HDL-C group (Supplementary Table 7 and Fig. 3). Among baseline normal group, an increase >15 mg/dL of HDL-C (aHR = 1.15; 95% CI = 1.08–1.21) as well as normal-to-low HDL-C group (aHR = 1.29; 95% CI = 1.24–1.35) had a higher liver cancer risk compared to the persistently normal group. Among baseline normal group, an increase >15 mg/dL of HDL-C reduced the CRC risk (aHR = 0.94; 95% CI = 1.90–0.99), whereas normal-to-low HDL-C group (aHR = 1.08; 95% CI = 1.05–1.12) had a higher CRC risk compared to the persistently normal group (Fig. 3).

Fig. 3: The effect of further categorization of HDL-C change on cancer risk.
figure 3

Adjusted hazard ratios (aHR) for gastric (a), colorectal (b), liver (c), pancreatic (d), gallbladder (e), and biliary cancers (f). ∆HDL-C = [(HDL-C at follow-up) − (HDL-C at baseline)]. X-axis is adjusted HR and y-axis is HDL-C change group. Adjusted HRs and CIs are derived from Cox proportional regression analysis. The dashed blue lines illustrate an HR of 1. The reference is a persistently normal (Normal-to-Normal) group. The HR is indicated with the central bar and the error bars show the 95% confidence interval (CI). Red bar is statistical significant HR with CI. Low to Increase (n = 125,709), Low to Low (n = 387,739), Low to Normal (n = 205,486), Normal to Increase (n = 194,998), Normal to Low (n = 385,774), and Normal to Normal (n = 1,831,089). The exact hazard ratios and 95% confidence intervals are provided in Supplementary Table 7. HDL-C high density lipoprotein cholesterol.

Effect of absolute HDL-C change on cancer risk

In unadjusted analysis, a decrement of HDL-C at follow-up increases the risk of gastric, colorectal, liver, pancreatic, gallbladder, and biliary cancers comparing to stable HDL-C group. In adjusted analysis, a decrement of HDL-C at follow-up (∆ HDL-C < −10 mg/dL) increases the risk of gastric, colorectal, liver, pancreatic, gallbladder, and biliary cancer (Supplementary Table 8 and Fig. 4). Interestingly, marked increase of HDL-C (∆ HDL-C ≥ 25 mg/dL) also associated with an increased risk of gastric (aHR = 1.07; 95% CI = 1.00–1.14) and liver cancer (aHR = 1.41; 95% CI = 1.3–1.53), whereas increase of HDL-C slightly reduced the risk of colorectal cancer risk (aHR = 0.94; 95% CI = 0.90–0.98).

Fig. 4: The effect of absolute HDL-C change on cancer risk.
figure 4

Adjusted hazard ratios (aHR) for gastric (a), colorectal (b), liver (c), pancreatic (d), gallbladder (e), and biliary cancers (f). ∆HDL-C = [(HDL-C at follow-up) − (HDL-C at baseline)]. X-axis is adjusted HR and y-axis is HDL-C change group. Adjusted HRs and CIs are derived from Cox proportional regression analysis. The dashed blue lines illustrate an HR of 1. The reference is a stable group. The HR is indicated with the central bar and the error bars show the 95% confidence interval (CI). Red bar is statistical significant HR with CI. ∆HDL < −10 (n = 561,153), ∆HDL = −10–−5 (n = 424,815), ∆HDL = −5–5 (n = 1,121,127), ∆HDL = 5–15 (n = 702993), ∆HDL = 15–25 (n = 234,723), ∆HDL ≥ 25 (n = 85,984). The exact hazard ratios and 95% confidence intervals are provided in Supplementary Table 8. HDL-C high-density lipoprotein cholesterol.

Discussion

To the best of our knowledge, this is the first study to investigate the effects of HDL-C level changes on gastrointestinal cancer development. In this large cohort study, persistently low HDL-C had a higher risk for gastric, liver, and biliary cancer compared to persistently normal HDL-C. HDL-C change from normal to low levels increased the risk of gastric, colorectal, liver, pancreatic, gallbladder, and biliary cancers. These results were robust to a range of sensitivity analyses for the impact of the exposure period in most cancers with similar estimated effect sizes. Furthermore, the effect of HDL-C change on the risk of individual gastrointestinal cancer were modified by sex and smoking status. Impact of HDL-C change on the gastric and gallbladder cancer risk was significant only in age ≥60 years, whereas the impact of HDL-C change on the pancreatic and biliary cancer risk was significant only in age <60 years. Effect of HDL-C change on the risk of liver and colorectal cancer was significant in both age groups. Decrement of absolute HDL-C level (∆HDL-C < −10 mg/dL) increased the risk of all individual digestive cancers, whereas a marked increment of HDL-C had a site-specific discrepancy.

Persistent exposure to low HDL-C and change from normal to low HDL-C increased the risk of gastric cancer and change from normal to low HDL-C increased the risk of colorectal cancer. Further increase of HDL-C level by >15 mg/dL among the baseline normal HDL-C group reduced colorectal cancer risk compared to the persistently normal group. The impact of longitudinal changes in HDL-C on gastrointestinal cancer risk has not been reported, even though an association between baseline HDL-C level and cancer risk has been reported (Supplementary Table 9). A low baseline HDL-C levels increased the risk of gastric cancer in a cohort study adjusted for H. pylori and demographic factors16 and HDL-C was inversely associated with gastric cancer among postmenopausal women17. A Finland study showed no significance between HDL-C and gastric cancer risk5. HDL-C was not associated with colorectal cancer in a UK cohort study8, whereas HDL-C was inversely associated with colorectal cancer in a US Women’s Health Study6 and in European cohort study18. This non-constant association between baseline HDL-C and cancer risk may be related to size and portion of study population and study design. However, in the current study, normal HDL-C level at the baseline was maintained in many case (84%) at follow-up, and HDL-C decreased to low levels in small portion (16%). And the effect of these two groups on gastrointestinal cancer risk was far different. Interestingly, low HDL-C persisted just in 55% at follow-up, but HDL-C level increased to the normal range of HDL-C in 45%. It seems that people who have found out that they have low HDL-C through screening are making efforts to reach normal HDL-C. And the persistent low HDL-C increased the risk of gastrointestinal cancer, but low-to-normal group did not increase the cancer risk. Therefore, baseline HDL-C-based cancer risk estimates performed in previous studies have limitations. Furthermore, our findings suggest that persons with low HDL-C at baseline need not be discouraged, and that improving the lipid profile with normal HDL-C can reduce cancer risk to a near extent to people with normal HDL-C at baseline.

The persistent exposure to low HDL-C (HR = 1.17; 95% CI = 1.12–1.23), change from normal to low HDL-C (aHR 1.26; 95% CI = 1.21–1.32), and change from low to normal HDL-C (HR = 1.07; 95% CI = 1.02–1.12) had a higher liver cancer risk compared to the persistently normal HDL-C group. HDL-C level increase by >15 mg/dL among the baseline normal HDL-C group (HR = 1.15; 95% CI = 1.08–1.21) as well as the normal to low HDL-C group (HR = 1.29; 95% CI = 1.24–1.35), increased liver cancer risk compared to the persistently normal group. Our findings suggest that very high HDL-C levels also increase the risk of liver cancer and emphasize that maintaining modest normal range of HDL-C levels is important to reduce the risk of liver cancer. The association between HDL-C level and liver cancer risk remains controversial. HDL-C levels were inversely associated with liver cancer risk in Finland5 and Swedish cohort studies10, whereas a Danish cohort study showed no association between HDL-C and liver-biliary cancer9. Our finding that maintenance of modest HDL-C level is important to minimize the liver cancer risk, may partly explain the controversial results of previous studies on the association between baseline HDL-C and liver cancer risk.

The change from normal to low HDL-C increased pancreatic and gallbladder cancer risk. Persistently low HDL-C and change from normal to low HDL-C increased the risk of biliary cancer. In analysis after excluding any cancer up to 2 years from baseline, these effects on pancreatic and gallbladder cancer risk were still significant but the effect on biliary cancer risk was not significant. Cohort studies on the association between HDL-C and these cancers have rarely been reported. Two cohort studies showed no association between pancreatic cancer and HDL-C5,9. A small case-control study suggested that low HDL-C was associated with gallbladder and bile duct cancers19.

Our main results were robust to a range of sensitivity analyses for the impact of exposure duration on most gastrointestinal cancers. The effects of HDL-C change on gastrointestinal cancers except biliary cancer were similar in model II (excluding any cancer within 2 years from baseline). The effects of HDL-C change on gastric, colorectal, and liver cancer risk was still significant but the effect on pancreatic, gallbladder, and biliary cancer risk was not significant in model III (excluding any cancer up to 2nd measurement of HDL-C).

Effect modifications by sex, smoking status, and age group were observed in subgroup analysis. Normal-to-low HDL-C change increased the risk of gastric, colorectal, liver, and pancreatic cancer in both men and women, whereas persistent low HDL-C increased the risk of those cancers only in women but not in men. In the subgroup analysis by smoking status, the hazardous effect of normal-to-low HDL-C change on gastric, colorectal, and liver cancer was significant in both never-smokers and ever-smokers, whereas persistent low HDL-C increased the risk of gastric and colorectal cancers only in never smokers and the risk of liver cancer only in ever-smokers. In subgroup analysis by combination of sex and smoking status (men-smoker, men-never smoker, women-smoker, women-never smoker), the effect of HDL-C change on women’s gastric cancer risk was significant only in never smokers and the effect of HDL-C change on men’s gastric cancer risk was significant only in smokers. These patterns were observed in pancreatic cancer. The hazardous effect of persistent low or normal-to-low HDL-C on liver cancer risk was observed in all 4 groups, but effect size was markedly high in smokers regardless of sex. The mechanism of this discrepant effect of HDL-C change according to sex and smoking status in several cancers is unknown. A plausible explanation is that the immunologic and inflammatory response to smoking further promotes a low HDL-C level–related cancer risk in current smokers. The related mechanisms need to be investigated in the future. Previous epidemiologic studies have reported sex-discrepant effects of BMI on cancer risk20. The interaction between BMI and smoking status on cancer risk has also been previously reported20,21. In a previous small study (259 cases of lung cancer in 14,547 members of atherosclerotic risk) using binary HDL-C, low HDL-C at baseline was associated with lung cancer only in past smokers (HR = 1.77; 95% CI = 1.05–2.97)22.

Interestingly, subgroup analysis by age group showed a discrepant effect on individual gastrointestinal cancers. The effect of HDL-C change on the risk of gastric and gallbladder cancer was significant only in age ≥60 years. The effect of HDL-C change on the risk of pancreatic and biliary cancer was significant only in age <60 years. The effect of HDL-C change on the risk of liver and colorectal cancer risk was significant in both age groups. The mechanism of site specific age discrepancy in the association between HDL-C change and digestive cancer risk is unknown. A different peak onset ages and site-specific biologic response to low HDL-C may partially contribute to this site specific age discrepancy in the association between HDL-C changes and gastrointestinal cancer risk.

We also investigated the effect of absolute HDL-C change on cancer risk. A decrement of absolute HDL-C level (∆HDL-C < −10 mg/dL) increased the risk of gastric, colorectal, pancreatic, gallbladder, and biliary cancers comparing to stable HDL-C group. An increment of HDL-C from baseline (∆ HDL-C ≥ 25 mg/dL) was also associated with an increased risk of gastric (7%) and liver cancer (41%), whereas an increment of HDL-C (∆ HDL-C = 5–25 mg/dL) slightly reduced the risk of colorectal cancer (6%). These results emphasize the importance of avoiding a decrement of HDL-C to prevent all digestive cancers and encourage an increment of HDL-C to prevent colorectal cancer.

The favorable effect of HDL-C on cancer appears to be related to its anti-inflammatory and anti-oxidative properties12,23. LDL-C in the tumor microenvironment can promote pro-oxidant and pro-inflammatory activities, whereas cholesterol and its metabolites can be removed from cancer cells by HDL-C24. HDL-associated apolipoproteins and ATP-binding cassette transporters might have anti-tumorigenic effects25,26. HDL-C also has anti-apoptotic effects27. This anti-apoptotic effect of HDL-C may partially explain the increased risk of liver cancer in the change from the normal to the high HDL-C group. Liver is a complex functional organ related to lipid metabolism and is one of organs of estrogen action28. Functional investigation for the link of HDL-C and liver cancer may clarify our epidemiologic findings. However, a previous study suggests that high cholesterol consumption by malignant cells to promote tumor growth might deplete plasma cholesterol and HDL-C levels29. Therefore, low HDL-C might be the epiphenomenon of cancer-related inflammation30,31. To avoid this issue, we enrolled a cancer-screened population and excluded patients diagnosed with any cancer before baseline and at baseline and excluded persons who diagnosed with 1 years and 2 years after baseline enrollment (Model I and Model II). And we further excluded those who were diagnosed with any cancer until the second measurement of HDL-C (model III).

Associations between gastrointestinal cancer risk and other lipids such as total cholesterol, LDL-C, triglyceride have also been studied5,6,8,10. A cohort study from Finland suggested that HDL-C was inversely associated with the risk of several site cancers but total cholesterol had no association with cancer risk during long term follow-up5. A US women cohort study showed that HDL-C was inversely associated with colorectal and lung cancer but LDL-C was not significantly associated with risk of total cancer or any site-specific cancers6. LDL and triglycerides are affected not only by long-term exercise, diet, and obesity, but also by short-term lipid-lowering drugs in case of LDL-C32, and by recent diet and alcohol consumption in case of triglycerides. On the other hand, HDL-C is influenced by long term exercise and diet pattern rather than short-term diet or lipid lowering drugs. This relative stability of HDL-C may be used as a marker for cancer risk assessment, so we focused on HDL-C. Actually our previous studies showed no association between gastric cancer and other lipids (total cholesterol, triglyceride, and LDL)16,33. The impact of HDL-C change on most gastrointestinal cancer in the adjusted analysis that did not consider LDL and TG was similar to the results in the adjusted analysis that included LDL-C and TG. However, the effect of HDL-C change on biliary tract cancer risk was significant in adjusted analyzes including LDL-C and TG, but not significant in adjusted analyzes excluding LDL-C and TG. These results suggested that HDL-C change affect gastrointestinal cancer risk regardless of other lipid level.

This study has considerable strengths. First, this is the first study to investigate the association between HDL-C changes and cancer risk of many gastrointestinal sites. A large database allowed us to investigate the effects of HDL-C changes on individual cancer risks. Furthermore, our results were robust to a range of sensitivity analyses for the impact of exposure duration on most cancers. Second, we showed a sex discrepancy in the association between HDL-C level change and individual digestive cancer risk. Furthermore, the hazardous effect of HDL-C changes on individual cancers differed according to smoking status. Subgroup analysis by combination of sex and smoking status clarified their association in the effect of HDL-C on cancer risk. Third, we also showed effect modification by age group in the impact of HDL-C change on digestive cancer risk. Fourth, we also explored that a decrement of absolute HDL-C level (∆HDL-C < −10 mg/dL) increased the risk of individual gastrointestinal cancers but marked increase of HDL-C had a site specific discrepant effect. Fifth, we used high-quality data from the NHISS, including direct measurement of lipids and BMI, detailed information on many covariates, and nearly complete sensing of cancer detection. Many previous studies have used data from the NHISS and validated their quality13,14. Finally, this national health examination cohort represents the general Korean population aged ≥40 years.

Nevertheless, this study has several limitations. The median follow-up period was 8.4 years. Although this duration may be relatively short to investigate cancer risk, the large cohort size (23,389,010 person years and enough number of events) may have partially compensated for this limitation. Second, we selected individuals who underwent two consecutive general health examinations, and follow-up eligible individuals comprised 63% of the baseline eligible individuals. Finally, no external validation was performed in this study. However, this study will promote similar research in other countries.

In conclusion, persistent exposure to low HDL-C and HDL-C level changes from normal to low levels increased overall and most digestive cancer risks compared to persistently normal HDL-C. These effects of HDL-C changes were robust to a range of sensitivity analyses. The risk of individual digestive cancer according to HDL-C change differs on the level of sex and smoking status. This study emphasizes the importance of maintaining normal HDL-C levels to prevent cancer development, especially in ever-smokers.

Methods

Data extraction and contents of data

This study was approved by the Institutional Review Board of Kyungpook National University Hospital, Chilgok (KNUHC 2017-12-022). In this large population-based cohort study, we used data from the Korean NHISS (REQ202204452-004), which cover approximately 98% of the Korean population aged ≥40 years every 2 years. We gathered the Base One Foundation Component Library (BFC), T20, the National General Health Examination (NGHE), and national cancer screening from NHISS.

We extracted sex and economic status from BFC; the disease codes of the International Classification of Disease, 10th revision (ICD 10) from T20 disease code; and demographic and laboratory data from the NGHE. Sex in this study is a biological sex registered in NHISS. The questionnaires included chronic diseases such as diabetes mellitus, hypertension, cerebrovascular disease, and ischemic heart disease, medication (anti-hypertensive drug, ant-diabetes drug, drug for heart disease, antiplatelet or anticoagulant drug, lipid lowering drug, etc), smoking status, alcohol consumption frequency, physical activity (frequency per week), and family history. Questionnaires also included liver-related variables (any liver disease history, chronic hepatitis B, chronic hepatitis C, and liver cirrhosis). Laboratory data included body mass index (BMI, weight/height2 [kg/m2]), blood pressure, lipid and fasting glucose levels. Weight and height were directly measured on the day of the health examination and blood samples were collected after 12 h of fasting. Fasting serum glucose and lipid levels were measured in units of milligram/deciliter (mg/dL) using fresh serum in each screening center. Korean national general health screening have provided glucose and lipids examination to all adults over than 40 years every 2 years during study periods. Therefore, missing rate of glucose and lipid is extremely low in the study period. Age was defined as the age at the time of national general health examination. Economic status was extracted as household income twentile. Smoking status was categorized with never, past, or current smoker. Alcohol consumption frequency was classified as none, 1/week, 2–3/week, 4–5/week, and ≥6/week. Moderate physical activity refers to “walking or exercising and feeling mild dyspnea for more than 30 min per day.”

Handling of missing data

  1. 1.

    Missing data for hypertension and diabetes mellitus in self-reported questionnaires was replaced using medication, blood pressure, and fasting glucose extracted from laboratory data in general health examination. Among missing data of hypertension or diabetes mellitus, use of antihypertensive or anti-diabetes drug was defined as presence of hypertension or diabetes. Among missing data of hypertension, systolic blood pressure ≥140 mmHg or diastolic blood pressure ≥90 mmHg was defined as hypertension. Among missing data of diabetes mellitus, fasting glucose ≥126 mg/dL was defined as presence of diabetes.

  2. 2.

    Missing data for ischemic heart disease (angina or myocardial infarction) and cerebrovascular disease was replaced using T20 disease code within 1 year.

    T20 disease code includes all ICD10 code registered whenever patients visit any medical institute. Ischemic heart disease includes I20–I25. Cerebrovascular disease includes I60–I69.

  3. 3.

    Other categorical data: Missing data was put as blank or unknown.

  4. 4.

    Continuous variables such as age, body mass index, and lipid: Missing data was put as blank. After reasonable correction of missing data, we handled the missing data based on the missing rate. If variables have low missing rate (<2%) among the final eligible population, we deleted missing data list-wise in the adjusted analysis. If variables have considerable missing rate, we set the missing data as unknown group in adjusted analysis.

Baseline enrollment and follow-up

From 2009 to 2017, national general health examination (NGHE) provided measurement of HDL-C to all Korean persons aged 40 years or older every two years. Among 6.18 million who underwent both NGHE and any cancer screening from January to December 2010, patients with any pre-existing cancer and persons who did not undergo gastric cancer screening were excluded. Among cancer free individuals who underwent both NGHE and National gastric cancer screening from January to December 2010 (4.373 million), we excluded persons with any C-codes or death within 12 months from index month (the month of health examination) (n = 49,991) (Fig. 1a). Death data from the National Statistical Office were also provided by the National Health Insurance Service. Nearly complete sensing of new cancer is possible through extracting C code from NHISS. In order to receive a special exemption for cancer, the cancer code should be registered with diagnostic evidence of cancer such as pathologic results at hospital. If any cancer code is registered, the patients pay just 5% for cancer related medical services. Therefore, cancer (C) code extracted from NHISS is highly reliable. Individuals who did not undergo NGHE in 2014 or absence of HDL-C values in 2014 (n = 1,188,176) and unknown sex type (n = 3991) were excluded [model I; n = 3,130,795, Fig. 1b]. All cancer codes (C codes) were extracted up to December 2021. Common gastrointestinal cancers included gastric (C16), liver (C22), colorectal (C18, C19, C20), pancreatic (C25), gallbladder (C23), and biliary (C24) cancers. Nearly complete sensing of new cancer is possible through extracting C code from NHISS. The NHISS provided raw data after eliminating personal identification information and this study did not affect the disease course. Therefore informed consent is waived in analysis using NHISS data under NHISS regulation.

Statistics and reproducibility

1) Determination of sample size

We did not estimate the sample size because the number of samples which could be used was fixed. However, we have provided calculations for the primary analysis with a specified power.

Framework

Persistent low HDL will increase the risk of individual cancer risk comparing to persistent normal HDL group. Change from normal HDL-C to low HDL-C will increase the risk of individual cancer risk comparing to persistent normal HDL group. We put Alpha error as 0.05 and Power as 80%.

Minimum sample size to evaluate the risk of colorectal cancer

In two independent groups, we put the colorectal cancer incidence of Group 1 and Group 2 is 0.6% and 0.5, respectively. The sample size is 171,724. In this study, HDL-C was categorized as four different groups. Therefore minimal sample size = 171,724 × 2 = 343,448.

2) HDL-C change group

HDL-C levels were extracted at baseline (2010) and follow-up (2014). Changes in HDL-C levels were classified into four groups: persistent normal (normal-to-normal), change from normal to low (normal-to-low), change from low to normal (low-to-normal), and persistent low (low-to-low) (Fig. 1c). According to the Adult Treatment Panel III (ATP III)15, normal HDL-C levels were defined as HDL-C ≥ 40 mg/dL in men and ≥50 mg/dL in women. Low HDL-C levels were defined as HDL-C < 40 mg/dL in men and <50 mg/dL in women. We also classified HDL-C change using continuous value. Change of HDL-C [∆HDL-C = (HDL-C at follow-up) − (HDL-C baseline)] were classified as decrease (∆HDL-C < −10 mg/dL, ∆HDL-C = −5–−10 mg/dL), stable (∆HDL-C = −5–5 mg/dL), increase (∆HDL-C = 5–15 mg/dL, ∆HDL-C = 15 ~ 25 mg/dL, and ∆HDL-C ≥ 25 mg/dL). We further categorized HDL-C change using ATP III and absolute HDL-C change [∆HDL-C = (HDL-C at follow-up) − (HDL-C baseline)] to investigate which further increment of HDL-C among baseline normal HDL-C can affect the risk of gastrointestinal cancer (Fig. 1c). Persistent normal group was further classified into persistent normal and normal-increase (baseline normal and increase ≥15 mg/dL at follow-up from baseline) groups. Baseline low group was categorized into persistent low, low-to- normal, and low-to-increase.

3) Statistical analysis

Statistics for the variables are presented as numbers (percentages) for categorical variables and as means (standard deviations) or medians (interquartile ranges) for continuous variables. First, we investigated the risk of gastrointestinal cancer according to baseline HDL-C. We calculated person-years from the date of baseline national general health examination to the following censoring events, whichever occurred first: occurrence of gastrointestinal cancer, death, or end of the study (December 31, 2021).

Cancer risk according to HDL-C change was measured with hazard ratios (HRs) and 95% confidence intervals (CIs) using Cox proportional regression analysis. The persistently normal group was set as the reference group. The association between covariates and cancer risk was assessed using the Cox regression analysis. We performed a multivariate analysis adjusted for significant confounders, such as age, sex, BMI, smoking status, alcohol intake, economic status, use of lipid lowering drugs, hypertension, diabetes, cerebrovascular disease, heart disease, low density lipoprotein cholesterol (LDL-C), triglyceride, and physical activity. Liver cancer analysis was additionally adjusted for liver factors such as liver disease, chronic hepatitis B, chronic hepatitis C, and liver cirrhosis. Among the final eligible population, the highest missing rate was for economic status (1.7%). Therefore, we deleted missing data list-wise in the adjusted analysis.

To conduct sensitivity analysis for the impact of exposure duration, we excluded cancers that developed within 2 year from the first measurement of HDL-C level and then analyzed the impact of HDL-C change on each cancer risk (model II). And persons who had been diagnosed with cancer up to second measurement of HDL-C (2014) were further excluded (model III) (Fig. 1b). We conducted an interaction analysis (joint test) between the well-known important cofactors (sex and smoking status) and HDL-C level changes in the cancer risk. And then, a subgroup analysis was performed based on sex and smoking status. All analyses were performed using the SAS software (version 9.4; SAS Institute, Cary, NC, USA). All statistical tests were two-sided, and values of P < 0.05 were considered statistically significant.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.