Introduction

Hypertension is a common non-communicable disease and the primary risk factor for cardiovascular and cerebrovascular diseases. It not only causes high disability and mortality but also imposes a heavy burden on patients, families and society1,2. According to the data released by the World Health Organization, high blood pressure can significantly increase the risk of heart, brain, kidney disease and other diseases, and it is the primary cause of premature death in the world3. The Global Burden of Disease Study 2019 estimated that there were approximately 10.85 million deaths due to hypertension worldwide in 2019, accounting for 31% of all deaths4. With the ageing of the population and changes in residents’ lifestyles, the number of people with hypertension in China continues to increase and it has become an important public health problem5. From 2012 to 2015, the crude prevalence rate of hypertension amongst residents aged 18 and over in China was 27.9% (the standardized rate was 23.2%), and the prevalence was on the rise compared with the past4. The deaths attributable to hypertension amongst Chinese residents increased from 1.22 million in 1990 to 2.599 million in 2019, the increase rate was 112.72%; the mortality attributable to hypertension increased from 103.25/105 in 1990 to 182.79/105 in 2019, the increase rate was 77.04%6.

With the improvement of people’s health needs, the medical model has changed to a biological-social-psychological model. Quality of life (QoL) has been gained more attention in medical field, and has increasingly become a focus for research and application in many fields. When evaluating the therapeutic effect of a disease, not only should biological indicators be used to evaluate the physical function, but also psychological and social indicators to evaluate overall function (i.e. QoL). QoL is a complex concept that is interpreted and defined differently within and between disciplines. In this paper, QoL is individuals’ perception of their position in life in the context of the culture and values systems in which they live and in relation to their goals, expectations, standards and concerns7. Compared with traditional indicators (such as cure rate and mortality), QoL is more suitable for the health evaluation of patients with chronic diseases for it reflect the new medical model and also “patient-centered” orientation. Given the fact that most patients with hypertension require long-term treatment, it becomes increasingly important to investigate the effect of treatment on hypertension patients’ QoL8. There are numerous studies on the QoL of patients with hypertension with a lot of them being on influencing factors such as age, sex, alcohol consumption9,10,11, only a few studies being on the instrument of QoL for hypertension. Although generic instruments for measuring QoL are generally used in both the general population and patients with hypertension, such as Brief Version of World Health Organization Quality of Life12, short- Form-3613, and European Quality of Life-5 Dimensions14, they do not capture the symptoms and side effects specific to hypertension. In contrast, disease-specific instruments such as the short form of hypertension quality of life Questionnaire (MINICHAL)15, Cambridge pulmonary hypertension outcome review (CAMPHOR)16 and Development of the pulmonary arterial hypertension symptoms and impact (PAH-SYMPACT)17 are focused on symptoms and signs that reflect the status of hypertension and more efficient than generic questionnaires. Thus, a more specific QoL measure, developed particularly to assess hypertension problems, would be useful in assessing QoL and evaluating whether treatment is successful or not.

To the best of the author’s knowledge, no scale for hypertension was developed based on the modular approach (a general module plus specific modules). In addition, no Chinese version of any of these instruments was available for use in patients with hypertension in China8. To fill this gap, the author developed a system of QoL Instruments for Chronic Disease (QLICD), which included a general module (QLICD-GM) and a disease-specific module for each disease considered. The QLICD (V1.0) contained one general module and nine disease-specific modules including hypertension (QLICD-HY), coronary heart disease (QLICD-CHD), chronic obstructive pulmonary diseases(QLICD-COPD), chronic pulmonary heart diseases(QLICD-CPHD), bronchia asthma (QLICD-BA), chronic gastritis (QLICD-CG), peptic ulcer (QLICD-PU), irritable bowel syndrome(QLICD-IBS), diabetes mellitus(QLICD-DM), which are widely used in some studies in China18. However, some problems were also found or reported in the clinical application of the QLICD (V1.0). For example, the structure of the scale was not completely reasonable, there was imprecision in reliability estimation and questions surrounding the validity of the test results. To solve these problems, the author developed the latest version of the system QLICD (V2.0), which contained 32 chronic disease-specific scales19. The present study developed the specific module for patients with hypertension, and then combined it with the general module that had been developed to form the hypertension instrument QLICD-HY (V2.0). In this paper, the author aimed to report on the development and validation of the QLICD-HY (V2.0) instrument.

Materials and methods

Development of the QLICD-HY (V2.0)

Development of the QLICD-HY (V2.0) was based on the construction of the general module (QLICD-GM) and the specific module development across several domains for hypertension patients.

Construction of the general module (QLICD-GM)

The method of the programmed decision was used for item selection, and a multi-step process was formed. Firstly, the author had to establish a research team (nominal and focus groups) to define OoL measurement concepts and propose an item pool. Secondly, the author had to screen items to form a primary scale. Thirdly, the author had to screen pre-survey items to form a test version scale. Finally, the author had to evaluate the scale and form the formal scale.

The nominal group consisted of 16 individuals, including seven researchers (two QoL researchers, two sociologists, two psychologists and one epidemiologist), six doctors, two nurses, and one medical manager for chronic diseases, which proposed the item pool by reviewing the literature and referring to existed QoL scales. The focus group was composed of 10 experts, including five researchers (two in QoL, one in epidemiology, one in sociology and one in psychology), which proposed the conceptual framework of QoL and refined items proposed by the nominal group. Both qualitative methods such as in-depth interviews, focus group discussions and quantitative methods such as variation, correlation and factor analyses were used in the item selection process. In the end, 28 items were selected to form the QLICD-GM, including three domains (nine items in the physical domain, 11 items in the psychological domain and eight items in the social domain), and each item was scored on a five-point Likert scale.

The scale was revealed to have a good validity20,21,22 based on data from 1672 individuals with 11 chronic diseases: 185 in hypertension, 163 in chronic gastritis, 170 in peptic ulcer, 124 in chronic obstructive pulmonary disease, 242 in diabetes, 140 in osteoarthritis, 100 in rheumatoid arthritis, 143 in systemic lupus erythematosus, 100 in stroke, 141 in prostatic hyperplasia and 164 in chronic renal failure. The primary steps were summarised in Fig. 1.

Figure 1
figure 1

Steps towards development and validation procedure of QLICD-HY (V2.0).

Construction of the hypertension-specific module

Using the similar multi-step procedure as the general module, 21 items were selected from the 32-item pool of the hypertension-specific module based on literature review, nominal group/focus group discussion and in-depth interviews.

After pre-test and two-stage screenings, the final specific module of 13 items (coded HY1-HY13) was formed including four facets of cerebrovascular symptoms (CES), cardiovascular symptoms (CAS), treatment side effects (TSE) and special effects on mentality and life (EML) (Fig. 1). In brief, the following three statistical procedures were used to screen the items based on pre-test data at this stage.

  1. A.

    Variation procedure. The standard deviation (SD) of the scores for each item was calculated and the items with smaller SD (< 1.10) were deleted.

  2. B.

    Correlation procedure. The Pearson’s correlation coefficients between the score of each item and the sum score of its own domain were computed and the items with smaller correlation coefficients (< 0.50) were deleted.

  3. C.

    Patients item importance rating procedure. The patients were asked to score the importance of each item using a 0–100 score system (0 = extremely unimportant and 100 = extremely important). The items with score < 65 were deleted.

Validation of the QLICD-HY (V2.0)

Data collection

The QLICD-HY (V2.0) was developed by combining the newly created hypertension-specific module with the general module QLICD-GM. It was used to investigate and assess hypertension patients in the field. The survey was conducted at the First Affiliated Hospital of Kunming Medical University in Kunming, Dalang hospital and Shilong Boai hospital in Dongguan, China. We enrolled participants with hypertension at any stage who were: (1) be able to read and write words with assistance and understand the content of the questionnaire; (2) volunteer to participate in the assessment and be able to provide written informed consent. The exclusion criteria comprised: (1) illiterate; (2) unconscious and unable to clearly express their feelings; (3) serious diseases. The study’s objective and significance were described to the participants by the investigators (doctors and medical post-graduate students), who received their informed consent. The survey institution’s ethics committee gave its approval to the research procedure and informed consent form.

At the time of admission into the hospital, all 370 respondents completed the questionnaires. A randomly selected subsample of 121 patients also took part in a follow-up evaluation after one or two days after being hospitalised to determine test–retest reliability, considering their QoL not change in nature because treatments has not been effected in these days.

In addition, after approximately a week of therapy, 321 patients who were accessible at the third scheduled evaluation time point completed the measures at discharge to gauge responsiveness. The investigators promptly verified each response to make sure it was complete. If missing values were discovered, the questionnaire was given back to the patients so they could fill out the blanks.

The scoring method of the QLICD-HY (V2.0)

On the basis of the collected data, the raw scores of items, domains, and the overall scale were determined. Each item on the QLICD-HY is scored on a five-point Likert scale: not at all, slightly, moderately, very much and extremely. The positively expressed items were assigned a score between 1 and 5, whilst the negatively stated items were reverse-coded. Each domain score was derived by aggregating the item scores inside the domain. The total score on the scale was the sum of the domain scores. All domain scores were linearly translated to a 0–100 scale for the purposes of comparison using the following formula: SS = (RS-Min) 100/R, where SS represents the standardised score, RS represents the raw score, Min represents the minimum score and R represents the range of scores.

Reliability

In the present study, internal consistent reliability and test–retest reliability were used to evaluate the scale’s reliability. Internal consistency was determined using Cronbach’s alpha coefficient for each domain, and test–retest reliability was determined using Pearson’s correlation coefficient r and intra-class correlation (ICC). A Cronbach’s alpha coefficient between 0.70 and 0.95 is viewed as sufficient evidence of internal consistency in scale development23. Sehunemann suggested that test–retest reliability between 0.73 and 0.95 was sufficient24.

Validity

Due to the lack of a consensus gold standard for evaluating the QOL of hypertension, the author decided to utilise the Chinese version of the SF-36 as the criteria for evaluating criterion-related validity. The criterion-related validity was assessed by correlating the corresponding domains of QLICD-HY and SF-36, and also the convergence validity and discriminant validity was tested by this multi-trait scaling analysis23 simultaneously. There are two validity criteria: (1) when the item-domain correlation is 0.40 or higher, it supports convergence validity; (2) discriminant validity is revealed when item-domain correlation is higher than that with other domains.

Construct validity was evaluated by calculating the Pearson correlation coefficient (r) among items and domains considering that these scores have been transformed to 0–100 quantitative data with normal distribution, as well as by confirmatory factor analysis using structural equation modeling. The author calculated Pearson’s correlation coefficient r between items and domains with a minimum threshold of 0.4023,25. Concerning structural equation modelling, CFI (comparative fit index)TLI (Tucker–Lewis index), RMSEA (root-mean-square error of approximation) and SRMR (standardized root mean square residual) are indices recommended as sensitive to model misspecification26, with the CFI and TLI with values greater than 0.90 and RMSEA, SRMR less than 0.08 reflecting a good fit of the model to the data.

Responsiveness

The responsiveness was evaluated by comparing the mean difference between the first and third evaluations (before and after treatments). The paired t-test was employed to evaluate responsiveness with the calculation of the standard responsiveness mean (SRM). The SRM was the ratio of the difference between before and after treatments to the standard deviation of the difference. According to Husted27, an SRM above 0.8 indicated good responsiveness, an SRM between 0.5 and 0.2 revealed moderate responsiveness, and an SRM below 0.2 indicated a poor response.

Ethics statement

The study protocol and the informed consent form were approved by the Institutional Review Board of Guangdong Medical University (PJ2013037) and performed in accordance with the World Medical Association Declaration of Helsinki for ethical principles for medical research involving human subjects. All procedures performed in studies involving human participants were in accordance with the ethical standards of the institution or practice at which the studies were conducted. The participants provided their written informed consent to participate in this research.

Results

Socio-demographic characteristics of the sample

The age range for 370 patients with hypertension was 29 to 88 years old, with a median age of 67 years and a mean age of 64.8 + 10.6 years. 160 (43.2%) were male, and 134 (63.3%) had finished primary school. The majority of participants (88.4%) were married, and 43 cases (11.6%) were widowed. Public insurance was the most common type of medical insurance amongst the study population (78.4%). Workers who performed manual labour in factories made up 34.1% of the workforce, followed by farmers with 21.1% (see Table 1).

Table 1 Socio-demographic characteristics of the sample (n = 370).

Reliability

Three approaches were used to examine the reliability of the scale: test–retest, ICC and internal consistency (see Table 2 for details). The test–retest correlation coefficients (r) for the four domains and 13 dimensions of QLICD-HY ranged from 0.49 to 0.92, with r = 0.90 for the overall scale and r = 0.77 for SOD as the lowest amongst the four exclusive domains. The differences in domain and facet scores between the first and second assessments were not statistically significant (P > 0.05). ICC results based on the criterion of the absolute agreement for a single measure were remarkably close to Pearson’s correlation coefficients r. Cronbach’s α coefficient for these four domains was between 0.70 and 0.88, which revealed good internal consistency reliability (Table 2).

Table 2 Reliability, floor and ceiling effects of the quality of life instrument QLICD-HY (V2.0).

Construct validity

Strong correlations between items and their own domains/facets subscales were found using correlation analyses (the most of correlation coefficients were higher than 0.5), while weak associations between items and other domains/facets were also found (see Table 3 in detail). For example, correlation coefficients between items of GPH1-GPH9 and physical function domain (in bold) are greater than those across domains. Similarly, correlation coefficients between items of GSO1–GSO8 and social function domain (in bold) are greater than those across domains. These associations were consistent with the conceptual theoretical constructs.

Table 3 Correlation coefficients r among items and domains of QLICD-HY (V2.0) (n = 370).

Structural equation modeling showed that the structure of the specific module of the QLICD-HY was consistent with the conceptual theoretical construct (four facets), with goodness of fit Chi-square \(\chi^{2}\) = 192.007 (P < 0.001), Tucker–Lewis index (TFI) = 0.941, comparative fit index (CFI) = 0.938, root mean square error of approximation (RMSEA) = 0.078, standardized root mean square residual (SRMR) = 0.066. See Table 4 and Fig. 2 in detail.

Table 4 Structure of the specific module of the QLICD-HY confirmed by SEM (n = 370).
Figure 2
figure 2

The structure of the specific module of QLICD-HY by structural equation modeling.

Criterion-related validity

The Chinese version of the SF-36 was employed to provide data for evaluating the QLICD-HY criterion-related validity. The correlation coefficients between the QLICD-HY and the SF-36 domain scores were expressed in Table 5. It can be seen from Table 5 that, overall, the correlations between the same and similar domains of the QLICD-HY and the SF-36 are higher than those between different and non-similar domains. For instance, the coefficients between physical domain of the QLICD- HY and the SF-36 was 0.50, and that between physical domain of the QLICD- HY and vitality of the SF-36 was 0.53, which were higher than all the other coefficients in this row. Obviously, the physical domain of the QLICD- HY was the same or similar to physical function and vitality of the SF-36. Additionally, the psychological domain of the QLICD-HY and mental health domain of the SF-36 had the largest correlation coefficient (0.67) in this row. These correlations confirm the criterion-related validity and also demonstrate the convergent and divergent validity to a reasonable degree.

Table 5 Correlation coefficients among domains scores of QLICD-HY (V2.0) and SF-36 (n = 370).

Responsiveness

According to Table 6, all domains of the scale were statistically significant (P < 0.05). SRM in various domains ranged from 0.22 to 0.85, with 0.77 and 0.85 for the specific module and overall scale, respectively. Except for psychological and social domains, other domains of SRM were all above 0.70. It could be noted that the QLICD-HY had good responsiveness.

Table 6 Responsiveness of the quality of life instrument QLICD-HY (V2.0) (n = 370).

Discussion

This research focuses on the development and validation of the QLICD-HY (V2.0), a new specific QoL instrument for patients with hypotension. The author developed the second version of QLICD-HY by combing the improved general module QLICD-GM and the developed specific module for hypertension. The general module QLICD-GM capture general QoL in patients with various chronic disease such as chronic obstructive pulmonary disease (COPD) and peptic ulcer, whilst disease-specific modules capture the aspects of QoL that distinguish the different diseases. The content was developed from focus group discussion, in-depth interviews with hypertension patients, pre-testing and psychometric evaluation. The final QLICD-HY (V2.0) was comprised of 41 items, including 28 items of QLICD-GM and 13 items of specific modules, making it concise and usable in research and clinical settings. Therefore, QLICD-HY is different from other QoL instruments for hypertension.

On the basis of the first version of QLICD-GM, the second version of QLICD-GM has undergone various enhancements to strengthen its readability and accessibility21. The evaluation of the QLICD-HY (V2.0) indicates that it is a reliable and valid disease-specific instrument. In this research, the internal consistency reliability and the test–retest reliability were used to determine the QLICD-HY reliability18, with the results indicating that it has a high degree of reliability. The findings of the test–retest reliability analysis reveal the stability of this instrument, which forms the basis for the evaluation of responsiveness to change.

To demonstrate validity, correlation analysis and confirmatory factor analysis by SEM were used, with the results confirming overall good validity. Correlation analysis revealed weak correlations between items and other domains/facets, whereas there were strong associations between items and their own domains/facets. The validity of the construct was also further confirmed by structural equation modeling, which revealed good fit from the data corresponded with the theoretical constructs of the instrument21. Comparable SF-36 domains suggest that the QLICD-HY (V2.0) has good criterion-related validity.

In general, responsiveness evaluation methods can be split into two categories: internal and external27,28. The present study focuses on internal responsiveness with the notion that a sensitive instrument should detect changes in response to therapies when testing post-treatment. In this research, the paired t-test was utilised to assess the mean response before and after therapy, along with indicators of responsiveness such as SRM. SRM is a good indicator for estimating effect size, with values of 0.20, 0.50, and 0.80 corresponding to small, moderate and large responsiveness, respectively27,28. As revealed in Table 6, all domains changed significantly after therapy, which appears to have good responsiveness.

The quality of life has strong cultural dependence; thus, it is necessary to develop a measuring scale with Chinese cultural characteristics. In China, despite the fact that several scales were constructed in accordance with Chinese culture and exhibited acceptable psychometric qualities29,30,31, there were still deficiencies. Initially, these scales were not established systematically using the modular approach (combination of the general module and specific modules), and therefore cannot be used to compare different diseases. Secondly, these scales lacked a unified standard. In addition, they did not adequately consider the influence of pharmacological side effects on patients, and the majority of them were solely utilised in traditional Chinese medicine. Therefore, a scientific, reasonable, reliable and appropriate QOL assessment scale was required for Chinese hypertension patients. In the present study, the QLICD-HY was created by combining the general module of a recognised and well-established system of QOL instruments for chronic diseases (QLICD) with a newly built hypotension-specific module. In comparison to other scales in China, the QLICD-HY not only demonstrates superior validity, reliability and responsiveness, but also the capacity to evaluate QOL across diseases using the general module, displaying both generic and specific qualities. From the structure of the scale, it is comprised of a moderate number of items with a clear hierarchical structure (items → facets → domains → overall) so that analysis of scores may be conducted not only at the domain and overall levels but also at the different facet levels (13 in total) to discover subtle changes. In addition, QLICD-HY has the characteristics of Chinese culture and adds entries to traditional Chinese culture in terms of appetite, sleep and family affection. Thus, QLICD-HY (V2.0) with Chinese cultural characteristics by means of the modular method is both comparable and targeted.

It is worth acknowledging that the present study was also subject to a few constraints. Firstly, the sample size in the present study is not sufficient for the general module’s factor analysis by SEM, and thus it was not carried out here. Secondly, the participants in the present study were chosen from hospitals’ inpatient populations. To determine whether the instrument is generalisable to other contexts and demographics, such as outpatients at a nearby clinic or community, further research is required.

Conclusion

In conclusion, combining the well-recognised general module of chronic diseases with the specific module of hypertension, the second edition of QLICD-HY was developed within the context of Chinese culture. It has good validity, reliability and responsiveness, and may be used to assess the QOL of Chinese hypertension patients. This instrument and its future improvements may have the potential to be applied in other countries to improve the measurement of QOL for hypertension patients if the findings are replicated in more general study groups.