Development of computer adaptive testing for measuring depression in patients with cancer

Kurisu, Ken; Hashimoto, Masayuki; Ishizawa, Tetsuro; Shibayama, Osamu; Inada, Shuji; Fujisawa, Daisuke; Inoguchi, Hironobu; Shimoda, Haruki; Inoue, Shinichiro; Ogawa, Asao; Akechi, Tatsuo; Shimizu, Ken; Uchitomi, Yosuke; Matsuyama, Yutaka; Yoshiuchi, Kazuhiro

doi:10.1038/s41598-022-12318-x

Download PDF

Article
Open access
Published: 17 May 2022

Development of computer adaptive testing for measuring depression in patients with cancer

Ken Kurisu¹,
Masayuki Hashimoto²,
Tetsuro Ishizawa¹,
Osamu Shibayama^1,3,
Shuji Inada^1,4,
Daisuke Fujisawa^5,6,
Hironobu Inoguchi⁷,
Haruki Shimoda^8,9,
Shinichiro Inoue¹⁰,
Asao Ogawa⁶,
Tatsuo Akechi^11,12,
Ken Shimizu¹³,
Yosuke Uchitomi¹⁴,
Yutaka Matsuyama¹⁵ &
…
Kazuhiro Yoshiuchi¹

Scientific Reports volume 12, Article number: 8247 (2022) Cite this article

1356 Accesses
12 Altmetric
Metrics details

Subjects

Abstract

The usefulness of depression scales for patients with cancer based on item response theory (IRT) and computer adaptive testing (CAT) has not yet been fully explored. This study thus aimed to develop an IRT-based tool for measuring depression in patients with cancer. We analyzed data from 393 patients with cancer from four tertiary centers in Japan who had not received psychiatric treatment. They answered 62 questions across five categories regarding their psychiatric status over the previous week. We selected 28 items that satisfied the assumptions of IRT, fitted a graded response model to these items, and performed CAT simulations. The CAT simulation used an average of 6.96 items and showed a Pearson’s correlation coefficient of 0.916 (95% confidence interval, 0.899–0.931) between the degree of depression estimated by simulation and that estimated using all 28 items. The measurement precision of CAT with only four items was superior to that of the estimation using the calibrated Patient Health Questionnaire-9. These results imply that this scale is useful and accurate for measuring depression in patients with cancer.

A case study of an individual participant data meta-analysis of diagnostic accuracy showed that prediction regions represented heterogeneity well

Article Open access 07 June 2023

Revisiting the theoretical and methodological foundations of depression measurement

Article 14 April 2022

The feasibility of a Bayesian network model to assess the probability of simultaneous symptoms in patients with advanced cancer

Article Open access 24 December 2022

Introduction

Depression frequently occurs in patients with cancer^1,2. Even mild levels of depression reportedly decrease the quality-adjusted life-year score³. Furthermore, patients with cancer have a higher risk of suicide than the general population^4,5,6. Several interventions, such as specialized palliative care, can reduce psychological symptoms^7,8, thus, requiring accurate scales for measuring depression. Clinicians commonly evaluate the condition through self-administered tools, such as the Patient Health Questionnaire-9 (PHQ-9)^9,10 and the Hospital Anxiety and Depression Scale (HADS)¹¹, both of which are based on classical test theory. These have several disadvantages, including sample dependency, inability to replace or add items, and requirement for participants to answer all items regardless of severity. Scales developed based on item response theory (IRT) and computer adaptive testing (CAT) can address these limitations¹². This is because IRT-based scales can reveal sample-independent subject traits, and CAT can optimize the way items are presented and reduce the associated burdens placed on patients. Several studies have applied CAT-based depression scales, which were developed in the Patient-Reported Outcomes Measurement Information System (PROMIS) project, to patients with cancer^13,14,15,16. However, the usefulness of CAT-based scales for measuring depression in patients with cancer has not yet been fully explored. Thus, this study aimed to develop a CAT-based scale for measuring depression in patients with cancer.

Methods

Ethical approval

All participants provided written informed consent. The institutional review board of the National Cancer Center Hospital (approval number: 2010-202) and all the participating sites approved the study. This study was in accordance with the ethical standards of the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Study design and participants

This multicenter prospective study was conducted at four tertiary centers in Japan (National Cancer Center Hospital, National Cancer Center Hospital East, Okayama University Hospital, and the University of Tokyo Hospital) between May 2011 and December 2012. The study included patients who (a) were aged ≥ 20 years, (b) had been diagnosed with any type of cancer, (c) had Eastern Cooperative Oncology Group performance status ≤ 2, and (d) were selected for or were already receiving anti-cancer treatments. Patients who (a) had received psychiatric treatments within the previous two months, or (b) were considered extremely sick to participate by their physicians-in-charge were excluded from the study. We recruited participants upon admission.

Data collection

We developed 62 items to measure depression. Table 1 shows examples of items translated from Japanese to English. Several psycho-oncologists independently drafted the items based on the diagnostic criteria and common symptoms of depression. Subsequently, they discussed and finalized these items. All items asked participants about their depressive mood over the preceding week, with each item answered on a 5-point scale (1 = none, 2 = rarely, 3 = sometimes, 4 = often, 5 = always).

Table 1 Examples of items translated from Japanese to English.

Full size table

To confirm concurrent validity, participants completed the PHQ-9, which is widely used to measure depression and has been validated in patients with cancer^9,10. The PHQ-9 total scores of 5–9, 10–14, 15–19, and 20–27 correspond to mild, moderate, moderately severe, and severe depression, respectively.

Overview of statistical analyses

Based on the analytic methods used in the PROMIS project¹⁷ and several studies on CAT^18,19,20, we conducted the following analyses: (1) descriptive statistics, (2) evaluation of the IRT assumptions, (3) fitting a graded response model (GRM) to the data, (4) evaluation of differential item functioning (DIF), (5) CAT simulations, and (6) calibration of the PHQ-9. All analyses were conducted using the open-source R software (version 4.1.1). Statistical significance was set at P < 0.05.

Descriptive statistics

Cronbach’s alpha was used to measure internal consistency (analyzed using the R package “psych,” version 2.1.9). Items with unanswered categories were excluded because their parameters could not be estimated, and those with an item-remainder correlation < 0.3 were also excluded due to violation of internal consistency²¹.

Evaluation of assumptions of the IRT model

We evaluated the assumptions of IRT including unidimensionality, local independence, and monotonicity¹⁷.

We tested unidimensionality by conducting principal component analysis (PCA), confirming that the proportion of variance of the first factor was ≥ 20% and the ratio of variance of the first factor to the second factor was ≥ 4^17,18. We excluded items with a low contribution to the first factor to satisfy these criteria.

Subsequently, we tested local independence by conducting a one-factor confirmatory factor analysis, producing a residual correlation matrix (analyzed using the R package “lavaan,” version 0.6–9). From the pairs of items with residual correlations > 0.2^17,18, we excluded the item with a lower contribution to the first factor of the PCA.

Finally, we tested monotonicity by developing a nonparametric IRT model (analyzed using the R package “mokken,” version 3.0.6), and excluded items with a scalability coefficient < 0.3¹⁸.

Graded response model

We fitted a GRM to the remaining items (analyzed using the R package “mirt,” version 1.35.1) to estimate discrimination and difficulty parameters for each item and latent factor θ (i.e., degree of depression) for each patient using maximum a posteriori (MAP). Subsequently, we excluded items that contained categories without maximum probability at any θ. We also examined fit statistics (S-X²) for each item, excluding those with a poor fit, as determined at an alpha level of 0.01¹⁷.

Evaluation of DIF

We evaluated DIF for age (≥ 65 or < 65) and sex (male or female) (analyzed using the “DIF” function in the R package “mirt,” version 1.35.1) and excluded items with an alpha level of 0.01^17,18.

CAT simulations

Following the item selection process, we recalculated Cronbach’s alpha, redeveloped a GRM, and recalculated discrimination and difficulty parameters for each item as well as θ for each patient (θ_true).

We used the resulting items and θ_true to perform CAT simulations (analyzed using the R package “catIrt,” version 0.5–0)¹⁹. At the beginning of the simulations, the estimated latent factor (θ_est) was set to zero, and the minimum number of items administrated was set to three. We conducted simulations using various combinations of latent factor estimators, item selection methods, and termination criteria.

Latent factor estimators were: (a) maximum likelihood estimation (MLE), (b) Bayesian modal estimation (BME), and (c) expected a priori estimation (EAP). Item selection methods were as follows: (a) unweighted Fisher information (UW-FI), and (b) pointwise Kullback–Leibler divergence (FP-KL). Termination criteria were: (a) standard error of measurement (SEM) threshold of 0.32 or (b) that of 0.50, while the simulations were also terminated upon reaching the maximum number of items.

We calculated Pearson’s correlation coefficients (PCCs) between θ_est and θ_true to measure the simulation accuracy, and PCCs between θ_est and the total score on the PHQ-9 to confirm concurrent validity.

Calibration of PHQ-9 to the IRT model

To compare the measurement precision of the scale with that of the PHQ-9, we calibrated the PHQ-9 to the GRM model (analyzed using the “fixedCalib” function in the R package “mirt,” version 1.35.1), and performed an estimation using the calibrated items²⁰. We plotted the Lowess curves of SEMs for the following: (a) CAT simulations with a fixed number of items and (b) estimation using the calibrated PHQ-9 items. Subsequently, we determined the minimum number of items required to surpass the measurement precision of the calibrated PHQ-9.

Results

Study participants

A total of 393 participants completed the questionnaires. The average score for all items was 1.44/5. The descriptive data are shown in Table 2. Among 289 patients who completed the PHQ-9, 77 (27%), 15 (5%), and 5 (2%) patients showed mild, moderate, and moderately severe to severe depression, respectively.

Table 2 Descriptive data of study participants.

Full size table

Item selection and parameter estimation

Cronbach’s alpha for all 62 items was 0.97. As shown in Fig. 1, 28 of these items were included in the GRM and CAT simulations. Cronbach’s alpha was 0.95 after the item selection. Most unanswered categories comprised those with higher scores (4 or 5).

The examples of parameters in the GRM are shown in Table 3 (see Supplementary Table 1 for the parameters of all the items). Overall, the discrimination parameters ranged from 1.53 to 3.32. The first and last difficulty parameters ranged from 0.09 to 1.60 and 2.55 to 4.33, respectively. The item with the highest discrimination parameter was “I feel depressed and have difficulty in daily life.” The items with the lowest difficulty parameter were “I often feel helpless” and “I feel hopeless for the future.” The items with the highest difficulty parameter were “I need help with my depression” and “Others don't understand me”.

Table 3 Examples of estimated parameters.

Full size table

CAT simulations

The results of the CAT simulations are presented in Table 4. When the termination criteria of the SEM threshold were set to 0.50, the most accurate simulation used the BME estimator and UW-FI item selection, achieving a PCC of 0.916 (95% confidence interval [CI], 0.899–0.931) using an average of 6.96 items. It also achieved a PCC with a total PHQ-9 score of 0.669 (95% CI, 0.600–0.728).

Table 4 Results of the computerized adaptive testing simulations.

Full size table

The Lowess curves for the SEMs of the CAT simulations are shown in Fig. 2. CAT using only four items had smaller SEMs at any θ_est than the estimation using the calibrated PHQ-9. The estimated parameters of PHQ-9 are listed in Supplementary Table 2.

Discussion

We developed a new scale for measuring depression in patients with cancer based on an IRT model and CAT simulations. The CAT simulations showed that a small number of items could accurately measure the degree of depression. The scale also showed a significant correlation with the PHQ-9 total score and achieved a smaller SEM than the calibrated PHQ-9 using only four items.

More than half of the items were excluded through the item selection process. The main reasons for this included unanswered categories, violations of local independence, and unsuitable category response curves. The existence of unanswered categories, which mainly comprised those with higher scores, may have resulted from the exclusion of patients undergoing psychiatric treatment. The existence of local dependence in many items might suggest duplication or redundancy in our item development. The unsuitable category response curves may have resulted from sample size insufficiency because the GRM requires more than 500 samples to estimate the parameters accurately²². The remaining 28 items exhibited a Cronbach’s alpha of 0.95, suggesting substantial internal consistency²³.

The exclusion of more than half of the items may also be attributable to the item selection using the unidimensional model. Instead, the bifactor model applied for larger item banks would be beneficial for developing CAT with more items. Gibbons et al. showed that such an analysis could result in the development of a CAT measuring depression/anxiety with hundreds of items^24,25. Such an analysis would also be necessary for our aim to develop CAT measuring depression in patients with cancer.

The parameters of several items may explain the characteristics of depression in patients with cancer. The discrimination parameter corresponds to the slope of the GRM and indicates the ability to discriminate subjects’ traits. The highest discriminative item was about the influence on daily life. Such an influence appears highly informative for assessing depression in patients with cancer. A previous study, which assessed depression in patients with cancer using IRT, showed that social withdrawal or decreased talkativeness is highly discriminative²⁶, which is similar to the result of the present study. However, other IRT-based studies on depression in patients with cancer did not include items that assessed the influence on daily lives^27,28,29. Thus, the importance of this item needs to be further examined.

The difficulty parameter indicates the traits of participants at which the probability of choosing either of the two adjacent categories is equal. Thus, items with high difficulty parameters were selected by participants with high severity, whereas items with low difficulty parameters were selected even by participants with low severity. The items about helplessness and hopelessness showed low difficulty parameters, suggesting that patients with cancer easily experience these symptoms. In contrast, the items about support and understanding from others showed high difficulty parameters, suggesting that these symptoms would be observed in highly depressive patients with cancer. These items were not included in previous IRT-based studies on depression in patients with cancer^26,27,28,29. In addition, the item selection process may have excluded items with higher difficulty or those with less difficulty. Thus, further studies are required to determine the importance of these items.

The CAT simulations achieved high measurement accuracy using a small number of items, exhibiting strength in shortening the health measurement scales. The significant correlation with the PHQ-9 score implies the ability of the scale to measure depression. Moreover, the CAT simulations showed a higher measurement accuracy than the estimation using the calibrated PHQ-9. Thus, the scale can be employed in clinical settings to efficiently evaluate depression in patients with cancer. The CAT developed in this study could be made available online, as in the PROMIS project, which would allow efficient assessment of depression in patients with cancer to be applied in clinical settings, such as palliative care and psycho-oncology.

This study has several limitations that need to be addressed. First, we excluded patients who were undergoing psychiatric treatments, which may have affected item selection and limited the situations of the CAT’s usage. Second, the sample size was insufficient. GRM reportedly requires more than 500 samples to estimate parameters of 25 items appropriately²². However, we recruited only 393 participants to estimate the parameters of 62 items. Third, we could not perform a DIF analysis for the history of depression because only few participants had it, which is likely due to the exclusion of participants under psychiatric treatments. Fourth, the final item set is limited, and only a small number of them are available for adaptive administration at each level of depression severity. Fifth, the moderate correlation between the CAT and the PHQ-9, with a correlation coefficient of 0.67, may imply inadequate measurement of depression by the scale. This result might also suggest that the items did not cover all subdomains of depression, such as somatic symptoms. Further investigation is necessary to examine the correlation between the CAT and the HADS. Finally, we did not confirm that the scale could accurately classify whether a patient has a major depressive disease or not. Several studies examined the diagnostic performance of the developed CAT for patients diagnosed through gold standard measures, such as structured interviews^24,25. Such examination for diagnostic ability is also required for the newly developed CAT in the present study.

In conclusion, this study developed a scale for measuring depression in patients with cancer based on IRT and CAT, providing a useful and improved way for clinicians to evaluate depression in patients with cancer.

Data availability

The datasets analyzed during the current study are not publicly available because the approval of data sharing has not been obtained from the institutional review board but are available from the corresponding authors on reasonable request.

References

Mitchell, A. J. et al. Prevalence of depression, anxiety, and adjustment disorder in oncological, haematological, and palliative-care settings: A meta-analysis of 94 interview-based studies. Lancet Oncol. 12, 160–174 (2011).
Article Google Scholar
Lu, D. et al. Clinical diagnosis of mental disorders immediately before and after cancer diagnosis: A nationwide matched cohort study in Sweden. JAMA Oncol. 2, 1188–1196 (2016).
Article Google Scholar
Fujisawa, D. et al. Impact of depression on health utility value in cancer patients. Psychooncology. 25, 491–495 (2016).
Article Google Scholar
Fang, F. et al. Suicide and cardiovascular death after a cancer diagnosis. N. Engl. J. Med. 366, 1310–1318 (2012).
Article Google Scholar
Henson, K. E. et al. Risk of suicide after cancer diagnosis in England. JAMA Psychiat. 76, 51–60 (2019).
Article Google Scholar
Harashima, S. et al. Death by suicide, other externally caused injuries and cardiovascular diseases within 6 months of cancer diagnosis (J-SUPPORT 1902). Jpn. J. Clin. Oncol. 51, 744–752 (2021).
Article Google Scholar
Holmenlund, K., Sjøgren, P. & Nordly, M. Specialized palliative care in advanced cancer: What is the efficacy? A systematic review. Palliat. Support Care 15, 724–740 (2017).
Article Google Scholar
Kassianos, A. P., Ioannou, M., Koutsantoni, M. & Charalambous, H. The impact of specialized palliative care on cancer patients’ health-related quality of life: A systematic review and meta-analysis. Support Care Cancer. 26, 61–79 (2018).
Article Google Scholar
Kroenke, K., Spitzer, R. L. & Williams, J. B. The PHQ-9: Validity of a brief depression severity measure. J Gen Intern Med. 16, 606–613 (2001).
Article CAS Google Scholar
Fann, J. R. et al. Depression screening using the patient health questionnaire-9 administered on a touch screen computer. Psychooncology 18, 14–22 (2009).
Article Google Scholar
Zigmond, A. S. & Snaith, R. P. The hospital anxiety and depression scale. Acta Psychiatr. Scand. 67, 361–370 (1983).
Article CAS Google Scholar
Reise, S. P. & Waller, N. G. Item response theory and clinical measurement. Annu. Rev. Clin. Psychol. 5, 27–48 (2009).
Article Google Scholar
Clover, K. et al. PROMIS depression measures perform similarly to legacy measures relative to a structured diagnostic interview for depression in cancer patients. Qual. Life Res. 27, 1357–1367 (2018).
Article Google Scholar
Stone, A. A., Broderick, J. E., Junghaenel, D. U., Schneider, S. & Schwartz, J. E. PROMIS fatigue, pain intensity, pain interference, pain behavior, physical function, depression, anxiety, and anger scales demonstrate ecological validity. J. Clin. Epidemiol. 74, 194–206 (2016).
Article Google Scholar
Wagner, L. I. et al. Bringing PROMIS to practice: Brief and precise symptom screening in ambulatory cancer care. Cancer 121, 927–934 (2015).
Article Google Scholar
Baum, G., Basen-Engquist, K., Swartz, M. C., Parker, P. A. & Carmack, C. L. Comparing PROMIS computer-adaptive tests to the brief symptom inventory in patients with prostate cancer. Qual. Life Res. 23, 2031–2035 (2014).
Article Google Scholar
Reeve, B. B. et al. Psychometric evaluation and calibration of health-related quality of life item banks: Plans for the Patient-Reported Outcomes Measurement Information System (PROMIS). Med. Care 45, S22–S31 (2007).
Article Google Scholar
De Beurs, D. P., de Vries, A. L., de Groot, M. H., de Keijser, J. & Kerkhof, A. J. Applying computer adaptive testing to optimize online assessment of suicidal behavior: A simulation study. J. Med. Internet. Res. 16, e207 (2014).
Article Google Scholar
Stochl, J., Böhnke, J. R., Pickett, K. E. & Croudace, T. J. An evaluation of computerized adaptive testing for general psychological distress: Combining GHQ-12 and Affectometer-2 in an item bank for public mental health research. BMC Med. Res. Methodol. 16, 58 (2016).
Article Google Scholar
Gibbons, L. E. et al. Migrating from a legacy fixed-format measure to CAT administration: Calibrating the PHQ-9 to the PROMIS depression measures. Qual. Life Res. 20, 1349–1357 (2011).
Article Google Scholar
Kline, P. A handbook of test construction (Methuen, 1986).
Google Scholar
Reise, S. P. & Yu, J. Parameter recovery in the graded response model using MULTILOG. J. Educ. Meas. 27, 133–144 (1990).
Article Google Scholar
Ponterotto, J. G. & Ruckdeschel, D. E. An overview of coefficient alpha and a reliability matrix for estimating adequacy of internal consistency coefficients with psychological research measures. Percept. Mot. Skills. 105, 997–1014 (2007).
Article Google Scholar
Gibbons, R. D. et al. Development of a computerized adaptive test for depression. Arch. Gen. Psychiatry 69, 1104–1112 (2012).
Article Google Scholar
Gibbons, R. D. et al. Development of the CAT-ANX: A computerized adaptive test for anxiety. Am. J. Psychiatry 171, 187–194 (2014).
Article Google Scholar
Akechi, T. et al. Symptom indicator of severity of depression in cancer patients: A comparison of the DSM-IV criteria with alternative diagnostic criteria. Gen. Hosp. Psychiatry 31, 225–232 (2009).
Article Google Scholar
Pergolotti, M. et al. Mental status evaluation in older adults with cancer: Development of the Mental Health Index-13. J. Geriatr. Oncol. 10, 241–245 (2019).
Article Google Scholar
Bjorner, J. B. et al. Use of item response theory to develop a shortened version of the EORTC QLQ-C30 emotional functioning scale. Qual. Life Res. 13, 1683–1697 (2004).
Article CAS Google Scholar
van der Donk, L. J. et al. The value of distinct depressive symptoms (PHQ-9) to differentiate depression severity in cancer survivors: An item response approach. Psychooncology. 28, 2240–2243 (2019).
Article Google Scholar

Download references

Acknowledgements

This work was partly supported by a grant-in-aid for Clinical Cancer Research (Grant Number H22-033) and MHLW EA Program (Grant Number JPMH20EA1012).

Author information

Authors and Affiliations

Department of Stress Sciences and Psychosomatic Medicine, Graduate School of Medicine, The University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo, 113-8655, Japan
Ken Kurisu, Tetsuro Ishizawa, Osamu Shibayama, Shuji Inada & Kazuhiro Yoshiuchi
Department of Psychosomatic Medicine, Sunagawa City Medical Center, Sunagawa, Hokkaido, Japan
Masayuki Hashimoto
Department of Psychosomatic Medicine, Yokohama Rosai Hospital, Yokohama, Kanagawa, Japan
Osamu Shibayama
Department of Psychosomatic Medicine, Kindai University Hospital, Osakasayama, Osaka, Japan
Shuji Inada
Department of Neuropsychiatry and Palliative Care Center, Keio University School of Medicine, Tokyo, Japan
Daisuke Fujisawa
Department of Psycho-Oncology, National Cancer Center Hospital East, Kashiwa, Chiba, Japan
Daisuke Fujisawa & Asao Ogawa
Department of Psycho-Oncology, National Cancer Center, Tokyo, Japan
Hironobu Inoguchi
Department of Hygiene and Preventive Medicine, School of Medicine, Iwate Medical University, Iwate, Japan
Haruki Shimoda
Department of Mental Health, The University of Tokyo, Tokyo, Japan
Haruki Shimoda
Department of Neuropsychiatry, Okayama University, Okayama, Japan
Shinichiro Inoue
Center for Psycho-Oncology and Palliative Care, Nagoya City University Hospital, Nagoya, Aichi, Japan
Tatsuo Akechi
Department of Psychiatry and Cognitive-Behavioral Medicine, Nagoya City University, Graduate School of Medical Sciences, Nagoya, Aichi, Japan
Tatsuo Akechi
Department of Psycho-Oncology, Cancer Institute Hospital, Tokyo, Japan
Ken Shimizu
Innovation Center for Supportive, Palliative and Psychosocial Care, National Cancer Center, Tokyo, Japan
Yosuke Uchitomi
Department of Biostatistics, School of Public Health, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Yutaka Matsuyama

Authors

Ken Kurisu
View author publications
You can also search for this author in PubMed Google Scholar
Masayuki Hashimoto
View author publications
You can also search for this author in PubMed Google Scholar
Tetsuro Ishizawa
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Shibayama
View author publications
You can also search for this author in PubMed Google Scholar
Shuji Inada
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Fujisawa
View author publications
You can also search for this author in PubMed Google Scholar
Hironobu Inoguchi
View author publications
You can also search for this author in PubMed Google Scholar
Haruki Shimoda
View author publications
You can also search for this author in PubMed Google Scholar
Shinichiro Inoue
View author publications
You can also search for this author in PubMed Google Scholar
Asao Ogawa
View author publications
You can also search for this author in PubMed Google Scholar
Tatsuo Akechi
View author publications
You can also search for this author in PubMed Google Scholar
Ken Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Yosuke Uchitomi
View author publications
You can also search for this author in PubMed Google Scholar
Yutaka Matsuyama
View author publications
You can also search for this author in PubMed Google Scholar
Kazuhiro Yoshiuchi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All the authors contributed to the preparation of the protocol. K.K. performed the statistical analyses. K.Y. supervised the study. All authors participated in interpreting the results and writing the manuscript and approved the final version of the manuscript.

Corresponding author

Correspondence to Kazuhiro Yoshiuchi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kurisu, K., Hashimoto, M., Ishizawa, T. et al. Development of computer adaptive testing for measuring depression in patients with cancer. Sci Rep 12, 8247 (2022). https://doi.org/10.1038/s41598-022-12318-x

Download citation

Received: 24 January 2022
Accepted: 29 April 2022
Published: 17 May 2022
DOI: https://doi.org/10.1038/s41598-022-12318-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

A case study of an individual participant data meta-analysis of diagnostic accuracy showed that prediction regions represented heterogeneity well

Revisiting the theoretical and methodological foundations of depression measurement

The feasibility of a Bayesian network model to assess the probability of simultaneous symptoms in patients with advanced cancer

Introduction

Methods

Ethical approval

Study design and participants

Data collection

Overview of statistical analyses

Descriptive statistics

Evaluation of assumptions of the IRT model

Graded response model

Evaluation of DIF

CAT simulations

Calibration of PHQ-9 to the IRT model

Results

Study participants

Item selection and parameter estimation

CAT simulations

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links