Methods for stroke severity assessment by chart review in the Atherosclerosis Risk in Communities study

Koton, Silvia; Patole, Shalom; Carlson, Julia M.; Haight, Taylor; Johansen, Michelle; Schneider, Andrea L. C.; Pike, James Russell; Gottesman, Rebecca F.; Coresh, Josef

doi:10.1038/s41598-022-16522-7

Download PDF

Article
Open access
Published: 19 July 2022

Methods for stroke severity assessment by chart review in the Atherosclerosis Risk in Communities study

Silvia Koton^1,2,
Shalom Patole²,
Julia M. Carlson³,
Taylor Haight⁴,
Michelle Johansen⁵,
Andrea L. C. Schneider⁶,
James Russell Pike⁷,
Rebecca F. Gottesman⁸ &
…
Josef Coresh²

Scientific Reports volume 12, Article number: 12338 (2022) Cite this article

1605 Accesses
5 Citations
Metrics details

Subjects

Abstract

Stroke severity is the most important predictor of post-stroke outcome. Most longitudinal cohort studies do not include direct and validated measures of stroke severity, yet these indicators may provide valuable information about post-stroke outcomes, as well as risk factor associations. In the Atherosclerosis Risk in Communities (ARIC) study, stroke severity data were retrospectively collected, and this paper outlines the procedures used and shares them as a model for assessment of stroke severity in other large epidemiologic studies. Trained physician abstractors, who were blinded to other clinical events, reviewed hospital charts of all definite/probable stroke events occurring in ARIC. In this analysis we included 1,198 ischemic stroke events occurring from ARIC baseline (1987–1989) through December 31, 2009. Stroke severity was categorized according to the National Institutes of Health Stroke Scale (NIHSS) score and classified into 5 levels: NIHSS ≤ 5 (minor), NIHSS 6–10 (mild), NIHSS 11–15 (moderate), NIHSS 16–20 (severe), and NIHSS > 20 (very severe). We assessed interrater reliability in a subgroup of 180 stroke events, reviewed independently by the lead abstraction physician and one of the four secondary physician abstractors. Interrater correlation coefficients for continuous NIHSS score as well as percentage of absolute agreement and Cohen Kappa Statistic for NIHSS categories were presented. Determination of stroke severity by the NIHSS, based on data abstracted from hospital charts, was possible for 97% of all ischemic stroke events. Median (25%-75%) NIHSS score was 5 (2–8). The distribution of NIHSS category was NIHSS ≤ 5 = 58.3%, NIHSS 6–10 = 24.5%, NIHSS 11–15 = 8.9%, NIHSS 16–20 = 4.7%, NIHSS > 20 = 3.6%. Overall agreement in the classification of severity by NIHSS category was present in 145/180 events (80.56%). Cohen’s simple Kappa statistic (95% CI) was 0.64 (0.55–0.74) and weighted Kappa was 0.79 (0.72–0.86). Mean (SD) NIHSS score was 5.84 (5.88), with a median score of 4 and range 0–31 for the lead reviewer (rater 1) and mean (SD) 6.16 (6.10), median 4.5 and range 0–36 in the second independent assessment (rater 2). There was a very high correlation between the scores reported in both assessments (Pearson r = 0.90). Based on our findings, we conclude that hospital chart-based retrospective assessment of stroke severity using the NIHSS is feasible and reliable.

Development and verification of a nomogram for predicting short-term mortality in elderly ischemic stroke populations

Article Open access 03 August 2023

Prevalence and incidence of stroke in Latin America and the Caribbean: a systematic review and meta-analysis

Article Open access 26 April 2023

Predictors of stroke favorable functional outcome in Guinea, results from the Conakry stroke registry

Article Open access 21 January 2022

Stroke severity is the most important predictor of post-stroke outcome¹ and has consistently been associated with short- and long-term mortality and disability after stroke^2,3,4,5,6.

In the Atherosclerosis Risk in Communities (ARIC) prospective study, broad information on personal characteristics, socioeconomic data and prevalence of cardiovascular risk factors was collected from 15,792 Black and White individuals through interviews and physical exams at baseline, and follow-up exams, phone interviews, and active surveillance of discharges from local hospitals⁷. Although ARIC includes high quality individual-level information on strokes in a large cohort collected starting in 1987–1989; direct measures of stroke severity on admission had not previously been collected. The National Institutes of Health Stroke Scale (NIHSS) is a simple, valid, reliable, and widely used systematic tool for quantitative measurement of ischemic stroke‐related neurological deficits⁸. Moreover, regulators require adjustment of stroke outcomes for stroke severity, and the NIHSS has become the generally accepted and agreed metric for regulatory compliance⁹; therefore adding data on NIHSS for stroke events occurring in ARIC is important.

In 2018, we initiated a nested study aimed at collecting data from ARIC stroke de-identified hospital charts. In this manuscript, our goals are to demonstrate the feasibility of generating retrospective NIHSS ratings in a large epidemiologic study, to show that chart-based NIHSS ratings are reliable, and to document the procedures in ARIC and share them as a model for assessment of stroke severity in other large epidemiologic studies.

Methods

Study setting

The Atherosclerosis Risk in Communities (ARIC) study is a community-based prospective cohort study, including 15,792 adults 45–64 years old at baseline (1987–1989). Participants were sampled from four US communities: Forsyth County, North Carolina; suburbs of Minneapolis, Minnesota; Washington County, Maryland; and Jackson, Mississippi⁷.

In ARIC, stroke events and deaths are identified since 1987–1989 through review of hospital records as well as phone interviews initially conducted annually, then semi-annually from 2012, and in-person ARIC visits¹⁰. As has been standard practice in ARIC since the initiation of the cohort, after an initial computer-generated algorithm is applied¹¹, stroke physician reviewers verify stroke events and adjudicate those verified events as definite, probable, or possible ischemic or hemorrhagic strokes. In cases of disagreement with the algorithm, a second physician review is conducted¹¹. However, data on stroke severity had not been collected.

In order to determine severity of stroke events, in 2018, we initiated a nested study to collect data from ARIC stroke de-identified hospital charts that were previously adjudicated as definite or probable incident or recurrent stroke events, as part of ongoing surveillance in ARIC. All ischemic stroke ARIC charts were reviewed and data on all items in the NIHSS were evaluated according to an algorithm for retrospective collection of the NIHSS score that has been shown to be valid across the entire stroke severity spectrum^12,13. For the present analysis, we included events occurring in ARIC from 1987–1989 to December 31st, 2009 that were adjudicated as definite or probable ischemic strokes. Standardized criteria for ischemic stroke have previously been reported¹¹. Data on severity of stroke were collected and managed using the secure, web-based software platform REDCap (Research Electronic Data Capture) electronic data capture tools hosted at Johns Hopkins University^14,15. The data that support the findings of this study are available from the corresponding author upon reasonable request.

Chart-based retrospective assessment of NIHSS

Before initiation of this study, participating reviewers, who were all physicians, were required to complete the NIH Stroke Scale online training (https://learn.heart.org/lms/nihss) and obtain a certificate of completion issued by the American Stroke Association. Following certification, they all participated in a training session aimed at achieving a standardized approach for retrospective assessment of NIHSS using data in ARIC stroke charts. Training included cross-review of 20 charts of incident/recurrent stroke events randomly selected.

Collection of data from charts of all ischemic stroke events in ARIC from 1987–1989 to December 31st, 2009 was conducted by a lead abstraction physician to minimize inter-rater variation. The following information available in hospitalization charts was used to evaluate severity of stroke on admission: admission notes and discharge summary, imaging reports (magnetic resonance imaging scans, computerized tomography scans, carotid ultrasound, angiographies and other imaging reports if available), neurology consult notes, progress notes, clinical evaluation and examination of the patient and autopsy reports if available. We aimed to assess stroke severity on admission; therefore, we prioritized abstraction of information from neurological evaluations reported in notes on admission. If there was more than one neurological evaluation in the admission notes, we used information reported for the first exam. In case of inconsistencies in the available data, we looked for additional evidence. Our priorities for collection of data were: 1. Neurology consult (if performed on admission or soon after admission), 2. Admission notes, 3. Other notes. According to the algorithm for retrospective evaluation of NIHSS, missing physical examination data were scored as normal¹³. This approach was suggested by Williams et al. based on the assumption that neurological examination items not mentioned in the hospitalization charts are usually normal¹³. In cases in which there was doubt regarding the determination of severity level, all the existing data were re-examined in a process led by the project PI (SK) and an agreement on the severity score was reached.

Because of variability of available records, and the wide range of time from which these records were available, each reviewer reported both the NIHSS estimated score and the estimated severity category, classified into 5 levels according to the NIHSS score: NIHSS ≤ 5 (minor), NIHSS 6–10 (mild), NIHSS 11–15 (moderate), NIHSS 16–20 (severe), and NIHSS > 20 (very severe). Completeness and quality of data were routinely assessed during the entire process of data review and abstraction by the project PI (SK), in order to assure collection of valid information.

Reliability assessment

For the assessment of inter-rater reliability, a second review was conducted, independently from the first review by the lead reviewer, by one of the four additional physician reviewers that went through the training process. Overall, 180 charts (15% of all definite/probable ischemic strokes) were reviewed by a second reviewer.

Interrater correlation coefficients for continuous NIHSS score and percentage of absolute agreement and Cohen Kappa Statistic for NIHSS categories were calculated. Symmetry test was performed to evaluate possible differences by raters in the propensity to select higher and lower NIHSS categories. We did not assess intra-rater reliability since we felt that, if given the same case to review twice, the probability a reviewer would remember their previous classification of severity would be high, due to the comprehensive review of all the documents in the stroke charts conducted in order to abstract severity data.

Ethics approval and consent to participate

The ARIC study protocol was approved by the Institutional Review Board (IRB) of each participating center, and informed consent was obtained from participants at each study visit. The present study was reviewed and approved by the Johns Hopkins University IRB. The informed consent included permission to obtain hospital records and research was performed in accordance with relevant guidelines and regulations.

Results

After the training period, chart review and abstraction of data required on average 15 min per chart, with a range of 10–35 min, depending on the extent of data available in the charts.

Among 1159 (96.7%) events with information required to define the NIHSS total score, median (25–75%) NIHSS score was 5 (2–8). Definition of the exact NIHSS total score was not possible for 39 events; however, in 3 out of these 39 events, charts included information that allowed to define severity by NIHSS category. Therefore, severity group was determined for 1162 (97%) events. The distribution of NIHSS categories was NIHSS ≤ 5 = 58.3% (n = 677), NIHSS 6–10 = 24.5% (n = 285), NIHSS 11–15 = 8.9% (n = 103), NIHSS 16–20 = 4.7% (n = 55), NIHSS > 20 = 3.6% (n = 42). To note, determination of severity category was not possible in 36 ischemic stroke events due to absence of relevant notes or very scarce information in the charts.

The participant characteristics associated with stroke events included in the interrater agreement sample were similar to those excluded from the interrater agreement sample with the exception of diabetes (21.9% versus 31.5%, p = 0.01) (Table 1). Mean age at stroke was higher in the interrater agreement sample (72.9 years versus 68.0 years, p < 0.001).

Table 1 Baseline characteristics of ARIC participants with ischemic stroke, overall and by inclusion in the inter-rater reliability assessment.

Full size table

The distributions of NIHSS categories by rater, percent agreement and Cohen’s Kappa statistic are displayed in Table 2. Overall agreement in the classification of severity by NIHSS category was present in 145/180 events (80.56%). Symmetry test was non-significant (p = 0.79) suggesting that the two raters who were reviewing the same chart had the same propensity to select higher and lower NIHSS categories. Cohen’s simple Kappa statistic (95% CI) was 0.64 (0.55–0.74) and weighted Kappa was 0.79 (0.72–0.86).

Table 2 Distribution of NIHSS category by rater and inter-rater agreement measures, N = 180.

Full size table

Mean (SD) NIHSS score was 5.84 (5.88), with a median score of 4 and range 0–31 for the lead reviewer (rater 1) and mean (SD) 6.16 (6.10), median 4.5 and range 0–36 in the second independent assessment, done by one of the four secondary reviewers. Agreement between raters in NIHSS score is shown in Fig. 1. The correlation (Pearson r) between the scores reported by the lead reviewer and the second reviewers was 0.90.

Discussion

Hospital chart-based retrospective assessment of stroke severity using the NIHSS is feasible and reliable. Using data from hospitalization charts of ischemic stroke events occurring in ARIC starting in 1987–1989, we were able to apply the retrospective algorithm developed by Williams et al. in 2000¹³ to categorize severity of ischemic stroke on admission, both in incident and recurrent events. Assessment of severity was feasible despite the variability in the extent of available data collected on stroke events occurring in ARIC over a period of 20 years.

The NIHSS is the most widely used scale for the assessment of stroke severity. It has become the “gold standard” for stroke severity rating in clinical trials⁹ on prevention, acute treatment and recovery after stroke. However, data in clinical trials and in observational studies differ; therefore, our findings contribute to the generalizability of retrospective evaluation of NIHSS. Although NIHSS is sometimes used to categorize stroke severity both in ischemic stroke and intracerebral hemorrhages, and has been reported reliable for prediction of outcome in intracerebral hemorrhages¹⁶, its accuracy is considered higher for evaluation of severity in ischemic compared with hemorrhagic strokes, therefore the present paper focuses on severity of ischemic stroke in ARIC.

Our findings show that the distribution of ischemic stroke severity in ARIC is skewed toward less severe strokes, with a median (25%-75%) NIHSS score of 5 (2–8). These findings are consistent with data from the 2016 National Inpatient Sample regarding NIHSS reporting in claims from the first quarter of implementation of optional reporting¹⁷, as well as previous findings from large clinical registries and population based studies^18,19.

The importance of collecting NIHSS prospectively both in research and routine clinical practice is well recognized; however, our findings showing that chart-based assessment of NIHSS is feasible and reliable are useful for the majority of large epidemiologic studies on stroke that have not routinely collected data on NIHSS. The mostly qualitative information available in such studies, can be abstracted and translated into the most commonly used scale for quantitative evaluation of severity of stroke. Since severity is the most important predictor of stroke outcome, data on severity of stroke are important and useful for a wide range of studies looking at short- and long-term physical, cognitive, mental and social outcomes of stroke. The protocols for the study are available and already being adopted by a large dementia consortium.

Conclusion

Hospital chart-based assessment of stroke severity using the NIHSS is feasible and reliable. Data on stroke severity are essential to accurately assess short- and long-term outcomes of stroke. Our findings are important and useful for a wide range of studies on stroke.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request. The protocols for the study are available and already being adopted by a large dementia consortium.

Abbreviations

ARIC:: Atherosclerosis Risk in Communities
CI:: Confidence interval
NIHSS:: National Institutes of Health Stroke Scale
REDCap:: Research electronic data capture
SD:: Standard deviation
US:: United States

References

Rost, N. S. et al. Stroke severity is a crucial predictor of outcome: an international prospective validation study. J. Am. Heart Assoc.: Cardiovasc. Cerebrovasc. Dis. 5(1), e002433 (2016).
Article Google Scholar
Gaughan, J., Gravelle, H., Santos, R. & Siciliani, L. Long-term care provision, hospital bed blocking, and discharge destination for hip fracture and stroke patients. Int. J. Health Econ. Manag. https://doi.org/10.1007/s10754-017-9214-z (2017).
Article PubMed PubMed Central Google Scholar
Johnston, K. C., Connors, A. F. Jr., Wagner, D. P. & Haley, E. C. Jr. Predicting outcome in ischemic stroke: external validation of predictive risk models. Stroke 34(1), 200–202 (2003).
Article Google Scholar
Konig, I. R. et al. Predicting long-term outcome after acute ischemic stroke: a simple index works in patients from controlled clinical trials. Stroke 39(6), 1821–1826 (2008).
Article Google Scholar
Koton, S., Telman, G., Kimiagar, I. & Tanne, D. NASIS Investigators Gender differences in characteristics, management and outcome at discharge and three months after stroke in a national acute stroke registry. Int. J. Cardiol. 168(4), 4081–4084 (2013).
Article Google Scholar
Saver, J. L. & Altman, H. Relationship between neurologic deficit severity and final functional outcome shifts and strengthens during first hours after onset. Stroke 43(6), 1537–1541 (2012).
Article Google Scholar
Wright, J. D. et al. The ARIC (Atherosclerosis Risk In Communities) study: JACC focus seminar 3/8. J. Am. Coll. Cardiol. 77(23), 2939–2959 (2021).
Article Google Scholar
National institute of neurological disorders and stroke rt-PA stroke study group. Tissue plasminogen activator for acute ischemic stroke. N. Engl. J. Med. 333(24), 1581–1587 (1995).
Article Google Scholar
Lyden, P. Using the national institutes of health stroke scale: A cautionary tale. Stroke 48(2), 513–519 (2017).
Article Google Scholar
Koton, S. et al. Stroke incidence and mortality trends in US communities, 1987 to 2011. JAMA 312(3), 259–268 (2014).
Article CAS Google Scholar
Rosamond, W. D. et al. Stroke incidence and survival among middle-aged adults: 9-year follow-up of the Atherosclerosis Risk in Communities (ARIC) cohort. Stroke 30(4), 736–743 (1999).
Article CAS Google Scholar
Lindsell, C. J. et al. Validity of a retrospective National Institutes of Health Stroke Scale scoring methodology in patients with severe stroke. J. Stroke Cerebrovasc. Dis.: Off. J. Natl. Stroke Assoc. 14(6), 281–283 (2005).
Article Google Scholar
Williams, L. S., Yilmaz, E. Y. & Lopez-Yunez, A. M. Retrospective assessment of initial stroke severity with the NIH stroke scale. Stroke 31(4), 858–862 (2000).
Article CAS Google Scholar
Harris, P. A. et al. The REDCap consortium: Building an international community of software platform partners. J. Biomed. Inform. 95, 103208 (2019).
Article Google Scholar
Harris, P. A. et al. Research electronic data capture (REDCap)–a metadata-driven methodology and workflow process for providing translational research informatics support. J. Biomed. Inform. 42(2), 377–381 (2009).
Article Google Scholar
Finocchi, C. et al. National institutes of health stroke scale in patients with primary intracerebral hemorrhage. Neurol. Sci. 39(10), 1751–1755 (2018).
Article Google Scholar
Saber, H. & Saver, J. L. Distributional validity and prognostic power of the national institutes of health stroke scale in US administrative claims data. JAMA Neurol. 77(5), 606–612 (2020).
Article Google Scholar
Reeves, M. et al. Distribution of national institutes of health stroke scale in the Cincinnati/northern Kentucky stroke study. Stroke 44(11), 3211–3213 (2013).
Article Google Scholar
Sacco, R. L., Shi, T., Zamanillo, M. C. & Kargman, D. E. Predictors of mortality and recurrence after hospitalized cerebral infarction in an urban community: The northern Manhattan stroke study. Neurology 44(4), 626–634 (1994).
Article CAS Google Scholar

Download references

Acknowledgements

The authors thank the staff and participants of the ARIC study for their important contributions.

Disclaimer

This article was prepared partially while Dr. Rebecca Gottesman was employed at the Johns Hopkins University School of Medicine. The opinions expressed in this article are the author’s own and do not reflect the view of the National Institutes of Health, the Department of Health and Human Services, or the United States Government.

Funding

The Atherosclerosis Risk in Communities Study is carried out as a collaborative study supported by National Heart, Lung, and Blood Institute contracts (HHSN268201700001I, HHSN268201700002I, HHSN268201700003I, HHSN268201700004I, HHSN268201700005I). Neurocognitive data is collected by U01 2U01HL096812, 2U01HL096814, 2U01HL096899, 2U01HL096902, 2U01HL096917 from the NIH (NHLBI, NINDS, NIA and NIDCD), and with previous brain MRI examinations funded by R01-HL70825 from the NHLBI. RFG was supported by K24 AG052573 when she was employed by Johns Hopkins University and is currently supported by the NINDS Intramural Research Program. MCJ is supported by K23 NS112459.

Author information

Authors and Affiliations

Sackler Faculty of Medicine, The Herczeg Institute on Aging and The Stanley Steyer School of Health Professions, Tel Aviv University, 6997801, Tel Aviv, Israel
Silvia Koton
Department of Epidemiology, Johns Hopkins University School of Public Health, Baltimore, MD, USA
Silvia Koton, Shalom Patole & Josef Coresh
Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
Julia M. Carlson
Department of Neurology, Virginia Commonwealth University, Richmond, VA, USA
Taylor Haight
Department of Neurology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Michelle Johansen
Department of Neurology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Andrea L. C. Schneider
Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
James Russell Pike
Stroke Branch, National Institute of Neurological Disorders and Stroke Intramural Research Program, NIH, Bethesda, MD, USA
Rebecca F. Gottesman

Authors

Silvia Koton
View author publications
You can also search for this author in PubMed Google Scholar
Shalom Patole
View author publications
You can also search for this author in PubMed Google Scholar
Julia M. Carlson
View author publications
You can also search for this author in PubMed Google Scholar
Taylor Haight
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Johansen
View author publications
You can also search for this author in PubMed Google Scholar
Andrea L. C. Schneider
View author publications
You can also search for this author in PubMed Google Scholar
James Russell Pike
View author publications
You can also search for this author in PubMed Google Scholar
Rebecca F. Gottesman
View author publications
You can also search for this author in PubMed Google Scholar
Josef Coresh
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.K., R.F.G., J.C. conceived the study. S.P., J.M.C., T.H., M.J. and A.L.C.S. reviewed the charts and collected data on stroke severity. S.K. organized and supervised chart-based data collection, routinely assessed the completeness and quality of data collected on stroke severity. J.R.P. and S.K. conducted the analysis, and R.F.G., J.C. contributed to review of results. S.K. drafted the manuscript; all other authors reviewed it and approved the final manuscript.

Corresponding author

Correspondence to Silvia Koton.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Koton, S., Patole, S., Carlson, J.M. et al. Methods for stroke severity assessment by chart review in the Atherosclerosis Risk in Communities study. Sci Rep 12, 12338 (2022). https://doi.org/10.1038/s41598-022-16522-7

Download citation

Received: 04 March 2022
Accepted: 11 July 2022
Published: 19 July 2022
DOI: https://doi.org/10.1038/s41598-022-16522-7

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.