Introduction

Understanding the scale and natural history of long-COVID is essential to planning health and social care. The majority of studies report the prevalence of long-COVID at a single timepoint post-infection1,2,3,4,5,6,7, with some adjusting for pre-existing symptoms8,9. Less is known about changes in long-COVID over time. Studies with serial outcome measurements have been restricted to selected groups (e.g. hospitalised patients10,11,12,13, older patients14, or veterans with break-through infections15) or specific (e.g. mental health) outcomes16, or have lacked a comparison group making it difficult to distinguish persistent or late-onset symptoms of long-COVID from symptoms that would have occurred anyway in the absence of SARS-CoV-2 infection11,12,13,17,18.

Hospital cohorts have variously reported no change10,11,13, improvement10,11, and deterioration over time10,14. In one cohort of 807 people, there was no change in the proportion reporting full recovery between 5 months and 1 year post discharge13. A study of 61 subjects reported no change in quality of life, but improvement in 6-min walking test distance11. A meta-analysis of seven cohort studies included 2883 people with repeat measures, in whom the prevalence of depression declined over follow-up and was not significantly different to the comparison group beyond 2 months16.

Some studies have highlighted the possibility of late-onset sequelae. In an ambidirectional cohort study, conducted on 1276 patients, the proportion reporting at least one symptom decreased from 68% at 6-month follow-up to 49% at 12 months10. However, both breathlessness (26% to 30%) and anxiety/depression (23% to 26%) increased over time. Three- to six-month follow-up of veterans with break-through SARS-CoV-2 infections (infections despite vaccination) revealed increased risk of new (HR 1.13, 95% CI 1.07–1.20) as well as persistent (HR 1.90, 95% CI 1.77–2.04) symptoms15. In a retrospective cohort study using linked electronic health records, 37% of people had at least one of nine long-COVID features 3–6 months after SARS-CoV-2 infection19. Of these, 40% had not had these features in the first 3 months of follow-up. A hospital cohort of 1438 patients, 60 years of age or older, reported increased risk of progressive and late onset cognitive decline at 12-months follow-up among severe cases14. In an online survey of 3762 participants who had previous suspected or confirmed SARS-CoV-2 infection, 86% reported re-occurrence of symptoms over time17.

Therefore, while long-COVID may be a stable condition in some, existing evidence suggests that others may experience recovery, relapse, or progression. We use serial questionnaire data from the long-COVID in Scotland Study (Long-CISS)20 to investigate the natural history of long-COVID in an unselected, general population cohort with laboratory-confirmed SARS-CoV-2 infection compared with symptoms in an age-, sex-, and socioeconomically-matched group of people who have never been infected.

Results

Of the 4,049,590 questionnaires sent out, 345,673 (9%) were completed by 288,173 unique individuals, of whom 257,341 (89%) consented to record linkage, required to obtain their test result (Fig. 1). Following linkage, 53,530 were excluded because they reported a previous positive test that was not recorded on the database, 5687 because they had asymptomatic infections, and 37,343 because they were recruited beyond 6 months follow-up. Of the remaining 160,781 individuals, 80,332 (50%) had previous symptomatic, laboratory-confirmed SARS-CoV-2 infection and 80,449 (50%) had never had a positive test for SARS-CoV2 infection. Of the 80,332 people who had previous symptomatic infections, 12,947 have so far completed questionnaires at both 6- and 12-month follow-up and 4196 have completed questionnaires at both 6- and 18-month follow-up. The corresponding figures for the 80,449 individuals never infected were 11,026 and 1711, respectively. The sample size was 23,973 (12,947 symptomatic infected and 11,026 never infected) for the primary analysis of 6- and 12-month follow-up. The index test dates of included participants ranged from 20 April 2020 to 30 November 2021.

Fig. 1: Participant flow diagram.
figure 1

PCR Polymerase Chain Reaction.

Changes in recovery status

Six months following SARS-CoV-2 infection, 6407 (49.5%) people reported being fully recovered, 5649 (43.6%) partially and 891 (6.9%) not recovered. At 12-month follow-up, the figures were 6575 (50.8%), 5412 (41.8%) and 960 (7.4%), respectively (χ2 trend, p = 0.323). Forty-one percent of people reported full recovery at both 6- and 12-month follow-up, 35% reported persistent incomplete/no recovery, 12% reported improvement and 12% deterioration (Table 1). Between 6 and 18 months, the figures were 36%, 36%, 14% and 15%, respectively.

Table 1 Trajectories of recovery status following symptomatic SARS-CoV-2 infection

Of those who felt partially recovered at 6 months, 1179/5649 (21%) had improved by 12 months and 421/1934 (22%) by 18 months and, of the 891 people not recovered at 6 months, 404 (45%) had some degree of improvement by 12 months, and 28 (3%) had fully recovered. Of those who felt partially recovered at 6 months, 458/5649 (8%) reported deterioration at 12 months and 189/1934 (10%) at 18 months. In addition, of 6407 who reported being fully recovered at 6 months, 1039 (16%) reported deterioration by 12 months.

Depression prior to SARS-CoV-2 infection and socioeconomic deprivation were more common among people who reported deterioration in recovery status between 6 and 12 months (Table 2). Similar patterns were observed comparing 6- and 18-month follow-up (Supplementary Table 1). Among those not fully recovered at 6 months, improvement at 12 months was less likely among older people and those with depression prior to COVID-19 and more likely among the most affluent, after adjusting for potential confounders (Table 3; unadjusted odds ratios in Supplementary Table 2). Among those who reported full or partial recovery at 6 months, deterioration at 12 months was less likely among older people and the most affluent and more likely among people with prior depression (Table 3). The associations were not statistically significant comparing 6- and 18-month follow-up (Supplementary Table 3; unadjusted odds ratios in Supplementary Table 4).

Table 2 Characteristics of participants by infection status and recovery status trajectory between 6 and 12 months
Table 3 Binary logistic regression of factors associated with improvement and deterioration in recovery status between 6 and 12 months

Changes in symptoms

The percentage who reported at least one of the 26 symptoms did not change between 6- and 12-month (72% versus 71%, respectively), and 6- and 18-month (73% versus 74%, respectively) follow-up, among people with previous symptomatic SARS-CoV-2 infection but increased significantly among those never infected (54% versus 57%, and 52% versus 55%, respectively; Table 4). Previously symptomatic participants had a higher prevalence of new and persistent symptoms than those never infected, at both 12- and 18-month follow-up compared with 6-month follow-up (Supplementary Fig. 2).

Table 4 Prevalence of symptoms reported at 6 versus 12 months follow-up, and 6 versus 18 months follow-up, among people who had symptomatic SARS-CoV-2 infection and those never infected

The prevalence of confusion and altered taste and smell decreased significantly between 6 and 12 months after SARS-CoV-2 infection contrasting with no significant change in confusion and altered smell, and an increase in altered taste, among those never infected (Table 4). The reductions were significant compared to those never infected after adjusting for potential confounders (Table 5; unadjusted odds ratios in Supplementary Table 5).

Table 5 Binary logistic regression models of the factors associated with symptoms at 12 months adjusted for symptoms at 6 months

Reduced prevalence of altered taste/smell and confusion was specific to those who reported an improvement in their recovery status following SARS-CoV-2 infection (Supplementary Table 6). The prevalence of confusion 6 months following symptomatic SARS-CoV-2 infection was significantly higher among those with a history of depression or anxiety than those without (1090/5839 (18.7%) versus 780/7108 (11.0%); p < 0.001) and improvement in confusion between 6- and 12-months was less likely among people with pre-existing depression or anxiety (Table 5).

People with previous symptomatic SARS-CoV-2 infection reported significant increases in the prevalence of both dry and productive cough between 6- and 12-month follow-up (Table 4). However, these symptoms were also reported more frequently over time in the never infected group. The increased prevalence of both dry and productive cough remained significantly higher among those previously infected than those never infected, after adjusting for confounders (Table 5). The factors associated with increased prevalence of dry cough were younger age, more pre-existing long-term conditions, and specifically pre-existing depression/anxiety (secondary effect estimates; Table 5). Increased prevalence of productive cough was associated with male sex and pre-existing respiratory disease (secondary effect estimates; Table 5). Following SARS-CoV-2 infection, late onset cough was specific to those who reported deterioration in their recovery status (Supplementary Table 6).

Increases in the prevalence of hearing problems between 6- and 12-month follow-up were reported by both those with previous symptomatic SARS-CoV-2 infection and those never infected (Table 4). After adjustment for confounders, the increased prevalence of hearing problems was significantly higher among those previously infected than those never infected (Table 5). Other factors associated with late onset hearing problems were socioeconomic deprivation, SARS-CoV-2 infection severity, and more pre-existing long-term conditions and specifically depression/anxiety (secondary effect estimates; Table 5).

Between 6- and 18-months follow-up, increased prevalence of dry cough, productive cough and hearing problems were all significant compared to those never infected after adjusting for potential confounders (Supplementary Table 7; unadjusted odds ratios in Supplementary Table 8).

Changes in quality of life

Following symptomatic SARS-CoV-2 infection, median EQ-5D VAS score decreased slightly from 75 (IQR 55–86) at 6 months to 74 (IQR 53–85) at 12 months (p < 0.001). However, it also fell among those never infected, from 80 (IQR 64–90) to 77 (IQR 61–90) (p < 0.001). In the fully adjusted Poisson regression model, symptomatic infection was associated with a larger fall in EQ-5D VAS score compared with those never infected (IRR 0.98, 95% CI 0.98–0.98).

Discussion

This study reports the trajectory of long-COVID in the general population compared to contemporaneous changes in symptoms and quality of life in a comparison group that had never been infected. Beyond 6 months following SARS-CoV-2 infection, there was no significant overall change in either self-reported recovery status or the percentage of people reporting at least one symptom known to be associated with previous SARS-CoV-2 infection20. However, 12% of people reported improvements in their recovery status, and 12% reported deterioration. These different trajectories were driven by different symptoms. In some people, altered taste, smell and confusion (‘brain fog’) resolved over time whereas others reported late onset dry or productive cough and hearing problems. These changes were not explained by underlying trends or confounding. Our findings demonstrate the importance of exploring individual symptoms rather than only grouping them together as a composite outcome.

Our analyses of serial outcomes corroborated our previous finding of late onset cough20, and identified a new finding of late onset hearing problems. Respiratory impairment following COVID-19 is well-recognised. A meta-analysis of 15 studies that followed-up 3066 patients hospitalised for SARS-CoV-2 infection reported that 56% had residual lung CT abnormalities and 44% had abnormal lung function tests: 35% impaired diffusion, 16% restrictive impairment and 8% obstructive impairment21. Systematic reviews had demonstrated that sudden sensorineural hearing loss occurring during acute SARS-CoV-2 infection can persist22. Proposed mechanisms include a direct effect of viral invasion via ACE2 receptors located in the ear23, indirect effects via hypoxia, immune-mediated damage or coagulative disorders24, and ototoxic medications used to treat COVID-1922. The prevalence of several common long-COVID symptoms, for example fatigue, muscle aches/weakness, headache, and anxiety/depression, remained stable within previously infected individuals over time. This has implications for individuals living with long-COVID and for clinical practice.

Socioeconomic deprivation and depression are known to be associated with development of long-COVID4,19,20. Our findings elaborate by showing both to also be associated with reduced risk of improvement over time and increased risk of deterioration. Biological mechanisms may partly explain these observations. The relationship between depression and inflammation is bidirectional25,26,27. Novel immune therapeutic targets are being investigated for the treatment of depression28, and there is a well-documented link between acute or chronic psychological stress and immune markers29. The stress resulting from socioeconomic deprivation has been linked to changes in immune response and wider detrimental health effects30,31. However, in this study, socioeconomic differences in self-reported recovery status were not corroborated by different changes in specific symptoms over time, other than a weak association with hearing problems. Therefore, it is also plausible that more deprived groups have less capacity to adapt their lives to ongoing health problems or poorer access to support.

In our study, 70.7% of previously infected people who provided 12-month follow-up data had at least one symptom at 12 months following SARS-CoV-2 infection. Estimates from previous longitudinal studies ranged from 28%12 to 77%18. Our 12-month prevalence rates of specific symptoms were comparable to those reported in a random-effects meta-analysis of 18 studies2: fatigue/weakness (28% versus 28%), dyspnoea (24% versus 18%), arthromyalgia (24% versus 26%), depression (21% versus 23%), and concentration difficulties or confusion (15% versus 18%). However, the high prevalence of these symptoms among people never infected reinforces the importance of a comparison group, as does the increase over time in the prevalence of at least one symptom in this group (from 54% to 57% between 6- and 12-month follow-up, and from 52% to 55% between 6- and 18-month follow-up).

The major strengths of our study included national, non-selective coverage, self-reported plus laboratory-confirmed SARS-CoV-2 infection status, and inclusion of a comparison group, never infected over the same period of the pandemic. Other studies have relied on historic controls or controls sampled earlier in the pandemic16. Recovery status, ongoing symptoms and quality of life were not subject to recall bias because participants reported them at the time of completing questionnaires. Follow-up to 18 months is longer than previously reported. While every adult in Scotland with a positive SARS-CoV-2 PCR test was invited to take part in the study, participation was voluntary, so response bias is possible.

A study limitation was that differential attrition by exposure could not be assessed within the permissions of the data sharing agreement because the researchers could not identify the individuals who moved from the negative to infected group. Furthermore, it is possible that some individuals in the comparison group had SARS-CoV-2 infection that was not detected by a PCR test. This risk was reduced by excluding from the analyses 53,530 participants who had only negative PCR tests recorded but who reported that they had had SARS-CoV-2 infection. However, classification error due to undiagnosed, asymptomatic infection remains. A further limitation, associated with any observational study, is residual confounding due to unknown or unmeasured confounders.

In conclusion, while long-COVID appeared to be a stable condition in many, both improvement and deterioration occurred in others. Improvements in altered taste, smell and confusion were reassuring. In contrast, the findings of late-onset cough and hearing problems one year following infection, that could not be explained by background trends or confounding, merit further investigation.

Methods

Study design and participants

The Long-COVID in Scotland Study (Long-CISS) is an ambidirectional, general population cohort. The National Health Service Scotland notification platform for SARS-CoV-2 PCR results was used to identify eligible participants and invite them via automated SMS text messages. Every adult (>16 years) in Scotland with a positive PCR test from April 2020 was invited along with a comparison group who had had a negative test but never a positive test (hereafter referred to as never infected), matched by age, sex, deprivation quintile, and time period of test20. People in the latter group were reallocated to the infected group if, and when, they had a positive test. The study commenced in May 2021 and recruited both retrospectively and prospectively based on existing and new test results, respectively. Participants provided electronic consent and study approval was obtained from the West of Scotland Research Ethics Committee (ref. 21/WS/0020) and Public Benefit and Privacy Panel (ref. 2021-0180). The ethics committee deemed that those under 18 (and over 16) years could provide informed consent.

An online questionnaire (Supplementary Fig. 1), self-completed 6, 12 and 18 months after the index test (first positive test or, for comparison group, most recent negative test), collected information on pre-existing health conditions and 26 current symptoms (harmonised with the ISARIC questionnaire)32 and health-related quality of life using the EuroQoL-5D visual analogue scale (EQ-5D VAS) score. Respondents who had tested positive also self-assessed their current recovery status (fully, partially or not recovered). The questionnaire data were linked retrospectively to the PCR test, vaccination, hospital admissions (Scottish Morbidity Record; SMR01 and SMR04) and dispensed prescriptions (Prescribing Information System; PIS) databases.

Exclusion and inclusion criteria

Only people who completed a 6-month follow-up questionnaire plus at least one subsequent questionnaire (12-month, 18-month or both) were included. We excluded people who had asymptomatic SARS-CoV-2 infection, usually detected during occupational or travel-related screening, and those who reported a positive test not recorded on the database, as we could not corroborate the accuracy and date of tests performed outside Scotland.

Definitions

Linkage to the test database provided date and result of the index PCR test plus age, sex and postcode of residence. The latter was used to derive the Scottish Index of Multiple Deprivation (SIMD) from aggregated data on: income, employment, education, health, access to services, crime and housing33. Severe infection was defined as hospital admission with an International Classification of Diseases v10 (ICD-10) code U07.1 between 1 day prior to the index test and 2 weeks after. Vaccination status (0, 1 or ≥2 doses) at the time of the index test was obtained via linkage to the vaccination database. SARS-CoV-2 variants were defined as dominant if they accounted for ≥95% of cases genotyped that week (https://sars2.cvr.gla.ac.uk/cog-uk/). Ethnic group was self-reported using the questionnaire. Pre-existing health conditions were ascertained from self-report using the questionnaire, as well as linkage to previous hospitalizations and dispensed prescriptions. Respiratory disease was defined as ICD10 codes J40-J47, J98.2 or J98.3, or bronchodilators, inhaled corticosteroids, cromoglycate, leukotriene or phosphodiesterase type-4 inhibitor (British National Formulary (BNF) 3.1-3.3), or self-report. Coronary heart disease was defined as ICD10 codes I11.0, I13.0, I13.2, I20-I25 (excluding I24.1), I50, T82.2, or Z95.5, or self-report. Depression was defined as ICD10 codes F30-F33, or anti-depressant, hypnotic or anxiolytic use (BNF 4.1;4.3), or self-report. Diabetes was defined as ICD10 codes E10-E14, G590, G632, H280, H360, M142, N083, O240-O243 or self-report34. Total number of self-reported health conditions was categorised as 0, 1, 2–3 or ≥4.

The outcomes measured were changes in self-reported recovery status following symptomatic SARS-CoV-2 infection, and changes in 26 individual symptoms and quality of life score compared with those never infected. Improvement in recovery status was defined as change from no recovery to partial/full recovery, or from partial recovery to full recovery. Deterioration was defined as change from full recovery to partial/no recovery or from partial recovery to no recovery.

Statistical analyses

Participant characteristics were summarized using frequencies/percentages and medians/inter-quartile ranges for categorical and continuous variables and compared using χ2 and Mann–Whitney U tests, respectively. The analyses were first conducted comparing 6- and 12-month follow-up, then repeated comparing 6 and 18 months. Separate binary logistic regression models were used to determine the factors associated with improvement and deterioration in recovery status over time following symptomatic infection; univariately then adjusted for covariates (age, sex, deprivation quintile, ethnic group, individual and total number of long-term conditions, vaccination status, and dominant variant).

Change in the prevalence of the 26 individual symptoms from 6 to 12 months was compared in those with previous, symptomatic SARS-CoV-2 infection and those never infected using McNemar’s tests. Separate binary logistic regression models were run for the presence of each symptom at 12 months. The models were adjusted for whether the person had been infected or never infected, whether the symptom was present at 6 months, as well as the confounders listed above. The analyses were repeated for the change in prevalence of individual symptoms from 6 to 18 months.

Median EQ-5D VAS score was calculated at 6 and 12 months for the infected and never infected groups. A Poisson regression model was run for EQ-5D VAS score at 12 months, adjusting for whether the person had been infected or never infected, the score at 6 months, and the confounders listed above. All analyses were performed using Stata v16. All statistical tests were two-tailed.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.