Introduction

The UK has experienced one of the largest epidemics of COVID-19 (symptomatic SARS-CoV-2 infection) in Europe1. As well as the risk of hospitalisation and death from COVID-19 some people have a prolonged and debilitating illness that may continue for weeks or months (Long COVID or post-COVID syndrome)2,3.

Estimates of symptom persistence following COVID-19 vary substantially, arguably due to heterogeneous study designs, study settings, follow-up periods and definitions. A recent meta-analysis of 194 studies with follow-up data from participants with confirmed or self-reported COVID-19 symptoms estimated a pooled prevalence of persistent symptoms at a mean follow-up of 126 days amongst hospitalised COVID-19 patients of 52.6% [95% CI 43.5, 61.6]. The most common persistent symptoms were fatigue, general pain or discomfort, impaired sleep, breathlessness, and impaired usual activity4. In the non-hospitalised group, the pooled prevalence was 34.5% [95% CI 21.9, 49.7]. The most frequent persistent symptoms were fatigue, breathlessness, muscle pain or myalgia, affected sleep and loss of sense of smell4. Most existing studies are based on small sample size, unrepresentative study populations and low response rate, so estimates are unlikely to be representative of the general population4.

Our own initial estimates from the REal-time Assessment of Community Transmission-2 (REACT-2) study, six rounds of repeat cross-sectional random samples of the population to evaluate community prevalence of SARS-CoV-2 anti-spike protein antibody positivity in England5, suggested that 21.6% of adults with evidence of prior infection experienced one or more symptoms 12 weeks after their initial illness6. However, this study lacked a negative control group. Recently, the Long-COVID in Scotland study (Long-CISS), a nationwide study including people with severe, mild and asymptomatic infections and a never-infected comparison group, found that 8% of symptomatic participants had not recovered by 6 or 12 months7. Participants with previous symptomatic infection were more likely to self-report 24 (of the 26 surveyed) persistent symptoms than people never infected. The largest effect sizes were observed for changes in taste and smell, breathlessness, chest pain, palpitations, and confusion. Similarly, a nationwide population cohort study in the Netherlands (Lifelines COVID-19) with COVID-19-positive cases and matched negative controls concluded that core symptoms of Long COVID were chest pain, difficulties with breathing, lump in throat, pain when breathing, painful muscles, heavy arms or legs, ageusia or anosmia, feeling hot and cold alternately, tingling extremities, and general tiredness8.

The REACT programme is one of the world’s largest and most comprehensive coronavirus monitoring studies. In addition to REACT-2 described above, the REACT-1 study included 19 rounds of cross-sectional random samples of the population to track community SARS-CoV-2 infection with PCR tests9. The REACT programme is able to identify individuals with persistent symptoms who have not been hospitalised and to compare them with people whose symptoms have resolved and those who have never had COVID-19. In this study, we use data from a follow-up survey of REACT participants to describe the duration of symptoms in people with a history of symptomatic infection, assess factors associated with symptom persistence beyond 12 weeks and with recovery after that point. We also compare current self-reported health and quality of life and specific symptoms for those with persistent symptoms to those who have never had COVID-19 or have recovered.

Results

Overview of study population

Of the 800,000 REACT participants who were sent invitations between 1 August–1 December 2022, 282,780 (35·3%) registered for the study, of whom 276,840 (97·9%) completed the questionnaire. Differential non-response was observed by sociodemographic characteristics, including age, sex, ethnicity, and deprivation (Supplementary Table 1).

Of those who completed the questionnaire, 266,854/276,840 (96·4%) reported whether they had a confirmed SARS-CoV-2 infection. In total, 157,668/266,854 (59·1%) participants had tested positive for SARS-CoV-2. Overall, 24,142 respondents were excluded because COVID-19 episode date was within 12 weeks of their survey completion date (Supplementary Fig. 1). Table 1 shows the key sociodemographic and COVID-19 characteristics of all participants included in the study.

Table 1 Comparison of participant sociodemographic and COVID-19 characteristics by COVID-19 history (n = 242,712)1

Factors associated with persistent symptoms following COVID-19 and recovery

Of the 133,526 people who reported at least one episode of test-confirmed SARS-CoV-2 infection, 3275 reported no symptoms, and 130,251 reported symptomatic COVID-19. In these 130,251 participants, the median duration of reported symptoms was 1.3 weeks, (mean 5·4 weeks, range 1 day to 3·0 years, IQR 6 days to 2 weeks), with 10·2%, 7·5%, and 5·2% reporting ongoing symptoms beyond 4 weeks, 12 weeks and 52 weeks, respectively.

Figure 1 shows the Kaplan Meier Survival Curve of time to symptom end date in those self-reporting symptomatic SARS-CoV-2 infection, overall and by dominant variant at the time of infection. Figure 2 shows factors associated with having symptoms following COVID-19 persisting for ≥12 weeks (“Long COVID”, LC) and ≥52 weeks (“very Long COVID”, VLC) compared to those who were asymptomatic or whose symptoms resolved within 4 weeks (Supplementary Table 2). In mutually adjusted logistic regression models, LC and VLC were both associated with being female compared to male (adjusted odds ratio [aOR] 1·42 [95% CI 1·35, 1·50] and 1·49 [1·38, 1·62] for LC and VLC respectively), having ≥1 comorbidities compared to no comorbidities (aOR 1·31 [1·19, 1·44], 1·52 [1·31, 1·76] respectively for 1, and 1·46 [1·27, 1·75], 2·35 [1·85, 2·97] for ≥2 comorbidities), and having had moderate or severe symptoms compared to mild at the time of infection (aOR 1·76 [1·63, 1·89], 1·47 [1·32, 1·64] respectively for moderate and 4·87 [4·52, 5·25], 3·55 [3·19, 3·96] for severe). The odds of LC and VLC were lower in people of Asian than white ethnicity (aOR 0·80 [0·69, 0·93], 0·71 [0·57, 0·88]), and in people infected at a time when Alpha (aOR 0·60 [0·56, 0·64], 0·59 [0·54, 0·64]), Delta (OR 0·38 [0·35, 0·41], 0·32 [0·29, 0·36]) and Omicron (OR 0·12 [0·11, 0·13] for LC, insufficient follow-up time for VLC) were dominant compared to Wild-type. In a sensitivity analysis to test whether the lower risk with more recent variants was due to unmeasured time-varying factors we restricted the comparison to cases close to the transition period and also found a reduction in risk for Alpha compared to Wild-type and Omicron compared to Delta (Supplementary Table 3).

Fig. 1: Kaplan–Meier survival curve of time to symptom end date in those self-reporting symptomatic SARS-CoV-2 infection, overall and by dominant variant at the time of infection.
figure 1

The curve shows the probability that a participant continues to have symptoms beyond time t. Participants infected at a time when Wild-type (blue) was dominant had a higher probability of symptoms continuing beyond time t compared to Alpha (red), Delta (grey) and Omicron (orange). The number at risk table below the curve shows, overall and by dominant variant at the time of infection, the number at risk at any specific time point. This is equal to the total number of participants remaining in the study including any individuals who experience the event of interest (symptom end date) or participants who are censored at this time point. The unit of time is “weeks,” so the number at risk is those participants who have not yet experienced the event of interest or been censored at the beginning of the week (before any event or censoring could occur). The number of participants who recovered within the previous time window are shown below the number at risk table. This is equal to the total number of participants with a symptom end date within the time window ending at this time point.

Fig. 2
figure 2

Factors associated with persistent symptoms following COVID-19 lasting i) ≥12 weeks and ii) ≥52 weeks versus those who reported being asymptomatic or symptoms resolved within 4 weeks. Logistic regression models with one or more COVID-19 symptoms lasting ≥12 weeks (y/n) or ≥52 weeks (y/n) as the binary outcome variables. Modelling of persistent symptoms as a function of biological and demographic variables. In the forest plot, data were presented as adjusted odds ratios (central dot) and 95% confidence intervals (bars). Adjusted odds ratios compare participants with persistent symptoms lasting i) ≥12 weeks (black) or ii) ≥52 weeks (blue) with those who reported being asymptomatic or symptoms resolved within 4 weeks (n = 126,016 participants for ≥12 weeks analysis / n = 121,142 participants for ≥52 weeks analysis). Mutually adjusted for age, sex, ethnicity, IMD, comorbidities, smoking status, severity of initial infection, dominant variant at the time of infection, and vaccination status. Data used: Supplementary Table 2.

There was also a gradient of reducing odds with lower deprivation (Fig. 2). There was a suggestion of lower risk of LC, but not VLC, in older compared to younger people (Fig. 2).

We fitted mutually adjusted Accelerated Failure Time models to assess factors associated with the rate of recovery from persistent symptoms (Supplementary Table 4). For people who had LC ( ≥ 12 weeks), longer time to recovery was found for females (adjusted time to recovery [aTR] 1·14 [1·06, 1·23] compared to males) and people with comorbidities (aTR 1·24 [1·08, 1·42] and 2·05 [1·58, 2·66] respectively for 1 and ≥2 comorbidities compared to no comorbidities). Shorter time to recovery of LC was found in people of other and mixed ethnicity (aTR 0·63 [0·45, 0·89] and 0·75 [0·57, 0·99] for other and mixed compared to white), people living in the least deprived areas (aTR 0·78 [0·69, 0·88] compared to most deprived), not-current smokers (aTR 0·73 [0·62, 0·86] compared to current smokers), later variants at the time of infection (aTR 0·79 [0·72, 0·86], 0·89 [0·79, 0·99], 0·69 [0·61, 0·78] for Alpha, Delta and Omicron respectively compared to Wild-type).

Current symptom profile and health-related quality of life

In our study population, the most common symptoms experienced by individuals with ongoing persistent symptoms were mild fatigue (66·9%), difficulty thinking or concentrating (54·9%) and joint pains (54·6%). However, mild fatigue and joint pains were also common in those with no history of COVID-19 and in those in whom symptoms had resolved (Fig. 3). The greatest difference in symptom prevalence between those with ongoing persistent symptoms and other participants were for loss or change of sense of smell (aOR 9·31, [8·64, 10·04]) or taste (aOR 8·47; [7·85, 9·15]), shortness of breath (aOR 6·69; [6·29, 7·12]), severe fatigue (aOR 6·19; [5·66, 6·77]), difficulty thinking or concentrating (aOR 4·97; [4·68, 5·27]), chest tightness or pain (aOR 4·71; [4·37, 5·08]) and poor memory (aOR 4·40; [4·15, 4·66]) (Fig. 3).

Fig. 3
figure 3

Current symptoms profile by COVID-19 history a) Forest plot of current symptoms in those reporting ongoing persistent symptoms following COVID-19 versus all other respondents, and b) Prevalence of current symptoms by COVID-19 history. Panel a shows the results of logistic regression models with 29 individual symptoms currently experienced (y/n) as the binary outcome variable and COVID-19 history as the primary exposure variable of interest (n = 242,712 participants). In the forest plot, data were presented as adjusted odds ratios (central dot) and 95% confidence intervals (bars). Odds ratios adjusted for age, sex, ethnicity, IMD, comorbidities, smoking status. Panel b shows the prevalence of 29 symptoms surveyed (n = 242,712 participants). Unweighted survey data presented. Data used: Supplementary Tables 5, 6.

Participants with ongoing symptoms lasting ≥12 weeks following COVID-19 reported worse current health, including a higher number of symptoms within the last two weeks, and greater reduction in ability to carry out daily activities due to these symptoms (Table 2). Post-exertional malaise, characterised by asking respondents who reported fatigue about worsening of fatigue symptoms after minimal physical and mental effort, and whether exercise makes fatigue symptoms worse, was also more common in individuals reporting ongoing persistent symptoms (Table 2, Supplementary Table 7). Additionally, worse mental health and health-related quality of life were reported by participants with ongoing symptoms lasting ≥12 weeks (Table 2, Supplementary Table 7). Table 2 also shows that for the 3221 people who previously had post-COVID-19 symptoms lasting ≥12 weeks and report having recovered, their health status is broadly similar to those with shorter recovery or who never had COVID-19.

Table 2 Current symptom profile and health-related quality of life of participants by COVID-19 history (n = 242,712)

Discussion

We show that symptomatic SARS-CoV-2 infection in England in adults is usually short-lived with most people reporting a short illness with symptom resolution within 2 weeks. However, in our study population, one in 10 people with symptomatic SARS-CoV-2 infection report symptoms for more than 4 weeks, one in 13 for more than 12 weeks (meeting the WHO definition for “post COVID-19 condition (Long COVID)”10), and 1 in 20 for more than 52 weeks. In our study population, 69% of those with persistent symptoms at 12 weeks still had symptoms at 52 weeks, meaning that 31% recovered within a year. We found that female sex, higher deprivation, having a pre-existing health condition, more severe symptoms at onset, and being infected when the original Wild-type variant was dominant was associated with having symptoms persisting for ≥12 weeks and ≥52 weeks. The above variables have previously been identified as risk factors for Long COVID2,7,11,12,13. We found a suggestion of lower reporting of persistent symptoms in older ages unlike another population-based study in the UK which found a positive linear association between age and Long COVID7.

The variant at the time of infection, initial severity and presence of pre-existing health conditions had the biggest impact on persistent symptoms, consistent with previous findings6,7,14,15. Compared to Wild-type, those infected when Omicron was dominant were 88% less likely to report symptoms beyond 12 weeks; this may reflect changing immunity in the population from previous exposure to the virus and vaccination. A recent case-control study conducted in the UK found lower odds of Long COVID with the Omicron versus the Delta variant, ranging from OR 0·2 (95% CI 0·2, 0·3) in those vaccinated >6 months prior to infection to 0·5 (95% CI 0·4, 0·6) in those vaccinated <3 months prior to infection16. We did not find conclusive evidence of effectiveness of vaccination against Long COVID. Vaccination reduces the severity of COVID-1917 and it may be through this indirect route that it has an impact on the risk of persistent symptoms post-infection. However, recent systematic reviews suggest that vaccination before SARS-CoV-2 infection could reduce the risk of subsequent Long COVID13,18,19.

The reporting of current symptoms was high across all groups in our study. For example, while 66·9%, 54·9% and 54·6% of individuals with ongoing persistent symptoms post-COVID-19 reported currently experiencing mild fatigue, difficulty thinking or concentrating and joint pains, respectively, the prevalence of these symptoms in those who never had COVID-19 was also high, at 31·1%, 15·2% and 35·5%. Indeed, a high level of symptom reporting was also observed for those who had recovered from COVID-19; the prevalence of mild fatigue, difficulty thinking or concentrating and joint pains in those with asymptomatic SARS-CoV-2 infection or who had recovered from COVID-19 within 4 weeks was 38·3%, 21·3% and 34·4%, respectively. These findings of high symptom prevalence in comparison groups have been observed elsewhere7,20,21, and could be due to higher participation in studies of people with current symptoms. Alternatively, this may reflect the timing of our survey which included months with high levels of upper-respiratory and influenza-like illness in the population. However, our data did show that the most specific persistent symptoms following COVID-19 were loss or change of sense of smell or taste, shortness of breath, severe fatigue, and difficulty thinking or concentrating, which were nine, seven, six and five times more likely, respectively, than in other participants. Of the few studies with a COVID-19-negative comparator group, one showed that COVID-19 cases had a higher likelihood of mood disorder, anxiety, and insomnia when compared to people with influenza or respiratory tract infection22. Another study found that in comparison to community controls, COVID-19 cases had a higher prevalence of symptoms at 6- and 9-months, including fatigue, sleep difficulties, hair loss, smell disorder, taste disorder, palpitations, chest pain, and headaches23.

There were substantial differences in currently reported health and well-being between individuals reporting ongoing persistent post-COVID-19 symptoms and those who had never had COVID-19 or had recovered, consistent with published evidence7,14,24. Encouragingly, those whose symptoms had resolved, even after 52 weeks, had general health and quality of life scores similar to those with no COVID-19 history or who recovered quickly. The dyspnoea and post-exertional malaise (PEM) scales were asked of everyone reporting shortness of breath (Dyspnoea 12) or fatigue, and individuals reporting persistent symptoms following COVID-19 scored higher (i.e. worse symptoms) than others, suggesting these symptoms may be more specific. A meta-analysis of 12 studies that evaluated health-related quality of life in individuals with Long COVID reported a pooled prevalence of poor quality of life (EQ5D Visual Analogue Scale - EQ‐VAS) of 59% (95% CI 42, 75)24. Similarly, the Long-COVID in Scotland study found that symptomatic SARS-CoV-2 infection was associated with a wide range of impaired daily activities and reduced health-related quality of life7.

A strength of our study is that we have addressed some of the limitations of existing studies by having a comparison group and including people in the general population who had severe, mild, and asymptomatic SARS-CoV-2 infections. We compared contemporaneous symptom profiles of community-based adults reporting ongoing persistent symptoms post-COVID-19 versus those who have never had COVID-19 or have recovered. Our study is the largest yet to look at these questions and goes further than previous questionnaire-based studies with COVID-19 negative8 and never-infected7 controls by identifying factors associated not only with recovery (yes/no) but rate of recovery. Indeed, our findings highlight the importance of having a comparator cohort of participants who tested negative and experienced the pandemic and national lockdowns. However, we acknowledge the possibility of misclassification bias in our comparator groups as infections may have gone undetected particularly in stages of the pandemic when free universal testing was not available in the UK.

We also recognise that the subjective nature of symptoms creates the potential for reporting and recall bias. We used information regarding presence and duration of symptoms rather than whether participants described themselves as having “Long COVID” to reduce potential reporting bias. The data on symptoms at the time of PCR testing were retrospective which introduces the possibility of recall bias, although we have previously shown that REACT participant reports of symptom onset date closely mirrored the epidemic curve25. There is also a risk that recall bias may have differentially affected reporting of symptoms by participants infected at different times, along with other time-varying factors, such as behaviour, seasonal weather patterns and changing pandemic restrictions, knowledge and expectations26, which may account for at least part of the association between persistent symptoms and Wild-type infection. However, studies looking at individuals with confirmed infections of different SARS-CoV-2 strains also show lower risks with more recent variants27,28.

We used validated instruments to assess mental health29,30, quality of life31. dyspnoea32, and fatigue33 but recognise the limitations of self-reporting and floor and ceiling effects (i.e, if a higher percentage of individuals achieve either maximum or minimum scores). The PHQ-9 scale used is a diagnostic tool for depression. However, some of the somatic questions have been found to be strongly correlated with symptoms that are common in Long COVID, including fatigue, sleep disruption and brain fog34. As such, by using this scale we might be overestimating the level of depression. This issue was raised by Re’em et al. who suggest the PHQ-235 screening criteria may be more appropriate for Long COVID as they do not include somatic items and simply require a score of 3 or more from the first two questions of PHQ-934. The percentage of participants in our study across all COVID-19 categories with a PHQ-2 score ≥3 was lower than the percentage with PHQ-9 ≥ 10 and the difference was more marked in those with ongoing persistent symptoms post-COVID-19 (Supplementary Table 7, Supplementary Table 8).

Our questionnaire response rate was 34·6%; response rates in the Long-COVID in Scotland Study and a Dutch population-based cohort were 16% and 33-39%, respectively7,8. Like these studies, our participants were more likely to be female, older, of white ethnicity and from the least deprived areas compared with the general adult population. These issues might cause selection bias in our study; however, we did not observe substantial differences between those invited and those who participated in the study on the measured sociodemographic characteristics (Supplementary Table 1).

A further limitation is that we do not present estimates for the population prevalence of persistent symptoms. To do so would require weighting but production of weights is far from straightforward given the composition of our sample. The probability of being in the sample was dependent upon the composition of the base population, varying response rates by sociodemographic group and across REACT-1 and REACT-2 rounds. We also oversampled participants who tested positive for SARS-CoV-2 and who reported persistent symptoms. Producing weights that take account of all these factors would involve making extensive assumptions which would likely introduce unknown biases.

In summary, our study provides timely data about the effect and implications of the pandemic on adults in England with and without ongoing persistent symptoms following COVID-19. Although COVID-19 is usually of short duration, some adults experience persistent and burdensome illness, although a sizeable proportion still recover after a prolonged period. Differences in symptoms and recovery are being further explored in an in-depth interview study with REACT participants to further understand the varying presentations and trajectories of post-COVID conditions36.

Methods

Study sample frame: the REACT programme

The REACT programme sampled random cross-sections of the population in England to quantify community prevalence of virus by RT-PCR (REACT-1, 19 rounds between May 2020 and March 2022) and of IgG anti-SARS-CoV-2 antibody based on a self-administered lateral flow immunoassay (LFIA) test (REACT-2, six rounds between June 2020 and May 2021)37. Methods for the studies are published elsewhere5,9,37. The average response rate across both studies and all rounds was 23·4%. Each round of data collection included between 90,000 and 210,000 participants38. Overall, 3,099,386 adults registered to take part in the REACT programme. Of these participants, 2,494,309/3,099,386 (80·5%) consented to both recontact and data linkage of routine health records.

Study design and participants

In this study, we aimed for a sample size of at least 160,000. Assuming a 20·0% response rate, we obtained a sample of 800,000 adults aged ≥18 years using as a sample frame REACT-1 and REACT-2 participants who had consented to both re-contact and data linkage (n = 2,494,309) (Supplementary Fig. 1). Personalised invitations were sent via email for one round of data collection between 1 August–1 December 2022. To increase our sample of individuals with persistent symptoms following COVID-19 we first invited all individuals in the following subgroups:

  1. 1.

    Individuals from REACT-1 or REACT-2 with a previous history of self-reported test confirmed or suspected COVID-19 who reported persistent symptoms of \(\ge\)12 weeks (n = 52,501)

  2. 2.

    Individuals from REACT-1 who tested positive for SARS-CoV-2 as part of the study (n = 13,482)

  3. 3.

    Individuals from REACT-2 who tested positive for SARS-CoV-2 IgG as part of the study and had not been vaccinated at the time (n = 85,757)

To achieve the 800,000-participant size, a random sample (n = 648,260) of all remaining adults not meeting the above criteria was selected.

Participants registered via an online portal. Those registered completed an online questionnaire38. It was designed to collect data on current health, well-being, functionality and recent symptoms, followed by questions about history of COVID-19 (SARS-CoV-2 PCR and lateral flow devices (LFD) test results, frequency, severity, duration). Current health status included a set of 29 symptoms potentially related to COVID-19 including: (i) loss or change of sense of smell or taste, (ii) coryzal symptoms, (iii) gastrointestinal symptoms, (iv) fatigue-related symptoms, (v) respiratory or cardiac symptoms (vi) memory or cognitive symptoms, (vii) other flu-like and miscellaneous symptoms. Mental and physical health outcomes were collected using the following validated questionnaires:

  1. 1.

    Quality of life/functioning: EuroQol five-dimension five-level (EQ-5D-5L)31

  2. 2.

    Assessment of breathlessness in people reporting this symptom: Dyspnoea-1232

  3. 3.

    Assessment of post-exertional malaise (PEM) in people reporting fatigue: three PEM-items from the DePaul Symptom Questionnaire39

  4. 4.

    Mental health: Generalized Anxiety Disorder 7-item scale (GAD-7)29 and Patient Health Questionnaire-9 (PHQ-9)30

Data linkage

The UK Health Security Agency (UKHSA) received results of all SARS-CoV-2 PCR tests in England from community settings (Pillar 2)40. In addition, members of the public were encouraged to submit results of at-home self-testing using lateral flow devices (Pillar 2)40. To obtain additional information on dates of positive SARS-CoV-2 tests, participant study data were linked to their Pillar 2 records using their unique National Health Service (NHS) number and other personal identifiers.

To obtain information on dates of received COVID-19 vaccine doses, participant study data were linked to their NHS records from NHS Digital on COVID-19 vaccination events41. This was done using their unique NHS number and other personal identifiers.

Statistical analyses

In the original REACT programme (REACT-1 and REACT-2) limited information was collected about persistent symptoms as this was not the main objective of these studies. Here, all participants were surveyed about current symptoms (as experienced on day of survey completion) and then later in the questionnaire participants were asked (retrospectively) the date their symptoms they ascribed to COVID-19 initially started and whether they thought their COVID-19 symptoms had resolved (and date of symptom resolution). This information was used to divide the study participants into different categories depending on symptom duration and whether or not they had resolved.

Our primary analyses focused on prevalence of individual symptoms currently reported at the time of questionnaire completion and validated self-reported physical and mental health outcome measures by the following COVID-19 categories of participants:

  1. 1.

    No COVID: no history or evidence of SARS-CoV-2 infection or COVID-19;

  2. 2.

    Asymptomatic or resolved short COVID < 4: Asymptomatic SARS-CoV-2 infection or COVID-19 symptoms resolved within 4 weeks;

  3. 3.

    Resolved short COVID ≥ 4 to < 12: COVID-19 symptoms resolved within 4-12 weeks;

  4. 4.

    Resolved persistent COVID: post-COVID-19 symptoms lasting ≥12 weeks but no longer symptomatic; this was further divided into those lasting less than 52 weeks and those lasting ≥52 weeks before resolution.

  5. 5.

    Ongoing persistent COVID: post-COVID-19 symptoms lasting ≥12 weeks and ongoing; this was further divided into those lasting less than 52 weeks to date and those lasting ≥52 weeks to date.

We present data on numbers and percent of participants in each of the above categories that are unweighted for potential differential response rates between groups.

We included only symptomatic SARS-CoV-2 infections confirmed by a positive test result (PCR or LFD) in our definition of COVID-19. This included self-reported test positives (survey question), REACT-1 test positives, unvaccinated REACT-2 SARS-CoV-2 IgG test positives and Pillar 2 test positives. Asymptomatic infections were defined as test (PCR or LFD) positives with no reported symptoms. A repeat positive test result was included as a separate infection if performed ≥90 days after a previous positive test42. The COVID-19 episode date used was symptom onset date for symptomatic infections or date of a positive SARS-CoV-2 PCR or LFD test for asymptomatic infections. We excluded individuals with less than 12 weeks follow up from their COVID-19 episode date to their questionnaire completion date.

Index of Multiple Deprivation (IMD) 2019 was used as a measure of relative deprivation, based on seven domains at a community level (Lower Layer Super Output Area, approximately 1500 residents) across England (income, employment, education, health, crime, barriers to housing and services, and living environment)43. Participants were allocated to quintiles of deprivation based on their residential postcode. A valid COVID-19 vaccine dose was defined as a date of vaccination 14 days or more prior to the COVID-19 episode date. The Wild-type strain was dominant in the UK prior to December 2020. Alpha dominated between December 2020 and April 2021 followed by Delta (May 2021 to mid-December 2021) and Omicron (late December 2021 onwards)44.

Continuous variables were presented as median (IQR) or mean (SD), as appropriate. Binary and categorical variables were presented as counts and percentages. A χ² test was used to identify differences in proportions across COVID-19 categories. For normally distributed continuous data, analysis of variance (ANOVA F-test) was used to test differences across categories, with Kruskal-Wallis tests used for non-normally distributed data.

We used logistic regression (adjusted Odds Ratios (aOR) and 95% Confidence Intervals (CI) adjusted for age, sex, ethnicity, IMD, comorbidities (presence of a pre-existing health condition) and smoking status (not-current smokers [including ex-smokers] vs. current smokers) to compare current self-reported specific symptoms for those with ongoing persistent symptoms following COVID-19 to those who had never had COVID-19 or had recovered. Further, we used mutually adjusted logistic regression to quantify the associations of age, sex, ethnicity, IMD, comorbidities, smoking status, severity of initial illness, COVID-19 vaccination status, and dominant UK circulating SARS-CoV-2 variant at time of infection with symptoms following COVID-19 lasting ≥12 weeks and lasting ≥52 weeks.

The dataset was converted into a format suitable for survival analysis techniques. Participants were followed up from their COVID-19 symptom onset date until the reported symptom end date (participants provided one date for when all symptoms had resolved) or, if they reported ongoing symptoms, until the survey completion date. We constructed Kaplan-Meier plots of time to symptom end date. To assess factors associated with symptom recovery in participants with symptom persistence beyond 12 weeks, we used an Accelerated Failure Time model45 with a Log-Normal distribution to quantify the associations between COVID-19 symptom discontinuation beyond 12 weeks and the following factors: age, sex, ethnicity, comorbidities, IMD, smoking status, severity of acute illness, COVID-19 vaccination status, and dominant UK circulating SARS-CoV-2 variant at time of symptom onset. Mutually adjusted Time Ratios (aTR) and 95% Confidence Intervals (CI) were estimated. An aTR >1 is interpreted as a slower symptom recovery rate beyond 12 weeks in participants with COVID-19 symptoms lasting ≥12 weeks.

All tests were two-tailed and p values of less than 0·05 were considered statistically significant. Data were analysed using the statistical package STATA version 15·0.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.