Introduction

Infection with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) increases the risk of arterial thrombotic events (ATE), such as myocardial infarction (MI) and ischaemic stroke, and venous thrombotic events (VTE), such as pulmonary embolism (PE) and lower limb deep vein thrombosis (DVT)1. This elevation in risk is highest immediately after infection, and higher after severe COVID-191.

The dominant SARS-CoV-2 variant changed during the pandemic, and different variants may be associated with different subsequent risks of major vascular events. The Delta variant (B.1.617.2), which emerged in late 2020 and was dominant by mid-20212,3,4,5,6, was associated with a greater risk of hospitalisation and death than the Alpha variant7,8,9,10,11.

The UK’s COVID-19 vaccine rollout started on December 8th, 2020, with eligibility in order of priority groups determined by the Joint Committee on Vaccination and Immunisation (JCVI), based on age, clinical vulnerability and health and social care occupation12. All adults in England became eligible to receive a first vaccination by June 18th 2021, and a second vaccination by August 202113.

Cohort studies have reported lower incidence of arterial and venous thrombotic events after COVID-19 vaccination than in unvaccinated people14 and that rates of hospitalisation for acute MI and ischemic stroke 31-120 days after COVID-19 are lower in vaccinated than unvaccinated people15. Randomised trials suggest that seasonal influenza vaccination reduces rates of cardiovascular events and cardiovascular death16. However, uncertainties remain. In younger people, the rare complications of the Pfizer–BioNTech BNT162b2 and Moderna mRNA-1273 mRNA vaccines (myocarditis)17,18 and the Oxford–AstraZeneca ChAdOx1 nCoV-19 AZD1222 vaccine (vaccine induced thrombotic thrombocytopenia)19 are balanced by reduced risk of severe COVID-19, but the role of these vaccines in modifying the incidence of common vascular events after COVID-19 is less clear.

Using linked anonymised electronic health records on 18.2 million adults registered with English general practices (GP), we compared the incidence of vascular diseases after COVID-19 with the incidence before or without COVID-19, in a pre-vaccination cohort followed during the wild-type and Alpha variant eras; and in vaccinated and unvaccinated cohorts followed during the Delta variant era (Fig. 1). Differences between associations in the pre-vaccination and unvaccinated cohorts should relate to effects of changing variants, while differences between the vaccinated and unvaccinated cohorts should relate to the effect of vaccination.

Fig. 1: Estimated number of COVID-19 cases identified by community testing in England between January 2020 and January 2022.
figure 1

The start and end of follow-up for each cohort, the periods during which different variants were dominant and the times that mass testing became available and vaccination rollout commenced are shown. Code and underlying data are available in the following repository: https://github.com/grpEHR/covid_variants.

Results

Characteristics of study cohorts

Among 18,210,937 people in the pre-vaccination cohort, 1,150,299 had a COVID-19 diagnosis during follow-up of whom 75,667 (6.6%) were hospitalised. There were 844,235 COVID-19 diagnoses (15,342 (1.8%) hospitalised) among 13,572,399 people in the vaccinated cohort and 162,103 (9,250 (5.7%) hospitalised) among 3,161,485 people in the unvaccinated cohort (Table 1). Among 75,667 people in the pre-vaccination cohort who were hospitalised because of COVID-19 37,881 (50.1%) were hospitalised on the day of diagnosis and 6,278 (8.3%) were hospitalised the day after diagnosis: the remainder were hospitalised 2–28 days after diagnosis (Table S1). In the pre-vaccination cohort, the median age was 49 years (interquartile range (IQR) 34–64), a slight majority (50.2%) were female, and 78.0%, 6.4% and 2.2% were recorded as being White, South Asian and Black ethnicities respectively. Differences between the vaccinated and unvaccinated cohorts reflected predictors of COVID-19 vaccine uptake20. The median (IQR) age was 54 (IQR 39–68) years in the vaccinated cohort, compared with 36 (IQR 28–47) years in the unvaccinated cohort. The proportions of females were 52.1% and 42.0% in the vaccinated and unvaccinated cohorts respectively, while the proportions recorded as of White ethnicity were 82.1% and 61.6% respectively, and the proportions living in the most deprived areas were 16.4% and 29.8% respectively. Compared with the vaccinated cohort, people in the unvaccinated cohort were more likely to be smokers, less likely to consult their GPs and less likely to have prior medical problems recorded (Table S2).

Table 1 Patient characteristics in the pre-vaccination, vaccinated and unvaccinated cohorts

Number of events and incidence rates

The numbers of events, person-years, and incidence rates per 100,000 person-years of vascular events before any COVID-19 diagnosis, after hospitalised COVID-19 and after non-hospitalised COVID-19, are presented, for each cohort and outcome, in Table 2. There was a total of 212,557, 57,425 and 3,316 ATE in the pre-vaccination, vaccinated and unvaccinated cohorts respectively. The corresponding total numbers of VTE were 117,730, 29,107 and 3,178 respectively. In each cohort, the incidence of each arterial thrombotic and venous thrombotic event was higher after COVID-19 than before or without COVID-19. For each outcome and cohort, the highest incidence rates were after hospitalised COVID-19. Incidence rates were generally lower in the unvaccinated cohort than in the vaccinated cohort, as expected given that the median age of the unvaccinated cohort (36 years) was much lower than that of the vaccinated cohort (54 years).

Table 2 Number of cardiovascular events in the pre-vaccination cohort, vaccinated cohort and unvaccinated cohort, with person-years of follow-up, by COVID-19 severity

Comparisons of event rates after COVID-19 diagnosis versus before or without COVID-19 diagnosis

Adjusted hazard ratios were estimated using Cox-proportional hazards models to quantify the associations between COVID-19 diagnosis (time-varying exposure) and a first cardiovascular event including arterial thrombotic, venous thrombotic and other cardiovascular events. Minimally adjusted models accounted for age, sex and region and maximally adjusted models accounted additionally for ethnicity, area deprivation, smoking status, number of GP-patient interactions and history of comorbidities. In each cohort, maximally adjusted HRs (aHRs) comparing the incidence of each outcome after COVID-19 diagnosis with the incidence before or without COVID-19 diagnosis were attenuated compared with age-, sex- and region-adjusted HRs (Tables 35, Table S3). For all outcomes, hazard ratios were extremely high on the day of COVID-19 diagnosis (day 0), particularly among individuals hospitalised with COVID-19 on the day of diagnosis. The incidence of each outcome in each cohort was also elevated during weeks 1–4 after COVID-19 diagnosis, compared with before or without COVID-19 diagnosis. aHRs were lower in subsequent time periods than during weeks 1–4 after COVID-19 diagnosis, though they were generally greater than 1 throughout follow-up in each cohort (Figs. 2 and 3, Tables 35). aHRs during weeks 1-4 after COVID-19 diagnosis were substantially lower in the vaccinated cohort than in the pre-vaccination and unvaccinated cohorts, and generally remained lower than in other cohorts during weeks 4–28 (Figs. 2 and 3, Tables 35). For each outcome and in each cohort, aHRs were substantially higher after hospitalised than non-hospitalised COVID-19 (Fig. 2, Tables 3 and 4, Table S3).

Table 3 Adjusted hazard ratios (95% CI) comparing the incidence of arterial thrombotic events after versus before or without a COVID-19 diagnosis, in the pre-vaccination, vaccinated and unvaccinated cohorts, overall and according to COVID-19 severity
Table 4 Adjusted hazard ratios (95% CI) comparing the incidence of venous thrombotic events after versus before or without a COVID-19 diagnosis, in the pre-vaccination, vaccinated and unvaccinated cohorts, overall and according to COVID-19 severity
Fig. 2: Maximally adjusted hazard ratios and 95% CIs comparing the incidence of arterial thrombotic and venous thrombotic events after versus before or without a COVID-19 diagnosis, in the pre-vaccination, vaccinated and unvaccinated cohorts, overall and by COVID-19 severity.
figure 2

Upper panels: Maximally adjusted hazard ratios and 95% CIs for arterial thrombotic events. Lower panels: Maximally adjusted hazard ratios and 95% CIs for venous thrombotic events. Left panels: all COVID-19 diagnoses: Middle panels: hospitalised COVID-19. Right panels: non-hospitalised COVID-19. The numbers of people in the pre-vaccination, vaccinated and unvaccinated cohorts were 18,210,937; 13,572,399 and 3,161,485 respectively. The numbers of COVID-19 diagnoses were 1,150,299 (75,667 hospitalised) in the pre-vaccination cohort, 844,235 (15,342 hospitalised) in the vaccinated cohort and 162,103 (9250 hospitalised) in the unvaccinated cohort. Maximally adjusted hazard ratios and 95% CIs are plotted at the median time of the outcome event within each follow up period in each cohort. Events on the day of COVID-19 diagnosis (day 0) were excluded. The numerical values of hazard ratios and their 95% CIs are displayed in Tables 3 and 4.

Table 5 Maximally adjusted hazard ratios (95% CI) comparing the incidence of other vascular events after versus before or without a COVID-19 diagnosis, in the pre-vaccination, vaccinated and unvaccinated cohorts, overall and according to COVID-19 severity
Fig. 3: Maximally adjusted hazard ratios and 95% CIs comparing the incidence of arterial thrombotic, venous thrombotic, and other vascular events after versus before or without a COVID-19 diagnosis, in the pre-vaccination, vaccinated and unvaccinated cohorts.
figure 3

Upper left panel: Acute myocardial infarction. Upper right panel: Ischaemic stroke. Second row left panel: Pulmonary embolism. Second-row right panel: Deep vein thrombosis. Third row left panel: Heart failure. Third row right panel: Angina. Lower left panel: Transient ischaemic attack. Lower right panel: Subarachnoid haemorrhage and haemorrhagic stroke. The numbers of people in the pre-vaccination, vaccinated and unvaccinated cohorts were 18,210,937; 13,572,399 and 3,161,485, respectively. The numbers of COVID-19 diagnoses was 1,150,299 in the pre-vaccination cohort, 844,235 in the vaccinated cohort and 162,103 in the unvaccinated cohort. Maximally adjusted hazard ratios and 95% CIs are plotted at the median time of the outcome event within each follow-up period in each cohort. Events on the day of COVID-19 diagnosis (day 0) were excluded. The numerical values of hazard ratios and 95% CIs are displayed in Tables 3, 4 and 5.

The incidence of ATE during weeks 1-4 after COVID-19 diagnosis, compared with before or without COVID-19 diagnosis, was elevated in the pre-vaccination and unvaccinated cohorts (aHRs 4.40 (95% CI 4.16–4.65) and 8.53 (7.20–10.1) respectively) but less markedly elevated in the vaccinated cohort (2.09 (1.92–2.28)) (Fig. 2, Table 3). The incidence of ATE remained elevated during weeks 5-28 in the unvaccinated cohort (1.54 (1.16–2.04)) and up to weeks 53–102 in the pre-vaccination cohort (1.22 (1.14–1.30)). During weeks 1–4 the aHRs for ATE were substantially lower in the vaccinated cohort than in the unvaccinated or pre-vaccination cohorts (ratios of aHRs 0.28 (0.25–0.32) and 0.36 (0.33–0.38) respectively, Table S4). Although attenuated, aHRs remained lower in the vaccinated cohort than in the unvaccinated or pre-vaccination cohorts during weeks 5–28 (ratios of aHRs 0.70 (0.52–0.94) and 0.73 (0.66–0.82), respectively).

The aHRs for ATE were substantially higher during weeks 1–4 after hospitalised COVID-19, versus before or without COVID-19 diagnosis (pre-vaccination cohort 12.1 (11.2–13.1), unvaccinated cohort 19.6 (15.6–24.5)) than after non-hospitalised COVID-19 (pre-vaccination cohort 2.70 (2.50–2.92), unvaccinated cohort 4.35 (3.36–5.64)). In sensitivity analyses restricted to primary diagnoses of ATE, aHRs during weeks 1–4 after hospitalised COVID-19 (including both day 0 and the rest of that period) were attenuated compared with aHRs for all ATEs (Figure S1). Estimated hazard ratios were similar in sensitivity analyses removing censoring at first vaccination in the unvaccinated cohort (Table S5). In additional analyses splitting follow-up during weeks 1-4 into shorter time intervals, hazard ratios for ATE declined steadily from days 1–6 to days 21–27 after COVID-19 diagnosis, in all cohorts (Table S6).

The aHRs for VTE during weeks 1-4 after COVID-19 diagnosis, versus before or without COVID-19 diagnosis, were substantially higher than for ATE, particularly in the pre-vaccination and unvaccinated cohorts (aHRs 16.6 (95% CI 15.9–17.2) and 29.6 (26.7–32.9) respectively), but less markedly in the vaccinated cohort (4.87 (4.53–5.23)) (Fig. 2, Table 4). The incidence of VTE remained elevated, compared with before or without COVID-19 diagnosis, during weeks 5–28 in all cohorts and up to weeks 53–102 in the pre-vaccination cohort (1.20 (1.09–1.32)). During weeks 1–4 the aHRs for VTE were substantially lower in the vaccinated cohort than in the unvaccinated or pre-vaccination cohorts (ratios of aHRs 0.17 (0.15–0.19) and 0.24 (0.23–0.26) respectively, Table S4). Although attenuated, aHRs remained lower in the vaccinated cohort than in the unvaccinated or pre-vaccination cohorts during weeks 5–28 (ratios of aHRs0 0.63 (0.49–0.80) and 0.61 (0.55–0.68), respectively).

The aHRs for VTE were substantially higher during weeks 1–4 after hospitalised COVID-19 (pre-vaccination cohort 88.5 (84.1–93.2), vaccinated cohort 46.7 (42.2–51.6), unvaccinated cohort 199.8 (174.7–228.5)) than after non-hospitalised COVID-19 (pre-vaccination cohort 5.99 (5.61–6.40), vaccinated cohort 2.13 (1.91–2.38), unvaccinated cohort 7.64 (6.34–9.20)). The incidence of VTE was still markedly elevated during weeks 5–28 after hospitalised COVID-19 in the pre-vaccination, vaccinated and unvaccinated cohorts (aHRs 4.39 (4.01–4.80), 5.67 (4.63–6.94) and 7.50 (5.07–11.1) respectively). In sensitivity analyses restricted to primary diagnosis of VTE, aHRs after COVID-19 diagnosis were attenuated compared with aHRs for all VTEs (Figure S2). This attenuation was particularly marked during weeks 1–4 (including both day 0 and the rest of that period) and after hospitalised COVID-19. In additional analyses splitting follow-up during weeks 1–4 into shorter time intervals, hazard ratios for VTE were generally similar during days 1–6 and days 7–13 after hospitalised COVID-19, then declined during days 14–20 and days 21–27 (Table S6). Hazard ratios after non-hospitalised COVID-19 did not markedly decline between days 1-6 and days 21–27.

In each cohort, aHRs for acute MI during weeks 1–4 after COVID-19 diagnosis, versus before or without COVID-19 diagnosis, were similar to those for ischaemic stroke (Fig. 3, Table 3). In the pre-vaccination cohort, aHRs for acute MI remained elevated during weeks 29–52 (1.16 (1.09–1.24)) and weeks 53–102 (1.31 (1.19–1.45)), but the incidence of ischaemic stroke was only slightly elevated from 29 weeks onwards (aHR 1.16 (1.05–1.27) during weeks 53–102). In all cohorts, aHRs during weeks 1–4 were markedly higher for PE (pre-vaccination 31.7 (30.3–33.1)), vaccinated cohort (9.10 (8.36–9.90), unvaccinated cohort 82.8 (72.7–94.3)) than for DVT, and aHRs for PE remained higher than for DVT during weeks 5–28 (Fig. 3, Table 4). By contrast, in the pre-vaccination cohort aHRs for DVT during weeks 29–102 were higher than for PE.

The incidence of heart failure, angina, and subarachnoid haemorrhage and haemorrhagic stroke during weeks 1–4 after COVID-19 diagnosis was substantially elevated in each cohort, versus before or without COVID-19 diagnosis, although aHRs were lower in the vaccinated cohort than the pre-vaccination or unvaccinated cohorts (Fig. 3, Table 5). Compared with these outcomes, the incidence of transient ischaemic attack was less markedly elevated during weeks 1–4. Though greater than 1, aHRs for these four outcomes were markedly lower during weeks 5–28 than weeks 1-4 after COVID-19 diagnosis. In the pre-vaccination cohort, the incidence of heart failure during weeks 53–102 was similar to the incidence before or without COVID-19 diagnosis (aHR 1.04 (0.98–1.11)). The incidence of angina and transient ischaemic attack was slightly elevated (aHRs between 1.10 and 1.16) and remained elevated during weeks 29–102. aHRs for subarachnoid haemorrhage and haemorrhagic stroke were 1.32 (1.14–1.52) during weeks 29–52 and 1.42 (1.15–1.76) during weeks 53–102.

In subgroup analyses, aHRs for both ATE and VTE were generally lower in younger age groups, in females, and in those reporting white ethnicity (Tables S7, S8, Figures S3, S4). Estimated excess risks of ATE 6 months post-COVID-19 diagnosis were 642, 229 and 718 per 100,000 people diagnosed with COVID-19 in the pre-vaccination, vaccinated and unvaccinated cohorts respectively (Fig. 4, Table S9). Corresponding estimated excess risks of VTE were 797, 270, and 1094 per 100,000 people diagnosed with COVID-19, respectively.

Fig. 4: Estimated absolute increase in the risk of arterial thrombotic and venous thrombotic events over time since COVID-19 diagnosis, compared with no COVID-19 diagnosis, in the pre-vaccination, vaccinated and unvaccinated cohorts.
figure 4

Upper panels: Estimated absolute increase in risk for arterial thrombotic events. Lower panels: Estimated absolute increase in risk for venous thrombotic events. Left panels: pre-vaccination cohort. Middle panels: vaccinated cohort. Right panels: unvaccinated cohort. The numbers of people in the pre-vaccination, vaccinated and unvaccinated cohorts were 18,210,937; 13,572,399 and 3,161,485 respectively. The number of COVID-19 diagnoses was 1,150,299 in the pre-vaccination cohort, 844,235 in the vaccinated cohort and 162,103 in the unvaccinated cohort. Increases in risks were estimated within sex and age groups, and the estimated overall increase in risk is the average of these (shown in black), weighted according to the proportions in each sex and age group in the pre-vaccination cohort. Estimated excess events at 28 weeks are displayed in Table S9.

Discussion

These cohort studies of up to ~18.2 million people demonstrate that COVID-19 vaccination substantially attenuates the elevated incidence of arterial and venous thrombotic events after COVID-19 diagnosis. This attenuation appears mainly attributable to the reductions in the severity of COVID-19 caused by vaccination. Among people diagnosed with COVID-19 before the availability of vaccination the incidence of ATEs and VTEs remained elevated by around 20% during the second year after COVID-19 diagnosis, and more so after hospitalised COVID-19, compared with the incidence before or without COVID-19 diagnosis. The associations of VTEs, and particularly PE, with hospitalised COVID-19 were stronger than those for ATEs. The elevations in incidence of arterial and venous thrombotic events after COVID-19 diagnosis was of similar magnitude in the pre-vaccination and unvaccinated cohorts, suggesting that changes in the dominant SARS-CoV-2 variant had little impact compared with that of vaccination.

Most previous studies of cardiovascular events following SARS-CoV-2 infections were conducted prior to the delta era and widespread availability of COVID-19 vaccination1,21,22,23. For example, a study of over 6 million patients from the US Department of Veterans Affairs healthcare system, including over 162,000 who had a positive COVID-19 test between March 2020 and January 2021 estimated HRs 30 days to 12 months post COVID-19 for several outcomes we considered, including MI (HR: 1.63, 95% CI: 1.51–1.75), DVT (2.09, 1.94-2.24), and heart failure (1.72, 1.65–1.80)23. Another study using data from the US Department of Veterans Affairs healthcare system found that vaccination against COVID-19 confers partial protection against a range of health outcomes including cardiovascular outcomes24.

Many pathways have been postulated to explain the increased risk of vascular events after infection25. Respiratory infections including SARS-CoV-2 activate immune responses beyond the respiratory epithelium. Influenza leads to inflammation, smooth muscle proliferation and fibrin deposition in atherosclerotic plaques in mice, which may increase risks of MI if replicated in human infection26. SARS-CoV-2 infection generally increases inflammatory cytokines, activating endothelium and increasing biomarkers of thrombosis, but probably does not directly infect the vascular epithelium (although this is contested)27. Agents that attenuate the inflammatory response (dexamethasone, tociluzamab and baricitinib) reduce mortality from SARS-CoV-2, but it is unclear whether these reduce vascular-specific mortality28,29,30,31. Cardiovascular sequalae of SARS and MERS are not well defined due to the small number of patients who have been followed up and lack of appropriate control groups, but the evidence points to potentially substantial post-infection cardiovascular risks32.

Although the risk of thrombotic events is high soon after infection, clinical trials have not demonstrated clear benefit from antithrombotic agents to most people with COVID-19. In RECOVERY, aspirin did not reduce the chance of death at 28 days after randomisation (rate ratio 0.96, 95%CI: 0.89–1.04), but did modestly reduce hospital stay33. In ATTACC, ACTIV-4a, and REMAP-CAP, therapeutic anticoagulation with heparins did not clearly increase organ-support free days in patients who were hospitalised with COVID-19 and were critically unwell (OR: 0.83, CrInt 0.67–1.03), but did in patients who were not critically unwell (4% (CrInt 0.5–7.2))34,35. Oral anticoagulation after admission did not clearly reduce mortality in people after hospital admission in HEALCOVID (n = 402)36 and ACTION (n = 615)37, but there was a suggestion of benefit (and no evidence of bleeding) in MICHELLE (n = 320)38.

We analysed comprehensive linked electronic health record data on ~40% of the English population. The large dataset size allowed us to estimate associations in important subgroups as well as overall, and we controlled for a substantial range of potential confounding factors. Limitations of our study include first that we did not have access to SARS-CoV-2 sequencing data so could not confirm the SARS-CoV-2 variant for individual infections. Second, we could not analyse an Omicron-era cohort as mandatory testing for SARS-CoV-2 in England stopped at the end of March 2022. Third, a substantial proportion of cardiovascular events were recorded on the day of COVID-19 diagnosis (day 0), so it is possible that some COVID-19 diagnoses were made because patients with cardiovascular events were examined in hospitals or in other healthcare settings, rather than because COVID-19 caused these cardiovascular events. However, we separated day 0 from weeks 1-4 and the differences between aHRs in the vaccinated and unvaccinated cohorts during weeks 1-4 suggest that reverse causation was not the main reason for the high hazard ratios. Fourth, although we adjusted for a broad range of potential confounders, our results may be biased by unmeasured confounding due to missing information or measurement error. For example, body mass index is not systematically recorded in electronic health records and may be incorrect in some instances. In addition, we did not adjust for potential time-varying confounders that varied after the cohort start date and may have predicted both COVID-19 and cardiovascular outcomes. Fifth, we did not account for reinfections, although these will have been rare until the Omicron variant became dominant. Sixth, older people in the vaccinated cohort were eligible for booster vaccination from September 2021 so associations between COVID-19 and CVD in that cohort will relate to the effects of the booster as well as the second vaccination. Seventh, outcome misclassification or delays in diagnosis can occur because individuals with mild symptoms do not present to healthcare or because it is difficult to identify a thrombotic disorder in individuals who are very unwell with COVID-19. Eighth, estimated hazard ratios were broadly similar for the pre-vaccination and unvaccinated cohorts. These may have been influenced, beyond the different SARS-CoV-2 variants studied in the two cohorts, by factors such as evolving testing strategies over time, the effect of lockdowns and changing access to healthcare services during the pandemic. Finally, we studied the incidence of cardiovascular events after diagnosis of COVID-19. Some COVID-19 cases will not have been recorded in electronic health records for reasons including lack of availability of mass testing in the first phase of the pandemic in 2020 and lack of testing of asymptomatic individuals. The inclusion of some follow-up after COVID-19 in the comparison group is likely to have led to the underestimation of the elevation in the incidence of cardiovascular events after COVID-19.

The absence of clear benefit for antithrombotic strategies after infection, and the clear reduction in thrombotic complications after vaccination means that vaccination is critical to prevent severe COVID-19 disease and its cardiovascular sequelae, particularly in population groups at the highest risk and in which coverage is currently low. In conclusion, COVID-19 vaccination reduces the risk of arterial thrombotic, venous thrombotic and other cardiovascular events after COVID-19 diagnosis. People who had COVID-19 before being vaccinated are at higher risk of these events for an extended period of at least two years: further follow-up is required to establish the duration of this higher risk.

Methods

The study design and pre-determined methods are described in detail in a publicly available protocol: https://github.com/opensafely/post-covid-vaccinated/tree/main/protocol.

Study design and data source

Using a cohort study design, linked electronic health records were analysed through OpenSAFELY (https://opensafely.org), a data analytics platform created to address COVID-19-related research questions39. The platform provides secure access to primary care records managed by the GP software provider, The Phoenix Partnership (TPP) SystmOne software, which covers around 40% of the population in England. This OpenSAFELY-TPP population is representative of the general population of England in terms of age, sex, ethnicity, index of multiple deprivation, and causes of death40. The TPP primary care data is securely linked at the individual level to the Second Generation Surveillance System for Pillars 1 and 2 SARS-COV-2 infection laboratory testing data, COVID-19 vaccination records (National Immunisation Management System), National Health Service (NHS) hospitalisations (Secondary Uses Services data) and the Office of National Statistics (ONS) death registry, including causes of death.

Outcomes were derived from primary care records, hospital admissions and causes of death. Clinically verified SNOMED-CT (Systematized Nomenclature of Medicine-Clinical Terms) and ICD-10 (International Classification of Disease, 10th revision) rule-based phenotyping algorithms were used to define arterial thrombosis outcomes (acute MI, ischaemic stroke and a composite ‘arterial thrombotic event’ (ATE)), venous thrombosis outcomes (PE, DVT and a composite ‘venous thrombotic event’ (VTE)), and other cardiovascular outcomes (heart failure, angina, transient ischaemic attack, subarachnoid haemorrhage and haemorrhagic stroke). Codes in any position (main or underlying diagnosis or cause of death) were used (see protocol: https://github.com/opensafely/post-covid-vaccinated/tree/main/protocol).

The date of COVID-19 diagnosis was defined as the earliest of the date of a positive SARS-COV-2 test, the date of a confirmed COVID-19 diagnosis in primary or secondary care or the date of death with SARS-COV-2 infection listed as primary or underlying cause. Individuals who were hospitalised with a COVID-19 primary diagnosis within 28 days of first COVID-19 diagnosis were categorised as ‘hospitalised COVID-19’, otherwise they were categorised as ‘non-hospitalised COVID-19’. Covariates identified as potential confounders included age; sex; ethnicity; region; area socioeconomic deprivation; smoking status; number of GP-patient interactions in the last 12 months; and previous history of a specific comorbidity (binary) for a range of diseases (details in Table S10).

Study population

Three cohorts were defined. In the ‘pre-vaccination’ cohort, follow-up started on January 1st 2020 (baseline) and ended on the earliest of December 14th 2021 (when the Omicron variant became dominant in England41, Fig. 1), the date that the outcome event of interest was recorded, and date of death. Exposure was defined as a recorded COVID-19 diagnosis between baseline and the earliest of eligibility for COVID-19 vaccination, date of first vaccination and June 18th 2021 (when all adults became eligible for vaccination): this exposure period was before the Delta variant became dominant in England. The other two cohorts were followed during the period when the Delta variant was dominant in England (Fig. 1): between June 1st 2021 (baseline) and December 14th 2021 (study end date). Follow-up in the ‘vaccinated’ cohort started at the later of baseline and two weeks after a second COVID-19 vaccination and ended at the earliest of the study end date, outcome event date, and date of death. The ‘unvaccinated’ cohort had not received a COVID-19 vaccine by 12 weeks after they became eligible for vaccination. Follow-up started at the later of baseline and 12 weeks after vaccination eligibility and ended at the earliest of the study end date, outcome event date, date of death, and date of first vaccination.

Individuals eligible for each cohort had been registered with an English GP for at least six months before the cohort baseline, were alive and aged between 18 and 110 years at baseline, and had known sex, region and area deprivation. Individuals with a history of COVID-19 before the cohort baseline were excluded. For each analysis, individuals with a prior history of the outcome analysed were excluded: these individuals were included in one of the subgroup analyses. In the vaccinated cohort, individuals who received a COVID-19 vaccination before December 8th 2020, or a second dose before or less than three weeks after their first dose, or received more than one type of vaccine before May 7th 2021, were excluded. In the unvaccinated cohort, individuals who could not be assigned to a vaccination priority group as defined by JCVI were excluded.

Statistical analyses

For each cohort, baseline demographic and clinical characteristics were described. The number of events per outcome, person-years of follow-up and incidence rates (per 100,000 person-years) of events before and after all, hospitalised and non-hospitalised COVID-19 were calculated. We tabulated the distribution of the number of days between COVID-19 diagnosis and subsequent COVID-19 hospitalisation. Time to first event was analysed for each outcome. Cox models were fitted with calendar time scale using the cohort-specific baseline as the origin (time zero). Hazard ratios (HRs) for follow-up after versus before or without a COVID-19 diagnosis were estimated, splitting follow-up into the day of COVID-19 diagnosis (‘day 0’), the remainder of 1-4 weeks and 5-28 weeks after COVID-19 diagnosis for all cohorts, and additionally 29-52 and 53-102 weeks after COVID-19 diagnosis for the pre-vaccination cohort. Because of limited numbers of events in some groups, in subgroup and sensitivity analyses, we estimated hazard ratios by combining day 0 with the rest of weeks 1–4. We additionally estimated hazard ratios 0, 1–6, 7–13, 14–20 and 21–27 days after COVID-19 diagnosis. For each outcome and cohort, we estimated: (i) age and sex; and (ii) maximally (including all potential confounders) adjusted HRs. We calculated ratios of hazard ratios (with corresponding 95% CI) comparing the maximally adjusted HRs for ATE and VTE during weeks 1-4 and 5-28 between the three cohorts. Subgroup analyses were conducted according to whether individuals had been hospitalised for COVID-19 within 28 days of COVID-19 diagnosis. In each cohort absolute excess risks (AER) of any ATE and any VTE after COVID-19, weighted by the proportions of individuals in age and sex strata in the pre-vaccination cohort, were derived based on the maximally adjusted hazard ratios for the cohort. Further details of the statistical analyses and AER calculations are provided in the supplementary material.

For the outcomes ATE and VTE, we conducted additional subgroup analyses by age group, sex, ethnicity, and prior history of the outcome. Further sensitivity analyses included removing censoring at first vaccination in the unvaccinated cohort and identifying outcomes using primary position only (main diagnosis or cause of death).

Data management and analyses were conducted in Python version 3.8.10 and R version 4.0.2.

Ethical approval and information governance

This study was approved by the Health Research Authority [REC reference 22/PR/0095] and by the University of Bristol’s Faculty of Health Sciences Ethics Committee [reference 117269]. Authors involved in data management/analysis successfully passed information governance training and obtained ONS safe researcher accreditation. NHS England is the data controller of OpenSAFELY-TPP. All outputs underwent disclosure checks and were approved by NHS England. Further details of OpenSAFELY information governance are provided in supplemental methods.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.