Introduction

Two mRNA COVID-19 vaccines (BNT162b2 from Pfizer-BioNTech and mRNA-1273 from Moderna) are currently authorized and fully approved in the United States1,2. The interval between the 2-doses of the primary series recommended by the Food and Drug Administration (FDA) is three weeks for Pfizer-BioNTech and four weeks for Moderna3. While the majority of mRNA vaccine recipients in the U.S. received their primary doses close to these recommended timings, some missed the second dose or received it outside the recommended interval4. There were substantial differences in the completion of the primary series and adherence to the recommended schedule by race, ethnicity, age, and/or jurisdiction4.

Vaccine dosing schedules have an important public health relevance, especially in resource-limited settings. When facing a shortage of vaccine supply while experiencing high level of SARS-CoV-2 transmission, countries have considered delaying the administration of the second dose of mRNA vaccines as a pragmatic approach to achieve a higher coverage of the single dose in the population5. For example, the U.K. delayed the timing of second dose administration from three weeks to twelve weeks6. Because of the high relevance to public health, not only the immunogenicity7 but also the total public health impact of different dosing protocols should be rigorously estimated to inform policy decisions.

The variability of vaccine effectiveness or efficacy against SARS-CoV-2 infection by different timing of second dose administration was evaluated. Test-negative design (TND) studies in Canada found that, compared to the manufacturer-specified 3-4-week interval between the first and second doses, a 7–8-week interval increased the effectiveness against SARS-CoV-2 infection among the general population in British Columbia and Quebec8 and healthcare workers in British Columbia9. In contrast, no evidence of different effectiveness by dosing interval was found in longitudinal studies among households10 and healthcare workers in the U.K.11. While these findings provide important insights, the underlying study designs have limitations, such as the potential for selection bias for TND studies12. Also, previous studies compared effectiveness based on the incidence of COVID-19 only after people received the second dose. This fails to capture the fact that longer interdose intervals result in longer time at the lower level of protection afforded by a single dose, increasing the risk of infection during the interdose interval. In order to identify the optimal dosing protocol, it is critical to compare the risk of infection not only after people are fully vaccinated but also throughout the whole course of vaccination including the interdose interval.

In this work, we use a target trial emulation (TTE) approach to evaluate the effectiveness (direct effect) of dosing protocols based on different interdose intervals of the mRNA COVID-19 vaccines. A TTE accounts for the duration of time at the sub-optimal levels of protection experienced during the inter-dose interval, while avoiding common biases that can occur when the date of treatment receipt and the start of follow-up differ13. We show that delaying the timing of the second dose administration by approximately 1–2 weeks may provide stronger long-term protection against SARS-CoV-2 infection, but a longer delay would increase the risk.

Results

Descriptive statistics

A total of 6,128,364 mRNA COVID-19 vaccine recipients in Georgia, U.S., were included in our analysis (Fig. 1, Table 1, Supplementary Table 1). Of these, 517,966 (8.5%) people had confirmed SARS-CoV-2 infection before vaccination, 26,255 (0.4%) people had infection between the first and second doses, and 388,119 (6.3%) people had infection after the second dose (Supplementary Table 2). Of 5,350,766 individuals who had completed the primary series during the study period, 38,539 (0.7%) people received their second dose before the recommended interval, 4,337,660 (81.1%) received within the recommended interval, 834,219 (15.6%) received within the late-but-allowable interval, and 140,348 (2.6%) received within the late interval. Of 777,598 people who had received the first dose but not their second dose by the end of the study period, 717,051 (92.2%) were classified as the late group in the analysis because ≥43 days for Pfizer-BioNTech and ≥50 days for Moderna had passed since their first dose administration. White and Asian individuals were more likely to receive the second dose during the recommended interval compared to other racial groups (Supplementary Table 3).

Fig. 1
figure 1

Study population for the main analysis.

Table 1 Characteristics of the vaccine recipients stratified by vaccine manufacturers in Georgia, United States, December 2020-March 2022 (N = 6,128,364)

Target trial emulation: clone-censor weight analysis

For Pfizer-BioNTech recipients, the FDA-recommended protocol yielded the lowest weighted cumulative risk of SARS-CoV-2 infection until about 55 days after the first dose administration (Fig. 2). After that point, the weighted cumulative risk was considerably lower under the late-but-allowable protocol (risk ratio (RR) = 0.79 (95% confidence interval (CI): 0.78-0.81) on Day 120 after the first dose, compared to the FDA-recommended protocol) (Table 2). The risk under the FDA-recommended protocol became similar to that of the late protocol around Day 90 after the first dose. For Moderna recipients, the recommended protocol had the lowest risk until about 70 days after the first dose administration, while the late-but-allowable protocol had the lowest risk after that (RR = 0.89 (95% CI: 0.87-0.91) on Day 120 after the first dose, compared to the FDA-recommended protocol). The late protocol consistently yielded the highest risk. The estimated cumulative risk and 95% CI on Day t after the first dose (t = 1, 2, …, 180) can be found in the Supplement (csv file).

Fig. 2: Estimates of inverse probability of censoring-weighted cumulative risk functions of SARS-CoV-2 infection by the protocol for Pfizer-BioNTech (panel a) and Moderna recipients (panel b).
figure 2

Data are presented as point estimates (solid lines) and 95% confidence intervals (shaded areas represent) using a nonparametric bootstrap based on 200 resamples. Blue lines are estimates under the recommended protocol. Yellow lines are estimates under the late-but-allowable protocol. Pink lines are estimates under the late protocol.

Table 2 Inverse probability of censoring-weighted risk of SARS-CoV-2 infection on 50 and 120 days after the first dose administration by protocol, Georgia, United States, December 2020-March 2022

The late-but-allowable protocol resulted in the lowest risk for both individuals with and without prior reported infection (Supplementary Fig. 1). For adults ≥65 years of age, the recommended protocol and late-but-allowable protocol yielded similar risks, while the late protocol consistently resulted in the highest risk (Supplementary Fig. 2).

Sensitivity analysis

After delaying each interval by one week for the Pfizer-BioNTech vaccine, the weighted risk of infection was estimated to be consistently lowest under the scenario where the second dose was administered from 24-32 days after the first dose (i.e., FDA-recommended interval for Moderna) (Fig. 3). The estimated cumulative risk of infection under the “first dose only” protocol was similar to the risk under the late protocol (Supplementary Fig. 3). When analyzing data up to September or November 2021, the estimated cumulative risk was similar under the recommended and late-but-allowable protocols until five months post first dose administration, after which the recommended protocol had lower risk. The late protocol consistently resulted in the highest risk (Supplementary Figs. 45). For the rest of the sensitivity analyses, the results did not meaningfully change (Supplementary Figs. 610).

Fig. 3: Results of sensitivity analysis: Estimates of inverse probability of censoring-weighted cumulative risk functions of SARS-CoV-2 infection for Pfizer-BioNTech recipients, using Moderna’s dosing schedules.
figure 3

Data are presented as point estimates (solid lines) and 95% confidence intervals (shaded areas represent) using a nonparametric bootstrap based on 200 resamples. Blue lines are estimates under the recommended protocol. Yellow lines are estimates under the late-but-allowable protocol. Pink lines are estimates under the late protocol.

Discussion

Our approach compared the risk of SARS-CoV-2 infection under scenarios where the total study population in Georgia, U.S. had followed each of the protocols that varied the timing of second dose administration. The infection risk was compared for the whole course of vaccination, not only after the completion of the primary series but also between the first and second doses. Our findings suggested that mRNA vaccine recipients may gain stronger long-term protection against SARS-CoV-2 infection (as well as disease) by delaying their second dose by approximately 1-2 weeks, especially for Pfizer-BioNTech, in this setting. Previous studies in the U.K. and Canada found higher levels of neutralizing antibodies after the delayed second dose14,15,16, which may explain one of the possible mechanisms behind our findings. Multiple factors likely contributed to the differences in the cumulative risk of infection across protocols, such as the increased protection due to possibly improved immune response from the delayed second dose, the increased risk of infection during the extended interdose intervals, and waning immunity. With the TTE approach, we were able to evaluate the comparative effectiveness of different interdose intervals, incorporating these various factors. It is important to note that a delayed second dose leads to a short-term higher risk, which may be particularly relevant if the force of infection in the community is high during that time. We found that a longer delay in the second dose would pose a negative impact in both the short and long-term due to a prolonged time at the lower level of protection induced by a single dose. Interestingly, for Pfizer-BioNTech, the risk of infection under the FDA-recommended protocol was similar to that under the late protocol, and the late-but-allowable protocol conferred stronger long-term protection. The sensitivity analysis showed that Pfizer-BioNTech recipients would gain stronger long-term protection when delaying the second dose administration by a week, providing public health implications.

Our results showed trends consistent with current data and knowledge of COVID-19 mRNA vaccines. First, the decrease in the slope of the estimated cumulative risk of infection after around Day 7-14 since the first dose administration was consistent with trial data that showed that a similar time would be required to obtain immunological protection after the first dose17. Second, the estimated cumulative risks were identical until around Day 30 when the FDA-recommended protocol began to show stronger protection, consistent with expectations of the timing of increased protection after receipt of the second dose. Similarly, the cumulative risk under the late-but-allowable protocol decreased at the expected time.

Like other more commonly-used methods, the TTE approach handles baseline confounding (by making exact copies of individuals) and informative censoring by protocol non-adherence (with inverse probability of censoring weights (IPCW) that accounts for covariates). Additionally, TTE approach has important advantages. First, our approach yields results with a similar interpretation as those from a trial that randomized people to different vaccine protocols. Second, the method avoids the immortal person-time bias that can occur when the completion of the protocol (i.e., second dose administration) and the start of follow-up (i.e., date of first dose administration) are misaligned. Using this approach, we were able to account for the extended duration of weak-protection time with a single dose under the delayed protocols, enabling us to evaluate the effectiveness of the whole course of different dosing protocols and providing implications for public health policy. This differs from the more traditional analysis, such as that using the Cox proportional hazards (PH) model, that compares the hazard of infection after the completion of the primary doses across individuals who received the second dose at different timing. Expected changes in comparative effectiveness over time, due to waning immunity for example, would also violate the proportional hazards assumption making this analysis inappropriate. The Cox PH model was used for the calculation of IPCW because these factors do not affect the probability of remaining uncensored. Recognizing its advantages, researchers have started applying TTE approaches to vaccine evaluation18,19,20,21,22,23, and it could be applied to inform boosting strategies13,18,24. Some important research questions remain for the use of TTE for vaccine evaluation, such as interference. In our study, as we focused on vaccinated individuals, we assumed that the impact of changes in the population-level transmission under each protocol would be smaller, as the transmission level was largely determined by unvaccinated individuals.

Limitations of our study include that the reported data on vaccination and test results, as well as their linkage, may not be perfectly accurate. If Georgia residents moved out of state, received a dose outside the state, or relied solely on at-home COVID tests, such information was not captured. There may be a difference in testing rates between individuals who received their second dose during the FDA-recommended interval and those who received it later. We expected that the impact of this difference would be smaller as we only analyzed data from vaccinated individuals and did not include unvaccinated people who were more likely to have different testing rates. We could not analyze outcomes other than SARS-CoV-2 infection, such as death and hospitalization, because the information was frequently missing and the dates associated with those outcomes were unreliable because of the challenges with case follow-up. We could not adjust for variables that may be time-varying and may have influenced infection and the timing of second dose administration, such as comorbidities, employment status, use of non-pharmaceutical interventions, and results of at-home testing, due to the lack of data. To assess the impact of these time-varying factors, it is essential to collect relevant data through longitudinal cohort studies. Longitudinal cohort data and surveillance data each have distinct advantages and limitations. While statewide surveillance data do not capture detailed individual-level health data and time-varying factors, it enabled us to evaluate dosing schedules across fine intervals among the general population throughout the state. The availability of at-home tests changed over time, especially around late 2021 and early 2022. The rate of at-home testing may have differed across interdose interval groups, although the difference might be smaller than that between vaccinated and unvaccinated groups. Another limitation is the lack of data on competing risks, with the primary concern being deaths from non-COVID causes. In conventional survival analysis methods, such as Kaplan-Meier curves, competing risks could theoretically introduce bias in vaccine effectiveness. However, cumulative risks in our study were calculated without relying on hazard functions sensitive to competing risks25; they were rather calculated by dividing the weighted number of cumulative cases on each day by the total number of cohort samples. We were unable to appropriately censor people with competing risks from the censor weight model due to the lack of data on the presence and timing of these competing risks, which may lead to bias when estimating the censoring weights.

Our study showed how the effectiveness of the mRNA COVID-19 vaccines against SARS-CoV-2 infection varied by the timing of the second dose administration among the general population in Georgia, providing policy-relevant implications. Delaying the timing of the second dose administration by approximately 1-2 weeks may help to reduce the risk of SARS-CoV-2 infection in the longer term, especially for Pfizer-BioNTech vaccine. The evaluation for multi-dose vaccination campaigns should be conducted early and periodically to provide evidence of vaccine effectiveness as an outbreak evolves.

Methods

Study population

Our study population included individuals who received at least one dose of an mRNA COVID-19 vaccine between December 13, 2020 and March 16, 2022 in Georgia, U.S. (Fig. 1). We excluded 4374 (0.1% of mRNA COVID-19 vaccine recipients) people who received their second dose ≤3 days after their first dose because of likely data entry errors. Children <5 years of age were excluded as they were not eligible for COVID-19 vaccination during our study period and their primary dosing schedule was different from those for people ≥5 years of age. Recipients of non-mRNA COVID-19 vaccines were not included in the study. We excluded 89,885 (1.4%) individuals who received their second dose more than 180 days after their first dose, since individuals who received their second dose beyond this time likely received a booster dose at that time while their true second dose was received outside of Georgia or otherwise misrecorded (Supplementary Fig. 11).

The number of confirmed COVID-19 cases can be found on the Georgia Department of Public Health (GDPH) website26. The most common SARS-CoV-2 variant in Georgia during the study period was Alpha (February-June 2021), Delta (July-November 2021), and Omicron (December 2021-March 2022)27.

Data source

We extracted the information on vaccine manufacturer, date of receipt of each vaccine dose, demographic characteristics (age, gender, race, ethnicity), and geographic region of residency (18 public health districts of residency)28 from the GDPH vaccine database29. Race (White, Black, Asian, American Indian and Alaska Native Resources, Native Hawaiian and Pacific Islander, or other) and ethnicity (Hispanic or non-Hispanic) were self-reported. We also extracted SARS-CoV-2 test results from the State Electronic Notifiable Disease Surveillance System (SendSS), an electronic database to track patients with notifiable diseases, including COVID-19 cases, across Georgia. Data are reported to the GDPH from laboratories, hospitals, and providers through SendSS and/or Electronic Laboratory Reports (ELR). The vaccine data and SARS-CoV-2 test results were linked by GDPH, using first name, last name, and date of birth.

Exposure and outcome

The protocols under investigation were defined based on the timing of the second dose administration of an mRNA COVID-19 vaccine relative to the first dose. We used the following three categories to characterize different interdose intervals: the FDA-recommended interval (17–25 days for Pfizer-BioNTech and 24-32 days for Moderna; “recommended” protocol), longer than the FDA-recommended interval but within the allowable interval (26-42 days for Pfizer-BioNTech and 33-49 days for Moderna; “late-but-allowable” protocol), and after the allowable interval (≥43 days for Pfizer-BioNTech and ≥50 days for Moderna; “late” protocol) (Supplementary Table 4)4. Our outcome was SARS-CoV-2 infection defined as a positive result of real-time reverse transcriptase PCR test or antigen test.

Covariates

We included demographic characteristics (age in years, sex, race, and ethnicity), public health districts of residence, and the presence of reported COVID-19 infection before vaccination to account for confounding in the analysis. We also adjusted for the calendar month and year of the first dose of vaccination (categorical) to account for changing levels of community transmission throughout the pandemic, varying SARS-CoV-2 prevention policies over time (e.g., mask mandates), and the different severity and transmissibility of SARS-CoV-2 variants.

TTE (clone-censor weight analysis)

We employed a TTE approach (clone-censor-weight analysis) to understand how the different intervals between the first and second doses of the primary series of mRNA COVID-19 vaccines may change the risk of SARS-CoV-2 infection after the first dose administration (Table 3)13,24,30. This method mimics a per-protocol analysis of a randomized controlled trial in which individuals are randomly allocated to alternative dosing protocols.

Table 3 Specification and emulation of a target trial of different interdose intervals between the first and second doses of mRNA COVID-19 vaccines and the risk of SARS-CoV-2 infection in Georgia, U.S. in 2020-2022, using observational data from Georgia Department of Public Health

We created three copies of the longitudinal dataset corresponding to the aforementioned three mRNA COVID-19 vaccination protocols of interest (FDA-recommended, late but allowable, and late)18. In each copy, individuals were followed up from the index date (i.e., the day each individual received their first dose) until at the earliest of SARS-CoV-2 infection, protocol nonadherence, or end of study. This method addresses measured confounding at baseline because the copies of each observation are identical at the start of follow-up. In each protocol-specific copy, a vaccine recipient who did not follow a given protocol was considered nonadherent and was censored at the time their vaccination course differed from the protocol. To explain the process of cloning, we created an illustrative example of the study population with five individuals in Fig. 4. Individual A in Fig. 4 received the second dose within the FDA-recommended interval, and thus, it was followed up until the end of the study period in the copy for the FDA-recommended protocol (i.e., survival time T days), while it was censored on the day of the second dose administration (Day 21) in the copies for the late-but-allowable protocol and the late protocol (i.e., survival time 21 days). Individual B was censored on the date of second dose administration (Day 13) in all copies. Individual C was censored on the last day of the FDA-recommended interval (Day 25) in the FDA-recommended protocol copy (i.e., survival time 25 days), while it was followed up until the day of COVID-19 infection in the late-but-allowable protocol copy (i.e., survival time 36 days) and censored on the day of second dose administration (Day 31) in the late protocol copy (i.e., survival time 31 days). Individual D received the second dose during the late interval, and thus, it was censored on the last day of the FDA-recommended interval in the FDA-recommended protocol copy (i.e., survival time 25 days) and on the last day of the late-but-allowable interval in the late-but-allowable protocol copy (i.e., survival time 42 days). Individual D was followed up until the end of the study period in the late protocol copy (i.e., survival time T days). Individual E was followed up until the day of COVID-19 infection (Day 7) in all copies (i.e., survival time 7 days). The complete set of conditions on censoring in each of the three copies of the longitudinal dataset corresponding to the mRNA COVID-19 vaccine protocols is available in Supplementary Table 5.

Fig. 4
figure 4

Example of the longitudinal dataset for five vaccine recipients and its three copies corresponding to different vaccine protocols (recommended, late but allowable, and late) for Pfizer-BioNTech mRNA COVID-19 vaccine.

Informative censoring due to protocol non-adherence was addressed with an IPCW. We fit a Cox PH model to the longitudinal dataset under each protocol where the outcome of being censored was adjusted for the aforementioned covariates. Subsequently, this model was used to estimate the probability of remaining uncensored at each person’s event time. The reciprocal of this probability served as the censoring weights. The weights were designed to upweight individuals who remain adherent to the vaccine protocol at each time to have the same covariate distribution as the entire study population, thus creating a weighted population that represents the entire study population had all individuals remained adherent to the certain vaccine protocol throughout follow-up.

For each of the three protocol-specific copies, we calculated the cumulative risk of SARS-CoV-2 infection had the total study population followed the corresponding protocol. We calculated the RR, setting the FDA-recommended protocol as a reference. We computed 95% CIs using a nonparametric bootstrap based on 200 resamples31,32. The cumulative risk of infection was also stratified by age group (<65 and ≥65 years of age) and the presence of reported prior infection.

Sensitivity analysis

In the first sensitivity analysis, we aligned the duration and timing of each protocol for both Pfizer-BioNTech and Moderna vaccines (early: ≤23 days; recommended: 24-32 days; late-but-allowable: 33-49 days; late: ≥50 days) (Supplementary Table 4). Second, we ended the follow-up period at the earliest of SARS-CoV-2 infection, protocol nonadherence, the end of the study period, or 180 days after the first dose administration. Third, we ran the analysis without excluding people who received their second dose >180 days after their first dose administration. Fourth, we excluded 1,335,643 people (21.8%) with missing information on sex, race, ethnicity, and/or public health district. This was because it is probable that their infection data were missing, although we considered them not infected by SARS-CoV-2 in the main analysis. Fifth, we created an additional protocol, “first dose only,” under which we followed up individuals until the earliest of SARS-CoV-2 infection, receipt of the second dose, or the end of the study period. Sixth, we estimated the comparative effectiveness for different time periods: up to September 2021 (before the booster dose became available) and up to November 2021 (before the Omicron wave). Seventh, we used natural splines for age and the date of the first dose administration in the Cox PH model to calculate the probability of being censored. Lastly, we increased the number of the nonparametric bootstrap resamples from 200 to 1000 to computed 95% CIs.

Software

All analyses were conducted with R (R Center for Statistical Computing; Vienna, Austria) v4.2.1. Censoring weights were estimated using the ‘survival’ package33. R scripts can be found at the following GitHub repository: https://github.com/KayokoShioda/COVID_mRNA_TTE_2ndDose.

Ethics statements

This activity was determined by the GDPH Institutional Review Board to be non-research and consistent with public health surveillance as per title 45 code of Federal Regulations 46.102(l)(2).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.