Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

A model and predictions for COVID-19 considering population behavior and vaccination


The effect of vaccination coupled with the behavioral response of the population is not well understood. Our model incorporates two important dynamically varying population behaviors: level of caution and sense of safety. Level of caution increases with infectious cases, while an increasing sense of safety with increased vaccination lowers precautions. Our model accurately reproduces the complete time history of COVID-19 infections for various regions of the United States. We propose a parameter \(d_I\) as a direct measure of a population’s caution against an infectious disease that can be obtained from the infectious cases. The model provides quantitative measures of highest disease transmission rate, effective transmission rate, and cautionary behavior. We predict future COVID-19 trends in the United States accounting for vaccine rollout and behavior. Although a high rate of vaccination is critical to quickly ending the pandemic, a return towards pre-pandemic social behavior due to increased sense of safety during vaccine deployment can cause an alarming surge in infections. Our results predict that at the current rate of vaccination, the new infection cases for COVID-19 in the United States will approach zero by August 2021. This model can be used for other regions and for future epidemics and pandemics.


Coronavirus Disease 2019 (COVID-19) began as a localized outbreak in Wuhan, China in December 2019 and quickly spread internationally to become a global pandemic. More than a year later, over 113 million people have become infected with COVID-19 with more than 2.5 million deaths worldwide1. To combat the spread of this virus, the Pfizer–BioNTech COVID-19 vaccine was approved in the United Kingdom on December 2, 20202, and the Pfizer–BioNTech and Moderna vaccines were subsequently approved for emergency use authorization in the United States3. In light of these recent developments, the potential impact of the distribution for COVID-19 vaccines is tremendous. Giving the public, health officials, and government, model-based trend predictions and additional guidance on effective vaccine distribution and potential problems is critical. In the last year, many papers have been published on modeling the COVID-19 pandemic4,5,6,7,8,9,10,11,12,13. Several modeling studies are based on differential equation compartment models involving compartments for susceptible, infectious, and recovered individuals, commonly referred to as SIR models14,15,16,17,18,19. Introducing additional compartments allows researchers to study the effect of vaccination and examine how to optimally distribute a vaccine. Matrajt et al.20 used an age-stratified population to determine the consequence of vaccine effectiveness and population coverage of the vaccine to indicate the optimal vaccine allocation. Bubar et al.21 accounted for the possibility of ruling out individuals with antibodies from receiving the vaccine using a serological test, and added an age-dependent effectiveness of the vaccine. Effects of vaccination have been examined in past outbreaks such as the 2009 H1N1 Swine Flu outbreak and the 2014–2016 Ebola epidemic22,23,24,25,26,27. These studies have introduced population compartments that separate the population by their location and give insights into what locations should receive vaccines first28.

A critical aspect of COVID-19 vaccination that remains unexplored is a population’s behavioral changes during the prolonged period of vaccination. While behavioral responses have not been addressed with respect to vaccines, efforts have been made to study the effects of non vaccine related behavioral changes for previous pandemics. These studies vary from the models on the effectiveness of social measures like quarantining and social distancing29,30,31 to characterizing the nature of spread of the disease32,33. One particular example that was reasonably effective in modeling behavioral changes during a pandemic was the closed-loop feedback in the compartmental SIR model presented by Perra et al.29. The authors examined behavioral changes by modeling the rate at which individuals enter self-imposed quarantine dependent on the number of infectious individuals. Over the last year, the time history of new infected regional cases has fluctuated drastically and has posed significant challenges for the infectious disease modeling community. A model that can represent the region/population specific COVID-19 cases accurately for the entire period of this pandemic has not yet been reported. We propose a mathematical model and a framework that incorporates the naturally occurring behavioral responses of a population to infectious cases coupled with possible additional behavioral changes exhibited during vaccination that has the ability to represent infection dynamics for the entirety of available COVID-19 case data in the United States (US) regions. We define “level of caution” to represent a population’s precautionary/safe behavior during an ongoing pandemic that results from a combination of increased social distancing, use of personal protection equipment, improved hygiene, and lockdown regulations. We also introduce “sense of safety” to represent a population’s return to normal, pre-pandemic behavior as more and more people are vaccinated. We introduce suitable mathematical forms to represent these two important behavioral aspects and incorporate these dynamic functions into our differential SIRDV model framework.

Fitting our model to available daily new infection case data for four major US states (Massachusetts, California, Florida and South Dakota) and two major US cities (Atlanta and New York City) for the first year of the pandemic, we show that our modeling framework is versatile at capturing a large range of developments of the COVID-19 pandemic over time, and it provides valuable insights into each population’s underlying social behavior. Introducing a vaccine to the population in our model, we analyzed the interaction between vaccine distribution rate and vaccine related additional behavioral responses of the population. We used our model to predict future trends for the pandemic with the advent of vaccine distribution.


We find that the time dependent infectious disease transmission rate \(\beta (t)\) is best given by \(\beta =\beta _0 \,f_I \, f_V\), where \(\beta _{0}\) is the population maximum infection transmission rate observed in the absence of any preventative societal measures. \(f_I\) and \(f_V\) are level of caution and sense of safety functions, respectively, proposed as:

$$\begin{aligned} \begin{array}{l} f_I=e^{-d_I I} \\ f_V=\dfrac{1}{f_I}+\left( 1-\dfrac{1}{f_I}\right) e^{-d_V V} .\\ \end{array} \end{aligned}$$

The function \(f_I\) models caution in a population, where its individuals take measures to reduce disease transmission through social distancing, personal protective equipment, hygiene, and local government mandates. The population’s level of caution to the number of infectious cases is determined by a factor \(d_I\), which was observed to change several times over a long duration in a given population due to changing population awareness and response, pandemic fatigue, seasons, and changing government mandates. These changes in sensitivity of the population to the number of infectious cases gives rise to the multiple peaks in the number of new infected cases observed nearly universally during the COVID-19 pandemic. \(f_I\) approaches 0 in the limiting case of very high values of level of caution factor \(d_I\), reflecting extreme cautionary measures by the population against the pandemic and leads to negligible disease transmission. A \(d_I\) value approaching 0 gives \(f_I\) as 1, reflecting a population whose behavior is approaching pre-pandemic levels of minimal disease related precautionary actions.

In addition, we included a competing sense of safety in our model, in which measures to reduce disease transmission are gradually decreased due to an increasing proportion of the population becoming vaccinated, offsetting the effects of a reduced transmission rate arising from an underlying level of caution. However, as modeled in Eq. (1), the net transmission rate will never exceed the base maximum transmission rate \(\beta _0\). As the sense of safety factor \(d_V\) approaches a very small value (population not dropping its guards down due to vaccinations), the sense of safety function \(f_V\) approaches 1 and \(f_V\) has no effect on \(\beta\). On the contrary, a high \(d_V\) reflects an increased sense of safety, causing cautionary measures against the disease transmission to be significantly reduced, leading to \(f_V = \frac{1}{f_I}\). In this case, infection-related level of caution is completely negated and the disease transmission rate \(\beta\) approaches the population’s highest transmission rate \(\beta _0\).

Infection data fit and interpretation for COVID-19 in the United States

We show that our modeling framework mathematically incorporates the dynamic level of caution within a SIRDV differential framework (shown in Fig. 7, represented by Eq. (2) and discussed in detail in the modeling approach section) is able to fit and predict the entire COVID-19 case history for the selected representative populations within the US. The model works for both pre-vaccine and during vaccination periods. Before vaccines become available, the vaccinated population fraction V in Eq. (1) stays zero giving \(f_V=1\) and naturally leading to no effects from vaccine related sense of safety. For conciseness, we show results for selected key populations. The model can be applied to other regions/populations within or outside of the United States as well. The model was fit to four US states (Massachusetts, California, Florida, and South Dakota) and two major US cities (New York City and Atlanta). These regions were chosen to represent a variety of population densities and varying geographical locations. We accounted for the fact that the reported cases were lower than the actual infection cases in the population due to lack of testing and asymptomatic cases using a factor M. M has a high value at the beginning of the pandemic due to lack of testing and reduces to a lower value as testing becomes more available. The simplified M shown in Fig. 1a is assumed following the Centers for Disease Control and Prevention's assessment that only 1 out of 4.6 COVID-19 cases were reported in the US for 2020. As shown in Fig. 1, our behavioral model was able to accurately model and fit representative states and cities across the United States with few parameters for each region. Estimated parameters for each region are shown in Table 1.

The level of caution factor \(d_I\) in Eq. (1) is shown in Fig. 2a for different regions over the first year of the COVID-19 pandemic. A high level of caution factor indicates that the population was quick to adapt their behavior in response to an increase in infections by taking increasingly stringent measures to reduce their transmission rate. Level of caution in a specific population changes due to addition or removal of local government regulations, new information regarding the disease, seasonal changes in behavior, pandemic fatigue, news leading to additional fear or any other factor that causes widespread changes in behavior and disease transmission rate. As expected, the results show that sudden drops in level of caution factor \(d_{I}\) tend to precede surges in new cases due to relaxed social measures. Conversely, a reduction in the influx of new cases will occur due to significant increase in \(d_{I}\). This level of caution is independent of the baseline maximum transmission rate, \(\beta _0\), and therefore provides a measure to compare social outlook towards the disease between different populations/regions. Time varying COVID-19 transmission rate \(\beta\) for each of the selected regions is shown in Fig. 2b. \(\beta _{0}\) describes transmission in the earliest stages of the pandemic for a population, when knowledge of the disease and social measures against it were limited. Therefore, this value also describes the transmission that a specific population can be expected to return to when the precautions against the infection becomes minimal, either as infectious cases approaches zero or as social response to the disease becomes very low. In addition to the infectious disease's inherent contagious characteristics, the base transmission rate depends on factors such as population density, contact rate, and everyday pre-pandemic behavior of its individuals. Likely due to such factors, we found that bustling New York City, was on the higher end of the baseline transmission rate with basic reproductive ratio for New York City obtained to be \(R_0 = \frac{\beta }{\gamma } = 4.5\)), whereas a less densely populated state like South Dakota has a much lower baseline transmission rate and a lower \(R_0\) value of 2.5. \(R_0\) values for other regions can be found from Table 1.

Figure 1
figure 1

Results of our behavioral model’s fit to COVID-19 data across six populations in the United States. (a) Shows M representing the ratio of actual infectious cases to reported cases. Reported new infection case data versus model fit for (b) Massachusetts, (c) Florida, (d) South Dakota, (e) California, (f) New York City, and (g) Atlanta. Gray bars represent the daily reports of new COVID-19 cases. The dashed brown lines are the corresponding 7-day average. Our model’s predictions for reported daily cases are shown as the solid red lines.

Table 1 Parameters for the four states and two cities for the 2020 COVID-19 pandemic.

To illustrate direct correlations between our model fit predictions and real life events, we take New York City as an example. Starting March 22, New York implemented the “New York State on PAUSE’ executive order, closing all non-essential businesses, canceling all non-essential gatherings, and mandated social distancing. This local government regulation is directly represented in the model results for level of caution which shows a significant increase in \(d_I\) following this mandate (Fig. 2a). Corresponding model results also show a sharp transition from one of the highest levels of disease transmission rates (Fig. 2b) to one of the lowest levels of transmission rates following this government mandate. Between September 1, 2020 and January 15, 2021, the model based \(d_I\) values show that the level of caution in New York City transitioned from one of the highest values to one of the lowest. When we examine real events, we find that starting September 2020, a series of citywide re-openings were introduced, including the opening of gyms, malls (at 50% capacity), public K-12 schools, and indoor dining (25% occupancy). This reopening coincided with the holiday season in the US at the end of the year 2020 and resulted in a significant spike in the new infection cases directly correlating with our model predictions. Therefore, the level of caution parameter \(d_I\) is a metric that quantifies a population’s behavior in response to an infectious disease outbreak; estimates of future \(d_I\) values will allow predictions of new infectious cases. There are clear trends in COVID-19 cases captured by our model that directly relate to local government mandated health regulations. These results suggest that we may be able to incorporate possible behavioral changes into our model representing future government regulations, along with changes in vaccination rates, to predict infection outcomes.

Figure 2
figure 2

(a) Level of caution factor \(d_I\) and (b) disease transmission rate \(\beta\) over the course of COVID-19 in six populations across the United States.

Future COVID-19 dynamics with vaccination

The model incorporates the effect of vaccination and the behavioral response of the population to growing number of people getting vaccinated due to the population’s sense of safety. This sense of safety counteracts the underlying level of caution that a population always has in response to the number of infectious cases. Predictions from our model with the presence of vaccination show that the future trajectory of the pandemic will strongly depend on population’s behavior in response to the disease and vaccination. In Fig. 3, we show a range of potential infection outcomes for different levels of caution to the infection and different senses of safety due to vaccination. The selected range of \(d_{I}\) and \(d_V\) represent reasonable extremes of the level of caution and sense of safety factors. For the future trend predictions, we use starting values based on California’s data, as a representative population (to avoid multiple curves and repetition) which has reasonable correlation with overall United States COVID-19 trends. The results, normalized as population fractions, provide critical COVID-19 future trends and insights that will be applicable to other regions as well. We have assumed a vaccine effectiveness \(\eta\) of 95% based on the initial estimates of the two leading vaccines34. \(\eta\) can be suitably selected for any other vaccination types or in the case of a different pandemic or epidemic with different vaccine effectiveness. The results are shown for estimated actual cases.

Figure 3
figure 3

Effect of (a) sense of caution factor, \(d_I\), and (b) sense of safety factor, \(d_V\) on pandemic trajectory. The dashed blue bar represents the introduction of a vaccine with 95% effectiveness, at a fixed rate of 0.3% of the population per day. Extreme values of \(d_{I}\) represent the extremes of social behavior: the most responsive (black dotted) and the least (red). Sensitivity analysis in (a) considered a fixed sense of safety \(d_V=0\), and in (b) considered a fixed level of caution \(d_I=500\).

Figure 3a models a population that does not alter their underlying behavior in response to introduction of a vaccine, and instead only responds to the increasing infectious cases by increasing personal safety measures. All simulated curves show a swift reduction in cases following the vaccine, though decisive action and stronger level of caution in response to the infection does show a considerable reduction in total infections. Figure 3b shows a scenario in which the population responds to the introduction of a vaccine by relaxing the social measures meant to slow the transmission. Although this sense of safety and increased normalcy may be a natural response to vaccines becoming more available, our predictions show that unfailing and continuing commitment to social preventative measures can significantly reduce the total number of future infections and even prevent a new surge and new peak that can happen if the population relaxes too soon. Note, regardless of the value of the sense of safety factor \(d_{V}\), the number of new cases per day drops to zero around the same time; however, the peak number of cases while progressing toward this point is very different for different \(d_{V}\) values.

We treat vaccination as a one-time event where an individual would receive an entire dose of the vaccine, despite the fact that some current vaccines require two doses that must be delivered at different times35. Given the much longer time scale of COVID-19 predictive curves, compared to the time gap between the two doses of m-RNA vaccines, it is reasonable to model vaccination rate by ignoring the time gap between the two doses and take the two doses combined as a single completed vaccine without significant loss of predictive accuracy. Unless otherwise noted, all vaccine distribution in this paper was modeled at a fixed rate of 0.3% of the population per day (0.6% receiving single dose of the two dose vaccines per day). This number represents approximately 2 million vaccine doses that are currently administered daily in the US. Because of the asymptomatic and unreported cases, the constant rate vaccination was applied proportionally to both the susceptible (\(\alpha _s\)) and recovered (\(\alpha _R\)) populations. For the remainder of the simulations (after Jan 2021), we assume that sense of caution, \(d_{I}\), remains fixed at a high level (\(d_{I}=500\)).

Figure 4
figure 4

Impact of vaccination rate, \(\alpha\), with two social responses to the vaccine. The population in (a) shows no changes in behavior in response to the vaccine, while the population in (b) relaxes its preventative measures as the vaccine is distributed.

The effects of the vaccination rate \(\alpha\) were examined (Fig. 4). We chose three values of \(\alpha\): the current rate of vaccination of \(0.3\%\) of the population per day36, a low (\(0.1\%\) per day), and a high rate (\(0.5\%\) per day). Note, the vaccination rate of \(0.1\%\) is not expected in the US but is shown to illustrate the consequences of low vaccination rate. As the vaccine distribution rate increases, the number of cases per day tends to zero quickly. However, as is shown in Fig. 4a,b, the population’s social response to the vaccine, \(d_V\), has a significant effect on the pandemic trajectory. In cases where preventative measures were abandoned more quickly and the population had an increased sense of safety in response to the vaccines (high values of \(d_V\)), increased vaccination rates still result in cases quickly tending toward zero, but before this happens, the number of cases per day increases rapidly. This behavior worsens as \(d_{V}\) increases, and this sharp increase occurs earlier as vaccination rate \(\alpha\) increases. Our results show that in the US, COVID-19 can be reasonably controlled by late summer of 2021 proceeding with the currently planned vaccination rate. The results elucidate the importance of local health and government authorities becoming aware of the fact that the sense of safety and vaccine distribution rate are related parameters. As has been shown, a faster vaccination rate significantly decreases the duration of the pandemic. However, if authorities intend to distribute a vaccine very quickly, they must be extra cognizant of the population’s behavioral response to it, as population relaxing its cautious practices could result in a noticeable increase in cases post vaccine rollout. If neglected, this peak under extreme circumstances could be disastrous. Therefore, based on our results, we recommend that proper disease transmission mitigating behavior be maintained, while welcoming a fast vaccine distribution rate.

Figure 5
figure 5

Contour plot of the sense of normalcy, \(d_{V}\), and vaccination rate, \(\alpha\), versus the total infections since the start of vaccination (red values shown as proportion of the total population). Shown for a high level of caution case (\(d_{I}\) = 500). The pink box shows a possibility of increasing number of total infected cases with increasing vaccination rate at very high sense of safety, and the teal box shows that once the vaccination rate crosses a threshold, the total number of infection cases drop with vaccination rate even at a very high sense of safety.

To further quantify the relative effects of the sense of safety and the vaccine distribution rate on the total number of infectious cases, the total number of individuals infected (as population fraction) after the start of vaccination were plotted with respect to \(\alpha\) and \(d_{V}\) in Fig. 5. This was done for a special case of a very high level of caution during vaccine rollout. As expected, for a given value of vaccination rate, the number of total infected cases increases as the sense of safety factor increases along the x-axis. This behavior is especially pronounced for low vaccination rates, when large increases in the sense of safety can result in significant numbers of total infections, up to 26\(\%\) as is shown in the yellow region of Fig. 5). Also, note that for very high value of \(d_V\), as the vaccination rate increases from 0, the number of infected cases quickly increases and then start to decrease again (pink box). For a very slow vaccination rate, vaccinated population dependent behavioral effects are limited due to our proposed relation for the sense of safety function \(f_{V}\), but quickly increase as the vaccinated individuals increase. This explains the increase in the total infections as one travels vertically in the pink box in Fig. 5. However, total infections then begin to decrease due to a critical vaccination rate being achieved, shown in the teal box. This reinforces the argument for the necessity of maximizing vaccine distribution; low vaccination rates can lead to behavior-related spikes in total cases, but these effects are mitigated as widespread vaccination outweighs these behavioral factors.

Figure 6
figure 6

Results for certain fractions of the population remaining unvaccinated. The entire COVID-19 case history and future predictions are shown in terms of estimated actual new infection cases for California as a representative example. Gray bars represent the daily reports of new COVID-19 cases. The dashed brown lines are the corresponding 7-day average. Our model predictions for the entire duration are also shown as green, gray and yellow curves.

We can expect some portion of the population to be unwilling or unable to receive a COVID-19 vaccine. The effect of the size of this group on the duration and severity of the pandemic was examined in Fig. 6. If large populations refuse vaccination, the duration of the pandemic can be prolonged.


Mathematical model

As shown in Fig. 7a, our model extends the general SIRD framework by adding the effect of vaccination and incorporating behavior based dynamics as an important capability specific to our study. The model consists of five compartments: Susceptible (S), Vaccinated (V), Infectious (I), Recovered (R), and Deceased (D). Here S, V, I, R, and D represent time dependent fractional variables with respect to the total population of the region of interest. Beginning in the susceptible compartment, individuals can follow the standard infection pathway through the infectious compartment then to either recovered or deceased. Alternatively, they can enter the vaccinated compartment following a fixed rate of vaccination \(\alpha\), where depending on the vaccine effectiveness \(\eta\), a subset of the vaccinated population \(V_S\) can become infected. The remainder of the vaccinated group \(V_R\) is successfully vaccinated and have no risk of becoming infected. Currently, the reinfection rate is very low and its effect can be neglected for the timescale of our study. Note that given the uncertainty in how rapidly the vaccines will be deployed in the future, for our predictions, we have used constant vaccination rates and have shown sensitivities to different vaccination rates. Time dependent vaccination rates can be easily implemented by selecting a suitable function for \(\alpha (t)\) in our model. Our SIRDV model for a region/population is described by the following equations:

$$\begin{aligned} \begin{array}{l} \dfrac{dS}{dt}=-\beta SI-\alpha \\ \dfrac{dV}{dt}=-\beta \left( 1-\eta \right) V I+\alpha \\ \dfrac{dR}{dt}=\gamma I \\ \vspace{0.03in} \dfrac{dI}{dt}=\Big (\beta S+\beta \left( 1-\eta \right) V-\mu \gamma -\gamma \Big )I \\ \dfrac{dD}{dt}=\mu \gamma I\\ \end{array} \end{aligned}$$

where \(\beta\) represents the dynamic transmission rate, \(\mu\) represents the mortality rate, and \(\gamma\) represents the recovery rate. The family of curves represented by this set of equations with constant parameters is considerably limited as it assumes that the population does not change its behavior at all over the course of the outbreak.

The significant differences and variations in disease transmission across different populations and over the course of the COVID-19 pandemic have shown that an understanding and modeling of dynamic population behavior changes is critical in predicting a real-world pandemic. To model these population behavioral attributes, we have incorporated a simple framework for a behavior-based, time-dependent net disease transmission rate \(\beta\) that is dependent on both the current infectious and vaccinated populations. With \(\beta _{0}\) as population maximum transmission rate, \(f_I (0 < f_I \le 1)\) level of caution function and \(f_V (1 \le f_V \le \frac{1}{f_I})\) as sense of safety function, we propose the following mathematical forms for behavior dependent transmission rate:

$$\begin{aligned} \begin{array}{l} f_I=e^{-d_I I} \\ f_V=\dfrac{1}{f_I}+\left( 1-\dfrac{1}{f_I}\right) e^{-d_V V} \\ \beta =\beta _0 \,f_I \, f_V \end{array} \end{aligned}$$

All the model parameters are described in Fig. 7b. The resultant effects of infectious and vaccinated populations on disease transmission rate \(\beta\) are shown in Fig. 7c,d for a range of \(d_I\) and \(d_V\) values. As shown, transmission rate decays to a smaller value at high infectious populations due to more cautionary and preventive actions with a higher level of caution. This decay slows significantly as a higher percentage of the population gets vaccinated due to the sense of safety from vaccination. The sensitivity of \(\beta\) to infectious population size is determined by the population’s \(d_I\), while the extent to which preventative measures are abandoned due to vaccine distribution is determined by \(d_V\). Note, from the mathematical form in Eq. (3), in the absence of vaccines, \(V=0 \implies f_V = 1\), which is physically and intuitively correct.

Figure 7
figure 7

(a) SIRDV compartment model with susceptible (S), vaccinated (V), infectious (I), recovered (R), and deceased (D) populations. Flow between between compartments is shown with arrows. (b) Description of model parameters. (c) and (d) Show variations in disease transmission rate \(\beta\), due to social response to infectious and vaccinated populations. (c) \(\beta\) for a range of level of caution factor \(d_I\) (100, 200, 400), and (d) \(\beta\) for a range of sense of safety factor \(d_V\) (1, 2, 4).

Model fit to specific regions

Combining Eq. (2) with our dynamic behavior model in Eq. (3), we are able to fit the complex, multimodal infection curves observed during the course of the COVID-19 pandemic.

We take the disease mortality rate and infectious period, \(\mu\) and \(\gamma\) (inverse of \(\gamma\) represents infectious period) as constants. The baseline transmission rate \(\beta _{0}\) was determined to be the parameter which was able to best fit the first rise in cases, where there was limited social response to the rise in infections. To represent the behavioral changes that were evident in the multiple peaks of the pandemic, we introduced multiple behavioral regions for each population. The model fit shows that each region had different level of caution factors (infection responsiveness) \(d_{I}\), which provides an estimate of how public perception of the disease varied in each region over the course of the pandemic. Each behavioral response is represented by a fixed \(d_I\) and smooth transitions were implemented via cosine interpolation, as displayed in Fig. 2a. To reduce the risk of overfitting our model, we limited the number of behavioral response changes for each location and found that, for the locations that were selected, a minimum of either three or four behavioral regions was sufficient to accurately represent the complete reported infection data. The differential equation model was implemented in MATLAB.

Model parameters were fit to daily new cases time series of four states and two cities: Massachusetts, California, Florida, South Dakota, New York City, and Atlanta. These regions were chosen to represent a variety of population densities, locations across the US, and responses to the pandemic. To determine the parameters, we used MATLAB’s bound-constrained optimization function37, minimizing the root mean square error between our model predictions and the reported number of cases per day. Simulations for each population began on March 15 and had an initial infected population equal to the number of new cases in the previous \(\frac{1}{\gamma }\) days, to estimate those who were still in the infectious period.

The reported infection case data is limited by the fact that tests were not universally available during the beginning stages of the pandemic. Additionally, certain infected populations did not show any symptoms38, but still may have transmitted the disease, further complicating estimates of new cases. To compare the actual case predictions from the model against the reported case, we introduced a factor M, which represents the number of actual cases per reported case. At the early stages of the pandemic, the awareness and testing was lacking but later on it improved significantly. We account for this by using a value of \(M = M_0\) for the initial stages of the pandemic which transition to a lower value of \(M= M_f\) well into the pandemic when testing becomes widely available. The transition between the two values of M is taken to be smooth using a sigmoidal function and is shown in Fig. 1a. The variation of M with time can be represented with

$$\begin{aligned} M=M_f+\frac{M_0-M_f}{1+e^{\delta _M(t - t_{M})}}, \end{aligned}$$

where \(\delta _M\) and \(t_{M}\) describe the smooth transition from \(M_0\) to \(M_f\).

Disease dynamics with vaccination

After fitting the model to reported real-world data, the effects of vaccination and its potential effects on behavioral response (sense of safety) were examined. Specifically, we chose the state of California as a representative case, which displayed an average infectivity rate of COVID-19 among the states and cities that we surveyed and an infection curve that was somewhat representative of the overall United States. Varying the vaccine distribution rate \(\alpha\), the sense of safety parameter \(d_{V}\), and the fraction of unvaccinated individuals \(S_{unvaccinated}\) in our model, we evaluated the progression of the disease by examining the predicted number of cases reported per day in the future. The predicted results are presented in Figs. 3, 4, 5, and 6.


We have developed an infectious disease dynamics model which accounts for behavioral changes in a population considering level of caution due to growing infectious individuals as well as a counteracting trend towards increasing normalcy, relaxing precautionary measures due to a sense of safety from increasing vaccine deployment. Our mathematical model accurately captures the infection trends for the first year of the COVID-19 pandemic for all of the US regions examined with a small number of parameters. A comparison of model parameters between different regions allows comparative insights between them. It demonstrates direct relationships between population behavior model parameters and major government actions that impact population behavior. It allows measurement of several important population and infectious disease specific quantities including highest disease transmission rate \(\beta _0\), disease transmission rate at any given time \(\beta\), and a measure of population’s behavior to reduce the disease transmission through parameter \(d_I\), where, in the absence of significant vaccination, \(\frac{d_I}{100}>> 1\) indicates safe response, and \(\frac{d_I}{100} < 1\) represents lack of caution. We found that although faster vaccine rollout will bring the COVID-19 end more quickly, there exist scenarios where fast vaccine rollout can give false sense of safety to the population, which will lead to a large short-term increase in infectious cases. This sense of safety could also cause weakening of restrictions by the local authorities, further exacerbating the pandemic, especially for areas hit with strains exhibiting lower vaccine efficacy in spite of decent vaccine coverage. We also found that if a large proportion of the population chooses to stay unvaccinated, this can have an adverse effect on the length of the pandemic. Prudence is required on the part of authorities to understand, predict, and limit any potential surge by increasing encouragement of all cautionary measures to prevent the spread of the virus. Our results indicate that in the United States, COVID-19 can be reasonably controlled by August 2021.

While our model is built on significant physical insights, population’s future behavioral aspects, presence of asymptomatic cases, a lack of exact knowledge about future vaccination rates, and other factors create some uncertainties. Therefore, although the quantitative predictions from our study are important, all possible uncertainties should be considered. Due to a reasonable homogeneity of vaccine distribution in the U.S., we have assumed that the vaccination rate is constant over the period of interest and for each population. It is important to note that vaccine roll out rate varies from region to region, especially when considering different parts of the world. In addition, for a specific region, the vaccination rate can vary significantly over a period of time. In this case, the vaccination rate \(\alpha\) can be assumed to be time dependent function in our model. In our predictions, we have not considered varying efficacies of vaccines against different SARS-COV-2 variants. In the future, as more data on the efficacy is available, this information can be incorporated in the model based predictions.

The results allow new insights into future COVID-19 trends and sensitivity of pandemic dynamics to various behavioral and other model parameters. As more exact information becomes available, new data can be directly incorporated in our model to produce more accurate results. Our study involved a reasonably diverse range of populations and their responses to the pandemic. The numerical values and ranges for the model parameters found in this study could be used as estimates to predict potential infection outcomes for scenarios where limited data is available (e.g., future pandemics). The proposed model provides a new framework for predicting infection dynamics of future pandemics and epidemics. As model based predictions get increasingly accurate, we expect that they will help guide informed policy decisions for the general public.

Data availability

COVID data was obtained from the Center for Systems Science and Engineering (CSSE) COVID-19 Data Repository at Johns Hopkins University39. Estimates for the total population of each region were obtained from the United States Census Bureau40.


  1. World Health Organization. WHO Coronavirus Disease (COVID-19) Dashboard (2021).

  2. Ledford, H., Cyranoski, D. & Noorden, R. V. The UK has approved a COVID vaccine—Here’s what scientists now want to know. Nature 588, 205–206 (2020).

    ADS  CAS  Article  Google Scholar 

  3. Commissioner, O. o. t. COVID-19 Vaccines. FDA (2021).

  4. Bertozzi, A. L., Franco, E., Mohler, G., Short, M. B. & Sledge, D. The challenges of modeling and forecasting the spread of covid-19. Proc. Nat. Acad. Sci. 117, 16732–16738 (2020).

    MathSciNet  CAS  Article  Google Scholar 

  5. Estrada, E. Covid-19 and sars-cov-2 modeling the present, looking at the future. Phys. Rep. 869, 1–51 (2020).

    ADS  MathSciNet  CAS  Article  Google Scholar 

  6. Giordano, G. et al. Modelling the covid-19 epidemic and implementation of population-wide interventions in Italy. Nat. Med. 26, 855–860 (2020).

    CAS  Article  Google Scholar 

  7. Kaxiras, E., Neofotistos, G. & Angelaki, E. The first 100 days: Modeling the evolution of the covid-19 pandemic. Chaos Solitons Fractals138 (2020).

  8. Korolev, I. Identification and estimation of the seird epidemic model for covid-19. J. Econ. 220, 63–85 (2021).

    MathSciNet  Article  Google Scholar 

  9. Tsay, C., Lejarza, F., Stadtherr, M. A. & Baldea, M. Modeling, state estimation, and optimal control for the us covid-19 outbreak. Sci. Rep. 10, 10711 (2020).

    ADS  CAS  Article  Google Scholar 

  10. Kucharski, A. J. et al. Early dynamics of transmission and control of covid-19: A mathematical modelling study. Lancet. Infect. Dis 20, 553–558 (2020).

    CAS  Article  Google Scholar 

  11. He, S., Peng, Y. & Sun, K. Seir modeling of the covid-19 and its dynamics. Nonlinear Dyn. 101, 1667–1680 (2020).

    Article  Google Scholar 

  12. Prem, K. et al. The effect of control strategies to reduce social mixing on outcomes of the covid-19 epidemic in Wuhan, China: A modelling study. Lancet Public Health 5, e261–e270 (2020).

    Article  Google Scholar 

  13. Kennedy, D. M., Zambrano, G. J., Wang, Y. & Neto, O. P. Modeling the effects of intervention strategies on covid-19 transmission dynamics. J. Clin. Virol.128 (2020).

  14. Roda, W. C., Varughese, M. B., Han, D. & Li, M. Y. Why is it difficult to accurately predict the covid-19 epidemic?. Infect. Dis. Modell. 5, 271–281 (2020).

    Article  Google Scholar 

  15. Liu, M., Thomadsen, R. & Yao, S. Forecasting the spread of covid-19 under different reopening strategies. Sci. Rep. 10, 20367 (2020).

    CAS  Article  Google Scholar 

  16. Acemoglu, D., Chernozhukov, V., Werning, I. & Whinston, M. D. Optimal targeted lockdowns in a multi-group sir model. Working Paper 27102, National Bureau of Economic Research (2020).

  17. Postnikov, E. B. Estimation of covid-19 dynamics “on a back-of-envelope”: Does the simplest sir model provide quantitative parameters and predictions?. Chaos Solitons Fractals 135 (2020).

  18. Brauer, F. Mathematical epidemiology: Past, present, and future. Infect. Dis. Modell. 2, 113–127 (2017).

    Article  Google Scholar 

  19. Kermack, W. O. & McKendrick, A. A contribution to the mathematical theory of epidemics. Proc. R. Soc. A 115, 700–721 (1927).

    ADS  MATH  Google Scholar 

  20. Matrajt, L., Eaton, J., Leung, T. & Brown, E. R. Vaccine optimization for COVID-19: Who to vaccinate first? medRxiv 2020.08.14.20175257 (2020).

  21. Bubar, K. M. et al. Model-informed COVID-19 vaccine prioritization strategies by age and serostatus. medRxiv 2020.09.08.20190629 (2020).

  22. Feng, Z., Towers, S. & Yang, Y. Modeling the effects of vaccination and treatment on pandemic influenza. AAPS J. 13, 427–437 (2002).

    Article  Google Scholar 

  23. Scherer, A. & McLean, A. Mathematical models of vaccination. Br. Med. Bull. 62, 187–199 (2002).

    Article  Google Scholar 

  24. Chowell, G., Tariq, A. & Kiskowski, M. Vaccination strategies to control Ebola epidemics in the context of variable household inaccessibility levels. PLoS Negl. Trop. Dis. 13 (2019).

  25. Lee, B. Y., Haidari, L. A. & Lee, M. S. Modelling during an emergency: The 2009 H1N1 influenza pandemic. Clin. Microbiol. Infect. 19, 1014–1022 (2013).

    CAS  Article  Google Scholar 

  26. Larson, R. C. & Teytelman, A. Modeling the effects of H1N1 influenza vaccine distribution in the United States. Value Health J. Int. Soc. Pharmacoecon. Outcomes Res. 15, 158–166 (2012).

    Article  Google Scholar 

  27. Potluri, R. et al. Impact of prophylactic vaccination strategies on Ebola virus transmission: A modeling analysis. PLoS ONE 15 (2020).

  28. Yu, Z. et al. Efficient vaccine distribution based on a hybrid compartmental model. PLoS ONE 11 (2016).

  29. Perra, N., Balcan, D., Gonçalves, B. & Vespignani, A. Towards a characterization of behavior-disease models. PLoS ONE6 (2011).

  30. Sardar, T., Nadim, S. S., Rana, S. & Chattopadhyay, J. Assessment of lockdown effect in some states and overall India: A predictive mathematical study on COVID-19 outbreak. Chaos Solitons Fractals 139 (2020).

  31. Kim, S., Seo, Y. B. & Jung, E. Prediction of COVID-19 transmission dynamics using a mathematical model considering behavior changes in Korea. Epidemiol. Health 42 (2020).

  32. Sarkar, K., Khajanchi, S. & Nieto, J. J. Modeling and forecasting the COVID-19 pandemic in India. Chaos Solitons Fractals 139 (2020).

  33. Reiner, R. C. et al. Modeling COVID-19 scenarios for the United States. Nat. Med. 1–12 (2020).

  34. Kim, J. H., Marks, F. & Clemens, J. D. Looking beyond covid-19 vaccine phase 3 trials. Nat. Med. (2021).

  35. News, V. K., Kaiser Health. Biden Aims for 100 Million COVID Vaccinations in First 100 Days (2021).

  36. Massachusetts Department of Public Health. COVID-19 Vaccination Program \(|\) (2021).

  37. MathWorks. Matlab bound constrained optimization using fminsearch (2021).

  38. Nogrady, B. What the data say about asymptomatic COVID infections. Nature 587, 534–535 (2020).

    ADS  Article  Google Scholar 

  39. Dong, E., Du, H. & Gardner, L. An interactive web-based dashboard to track COVID-19 in real time. Lancet. Infect. Dis 20, 533–534 (2020).

    CAS  Article  Google Scholar 

  40. Bureau, U. C. County Population Totals 2010–2019 (2019).

Download references


Authors would like to thank John Antolik and Thomas Bohac for discussions during the early phase.

Author information

Authors and Affiliations



T.U. and Z.L. performed simulations and helped develop the model. V.S. conceptualized and led the development of the model. All authors contributed to the writing of the manuscript.

Corresponding author

Correspondence to Vikas Srivastava.

Ethics declarations

Competing interest

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Usherwood, T., LaJoie, Z. & Srivastava, V. A model and predictions for COVID-19 considering population behavior and vaccination. Sci Rep 11, 12051 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing