Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil

Bastos, Saulo B.; Cajueiro, Daniel O.

doi:10.1038/s41598-020-76257-1

Download PDF

Article
Open access
Published: 10 November 2020

Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil

Saulo B. Bastos¹ &
Daniel O. Cajueiro^1,2,3

Scientific Reports volume 10, Article number: 19457 (2020) Cite this article

6037 Accesses
97 Citations
Metrics details

Subjects

Abstract

We model and forecast the early evolution of the COVID-19 pandemic in Brazil using Brazilian recent data from February 25, 2020 to March 30, 2020. This early period accounts for unawareness of the epidemiological characteristics of the disease in a new territory, sub-notification of the real numbers of infected people and the timely introduction of social distancing policies to flatten the spread of the disease. We use two variations of the SIR model and we include a parameter that comprises the effects of social distancing measures. Short and long term forecasts show that the social distancing policy imposed by the government is able to flatten the pattern of infection of the COVID-19. However, our results also show that if this policy does not last enough time, it is only able to shift the peak of infection into the future keeping the value of the peak in almost the same value. Furthermore, our long term simulations forecast the optimal date to end the policy. Finally, we show that the proportion of asymptomatic individuals affects the amplitude of the peak of symptomatic infected, suggesting that it is important to test the population.

Modelling transmission and control of the COVID-19 pandemic in Australia

Article Open access 11 November 2020

Inferring the spread of COVID-19: the role of time-varying reporting rate in epidemiological modelling

Article Open access 24 June 2022

Differences in COVID-19 cyclicity and predictability among U.S. counties and states reflect the effectiveness of protective measures

Article Open access 31 August 2023

Introduction

The world has seen an ongoing pandemic of COVID-19 (coronavirus 2) caused by severe acute respiratory syndrome SARS-CoV-2. According to the World Health Organization (WHO)¹, although most people infected with it will present mild respiratory symptoms, or no signs of the disease, and recover without needing special treatment, older people, and those with severe medical conditions like diabetes, cardiovascular disease, or chronic respiratory disease may develop serious illness. While the COVID-19 outbreak was first identified in Wuhan, Hubei, China, in December 2019, we could only confirm the first case in Brazil on February 25, 2020. The first known patient in Brasil was a 61-year-old man from São Paulo who had returned from Lombardy (Italy) and tested positive for the virus. Since then, we may confirm 4579 cases and 159 deaths (March 30, 2020) in roughly the entire Brazilian territory. Like in the rest of the world², the Brazilian government response to the pandemic has been the introduction of measures to ensure social distancing, such as schools closure, restricting commerce, banning public events and home office.

We use the Brazilian recent data from February 25, 2020 to March 30, 2020 to model and forecast the evolution of the COVID-19 pandemic. Our study focuses on the early period of the pandemics that accounts for unawareness of the epidemiological characteristics of the disease in a new territory, sub-notification of the real numbers of infected people and the timely introduction of social distancing policies to flatten the spread of the disease. This work has had the practical appeal for providing preliminary estimates of Covid-19 epidemiological parameters and the duration of the social distancing policy in Brazil.

The computational modeling of infectious diseases comprises a large collection of models^3,4,5. In order to model the evolution of the Covid-19 in Brazil we modify two versions of the the Susceptible-Infected-Recovered (SIR) model⁶ to consider the effects of social distancing measures in the evolution of the disease. The SIR model describes the spread of a disease in a population split into three non-intersecting classes: susceptible (S) are individuals who are healthy but can contract the disease; Infected (I) are individuals who are sick; Recovered (I) are individuals who recovered from the disease. Due to the evolution of the disease, the size of each of these classes change over time and the total population size N is the sum of these classes

$$\begin{aligned} N(t)=S(t)+I(t)+R(t). \end{aligned}$$

(1)

Let $\beta $ be the average number of contacts that are sufficient for transmission of a person per unit of time t. Then $\beta I/N$ is the average number of contacts that are sufficient for transmission with infective individuals per unit of time of one susceptible and $(\beta I/N)S$ is the number of new cases per unit of time due to the S susceptible individuals. Furthermore, let $\gamma $ be the recovery rate, which is the rate that infected individuals recover or die, leaving the infected class, at constant per capita probability per unit of time.

Based on these definitions, we can write the SIR model as

$$\begin{aligned} \begin{array}{rcl} \displaystyle \frac{dS}{dt} &{} = &{} \displaystyle -\frac{\beta IS}{N}\\ \displaystyle \frac{dI}{dt} &{} = &{} \displaystyle \frac{\beta IS}{N} - \gamma I \\ \displaystyle \frac{dR}{dt} &{} = &{} \displaystyle \gamma I \\ \end{array}\;\;\;\mathbf{[SIR] }. \end{aligned}$$

(2)

It is worth mentioning that we can also evaluate the number of recovered individuals from Eq. (1) using also the number of susceptible and infected individuals, since in this version of the SIR model (Eq. 2) the population is constant. Actually, since we are modeling a short term pandemic, we do not consider the demographic effects and we assume that an individual does not contract the disease twice. We do not implement this model, we only included it for the sake of reference.

We actually want to estimate the fraction of people that die from the disease. Then we include a probability $\rho $ of an individual in the class I dying from infection before recovering⁴. In this case, we get the following set of equations

$$\begin{aligned} \begin{array}{rcl} \displaystyle \frac{dS}{dt} &{} = &{} \displaystyle - \frac{\beta I S}{N}\\ \displaystyle \frac{dI}{dt} &{} = &{} \displaystyle \frac{\beta I S}{N} - \gamma I - \frac{\rho }{1 - \rho } \gamma I = \displaystyle \frac{\beta I S}{N} - \frac{\gamma I}{1 - \rho }\\ \displaystyle \frac{dR}{dt} &{} = &{} \displaystyle \gamma I \\ \displaystyle \frac{dD}{dt} &{} = &{} \displaystyle \frac{\rho }{1-\rho } \gamma I\\ \end{array}\;\;\;\mathbf{[SIRD] }, \end{aligned}$$

(3)

where $\frac{\rho }{1-\rho } \gamma I$ is the number of people in the population that die due to the disease per unity of time and D is the number of people that die due to the disease. Note that in this case the number of individuals in the population reduces due to the infection according to $\frac{dN}{dt} = -\frac{\rho }{1 - \rho } \gamma I$. For the ease of reference, we call this model “SIRD” (Susceptible-Infected-Recovered-Dead) model.

Since, in the case of the COVID-19, there is a relevant percentage of the infected individuals that are asymptomatic, we split the class of infected individuals in symptomatic and asymptomatic^7,8,9:

$$\begin{aligned} \begin{array}{rcl} \displaystyle \frac{dS}{dt} &{} = &{} \displaystyle - (\beta _A I_A + \beta _S I_S)\frac{ S}{N} \\ \displaystyle \frac{dI_A}{dt} &{} = &{} \displaystyle (1-p)(\beta _A I_A + \beta _S I_S)\frac{ S}{N} - (\gamma _A) I_A \\ \displaystyle \frac{dI_S}{dt} &{} = &{} p (\beta _A I_A + \beta _S I_S)\frac{ S}{N} - \frac{\gamma _S I_S}{1 - \rho } \\ \displaystyle \frac{dR_A}{dt} &{} = &{} \displaystyle \gamma _A I_A \\ \displaystyle \frac{dR_S}{dt} &{} = &{} \displaystyle \gamma _S I_S \\ \displaystyle \frac{dD}{dt} &{} = &{} \displaystyle \frac{\rho }{1 - \rho } \gamma _S I_S \\ \end{array}\;\;\;{\mathbf{[SIRASD] }}, \end{aligned}$$

(4)

where $I_A$ is the number of asymptomatic individuals, $I_S$ is the number of symptomatic individuals, $R_A$ and $R_S$ are the recovered individuals from the asymptomatic and symptomatic infection, respectively, and p is the proportion of individuals who develop symptoms. For ease of reference, we call this model “SIRASD” (Susceptible-Infected-Recovered for Asymptomatic-Symptomatic and Dead) model. Like the SIRD model, the condition that N is constant does not hold anymore and if we need to evaluate N over time, we need to integrate $\frac{dN}{dt} = -\frac{\rho }{1 - \rho } \gamma _S I_S$.

In order to consider the effect of the social distancing policy, we modify the transmission factors of Eqs. (3) and (4) by multiplying them by a parameter $\psi \in [0, 1]$, when the date belongs to the period of the implementation of government policy. Otherwise, we use $\psi =1$. To be precise, we replace $\beta $ in Eq. (3) by $\psi \beta $, $\beta _A$ in Eq. (4) by $\psi \beta _A$ and $\beta _S$ in Eq. (4) by $\psi \beta _S$. Note that doing this procedure we avoid the introduction and estimations of new “$\beta $s” and we may use $\psi $ to evaluate the effectiveness of social distancing policy. In the end, we may measure the social distance as $1-\psi $.

Our models provide estimates of the epidemiological parameters, that are consistent with the international literature, and good forecasts of the short-term Brazilian time series of infected individuals in Brazil. Furthermore, one of our models assesses the number of asymptomatic (or individuals with mild symptoms that do not look for the hospitals and are not being tested). We use these models to simulate long-term scenarios of the pandemics that depend on the level of engagement of the Brazilian social distancing policy. We show that: (1) the social distancing policy imposed by the government is able to flatten the pattern of contamination provided by the COVID-19; (2) there is an optimal date for abandoning the social distancing policy; (3) short-term social distancing policies only shift the peak of infection into the future keeping the value of the peak in almost the same value. (4) The proportion of asymptomatic individuals affects the amplitude of the peak of symptomatic infected, meaning that it is important to invest in testing the population, massively or by random sampling.

Our work relates to the recent interesting contributions^10,11,12,13 in the sense that all these works try to model the spread of the COVID-19 and to evaluate the countermeasures against this virus. However, our paper differs from these works in the following dimensions: (1) data: our work focuses in Brazilian data. This is an important characteristic since different countries may present different demographies and we know that the COVID-19 is riskier for older populations that appear with higher proportion in developed countries. Furthermore, the level of nutrition of the population of the country may affect the probability of contracting and developing the disease. The quality of data may vary from developed countries to underdeveloped ones and, in our paper, we do not use data from other countries to calibrate our models. (2) Model: we use variations of the SIR model mentioned above. One of the advantages of the SIR model is the simplicity and researchers have used this model in several successful attempts to model the spread of infectious diseases^14,15,16,17. (3) Estimation: our paper estimates all the parameters based on a clear hierarchical procedure based on squared error minimization.

Results

Data analysis

Table 1 Estimated values of the epidemiological parameters.

Full size table

Table 2 Estimated values of $\psi $ for the SIRD and SIRASD models and the impact on the basic reproductive number R.

Full size table

We use the real data provided by the Ministry of Health of Brazil from February 25, 2020 to March 30, 2020 in our estimations. If we change the final date of the period of estimation of the epidemiological parameters of the model, we note that there is a structural change in the data suggesting the effectiveness of the social distancing policy. It is worth mentioning that it is hard to know exactly when social distance measures took effect mostly because there is a variable incubation period of the virus given by a range from 2 to 10 days¹⁸ and some initiatives of social distance measures (such as home office) started even before the official implementation of the social distancing policy. In fact, after March 23, 2020, we are able to see in the data three consecutive reductions in the first difference of the cumulative number of infections, so depending on the final date that is used for the estimation of the SIRD model, the estimated parameters cannot fit the real data anymore, as shown in Fig. 1. Thus, we define two estimation periods: (1) February 25, 2020–March 22, 2020, in which we estimate the epidemiological parameters of Eqs. (3) and (4); (2) March 23, 2020–March 30, 2020, in which we estimate the paramter $\psi $.

Regarding the estimation of the epidemiological parameters of Eqs. (3) and (4), we estimate all parameters of our model by minimizing the squared error of integrated variables and their real values^5,28. We proceed in a hierarchical procedure. We start by estimating the parameters of the SIRD model, namely $\beta $, $\gamma $ and $\rho $ by minimizing the squared error

$$\begin{aligned} \begin{array}{cc} \min _{\beta , \gamma , \rho }&\frac{1}{2} \left( \sum _{t}{ f \left( [(I_t^{cum} - D_t) - (\hat{I}_t+\hat{R}_t)]^2 \right) + f\left( [D_t - \hat{D}_t]^2\right) } \right) \end{array}, \end{aligned}$$

(5)

where $I_t^{cum}$ and $D_t$ are the cumulative number of infected individuals and deaths, which are the real data provided by the Ministry of Health of Brazil, and $\hat{I}_t$, $\hat{R}_t$ and $\hat{D}_t$ are estimated values of the infected, recovered and deaths, respectively. We use the nonlinear function $f(z) = C^2 \log \left( (g(z)/C)^2 \right) $ to correct the exponential characteristic of the series so that the errors of the last values of the series do not dominate the minimization, where $g(z)=\log (1+z)$. Furthermore, we use the scaling parameter $C=2$ to soft threshold between inliers and outliers. Using this procedure, we note that the estimated epidemiological parameters vary less among simulations with different random seeds.

After estimating the SIRD model, we proceed by estimating the SIRASD model. Note that we lack information on the number of asymptomatic individuals, since the clear recommendation of the Ministry of Health is to test for the virus only if one has moderate or severe symptoms. Otherwise, follow the “stay at home” policy, which recommends individuals with mild symptoms to stay at home and do not seek for medical attention. Furthermore, the mortality rate is evaluated mostly over the symptomatic ones, since the asymptomatic are in many cases not tested. Therefore, we suppose that $\beta _S=\beta $, $\gamma _S=\gamma $ and we keep the value of $\rho $. Using these parameters, and assuming that there is only one asymptomatic individual in the beginning of the simulation, we estimate the parameters $\beta _A$, $\gamma _A$ and p in order to minimize the squared error

$$\begin{aligned} \begin{array}{cc} \min _{\beta _A, \gamma _A, p}&\frac{1}{2} \left( \sum _{t}{ f \left( [(I_t^{cum} - D_t) - (\hat{I}_{S,t}+\hat{R}_{S,t})]^2 \right) + f\left( [D_t - \hat{D}_t]^2\right) } \right) \end{array}, \end{aligned}$$

(6)

where $I_t^{cum}$ and $D_t$ are real data provided by the Ministry of Health of Brazil, the cumulative number of infected individuals and deaths, and $\hat{I}_{S,t}$, $\hat{R}_{S,t}$ and $\hat{D}_t$ are estimated values of the symptomatic infected and recovered individuals, and deaths.

Table 1 presents the epidemiological parameters of our model and some reference values. We also show other values for these epidemiological parameters obtained from other simulations in Table 5 in “Methods”. Some of the lines of Table 1 deserve remarks. First, the basic reproductive number $R_0$ in both models are comparable to the values for China and Italy. Second, the death rate $\rho $ is very close to the values disclosed by the Brazilian Ministry of Health and the average of international values. We point out that our estimation of the death rate uses data that presumes there are places in hospitals to treat patients with severe infections, that is the situation that is present in the data now. Depending on the government policy, we do not know whether this is true or not at the peak of infection. Third, the proportion of symptomatic individuals p is smaller than the international reference due to the Brazilian Ministry of Health policy “only test if you have strong symptoms”. In fact, the same problem of underdiagnosis also seems to have happened in the early epidemics in China²⁹.

In the last step of the estimation procedure, in order to estimate the parameter $\psi $, we keep all model parameters as previously estimated and we also minimize the mean squared error using loss functions similar to the ones defined in Eqs. (5) and (6), depending on the case, in the period after March 23, 2020. Furthermore, in order to evaluate the effectiveness of the social distancing policy, we estimate a new value of $\psi $ for each new point of the time series as shown in Table 2, where the column 2 shows the estimations of $\psi $ for the SIRD model and column 4 shows estimations of the same parameter for the SIRASD model. Although there is a small gap between the values of $\psi $ for different models (SIRD or SIRASD), both columns suggest that the social distance factor $\psi $ is going down, meaning that more people are joining the government policy. According to the models, the transmission rate is reduced to approximately 62% of its original value. Table 2 also presents the effective reproductive number derived from the impact of $\psi $ on the transmission factors.

Forecasts

Figures 2 and 3 present respectively the short-term forcasts of the SIRD and the SIRASD models, where the models incorporate the $\psi $ factor in order to rescale the transmission factors ($\beta $, $\beta _A$ and $\beta _S$) in the scenario with the social distancing policy imposed by the government. Note that Fig. 3 explicitly shows the proportion of unknown asymptomatic individuals that when added to the symptomatic individuals skew the total value of infected individuals upwards.

We also use the SIRD and SIRASD models to provide long term forecasts of the evolution of the COVID-19 pandemic in Brazil depending on the social distancing policy considered. While Fig. 4 shows the forecasts for the SIRD model, Fig. 5 shows the forecasts for the SIRASD model. In particular, we may note that while the SIRASD model predicts that the number of infected is higher than the estimates of the SIRD model, it also predicts a lower peak for the infected with symptoms, which are the ones that could require medical attention.

We explore four cenarios: (I) no measures of social distancing policy (black line); (II) current social distancing policy imposed by the government for an indefinite time (blue line); (III) 2-month social distancing policy imposed by the government (yellow line); and (IV) optimum limited time social distancing policy imposed by the government, so that the second infection peak is not greater than cenario II (red line). Scenario III suggests that policies based on short-term social distancing policy are not enough to constrain the evolution of the pandemic, that is, if social distancing policy measurements are released before the optimal time, a second peak should be experienced. The peaks and dates in which they occur are detailed in Table 3. In the case of Scenario IV, the last day of the social distancing policy is June 22, 2020 for the SIRD model and June 16, 2020 for the SIRASD model.

In addition to Fig. 5, we also present the evolution of the proportion of asymptomatic and symptomatic in Fig. 6. In this figure, we show the instant proportion [$I_{S,t}/(I_{A,t}+I_{S,t})$ for the symptomatic and $I_{A,t}/(I_{A,t}+I_{S,t})$ for the asymptomatic] and the cumulative proportion as well. Note that the proportion of individuals who develop symptoms, p in Eq. (4), alters the transmission rate, so it also affects the evolution of the number of asymptomatic and symptomatic individuals over time. So this plot estimates the evolution of this proportion. The last column of the last line of Table 1 shows that the proportion of asymptomatic may vary from 29 to 37%, but this value is not fixed and evolves over time²⁶. Our estimates suggest that the proportion of cumulative asymptomatic is approximately 68% in March 30, 2020, which converged to $1-p$ (with p given in Table 1); that may account for some individuals with mild symptoms that were not tested.

Table 3 Peaks in each scenario and the dates of occurrence.

Full size table

Finally, it is worth considering that the SIRASD differential equations, presented in Eq. (4), need an initial condition for the number of asymptomatic individuals. If we find the parameters values $(\beta _A, \gamma _A, p)$ by solving the optimization problem of Eq. (6) using different conditions, we get different results, that is, different peak values for the symptomatic individuals. If the proportion of asymptomatic individuals is larger, then this may be good news since it may represent less pressure for the health care system. But since we do not have enough tests to map the whole population, we need to work with hypotheses. Figure 7 shows the effect of different initial conditions in the symptomatic percentage and the peak value of symptomatic, that is, we vary the initial conditions, evaluate the symptomatic proportion (parameter p in the SIRASD model), then calculate the peak value of symptomatic infected. So if we assume that the number of asymptomatic (symptomatic) individuals in data is larger (smaller) today, the number of asymptomatic (symptomatic) individuals will also be larger (smaller) in the time of the peak, leading to a smaller peak for the symptomatic.

Discussion

We use the Brazilian recent data from February 25, 2020 to March 30, 2020 to model and forecast the evolution of the COVID-19 pandemic in Brazil.

We estimate two variations of the SIR model using historical data and we find parameters that are in accordance with the international literature. We also introduce a factor $\psi $ to account for the effect of the government social distancing measures. Our methodology is able to estimate the asymptomatic individuals, that may not be entirely present in data. Since the Brazilian government does not have enough tests for mass testing, this measure may provide some additional information. In fact, we show the relevance of the number of asymptomatic individuals, since the larger the number of asymptomatic individuals, the smaller the number hospital beds needed. The “stay at home” and “only test if you have strong symptoms” policies present contradictory effects in the disease control. While they avoid an increase in the number of infected people and the use of extra resources with people that present only mild symptoms, they reduce the amount of information about the real number of infected individuals. In particular, it explains the low value of the parameter that measures the proportion of individuals who present symptoms, since we count many individuals with mild symptoms as asymptomatic.

While our short-term forecasts are in great accordance with the data, our long-term forecasts may help us to discuss different types of social distancing policies. We also show that the social distancing policy imposed by the government is able to flatten the pattern of contamination provided by the COVID-19, but short-term policies are only able to shift the peak of infection into the future keeping the value of the peak in almost the same value. Furthermore, we define the idea of the optimal social distancing policy as the finite social distancing policy that the second peak that happens after stopping the policy is not larger than the first. Based on this definition, we provide an estimate of the optimal date to end the social distancing policy.

An important discussion is about the effectiveness of vertical containment policies, where only people at risk follow social distance policies. In these kinds of policies, the two fractions of the population, the one at risk and the other one, present very different behaviors. First, the dynamics of the population at risk behaves similarly to the case with social distancing measures, but with a higher death rate. Second, the dynamics of the population that is not at risk behaves similarly to the case without social distance measures but with a low death rate. Third, since the fraction of the population that is at risk is much smaller than the rest of the population, the number of infected of the total population behaves similarly to the case without control. In fact, the policy’s effectiveness is not in reducing the number of infected, but in reducing the number of deaths by confining individuals at risk. It is worth mentioning that the effectiveness of these vertical containment polices depends strongly on the ability to separate the individuals at high risk from the individuals at low risk and on the number of vacancies in hospitals to treat the disease. We may extend our model to explore these type of scenarios and we leave for future work.

Finally, another interesting research path is to evaluate the economic side effects of pandemic control^30,31 and to propose measures to minimize these impacts³².

Methods

Table 4 Parameters estimation region.

Full size table

The solution of the systems of differential equations

We find the numerical solutions of Eqs. (3) and (4) through integration using the explicit Runge-Kutta method of order 5(4)³³. While this method controls the error assuming accuracy of the fourth-order, it uses a fifth-order accurate formula to take the steps. We use the implementation “solve_ivp” of the scipy Python’s library.

The solution of the systems of differential equations depends on the definition of initial conditions. We use $N_0=210147125$, that is the Brazilian population according to Brazilian Institute of Geography and Statistics (IBGE) which is the agency responsible for official collection of statistical, geographic, cartographic, geodetic and environmental information in Brazil, for both models. For the case of the SIRD model, we use $S_0=N_0-1$ and $I_0=1$. For the case, SIRASD model, we use $I_{S0} = 1$ and $S0 = N_0 - I_{A0} - I_{S0}$. We use $I_{A0}=1$ in all simulations of the paper but the simulations presented in Fig. 7, since we want to learn about the effect of $I_{A0}$ in the proportion of symptomatic and asymptomatic individuals in the peak date.

The estimation procedure

Our estimation procedure requires simultaneous integration of the differential equations (SIRD or SIRASD model depending on the case) and minimization of the loss functions [(5) or (6)] depending on the case for each time t. We minimize the loss functions using the method “optimize.least_squares” also from the scipy Python’s library³⁴ using the cauchy loss with scaling parameter $C=2$^35,36. To minimize the impact of the initial point assumption and data incompleteness, we repeat the estimation procedure 100 times using random initial conditions, but we discarded estimations which did not converge. Since this is a difficult nonlinear problem we bound the parameters estimation region. In particular, we use the bounds presented in Table 4. To be clear, the fact that $\beta _S=\beta $ and $\gamma _S=\gamma $ is a consequence of our hierarchical estimation procedure previously described in “Results” section. Furthermore, $\beta _A\in [0,\beta _S]$ means that $\beta _A\le \beta _S$⁷, since the asymptomatic individuals do not have symptoms that may help the spread of the infection.

Finally it is worth mentioning that this estimation procedure is sensitive to the random seed used by the algorithm as an initial condition. In particular, depending on this seed, we have found different epidemiological parameters in different simulations of the SIRD model, as presented in Table 5. We have chosen the simulation results that provided the closest value of the median of the parameter $\gamma $, which is the one that used the random seed 7. We emphasize that although any of the presented epidemiological parameters could be a possible estimation and we could use them in the main part of this paper, this choice does change the qualitative analysis and the conclusions of our paper.

Table 5 Random seeds used in simulations and their respective estimated values of the epidemiological parameters.

Full size table

The long term forecasts

The long term forecasts use the estimations presented in Table 1 and the integration of the systems of differential equations as described in the beginning of this section. We build the 95% confidence intervals of these curves randomizing the values of the parameters in the 95% confidence intervals presented in Table 1.

Data availability

The datasets and codes used to generate all the results of the current study are available in the Zenodo repository, https://zenodo.org/record/4041467#.X2joy4Zv88o.

References

World Heath Organization. Coronavirus overview. https://www.who.int/health-topics/coronavirus (2020).
Adam, D. The simulations driving the world’s response to covid-19. How epidemiologists rushed to model the coronavirus pandemic?. Nature 20, 20 (2020).
Article Google Scholar
Grassly, N. & Fraser, C. Mathematical models of infectious disease transmission. Nat. Rev. Microbiol. 6, 477–487 (2008).
Article CAS PubMed PubMed Central Google Scholar
Keeling, M. J. & Rohani, P. Modeling infectious diseases in humans and animals (2011).
Brauer, F., Castillo-Chavez, C. & Feng, Z. Mathematical models in epidemiology (2019).
Kermack, W. O. & McKendrick, A. G. A contribution to the mathematical theory of epidemics. Proc. R. Soc. A 115, 700–721 (1927).
ADS MATH Google Scholar
Robinson, M. & Stilianakis, N. I. A model for the emergence of drug resistance in the presence of asymptomatic infections. Math. Biosci. 243, 163–177 (2013).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Arino, J., van-den Brauer, F., Driessche, P., Watmough, J. & Wu, J. How will country-based mitigation measures influence the course of the covid-19 epidemic?. J. Theoret. Biol. 253, 118–130 (2008).
Article MathSciNet Google Scholar
Longini, I. M. Jr., Halloran, M. E., Nizam, A. & Yang, Y. Containing pandemic influenza with antiviral agents. Am. J. Epidemiol. 159, 623–633 (2004).
Article PubMed Google Scholar
Kucharski, A. J. et al. Early dynamics of transmission and control of covid-19: A mathematical modelling study. Lancet Infect. Dis. 20, 1–7 (2020).
Article Google Scholar
Berger, D., Herkenhoff, K. & Mongey, S. An Seir Infectious Disease Model with Testing and Conditional Quarantine (Tech. Rep, Federal Reserve Bank of Minneapolis, 2020).
Read, J. M., Bridgen, J. R. E., Cummings, D. A. T., Ho, A. & Jewell, C. P. Novel coronavirus 2019-ncov: Early estimation of epidemiological parameters and epidemic predictions. medRxivhttps://doi.org/10.1101/2020.01.23.20018549 (2020). https://www.medrxiv.org/content/early/2020/01/28/2020.01.23.20018549.full.pdf.
Walker, P. G. T. et al.The Global Impact of Covid-19 and Strategies for Mitigation and Suppression (Tech. Rep, Imperial College, 2020).
Shaman, J., Karspeck, A., Yang, W., Tamerius, J. & Lipsitch, M. Real-time influenza forecasts during the 2012–2013 season. Nat. Commun. 4, 2837 (2013).
Article ADS PubMed Google Scholar
Berge, T., Lubuma, J.-S., Moremedi, G., Morris, N. & Kondera-Shava, R. A simple mathematical model for ebola in Africa. J. Biol. Dyn. 11, 42–74 (2017).
Article MathSciNet CAS PubMed Google Scholar
Osthus, D., Hickmann, K. S., Caragea, P. C., Higdon, D. & Valle, S. Y. D. Forecasting seasonal influenza with a state-space sir model. Ann. Appl. Stat. 11, 202–224 (2017).
Article MathSciNet PubMed PubMed Central Google Scholar
Khaleque, A. & Sen, P. An empirical analysis of the ebola outbreak in west Africa. Sci. Rep. 7, 42594 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
World Heath Organization. Novel coronavirus(2019-ncov). situation report-7. Tech. Rep., World Heath Organization (2020).
World Heath Organization. Coronavirus disease 2019 (covid-19)situation report-72. https://www.who.int/docs/default-source/coronaviruse/situation-reports/20200401-sitrep-72-covid-19.pdf?sfvrsn=3dd8971b_2 (2020).
Brazilian Ministry of Health. Coronavirus panel. https://covid.saude.gov.br/ (2020). Accessed 27 Mar 2020.
Brazilian Ministry of Health. Coronavirus panel. https://covid.saude.gov.br/ (2020). Accessed 29 Mar 2020.
Wu, J. T. et al. Estimating clinical severity of covid-19 from the transmission dynamics in Wuhan, China. Nat. Med. 20, 20 (2020).
Google Scholar
Wu, J. T., Leung, K. & Leung, G. M. Nowcasting and forecasting the potential domestic and international spread of the 2019-ncov outbreak originating in Wuhan, China: A modelling study. Lancet 395, 689–697 (2020).
Article CAS PubMed PubMed Central Google Scholar
Remuzzi, A. & Remuzzi, G. Covid-19 and Italy: What next?. Lancet 20, 1–4 (2020).
Article Google Scholar
Zhang, S. et al. Estimation of the reproductive number of novel coronavirus (covid-19) and the probable outbreak size on the diamond princess cruise ship: A data-driven analysis. Int. J. Infect. Dis. 93, 201–204 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mizumoto, K., Kagaya, K., Zarebski, A. & Chowell, G. Estimating the asymptomatic proportion of coronavirus disease 2019 (covid-19) cases on board the diamond princess cruise ship. Eurosurveillance 25, 2000180 (2020).
Article PubMed Central Google Scholar
Nishiura, H. et al. Estimation of the asymptomatic ratio of novel coronavirus infections. Forthcom. Int. J. Infect. Dis. 20, 20 (2020).
Google Scholar
Bard, Y. Nonlinear Parameter Estimation (1974).
Nishiura, H. et al. The rate of underascertainment of novel coronavirus (2019-ncov) infection: Estimation using Japanese passengers data on evacuation flights. J. Clin. Med. 4, 419 (2020).
Article Google Scholar
Eichenbaum, M. S., Rebelo, S. & Trabandt, M. The macroeconomics of epidemics. Working Paper 26882, National Bureau of Economic Research (2020). https://doi.org/10.3386/w26882.
Gormsen, N. J. & Koijen, R. S. J. Coronavirus: Impact on stock prices and growth expectations. Working Paper of the University of Chicago 1–27 (2020).
Hone, T. et al. An empirical analysis of the ebola outbreak in west Africa. Lancet 7, 1575–1583 (2019).
Google Scholar
Dormand, J. R. & Prince, P. J. A family of embedded Runge-Kutta formulae. J. Comput. Appl. Math. 6, 19–26 (1980).
Article MathSciNet Google Scholar
Virtanen, P. et al. SciPy 1.0. Fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mayorov, N. Robust nonlinear regression in scipy (Tech, Rep, 2015).
Triggs, B., McLauchlan, P. F., Hartley, R. I. & Fitzgibbon, A. W. IWVA 1999: Vision Algorithms: Theory and Practice, chap. Bundle Adjustment—A Modern Synthesis (Springer, Berlin, 1999).
Google Scholar

Download references

Acknowledgements

Daniel O. Cajueiro is indebted to CNPQ for partial financial support under grant 302629/2019-0. Since this is an ongoing pandemic we received very useful comments from many people in Brazil directly involved with the study and the pandemics management. We would like to thank the comments of R.F.S. Andrade who is at Center for Data and Knowledge Integration for Health (CIDACS - Fio Cruz), G.C. Cardoso who is at the Department of Physics of University of São Paulo (USP), I.V.B. Freitas who belongs to the technical staff of the Brazilian Senate, M.M. Morato and J.E. Normey-Rico who are at the Department of Automation and Systems of the Universidade Federal de Santa Catarina (UFSC), G. Riella who is at Brazilian School of Public and Business Administration of Getúlio Vargas Fundation (EBAPE-FGV), H.F.C. Velho who is at the National Institute of Space Research (INPE), several colleagues of the Department of Economics of University of Brasília (unB) and several colleagues of the Machine Learning Laboratory for Finance and Organizations (LAMFO-UnB) on an earlier version of the manuscript.

Author information

Authors and Affiliations

Department de Economics, FACE, Universidade de Brasília (UnB), Campus Universitário Darcy Ribeiro, Brasília, 70910-900, Brazil
Saulo B. Bastos & Daniel O. Cajueiro
Nacional Institute of Science and Technology for Complex Systems (INCT-SC), Universidade de Brasília (UnB), Brasília, Brazil
Daniel O. Cajueiro
Machine Learning Laboratory for Finance and Organizations (LAMFO), FACE-Universidade de Brasília (UnB), Campus Universitário Darcy Ribeiro, Brasília, 70910-900, Brazil
Daniel O. Cajueiro

Authors

Saulo B. Bastos
View author publications
You can also search for this author in PubMed Google Scholar
Daniel O. Cajueiro
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.B.B. and D.O.C. designed the research, performed the research and wrote the manuscript.

Corresponding author

Correspondence to Daniel O. Cajueiro.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bastos, S.B., Cajueiro, D.O. Modeling and forecasting the early evolution of the Covid-19 pandemic in Brazil. Sci Rep 10, 19457 (2020). https://doi.org/10.1038/s41598-020-76257-1

Download citation

Received: 09 April 2020
Accepted: 05 October 2020
Published: 10 November 2020
DOI: https://doi.org/10.1038/s41598-020-76257-1

This article is cited by

Real-time estimation and forecasting of COVID-19 cases and hospitalizations in Wisconsin HERC regions for public health decision making processes
- Srikanth Aravamuthan
- Juan Francisco Mandujano Reyes
- Dörte Döpfer
BMC Public Health (2023)
EKF-SIRD model algorithm for predicting the coronavirus (COVID-19) spreading dynamics
- Abdennour Sebbagh
- Sihem Kechida
Scientific Reports (2022)
Reconstruction of the transmission dynamics of the first COVID-19 epidemic wave in Thailand
- Chaiwat Wilasang
- Natcha C. Jitsuk
- Charin Modchang
Scientific Reports (2022)
Influence of state reopening policies in COVID-19 mortality
- Ka-Ming Tam
- Nicholas Walker
- Juana Moreno
Scientific Reports (2022)
Optimal Control Approach for the COVID-19 Pandemic in Bahia and Santa Catarina, Brazil
- Igor M. L. Pataro
- Marcelo M. Morato
- Julio E. Normey-Rico
Journal of Control, Automation and Electrical Systems (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.