Abstract
In Italy, 128,948 confirmed cases and 15,887 deaths of people who tested positive for SARSCoV2 were registered as of 5 April 2020. Ending the global SARSCoV2 pandemic requires implementation of multiple populationwide strategies, including social distancing, testing and contact tracing. We propose a new model that predicts the course of the epidemic to help plan an effective control strategy. The model considers eight stages of infection: susceptible (S), infected (I), diagnosed (D), ailing (A), recognized (R), threatened (T), healed (H) and extinct (E), collectively termed SIDARTHE. Our SIDARTHE model discriminates between infected individuals depending on whether they have been diagnosed and on the severity of their symptoms. The distinction between diagnosed and nondiagnosed individuals is important because the former are typically isolated and hence less likely to spread the infection. This delineation also helps to explain misperceptions of the case fatality rate and of the epidemic spread. We compare simulation results with real data on the COVID19 epidemic in Italy, and we model possible scenarios of implementation of countermeasures. Our results demonstrate that restrictive socialdistancing measures will need to be combined with widespread testing and contact tracing to end the ongoing COVID19 pandemic.
Main
After a novel strain of coronavirus, SARSCoV2, was identified in Wuhan (Hubei), China^{1,2}, an exponentially growing number of patients in mainland China were diagnosed with COVID19, prompting Chinese authorities to introduce radical measures to contain the outbreak^{3}. Despite these measures, a COVID19 pandemic ensued in the following months. The World Health Organisation report dated 5 April 2020 reported 1,133,758 total cases and 62,784 deaths worldwide^{4}.
Italy has been severely affected^{5}. After the first indigenous case on 21 February 2020 in Lodi province, several suspect cases (initially epidemiologically linked) began to emerge in the south and southwest territory of Lombardy^{6}. A ‘red zone’, encompassing 11 municipalities where SARSCoV2 infection was endemic, was instituted on 22 February 2020, and put on lockdown to contain the emerging threat. A campaign to identify and screen all close contacts with confirmed cases of COVID19 resulted in taking 691,461 nasal swabs as of 5 April 2020. Of the 128,948 detected cases, 91,246 were currently infected (28,949 hospitalized, 3,977 admitted to intensive care units (ICUs) and 58,320 quarantined at home), 21,815 had been discharged due to recovery and 15,887 had died^{7}. In the early days of the epidemic in Italy, both symptomatic and asymptomatic people underwent screening. A government regulation dated 26 February 2020 limited screening to symptomatic subjects only^{8}. On 8 March 2020, to further contain the spread of SARSCoV2, the red zone was extended to the entire area of Lombardy and 14 more northern Italian provinces. On 9 March 2020, lockdown was declared for the entire country^{9} and progressively stricter restrictions were adopted.
COVID19 displays peculiar epidemiological traits when compared with previous coronavirus outbreaks of SARSCoV and MERSCoV. According to Chinese data^{10}, a large number of transmissions, both in nosocomial and community settings, occurred through humantohuman contact with individuals showing no or mild symptoms. The estimated basic reproduction number (R_{0}) for SARSCoV2 ranges from 2.0 to 3.5^{11,12,13}, which seems comparable, or possibly higher, than for SARSCoV and MERSCoV. High viral loads of SARSCoV2 were found in upper respiratory specimens of patients showing little or no symptoms, with a viral shedding pattern akin to that of influenza viruses^{14}. Hence, inapparent transmission may play a major and underestimated role in sustaining the outbreak.
Predictive mathematical models for epidemics^{15,16,17,18} are fundamental to understand the course of the epidemic and to plan effective control strategies. One commonly used model is the SIR model^{19} for humantohuman transmission, which describes the flow of individuals through three mutually exclusive stages of infection: susceptible, infected and recovered. More complex models can accurately portray the dynamic spread of specific epidemics. For the COVID19 pandemic, several models have been developed. Lin and colleagues extended a SEIR (susceptible, exposed, infectious, removed) model considering risk perception and the cumulative number of cases^{20}, Anastassopoulou and colleagues proposed a discretetime SIR model including dead individuals^{21}, Casella developed a controloriented SIR model that stresses the effects of delays and compares the outcomes of different containment policies^{22} and Wu and colleagues used transmission dynamics to estimate the clinical severity of COVID19^{23}. Stochastic transmission models have also been considered^{24,25}. Here, we propose a new meanfield epidemiological model for the COVID19 epidemic in Italy that extends the classical SIR model, similar to that developed by Gumel and colleagues for SARS^{26}. A summary of the main findings, limitations and implications of the model for policymakers is shown in Table 1.
Our model, named SIDARTHE, discriminates between detected and undetected cases of infection and between different severity of illness (SOI), nonlifethreatening cases (asymptomatic and paucisymptomatic; minor and moderate infection) and potentially lifethreatening cases (major and extreme) that require ICU admission.
The total population is partitioned into eight stages of disease: S, susceptible (uninfected); I, infected (asymptomatic or paucisymptomatic infected, undetected); D, diagnosed (asymptomatic infected, detected); A, ailing (symptomatic infected, undetected); R, recognized (symptomatic infected, detected); T, threatened (infected with lifethreatening symptoms, detected); H, healed (recovered); E, extinct (dead). The interactions among these stages are shown in Fig. 1. We omit the probability rate of becoming susceptible again after having recovered from the infection. Although anecdotal cases are found in the literature^{27}, the reinfection rate value appears negligible. A detailed discussion of the model considerations and parameters is provided in the Methods.
For the COVID19 epidemic in Italy, we estimate the model parameters based on data from 20 February 2020 (day 1) to 5 April 2020 (day 46) and show how the progressive restrictions, including the most recent lockdown progressively enforced since 9 March 2020, have affected the spread of the epidemic. We also model possible longerterm scenarios illustrating the effects of different countermeasures, including social distancing and populationwide testing, to contain SARSCoV2.
The model parameters have been updated over time to reflect the progressive introduction of increased restrictions. On day 1, the basic reproduction number was R_{0} = 2.38, which resulted in a substantial outbreak. On day 4, R_{0} = 1.66 as a result of the introduction of basic social distancing, awareness of the epidemic, hygiene and behavioral recommendations, and early measures by the Italian government (for example, closing schools). At day 12, asymptomatic individuals were almost no longer detected, and screening was focused on symptomatic individuals (leading to R_{0} = 1.80). On day 22, a partially incomplete lockdown, of which the effectiveness was reduced by the movement of people from the north to the south of Italy when the countrywide lockdown was announced but not yet enforced, yielded R_{0} = 1.60. When the national lockdown was fully operational and strictly enforced, after day 28, R_{0} = 0.99, finally reaching below 1. Moreover, R_{0} = 0.85 was achieved after day 38 due to a wider testing campaign that identified more mildly symptomatic infected individuals. Figure 2a shows the model evolution with the estimated parameters up to day 46; in the earliest epidemic phase, the number of infected was considerably underestimated. Of the total cases, 35% were undetected. In Fig. 2b, the infected individuals are partitioned into the different subpopulations (diagnosed or not, with different SOI classification). Over a 350day horizon, in the absence of further policy changes, Fig. 2c predicts that 0.61% of the population will contract the virus (and 0.45% will be diagnosed), while 0.06% of the population will die from COVID19. The peak of the number of concurrently infected individuals will occur on around day 50 at 0.19% of the population, while the peak of concurrently diagnosed infected individuals will occur later (around day 56) and amounts to 0.17% of the population. The actual case fatality rate (CFR) is 9.8% and the perceived CFR is 13%. Figure 2d shows that each infected subpopulation reaches its peak at a different time.
Extended Data Fig. 1 shows how the situation could have evolved if milder or stronger measures had been implemented earlier. The curve following day 22 shows the importance and effectiveness of a prompt lockdown. The actual epidemic evolution corresponds to an intermediate scenario: the lockdown measures had a moderate effect, probably due to their incremental nature.
We predict a range of possible future scenarios, with different measures enforced after day 50.
Figure 3a,b shows, if the lockdown is weakened, a sudden and strong increase of the spread of disease, a prolonged emergency and more deaths (0.12% of the population in the first 350 days). Figure 3c,d shows the benefits of stricter lockdown measures: after 350 days, 0.41% of the population would contract the virus (0.30% diagnosed) and 0.04% of the population would die.
A policy of populationwide testing and contact tracing would help to rapidly end the epidemic, as suggested by Peto^{28}. Figure 4a,b shows the effect of such measures: the peak would be reached sooner and, after 350 days, 0.43% of the population would contract the virus (0.33% diagnosed), with an estimated 0.05% dying. Figure 4c,d shows the effect of combining a milder lockdown with widespread testing and contact tracing: after 350 days, 0.52% of the population would contract the virus (0.41% diagnosed) and 0.05% would die.
Hence, the current adopted lockdown measures are vital to contain the epidemic and cannot be relieved. Rather, they should be even more restrictive. The enforced lockdown could be mitigated in the presence of widespread testing and contact tracing, which would strongly contribute to a rapid resolution of the epidemic.
Distinguishing between diagnosed and nondiagnosed cases highlights a distortion in disease statistics. The discrepancy between the actual CFR (total number of deaths due to the infection, divided by the total number of infected people) and the perceived CFR (number of deaths ascribed to the infection, divided by the number of people diagnosed as infected) can be quantified, which explains the gap between the actual infection dynamics and perception of the outbreak. Performing an insufficient number of tests underestimates the transmission rate and overestimates the CFR. Our model can predict the longterm effects of underdiagnosis.
Concerning diagnostic tests for COVID19, currently, standard molecular methods to detect the presence of SARSCoV2 in respiratory samples are based on nonspecific realtime polymerase chain reaction with reverse transcription methods, which target RNAdependent RNA polymerase and E genes^{29}. These tests are timeconsuming and cannot be done on all susceptible individuals in the population; high false negatives rates have been reported and certified laboratories with expensive equipment are needed. Rapid tests with high sensitivity and specificity that can be easily adapted to reallife settings (schools, airports, train stations) are urgently required. Some laboratories are moving in this direction, developing a 15 min test to detect SARSCoV2 immunoglobulins IgM and IgG simultaneously in human blood^{30}.
Our model confirms that diagnosis campaigns can reduce the infection peak (the diagnosed population enters quarantine and is therefore less likely to affect the susceptible population) and help end the epidemic more quickly^{28}. Healthcare workers are more likely to be exposed and their risk of infection is increased, as supported by reports from China^{31,32} suggesting that disease amplification in healthcare settings will occur despite restrictive measures.
The model does not consider reduced availability of medical care due to the healthcare system reaching or even surpassing its capacity^{33}. These analyses can only be done indirectly. For example, when the number of seriously affected individuals is high (above a threshold), the mortality coefficient will be increased due to an insufficient number of ICUs.
We compare scenarios with control measures of varying strength and nature, predicting for each the timing and magnitude of the epidemic peak, including the peak of ICU admissions. According to our findings, a partial implementation of lockdown measures results in a delay in the peak of infected individuals and patients admitted to the ICU, contrasting with an only moderate decrease in the total number of infected individuals and ICU admissions. Conversely, the implementation of very strong socialdistancing strategies would result in an anticipated lower peak of infected individuals and patients admitted to the ICU, with a marked decrease in the total number of infected individuals and ICU admissions due to the disease.
Our findings provide policymakers with a tool to assess the consequences of possible strategies, including lockdown and social distancing, as well as testing and contact tracing. Our simulation results, achieved by combining the model with the available data about the COVID19 epidemic in Italy, suggest that enforcing strong socialdistancing measures is urgent, necessary and effective, in line with other reports in the literature^{2,22,24}. The earlier the lockdown is enforced, the stronger the effect obtained. The model results also confirm the benefits of mass testing, whenever facilities are available^{28}. We believe these indications can be useful to manage the epidemic in Italy, as well as in countries that are still in the early stages of outbreak.
Although the mortality rate (number of deaths in the whole population) of COVID19 can be decreased with restrictive measures that reduce the spread of SARSCoV2, the CFR (number of deaths in the infected population) is essentially constant in different scenarios, unaffected by the extent of social restriction and testing. Despite rigid isolation policies, COVID19 patients may still be burdened with excess case fatality, and efforts should be focused on developing more effective treatment strategies to combat COVID19. As new drugs and vaccines are being tested and evaluated, the current scenario will evolve to account for these ongoing innovations^{34,35,36,37}.
Methods
SIDARTHE mathematical model
The SIDARTHE dynamical system consists of eight ordinary differential equations, describing the evolution of the population in each stage over time:
where the uppercase Latin letters (state variables) represent the fraction of population in each stage, and all the considered parameters, denoted by Greek letters, are positive numbers. The interactions among different stages of infection are visually represented in the graphical scheme in Fig. 1. The parameters are defined as follows:
α, β, γ and δ respectively denote the transmission rate (the probability of disease transmission in a single contact multiplied by the average number of contacts per person) due to contacts between a susceptible subject and an infected, a diagnosed, an ailing or a recognized subject. Typically, α is larger than γ (assuming that people tend to avoid contacts with subjects showing symptoms, even though diagnosis has not been made yet), which in turn is larger than β and δ (assuming that subjects who have been diagnosed are properly isolated). These parameters can be modified by socialdistancing policies (for example, closing schools, remote working, lockdown). The risk of contagion due to threatened subjects, treated in proper ICUs, is assumed negligible.
ε and θ capture the probability rate of detection, relative to asymptomatic and symptomatic cases, respectively. These parameters, also modifiable, reflect the level of attention on the disease and the number of tests performed over the population: they can be increased by enforcing a massive contact tracing and testing campaign^{28}. Note that θ is typically larger than ε, as a symptomatic individual is more likely to be tested.
ζ and η denote the probability rate at which an infected subject, respectively not aware and aware of being infected, develops clinically relevant symptoms, and are comparable in the absence of specific treatment. These parameters are diseasedependent, but may be partially reduced by improved therapies and acquisition of immunity against the virus.
µ and ν respectively denote the rate at which undetected and detected infected subjects develop lifethreatening symptoms; they are comparable if there is no known specific treatment that is effective against the disease, otherwise µ may be larger. Conversely, ν may be larger because infected individuals with more acute symptoms, who have a higher risk of worsening, are more likely to have been diagnosed. These parameters can be reduced by means of improved therapies and acquisition of immunity against the virus.
τ denotes the mortality rate (for infected subjects with lifethreatening symptoms) and can be reduced by means of improved therapies.
λ, κ, ξ, ρ and σ denote the rate of recovery for the five classes of infected subjects; they may differ significantly if an appropriate treatment for the disease is known and adopted for diagnosed patients, but are probably comparable otherwise. These parameters can be increased thanks to improved treatments and acquisition of immunity against the virus.
Discussion on modeling choices
In the model, we omit the probability rate of becoming susceptible again, after having already recovered from the infection, because this appears to be negligible based on early evidence^{27}. Given the scarcity of available data, it is impossible to have conclusive evidence about immunity at this stage. Immunity might also be temporary^{38}. Although some reports suggest the possibility of SARSCoV2 reinfection^{27,39,40}, the indicated presence of viral RNA in respiratory samples might reflect a persistence rather than a true recurrence. The literature on the recrudescence of related members of the coronavirus family, such as SARSCoV and MERSCoV, is similarly sporadic. MERSCoV reinfection despite serum detection of neutralizing antibodies has been described only in animals^{41,42}, while the presence of neutralizing antibodies in serum via primary infection or passive transfer has been shown to prevent respiratory tract replication of SARSCoV in a murine model^{43}. From a modeling perspective, we are particularly interested in predictions over a relatively short horizon within which the temporary immunity is likely still to be in place, and the possibility of reinfection would negligibly affect the total number of susceptible individuals and so there would be no substantial difference in the evolution of the epidemic curves we consider. To provide solid support to this claim, Extended Data Fig. 2 shows the results of numerical simulation of the model when the possibility of reinfection is introduced: the evolution is almost identical, with the only difference being that the recovered population of course decreases over time. Hence, based on the evidence at hand, although we cannot rule out that adaptive immunity against SARSCoV2 may not provide longlasting protection, we may reasonably consider the probability of reinfection to be negligible within the scope of our model.
Also, our model accounts for a distinction between nondiagnosed individuals, who spread the infection more because they are not in isolation, and diagnosed individuals, who transmit the disease much less thanks to proper isolation and complying with strict rules, either in hospital or at home. Because Italy is on lockdown, extended emergency measures nationwide are being applied to contain the epidemic: unless indispensable for fundamental activities, people are forced to stay at home in family settings, drastically reducing the risk of spreading the disease. Persontoperson household transmission of SARSCoV2 has been described in China^{44,45}. Although the infection of household members of COVID19positive individuals is possible, the rate of this occurrence is difficult to estimate so far. The only way to completely avoid such risk is to separate infected individuals in dedicated quarantine centers^{46}, as has been done partially in Italy, confining infected people in individual hotel rooms. Even with reduced admissions to hospital, patients that are treated at home and assisted by household members strictly comply with the home isolation guidelines issued by experts^{47}, ranging from sanitary hygiene measures (including waste management, cleaning of contaminated surfaces and household laundering) to interhuman contact measures among family members (the caregiver of a suspected or confirmed COVID19infected individual in home isolation must be in good health and maintain a distance of at least 1 m, avoiding direct contact with oral or respiratory secretions, faeces and urine; moreover, a surgical mask and disposable gloves should always be used). Hence, we can safely assume that inhouse transmission is severely limited.
Although we do consider a delay in the emergence of symptoms, through asymptomatic (or paucisymptomatic) patients, categorized as undetected (infected) and detected (diagnosed), our model does not account for a possible latency between exposure to the virus and onset of infectiousness, because there is mounting evidence that an infected individual can transmit the virus at an early, preclinical stage of the disease, based on epidemiological investigation of COVID19 clusters^{45,48,49,50}. Moreover, recent studies estimated median serial interval values for COVID19 to be close to or shorter than the median incubation period^{51,52}, further proving the possibility of presymptomatic transmission of the disease. For this reason, we deemed it unnecessary to include an additional stage: although asymptomatic, individuals exposed to the virus retain a potential of viral transmission and thus reasonably fit within the infected and diagnosed stages.
Finally, the SIDARTHE model is a meanfield type of model, where the average effect of phenomena involving the whole population is captured. Social mixing patterns are incorporated into our contagion parameters in an averaged fashion over the whole population, irrespective of age. However, our model is fully flexible and suited to include, for example, a distinction between age classes, which would require splitting each variable of the model into N variables if N age classes are considered. Another possible future development is to extend the model to predict the simultaneous evolution of other diseases, which, due to the epidemic emergency, may be overestimated, underestimated or not treated appropriately because the healthcare system is overloaded, thus leading to an increased number of ‘collateral’ deaths not directly linked to the virus.
Analysis of the mathematical model
The SIDARTHE model (1)–(8) is a bilinear system with eight differential equations. The system is positive: all the state variables take nonnegative values for t ≥ 0 if initialized at time 0 with nonnegative values. Note that H(t) and E(t) are cumulative variables that depend only on the other ones and their own initial conditions.
The system is compartmental and demonstrates the mass conservation property: as can be immediately checked, \(\dot S\left( t \right) + \dot I\left( t \right) + \dot D\left( t \right) + \dot A\left( t \right) + \dot R\left( t \right) + \dot T\left( t \right) + \dot H\left( t \right) + \dot E({\mathrm{t}}) = 0\), hence the sum of the states (total population) is constant. Because the variables denote population fractions, we can assume
where 1 denotes the total population, including deceased.
Given an initial condition S(0), I(0), D(0), A(0), R(0), T(0), H(0), E(0) summing to 1, we can show that the variables converge to an equilibrium
with \(\bar S + \bar H + \bar E = 1\). So only the susceptible, the healed and the deceased populations are eventually present, meaning that the epidemic phenomenon is over. All the possible equilibria are given by \(\left( {\bar S,0,0,0,0,0,\bar H,\bar E} \right)\), with \(\bar S + \bar H + \bar E = 1\).
To understand the system behavior, we partition it into three subsystems: the first includes just variable S (corresponding to susceptible individuals), the second includes I, D, A, R and T (the infected individuals), which are nonzero only during the transient, and the third includes variables H and E (representing healed and defunct). We focus on the second subsystem, which we denote the IDART subsystem. An important observation is that when (and only when) the infected individuals I + D + A + R + T are zero are the remaining variables S, H and E at equilibrium. Variables H and E (which are monotonically increasing) converge to their asymptotic values \(\bar H\) and \(\bar E\), and S (which is monotonically decreasing) converges to \(\bar S\) if and only if I, D, A, R and T converge to zero.
The overall system can be recast in a feedback structure, where the IDART subsystem can be seen as a positive linear system subject to a feedback signal u as follows.
Defining x = [I D A R T]^{⊤}, we can rewrite the IDART subsystem as
where r_{1} = ε + ζ + λ, r_{2} = η + ρ, r_{3} = θ + μ + κ, r_{4} = ν + ξ and r_{5} = σ + τ. The remaining variables satisfy the differential equations
Because the timevarying feedback gain S(t) eventually converges to a constant value \(\bar S\), we can proceed with a parametric study with respect to the asymptotic feedback gain \(\bar S\). A key property is given in the following proposition.
Proposition 1
The IDART subsystem with susceptible population \(\bar S\) is asymptotically stable if and only if
Proof of proposition 1
The dynamical matrix of the linearized system around the equilibrium \(\left( {\bar S,0,0,0,0,0,\bar H,\bar E} \right)\) is
where r_{1} = ε + ζ + λ, r_{2} = η + ρ, r_{3} = θ + μ + κ, r_{4} = ν + ξ and r_{5} = σ + τ.
The matrix has three null eigenvalues, and five eigenvalues roots of the polynomial
where
The transfer function from u to y_{S} in the system (9)–(13) is G(s) = N(s)/D(s). Because the system is positive, the H_{∞} norm of G(s) is equal to the static gain G(0) = N(0)/D(0).
Then, by standard root locus (small gain argument) on the positive system G(s), we can say that the polynomial is Hurwitz (all roots in the lefthand plane) if and only if expression (17) holds, where \(\bar S^ \ast = 1/G\left( 0 \right)\), which proves the result.
We observe that, therefore, we are well justified to define the basic reproduction parameter
and stability of the equilibrium occurs for \(\bar S R_0 < 1\).
(Notice also that R_{0} = G(0) is the H_{∞} norm of the transfer function G(s).) QED
The threshold \(\bar S^ \ast\) is of fundamental importance. Because, asymptotically, S(t) converges monotonically to a constant \(\bar S\), such a constant \(\bar S\) must ensure convergence of the IDART subsystem to zero (hence stability; otherwise, S could not converge to \(\bar S\)). Therefore, we have the following result.
Proposition 2
For positive initial conditions, the limit value \(\bar S = \mathop {{\lim }}\limits_{t \to \infty } S(t)\) cannot exceed \(\bar S^ \ast\).
Proof of proposition 2
Because S(t) is monotonically decreasing and nonnegative, it has a limit \(\bar S \ge 0\). For t large enough, we have \(S\left( t \right) \approx \bar S\). Then the system converges to the linear system corresponding to the linearization in \(\bar S\). If, by contradiction, \(\bar S\) renders this system unstable, then x(t) diverges, as the Metzler matrix \(F + b\bar Sc^ \top\) has a positive dominant eigenvalue. In turn, this implies that x(t) cannot converge to zero, hence its components remain positive, which means that αI + βD + γA + δR > 0 does not converge to zero. As a consequence, \(\dot S =  S\left( {\alpha I + \beta D + \gamma A + \delta R} \right) < 0\) also does not converge to zero, hence S(t) cannot converge to a nonnegative value \(\bar S \ge 0\). We have reached a contradiction. QED
The threshold value of expression (17) has a deep meaning. The limit \(\bar S\) represents the fraction of population that has never been infected. This value is a decreasing function of the parameters α, β, γ and δ, which are the infection parameters. The action
has a destabilizing effect on the IDART subsystem, which would be stable without this feedback. To preserve the stability of the IDART subsystem and ensure that the equilibrium \(\bar S\) is reached, either the infection coefficients are small or the final value \(\bar S\) is small. Defining the basic reproduction number as
we have that stability of the equilibrium occurs for
At the outset of the epidemic we have \(\bar S \simeq 1\), so that stability occurs for
which essentially represents an immediate recovery with no large involvement of the population. Larger values of R_{0} imply a strong affection of the population according to equation (19).
We can provide an important formula that relates the coefficient R_{0} with the steadystate value \(\bar S\) (and \(\bar H\), \(\bar E\)).
Proposition 3
For positive initial conditions, the limit values \(\bar S = \mathop {{\lim }}\limits_{t \to \infty } S(t)\), \(\bar H = \mathop {{\lim }}\limits_{t \to \infty } H(t)\) and \(\bar E = \mathop {{\lim }}\limits_{t \to \infty } E(t)\) are given by
where f_{0} = −c^{⊤}F^{−1}x(0), f_{H} = −f^{⊤}F^{−1}x(0), f_{E} = −d^{⊤}F^{−1}x(0), R_{H} = −f^{⊤}F^{−1}b and R_{E} = −d^{⊤}F^{−1}b.
Proof of proposition 3
From expression (14), we have \(\dot S\left( t \right)/S\left( t \right) =  y_S\left( t \right)\), namely \( y_S\left( t \right) = \frac{{{\rm{d}}\log \left( {S\left( t \right)} \right)}}{{{\rm{d}}t}}\). By integration we have
Now, with constant F and b, we integrate \(\dot x\left( t \right)\):
Since \(\dot S\left( t \right) =  S\left( t \right)y_S\left( t \right)\) and x(∞) = 0, we have
We premultiply by c^{⊤}F^{−1} and take into account that y_{S}(t) = c^{⊤}x(t):
Simple calculations show that −c^{⊤}F^{−1}b = R_{0}, with R_{0} defined in equation (18). Denoting f_{0} = −c^{⊤}F^{−1}x(0), we have
The formulas for \(\bar H\) and \(\bar E\) can be obtained by premultiplying the expression of \(\mathop {\smallint }\limits_0^\infty x\left( \phi \right){\rm{d}}\phi\) above by f^{⊤} and d^{⊤}, respectively. QED
If we consider an initial condition in which only undiagnosed infected I(0) > 0 are present, while D(0) = A(0) = R(0) = T(0) = 0, then we can explicitly compute \(f_0 =  c^ \top F^{  1}\left[ {\begin{array}{*{20}{c}} {I(0)} & 0 & 0 & 0 & 0 \end{array}} \right]^ \top\) as
It is important to stress that equation (23) could be totally misleading for a longterm prediction, because in the long run the coefficients of matrix F are going to change. So, if there is a change in the parameters at time t_{0}, for example due to imposed restrictions and countermeasures, the prediction has to be adjusted by considering f_{0} = −c^{⊤}F^{−1}x(t_{0}), where F includes the new parameter values and x(t_{0}) = (I(t_{0}) D(t_{0}) A(t_{0}) R(t_{0}) T(t_{0}))^{⊤}. Clearly equation (20) also has to be updated by considering the new S(t_{0}).
An important indicator of the dynamics of an epidemiologic model is the CFR, which is the ratio between the number of deaths and the number of infected. Our model allows us to distinguish between the actual CFR M(t) and the perceived CFR P(t), which are defined as
Taking into account that
we can provide the explicit formulas
with equilibria
Fit of the model for the COVID19 outbreak in Italy
We infer the model parameters based on the official data (source: Protezione Civile and Ministero della Salute) about the evolution of the epidemic in Italy from 20 February 2020 (day 1) through 5 April 2020 (day 46). The official data we gathered are provided in Supplementary Table 1. We turn the data into fractions over the whole Italian population (~60 million).
The estimated parameter values are based on the data about the number of currently infected individuals with different SOI (asymptomatic or paucisymptomatic, quarantined at home, roughly corresponding to variable D(t) in our model; symptomatic and hospitalized, roughly corresponding to variable R(t) in our model; symptomatic in lifethreatening conditions, admitted to ICUs, roughly corresponding to variable T(t) in our model) and the number of diagnosed individuals who recovered (roughly corresponding to the quantity \({\int}_0^t {\left[ {\rho D\left( \phi \right) + \xi R\left( \phi \right) + \sigma T\left( \phi \right)} \right]{\rm{d}}\phi }\) that can be computed based on our model). Although we also show plots comparing the model prediction to cumulative case data, we did not fit the model to the cumulative case counts, but to the number of currently infected cases, to avoid the pitfalls described by King and others^{53}.
Data about the number of deaths (corresponding to E(t) in our model) appear particularly high with respect to the CFR reported in the literature; this can be largely explained by the age structure of the Italian population, which is the second oldest in the world (the reported CFR across all countries increases steeply with the age of the patient), and by the extensive intergenerational contacts in Italian society, which enhanced the spreading of the virus among older and more fragile generations^{54}. Perhaps more importantly, it can also be explained by the Italian criteria for (provisional) statistics, which lead to overestimation. In fact, unlike other countries, the official numbers for COVID19 deaths provisionally include the deaths of all people tested positive for the SARSCoV2 virus, even when they had multiple preexisting lifethreatening diseases and the exact cause of death had not yet been ascertained, so these numbers still need to be confirmed^{55}. Thus, an important challenge in tuning the model is that the initial data are affected by statistical distortion: in particular, the values of the ratio death/infected are highly overestimated. The model fitting process must take this problem into account. Therefore, we decided to fit the parameters based on the data about the diagnosed infected population and the number of recovered diagnosed patients, but not on the data about deaths. It is also worth stressing that, in the long run, the model is weakly sensitive to the initial conditions; for this reason, the initial mismatch concerning the mortality data has little impact.
We adopt a bestfit approach to find the parameters that locally minimize the sum of the squares of the errors. The model involves many state variables, as well as a large number of uncertain parameters whose numerical determination is a very challenging problem; it is likely that an infinite number of different parameter sets could be found, matching the data equally well. On the other hand, our parameters are control tuning knobs whose values should realistically reproduce the data and the reproduction number R_{0} in plausible scenarios. Relying on a priori epidemiological and clinical information about the relative parameter magnitude (as discussed above), and starting from a random initial guess, the model parameters have been fitted by reiterated local minimization of the sum of the squares of the errors. During the course of the simulation, the parameters have been updated based on the successive measures, of increasing strength, adopted by policymakers.
In particular, the fraction of the population in each stage at day 1 is set as: I = 200/60e6, D = 20/60e6, A = 1/60e6, R = 2/60e6, T = 0, H = 0, E = 0; S = 1 – I – D – A – R – T – H – E. The parameters are set as α = 0.570, β = δ = 0.011, γ = 0.456, ε = 0.171, θ = 0.371, ζ = η = 0.125, μ = 0.017, ν = 0.027, τ = 0.01, λ = ρ = 0.034 and κ = ξ = σ = 0.017. The resulting basic reproduction number is R_{0} = 2.38.
After day 4, as a consequence of basic socialdistancing measures due to the public being aware of the epidemic outbreak and due to recommendations (such as washing hands often, not touching one’s face, avoiding handshakes and keeping distance) and early measures (such as closing schools) by the Italian government, we set α = 0.422, β = δ = 0.0057 and γ = 0.285, so the new basic reproduction number becomes R_{0} = 1.66.
Also, after day 12, we set ε = 0.143 as a consequence of the policy limiting screening to symptomatic individuals only; thus, totally asymptomatic individuals are almost no longer detected, while individuals with very mild symptoms are still detected (hence ε is not set exactly to zero). Due to this, R_{0} = 1.80.
After day 22, the lockdown, at first incomplete, yields α = 0.360, β = δ = 0.005 and γ = 0.200; also, ζ = η = 0.034, μ = 0.008, ν = 0.015, λ = 0.08 and ρ = κ = ξ = σ = 0.017. Hence, the new basic reproduction number becomes R_{0} = 1.60.
After day 28, the lockdown is fully operational and gets stricter (working is no longer a good reason for going out: gradually, nonindispensable activities are stopped): we get α = 0.210 and γ = 0.110, hence R_{0} = 0.99.
After day 38, a wider testing campaign is launched: this yields ε = 0.200, and also ρ = κ = ξ = 0.020, while σ = 0.010 and ζ = η = 0.025. Therefore, R_{0} = 0.85.
The parameters above were used to simulate the model and generate the graphs reported in Fig. 2. The comparison between the official data and the curves resulting from the SIDARTHE model are provided in Extended Data Fig. 3. The current number of infected (including all stages), the number of recovered and the cumulative number of diagnosed cases are well reproduced, but a small mismatch can be noted in the last days when distinguishing between different SOI. This discrepancy can have two interpretations: on the one hand, the model considers infected with different severities (for example, T(t) is the number of lifethreatened patients that would need ICU admission) while the data report the actual treatment that the patients received (for example, the number of patients actually admitted to ICUs, which is constrained by the number of available beds and can be limited if the infected suddenly and quickly worsen, leading to death, before admission to the hospital). Hence, our overestimation of ICU patients may be due to saturation of the healthcare system, which is neglected in the model, or to the sudden worsening of infected who die at home before having the time to reach the ICU. Another possible explanation for our overestimation of patients with symptoms, and lifethreatening symptoms, and our underestimation of patients that are asymptomatic or paucisymptomatic, is that the average age of infected people is getting lower and lower, and younger patients are less likely to show serious or lifethreatening symptoms.
In the possible future scenarios reported in Figs. 3 and 4, the parameters are changed after day 50 as follows. In Fig. 3a,b, α = 0.252, hence R_{0} = 0.98 (increased). In Fig. 3c,d, α = 0.105, hence R_{0} = 0.50 (significantly decreased). In Fig. 4a,b, ε = 0.400, hence R_{0} = 0.59 (decreased, although not as much as in the previous scenario). In Fig. 4c,d, α = 0.420 but also ε = 0.600, therefore R_{0} = 0.77 (reduced, although not as much as in the previous two scenarios).
Conversely, Extended Data Fig. 1 shows the epidemic evolution that would have been predicted by the model for the COVID19 outbreak in Italy if, after day 22, socialdistancing countermeasures had been absent (Extended Data Fig. 1a,b), mild (Extended Data Fig. 1c,d), strong (Extended Data Fig. 1e,f) and very strong (Extended Data Fig. 1g,h). In all cases, the actual CFR is ~7.2%, while the perceived CFR is ~9.0%.
Extended Data Fig. 1a,b shows that, in the absence of further countermeasures after day 22 (just closing schools and hygiene recommendations), we have α = 0.422, γ = 0.285 and β = δ = 0.0057, hence R_{0} = 1.66 and the model predicts an evolution that leads to 73% of the population having contracted the virus (and ~64% having been diagnosed) and ~5.2% of the population having died because of the contagion over a 300day horizon (Extended Data Fig. 1a). The peak of the number of concurrently infected individuals occurs at around 76 days and amounts to ~44% of the population; however, the peak of concurrently diagnosed infected individuals occurs later, around 82 days, and amounts to 39% of the population. Extended Data Fig. 1b shows how the different subpopulations of infected individuals evolve over time, and it is interesting to notice that each subpopulation reaches its peak at a different time. In particular, the fraction of infected who need intensive care reaches its peak, almost 16.5% of the population, after 107 days.
Extended Data Fig. 1c,d shows that, with socialdistancing countermeasures after day 22 having a mild effect, α = 0.285 and γ = 0.171, hence R_{0} = 1.13, still larger than 1. Hence, the peak is delayed (and reduced in amplitude), because the increase in the number of new infected is reduced. Over a 500day horizon, as shown in Extended Data Fig. 1c, the model predicts an evolution that leads to a peak in the number of concurrently infected individuals around day 170, amounting to 11.7% of the population (10.6% of the population have been diagnosed). Eventually, 35% of the population have contracted the virus (and ~30% have been diagnosed) and ~2.5% of the population have died because of the contagion. The fraction of patients in need of intensive care, as shown in Extended Data Fig. 1d, reaches its peak on day 198, amounting to 5.3% of the population. The adopted socialdistancing policy, although mild, has some impact and helps gain more time to strengthen and supply the healthcare system, but is still insufficient.
Extended Data Fig. 1e,f shows that, with stronger socialdistancing countermeasures, able to yield α = 0.200 and γ = 0.086, hence R_{0} = 0.787, now lower than 1, the peak is not delayed, but anticipated, because the increase in the number of new infected is reduced so much that it soon becomes a decrease. Over a 300day horizon, as shown in Extended Data Fig. 1e, the model predicts an evolution of the situation that leads to a peak in the number of concurrently infected individuals around day 50, amounting to 0.092% of the population; the peak in diagnosed infected occurs at day 54 and amounts to 0.083% of the population. Eventually, 0.25% of the population have contracted the virus (and ~0.22% have been diagnosed) and ~0.02% of the population have died because of the contagion. The fraction of patients in need of intensive care, as shown in Extended Data Fig. 1f, reaches its peak on day 85, amounting to 0.04% of the population.
Extended Data Fig. 1g,h shows that, with even stronger socialdistancing countermeasures, α = γ = 0.057, hence R_{0} = 0.0329, significantly lower than 1. Over a 300day horizon, as shown in Extended Data Fig. 1g, the model predicts an evolution of the situation that leads to a peak in the number of concurrently infected individuals around day 25, amounting to 0.057% of the population; the peak in diagnosed infected occurs at day 35 and amounts to 0.048% of the population. Eventually, 0.086% of the population have contracted the virus (and ~0.074% have been diagnosed) and ~0.006% of the population have died because of the contagion. The fraction of patients in need of intensive care, as shown in Extended Data Fig. 1h, reaches its peak on day 64, amounting to 0.02% of the population.
These scenarios, although surpassed, are fundamental to prove that lockdown was an appropriate policy, given that, in the absence of socialdistancing countermeasures, the epidemic could have had tragic outcomes; also, they suggest—for countries early on in the outbreak evolution—that strictly enforcing the lockdown as early as possible leads to enormous benefits with respect to a delayed intervention.
Model sensitivity analysis
We now investigate the sensitivity of the model to parameter variations, focusing in particular on the parameters that can be influenced by policymakers: transmission parameters, related to lockdown measures (α, β, γ and δ), and testing parameters, related to testing and contact tracing policies (ε, θ). To illustrate the effect of changing the parameter values in the model, our sensitivity analysis results are reported in Extended Data Figs. 4–10.
Interestingly, the model is particularly sensitive to variations in the value of α and of ε. Increasing α significantly increases all the curves (Extended Data Fig. 4). Also increasing the other transmission parameters, β, γ and δ, increases all the curves—that is, increases the values of all the variables, point by point, over time (Extended Data Figs. 5–7), although the sensitivity is smaller. All these parameters can be decreased by policymakers, by enforcing lockdown and socialdistancing measures, and stringent safety procedures in hospitals and for home assistance of diagnosed infected.
Conversely, increasing ε significantly decreases all the curves (Extended Data Fig. 8). Also increasing the other testing parameter θ decreases all the curves—that is, decreases the values of all the variables, point by point, over time (Extended Data Fig. 9), but the sensitivity is smaller. These two parameters can be increased by policymakers by enforcing populationwide testing and contact tracing, focused on discovering, respectively, asymptomatic and symptomatic infections. Discovering infected people at an earlier stage appears to help reduce the contagion more.
The other parameters are harder to control with prevention and mitigation strategies (Extended Data Fig. 10). Increasing ζ and η decreases the final number of infected and recovered, but also increases the number of deaths; the number of symptomatic and lifethreatening infections initially increases, to decrease afterwards. Increasing μ and ν decreases the final number of infected and recovered, but also increases the number of deaths; the number of lifethreatening infections initially increases, to decrease afterwards. Increasing λ, as well as the other healing parameters ρ, κ, ξ and σ, decreases all the curves, apart from the curve of recovered patients, which initially increases (due to a higher recovery rate) and then eventually decreases (due to fewer infections overall). Increasing τ leaves all the curves almost unaffected, apart from the curve of lifethreatened infected, which is decreased, also leading to a small decrease in the curve of all infected cases, a decrease in the curve of recovered and an increase in the curve of deaths.
Discussion of the model features
The key feature of our proposed model is the distinction between detected and undetected infection cases, and between cases with different SOI classifications (mild and moderate versus major and extreme). Distinguishing between diagnosed and not diagnosed cases allows us to highlight the perceived distortion in disease statistics, such as the number of infected individuals, the transmission rate and the CFR (the ratio between the number of deaths ascribed to the infection and the number of diagnosed cases). The discrepancy between the actual CFR (total number of deaths due to the infection, divided by the total number of people who have been infected) and the perceived CFR (number of deaths ascribed to the infection, divided by the number of people who have been diagnosed as infected) can be quantified based on this model. Therefore, the model can explain the possible discrepancy between the actual infection dynamics and the perception of the phenomenon. Misperception (either resulting in underestimating or overestimating) can be particularly relevant in the early phases of an epidemic phenomenon due to the lack of thorough information: for example, performing an insufficient number of tests may lead to underestimating the transmission rate (because many infected subjects are not diagnosed as such) and overestimating the CFR (because critical or fatal cases hardly go undetected). The model thus provides a rough quantification of the error in estimating the actual number of infected people due to the lack of proper diagnostic tests, or due to insufficient number of diagnostic tests being performed. Also, it can explain and predict the longterm effects of underdiagnosis, including the (apparently surprising) increased number of infections and fatalities, with sudden outbreaks after long silent periods.
Once the model parameters have been estimated on the basis of the available clinical data, the model enables us to reproduce and predict the dynamic evolution of the epidemic and to evaluate the possible underestimation or overestimation of the epidemic phenomenon based on current statistics, which are heavily subject to bias (for example, asymptomatic patients may get tested according to some protocols, not tested according to others).
The model helps evaluate and predict the effect of the implementation of different guidelines and protocols (for example, more extensive screening for the disease or stricter socialdistancing measures), which typically results in a change in the model parameters.
The model predictions in the long run are not very sensitive to the initial conditions, but they are sensitive to the parameter values (and in particular extremely sensitive to some of these, as our sensitivity analysis has indicated), which are deeply uncertain and can vary due to several factors, such as population density, cultural habits, environmental conditions and age distribution of the population. The predictions must also consider parameter variations due to the measures imposed by the government. This is a fundamental aspect: in the long term, not imposing drastic measures leads to catastrophic outcomes, even when the initially affected population is a small fraction.
Socialdistancing measures are modeled by reducing the infection coefficients α, β, γ and δ. The infection peak time is not monotonic with increasing restrictions. Partial restrictions on population movements postpone the peak, while strong restrictions anticipate the peak. Mild containment measures may have negative effects, for example augmenting the fraction of the population with lifethreatening symptoms with respect to the fraction of population with mild symptoms.
Diagnosis campaigns can reduce the infection peak, because the diagnosed population enters quarantine and hence is less likely to affect the susceptible population.
Reporting Summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
We gathered epidemiological data from the following publicly available data sources: Italian Civil Protection (http://www.protezionecivilfor exampleov.it/mediacomunicazione/comunicatistampa) and the Ministry of Health (http://www.salutfor exampleov.it/portale/home.html). All the epidemiological information we used is documented in the Extended Data and Supplementary Tables. Raw data are reported in Supplementary Table 1 and are also included in Extended Data Fig. 3.
Code availability
The codes are available at http://users.dimi.uniud.it/~giulia.giordano/docs/papers/SIDARTHEcode.zip.
References
Velavan, T. P. & Meyer, C. G. The COVID19 epidemic. Trop. Med. Int. Health 25, 278–280 (2020).
Wu, Z. & McGoogan, J. M. Characteristics of and important lessons from the coronavirus disease 2019 (COVID19) outbreak in China: summary of a report of 72,314 cases from the Chinese center for disease control and prevention. JAMA 323, 1239–1242 (2020).
Guan, W.J.et al. Clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. https://doi.org/10.1056/NEJMoa2002032 (2020).
WHO. Coronavirus Disease 2019 (COVID19): Situation Report 76 (WHO, 2020).
Remuzzi, A. & Remuzzi, G. COVID19 and Italy: what next? Lancet Health Policy 395, 1225–1228 (2020).
Giuffrida, A. & Beaumont, P. Coronavirus: inquiry opens into hospitals at centre of Italy outbreak. The Guardian (26 February 2020).
Ministero della Salute (Italian Ministry of Health). http://www.salute.gov.it/imgs/C_17_notizie_4403_0_file.pdf (5 April 2020).
Italian Civil Protection. Situazione Italia al 5 marzo. http://www.salute.gov.it/portale/nuovocoronavirus/dettaglioNotizieNuovoCoronavirus.jsp?lingua=italiano&menu=notizie&p=dalministero&id=4157 (5 March 2020).
Chronology of main steps and legal acts taken by the Italian Government for the containment of the COVID19 epidemiological emergency. http://www.protezionecivile.gov.it/documents/20182/1227694/Summary+of+measures+taken+against+the+spread+of+C19/c16459ad4e524e9090f3c6a2b30c17eb (accessed 12 March 2020).
Wang, Y., Wang, Y., Chen, Y. & Quin, Q. Unique epidemiological and clinical features of the emerging 2019 novel coronavirus pneumonia (COVID19) implicate special control measures. J. Med. Virol. 92, 568–576 (2020).
Fisman, D., Rivers, C., Lofgren, E. & Majumder, M. S. Estimation of MERSCoronavirus reproductive number and case fatality rate for the Spring 2014 Saudi Arabia outbreak: insights from publicly available data. PLoS Curr. https://doi.org/10.1371/currents.outbreaks.98d2f8f3382d84f390736cd5f5fe133c (2014).
Zhao, S. et al. Preliminary estimation of the basic reproduction number of novel coronavirus (2019nCoV) in China, from 2019 to 2020: a datadriven analysis in the early phase of the outbreak. Int. J. Inf. Dis. 92, 214–217 (2020).
Read, J., Bridgen, J. R., Cummings, D. A. T., Ho, A. & Jewell, C. P. Novel coronavirus 2019nCoV: early estimation of epidemiological parameters and epidemic predictions. Preprint at medRxiv https://doi.org/10.1101/2020.01.23.20018549 (2020).
Zou, L. et al. SARSCoV2 viral load in upper respiratory specimens of infected patients. N. Engl. J. Med. 382, 1177–1179 (2020).
Anderson, R. M. & May, R. M. Infectious Diseases of Humans (Oxford Univ. Press, 1991).
Diekmann, O. & Heesterbeek, J. A. P. Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation (Wiley, 2000).
Hethcote, H. W. The mathematics of infectious diseases. SIAM Rev. 42, 599–653 (2000).
Brauer, F. & CastilloChavez, C. Mathematical Models in Population Biology and Epidemiology 2nd edn (Springer, 2012).
Kermack, W. O. & McKendrick, A. G. A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. 115, 700–721 (1927).
Lin, Q. et al. A conceptual model for the coronavirus disease 2019 (COVID19) outbreak in Wuhan, China with individual reaction and governmental action. Int. J. Inf. Dis. 93, 211–216 (2020).
Anastassopoulou, C., Russo, L., Tsakris, A. & Siettos, C. Databased analysis, modelling and forecasting of the COVID19 outbreak. PLoS One 15, e0230405 (2020).
Casella, F. Can the COVID19 epidemic be managed on the basis of daily data? Preprint at https://arxiv.org/abs/2003.06967 (2020).
Wu, J. et al. Estimating clinical severity of COVID19 from the transmission dynamics in Wuhan, China. Nat. Med. 26, 506–510 (2020).
Hellewell, J. et al. Feasibility of controlling COVID19 outbreaks by isolation of cases and contacts. Lancet Global Health 8, e488–e496 (2020).
Kucharski, A. J. et al. Early dynamics of transmission and control of COVID19: a mathematical modelling study. Lancet Global Health https://doi.org/10.1016/S14733099(20)301444 (2020).
Gumel, A. B. et al. Modelling strategies for controlling SARS outbreaks. Proc. R. Soc. B Biol. Sci. https://doi.org/10.1098/rspb.2004.2800 (2004).
Lan, L. et al. Positive RTPCR test results in patients recovered from COVID19. JAMA https://doi.org/10.1001/jama.2020.2783 (2020).
Peto, J. Covid19 mass testing facilities could end the epidemic rapidly. Br. Med. J. 368, m1163 (2020).
Corman, V. M. et al. Detection of 2019 novel coronavirus (2019nCoV) by realtime RTPCR. Euro Surveill. 25, 2000045 (2020).
Li, Z. et al. Development and clinical application of a rapid IgMIgG combined antibody test for SARSCoV2 infection diagnosis. J. Med. Virol. https://doi.org/10.1002/jmv.25727 (2020).
Roosa, K. et al. Shortterm forecasts of the COVID19 epidemic in Guangdong and Zhejiang, China: February 13–23, 2020. J. Clin. Med. 9, E596 (2020).
Wang, C. et al. Risk management of COVID19 by universities in China. J. Risk Financ. Manag. 13, 36 (2020).
Ji, Y., Ma, Z., Peppelenbosch, M. P. & Pan, Q. Potential association between COVID19 mortality and healthcare resource availability. Lancet Global Health 8, e480 (2020).
Wang, M. et al. Remdesivir and chloroquine effectively inhibit the recently emerged novel coronavirus (2019nCoV) in vitro. Cell Res. 30, 269–271 (2020).
Chang, Y.C. et al. Potential therapeutic agents for COVID19 based on the analysis of protease and RNA polymerase docking. Preprint at Preprints https://www.preprints.org/manuscript/202002.0242/v1
Diao, B. et al. Reduction and functional exhaustion of T cells in patients with coronavirus disease 2019 (COVID19). Preprint at medRxiv https://doi.org/10.1101/2020.02.18.20024364 (2020).
Chen, W.H., Strych, U., Hotez, P. J. & Bottazzi, M. E. The SARSCoV2 vaccine pipeline: an overview. Curr. Trop. Med. Rep. https://doi.org/10.1007/s40475020002016 (2020).
Epidemiologist: ‘Too early to say’ if infected people develop immunity from the coronavirus (interview to David Heymann). The Hill https://thehill.com/policy/healthcare/490059epidemiologisttooearlytosayifinfectedpeopledevelopimmunityfrom (29 March 2020).
Chen, D. et al. Recurrence of positive SARSCoV2 RNA in COVID19: a case report. Int. J. Infect. Dis. 93, 297–299 (2020).
Zhou, L. et al. Cause analysis and treatment strategies of ‘recurrence’ with novel coronavirus pneumonia (covid19) patients after discharge from hospital. Zhonghua Jie He He Hu Xi Za Zhi 43, E028 (2020).
Houser, K. V. et al. Enhanced inflammation in New Zealand white rabbits when MERSCoV reinfection occurs in the absence of neutralizing antibody. PLoS Pathog. 13, e1006565 (2017).
Hemida, M. G. et al. Longitudinal study of middle east respiratory syndrome coronavirus infection in dromedary camel herds in Saudi Arabia, 2014–2015. Emerg. Microbes Infect. 6, e56 (2017).
Subbarao, K. et al. Prior infection and passive transfer of neutralizing antibody prevent replication of severe acute respiratory syndrome coronavirus in the respiratory tract of mice. J. Virol. 78, 3572–3577 (2004).
Ji, N.L. et al. Clinical features of pediatric patients with COVID19: a report of two family cluster cases. World J. Pediatr. https://doi.org/10.1007/s12519020003562 (2020).
Qian, G. et al. A COVID19 transmission within a family cluster by presymptomatic infectors in China. Clin. Infect. Dis. https://doi.org/10.1093/cid/ciaa316 (2020).
Fineberg, H. V. Ten weeks to crush the curve. N. Engl. J. Med. https://doi.org/10.1056/NEJMe2007263 (2020).
Istituto Superiore di Sanità. Recommendations for People in Family Isolation and Their Caregivers http://www.salutfor exampleov.it/portale/nuovocoronavirus/dettaglioNotizieNuovoCoronavirus.jsp?lingua=italiano&menu=notizie&p=dalministero&id=4266 (accessed 4 April 2020).
Pung, R. et al. Investigation of three clusters of COVID19 in Singapore: implications for surveillance and response measures. Lancet 395, 1039–1046 (2020).
Rothe, C. et al. Transmission of 2019nCoV infection from an asymptomatic contact in Germany. N. Engl. J. Med. 382, 970–971 (2020).
Kimball, A. et al. Asymptomatic and presymptomatic SARSCoV2 infections in residents of a longterm care skilled nursing facility. King County, Washington, March 2020. Morb. Mortal Wkly Rep. 69, 377–381 (2020).
Nishiura, H., Linton, N. M. & Akhmetzhanov, A. R. Serial interval of novel coronavirus (COVID19) infections. Int. J. Infect. Dis. 93, 284–286 (2020).
Du, Z. et al. Serial interval of COVID19 among publicly reported confirmed cases. Emerg. Infect. Dis. 26, 6 (2020).
King, A. A. et al. Avoidable errors in the modelling of outbreaks of emerging pathogens, with special reference to Ebola. Proc. Biol. Sci. 282, 20150347 (2015).
Dowd, J. B. et al. Demographic science aids in understanding the spread and fatality rates of COVID19. OSF https://osf.io/fd4rh (2020).
Istituto Superiore di Sanità. Characteristics of COVID19 patients dying in Italy. https://www.epicentro.iss.it/en/coronavirus/sarscov2analysisofdeaths (2020).
Acknowledgements
We acknowledge the huge efforts of the whole COVID19 IRCCS San Matteo Pavia Task Force. ID staff: R. Bruno, M.U. Mondelli, E. Brunetti, A. Di Matteo, E. Seminari, L. Maiocchi, V. Zuccaro, L. Pagnucco, B. Mariani, S. Ludovisi, R. Lissandrin, A. Parisi, P. Sacchi, S.F.A. Patruno, G. Michelone, R. Gulminetti, D. Zanaboni, S. Novati, R. Maserati, P. Orsolini, M. Vecchia. ID residents: M. Sciarra, E. Asperges, M. Colaneri, A. Di Filippo, M. Sambo, S. Biscarini, M. Lupi, S. Roda, T. Chiara Pieri, I. Gallazzi, M. Sachs, P. Valsecchi. Emergency Care Unit (ECU) staff: S. Perlini, C. Alfano, M. Bonzano, F. Briganti, G. Crescenzi, A.G. Falchi, R. Guarnone, B. Guglielmana, E. Maggi, I. Martino, P. Pettenazza, S. Pioli di Marco, F. Quaglia, A. Sabena, F. Salinaro, F. Speciale, I. Zunino. ECU residents: M. De Lorenzo, G. Secco, L. Dimitry, G. Cappa, I. Maisak, B. Chiodi, M. Sciarrini, B. Barcella, F. Resta, L. Moroni, G. Vezzoni, L. Scattaglia, E. Boscolo, C. Zattera, M. Fidel Tassi, V. Capozza, D. Vignaroli, M. Bazzini. Intensive care unit: G. Iotti, F. Mojoli, M. Belliato, L. Perotti, S. Mongodi, G. Tavazzi. Paediatric unit: G. Marseglia, A. Licari, I. Brambilla. Virology staff: D. Barbarini, A. Bruno, P. Cambieri, G. Campanini, G. Comolli, M. Corbella, R. Daturi, M. Furione, B. Mariani, R. Maserati, E. Monzillo, S. Paolucci, M. Parea, E. Percivalle, A. Piralla, F. Rovida, A. Sarasini, M. Zavattoni. Virology technical staff: G. Adzasehoun, L. Bellotti, E. Cabano, G. Casali, L. Dossena, G. Frisco, G. Garbagnoli, A. Girello, V. Landini, C. Lucchelli, V. Maliardi, S. Pezzaia, M. Premoli. Virology residents: A. Bonetti, G. Caneva, I. Cassaniti, A. Corcione, R. Di Martino, A. Di Napoli, A. Ferrari, G. Ferrari, L. Fiorina, F. Giardina, A. Mercato, F. Novazzi, G. Ratano, B. Rossi, I.M. Sciabica, M. Tallarita, E. Vecchio Nepita, P. Marone. Pharmacy unit: M. Calvi, M. Tizzoni. Hospital Leadership: C. Nicora, A. Triarico, V. Petronella, C. Marena, A. Muzzi, P. Lago, S. Cutti, V. Novelli. Data unit: F. Comandatore, G. Bissignandi, S. Gaiarsa, M. Rettani, C. Bandi, A. Ferrari. Also, we acknowledge financial support through Italian grant PRIN 2017 ‘Monitoring and Control Underpinning the EnergyAware Factory of the Future: Novel Methodologies and Industrial Validation’ (ID 2017YKXYXJ).
Author information
Authors and Affiliations
Contributions
G.G., F.B. and P.C. proposed the model and performed the mathematical derivations, the fitting and the simulations. R.B., A.D.F., A.D.M. and M.C. provided firsthand insight into the disease evolution and provided the clinical contextualization and interpretation of the results. All authors wrote and approved the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review information Jennifer Sargent was the primary editor on this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Alternative scenarios for epidemic evolution.
Epidemic evolution that would have been predicted by the model for the COVID19 outbreak in Italy if, after day 22, socialdistancing countermeasures had been: absent (panels a and b), mild (panels c and d), strong (panels e and f) and very strong (panels g and h). In all cases, the actual Case Fatality Rate is around 7.2%, while the perceived CFR is around 9.0%. Panels (a), (c), (e), (g) show the difference between the actual (real cases) and the perceived (diagnosed cases) evolution of the epidemics, while panels (b), (d), (f), (h) distinguish between the different categories of infected patients. Note the different scales between the panels, having different orders of magnitude, which testify the enormous impact of socialdistancing and lockdown.
Extended Data Fig. 2 Sensitivity analysis with respect to loss of immunity.
Sensitivity analysis showing the effect of introducing lack of immunity (hence, the possibility of reinfection) after day 50: recovered individuals can become susceptible again, so we add a term +χH(t) in equation (1) and a term −χH(t) in equation (7), where χ represents the rate at which immunity is lost. We show the evolution of the various model variables when χ = 0, χ = 0.1, χ = 0.8. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Apart from the number of recovered individuals, which is drastically reduced after loss of immunity, all the other curves are essentially unaffected: the increase in the number of infected and deaths, hence the increase in the number of cumulative infected, is hardly visible.
Extended Data Fig. 3 Model simulation compared to real data.
Comparison between the official data (red dots histogram) and the results with the calibrated SIDARTHE model (blue line). Panel (a): number of reported infected with no (or mild) symptoms, who are quarantined at home. Panel (b): number of reported infected with symptoms, who are hospitalised. Panel (c): number of reported infected with lifethreatening symptoms, admitted to ICU. Panel (d): number of reported recovered individuals. Panel (e): total number of reported infected in all categories. Panel (f): number of cumulative reported cases.
Extended Data Fig. 4 Sensitivity analysis with respect to α.
Sensitivity analysis showing the effect of varying the transmission coefficient α, whose nominal value is α = 0.21, after day 50. We multiply the nominal value of α by 0.5, 0.8, 1, 1.1, and 1.2, and show the corresponding evolution of the model variables. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Increasing α significantly increases all the curves: the model is extremely sensitive to variations in the value of α.
Extended Data Fig. 5 Sensitivity analysis with respect to β.
Sensitivity analysis showing the effect of varying the transmission coefficient β, whose nominal value is β = 0.0050, after day 50. We multiply the nominal value of β by 0.5, 0.8, 1, 1.2, 2, and show the corresponding evolution of the model variables. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Increasing β increases all the curves, although the sensitivity is smaller than with respect to α.
Extended Data Fig. 6 Sensitivity analysis with respect to γ.
Sensitivity analysis showing the effect of varying the transmission coefficient γ, whose nominal value is γ = 0.11, after day 50. We multiply the nominal value of γ by 0.5, 0.8, 1, 1.2, 2, and show the corresponding evolution of the model variables. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Increasing γ increases all the curves, although the sensitivity is smaller than with respect to α and β.
Extended Data Fig. 7 Sensitivity analysis with respect to δ.
Sensitivity analysis showing the effect of varying the transmission coefficient δ, whose nominal value is δ = 0.0050, after day 50. We multiply the nominal value of δ by 0.5, 0.8, 1, 1.2, 2, and show the corresponding evolution of the model variables. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Increasing δ increases all the curves, although the sensitivity is smaller than with respect to α.
Extended Data Fig. 8 Sensitivity analysis with respect to ε.
Sensitivity analysis showing the effect of varying the testing coefficient ε, whose nominal value is ε = 0.2000, after day 50. We multiply the nominal value of ε by 0.75, 0.8, 1, 1.2, 2, and show the corresponding evolution of the model variables. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Increasing ε significantly decreases all the curves: the model is extremely sensitive to variations in the value of ε.
Extended Data Fig. 9 Sensitivity analysis with respect to θ.
Sensitivity analysis showing the effect of varying the testing coefficient θ, whose nominal value is θ = 0.3705, after day 50. We multiply the nominal value of θ by 0.5, 0.8, 1, 1.2, 2, and show the corresponding evolution of the model variables. Panel (a) shows the variation in the total number of cases, panel (b) in the number of recovered individuals (green) and deaths (black), panel (c) in the total number of currently infected individuals, panels (d)–(h) in the number of infected in different categories. Increasing θ decreases all the curves, but the sensitivity is smaller than with respect to ε.
Extended Data Fig. 10 Sensitivity analysis with respect to the other parameters.
Sensitivity analysis showing the effect of varying, after day 50: the worsening coefficients ζ and η leading to clinically relevant symptoms, whose nominal values are ζ = η = 0.0250 (row a); the worsening coefficients μ and ν leading to lifethreatening symptoms, whose nominal values are μ = 0.0080 and ν = 0.0150 (row b); the healing coefficient λ, whose nominal value is λ = 0.0800 (row c); the healing coefficients ρ, κ, ξ and σ, whose nominal values are ρ = κ = ξ = 0.0200 and σ = 0.0100 (row d); the mortality coefficient τ, whose nominal value is τ = 0.0100 (row e). In all cases, the nominal value of all the considered parameters is multiplied by 0.5, 0.8, 1, 1.2, 2, and the corresponding evolution of the model variables is shown. Increasing ζ and η decreases the final number of infected and recovered, but also increases the number of deaths; the number of symptomatic and lifethreatening infections initially increases, to decrease afterwards. Increasing μ and ν decreases the final number of infected and recovered, but also increases the number of deaths; the number of lifethreatening infections initially increases, to decrease afterwards. Increasing λ, as well as the other healing parameters, decreases all the curves, apart from the curve of recovered patients, which initially increases (due to a higher recovery rate) and then eventually decreases (due to less infections overall). Increasing τ leaves all the curves almost unaffected, apart from the curve of lifethreatened infected that is decreased, leading to a small decrease in the curve of all infected cases, the curve of recovered that is decreased and the curve of deaths that is increased.
Supplementary information
Supplementary Information
Supplementary Table 1.
Rights and permissions
About this article
Cite this article
Giordano, G., Blanchini, F., Bruno, R. et al. Modelling the COVID19 epidemic and implementation of populationwide interventions in Italy. Nat Med 26, 855–860 (2020). https://doi.org/10.1038/s4159102008837
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s4159102008837
Further reading

COVID19 and hospital management costs: the Italian experience
BMC Health Services Research (2022)

Human behaviour, NPI and mobility reduction effects on COVID19 transmission in different countries of the world
BMC Public Health (2022)

Reliability of predictive models to support early decision making in the emergency department for patients with confirmed diagnosis of COVID19: the Pescara Covid Hospital score
BMC Health Services Research (2022)

Inference on the dynamics of COVID19 in the United States
Scientific Reports (2022)

Iterative datadriven forecasting of the transmission and management of SARSCoV2/COVID19 using social interventions at the countylevel
Scientific Reports (2022)