Introduction

As one of the key neglected tropical diseases (NTDs)1,2, dengue fever is a viral infection transmitted to humans through the bite of infected mosquitoes, primarily Aedes albopictus and Aedes aegypti3. The global incidence of dengue fever has notably increased over the past two decades. According to the World Health Organization (WHO), reported cases surged tenfold from 2000 to 2019, escalating from 0.5 million to 5.2 million cases4. Since 2023, sustained transmission of dengue, coupled with an unexpected upsurge in cases, has led to over 5 million infections reported across more than 80 countries/territories, which included more than 5000 dengue-related deaths4,5. Therefore, dengue fever poses a significant threat to the health of billions of individuals and has a substantial burden worldwide.

Approximately 75% of the global prevalence for dengue infections locates in Southeast Asia and the Western Pacific regions close to the tropics6,7,8. In China, the expansion of dengue poses a significant public health concern. Prior to the year 2000, occurrences of dengue fever in China were predominantly localized in tropical regions along the southern coastline9. However, there has been a gradual northward progression of dengue outbreaks over time10. An epidemic of dengue fever occurred in China during 2014, with the number of infected people reaching more than 40,00011,12. In addition, our previous study suggests that frequent human travel increases the possibility that individuals will be exposed to disease vectors, allowing the rapid spread of dengue virus (DENV) to susceptible populations13. Some previous studies have identified long-term urbanization, human mobility and population growth are main influential drivers shaping dengue’s distribution and dynamics14,15. In addition, the interactions between recent climate change and key socio-environmental drivers have also been shown to contribute to dengue’s expansion16. Although our previous studies have concerned about the national burden of dengue fever, the studies mainly focused on identifying the connectivity between local and imported cases as well as the fundamental factors for dengue epidemics17,18. In addition, a recent study has projected future risk of dengue related to hydrometeorological conditions in China and quantify its associated disease burden in a future with a changed climate19. However, we have limited knowledge on the time-evolving transmission dynamic features and burden associated with nationwide spread of dengue fever in China up to now.

To understand the transmission of the DENV and better control its progression in the future, it is critical that the epidemiological characteristics underpinning these outbreaks are quantified and analyzed in space and time. Here, we performed this large-scale study to characterize the epidemiology of dengue fever based on national surveillances. A city-resolved metapopulation network model was used to simulate the transmission of dengue infection among the 337 cities in China. The metapopulation network model can flexibly generate patterns of complex population movements between different regions. The model depicts national data on dengue infections and is coupled with an iterative Bayesian inference algorithm, named the ensemble adjustment Kalman filter (EAKF), which assimilates observations of daily cases in each city, as well as human movement between cities12,13. The incorporation of Bayesian inference supports a fitting of the model to case observations and estimation of the unobserved state variables (for example, new infections per day within a city) and system parameters (for example, transmission rate in each city). Our findings provide the most comprehensive quantification to date of the time-evolving epidemiological features and burden associated with successive epidemics waves of dengue fever in China over the period of 2013–2020.

Results

Seasonal and pandemic characteristics of dengue fever

From 2013 to 2020, a total of 95,339 cases of dengue fever were reported in investigated 337 cities in China, including 83,318 local cases and 12,021 imported cases. In terms of spatial distribution, dengue fever cases were mainly distributed in the southeastern part of China, specifically, south of the Hu line (Fig. 1a). Over the span of eight years, there was a noticeable expansion in the geographical spread of dengue cases, with a particular increase observed in imported cases (Figure S2, S3). Almost all cities in south of the Hu line were reported to have imported dengue infections from outside the country in 2019 (Figure S3). In terms of temporal distribution, dengue fever in China usually has an epidemic peak every year, usually between July and November, with sporadic cases at other times (Fig. 1b, c, g). Among 83,318 local dengue cases, 47,100 (56.53%) of the reported dengue cases were laboratory-confirmed cases, 34,986 (41.99%) were clinically diagnosed cases, and 1232 (1.48%) were probable cases. Typically, the diagnosis of reported dengue cases was predominantly laboratory-confirmed. However, during the stronger dengue outbreaks in 2014 and 2019, more than half of the cases were clinically diagnosed (Fig. 1d).

Fig. 1: Spatial and temporal distribution of local and imported cases of dengue fever, 2013-2020.
figure 1

a Spatial distribution of local and imported dengue cases (log-transformed scale). The shades of red in the city blocks indicate the severity of local cases, with three breakpoints: 1.10, 3.22 and 5.88, while the shades of blue in the city blocks indicate the severity of imported cases, with three breakpoints: 0.69, 2.40 and 4.11. The Jenks natural breaking method was used for data classification. The blue dashed line is Hu Line, a geographical demarcation line for population distribution and economic development in China. b Seasonal distribution of dengue cases, plotted as the sum of the weekly number of cases throughout the year from 2013 to 2020. c Time series of weekly cases of dengue. For (a)–(c), natural logarithmic transformation (ln(cases + 1)) is applied. As a widely used data normalization method, logarithm transformation of data was performed to make the data distribution more uniform in order to better explore the seasonal fluctuation characteristics of dengue incidence. d Time series of weekly probable, clinically diagnosed and laboratory-confirmed cases of dengue. The subfigure providing a zoomed-in view of the weekly dengue cases on the vertical coordinate. e Source-sink relationships between original countries with imported dengue cases and Chinese provincial administrations with relatively high imported cases. f Relationships between original countries with imported dengue cases and Chinese provinces with relatively low imported cases. g Temporal distribution of imported dengue cases across provinces.

From 2013 to 2020, Myanmar and Cambodia emerged as the primary original countries with imported dengue cases, with most of imported cases being brought into the Yunnan and Guangdong provinces in China. Other provinces, except Xinjiang and Tibet, also reported dengue fever cases imported from abroad (Fig. 1e, f). The proportion of imported dengue cases from original countries varied in different years. Myanmar emerged as the primary importing country during the period of 2013–2017, while Vietnam replaced this role during 2018–2020 (Figures S4, S5). Dengue cases were mainly imported into Yunnan province in 2013–2017 and to Guangdong province in 2018–2020 (Figures S4, S5). The temporal distribution of imported cases exhibited an upward trend starting in March, reaching a peak in October, followed by a decline (Fig. 1g).

Except for the low incidence of dengue fever among children and adolescents aged 0–19 years, there was no substantial difference in the incidence of dengue fever among local dengue cases across different age groups (Figure S6). The incidence rate was slightly higher in young adults aged 20–49 years in Guangxi than in other age groups, and slightly higher in the individuals over 60 years in Fujian than in other age groups.

The median time from onset to diagnosis was consistent at 4 days (IQR 2–7 days) for both local dengue cases and imported cases. The probability density distribution from onset to diagnosis showed that the time to diagnosis was slightly shorter for reports in the months following the start of the annual dengue outbreak than before the outbreak (Figure S7). The probability density distributions of incidence to diagnosis were exhibited similar for different age groups (Figure S7).

Nationwide, local cases of dengue fever exhibited an annual peak of activity, typically peaking in September. There was no significant difference (P = 0.3514) in the time of reaching the annual peak of dengue epidemic across difference provinces (Fig. 2c, f). With increasing dimensions of dengue epidemics, the annual amplitude fluctuations decreased (P = 0.0001) (Fig. 2a, d). Outside the annual amplitude of dengue, there were more sporadic cases in the south than in the north, with consistent of results for semi-annual periodicity (P = 0.0016) (Fig. 2b, e).

Fig. 2: Characteristics of a latitudinal gradient in the periodicity and peak timing of dengue fever epidemics in China.
figure 2

a Amplitude of the annual periodicity (F = 19.62, P = 0.0001). b Amplitude of the semi-annual periodicity (F = 12.24, P = 0.0016). c Timing of primary annual dengue fever peak, in weeks from 1st January (F = 0.90, P = 0.3514). Colors represent different climatic zones (red = tropical, yellow = sub-tropical, green = warm-temperate, blue = mid-temperate, black = cold-temperate). Symbol size is proportional to the number of dengue cases in each province. The solid black line indicates a linear regression fit (regression weighted by the average annual number of dengue fever cases). The results of the F test and the corresponding P values were employed to testing the significance of the linear regression fit. P values are given on the graphs. d Amplitude of the annual periodicity in each province. e Amplitude of the semiannual periodicity in each province. f Timing for primary annual peak of dengue fever in each province. NA in the legend indicates that the region is temporarily unable to calculate these indicators (No dengue cases are reported in Tibet during the study time, and data for Hong Kong, Macau, Taiwan, Spratly Islands, and other parts of China outside of mainland China are not currently available).

The annual amplitude trend of dengue imported cases is consistent with local cases, showing a trend of strong in the South and weak in the north (P = 0.0179), with no significant difference in the semi-annual periodicity between the north and south regions (P = 0.8988) (Figure S19). The peak of imported dengue cases reached their annual peak later in the southern region than in the northern region, which was statistically significant (P = 0.0015) (Figure S15-S19).

Intrinsic dynamic characteristics of DENV transmission

By fitting the inferred model of DENV transmission to the daily observed case, we successfully captured dengue outbreaks at city, urban agglomerations, and national scales in 2019 (Fig. 3). The peak times of dengue outbreaks were very consistent across regions, but there were still some inconsistencies. For example, in the Yunnan Guangxi border area, the infections number peaked in mid-September, while in the Pearl River Delta, infections peaked in mid-October (Fig. 3c, d). We also estimated the number of monthly total infections (reported and unreported infections) across the country in 2019. Most dengue infections were unreported cases (Fig. 3b). It was estimated that the total number of dengue infections in the country peaks in September, with 25,691 (95% CI 23,214-28,333) new infections in one month (Fig. 3b).

Fig. 3: Model fitting and estimation of national dengue infections.
figure 3

a, cf Model fit for the number of daily dengue cases (orange star symbols) in China as a whole and for four urban agglomerations. The solid purple line indicates the median of the estimates. Dark shaded intervals and light shaded intervals indicate the interquartile range and 95% confidence intervals (CIs) of the estimates, respectively. b Confirmed dengue cases (yellow bars) and estimated monthly total infections (blue bars) in China. Distributions were obtained from 300 ensemble members. The blue bars represent medians and whiskers show 95% CIs.

Next, we focused on these major urban agglomerations and the city scales in 2019. Overall, the estimated number of infections and related parameters across the regions began to rise in June, peaked around September and October, and then declined sharply (Fig. 4). The Yunnan Guangxi border region, the Fujian Zhejiang coast area, and Chongqing city had the highest estimated number of infections in September, with 11,659 (95% CI 10,025-14,019), 3063 (95% CI 2619–3524), and 2527 (95% CI 2141–2971) cases, respectively (Fig. 4a, c, d). The highest estimated number of infections in the Pearl River Delta occurred in October, with 2674 (95% CI 2296–3053) cases (Fig. 4b). Changes in transmission rates were broadly consistent across the four agglomerations regions (Fig. 4e–h). The Reff began to rise gradually in June, exceeding one during the dengue epidemic, and then began to fall in October until it tended to zero (Fig. 4i–l). The force of dengue infection was low in the four regions, with only a small increase during the epidemic (Fig. 4m–p). We also estimated the number of infections in four major urban agglomerations and all 337 cities, as well as parameter inferred information (Table 1, Table S4).

Fig. 4: Estimation of dengue infections and inference of time-varying transmission parameters in four urban metropolitans.
figure 4

ad Confirmed dengue cases (yellow bars) and estimated monthly total infections (blue bars) in four urban metropolitans. Distributions were obtained from 300 ensemble members. The blue bars represent medians and whiskers show 95% CIs. eh Distribution of dengue transmission rates. il Distribution of dengue effective reproduction number. (mp) Distribution of dengue force of infection. The center and box boundaries represent the median (50th), 75th, and 25th percentiles, respectively. In the box plots, the poles of the whiskers represent the distance from the 75th percentile to the maxima and the 25th percentile to the minima, respectively. The posterior estimate for the day is also displayed. The distribution is derived from 300 ensembled members.

Table 1 Parameters estimation derived from transmission model in four major urban agglomerations

The variation of Reff was crucial for revealing the intrinsic transmission dynamics characteristics of DENV. We found that the Reff of dengue fever varied between cities across the nation (Fig. 5). In general, the Reff started to rise in June, reached a maximum around September and October, and then decreased sharply to around zero. Cities with high Reff was concentrated in the south of China, especially some border and coastal cities. The analysis results of the effective reproduction number Reff across cities suggested that the current epidemic of dengue in China is not a long-term sustained epidemic state. The results of model inference further confirm the seasonal fluctuations of dengue epidemic, but also suggest that seasonal dengue transmission are spatially heterogeneous. We similarly gave the change in dengue transmission rates over time by city (Figure S21).

Fig. 5: Estimates of effective reproduction numbers in cities.
figure 5

(ag) Estimates of effective reproduction numbers for dengue on the first day of each month in each city. Blue shades represent the magnitude of the effective reproduction number, while gray shades represent the absence of inference.

We revealed the infections in the main months of seasonal epidemics in each city, as well as the infections throughout the annual period. In August, infections were predominantly observed in coastal and border cities, whereas in September and October, dengue cases were reported in some inland cities located in the southern regions. (Fig. 6a–e). We further depicted the network of population movement and the relationship between the infections number resulting from population movement (Fig. 6f–j). Infections resulting from population movement were mainly found in the southern pivotal cities, which were centers of high population movement, and radiated to around cities. Additional information regarding the network of population movements and the number of infections resulting from population movements throughout the year and is given in the Supplementary Information file (Figure S22).

Fig. 6: Estimates of dengue infections in cities and the relationship between population mobility and dengue infections.
figure 6

(ae) Estimates of the number of dengue infections in each city for each month and for the whole year. Redder colors in the cities indicate more severe infections, and the legends record the infections’ range of specific numbers. (fj) Estimated infections due to population movement in each city and the network of population movement intensity between cities. Estimated infections attributed to population movement in each city, along with a network illustrating the intensity of population movement among cities. Dots represent infections in cities. The thickness of the blue line represents the intensity of population movement between cities. (ko) Population outflows in five cities. The lines and dots indicate the location of the population outflow from the city. Larger pink dots indicate higher population outflows. Note that the South China Sea part in the figure k-o is consistent with the previous figure, which is not shown due to the limited space in the figure.

The model inference results are robust to the setting of parameters and variables (Figure S25-29). Furthermore, we generated simulated outbreaks based on different scenarios. Tests on fitting simulated dengue epidemics demonstrate that the inference method can recover key time-varying parameters in different simulation scenarios (Figure S24). We also did model fitting for dengue case data from 2013 to 2018 and estimated the total number of actual infections per month (Figure S30). The inferred results of this study demonstrated the inter-annual variations in epidemic size from 2013 to 2019, but it exhibited a consistent peak in annual activity over time (Fig. 3 and Figure S30).

Discussion

Our study of more than 90,000 cases of dengue fever reported to the NNDSS during 2013–2020 in China gives a comprehensive quantification of the national burden of dengue disease and its intrinsic dynamics over time up to now. The geographical distribution of dengue cases was expanding in recent years, with obvious changes in the main imported areas as well as external sources of imported cases over the study period. The main external source country of imported cases changed from Myanmar to Cambodia and the main imported province changing from Yunnan to Guangdong. Our large study covers 337 climatologically diverse cities and suggests that although cases of dengue tend to arise in the autumn of the year throughout the country, annual amplitude of epidemics varies by latitude. The peak intensity mainly characterized by high in southern cities and low in northern cities. Our study demonstrated that the metapopulation network model coupled with an iterative Bayesian inference algorithm provided accurate simulations for city-level dengue transmission trajectories in retrospective forecasts of 337 cities in China. The proposed model accurately predicted the magnitude and temporal peak of seasonal epidemics as well as outbreaks of dengue.

Our study revealed that the age profile of dengue infection was in consistency with previous studies found in Asia20,21. Our study indicated a slightly higher incidence of dengue in the elderly in several regions, most likely due to a gradual decline in their immunity with age. A similar situation exists for young adults in Guangxi, which is located in southern China. Overall, there was little difference between the different age groups. The slight difference of dengue incidence among different age groups reminds us that the whole population may be susceptible to dengue infection.

A national, population-representative study of the spatio-temporal disease burden and characteristics of dengue infections could significantly enhance comprehension of the disease dynamics over time scales as well as across geographical regions. By inferring the results of dengue infections in China from 2013–2019, we found that the seasonal trend of dengue epidemics was consistent, but the interannual intensity varied across years. This may due to the fact that local conditions (i.e., meteorological factors, mosquito density, urbanization growth, and other unknown factors) and population mobility jointly determine the epidemic size in most years. Results from similar studies also support the assumption that seasonal epidemics of dengue are accompanied by variations in epidemic size due to inter-annual variations in local conditions and other factors22,23,24. This study also revealed that dengue infection was predominantly occurred in the south-eastern region of China. Over the span of eight years, there was a noticeable expansion in the geographical spread of dengue cases, especially the imported cases. One of the reason for these transmission trends might be the temperature changes due to climatic hazards, which expanded the area suitable for mosquito breeding25,26. Additionally, the uncontrolled urban growth, rapid and global traveling, difficulty of controlling vector spread, and environmental degradation were also potential factors contribute to the spread of dengue18,27,28,29,30. Further research is required to elucidate the underlying factors contributing to the dengue transmission.

Regarding the regional distribution of dengue cases, our analysis based on Bayesian inference algorithm coupled with disease transmission dynamics model revealed dengue outbreaks at the national, city, and major urban agglomerations scale. The results indicated that this acute highly infectious mosquito-borne infection is endemic in several major urban clusters31. In each of the major urban agglomerations, we estimated that the number of infections peaks in September or October, which was broadly in line with our previous studies12,13. In addition, similar trends in the Reff estimated by the model revealed that dengue fever gradually tends to subside around October. The identification of peak timing and the Reff played a vital role in targeting measures to prevent and control dengue outbreaks. As a key indicator, changes of the Reff in cities across the country were also demonstrated. Our research revealed that the cities with high Reff were mainly concentrated in southern China, especially in some border and coastal cities. Despite the incomplete consistency of the mathematical models used, the estimates of Reff in different regions in our study are consistent with the results of some previous studies in partial regions of China32,33,34,35. Dengue burden surveys helped to elucidate the temporal distribution of infected cases, which is likely attributed to factors such as particular temperature and high humidity27,36,37, and these in turn had informed the spatial distribution of dengue. As data from more years and regions become available in the future, we hope to further validate and refine the temporal, spatial and population distributions of dengue fever.

Another potentially important trigger for the incidence of dengue fever was the increased mobility of the population13,38,39. As confirmed by our study, dengue fever spread with a lag in China. Dengue fever in China occurs in coastal and border cities from August to October, and gradually spreads inland. The results of population movement partially underpinned this trend. Population movements could lead to infections mainly in the southern pivotal cities with high population movements and radiate to around cities. These findings might be driven by a number of factors that regulate population mobility, including national policies and economic developments40,41. Further research is needed on this preliminary finding to alleviate the burden of dengue fever.

On the basis of the knowledge of the transmission mechanism of DENV depicted by a set of ODEs12, the mathematical model consisting of both mosquito and human compartments was developed to simulate dengue transmission in this study. We further incorporated the transmission dynamics model with population movement, and then constructed the city-resolved metapopulation network system to simulate the transmission of DENV between cities of China. An iterative Bayesian inference algorithm-the EAKF was applied to assimilate observations of daily dengue cases in each city, as well as population movement between cities. Our previous study demonstrated that the unobserved state variables (for example, new infections per day within a city) and system parameters (for example, transmission rate in each city) of the model can be iteratively calibrated and estimated by a fitting of the model to case observations using Bayesian inference13,42. Our results showed that the method can recover key time-varying transmission parameters of DENV in multiple simulation scenarios. The model fitting to the observed dengue case data accurately captured the successive epidemic waves occurring at national and city scales.

Some limitations in our study should be noted. First, although we used real measurements from 89 mosquito vector surveillance sites in our analysis, the current surveillance of mosquito vector density in some remote areas is not comprehensive, resulting in possible biases in the vector density estimates obtained. Second, this study examined the spread of dengue fever based on large-scale population movements between cities. However, the study was unable to determine the subtle differences between the actual distance individuals traveled, the amount of time spent in different locations, and the frequency of commuting by populations in specific types of habitats (e.g., residential, recreational, and commercial facilities)13. Third, population movement is one of the major triggers for cross-regional transmission of dengue fever. Although we considered cross-regional transmission within the country, international population movement could not be well estimated and applied in the model. It is hoped that the effects caused by population mobility can be elaborated in more detail in further studies. Finally, although our model takes into account the heterogeneity of disease transmission among populations in different regions (e.g., dengue transmission capacity, and so on), the study did not take into account the different effects such as classifications of primary and post-primary dengue cases and cases with different dengue severity on disease transmission since the information is not yet available. Although the data of dengue fever cases used in our study were obtained from the NNDSS, currently the largest national surveillance platform of infectious diseases in China, the influence of misclassification of dengue cases on the estimated results still cannot be totally eliminated.

In conclusion, this study gave a comprehensive quantification of the time-evolving epidemiological features and burden associated with successive epidemics waves of dengue fever in China over recent years. Our study offers robust, population based, national data to understand epidemiological characteristics and transmission dynamics of dengue fever. These efforts will facilitate the transformation of dengue transmission inference into decision-making. Future mitigation policies should take into account the heterogeneity of the disease burden found.

Methods

Ethics statement

This study complies with all relevant ethical regulations. The approval was obtained from the Ethics Committee of National Institute Communicable Disease Control and Prevention Chinese Center for Disease Control and Prevention (Ethical Approval NO. ICDC-2016008).

Study domain

In this study, the unit of data analysis was defined as a prefecture-level administrative region or municipality (hereunder referred to uniformly as cities). A total of 333 prefecture-level administrative regions (including 293 prefecture-level cities, 30 autonomous prefectures, 7 prefectures and 3 leagues) were included in our analysis. We also included 4 municipalities (including Beijing, Shanghai, Tianjin, and Chongqing) in our analysis because dengue fever was active in these places. Therefore, all 337 cities in mainland China were included in this study. Additionally, we divided these 337 cities with severe dengue outbreaks into major urban agglomerations (Supplementary pp 20).

Dengue fever incidence data

Surveillance data on daily incidence of dengue fever during the period from 1st January, 2013 to 31st December, 2020 were obtained from the National Notifiable Diseases Surveillance System (NNDSS) of the Chinese Center for Disease Control and Prevention (China CDC). All dengue fever cases were diagnosed according to the diagnostic criteria for Dengue Fever (WS 216-2008 and WS 216-2018)43,44. Case definition for a probable, clinically diagnosed and laboratory-confirmed case of dengue fever disease can be found in the appendix document (Supplementary pp 6). The comprehensive diagnosis was made according to the epidemiological evidence, clinical manifestations and laboratory examination results of the patients. The information of dengue fever cases includes basic demographic information (gender, age, date of birth, national standard code of individual present residential address, and population classification), disease diagnosis (probable cases, clinically diagnosed cases, and laboratory-confirmed cases), date of disease onset, date of diagnosis, and date of death (if applicable). The daily dengue fever cases were aggregated to each city based on individual present residential address.

Mosquito abundance data

Two types of mosquito abundance data were simultaneously collected specifically for this study and utilized to evaluate mosquito larvae and adult mosquito density in each region. Mosquito adult density (Aedes aegypti and Aedes albopictus) was sampled by mosquito light traps. In high-risk areas with frequent dengue outbreaks in recent years, mosquito vector activity is monitored twice a month. In areas where local cases have occurred in recent years or where the risk of dengue outbreaks is relatively high, mosquito vector activity is monitored once a month. The number of adult mosquitoes was evaluated in several circumstances, such as agricultural areas, livestock sheds, residential areas (> 50 households sampled per month), and parks. Breteau index (BI) was calculated to evaluate the mosquito larvae density45. \(BI={N}^{\ast }/N\times 100\) evaluates the breeding of mosquito larvae in indoor and outdoor small water containers. \({N}^{\ast }\) is the number of Aedes positive water storage containers, and N is the total number of households surveyed (> 50 households sampled per month). All operation was performed according to the standard of National Vector Detection Program46. The locations of 89 survey sites for mosquito vector data are shown in Supplementary Fig. 8.

Meteorological surveillance data

Daily meteorological data including ambient temperature, daily maximum temperature and daily minimum temperature from 1st January, 2013 to 31st December, 2019 were obtained from the China Meteorological Data Sharing System (http://data.cma.cn/). This dataset comprises information collected from 2441 meteorological stations in mainland China, and organized based on their respective administrative regions (Supplementary Fig. 8). In cases where multiple meteorological stations are available within a single administrative region, the regional average, maximum, and minimum temperature were calculated as the average, maximum, and minimum values of multiple stations, respectively.

Demographic and population mobility data

Demographic and population mobility data was collected from the results of China’s seventh census in 2020, which can be publicly obtained from the National Bureau of Statistics of China (http://www.stats.gov.cn/). In order to match this outflow and inflow of population to our prefecture-level administrative and to obtain a more accurate and detailed understanding of the daily population commute, we obtain daily city-level population migration data from the Gaode Migration Big Data platform (https://trp.autonavi.com/migrate/page.do). The platform of Gaode Migration Big Data is based on location-based service (LBS), which collects the geographical location changes of users’ mobile phone positioning to depict the migration intensity between cities. To ensure privacy and confidentiality, the Gaode platform only provides the weight parameter of the daily flow scale between cities, named migration index. Hence, we calculate the daily inflow and outflow population between city by \({n}_{A\leftarrow B}(t)={\eta }_{A\leftarrow B}(t)/{\sum}_{k\ne A}{\eta }_{A\leftarrow k}(t)\times {\rho }_{A}\). Here, \({n}_{A\leftarrow B}(t)\) is the migration index from city B to city A on day t. The \({\rho }_{A}\) is a static inflow population of city A from the census data. Noted that since the platform was opened around June 2018, to reduce the impact of the COVID-19 pandemic on population mobility patterns, we selected the migration index for the whole year from 1st June, 2018 to 31st May, 2019 to match each calendar year. We assume that the population mobility patterns of each calendar year during the study period were similar.

Descriptive analysis of local and imported dengue cases

We included all cases of dengue fever from 1st January, 2013 to 31st December, 2020 in our analysis. We categorized all cases into local and imported cases and assessed the temporal and spatial geographic distribution in 31 provincial administrations. We categorized cities into 4 × 4 categories according to the number of local and imported cases in each city by using the method of Jenks natural discontinuity classification47, and a map was generated. To quantify the seasonal pattern of local dengue cases in each region, we plotted heat maps of dengue cases throughout the year and on a weekly basis for local cases of dengue. We also stratified local cases by different diagnostic categories. We stratified imported cases by original country and imported Chinese provincial administrations to map the source-sink relationship of dengue imported cases.

Latitudinal gradient characteristics of dengue cases distribution

We also fitted models of seasonal multiple linear regression to weekly time series of local and imported dengue cases (Supplementary pp 11), including time trends and harmonic terms to represent annual and semiannual periodicity48,49,50. We extracted the amplitudes and peak times of the annual and semiannual cycles based on the model coefficients48. The amplitude measures the difference between the maximum and minimum values of the epidemic curve for the disease50. The F-test method was applied to explore the statistical significance of seasonal multiple linear regression models and determine whether the overall linear relationship holds.

Metapopulation network modeling for local dengue cases

We developed a metapopulation network model focusing on simulating the spread of dengue fever at prefecture city level in mainland China in 2019. This model extends the traditional isolated compartment model by treating each city as a subpopulation and interconnecting then through population movement, allowing for superior performance even when the internal space does not satisfy the assumption of population homogeneity. It has shown superiority in tracking the large-scale spatial spread of infectious disease12,13,42.

Dengue fever is an infectious disease transmitted by mosquitoes, primarily Aedes aegypti and Aedes albopictus. DENV is represented by four distinct serotypes (DENV-1 to DENV-4) that co-circulate in tropical and sub-tropical regions and can repeatedly infect humans. Transmission processes are divided into two steps: infected individuals infect susceptible mosquitoes, then infected mosquitoes infect susceptible individuals. This assumption is based on evidence that infection with any serotype of DENV leads to a short period of heterotypic immunity (that is, immunity against another serotype)51 and the reality that four DENV serotypes are commonly transmitted across different regions in mainland China52. Therefore, a mathematical model consisting of both mosquito and human compartments (SI-SIR model) is considered in our study to simulate dengue transmission. Susceptible (\({S}_{H}\)), infected (\({I}_{H}\)) and removed (\({R}_{H}\)) humans are included in the human compartment.

The mosquito compartments consist of both susceptible (\({S}_{M}\)) and infected (\({I}_{M}\)) mosquitoes. This model has been validated in previous studies12,13. The SI-SIR model comprises the following system of ordinary differential equations (ODEs):

$$\frac{d{S}_{M}}{dt}= -\frac{\lambda \tau (t){\beta }_{M}{S}_{M}{I}_{H}}{{N}_{H}}-\alpha {S}_{M}+{\mu }_{b}(t)({S}_{M}+(1-U){I}_{M})-{\mu }_{d}{S}_{M}\\ \frac{d{I}_{M}}{dt}= \frac{\lambda \tau (t){\beta }_{M}{S}_{M}{I}_{H}}{{N}_{H}}+\alpha {S}_{M}+{\mu }_{b}(t)U{I}_{M}-{\mu }_{d}{I}_{M}\\ \frac{d{S}_{H}}{dt}= -\frac{\lambda \tau (t){\beta }_{H}{S}_{H}{I}_{M}}{{N}_{H}}\\ \frac{d{I}_{H}}{dt}= \frac{\lambda \tau (t){\beta }_{H}{S}_{H}{I}_{M}}{{N}_{H}}-\frac{{I}_{H}}{D}$$

Here, \({N}_{H}\) represents the total population number; \(\lambda\) is the adjusted transmission rate between humans and mosquitoes; \(\tau (t)\) is the biting rate; \({\beta }_{M}\) is the probability for the infection transmission from human to mosquitoes; \({\beta }_{H}\) is the probability for the infection transmission from mosquitoes to human; \(\alpha\) is the random rate of DENV seeding into mosquitoes; \({\mu }_{b}(t)\) is the birth rate of mosquitoes at timet; \({\mu }_{d}\) is the mortality rate of mosquitoes; U is the vertical dissemination rate of infected mosquitoes and is constant over an outbreak; D is the average duration of contagiousness for human.

To use the compartment model across the national cities, it is necessary to take into account population movements between cities, for which we extend the isolated model to a metapopulation model. In this extended model, we included two types of population movements, daily regular commuting and diffusive random movements across the 337 cities in China42. The daily regular commuting is relatively stable in terms of demographic composition, and travel regularly between places of residence and work. In contrast, the diffusive random movements circulate among all subpopulations, following a Markov process specified by a transition matrix. Previous studies have shown that these two patterns of population movement have qualitatively different effects on the spatial spread of infectious diseases53,54. We therefore take both patterns of population movement into account.

The population exchange pattern by regular commuting is defined as follows. During the daytime, the population in location i, \({N}_{i}^{Hd}(t)={N}_{ii}^{H}+{\sum}_{k\ne i}{I}_{ki}^{Hr}(t)+{\sum}_{k\ne i}({N}_{ik}^{H}-{I}_{ik}^{Hr}(t))\), consists of three parts: individuals who live and work in location i; reported infected individuals who would otherwise move to location \(k(k\, \ne \, i)\); unreported infected individuals who work in location i. For the subpopulation \({N}_{ij}\), the new infected mosquito come from both the reported and unreported infections in location i. For each susceptible mosquito \({S}_{i}^{M}\) in population \({N}_{i}\), the probability of contacting reported infected individuals is \({\sum}_{k}{I}_{ki}^{Hr}(t)/{N}_{i}^{Hd}(t)\), where \({\sum}_{k}{I}_{ki}^{Hr}(t)\) is the reported infected individuals who would otherwise move to location \(k(k\, \ne \, i)\), and the probability of contacting unreported infected individuals is \({\sum}_{k}{I}_{ik}^{Hu}(t)/{N}_{i}^{Hd}(t)\), where \({\sum}_{k}{I}_{ik}^{Hu}(t)\) is the unreported infected individuals who work in location i. Those contacts lead to new infected mosquito \(\frac{{\lambda }_{i}(t){\tau }_{i}(t){\beta }_{i}^{M}(t){S}_{i}^{M}(t){\sum}_{k}({I}_{ki}^{Hr}(t)+{I}_{ik}^{Hu}(t))}{{N}_{i}^{Hd}(t)}\) during a period of \(d{t}_{1}\) day. For each susceptible individuals \({S}_{ij}^{H}(t)\) in population \({N}_{i}\), the probability of contacting infected mosquito is \({I}_{i}^{M}(t)/{N}_{i}^{Hd}(t)\). Those contacts lead to new infected individuals \(\frac{{\lambda }_{i}(t){\tau }_{i}(t){\beta }_{i}^{H}(t){S}_{ij}^{H}(t){I}_{i}^{M}(t)}{{N}_{i}^{Hd}(t)}\) during a period of \(d{t}_{1}\) day.

The population exchange pattern by random movement is defined as follows. Apart from regular commuting, during the daytime, \(\theta d{t}_{1}{\bar{N}}_{ik}^{H}\) individuals (except for reported infected individuals) drawn from the location \(k(k \, \ne \, i)\) move to location i, are randomly redistributed into the subpopulation here. This population exchange pattern exists between any two locations. For instance, we compute the number of susceptible individuals entering into subpopulation \({S}_{ij}^{H}(t)\). In other location \(k(k \, \ne \, i)\), the probability that a random traveler is susceptible is \({\sum}_{l}{S}_{kl}^{H}(t)/({N}_{i}^{Hd}(t)-{\sum}_{l}{I}_{lk}^{Hr}(t))\), where the numerator \({\sum}_{l}{S}_{kl}^{H}(t)\) is the number of susceptible individuals from all other locations l to the location k, and the denominator \({N}_{k}^{Hd}(t)-{\sum}_{l}{I}_{lk}^{Hr}(t)\) is the total mobile population (total population minus reported infected individuals) in location k. Therefore, the total number of susceptible individuals entering location i is \(\theta d{t}_{1}{\sum}_{k\ne i}{\bar{N}}_{ik}^{H}{\sum}_{l}{S}_{kl}^{H}(t)/({N}_{k}^{Hd}(t)-{\sum}_{l}{I}_{lk}^{Hr}(t))\). These individuals are redistributed into subpopulations present in location i, where the proportion of population in subpopulation \({N}_{ij}\) is \(({N}_{ij}^{H}-{I}_{ij}^{Hr}(t))/{N}_{i}^{Hd}(t)\). The number of susceptible individuals entering the subpopulation \({S}_{ij}^{H}(t)\) is calculated by \(\theta d{t}_{1}\frac{{N}_{ij}^{H}-{I}_{ij}^{Hr}(t)}{{N}_{i}^{Hd}(t)}{\sum}_{k\ne i}\frac{{\bar{N}}_{ik}^{H}{\sum}_{l}{S}_{kl}^{H}(t)}{{N}_{k}^{Hd}(t)-{\sum}_{l}{I}_{lk}^{Hr}(t)}\).

In addition, we calculate the number of susceptible individuals leaving the subpopulation \({S}_{ij}^{H}(t)\). The \(\theta d{t}_{1}{\sum}_{k\ne i}{\bar{N}}_{ki}^{H}\) individuals are redistributed leaving location i. The proportion of susceptible people from \({N}_{ij}\) is \({S}_{ij}(t)/({N}_{i}^{Hd}(t)-{\sum}_{l}{I}_{li}^{Hr}(t))\). Finally, the number of susceptible persons leaving the subpopulation \({S}_{ij}(t)\) is calculated by \(\theta d{t}_{1}\frac{{S}_{ij}^{H}(t)}{{N}_{i}^{Hd}(t)-{\sum}_{l}{I}_{li}^{Hr}(t)}{\sum}_{k\ne i}{\bar{N}}_{ki}^{H}\).

Random movement for other human compartments can be computed similarly. Note there is no random movement in reported infected individuals \({I}_{ij}^{Hr}(t)\) and mosquito compartments because we assume infections and mosquitoes were not able to move long distance. Similarly, we can also describe the human and mosquito transfer process at nighttime.

These regular commuting and random movement only occur between human populations, by comparison, mosquitoes generally don’t move long distances. Therefore, this study assumed that the mosquitoes are fixed in a single city from birth to death. We also distinguish between reported infected and unreported infected individuals, and defined different movement patterns for them. To better fit real case observations as well as unobservable estimates of state variables and system parameters, we coupled a city-resolved metapopulation model with an iterative Bayesian inference algorithm, named the EAKF55. The model is able to depicts both reported and unreported infections and recover key epidemiologic time-varying parameters12,13,56,57,58,59. The specific details of the extended metapopulation network model are provided in the Supplementary Information pp12-15.

Transmission dynamics and parameters of dengue fever

The transmission model estimates the total infections (both reported and unreported) in China and the four major urban agglomerations. The estimated infection due to population movements is calculated as follows: \({I}_{{{{\rm{move}}}}}={I}^{Hr}+{I}^{Hu}-{I}_{ii}^{Hr}-{I}_{ii}^{Hu}\), which is simply the total infectious in each city minus the infectious caused by local commuters. The transmission rate \({\beta }_{i}(t)\) from mosquitoes to human in location i can be estimated by \({\beta }_{M\to H}(t)={\lambda }_{i}(t){\tau }_{i}(t){\beta }_{i}^{H}(t)\). Similarly, the transmission rate from humans to mosquitoes \({\beta }_{H\to M}(t)={\lambda }_{i}(t){\tau }_{i}(t){\beta }_{i}^{M}(t)\).

For a constant transmission rate, the basic reproduction number (\({R}_{0}\)) derived from the next generation matrix60 is estimated by the formula \({R}_{0}=\sqrt{\frac{{\beta }_{M\to H}{\beta }_{H\to M}{N}_{M}D}{{N}_{H}({\mu }_{d}-{\mu }_{b}U)}}\). As time evolves, the basic reproduction number \({R}_{0}\) cannot track the progress of the transmission because the susceptible individuals and mosquitoes (\({S}_{H}\) or \({S}_{M}\)) are gradually consumed, become infected and finally removed from the disease transmission cycle. Therefore, the effective reproduction number (Reff) was defined to track the evolution of the indicator61 in the following equation

$${R}_{eff}={R}_{0}\sqrt{\frac{{S}_{i}^{H}(t)}{{N}_{i}^{H}}\frac{{S}_{i}^{M}(t)}{{N}_{i}^{M}}}=\sqrt{\frac{{\lambda }_{i}^{2}(t){\tau }_{i}^{2}(t){\beta }_{i}^{M}(t){\beta }_{i}^{H}(t){S}_{i}^{M}(t){S}_{i}^{H}(t)D}{({\mu }_{d}-{\mu }_{b}(t)U){({N}_{i}^{H}(t))}^{2}}},$$

where \({S}_{i}^{H}(t)={S}_{ii}^{H}(t)+{\sum}_{k\ne i}{S}_{ik}^{H}(t)\) is the number of susceptible individuals in location i; \({N}_{i}^{H}(t)={N}_{ii}^{H}+{\sum}_{k\ne i}{I}_{ki}^{Hr}(t)+{\sum}_{k\ne i}({N}_{ik}^{H}-{I}_{ik}^{Hr}(t))\) is the number of populations in locationi. For this calculation, \({S}_{i}^{H}(t)={N}_{i}^{H}(t)\) is assumed because the infected individuals are less than 3‰ populations in each location, and then the indicator Reff can be calculated by the following equation

$${R}_{eff}={R}_{0}\sqrt{\frac{{S}_{i}^{H}(t)}{{N}_{i}^{H}}\frac{{S}_{i}^{M}(t)}{{N}_{i}^{M}}}=\sqrt{\frac{{\lambda }_{i}^{2}(t){\tau }_{i}^{2}(t){\beta }_{i}^{M}(t){\beta }_{i}^{H}(t){S}_{i}^{M}(t)D}{({\mu }_{d}-{\mu }_{b}(t)U){S}_{i}^{H}(t)}},$$

where \({R}_{eff}\) represents the average number of generations of infected people caused by each imported or local case on a given day in each city.

Furthermore, we defined the force of infection (\({F}_{i}(t)\)) based on this transmission model \({F}_{i}(t)=\frac{{\lambda }_{i}(t){\tau }_{i}(t){\beta }_{i}^{H}(t){I}_{i}^{M}(t)}{{N}_{i}^{H}(t)}=\frac{{\lambda }_{i}(t){\tau }_{i}(t){\beta }_{i}^{H}(t){I}_{i}^{M}(t)}{{S}_{i}^{H}(t)}\). This indicator \({F}_{i}(t)\) represents the probability of a susceptible person becoming infected with dengue, depending on the likelihood that the susceptible person acquires the DENV from an infected mosquito. Note that the above \({R}_{eff}\) and \({F}_{i}(t)\) ignore the influence of random movement.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.