Abstract
In this work, we aim to accurately predict the number of hospitalizations during the COVID19 pandemic by developing a spatiotemporal prediction model. We propose HOIST, an Ising dynamicsbased deep learning model for spatiotemporal COVID19 hospitalization prediction. By drawing the analogy between locations and lattice sites in statistical mechanics, we use the Ising dynamics to guide the model to extract and utilize spatial relationships across locations and model the complex influence of granular information from realworld clinical evidence. By leveraging rich linked databases, including insurance claims, census information, and hospital resource usage data across the U.S., we evaluate the HOIST model on the largescale spatiotemporal COVID19 hospitalization prediction task for 2299 counties in the U.S. In the 4week hospitalization prediction task, HOIST achieves 368.7 mean absolute error, 0.6 \({R}^{2}\) and 0.89 concordance correlation coefficient score on average. Our detailed number needed to treat (NNT) and cost analysis suggest that future COVID19 vaccination efforts may be most impactful in rural areas. This model may serve as a resource for future county and statelevel vaccination efforts.
Similar content being viewed by others
Introduction
The COVID19 pandemic has caused enormous social and economic loss. With over 90 million confirmed cases and 1 million deaths in the U.S. by Aug 2022^{1}, the pandemic placed a heavy burden on national and global healthcare systems. The disparities in medical resource availability among U.S. counties—including ventilators, hospital beds, and critical care staff—can and, in some cases, have devastatingly impacted patient outcomes^{2}. Nationwide vaccination efforts have primarily favored urban areas, and the urbanrural disparity in vaccination coverage has continued to worsen throughout the pandemic^{3}. Rural vaccination efforts are technically challenging due to a combination of more limited access to healthcare, generally lower perception of the severity of COVID19 by residents of rural communities, and usually higher levels of vaccine hesitancy^{3}. This multiyear trend of relative rural underimmunization raises the question of whether further vaccination efforts might be more impactful in these underimmunized areas regarding hospitalization, death, and costeffectiveness. Answering this question is technically challenging but clinically meaningful; if counties can be stratified in terms of improved outcomes per vaccination, this may provide meaningful insight into how local and statewide efforts might be distributed to maximize impact. To accomplish this task, we propose the following technical contributions:

1.
Extracting interregion similarities. The pandemic progression might be similar in regions with similar geographical and socioeconomic properties. If so, extracting and utilizing these similarities would help the model better predict pandemic progression in the future. Most existing spatiotemporal pandemic prediction methods use predefined location graphs to explicitly inform the spatial connections to the model^{4,5,6,7}. In contrast, these manually defined location graphs may differ significantly from the groundtruth interregional relationships. Other works learn the spatial connectivity purely based on the historical pandemic progressions^{8,9}, regardless of the underlying social, economic, or geographical similarities, which cannot capture the interregion similarities in full view and lead to inferior prediction results. Therefore, it is nontrivial to model spatial dependencies with multisource background data in a flexible way.

2.
Utilizing complex innerregion influence factors. Local COVID19 hospitalization rates are likely affected by many factors, including the number of infected patients, highrisk patient population size, and vaccination rates. Estimating these effects becomes more challenging with more granular information, such as the number of COVID19 immunizations per individual. Leveraging these complex innerregion influence factors, such as medical claims and vaccination statistics, may significantly improve hospitalization prediction performance. Recent research combines the susceptibleinfectedremoved (SIR) and susceptibleexposedinfectedremoved (SEIR) differential models with deep learning models to improve predictions by simulating realworld SIR dynamics^{4,5,6,7}. However, when applied to hospitalization prediction tasks, these models face the following issues: (1) Limited data sources and model parameters in the traditional epidemiological model make it difficult to utilize these complex highdimensional innerregion factors effectively. Ignoring complex innerregion factors may cause the model to fail to capture the underlying disease distribution, virus subvariants, and vaccination effectiveness rates across the state, which could lead to biased results. While purely deep learningbased timeseries prediction models such as recurrent neural networks (RNN) can handle highdimensional data and extract complex nonlinear relationships, they ignore realworld progression dynamics and have minimal interpretability compared to epidemiologybased models. (2) Most epidemiological models offer a simple increasingdecreasing trend with different rates, while the hospitalization curves are more complex and may not follow this simple trend.

3.
Moreover, the SIR dynamics are not designed for spatiotemporal prediction tasks, so these models cannot directly utilize for hospitalization prediction.
To address the above challenges, we propose an Ising dynamicsbased deep learning model for spatiotemporal COVID19 hospitalization prediction. We have combined multiple data sources, including disease and vaccination statistics from realworld medical claims, medical resource usage, census, and geographical and mobility data, to create an evidence hub for the ability to train a complex evidencedriven spatiotemporal prediction model, HOIST. The proposed HOIST model can handle complex innerregional influence factors from multiple data sources and adaptively learn the interregional relationships using the locational census and geographical and interlocational mobility data without requiring a predefined location graph with fixed edges.
The model learning process is guided by the Ising dynamics, a mathematical statistical mechanics model to estimate site spin configurations in square lattices^{10}. By drawing the analogy between locations and lattice sites, we find the Ising dynamics a natural choice to model the innerregion factors and interregion similarities simultaneously: (1) the Ising dynamics guide the model to extract and utilize interregion spatial relationships by taking prediction results from similar locations as kinetic energy; (2) the Ising dynamics model the complex influence of granular innerregion factors from the realworld clinical evidence as potential energy (i.e., external fields). Both kinetic energy and potential energy jointly decided the energy of the location (predicted hospitalization case to increase or decrease). The Ising dynamics in this paper are used as a regularizer to guide the model learning and prediction process, which enables the model to learn complex nonlinear patterns flexibly. We use recurrent neural networks (RNNs) to learn the parameters of the Ising model, so our model can also extract and utilize temporal patterns in the data.
The overview of HOIST is shown in Fig. 1. The HOIST model first uses static data to calculate the distances between locations in the latent space. These distances are further normalized into 0 and 1, which indicate the connectivity between locations learned by the model. The dynamic data are used to estimate the external fields (EFs) of locations and used to generate predictions with LSTM. The estimated EFs and the learned connectivity are used to calculate the Ising dynamic loss, which is then used to regularize the prediction results and better model the realworld connectivity and effects of various influence factors.
We evaluate the HOIST model on the largescale spatiotemporal COVID19 hospitalization prediction task for 2299 counties in the United States. This scale is much larger and more granular than existing COVID19 spatiotemporal predictive works. In the 4week hospitalization prediction task, HOIST achieves 368.7 MAE, 0.6 \({R}^{2}\) and 0.89 CCC scores. We also conduct experiments with different lengths of prediction windows from 1 to 5 weeks. Compared to the best baseline model, HOIST achieves 48% lower MAE, 65% lower MSE, 272% higher \({R}^{2}\) and 51% higher CCC on average. The prediction performances under the temporal data split setting show that HOIST can consistently achieve low prediction error regardless of the underlying data distribution shift caused by new virus variants. These results suggest the HOIST model can accurately predict both longterm trends and shortterm variations, enabling broader realworld applications.
Unlike totally blackbox deep learning timeseries models, the fusion of realworld dynamics in HOIST allows us to look deeper into the model to see how various influence factors affect the model predictions. By analyzing the weights of external fields, we find that the booster vaccination rate has a more significant negative correlation with future hospitalization cases (i.e., more profound impact) compared to the first and second vaccinations in the series. We also find that the effects of two major vaccination brands, Pfizer and Moderna, have no statistical difference in our model. By simulating varying immunization rates, we conduct a detailed analysis of the marginal benefit of the vaccination ratio by answering two questions: (1) How many more vaccinations are necessary to prevent one hospitalization case for a specific location? (2) What is the cost ratio between these vaccinations and the average COVID19 hospitalization for a specific location? Quantitatively, our model shows that increasing the vaccination ratio by 10% can reduce the number of current hospitalization cases by 15% on average for all locations. We also find that the cost ratio is generally highest in much of the rural Midwest and Rock Mountain regions, suggesting that prioritization of vaccine efforts in these counties could most significantly reduce the overall statewide healthcare financial burden of COVID19. For 368 counties among 43 states all over the United States, rural vaccination outreach efforts are likely costsaving endeavors. We believe these results and our model may inform clinicians, healthcare institutions, and policymakers to improve their decisions and ultimately reduce the negative economic and health impacts caused by the pandemic. All detailed countylevel analysis results are available at the online visualization platform at https://v1xerunt.github.io/HOIST/.
Results
Problem formulation and data sources
We develop the HOIST model to predict the total number of COVID19 hospitalization cases in an upcoming 4week period at a county level across the United States. Throughout this paper, we use \(N\) to denote the number of locations (counties) and \(T\) to denote the length of timesteps (days). The model uses static location background data to adaptively learn location connectivity and then uses dynamic data to learn the temporal patterns. The model uses the learned spatiotemporal patterns to conduct predictions. Concretely, we formulate model inputs and the prediction task as follows:
Input 1 (static data)
The static data include distance matrices \({{{{{\bf{D}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{\bf{G}}}}}}\in {{\mathbb{R}}}^{N\times N}\), population demographics statistics \({{{{{\bf{M}}}}}}\in {{\mathbb{R}}}^{N\times 9}\) and economics and healthcare statistics \({{{{{\bf{E}}}}}}\in {{\mathbb{R}}}^{N\times 4}\). \({{{{{\bf{D}}}}}}\) is the geographical distance matrix and \({{{{{\bf{G}}}}}}\) is the mobility distance matrix. The geographical distances are the Haversine distances between \(N\) locations. The mobility distance includes the average mobility flows between \(N\) locations during 2020 and 2021. The mobility scores are collected from the Multiscale Dynamic Human Mobility Flow Dataset, which analyzed millions of anonymous mobile phone users’ visits to various places provided by SafeGraph^{11}. The census features \({{{{{\bf{M}}}}}}{{{{{\boldsymbol{,}}}}}}{{{{{\bf{E}}}}}}\) include populations of different age and race groups and medical resource statistics, which are collected from the countylevel census dataset provided by ref. ^{12}.
Input 2 (dynamic data)
All dynamic data are threedimensional tensors, including the daily new infected COVID19 case counts \({{{{{\bf{I}}}}}}{{{{{\boldsymbol{\in }}}}}}{{\mathbb{R}}}^{N\times T\times 1}\), medical claim statistics \({{{{{\bf{C}}}}}}{{{{{\boldsymbol{\in }}}}}}{{\mathbb{R}}}^{N\times T\times 20}\), vaccination statistics \({{{{{\bf{V}}}}}}{{{{{\boldsymbol{\in }}}}}}{{\mathbb{R}}}^{N\times T\times 17}\), and medical resource usage statistics \({{{{{\bf{U}}}}}}\in {{\mathbb{R}}}^{N\times T\times 4}\). The daily new cases \({{{{{\bf{I}}}}}}\) are collected from the Johns Hopkins COVID19 Data Repository^{13}. Claim statistics \({{{{{\bf{C}}}}}}\) include daily statistics of total patients, patients older than 65, and patients with certain comorbidities. The age threshold and comorbidities are derived from the CDC COVID19 guidelines^{14} and the Charlson Comorbidity Index^{15}. The vaccination statistics \({{{{{\bf{V}}}}}}\) include daily statistics of 13 different vaccination CPT codes. Both vaccination and claim statistics are collected from IQVIA’s realworld claims dataset^{16}. The medical resource usage statistics \({{{{{\bf{U}}}}}}\) include the usage of inpatient and ICU beds and their usage for COVID19 patients, which are collected from HealthData.gov^{17}. Though \({{{{{\bf{U}}}}}}\) also provides information for hospitalization cases, we use \({{{{{\bf{C}}}}}}\) instead of \({{{{{\bf{U}}}}}}\) as the groundtruth case numbers. This is because \({{{{{\bf{U}}}}}}\) only provides weekly statistics, and the statistics are collected from fewer healthcare institutions compared to the claims data. We include medical resource usage statistics \({{{{{\bf{U}}}}}}\) to help reduce the potential biases in the claims data.
The feature list and statistics can be found in Supplementary Tables 1–4.
Task (spatiotemporal COVID19 hospitalization prediction)
Given all static and dynamic data of \(N\) locations and \(T\) timesteps, our task is to predict the total number of COVID19 hospitalization cases in future \(L\) days for each of the N locations, denoted as \({{{{{\bf{y}}}}}}\in {{\mathbb{R}}}^{N}\). Since the claims data do not show the primary cause for hospitalizations, we define a hospitalization case as a patient hospitalized within 35 days after the COVID19 diagnosis in the claims dataset, and we calculate the total hospitalization cases for each location to get \({{{{{\bf{y}}}}}}\). This setting is inspired by previous COVID19 hospitalization prediction works^{18,19}.
Experiment settings
In the experimental phase, we extract all the required data from September 2020 to May 2022. We set the input window \(\tau\) to 5 weeks and the prediction window \(L\) to 4 weeks. We select counties using the FIPS codes. The final number of counties is 2299. We only exclude counties that have zero cases for all timesteps. Note that our experiment scale is much larger and more granular than existing works^{4,5}, which allows for a wider scope of possible applications.
We use the mean square error (MSE), mean absolute error (MAE), the coefficient of determination (\({R}^{2}\)), and the average concordance correlation coefficient (CCC) to evaluate model prediction. The CCC and \({R}^{2}\) are computed as:
where \({\mu }_{\hat{y}}\) and \({\mu }_{y}\) are the means for the predictions and ground truth, and \({\sigma }_{\hat{y}}^{2}\) and \({\sigma }_{y}^{2}\) are the corresponding variances. \(\rho\) is the correlation coefficient between the two variables \(\hat{y}\) and \(y\). Note that the range of \({R}^{2}\) is \((\infty,1)\), so an extreme value may significantly affect the average value. In contrast, the range of CCC is \(({{{{\mathrm{1,1}}}}})\), so this will be less affected by extreme values.
We split the sequence into train, validation, and test sets in a 3:1:1 ratio. The training sequences are from Sep 04, 2020, to Sep 02, 2021; the validation sequences are from Sep 03, 2021, to Dec 23, 2021, and the test sequences are from Dec 24, 2021, to Apr 15, 2022. We train the model on the training set and save the model and hyperparameters with the best performance on the validation set. We then test the model on the testing set and report the performance. We train all models five times with different random initializations and calculate the standard deviations. Due to diverse location characteristics, the average hospitalization cases vary from a few to tens of thousands of cases. This large variation in case numbers poses a challenge for the deep learning model to learn stable parameters. We therefore conduct the log transformation on the prediction targets and scale the model predictions back to the original scale, then calculate the performance metrics. We also provide the prediction uncertainty at 90% confidence in all countylevel and statelevel prediction plots using the conformal methods^{20}. The conformal algorithms possess explicit and nonasymptotic guarantees without distributional assumptions or model assumptions so that they can be easily applied to all trained models. The prediction interval α is estimated as:
where \(\hat{y}\) is the average prediction using the five models with different random initializations, \(\sigma\) denotes the standard deviation of five models. The value of parameter \(s\) depends on the required confidence. For example, if we aim to obtain the prediction interval with 90% confidence, we calculate the \(s\) to make the prediction interval \(a\) covers 90% ground truth on the validation set. Then we applied the \(s\) on the test set to calculate the test prediction interval. We provide a more detailed model uncertainty analysis in Supplementary Fig. 4.
We use Python 3.9, PyTorch 1.12^{21}, scikitlearn 1.2, and NumPy 1.19 to collect the data and implement the models. We use the minibatch gradient descent strategy to train the models, and the batch size is set to 128. We use the Adam optimizer with a learning rate of 0.01 for 300 epochs. We save the model with the highest score on the validation set and report the prediction performances on the testing set. All the experiments are done on the server with Intel i913900K CPU, 64 GB RAM, and one NVIDIA RTX 4090 GPU. The HOIST source code is publicly available on GitHub (https://github.com/v1xerunt/HOIST).
Baseline models
We compare HOIST with the following epidemiology and deeplearning methods.

1.
DELPHISEIR^{22}: This is a variant of the SEIR (susceptible, exposed, infectious, and recovered) epidemiology model. Compared with the traditional SEIR model, DELPHISEIR can model hospitalization trends and policy strategies. We use the trust region^{23} optimization strategy to estimate the model’s parameters.

2.
GRU: We input all features into a gated recurrent unit (GRU) model and predict the future number of cases. The GRU model is a variant of RNN, widely applied in multiple pandemic prediction works^{24,25,26}. The hidden dimension of GRU is set to 128.

3.
LSTM: We input all features into a long shortterm memory (LSTM) model and predict the future number of cases. The LSTM model is another variant of RNN. The hidden dimension of LSTM is also set to 128.

4.
ColaGNN^{9}: ColaGNN learns the location graph with sequential data to learn spatial relationships for pandemic progression. The hidden dimension of RNN is set to 128, and the convolution filter dimension is set to 64.

5.
ACTS^{8}: ACTS is a COVID19 forecasting model which uses the interseries attention mechanism to learn spatial relationships between locations. The convolution filter dimension is set to 64, and the segment length is set to 14.

6.
CovidGNN^{27}: CovidGNN uses a graph neural network with skip connections to predict future COVID19 cases. We use a twolayer graph attention network, and the graph network dimension is set to 64.

7.
STAN^{4}: STAN fuses the SIR dynamics into a spatiotemporal prediction model for COVID19 case prediction. The graph network dimension is set to 64, and the hidden dimension of GRU is set to 128. Since the SIR dynamics do not apply to the hospitalization prediction task, we remove the SIR constraints in the STAN model.
All models can access the same data sequences with the same input window and are evaluated on the same testing set. For the spatiotemporal prediction models (i.e., ColaGNN, ACTS, CovidGNN, STAN), we use the static data to build the location graph or calculate the location similarities in their algorithms. For the GRU and LSTM models, we concatenate the static data with the original inputs at each timestep to the model. The SEIR model cannot take the static location data as inputs. All model hyperparameters are decided by using grid search on the validation dataset. For the DELPHISEIR model, we use the deployed version of the DELPHISEIR model (i.e., DELPHISEIR V4) and recommend optimal parameters. To assess the performance improvement from the Ising dynamics, the adaptative connectivity learning, and the realworld evidence, we conduct an ablation study by comparing HOIST against the following ablation versions from both a data perspective and a method perspective:

1.
HOISTVaccination: We remove the vaccination statistics data from the model input sequences of HOIST.

2.
HOISTRisk: We remove the highrisk patient statistics in the realworld claims data from model input sequences of HOIST.

3.
HOISTAC: We remove the adaptative connection learning module from HOIST. The \({{{{{\bf{S}}}}}}\) matrix is learned by calculating the sequence similarities instead of using location background data.

4.
HOISTIsing: We remove the Ising dynamic loss and replace the EF modeling module with a naïve LSTM network in HOIST.
Model performance analysis
We design experiments to answer the following research questions:

1.
How well does HOIST perform in the hospitalization prediction task?

2.
How well does HOIST perform with different lengths of prediction window?

3.
How well does HOIST perform under temporal data split setting?

4.
What is the analysis of the learned external field weights?

5.
How does the HOIST model help to increase vaccination rates effectively?
28day hospitalization prediction performance
Table 1 shows the performance of the 28day hospitalization prediction task. Compared to the best baseline model, HOIST reaches 70% lower MSE and 50% lower MAE. It also achieves an \({R}^{2}\) score of 0.6 and a CCC of 0.89, while the best baseline model achieves 0.16 \({R}^{2}\) and 0.6 CCC. We find ColaGNN and ACTS achieve better prediction performance among the compared models; this may be because both CovidGNN and STAN require a fixed location graph structure as input, while ColaGNN and ACTS can learn the spatial relationships based on the sequence similarity. Compared to the infected case prediction task, the spatial patterns of hospitalization cases may be more complex and thus cannot be predefined using a fixed location graph. Therefore, models that can learn flexible connections commonly outperform all others in this task. By utilizing the locationstatic background data, HOIST can better extract spatial patterns of the pandemic progression. We also find that traditional epidemiology SEIR models fail to predict accurately in this task, probably because the hospitalization curves do not follow a simple increasingdecreasing trend, which is the underlying assumption of most epidemiological models. Additionally, the groundtruth curves may have multiple peaks and complex shortterm variations, which increases the difficulty of the predictive task. Note that when calculating \({R}^{2}\) and CCC scores, the average value \({\mu }_{y}\) used in the denominator is the average value in the test time phase, which is often difficult to beat since the future \({\mu }_{y}\) is unavailable at the prediction time. Therefore, in all experiments, any positive \({R}^{2}\) or high CCC scores can be impressive to achieve.
We conduct the Student’s ttest to evaluate the significance of the performance differences. The results show that the performance differences between HOIST and the best baseline models are significant (p < 0.001). HOIST also outperforms all ablation versions of HOIST by a large margin, validating the effectiveness and necessity of all proposed modules. According to the performance analysis, the Ising dynamics are the most critical component of HOIST. By modeling various influence factors as external fields and using spatiotemporal dynamics to regularize the model learning process, HOIST confirms that the Ising dynamics can improve spatiotemporal predictive performance by more closely resembling the realworld curve. Furthermore, integrating realworld evidence data, such as highrisk patient cohorts and vaccination statistics, can reduce prediction error. Using background data to learn adaptative spatial connectivity in HOIST also enables it to outperform the reduced model, which only uses sequence similarities (i.e., HOISTAC). The adaptative connectivity learning module also helps the HOIST learn clustered spatial embeddings using the demographics data. We provide visualization plots of the learned embeddings in Supplementary Fig. 2. The results show that HOIST can not only extract geographical similarities between locations but also can identify geographically distant but socioeconomically similar locations. We also provide the predicted curve plots for all US states in Supplementary Fig. 1.
Prediction performance with different lengths of prediction window
In this section, we evaluate each model’s longterm and shortterm prediction performance by changing the prediction window \(L\) from 1 to 5 weeks and then train and evaluate all the models with the same strategy. The MAE, MSE, \({R}^{2}\), and CCC of HOIST and the 6 best baseline models are shown in Fig. 2. Due to its poor performance, we exclude the SEIR model.
The results demonstrate that the MSE and MAE of all models increase as the length of the prediction window increases. This is expected since predicting further into the future becomes progressively more challenging. However, HOIST consistently achieves a much lower MAE and MSE than all baseline models. HOIST also achieves consistently high \({R}^{2}\) and CCC scores. Compared to the baseline models, HOIST achieves, on average, a 48% lower MAE, 65% lower MSE, 272% higher \({R}^{2}\), and 51% higher CCC. The results demonstrate that the HOIST model accurately predicts both longterm trends and shortterm variations, enabling broader realworld applications.
Prediction performance under temporal data split setting
New variants of the COVID19 virus can lead to different disease severity. It is nontrivial for a model to accurately predict future hospitalization under such distribution transitions. We design experiments to evaluate how model performance evolves over time. We use a sliding window training setting by using 10 weeks data for training and using the next 4 weeks data for testing. We split the time from Sep 2020 to April 2022 into seven periods, and the model is tested in seven testing time phases. We compare HOIST against the other three best baseline models (i.e., LSTM, ACTS, and ColaGNN). The results are shown in Fig. 3.
The results show that HOIST consistently outperforms all baseline models in terms of MSE and MAE in all testing phases. HOIST consistently achieves low prediction errors and high CCC scores and is less affected by temporal data distribution shifts. We notice that some baseline models experience low prediction performance on Feb 2021 and Jan 2022. This may be due to distribution shifts between training and testing data caused by emerging of new variants of the COVID19 virus. However, HOIST achieves lower prediction errors than baselines on these time phases.
Analysis of learned external fields
Compared with the baseline blackbox models, modeling external fields with Ising dynamics allows us to analyze the importance of different factors for different locations, providing us with more insight into which features are most predictive of hospitalization. In the learned EF parameters \(\hat{{{{{{\bf{f}}}}}}}\), each dimension denotes the weight of the corresponding input factor. To visualize the effect of the number of vaccinations on the hospitalization rate, we plot the corresponding weights of the following vaccination features: (1) the total number of vaccinations, (2) the number of first vaccination administrations, (3) the number of second vaccination administrations, and (4) the number of booster administrations, as shown in Fig. 4.
The negative EF weights occur because the vaccination ratio is negatively correlated with the hospitalization case count. However, the results show that the EF weights of all four features are increasing, meaning the predictiveness of vaccinations is decreasing over time. This is possibly due to the increased prevalence of new variants of the virus, many of which are more resistant to vaccinations, which is consistent with conclusions in recent medical research^{28,29}. However, we can still observe that the booster doses (the purple line) are more effective than the first and second doses (p < 0.05). We also compare the model effect (i.e., the EF weights) of Pfizer (CPT code 0001A ~ 0004A) and Moderna (CPT code 0011A ~ 0013A, 0064A) COVID19 vaccines, and we find that their effects do not have a statistical difference in our model (pvalue 0.1).
We also explore the EF weights of highrisk conditions, which are condition codes defined by the CDC COVID19 People with Certain Medical Conditions guideline^{14} and the Charlson Comorbidity Index^{15}. These features are proven to have significant impacts on COVID19 patients’ outcomes^{30,31}. We find the feature with the highest EF weight is the number of patients older than 65. The top five conditions are renal disease, dementia, immunodeficiency, malignancy, and chronic lung disease. The results are consistent with CDC guidelines, and we do not observe statistically significant differences in these five features.
Predicted hospitalization hotspot map and error analysis
We present the predicted hospitalization hotspot map in Fig. 5. Figure 5a is the hotspot map colored by the number of hospitalization cases and Fig. 5b is colored by the ratio of hospitalization cases over the total population. There are some blank areas in the map in cases of no valid data or zero case counts in these counties. These heatmaps can be accessed in an online visualization platform at https://v1xerunt.github.io/HOIST/.
We find that the spatial patterns of hospitalization are similar in both figures. West coast, Northeast coast, and Midwest areas generally have high hospitalization case counts and perperson occurrence. Interestingly, we also find that some locations—such as the San Diego county (A) and Riverside county (B) along the West coast—have very high case counts but only mediumlevel hospitalization ratios. A possible interpretation could be that the people from these counties tend to travel to Los Angeles for hospitalization.
We also plot a heatmap to show the spatial patterns of the mean average percentage error (MAPE). We only calculate the MAPE for locations where case counts are larger than 100 in the testing phase because a small prediction error can lead to very large MAPE for locations with low counts. The results are shown in Fig. 5c.
We find that HOIST achieves low MAPE for most locations, with only 15 locations having a MAPE greater than 0.5. For these locations, HOIST tends to underestimate the case counts. This may be because there are some pattern changes in the testing time window; for example, in Effingham, Georgia, HOIST has 0.64 MAPE. When we plot the predicted hospitalization curve and the number of infected cases (Fig. 6), we find several peaks in the infected case count curve during both the validation and testing phases, which is quite different from training patterns. These sudden fluctuations in input features may cause HOIST to fail to accurately predict the surge of hospitalization cases during the testing phase. We provide another two case examples for error analysis in Supplementary Figure 3.
Number needed to treat prediction with HOIST
Increasing a community’s vaccination rate is an effective method of reducing both the occurrence and severity of COVID19^{32}. However, there is still uncertainty surrounding how and whether this effectiveness is influenced by local vaccination rates. In other words, we ask the question, “Due to the disproportionately higher vaccination rates in urban areas, do inequalities exist regarding the number of additional vaccinations necessary to reduce a single hospitalization or death?” We also calculate the costeffectiveness of future vaccination efforts based on the countylevel average cost of COVID19 hospitalization. Accurately answering these questions at the county level may allow policymakers and healthcare institutions with limited distribution capabilities and finite medical resources to develop more targeted vaccination efforts.
The high predictive accuracy of HOIST enables us to answer these questions. By changing the vaccination rates in the input features, HOIST can simulate the changes in the predicted hospitalization curve. We use the predicted number of hospitalizations to calculate the number needed to treat (NNT) with respect to how many vaccinations are needed to prevent one hospitalization in each county. For each location, we increase the vaccination rate by 10% (i.e., 10% total population) and calculate the predicted hospitalization case reduction between the original prediction \(\hat{y}\) and the simulated prediction \({\hat{y}}_{{sim}}\). The NNT for hospitalization is calculated as:
where \({{{{{\rm{CER}}}}}}\) denotes the control event rate (original predicted hospitalization rate), \({{{{{\rm{EER}}}}}}\) denotes the experimental event rate (simulated hospitalization rate after the vaccination rate increases by 10%), and \(p\) denotes the population size of the location.
Furthermore, we acknowledge that morbidity and mortality among individuals hospitalized with COVID19 occur disproportionately in certain ethnic and racial minority groups. To address these outcome disparities in our model, we adjust the NNT for each county based on its demographics and their respective risk ratios:
where \({r}_{{{{{{\rm{white}}}}}},{{{{{\rm{nh}}}}}}}\), \({r}_{{{{{{\rm{asian}}}}}},{{{{{\rm{nh}}}}}}}\) and \({r}_{{{{{{\rm{black}}}}}},{{{{{\rm{nh}}}}}}}\) denote the population ratio of White, Asian, and Black nonHispanic persons, and \({r}_{{{{{{\rm{hispanic}}}}}}}\) denotes the population ratio of Hispanic or Latino persons. All countylevel population ratios are extracted from the census data. The risk adjustment factor for each race and ethnicity is based on national COVID19 death risk ratios reported by the CDC^{33,34}. While \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{h}\) represents the predicted number of vaccinations needed to prevent one hospitalization, \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{a}\) is not a direction calculation of NNT for death prevention; instead, its purpose is to illustrate how locations with similar \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{h}\) might be further stratified based on the predicted outcomes of those predicted hospitalizations. The resultant \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{a}\) favors locations with more Black/African American, Hispanic, and Latino populations. Future analytical and predictive efforts incorporating countylevel death rates are warranted.
The heatmaps of \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{h}\) and \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{a}\) are shown in Fig. 7. Note how the count and ratio maps identify hotspots primarily among urban and metropolitan areas. In contrast, the NNT values are generally lowest in the rural region, which are highly clustered in certain states, including North Dakota, South Dakota, Kansas, Nebraska, Montana, etc. Quantitatively, increasing the vaccination ratio by 10% can reduce the number of current hospitalization cases by 15% on average for all locations.
We further calculate the cost ratio between state average vaccination costs and state average COVID19 hospitalization costs. We collect the statelevel average vaccine cost from the Centers for Medicare and Medicaid Services (CMS)^{35} and hospitalization cost for both complex and noncomplex COVID19 inpatients from the COVID19 Cost Tracker^{36}. The cost ratio is then calculated as:
where \({r}_{{{{{{\rm{complex}}}}}}}\) and \({r}_{{{{{{\rm{noncomplex}}}}}}}\) are the average complex and noncomplex COVID19 inpatients ratio extracted from the hospital resource data. \({c}_{{{{{{\rm{complex}}}}}}}\) and \({c}_{{{{{{\rm{noncomplex}}}}}}}\) are hospitalization costs for complex and noncomplex COVID19 inpatients. \({c}_{{{{{{\rm{shot}}}}}}}\) is the cost of one vaccine. All parameters to calculate the \({r}_{{{{{{\rm{cost}}}}}}}\) vary by location.
The cost ratio heatmap is shown in Fig. 7c. A cost ratio greater than 1 indicates that increasing the number of vaccinations in that location by the NNT is less expensive than the cost of a single prevented hospitalization. The heatmap demonstrates a ratio favoring future vaccination efforts in primarily rural regions, most commonly among centrally located and landlocked counties. We list the \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{h}\) and \({{{{{\rm{NN}}}}}}{{{{{{\rm{T}}}}}}}_{a}\) of the top 15 locations with the highest (most favorable) cost ratios in Table 2. The full countylevel table can be accessed in the GitHub repository and the visualization platform. These results suggest three meaningful conclusions. First, even smallscale vaccination efforts in targeted counties are likely to prevent at least one hospitalization and its related sequelae. Second, rural vaccination outreach efforts are likely costsaving endeavors in 368 counties among 43 states (with a cost ratio >1). Third, we find that locations that have high cost ratio also have high adjust ratio. This may indicate large healthcare disparities for different races in these locations. More life could be saved by improving vaccination fairness in these locations.
Discussion
In this work, we propose an Ising dynamicsbased deep learning model, HOIST, for spatiotemporal COVID19 hospitalization prediction. The HOIST model is built with multiple data sources, including disease and vaccination statistics from realworld medical claims, medical resource usage, census, and geographical and mobility data. HOIST can handle complex innerregion influence factors and adaptively learn the interregional relationships without requiring a predefined location graph with fixed edges. By drawing the analogy between locations and lattice sites, we use the Ising dynamics as a regularizer to guide the model learning and prediction process, which is a natural choice to model the innerregion factors and interregion similarities simultaneously.
We evaluate the HOIST model on the largescale spatiotemporal COVID19 hospitalization prediction task for 2299 counties in the United States. In the 4week hospitalization prediction task, HOIST achieves 368.7 MAE, 0.6 \({R}^{2}\) and 0.89 CCC scores. For different lengths of prediction window from 1 to 5 weeks, HOIST achieves 48% lower MAE, 65% lower MSE, 272% higher \({R}^{2}\) and 51% higher CCC on average compared to the best baseline models. We conducted a detailed analysis of the model results and learned parameters. We find that the booster shot of vaccination population percentage has a more significant negative correlation to future hospitalization cases than the first and second vaccination shots. We also find that the effects of two major vaccination brands, Pfizer and Moderna, have no statistical difference in our model.
We note that, in contrast to many previous models, our HOISTbased clinical and economic predictions suggest a need to prioritize future vaccination efforts in rural areas over urban centers. The results indicate that increasing the total vaccination ratio by 10% can make the number of current hospitalization cases reduce by 15% on average for all locations. We believe these predictions accurately reflect the growing disparity in vaccination rates between these communities. Additionally, we note a positive correlation between vaccination rate and NNT; in fact, the adjusted NNT map appears strikingly similar to maps depicting percentages of fully vaccinated residents by county^{37}. We surmise that communities with lowerthanaverage proportions of vaccinated individuals are more likely to benefit from further vaccination efforts, irrespective of rurality. For 368 counties among 43 states, increasing the vaccination ratio in these counties can significantly reduce the overall healthcare costs for COVID19 patients. We hope these results and our model can inform clinicians, healthcare institutions, and policymakers to improve their decisions and ultimately reduce the negative economic and health impacts caused by the pandemic.
This research is not without limitations. In the data collection process, the number of hospitalization cases may have bias. We define a hospitalization case as a patient hospitalized within 35 days after the COVID19 diagnosis. We extract these statistics from the claim dataset, so we cannot know if COVID19 is the primary cause of hospitalization. Though this definition is the same as existing COVID19 prediction works^{18,38}, the model may conflate hospitalization trends due to other causes with those accurately attributed to COVID19. This is especially true in the pediatric population, where COVID19 infectivity is high but severe cases are uncommon. Despite these limitations, we still select claims data as our primary data source as it provides valuable granular countylevel statistics.
The second limitation concerns data preprocessing. The original data distribution of hospitalization cases is highly skewed. Our dataset is much larger than previous research; over 80% of locations have fewer than 1000 hospitalization cases in the 28day prediction window, and only 2% have more than 10,000 hospitalization cases. To maintain the stability of the machine learning models, we conduct a log transformation on the prediction target, followed by a zscore transformation to normalize the sequence. Therefore, model prediction results require two reverse transformations to return to the original scale. This transformation may cause a small error in the prediction scale to grow exponentially as it converts to the original scale. As a result, we observe that baseline models sometimes have abnormally large prediction errors and standard deviations (e.g., over 10,000 MAE), especially when the prediction window is long. We manually remove these outliers from the performance table. Though we do not find these issues in HOIST, which further demonstrates the stability of the model, we still believe this is an open research challenge in the largescale spatiotemporal prediction work. The data scale issue might be solved by proposing more advanced preprocessing techniques or new scaleinvariant models.
A third limitation exists at the methodology level. Though the spatial connectivity learning module is more flexible than previous works requiring fixed graph structures, the learned connectivity does not change with time. In realworld scenarios, spatial connectivity may be affected by several factors, including weather, holidays, and travel restrictions. Therefore, future works may include integrating more data sources, such as realtime mobility data between locations, to further improve the extracted spatial patterns, interpretability analysis on the feature contributions, and more granular age information. Besides, the Ising dynamics maybe not be the only choice to model the spatial and social background factors for hospitalization prediction. The reason we choose the Ising dynamics is that it can incorporate both interregion spatial relationships and innerregion factors and can handle these complex factors compared to naïve SIR and SEIR models. We are inspired by previous sociology studies that use Ising models for human behavior tendencies^{39}. Though the Ising dynamics show good prediction performances in HOIST, we hope our exploratory work can provide more inspiration for future studies using other realworld dynamics.
Methods
We follow the recommendations set out in the Global Code of Conduct for Research in ResourcePoor Settings when designing, executing, and reporting the research, and this research does not use individuallevel data. Our study complies with the recommendations of the GATHER statement.
Background
Machine learning and deep learning models have been widely applied in pandemic predictions. Statistical epidemic prediction models, including SIR, SEIR, and their variants, have achieved some success in infection, hospitalization, and mortality prediction tasks^{25,40,41}. To extract complex temporal patterns, some works have applied the recurrent neural network (RNN) and its variants, such as long shortterm memory network (LSTM) and gated recurrent unit network (GRU), to predict future infected and hospitalization cases^{24,25,26}.
To further improve the spatiotemporal prediction performance, a significant line of research focuses on extracting and utilizing spatial dependencies. Graph neural networks (GNN)^{4,27,42,43} and metapopulation analysis models^{44,45,46} have achieved remarkable success in solving this issue. In these works, counties and states are modeled as nodes in the graph, and the edges are defined using geographical and sociological similarities or mobility scores. By combining the GNNs and RNNs, these models can extract spatiotemporal patterns in the data and make better predictions. However, most of these models rely on predefined location graphs. These manually defined location graphs may differ greatly from the groundtruth interregional relationships and spatial pandemic transmission patterns. Besides, some metapopulation models are only applicable for infectious diseases since the underlying assumption is the population flow, which makes it difficult to extract other spatial patterns such as hospitalizations. Some works aim to solve this issue by adaptively learning the connection weights between locations. Jin et al.^{8} and Deng et al.^{9} utilize the attention mechanism to predict future infected cases in pandemics. Their work learns the connection weights based on the similarities of historical case curves at different locations but largely ignores the underlying social, economic, medical, and geographical similarities, which may have a profound impact on the interlocation relationships. Therefore, further exploration is needed to discern how best to model spatial dependencies with multisource background data in a flexible way.
Another line of research focuses on integrating realworld disease transmission dynamics into deep learning models. Though deep learning networks can extract complex temporal patterns, they can only predict known data patterns and thus have worse longterm prediction accuracy. Integrating epidemiology models such as SIR and SEIR can help deep learning models predict curves resembling realworld transmission patterns. Gao et al.^{4} used an SIR model as a regularizer term in the loss function to help the GATGRU model predict countylevel COVID19 infection case numbers in the United States. Storlie et al.^{6} proposed a stochastic SIR model to predict future COVID19 cases. Though these models can be applied in the COVID19 hospitalization prediction, their main approach involves dividing the infected population into hospitalized and nonhospitalized cohorts and then modifying the transmission equation accordingly. However, as mentioned in the introduction, the relatively few parameters in most epidemiological models may not fully describe the influences of complex underlying disease distribution, viral variants, and vaccination effects. Additionally, these epidemiological models are not designed for general spatiotemporal predictions and thus cannot extract and utilize interregional interactions. Therefore, integrating appropriate realworld dynamics that can naturally model spatiotemporal hospitalizations may improve the prediction performance. The Ising model is a statistical mechanics model for site spin configuration estimations; it has been applied in sociology research to model social behavior by considering each individual as a site^{39}. In this work, instead of directly using statistical methods to estimate the Ising parameters in a lowdimensional and staticvariableonly dataset, we combine the Ising dynamics with deep learning techniques to model the spatiotemporal pandemic hospitalization patterns from complex, highdimensional multisource data.
Learning adaptative connectivity in the latent space
Pandemic progression patterns have spatial similarity depending on the underlying population, economic, geographic, and mobility factors in various locations. Recent works suggest that utilizing these spatial similarities can guide the model to better extract progression patterns and make more accurate predictions^{4,27,42,43}. In this work, instead of predefining a connected location graph with fixed edges, we aim to learn the adaptative connectivity between locations. Concretely, we measure the latent distances between locations from multiple perspectives from the static data and then normalize the distances to similarities.
In HOIST, each location will be embedded in a multidimensional latent space. The latent space is created by considering the set of sociodemographic variables as dimensions so that each location will have a unique position, and locations that are similar in these sociodemographic factors will be embedded closely in the space (also known as the Blau space in sociology study)^{47}. To calculate the distance of two locations in the latent space, previous works directly use the sum of Euclidean distance over all dimensions^{39}. This is applicable in smallscale settings, but our setting is largescale and highdimensional, containing all counties in the United States. The scale differences in some factors are substantial (e.g., population size and geographic distance). Using Euclidean distance may lead to biased connectivity. Furthermore, we also aim to capture the complex nonlinear similarity, which is difficult for scaleinvariant distance metrics such as Mahalanobis distance. In this work, we calculate the distances using the graph attention mechanism^{48}. Concretely, for location \(i\) and \(j\), the latent distance \({l}_{{ij}}\) is calculated as:
where \({{{{{{\bf{k}}}}}}}_{i}\) denotes the \(i\)th row vector of the \({{{{{\bf{K}}}}}}\) matrix (i.e., the population demographics \({{{{{\bf{M}}}}}}\) and the economics and healthcare statistics \({{{{{\bf{E}}}}}}\)), \({d}_{{ij}}\) and \({g}_{{ij}}\) denote the value at \(i\)th row and \(j\)th column in the matrices \({{{{{\bf{D}}}}}}\) and \({{{{{\bf{G}}}}}}\) (i.e., the distance between location \(i\) and \(j\)), \({{{{{{{\bf{W}}}}}}}_{K,2}{{{{{\boldsymbol{,}}}}}}{{{{{\bf{W}}}}}}}_{K,1},\,{{{{{{\bf{W}}}}}}}_{D,1}{{{{{\boldsymbol{,}}}}}}\,{{{{{{\bf{W}}}}}}}_{D,2}\) are weight matrices, \(\sigma\) denotes the sigmoid activation function, and \((\cdot {}\cdot)\) denotes the concatenation operation. Using this attention mechanism, the model can adaptatively learn the distance between two locations from multiple perspectives, including demographics, populations, economics, healthcare, geographic distance, and mobility. The distances are further normalized to obtain the similarity \({s}_{{ij}}\) as:
where \({s}_{{ij}}\) indicates the normalized adaptative similarity between location \(i\) and \(j\) learned by the model. The higher the \({s}_{{ij}}\), the more similar the two locations. The similarity scores of all locations formulate the matrix \({{{{{\bf{S}}}}}}{{\in }}{{\mathbb{R}}}^{N\times N}\) which is used to inform how the model utilizes the learned spatial connectivity in the following sections.
Modeling the external fields in Ising dynamics
In the Ising model, the energy of a spin configuration \({{{{{\bf{y}}}}}}\) of lattice sites is given by the Ising Hamiltonian \(I\), which takes the form:
where \({s}_{{ij}}\) denotes the interaction between two sites \(i\) and \(j\), and \({e}_{i}\) denotes the external field (EF) interacting with site \(i\). The Ising Hamiltonian \(I\) can be considered as the sum of the energy related to the interactions between sites and the energy related to the external field. This Ising Hamiltonian form is applied to capture pandemic hospitalization patterns. First, the hospitalization patterns are highly correlated with the locationspecific distributions on comorbidity, vaccination, and hospital resource usage. This can be modeled as the first term in the Ising Hamiltonian \({e}_{i}{y}_{i}\). Second, the hospitalization patterns are highly correlated across locations depending on the underlying socioeconomic factors. The second term in the Ising Hamiltonian matches location correlation. We propose to use the Ising dynamics to guide the model learning process in this manner.
To fuse this dynamic equation into the HOIST model, the initial step is to model the external fields using Ising dynamics. More specifically, we model the external fields of a specific location from multiple perspectives. The idea is to utilize historical statistics to infer future hospitalization cases. For example, if the number of infected cases among highrisk cohorts has risen in the past few months, the number of hospitalization cases is also likely to rise soon. This temporal delay is expected since many infected patients, even those at high risk for critical illness, do not develop severe manifestations immediately. Of course, we also take vaccinations into consideration; as the number of immunized individuals increases, the hospitalization rate decreases. All sequence vectors are flattened into vectors in a historical window \(\tau\) instead of just the values at timestep \(t\) (e.g., \({{{{{{\bf{u}}}}}}}_{t}^{{\prime} }=[{{{{{{\bf{u}}}}}}}_{t\tau+1},{{{{{{\bf{u}}}}}}}_{t\tau+2},\ldots,{{{{{{\bf{u}}}}}}}_{t}]\)). Likewise, we create the historical new infected case sequence \({{{{{\bf{i}}}}}}^{\prime} _{t}\), claim statistics sequence \({{{{{{\bf{c}}}}}}}_{t}^{{\prime} }\), vaccination statistics \({{{{{{\bf{v}}}}}}}_{t}^{{\prime} }\), and medical resource usage sequence \({{{{{{\bf{u}}}}}}}_{t}^{{\prime} }\) at timestep \(t\). Note that we omit the location index \(i\) in this section because we are only discussing one specific location. The parameters of external fields are modeled as:
where \(\phi :\,{{\mathbb{R}}}^{F}\to {{\mathbb{R}}}^{F}\) denotes the projection function to generate scaling factors for each sequential factor. By multiplying the scaling factors and the sequential factors, the final predictions are calculated as:
Here \({{{{{\rm{LSTM}}}}}}(\cdot)\) denotes a multilayer perceptron to capture temporal progressions, the \((\cdot)\) operation denotes the Hadamard product and \((\cdot {}\cdot)\) denotes the concatenation operation. This process is essentially learning weights (i.e., \({{{{{{\bf{f}}}}}}}_{I}\), \({{{{{{\bf{f}}}}}}}_{C}\), \({{{{{{\bf{f}}}}}}}_{V}\) and \({{{{{{\bf{f}}}}}}}_{U}\)) for different influence factors. The larger the weight, the greater the influence of the corresponding factors on the prediction result. This is also why we added the negative symbol before \({{{{{{\bf{f}}}}}}}_{V}\), since the vaccinations should be negatively correlated with hospitalization cases. By concatenating all the scaling factors, we obtain the EF strength vector \(\hat{{{{{{\bf{f}}}}}}}\). We further use an LSTM network to model the temporal progression of the EF.
To generate the final predictions for location \(i\) (here, we bring back the location subscript as \({\hat{{{{{{\bf{h}}}}}}}}_{i,t}\)), we calculate the weighted sum of the final hidden state from the LSTM network from other locations. We use the adaptative connectivity \({{{{{\bf{S}}}}}}\) as the connection weight as:
In this way, the final prediction result \({\hat{y}}_{i,t}\) considers not only the temporal progression of the pandemic and influence factors (Eq. 3) but also the relevant spatial relationships (Eqs. 2 and 4).
Endtoend learning with the spatiotemporal Ising loss function
Our loss function consists of a prediction loss and an Ising dynamics constraint loss. First, the prediction loss is calculated by using the mean square error:
The second loss term is the Ising constraint loss \({{{{{{\mathscr{L}}}}}}}_{d}\), which takes the form:
The term \({e}_{i}\) denotes the summation over all dimensions of \({\hat{{{{{{\bf{f}}}}}}}}_{i}\), and \({\hat{f}}_{i,k}\) denotes the \(k\)th dimension in the \({\hat{{{{{{\bf{f}}}}}}}}_{i}\) vector, which turns the EF vector into a scalar, denoting the EF strength. \({s}_{{ij}}\) is the value at the \(i\)th row, \(j\)th column in the \({{{{{\bf{S}}}}}}\) matrix, which is the similarity score between location \(i\) and \(j\). \({{{{{{\mathscr{L}}}}}}}_{d}\) is essentially another mean square error loss between the groundtruth value and the estimated value using the Ising dynamics (i.e., \({\hat{y}}_{i,t}^{d}\)). In the original Ising dynamics setting, the system’s energy should be as low as possible to keep the system stable. Here, we use the same idea to reduce the prediction error \({{{{{{\mathscr{L}}}}}}}_{d}\) as part of our loss function. It is worth noting that \({\hat{y}}_{i,t}\) is the final prediction result and \({\hat{y}}_{i,t}^{d}\) is only used as an auxiliary output to optimize the Ising constraint parameters. The final loss takes the form:
By optimizing the Ising constraint loss and the prediction loss together, the model can extract the temporal hospitalization progression patterns as well as optimize the spatial adaptative connectivity. Using the Ising dynamics as a constraint instead of directly using \({\hat{y}}^{d}\) as the prediction result and only optimizing \({{{{{{\mathscr{L}}}}}}}_{d}\) allows the model to have more flexibility to learn spatiotemporal patterns from highdimensional data.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The mobility scores are collected from the Multiscale Dynamic Human Mobility Flow Dataset^{11} (https://github.com/GeoDS/COVID19USFlows). The census features are collected from the countylevel census dataset provided by the US Countylevel Dataset^{12} (https://github.com/JieYingWu/COVID19_US_Countylevel_Summaries). The daily new cases are collected from the Johns Hopkins COVID19 Data Repository^{13} (https://github.com/CSSEGISandData/COVID19). The medical resource usage statistics are collected from HealthData.gov^{17} (https://healthdata.gov/Hospital/COVID19ReportedPatientImpactandHospitalCapa/anagcw7u). All the processed data are available at https://github.com/v1xerunt/HOIST. Our model can be trained without using claims data. Source data of tables and figures are provided with this paper. Source data are provided with this paper.
Code availability
The codes for model construction, training, and inference used in this paper are publicly available at https://github.com/v1xerunt/HOIST. The visualization results are available at https://v1xerunt.github.io/HOIST/.
References
Centers for Disease Control and Prevention. CDC COVID Data Tracker. https://covid.cdc.gov/coviddatatracker/#datatrackerhome (2022).
Hassan, E. M. & Mahmoud, H. N. Impact of multiple waves of COVID19 on healthcare networks in the United States. PLoS ONE 16, e0247463 (2021).
Centers for Disease Control and Prevention. Disparities in COVID19 Vaccination Coverage Between Urban and Rural Counties. https://www.cdc.gov/mmwr/volumes/71/wr/mm7109a2.htm (2022).
Gao, J. et al. STAN: spatiotemporal attention network for pandemic prediction using realworld evidence. J. Am. Med. Inf. Assoc. 28, 733–743 (2021).
Kargas, N. et al. Stelar: spatiotemporal tensor factorization with latent epidemiological regularization. In Proc. 2021 AAAI Conference on Artificial Intelligence, Vol. 35, 4830–4837 (AAAI, 2021).
Storlie, C. B. et al. A hierarchical Bayesian model for stochastic spatiotemporal SIR modeling and prediction of COVID19 cases and hospitalizations. Preprint at https://arxiv.org/abs/2104.04033 (2021).
Fan, C. et al. Parameter estimation for the SEIR model using recurrent nets. Preprint at https://arxiv.org/abs/2105.14524 (2021).
Jin, X., Wang, Y.X. & Yan, X. Interseries attention model for COVID19 forecasting. In Proc. 2021 SIAM International Conference on Data Mining (SDM), 495–503 (SIAM, 2021).
Deng, S., Wang, S., Rangwala, H., Wang, L. & Ning, Y. ColaGNN: crosslocation attention based graph neural networks for longterm ILI prediction. In Proc. 29th ACM International Conference on Information & Knowledge Management, 245–254 (ACM, 2020).
Cipra, B. A. An introduction to the Ising model. Am. Math. Mon. 94, 937–959 (1987).
Kang, Y. et al. Multiscale dynamic human mobility flow dataset in the US during the COVID19 epidemic. Sci. Data 7, 1–13 (2020).
Killeen, B. D. et al. A countylevel dataset for informing the United States’ response to COVID19. Preprint at https://arxiv.org/abs/2004.00756 (2020).
Dong, E., Du, H. & Gardner, L. An interactive webbased dashboard to track COVID19 in real time. Lancet Infect. Dis. 20, 533–534 (2020).
Centers for Disease Control and Prevention. People with Certain Medical Conditions. https://www.cdc.gov/coronavirus/2019ncov/needextraprecautions/peoplewithmedicalconditions.html (2022).
Sundararajan, V. et al. New ICD10 version of the Charlson comorbidity index predicted inhospital mortality. J. Clin. Epidemiol. 57, 1288–1294 (2004).
IQVIA. Harness the power of Real World Data. https://www.iqvia.com/solutions/realworldevidence/realworlddataandinsights (2022).
HealthData. COVID19 Reported Patient Impact and Hospital Capacity by State. https://healthdata.gov/dataset/COVID19ReportedPatientImpactandHospitalCapa/6xf2c3ie/data (2022).
Gao, J. et al. MedML: fusing medical knowledge and machine learning models for early pediatric COVID19 hospitalization and severity prediction. Iscience 25, 104970 (2022).
The BARDA Community Challenge. BARDA Community Challenge—Pediatric COVID19 Data Challenge. https://www.synapse.org/#!Synapse:syn25875374/wiki/ (2022).
Lin, Z., Trivedi, S. & Sun, J. Conformal prediction intervals with temporal dependence. Preprint at https://arxiv.org/abs/2205.12940 (2022).
Paszke, A. et al. Pytorch: an imperative style, highperformance deep learning library. Adv. Neural Inf. Process. Syst. 32, 8026–8037 (2019).
Li, M. L. et al. Forecasting COVID19 and analyzing the effect of government interventions. Oper. Res. 71, 184–201 (2022).
Byrd, R. H., Gilbert, J. C. & Nocedal, J. A trust region method based on interior point techniques for nonlinear programming. Math. Program. 89, 149–185 (2000).
Kavouras, I. et al. COVID19 spatiotemporal evolution using deep learning at a European level. Sensors 22, 3658 (2022).
Yang, Z. et al. Modified SEIR and AI prediction of the epidemics trend of COVID19 in China under public health interventions. J. Thorac. Dis. 12, 165 (2020).
Nikparvar, B., Rahman, M., Hatami, F. & Thill, J.C. Spatiotemporal prediction of the COVID19 pandemic in US counties: modeling with a deep LSTM neural network. Sci. Rep. 11, 1–12 (2021).
Kapoor, A. et al. Examining COVID19 forecasting using spatiotemporal graph neural networks. Preprint at https://arxiv.org/abs/2007.03113 (2020).
Callaway, E. Fastspreading COVID variant can elude immune responses. Nature 589, 500–502 (2021).
Eyre, D. W. et al. Effect of COVID19 vaccination on transmission of alpha and delta variants. N. Engl. J. Med. 386, 744–756 (2022).
Kuswardhani, R. T. et al. Charlson comorbidity index and a composite of poor outcomes in COVID19 patients: a systematic review and metaanalysis. Diabetes Metab. Syndr. 14, 2103–2109 (2020).
Christensen, D. M. et al. Charlson comorbidity index score and risk of severe outcome and death in Danish COVID19 patients. J. Gen. Intern. Med. 35, 2801–2803 (2020).
Tenforde, M. W. et al. Association between mRNA vaccination and COVID19 hospitalization and disease severity. J. Am. Med. Assoc. 326, 2043–2054 (2021).
Centers for Disease Control and Prevention. Risk for COVID19 Infection, Hospitalization, and Death By Age Group. https://www.cdc.gov/coronavirus/2019ncov/coviddata/investigationsdiscovery/hospitalizationdeathbyage.html (2022).
Centers for Disease Control and Prevention. Risk for COVID19 Infection, Hospitalization, and Death By Race/Ethnicity. https://www.cdc.gov/coronavirus/2019ncov/coviddata/investigationsdiscovery/hospitalizationdeathbyraceethnicity.html (2022).
Centers for Medicare and Medicaid Services. Medicare COVID19 Vaccine Shot Payment. https://www.cms.gov/medicare/covid19/medicarecovid19vaccineshotpayment (2022).
FAIRHealth. COVID19 Cost Tracker. https://www.fairhealth.org/statesbythenumbers/covid19heatmap (2022).
The New York Times. COVID19 Vaccinations: County and State Tracker. https://www.nytimes.com/interactive/2020/us/covid19vaccinedoses.html (2022).
Bennett, T. D. et al. Clinical characterization and prediction of clinical severity of SARSCoV2 infection among US adults using data from the US National COVID Cohort Collaborative. JAMA Netw. Open 4, e2116901 (2021).
GodoyLorite, A. & Jones, N. S. Inference and influence of network structure using snapshot social behavior without network data. Sci. Adv. 7, eabb8762 (2021).
Pei, S. & Shaman, J. Initial simulation of SARSCoV2 spread and intervention effects in the continental US. Preprint at medRxiv https://doi.org/10.1101/2020.03.21.20040303 (2020).
Albani, V. V., Velho, R. M. & Zubelli, J. P. Estimating, monitoring, and forecasting COVID19 epidemics: a spatiotemporal approach applied to NYC data. Sci. Rep. 11, 1–15 (2021).
Panagopoulos, G., Nikolentzos, G. & Vazirgiannis, M. Transfer graph neural networks for pandemic forecasting. in Proc. AAAI Conference on Artificial Intelligence, Vol. 35, 4838–4845 (AAAI, 2021).
Skianis, K., Nikolentzos, G., Gallix, B., Thiebaut, R. & Exarchakis, G. Predicting COVID19 positivity and hospitalization with multiscale graph neural networks. Sci. Rep. 13, 5235 (2023).
Colizza, V. & Vespignani, A. Epidemic modeling in metapopulation systems with heterogeneous coupling pattern: theory and simulations. J. Theor. Biol. 251, 450–467 (2008).
Calvetti, D., Hoover, A. P., Rose, J. & Somersalo, E. Metapopulation network models for understanding, predicting, and managing the coronavirus disease COVID19. Front. Phys. 8, 261 (2020).
Coletti, P. et al. A datadriven metapopulation model for the Belgian COVID19 epidemic: assessing the impact of lockdown and exit strategies. BMC Infect. Dis. 21, 1–12 (2021).
McPherson, M. A Blau space primer: prolegomenon to an ecology of affiliation. Ind. Corp. Change 13, 263–280 (2004).
Veličković, P. et al. Graph attention networks. Preprint at https://arxiv.org/abs/1710.10903 (2017).
Acknowledgements
This work was supported by NSF awards SCH2205289, SCH2014438, IIS2034479, NIH award R01 1R01NS10729101, IQVIA, and OSF Healthcare. J.G. acknowledges the receipt of studentship awards from the Health Data Research UKThe Alan Turing Institute Wellcome PhD Programme in Health Data Science (Grant Ref: 218529/Z/19/Z). We thank Zhen Lin from the University of Illinois Urbana Champaign for assisting with the model uncertainty estimation. Icons in the figures are designed using resources from Flaticon.com. For the purpose of open access, the author has applied a Creative Commons Attribution (CC BY) license to any authoraccepted manuscript version arising from this submission.
Author information
Authors and Affiliations
Contributions
J.G. developed the method and conducted all the experiments. C.M. and L.G. provided the datasets. J.H. managed the project meetings. A.C. provided the clinical assessment. A.C. and J.S. coled the overall project. All authors (J.G., J.H., C.M., L.G., A.C., and J.S.) participated in paper writing.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gao, J., Heintz, J., Mack, C. et al. Evidencedriven spatiotemporal COVID19 hospitalization prediction with Ising dynamics. Nat Commun 14, 3093 (2023). https://doi.org/10.1038/s41467023387563
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467023387563
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.