Introduction

The COVID-19 virus has caused a worldwide pandemic with progressive cases and fatalities. Not only has the United States been hard hit by the pandemic (as of this publication it was reporting the highest number of deaths and cases on a per country basis), but it is also leading many research efforts to combat the virus1. A key component of many of these research studies focused on evaluating relationships between COVID-19 mortality and numerous variables such as pre-existing conditions and age2. This renewed interest in public health and epidemiological studies has led researchers to identify several socioeconomic factors that could be correlated to prevalence and fatality of COVID-19. Specifically, the socioeconomic variables that had the highest risk ratios for both infection and fatality due to COVID-19 were low education and black ethnicity even when compared to unemployment, income, lack of health insurance, COPD, diabetes, and obesity3. Considering a country’s social and demographic attributes is essential when modeling factors influencing the pandemic. A country’s current health expenditure as a function of GDP4 and income inequality within a country5 are also linked to COVID-19 spread and vulnerability. While many socioeconomic factors have been investigated to frame evidence-based recommendations in our fight against COVID-19, one factor that has been recently, but only superficially analyzed is altitude. An observational and modeling study detected an association between higher altitude with decreased infection, ultimately recommending a deeper analysis of this idea6. Furthermore, two additional studies have been conducted supporting the idea that increased altitude may decrease the pathogenesis of COVID-196,7,8.

The link between altitude and pathology is not a new revelation. Numerous studies have looked into the effect of moderate altitude gain with respect to fatality in other pathologies, including lowering of overall mortality in cardiovascular and cerebrovascular diseases9. Also, higher altitudes have been identified as a risk factor for undiagnosed COPD10 and lung cancer11. These generalized associations suggest that altitude is likely to have some effect on COVID-19 etiology. Temperature is another geographic variable commonly evaluated with altitude that was initially thought to influence the spread of COVID-1912. The results concerning temperature are mixed, with a pair of studies12,13 showing higher temperatures slowing the contagion rates, and others showing temperature having a weak correlation5,14 or varied effect of COVID-19 spread depending on altitude15. Etiological factors can be related to the physiological adaptation often seen in response to chronic exposure to higher altitudes. Such exposure may result in increased lung diffusion capacity, increased capillary blood volume, decreased blood pressure, polycythemia, and increased O2 carrying capacity of blood16. The positive effects of altitude on lung function could be an indication of alterations in the expression level of Angiotensin-converting enzyme 2 (ACE2), which is the primary route of entry for the COVID-19 virus into cells. In addition, it seems to play a pivotal role in acute lung injury and other organ damage17. Therefore, it is a reasonable hypothesis that these physiological responses could positively impact the defense against COVID-19, ultimately reducing fatality rates in moderate-high altitude populations. There have been studies early on in the pandemic evaluating this altitude’s effect on fatality rates with an active COVID-19 infection, however these studies did not consider any sociodemographic covariates18,19, or were done very early in the pandemic without an equal distribution of active COVID-19 infections20,21,22.

The association of higher altitude to decreased COVID-19 fatality rates has not been evaluated in the context of comorbidity risk factors (e.g., COPD, CHD, obesity, and smoking) or socioeconomic risk factors (e.g., gender, age, ethnicity, education, and income), which are known to be associated to COVID-19 fatalities. We revisit this premise by presenting this study that evaluates the effect of altitude on COVID-19 fatality rates while simultaneously considering confounding effects of comorbidities and demographic risk factors. The study was performed using county aggregated metrics for states located in the Mountain region of the United States, this region includes approximately 25 million people which is close to 8% on the total population in the country. We focus on this region as it provides us with the required dynamic representation of altitude as all eight states in this region have a wide range of altitude. Our study provides further information that connects COVID-19 to etiological factors, leading to advanced management strategies to reduce fatality rates and improve outcomes.

Methods

Study design

The study was designed to focus on state and regional trends between associations of COVID-19 fatalities over one year (March 2020 to March 2021) to altitude while considering the effect of comorbidity rates and demographic factors. To achieve this, a statistical modeling approach that used county-wide aggregated data to elucidate associations was developed and adjusted to population density. The study focused on the Mountain region of the western United States; this region contains the US Rocky Mountains, which is the major mountain range in western North America and is a sub region recognized by the US Census Bureau. All states included in the Mountain region are presented in Fig. 1. All data used in this study was obtained from previously published sources or public databases without the use of any specific identified data. All methods were performed in accordance with relevant guidelines and regulations.

Figure 1
figure 1

States included in this study. These states are defined as the Mountain State Region as recognized by the United States Census Bureau.

Altitude, comorbidity, demographics, and COVID-19 fatality data

County altitude data was obtained from a previously published dataset evaluating lung cancer incidence11. Altitude data from that study was a population-weighted mean average, which was calculated for all counties in the United States by subdividing each county into a census block, then determining the weighted average of the block's altitude. This method is argued by the authors to be a better estimate of an individual’s exposure because it accounts for the geographic distribution of population density. The COVID-19 fatality rate for each individual county was obtained from each state’s official COVID-19 website (resources available as Supplementary File 1). The data in this study records all COVID-19 cases and deaths reported on each of the state’s official websites as of March 14th, 2021. Our study used only official reported values and did not attempt to recalculate or considered possible COVID-19 fatalities not officially defined. Altitude and COVID-19 county data averaged by state and Mountain region are presented in Table 1.

Table 1 Mean and standard deviations for county altitude and COVID-19 cases, deaths and case fatality by state and by mountain region. Fatality rate is expressed as the fatal COVID-19 cases/Total number of cases for each county.

Comorbidity data by county for obesity, chronic obstructive pulmonary disease (COPD), coronary heart disease (CHD), asthmas, diabetes, smoking, and chronic kidney disease (CKD) was obtained from the Population Level Analysis and Community Estimates (PLACES) project which is sponsored by the Centers for Disease Control and Prevention23. Comorbidities were generated using the results from the 2018 Behavioral Risk Factor Surveillance System (BRFSS). Demographic data used for this study was obtained from the American Community Survey (ACS) 2019, 5-year subject tables from the United States Census Bureau24. Comorbidity and demographic factor data averaged by state and Mountain region in this study are presented in Supplementary Table 1.

Statistical analysis

This is study used Nonparametric Regression models applied through a Generalized Additive Models (GAMs) approach. GAMs were selected because they can address deviations from normality, which are a limitation of Generalized Linear Models (GLMs)25,26. These models use non parametric smoothing splines that a linked to the predictor evaluated27. The models evaluate associations between COVID-19 fatality rate, which was set as the dependent variable, to altitude, comorbidity, and demographic factor effects, which were set as independent variables. Each of the evaluations was performed by state and all models are adjusted to population density. Regional analysis (as a meta-analysis) was done independently for each variable. All models used the individual county as the experimental unit. Dependent variables were introduced into the model as non-parametric smoothing splines with 3 degrees of freedom. GAM used were as follows:

$${{COVID}}{\text{-}} {19}\;{{Fatality}}\;{{rate}}_{{{\text{ijkl}}}} = \beta_{0} + s_{i} \;(Population\;density_{i} ) + s_{j} \;(Altitude_{j} ) + s_{k} \;(Covariates_{k} ) + \varepsilon_{ijkl}$$

where COVID-19 Fatality rateijkl is the dependent variable measured on the individual county. β0 is the intercept. si, sj and sk are smoothing functions estimated in a non-parametric fashion. Population densityi, Altitudej, and Covariatesk are the specific population density, altitude and covariates (sociodemographic and comorbidities) at the individual county. εijkl is the random error inherent to each measurement (residuals) which is assumed to be independent of other observations. In summary, the generalized GAM equation is very similar to the GLM but adds spline functions that models internally each independent parameter included in the model. All models assumed Gaussian distribution for the residuals, which were corroborated through preliminary residual plots. All non-parametric modeling was performed using PROC GAM in SAS/STAT v.9.4 (SAS Inc., Cary, NC). Modeling effects were estimated as Risk Ratios with 95% confidence intervals; effect directions were determined by the estimate coefficient sign. A negative coefficient would reduce the fatality rate while a positive one would increase the rate. Standardization of coefficients was done using a normally distributed z-score transformation. Descriptive statistics were calculated by state and by region using PROC MEANS in SAS/STAT. All significant associations presented considered a two tailed test. Bonferroni level significance was declared per family of tests which is disclosed in each figure. The code and data used in the analysis is available as supplemental material.

Results

County aggregated data analyzed with a Nonparametric Regression approach though Generalized Additive Models adjusted by population density was used to evaluate the association between COVID-19 fatality rate to altitude, comorbidities, and demographic factors. The outcomes of this modeling approach are presented in Fig. 2. Between the states within the Mountain region, some association with altitude and fatality rate (Colorado, Idaho, and Wyoming) was observed, demonstrating higher altitude lowering the COVID-19 fatality rate. This association was also observed for the whole Mountain region when analyzed as a meta-analysis. Higher comorbidity rates were associated with higher COVID-19 fatality rates within states (except for Montana and Nevada) and for the Mountain region collectively. Associations significant to a Bonferroni level were only detected for the Mountain region. The states of Arizona, Colorado, Idaho, and Wyoming displayed an association to income; in all cases a higher median income was associated with lower COVID-19 fatality rate. This trend was also observed for the Mountain region significant at a Bonferroni confidence level. Associations to age, education, employment status and poverty were only detected for the Mountain region. The effect of race was detected for the states of Arizona and Utah and for the whole Mountain region.

Figure 2
figure 2

COVID-19 fatality rate association to county-wide comorbidities and demographic factors adjusted by population density. Only those associations that were detected as significant (P ≤ 0.05) are shown. Asterisk (*) indicates significance of a specific predictor at a Bonferroni threshold (37 tests). Mountain region corresponds to all states evaluated as a meta-analysis.

In a further evaluation of the altitude effect, we performed subsequent modeling that simultaneously considered the effect of altitude along with comorbidities and demographic factors was conducted. The outcomes for these models are presented in Fig. 3 and are adjusted for population density. When simultaneously evaluating the effects of several factors known to have an association with the dependent variable, the expectation is that they will remain significant when their contribution is independent of each other or that they will drop out when they share their covariance. In current models, the effect of altitude is consistently present when evaluating the data from the states of Colorado, Idaho, Wyoming and the whole Mountain region. The state of Colorado and the Mountain region were the only ones that consistently reached Bonferroni significance. Interestingly, using this set of models, the state of New Mexico displayed a reverse altitude effect where higher altitude was associated to a higher fatality rate. It is noteworthy to mention that in the states of Colorado, Idaho, and Wyoming, the predominant effect of comorbidities was eliminated suggesting that altitude is a more reliable predictor in these states. The states of Arizona, New Mexico, and Utah continued to display an association to comorbidities. Income continued to appear for several states and remained significant in Utah for the whole Mountain region. The higher risk of fatality due to COVID-19 observed for the whole Mountain region for Native Americans continues to be observed even when considering altitude although this effect was not significant at a Bonferroni level.

Figure 3
figure 3

COVID-19 fatality rate association to county-wide comorbidities and demographic factors while simultaneously evaluating mean county altitude associations adjusted by population density. Only those associations that were detected as significant (P ≤ 0.05) are shown. Asterisk (*) indicates significance of a specific predictor at a Bonferroni threshold (72 tests). Mountain region corresponds to all states evaluated as a meta-analysis.

Based on the consistent detection of associations between comorbidities even while simultaneously considering the effect of altitude for the Mountain region, an all-inclusive model that included all comorbidities and altitude simultaneously and was adjusted for population density was developed. This model allows the detection and ranking of effects for predicting COVID-19 fatality rate. The outcome of this model is presented in Fig. 4, where raw parameter estimates are presented along with standardized effect ratios. In this model, comorbidities such as obesity, COPD, Asthma and Diabetes drop out in significance while only the effect of altitude and smoking continue to be significantly associated at a Bonferroni level. The standardized evaluation of these effects shows that altitude has a standardized effect of − 2.94, which is comparable in magnitude to the effects of smoking and CKD (3.48 and 2.58 respectively), although in the opposite direction. CHD displayed a significant but opposite effect to other comorbidity factors suggesting that patients with this condition may be self-selecting to live at higher altitudes.

Figure 4
figure 4

Elevation and comorbidities parameter risk ratio effect estimates for the mountain region adjusted by population density. (A) Generalized Additive Model parameter risk ratio estimates. (B) Standardized risk ratio estimates and confidence intervals. Values in bold are significant at P ≤ 0.05. Asterisk (*) indicates significance at a Bonferroni threshold (8 tests).

Our findings suggest that altitude is not only a strong predictor for COVID-19 fatality, but that it has a fatality lowering effect, in some cases, this effect is more relevant than well documented comorbidities. Our study also confirmed and provided further evidence that comorbidities and demographics are important predictors of COVID-19 fatality.

Discussion

The main question of evaluating the effect of altitude on COVID-19 fatality rates was explored early in the pandemic with some studies finding significant associations13,18,28,29,30 and some studies finding no associations18,20,21,22,31,32. As the pandemic advances and we have more information we believe it is important to revisit this argument. Studies that found no effect for altitude had several caveats which limited their conclusions earlier in the pandemic, which can now be reassessed with the new information available. These caveats can be put in three categories: The first group consists of studies that presented an argument based on early pandemic data13,20,21,22,30. These early studies closed their fatality data collection between April and July of 2020, with these dates being just a couple months into the pandemic. During that time, the spread of the virus was not geographically uniform, therefore, the case fatality data gathered earlier in the pandemic may not be an accurate representation of the true effect of altitude. Even when the virus was discovered in December of 2019, it was not declared a pandemic until March of 2020. This was followed by the implementation of travel and restrictions and social distancing mandates which further slowed the geographic expansion of the virus33. The second group consists of articles that make a regional assessment based on a small range of altitudes21,22,30,34. Studies done in Brazil, Spain, and Italy do not consider a wide range of altitudes in their analysis, which diminishes the ability to evaluate the effect of altitude on case fatality rate. The countries in these studies have the majority of their major population centers at altitudes less than 1 km above sea level, not accurately representing high altitude for this analysis. This group also includes studies done in locations that can have non generalizable characteristics such as those assessments done at a single location at extreme altitude in Peru32. The third, and last group includes studies that do not consider any covariate adjustments within their methodology18,19,30. Sociodemographic factors are strong determinants of health outcomes, so it is important to consider these variables for an effective assessment. Overall, many of the reports published early on the pandemic, such as the studies discussed above, fall into more than one of these groups.

The study we present revisits the main question of the effect of altitude on case fatality and addresses the caveats identified in previous studies and addresses them within the methodology: it includes over a full year of data ending in May 2021 which provides a more extensive representation of risk across geographies. It also focuses on a region that has a large altitude range and includes several major population centers at altitudes much higher than the aforementioned publications. Furthermore, it considers sociodemographic covariates in the analysis. The US Mountain Region is a sub region recognized by the US Census Bureau as a large generalizable area with defined geographic characteristics that facilitate context, excluding the relatively flat regions of the US, which dilutes the effect and increases the power of this study.

In our results, Colorado, Idaho, and Wyoming all displayed a strong altitude effect in which higher altitude decreased COVID-19 fatality rate; Arizona, Montana, Nevada, and Utah did not display any effect; and New Mexico displayed an inverse effect. Population density may have been a source of this discrepancy, and for that reason all our models are adjusted to this effect. As evident in Table 1, Colorado, Idaho, and Wyoming have higher average county altitudes of (2.061 km, 1.301 km, 1.687 km, respectively) than both Arizona (1.068 km) and Montana (1.151 km)24. CO, ID, and WY have, on average, higher average county altitudes, this causes the predictive effect of altitude on case-fatality rate is more likely observed. Population distribution across altitude within each state is another factor that contributes to the differing effects between states. For example, Colorado has 47% of its population in Arapahoe, Denver, El Paso, and Jefferson counties, where average county altitudes range from 1.637 km (Denver County) to 1.942 (El Paso County). On the other hand, Arizona24 has 62% of its population in Maricopa County (altitude of 0.385 km), and 75.3% of Nevada’s population lives in Clark County (altitude of 0.655 km)24. With a larger percentage of the population living at lower altitude in Nevada and Arizona, it is less likely the effect of high altitude on case-fatality rate would be seen. Other contributions to the differences in effects between states is the size and number of counties in each state. States that had a strong altitude effect, such as Colorado and Idaho, are divided into more counties than states that did not have an effect, such as Arizona, Nevada, and Utah. This discrepancy in the number of counties and the population within them is also addressed with the population density adjustment performed in all models. Altitude seems to have less predictive value on COVID-19 case fatality in states with fewer counties since each county covers a larger range of altitudes. States with more counties, however, allow for a more granular look at the data as the average altitude of each county encompasses a smaller area.

In the state of New Mexico, altitude was shown to have an association with an increase in the fatality rate. An explanation for this is the large Native American population that resides in northwest New Mexico; the state of New Mexico has one of the highest Native American populations at just under 200,00035. Native Americans have been impacted disproportionately by the pandemic36, and since the Native American population in this state is predominantly distributed in the high-altitude counties, that may have flipped the direction of the effect. A visual presentation of this distribution is displayed in Supplementary Fig. 1. Native Americans were the only racial group within the Mountain West Region that demonstrated a higher fatality rate from COVID-19. This is seen in Fig. 3. Although, this was not observed every time, most likely due to power limitations. Furthermore, Native Americans have been shown to be more prone to developing chronic illnesses like COPD, hypertension, diabetes, chronic kidney disease, and coronary artery disease. These chronic diseases have been linked to an increase in the rate of COVID-19 fatality36,37. Research on certain genes in the Native American population that alter ACE238 is ongoing, potentially providing another explanation for the higher rates of COVID-19 fatality in this population. While this is an explanation for the different COVID-19 fatality trends seen in New Mexico, more research is required to provide a strong association.

Overall, altitude was shown to consistently decrease in fatality from COVID-19 infection even when controlling for comorbidities known to be associated with an increase in COVID-19 fatality. While this trend was not seen in every state individually, it was seen as a trend for the Mountain region collectively.

Much speculation has been presented already describing the human body’s physiological responses to high altitude and how it would play a role in a COVID-19 infection30. At high altitude, the body is required to adjust primarily as a result of the decreased PO239. High altitude also has a significant effect on the cardiovascular system, which could potentially influence the progression of the COVID-19 virus. This is because much of the acclimatization to higher altitudes relies on improving oxygen extraction and delivery in the tissues. Therefore, patients with pre-existing conditions have a higher risk of severe COVID-19 infections40. Cardiovascular diseases (CVD) have been linked to increased severity of COVID-1941. Interestingly, residents at higher altitudes often have lower rates of cardiovascular diseases42,43. With healthier cardiovascular systems overall, patients with COVID-19 may be able to better handle the cardiovascular impact of the virus, thereby decreasing case-fatality rate. Similar to CVD, studies have shown that altitude and obesity have a similar inverse relationship44,45,46,47. Obesity has demonstrated an adverse effect on COVID-19 outcomes48. With lower rates of obesity at higher altitudes, the case-fatality rate from COVID-19 infections is likely to be lower. Our study showed that although comorbidities are important predictors to COVID-19 fatalities, the effect of altitude is in some cases more prominent and carries a positive protective effect. An explanation for documented comorbidities appearing to be less relevant as compared to altitude may be related to the geographic delimitation of our study. The Mountain region has a significant representation of communities at high altitudes as compared to the rest of the United States where altitude variation is not as extreme. Also, states within this region have a lower prevalence of comorbidity factors49. These two factors may contribute to the detection of the protective effect of altitude and the diminished relevance of other comorbidities.

Our study has some inherent limitations related to the methodology used and the geography of the region. We did not challenge official values by recalculating or considering possible fatalities. Counties are not broken up uniformly by state and may contain wide altitude ranges. As explained in the discussion, some states such as Arizona have only 17 counties while others such as Colorado have 64 counties even though they have comparable total areas. Future research could break down the analysis by smaller geographic areas such as by zip code, in order to obtain a more granular look at the effect of altitude. In addition, future research should be directed towards establishing the physiological changes that occur at altitude and how those changes affect COVID-19 fatality.

In summary, the goal of our study was to revisit a question asked early in the pandemic but with more information that is now available. We evaluated possible correlations between altitude and COVID-19 case fatality rate in a geographic region that has a wide dynamic representation of altitudes, all while considering comorbidities and sociodemographic factors. Our statistical analysis demonstrated that COVID-19 case-fatality rate decreased with higher altitude in the Mountain region, and this effect is evident even when controlling for pre-existing conditions and certain demographic variables. The protective effects of high altitude are likely influenced by both physiologic changes and demographic trends that are associated with life at high altitude. This effect is evident even when accounting for several variables known to impact COVID-19 case fatality rate; our study shows that case-fatality rate and altitude have an inverse relationship across the Mountain West Region. It is not realistic to make a broad recommendation to vulnerable populations to move to altitude to be better protected against poor COVID-19 outcomes. Nonetheless, our study highlights specific populations that are still vulnerable despite this extra protective factor. Allocation of resources to these specific population can improve overall outcomes.