Introduction

The objectives of this study are: (1) to specify evacuation return and home-switch stability as two critical milestones of short-term recovery during and in the aftermath of disasters; and (2) to understand the presence of disparities among subpopulations in duration of these critical recovery milestones. The intensity and frequency of extreme weather events—flooding, winter storms, and hurricanes—have increased in the past few decades1,2,3. Monitoring the recovery from these extreme weather events enables determination of whether people have returned to their pre-disaster life and prepared for the following event. The literature4,5,6,7 has recognized that community disaster recovery is a complex process that involves several factors, including resource allocation, population vulnerability, and infrastructure resilience. Yet the assessments of recovery stages are often descriptive, subjective, and lack quantitative and data-driven measures to assess and proactively monitor the progress of community disaster recovery to inform recovery implementation and resource allocation8,9,10. Hence, departing from the standard approach for designation of recovery stages descriptively, we aim to determine critical milestones based on population activity patterns embedded in location-based big data, which is crucial to help decision makers and responders understand and monitor community recovery progress and allocate resources to communities.

In this study, critical recovery milestones are defined as times at which community functionality, such as life activities and commerce, return to steady state. This study focuses on two critical milestones of short-term recovery during and in the aftermath of disasters: evacuation return and home-switch stability. In particular, evacuation return and home-switch stability represent when people in an area return from evacuation and the home move-out rates are stabilized. During disasters, people may evacuate to escape life-threatening circumstances. Persons severely impacted by disasters who fail to evacuate may incur physical health issues as well as long-term mental health problems11,12,13. Houses damaged by the disaster may need repair before they are again inhabitable, forcing residents to relocate until their homes are inhabitable. Thus, by employing evacuation return and home-switch stability as indicators of short-term community disaster recovery, we can enable the understanding of community short-term recovery progress at fine scales. Also, specifying and monitoring these short-term critical recovery milestones reveals the trajectory of long-term recovery for some subpopulations. The evacuation return and home-switch stability can help decision-makers monitor community disaster recovery progress proactively, reveal disparate recovery progress in subpopulations, understand population responses to disasters, and reduce the influence on health and the local economy for future extreme weather events.

With location-based data, several studies have examined population mobility during disasters14,15,16,17,18,19 and assessed disaster impacts20,21,22,23,24,25; however, the majority of these studies focus on evacuation patterns26,27, disruption in mobility28,29,30, and mobility resilience31,32,33,34. Despite the recognition that population mobility and disaster impact extent are important factors in community resilience and recovery in disasters, few studies have attempted to characterize the disaster recovery process based on patterns of population activities. Currently, it is a common approach to collect data for assessing disaster recovery via public surveys from respondents who experienced disaster events35. Compiling and analyzing survey data has significant lag and puts the burden of providing information on affected people. Other researchers used information such as satellite imagery36 and geotagged twitter data37 to examine disaster recovery. Yet satellite imagery has limited ability to capture short-term recovery milestones in such hurricane events due to the blockage of clouds. Geotagged Twitter data may suffer bias issues because of the limited number of geo-coded tweets and the unbalanced user population among socioeconomic groups and affected areas. Location-based big data, on the other hand, provides opportunities to investigate the post-disaster recovery on a much finer scale and in a timely manner. Recognizing this, a number of recent studies have examined disaster recovery using location-based data. Yabe et al.38 observed human mobility for five extreme events with more than 1.9 million mobile phone users to examine macroscopic population mobility recovery patterns with consideration of connectedness to neighboring cities and house damage levels. Despite these efforts, little attention has been paid to examining short-term critical recovery milestones in terms of evacuation return and home-switch stability.

In this study, we utilized aggregated location-based data to assess critical community recovery milestones, evacuation return and home-switch stability, in the aftermath of the 2017 Hurricane Harvey in Harris County, Texas. As shown in Fig. 1, two important indices, evacuation and home move-out rates, are defined and examined as indicators for the short-term community recovery milestones of evacuation return and home-switch stability. Here, the return durations of evacuation and home move-out rates capture the time it took for a census tract to have evacuation and home move-out rates back to a steady state. The return of evacuation rate indicates that people have returned to their homes after evacuation. This milestone indicates that the hazard impacts (such as road inundations and power outages) have diminished, and residents felt safe to return to their homes. On the other hand, a steady state of home move-out rate represents that people ceased moving out of their home census block group (CBG), and the community has returned to stasis in terms of home switch. A greater than normal home CBG switch could suggest that residents whose homes were impacted are moving to other areas. The return of home-switch rate to a stable state suggest that impacted residents have found a new residence (new permanent home or temporary home). There are various reasons for switching homes after disasters; for example, in the context of flooding events, people may decide to sell their homes due to the lack of flood insurance, or people may want to relocate to places of relatively higher elevations to avoid future flooding impacts. Also, people who want to repair their current homes may need to move to temporary quarters while their damaged homes are under repair and restoration. People living in rental homes may be required to relocate to other properties or apartments due to necessary repairs and restoration. The return of the move-out rates to that of normal levels indicates that people stop switching homes. Accordingly, we used location-based big data with disaster impact and socio-demographic data to: (1) assess the duration of the evacuation and the time for the move-out rates to return to steady state after a hurricane, and (2) in responding to disasters, reveal potential disparities of evacuation return and home-switch stability among subpopulations from different income, race, and ethnicity. The remainder of this paper proceeds as follows: In “Results” section explains the results of the evacuation and home move-out rates and their corresponding time to return after the disaster, in “Discussion and concluding remarks” section discusses and concludes the disparate recovery patterns of different sub-populations and the main contribution of this study, and in “Materials and methods” section introduces the data and methods used in this study to assess evacuation return and home-switch stability.

Figure 1
figure 1

Schematic illustration of two critical community recovery milestones, evacuation return and home-switch stability assessing by evacuation and home move-out rates (The figure is for illustration purposes and not drawn to scale).

Results

Duration of evacuation and move-out rates returning to steady state

Figure 2 shows the distributions of the duration of the evacuation and move-out rates returning to steady state in Harris County, where the percentage represents the number of census tracts over the number of census tracts in Harris County. According to the results of the evacuation rate, residents of more than half of the census tracts in Harris County stopped evacuating and were able to return to their homes within 5 days after landfall of Hurricane Harvey. The ability to return home may also indicate the extent of impacts, such as road inundations and power outages, had been reduced to acceptable levels. On the other hand, for the return duration of the move-out rate, the results show that people living in more than half of the census tracts in Harris County stopped moving out of their homes within 6 weeks after the landfall of Hurricane Harvey. The return duration for the move-out rate takes longer than the evacuation rate due to the nature of switching homes. For example, insurance for covering costs of home repairs and whether to continue living in an area (due to work or school considerations) are important factors for residents. The drivers of the decision to evacuate and move out are different; the consideration of evacuation is usually based on the anticipated impacts of events, while the decision to move out may depend on actual damages in the aftermath of events. Thus, the return duration of the move-out rate is more dispersed temporally than the evacuation rate. Despite that, the move-out rate of most of the census tracts returned to steady state within 8 weeks, and only a few census tracts have the return durations for the move-out rate longer than 9 weeks. Overall, most of the census tracts in Harris County took less than 5 days to return to steady state for evacuation and, got move-out rates, 8 weeks.

Figure 2
figure 2

Distributions of the return duration of the evacuation rate (left) and home move-out rate (right) in Harris County, where the probability represents the number of census tracts over the total number of census tracts, which is 786, in Harris County.

Return duration in flooded and non-flooded areas

This section presents the results of the duration of returning to steady state with the consideration of flood impacts. Based on flood impact data, we categorized census tracts into two groups: flooded and non-flooded. Due to different degrees of impact, the patterns of the evacuation and move-out rates are likely to be different. For example, the return duration of evacuation rate for people living in severely flooded areas may be of longer duration due to the wait for water to recede from their homes and remediation to be completed. Due to the non-normality of the residuals, this study used the Kruskal–Wallis test, or one-way ANOVA, on ranks, to examine the difference in the return patterns of different groups of populations. A significant result of the Kruskal–Wallis test indicates that the median values of different groups of census tracts are different. As shown in Fig. 3, the probability for relatively long return durations of evacuation and move-out rates in the flooded census tracts is generally higher than in the non-flooded census tracts. In addition, the p values for both comparisons are less than 0.05, indicating that the return durations in the flooded census tracts for both evacuation and move-out rates were significantly longer than in the non-flooded census tracts.

Figure 3
figure 3

Distributions of the return duration of the evacuation rate (left) and home move-out rate (right) in flooded and non-flooded census tracts in Harris County, where the probability represents the percentage of census tracts. The results indicate that the durations for the flooded census tracts for both evacuation and move-out rates were significantly longer than for the non-flooded census tracts. In this study, we identified 193 flooded and 593 non-flooded census tracts in Harris County. The solid lines are kernel density estimations (KDE).

Examination of disparities in evacuation return and home-switch patterns

After we identified the return durations of the evacuation and home move-out rates, we analyzed whether the patterns of achieving short-term critical recovery milestones among various socio-demographic statuses were different. For example, the low-income population may experience difficulty evacuating from their damaged homes due to a lack of resources and the need for assistance from agencies or rescue organizations. Specifically, in the case of flooding, for people living in flooded or high-humidity houses without evacuating, the possibility of contracting viral diseases and infections could increase. The high-income population, however, may have the means to evacuate to temporary shelters, such as hotels and the homes of friends or relatives that are distance away from impact areas, to mitigate physical impacts. The analysis results in this section examines socio-demographic status: median household income, the ratio of Black and Hispanic populations to total population, and the percentage of people living in rental homes, with the flooded impact data and the specified critical recovery milestones, evacuation return and home-switch stability, to understand the recovery patterns in different subpopulations and communities. Also, we investigated the differences between long and short return durations in terms of socio-demographic status.

Persons of different socio-demographic status may exhibit different return patterns. The return patterns of census tracts affected by flooding may differ from those not impacted by flooding. Specifically, we compared the return patterns among combinations of high and low median-household-income levels, as well as flooded and non-flooded. Figure 4 illustrates the comparison results in two settings for the evacuation and move-out rates: (1) classify population into three groups, which are all populations, high income population (above the third quartile), and low income population (below the first quartile) and compare the patterns of flooded and non-flooded areas in each subpopulation group, as shown in Fig. 4A and B, and (2) classify population into two groups, which are all population and population in flooded areas and compare the patterns of high and low median household income in each subpopulation group, as shown in Fig. 4C and D.

Figure 4
figure 4

Return patterns of the evacuation and move-out rates for different subpopulations, where the vertical axis is the cumulative probability that represents the cumulative percentage of census tracts returning to a steady state. (A) and (B) demonstrate the comparison between flooded (F) and non-flooded (NF) areas in all, high-, and low-income populations. (C) and (D) demonstrate the comparison between high- and low-income populations in all and flooded census tracts.

In Fig. 4A and B, the differences between flooded and non-flooded areas are significant in all populations and high-income populations. That is, the return durations of the evacuation and home move-out rates were longer in the flooded areas for all populations and high-median-household-income areas. For the low-income subpopulation, however, the difference between flooded and non-flooded areas are not significant for both the evacuation and move-out rates. In other words, the flood impacts did not significantly affect the return patterns of the evacuation and move-out rates in the low-income population. In Fig. 4C and D, the differences are significant between high- and low-income populations in the flooded areas for both the evacuation and move-out rates. For the comparison of high- and low-income populations in all study areas, only the evacuation rate shows significant differences between high- and low-income levels. The difference in the return patterns between the high- and low-income populations in all study areas for the move-out rate is insignificant. In addition, all return patterns show two-phase return progress that the return pace of the first 80% of the population is faster than the remaining 20% of the population, which indicates that most of the communities return to steady state earlier and 20% of the areas return with considerable lag.

According to these analysis results, in flooded census tracts in the study area, the time for the evacuation and home move-out rates to return to steady state was longer. The longer return duration compared to the non-flooded census tracts is intuitive: residents in the flooded areas must wait for the flooding to recede before returning to homes from evacuation and making relocation decisions. Yet the exception is for the low-income subpopulations; the differences between flooded and non-flooded status for the low-income census tracts are insignificant for both the evacuation and move-out rates. This may show the inability of low-income population to evacuate and relocate to mitigate the impact of the flooding. For the flooded areas, it is significant that the low-income population has a shorter return duration for both the evacuation and move-out rates than the high-income population. A further comparison between long and short return durations of the evacuation and move-out rates is addressed in the following section. Overall, based on the analysis of the two indicators of short-term critical recovery milestones, the immediate responses and the community recovery progresses in terms of evacuation and home-switch are different between high- and low-income populations, as well as between flooded and non-flooded areas.

Comparisons between long and short return duration

The results in the previous section examined the effect of income-level and flooding status on the return patterns on the achievement of short-term critical recovery milestones of the evacuation and move-out rates. To understand the difference between long and short return durations of the evacuation and move-out rates, we compared these milestones with respect to income, housing type, and race. Based on the distribution of return durations for the evacuation and move-out rates, the first quartile durations of return to steady state in flooded areas is 5 days for evacuation and 5 weeks for move-out rates. The third quartile values are 7 days for evacuation and 8 weeks for move-out rates. We applied the following criteria to distinguish long and short return durations. Long return duration was greater than 7 days for evacuation and 8 weeks for home move-out rates. Short return duration was less than 5 days for evacuation and less than 5 weeks for home move-out rates. Figure 5 shows the location of census tracts with long and short return durations in terms of the evacuation and home move-out rates.

Figure 5
figure 5

Locations of census tracts with long and short return durations in terms of the evacuation rate (left) and the home move-out rate (right). The criteria for long return duration was greater than 7 days for evacuation rate and 8 weeks for home move-out rate. The criteria for short return duration was less than 5 days for evacuation rate and less than 5 weeks for home move-out rate. The dotted areas are the census tracts identified as flooded census tracts.

We first compared the differences between long and short return durations with respect to median household income and the ratio of the population living in rental home. Figure 6 compares long and short return durations to the median household income and the ratio of the population living in rental homes in the evacuation and move-out rates in the flooded census tracts. The statistical test results indicate that the difference between long and short return durations are significant except for the difference in the move-out rate in terms of the ratio of population living in rental homes; even though it is insignificant, it is apparent that the census tracts with long return duration of move-out rate tend to have a lower ratio of residents living in rental properties. Thus, according to this result, the census tracts with lower median household income and a higher ratio of persons living in rental homes had short return durations in the evacuation and move-out rates.

Figure 6
figure 6

Comparison between long and short return durations with respect to the median household income (A,B) and the ratio of the population living in rental homes (C,D) of the evacuation (A,C) and move-out rates (B,D). The statistical test results indicate that the differences between long and short return durations are significant except for the difference in the move-out rate in terms of the ratio of living in renting homes.

The shorter return duration of evacuation rate for the low-income population may indicate the inability to remain evacuated, which is reported in other studies in the literature39,40. Several studies41,42,43 indicated that low-income population were less likely to remain evacuated due to barriers linked with financial constraints. For example, high-income population can evacuate to other cities or stay in hotels. In contrast, the low-income population may not have options other than returning to their homes due to work obligations or inability to afford accommodations. The shorter home move-out rate return duration for the low-income population compared to the high-income population may be related to their housing types (renting versus owning), the damage level of their homes, and the capability to repair their homes (influenced by flood insurance coverage). The census tracts with low median household income usually have a higher ratio of the population of living in rental apartments/homes. The residents may be required to move to other properties or apartment units because of flooding damages. The high-income population is able to live in houses less vulnerable to flooding44 or have flood insurance and financial resources45,46, allowing them more time to consider and make decisions regarding rebuilding their home and relocation. People unable to afford repair costs or living in rental apartments/homes are less likely to return to their homes due to insufficient financial resources47. Thus, a shorter move-out return duration does not necessarily signal a positive trend in short-term recovery and these indicators should be interpreted in light of the housing type and socio-demographic characteristics of each area.

Figure 7 compares long and short return durations in terms of the Black and Hispanic populations in the evacuation and move-out rates in the flooded census tracts. The statistical test results indicate that the differences between long and short return durations in the evacuation rate are significant but for the home move-out rate, are insignificant. Since the evacuation rate indicates the immediate response of the population to the disaster, an ideal result of the difference between long and short return duration should be solely affected by the flood damage level and not related to the socio-demographic. However, a census tract with a shorter return duration of evacuation rate tends to have a higher ratio of minority populations. This result indicates that minority populations require help and resources to overcome challenges to remain evacuated. On the other hand, based on the statistical results, there is no significant difference in the move-out return duration with respect to the ratio of minority populations.

Figure 7
figure 7

Comparison between long and short return durations with respect to the Black population ratio (A,B) and Hispanic population ratio (C,D) in the evacuation (A,C) and move-out rates (B,D). The statistical test results indicate that the differences between long and short return durations in the evacuation rate are significant but insignificant in the home move-out rate.

Discussion and concluding remarks

By employing location-based big data with disaster impact and socio-demographic data, this study specified two critical recovery milestones, evacuation return and home-switch stability, to address the objectives of the paper. Specifically, we used the evacuation rate to assess results demonstrating that more than half of the census tracts in Harris County returned from evacuation within 5 days. In addition, the populations more than half of the census tracts stopped moving out after 6 weeks. Return durations of the high-income census tracts when flooded were longer than those for when non-flooded; however, there was no significant difference between flooded and non-flooded in the low-income census tracts. This finding indicates the inability of the low-income population to evacuate and relocate. When the census tracts were flooded during Hurricane Harvey, the disparate return patterns in the evacuation and home move-out rates were significant in that the low-income population returned sooner than the high-income population. The flooded census tracts with short evacuation return (less than 5 days) had lower median household income, higher ratio of persons living in rental homes, and a higher percentage of minority populations compared to those with long evacuation returns. On the other hand, there is no significant difference between long and short return durations of the home move-out rate in the flooded census tract with minority groups; the differences between them are mainly related to income and housing types. The long return durations (more than 8 weeks) of the home move-out rate tended to be more prevalent in high-income census tracts compared to the short return durations. Often, we view the areas with shorter return durations as more resilient to disaster based on the ability to return to steady state. According to the results and discussions in this study, however, the fact might be just the opposite. A shorter evacuation return and relocation progress may indicate that challenges faced by low-income and minority populations to evacuate and relocate and require additional assistance and resources to mitigate the impact of disasters.

This study provides three contributions to the study of parity recovery and remediation after a disaster: specifying critical short-term recovery milestones, revealing disparate community recovery patterns in different subpopulations, and observing non-uniform recovery duration and patterns. First, this study specified two critical milestones-evacuation return and home-switch stability-related to short-term disaster recovery to be used for more data-driven and proactive monitoring of recovery. The standard approaches for community recovery monitoring have significant lags and put the burden of data collection on affected people via public surveys. The indicators used in this study were obtained from privacy-protective and aggregated location-based data to provide a finer-resolution insight into the recovery at the census-tract level and to monitor community recovery progress in a more data-centric manner during and in the aftermath to support decision-makers and responders proactively. Second, the findings showed that a shorter duration of critical recovery milestone indicators in flooded areas is not necessarily a positive indication. A short duration of evacuation return could be due to challenges to evacuation faced by low-income residents. A short home move-out return could be due to living in rental property or a lack of flood insurance to properly effect home repairs and relocation. In practice, early return of evacuation and home-switch in the context of flooding events may signal the absence of resources and may require support from officials and decision-makers. Third, the skewed distribution of return durations for both the evacuation return and home-switch stability were observed in all subpopulation groups. All return patterns show a two-phase return process that the first 80% of population returned faster than the remaining 20% of the population. This phenomenon indicates that the recovery patterns are non-uniform. Hence, in evaluation and monitoring of recovery, it is important to consider the socio-demographic information of both unusual short and long return duration to identify potential issues before making decisions.

Some limitations and concerns need to be addressed in future studies. Since the location-based data used in this study is anonymous, the analyses of this study focusing on specifying short-term recovery milestones are based on the socio-demographic information at the census tract level instead of the individual level. Therefore, the results demonstrated in this study only reveal the short-term recovery milestones and the disparities aggregated at the census tract level rather than evacuation dynamics. We make no implication regarding individual household behaviors. Future study is needed to integrate field surveys with location-based dtata to understand the relationship between the return duration of the evacuation and move-out rates and socio-demographic variables at the household level. While this study focuses on the progression of return on evacuation and home switch rates of census tracts in the aftermath of disasters to understand the recovery progress of an area, we note that population behaviors such as where people evacuate and relocate to are also critical and thus require further studies to investigate. The home CBG information used in this study is identified and aggregated by Spectus with its algorithms considering users’ dwell durations, dwell start and stop time, and dwell locations using GPS information; however, the algorithm is not publicly available. In addition, this study only considered the impact of flooding; Hurricane Harvey did not cause extensive power outages. Further investigation into the effects of the power outage on return duration can be made using data from events with extensive power outage. Despite the limitation and concerns, from a practical perspective, the indicators and findings in this study could inform disaster managers and public officials to make recovery decisions and allocate resources in a more proactive, data-driven, and equitable manner. Such data-driven approaches could overcome lags and inefficiencies in disaster recovery management and enhance community resilience.

Materials and methods

Study area and period

The study collected and analyzed data from Harris County, Texas, which includes the Houston metropolitan area, one of the most adversely affected areas by the 2017 Hurricane Harvey. On August 25, 2017, Hurricane Harvey, a devastating Category 4 hurricane, made landfall and led to heavy rainfall in Harris County. In addition, Houston downtown and some western areas of Harris County were flooded. There was limited mandatory evacuation issued for Harris County. Also, due to the release of water from Barker and Addicks Reservoir, the west part of Harris County experienced an extensive and prolonged flooding. The impacts of Hurricane Harvey continued until September 1, 2017, when Hurricane Harvey left Harris County, and people started to recover from the impacts afterward. To understand the progression of return on evacuation and home switch rates, we obtained data from Harris County at the census tract level with 786 census tracts from the period between July 9, 2017, to July 28, 2018, within which the pre-disaster period is from July 9 to August 24, 2017, and the post-disaster period is from August 25 to July 28, 2018.

Data sources

Location-based data

The aggregated location-based data used in this study is from Spectus Inc. Spectus has a location intelligence platform which collects mobility data of anonymized devices of users who have opted in to provide access to their location data anonymously for research purposes through a CCPA (California Consumer Privacy Act)- and GDPR (General Data Protection Regulation)-compliant framework. Spectus creates their geo-behavioral dataset by collaborating with app developers directly to capture offline behavior data at fine-granular scales with accurate locations based on Bluetooth technology, as well as GPS, Wi-Fi, and IoT (Internet of Things) signals. For anonymous, opted-in each user, Spectus collects more than a hundred data points daily on average, which provides a more accurate understanding of the population’s mobility patterns than the conventional mobility survey. Current daily active user count collected by Spectus is roughly 15 million in the United States. Through its Data for Good program, Spectus provides mobility insights for academic research and humanitarian initiatives. By analyzing the aggregated mobility patterns of more than 500,000 anonymous Spectus users (representing 12.5% of the population of the Puget Sound region under analysis), Wang et al.48 determined that Spectus data, as compared to cellular network and in-vehicle GPS data, benefitted from a superior combination of large scale, high accuracy, precision, and observational frequency. Beyond validating scale and accuracy, the research48 found that Spectus data is highly representative in terms of population mobility patterns. In addition, multiple existing studies49,50 on Spectus data have demonstrated the representativeness of the data.

Spectus aggregates data using artificial intelligence and machine learning techniques. Spectus’ responsible data sharing framework enables us to query anonymized, aggregated, and privacy-enhanced data, by providing access to an auditable and on-premise sandbox environment. In this study, we used one of the Spectus aggregated datasets, Daily Metric by Device, to assess the return patterns and progresses after disasters. The Daily Metric by Device table provides information at the device level, including users’ home census block group tags and hours users stay at home census block groups per day and night. Based on the aggregated data, we calculated the evacuation and home move-out rate, which are introduced in Data Processing section, to understand population’s response to disasters. All the location-based data used in this study were aggregated to the census tract level in order to further preserve privacy.

Flood impacts data

In this study, we used flood inundation percentages within a census tract as a measure of flood impacts. Specifically, we calculated flood inundation percentages based on the flood inundation map of Hurricane Harvey produced by Federal Emergency Management Administration (FEMA). We overlaid the map of Harris County at the census tract level with the FEMA flood inundation map to compute the flood inundation areas within each census tract and its corresponding flood inundation percentage. For the analysis in this study, we used 10% of the flood inundation percentage in each census tract as a threshold to distinguish flooded and non-flooded areas. The 10% threshold was selected based on the comparison with the claim data from Federal Emergency Management Agency’s National Flood Insurance Program. Figure 8 shows the spatial distribution of the flooded and non-flooded census tracts in Harris County.

Figure 8
figure 8

Map of the flooded and non-flooded census tracts in Harris County based on the flood inundation map of Hurricane Harvey produced by FEMA.

Socio-demographic data

We retrieved demographic and household socioeconomic data from the American Community Survey database administrated by US Census Bureau at the census tract level to understand whether the socio-demographic status affects return patterns and progresses. The data used in this study is the 2017 5-year estimates data, representing the estimates over the 5-year period from 2013 through 2017. The socio-demographic data obtained in this study included the median household income, the ratio of the Black population, the ratio of the Hispanic population, and the housing types in each census tract in Harris County. We then compared this socio-demographic data with the recovery milestones such as evacuation and home switch return patterns.

Data processing

We first identified residents of Harris County during the hurricane period based on their home census block group tags in the aggregated data provided by Spectus. That is, we extracted all users and their corresponding information if their home tags occurred in Harris County at least once. For the evacuation rate, we calculated the percentage of people who left their homes daily. Also, we elicited the move-out rate from the changes of users’ home tags in the aftermath of the event. Calculation of the evacuation and move-out rates is discussed in the following sub-sections.

Evacuation rate

The evacuation rate was calculated based on the data at the census block group level, the finest geospatial level of users’ home locations. In this study, people who left their home census block groups and dwelled in the other census block groups for an entire day were viewed as evacuated populations. In other words, the evacuation rate indicates that the percentage of people who left their home census block groups during Hurricane Harvey. To this end, we extracted the data from July 9, 2017, through November 19, 2017, from the aggregated data provided by Spectus to capture the pre-disaster period and avoid the effect of the Thanksgiving holiday. To ensure the quality of the data and avoid data biases, we analyzed only records with at least 240 min of location information in a day. Thus, the evacuation rate for each census tract was calculated based on the average user count per day per census tract is 126 users. Then we calculated the evacuation rate for each census tract as the number of evacuated users divided by the total number of users in a census tract. Also, to understand fluctuations in the evacuation rate during Hurricane Harvey, we calculated the percent change of the evacuation rate according to the baseline rate for CBGs, which is the average rate that indicates the percentage of people left their home during the pre-disaster period (July 9, 2017, through August 5, 2017). The baseline rate is calculated based on the day of a week to account for the difference between weekdays and weekends. The calculation of the percent change is shown as Eq. (1).

$$\begin{aligned} Percent \; change \; of \; ER_{t,d,c}= {\frac{(ER_{t,d,c}-BER_{d,c})}{BER_{d,c}}} \end{aligned}$$
(1)

where \(ER_{t,d,c}\) is the evacuation rate on day t and day d of a week in census tract c, and \(BER_{d,c}\) is the baseline evacuation rate on the day d of a week in census tract c.

Home move-out rate

The home move-out rate, an index assessing home-switch return patterns and progress, is calculated according to the home CBG information in the aggregated data. Since home switches do not happen often, we obtained data from July 9, 2017, to July 28, 2018, and further aggregated it to a weekly period. Specifically, we aggregated the home tags of all users who had at least one home tag in Harris County to a weekly table so that every user has a home tag every week. If no identified home information can be found from Spectus for a user for a specific week, we filled the data from the previous home tag by assuming no home switch for this user during this period. We then used the home information and calculated the home move-out rate at the census-tract level. To this end, the move-out rate is defined as the number of users switching their homes during a specific week from a census tract over the number of users in the census tract. Likewise, we calculated percent change of the move-out rate to understand fluctuations in the aftermath of Hurricane Harvey based on the pre-disaster baseline data from July 9, 2017, through August 12, 2017. That is, the move-out rate for each census tract was calculated based on the average user count per week per census tract is 192 users. Each census tract has a baseline home move-out rate for calculating percent changes using Eq. (2).

$$\begin{aligned} Percent \; change \; of \; MOR_{w,c}= {\frac{(MOR_{w,c}-BMOR_{c})}{BMOR_{c}}} \end{aligned}$$
(2)

where \(MOR_{w,c}\) is the move-out rate on week w in census tract c, and \(BMOR_{c}\) is the baseline move-out rate in census tract c.

Identifying duration returning to steady state

The times at which the evacuation and home move-out rates return to steady state are critical milestones of community short-term recovery. Thus, we developed a duration identification approach to specify the duration of evacuation and move-out rates returning to steady state after the event. During the impact of a disaster, the evacuation rate increases as persons evacuate from their homes to avoid injury and due to physical damage and return to their homes when the impact level decreases. Similarly, people switch their homes during and in the aftermath of a disaster due to damaged property. The move-out rate increases directly following a disaster then returns to steady state. We used the percent change of the evacuation and move-out rates and applied the rolling average method to obtain trends of the percent change of the evacuation and move-out rates during the evaluation periods. In particular, we calculated the 7-day average changes for the evacuation rate when there were more than 4 days of data within each 7-day period. Also, to understand fluctuations in the move-out rate, we used 4-week average changes in cases in which at least 2 weeks of data were known.

After determining trends in the percent change of the evacuation and move-out rates for each census tract in the study area, we looked for (1) the maximum change and (2) the durations of reaching the maximum change during the evaluation period. Using that information, we identified the duration for return to steady state in each census tract in the period after the maximum change and then specified the beginning of a steady state. Since a new steady state in the aftermath of disasters can vary compared to the pre-disaster status, we defined a steady state as having no substantial difference in terms of the percent changes. That is, the difference of the percent change of the evacuation and home move-out rates between two consecutive days/weeks is within a threshold, assumed to be 10% in this study. Thus, we defined the returned day/week of the evacuation and move-out rates for a census tract to be the first day/week with two consecutive differences less than 10%. Through this approach, we identified the duration of the evacuation and move-out rates returning to steady states for each census tract in Harris County in the aftermath of Hurricane Harvey.