Introduction

Disasters not only cause physical damage to infrastructure systems and building facilities but also affect the societal dynamics of communities by disrupting the lifestyles of populations. Lifestyles can be characterized by essential and non-essential activities, such as visiting grocery stores to meet food needs or visiting local shopping centers to buy clothing (FEMA, 2020; Li et al., 2021; Podesta et al., 2021). The disruption of essential lifestyles negatively impacts populations, as they are unable to access critical resources to maintain their health and well-being. The restoration of non-essential services can also signal a return to normalcy, as people are more able and comfortable to resume their earlier activities (Horney et al., 2017; Links et al., 2018). During the rebuilding and recovery process following a disaster, people may incrementally resume their standard lifestyles. The return to pre-disruption lifestyle standards could represent an important facet of the societal impacts of disasters, and thus, has the potential to unveil a critical milestone in community recovery (Ma et al., 2022; Yuan et al., 2021).

Recovery is a vital component of the disaster management cycle. According to FEMA (2016), disaster recovery involves the return of all formal and informal systems to a normal state. Eid and El-adaway (2018) show that the disaster recovery process is multidimensional and nonlinear which involves a multitude of strategies and capacities to rebuild community livelihoods. Indeed, it is well-established that recovery is a complex process of multiple dimensions (Horney et al., 2017; Platt et al., 2020). Finucane et al. (2020) discuss how recovery planning should be adaptive to community needs and establish formal mechanisms for assessment. Recent literature has also stressed the importance of considering social connections (Delilah Roque et al., 2020; Masud-All-Kamal & Monirul Hassan, 2018; Sadri et al., 2018) and social inequities (Emrich et al., 2022; Peacock et al., 2014; Sovacool et al., 2018) in varying levels of recovery. According to Mayer (2019), the recovery process continues to be enhanced by interventions and capacity building in individuals and communities, which is being shown in programs across the globe.

Although there is no concrete consensus on recovery, several recovery frameworks have been developed. For example, Cutter et al. (2008) presents a system perspective on recovery using factors of ecological, social, economic, infrastructure, institutional, and community competence. Based on the ratings of a Delphi-panel to expert opinion, Jordan and Javernick-Will (2013) viewed recovery through infrastructure, social, economic, and environmental components. Horney et al. (2017) recognized the importance of developing potential indicators for recovery including but not limited to housing, infrastructure, and community planning and capacity building.

Recovery frameworks must also include elements of social recovery through institutions and services. These have singled out the significance of measuring people’s life activities as part of the rebuilding phase after a disaster (Links et al., 2018; Tatsuki, 2007). The framework by Links (2018) focuses on the variety of human needs by presenting the concepts of community functioning, prevention and mitigation, and population. In addition, Tatsuki (2007) mentioned several factors of long-term recovery process such as community rebuilding and government relations. Other research studies have examined measured recovery through the amount of population returns (Finch et al., 2010), the functionality of infrastructure services (Chang et al., 2014), the extent of housing damage (Peacock et al., 2014), and the rates of neighborhood poverty (Lee, 2020); however, to the authors’ knowledge, few studies have examined the lifestyle activities of populations at the granular scale of census block group (CBG) level. Therefore, the analysis of lifestyle patterns before and after disasters could bring unique insights about a population’s return to normalcy and supplement established recovery frameworks.

However, there is a gap of knowledge related to the specification, characterization, and quantification of recovery milestones (FEMA, 2016; Mayer, 2019; Tierney & Oliver-Smith, 2012). First, most recovery indicators are qualitative which can lead to a vague understanding of the disaster impact and recovery efforts. Quantitative measures of recovery are needed to aid in the allocation of resources and facilitate the restoration of critical systems. In fact, there is a need to produce quantifiable measures of recovery to not only help us better understand the recovery process but also evaluate the effectiveness of recovery efforts (Cutter et al., 2013). Second, recovery indicators are limited from a spatiotemporal perspective. They fail to capture the changing nature of individuals, populations, and communities in a post-disaster setting. Thus, the research introduces the concept of lifestyles to bring a much-needed quantitative and dynamic aspects of recovery. It also proposes the use of location intelligence data to quantify lifestyle patterns.

Location intelligence data can reveal important insights about complex disaster phenomena pertaining to population activities and interactions with critical facilities (Huang et al., 2022; Yuan, Esmalian, et al., 2022). For example, location intelligence data has examined disaster impacts (Rosenblum et al., 2021; Wang & Taylor, 2018; Yu et al., 2018), damage reporting (Yuan & Liu, 2020; Yuan, Yang, et al., 2022), evacuation patterns (Deng et al., 2021; Han et al., 2019; Hong et al., 2021), and rapid impact assessments (Kontokosta & Malik, 2018; Salley et al., 2019). Hong et al. (2021) categorized the evacuation patterns caused by Hurricane Harvey into shelter-in-place, stable, distressed, and abandoned households. Certain research studies found that human mobility has distinctive patterns during stable and disruptive periods (Wang & Taylor, 2014; Yabe et al., 2020). Podesta et al. (2021) measured periods of disruption to points of interest (POIs) in Harris County after the landfall of Hurricane Harvey; the study concluded that POI systems have differential recovery rates to their baselines. To expand on the understanding of accessibility to essential services, Esmalian et al. (2022), utilized location intelligence data to determine equitable access to critical facilities during disruptive events. Despite these recent advances, little of the existing work has leveraged location intelligence data for specifying and characterizing community recovery. Unlike traditional datasets (e.g., surveys) used for recovery, location intelligence data has greater spatial granularity, larger samples, and shorter lags in data collection. It could also relieve the burden of data collection from affected populations. Thus, examining lifestyle recovery as a critical recovery milestone can yield understanding about recovery trajectories at a granular level.

In this study, we leveraged location intelligence data, specifically POI data and human mobility data, to examine lifestyle activities at the CBG level. In doing so, this study presents a novel methodology to specify lifestyle patterns and understand lifestyle recovery as a critical milestone following a disaster. Due to their holistic and dynamic nature, lifestyles encompass routine changes across various spatial units after the disaster impact, and lifestyle recovery is a critical milestone based on capturing people’s interactions with infrastructure, businesses, and social interactions in the community (Cutter et al., 2008; Podesta et al., 2021).We processed and analyzed datasets from Spectus and SafeGraph, two prominent commercial companies that collect and process location-based data at a granular scale. We combined that data with publicly available building footprint data to specify population visitations to POI. The study context was Hurricane Harvey in Harris County, Texas, from August 2017 through November 2017. The objectives of the research are to (1) assess the extent of impact of disasters on lifestyle activities, (2) identify patterns of recovery trajectories, and (3) examine the variations of recovery given flooding and demographic attributes. The analysis focused on answering the following research questions: (1) What are distinctive patterns in typical lifestyles of populations during normal periods?; (2) To what extent, do disasters impact lifestyle activities and how quickly do lifestyle patterns recover?; and (3) To what extent, do the duration and trajectory of lifestyle recovery vary across different areas with different flooding impacts and demographic attributes? The aim of the research is to guide decision-makers such as emergency managers, public leaders, and service companies with more quantitative understanding of lifestyle recovery and bring attention to the potential disparities after a hazard event.

Data and methods

Study context

We analyzed and quantified the recovery of lifestyles using location intelligence data captured before, during, and after the 2017 Hurricane Harvey in Harris County, Texas. Hurricane Harvey was a Category 4 storm that made landfall in August 2017 and greatly impacted the Texas coastal region. Extensive flooding led to the inaccessibility of essential and non-essential facilities(CBS/AP. (2017)). The Texas Department of Transportation reported nearly 350 Houston road locations being affected (Dong et al., 2022; Zipline’s Logistics consultants, 2017), and the Electric Reliability Council of Texas (ERCOT, 2017) reported thousands of power disruptions. These disruptions had significant impacts on people’s lifestyle activities in the aftermath of Harvey. The impacts of Harvey and the metropolitan setting of Harris County provide a suitable context for examining lifestyle recovery patterns (Blake & Zelinsky, 2018).

Data sources

To specify daily lifestyle patterns, we processed multiple datasets to determine daily visitations from each home CBG to different POIs. Figure 1 depicts an overview of the methods for processing and analyzing the datasets. Human mobility data was primarily obtained from Spectus, a location intelligence data company that provided location data (e.g., Global Positioning System data) of high spatial accuracy and a very high frequency of observations (Spectus, n.d.). Every day, more than 100 data points on average are collected from each anonymous device. Currently, there are roughly 15 million daily active users in the United States. The Spectus panel consists of adults who provided informed consent to anonymized data collection for research purposes, through a General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA) compliant framework. Compared to traditional Call Detail Record (CDR) data, which provides general information about telephone exchanges and interactions, location intelligence data with precise GPS information provides exact data about destinations, which is extremely useful for detecting human lifestyle patterns in detail. Spectus’s main database is built using third-party apps that capture opted-in users’ anonymized location points if they consent to share their location with these apps. Data is further anonymized and erased from system, then aggregated at the CBG level to ensure consumer privacy and confidentiality.

Fig. 1: Integrating multiple data sources to specify population visitation to POIs for characterizing lifestyle patterns.
figure 1

Notes: Raw data includes multiple data sources. Data analysis includes the merging of data sources for visits at POIs and the overall methods for clustering the data. Metrics are the final outputs of the methods including calculating the recovery trajectories.

For 2017, Spectus data lacked North American Industry Classification System (United States Office of Management and Budget, 2022). NAICS codes to distinguish the type of each POI (stop points of unique device IDs). To address this, we utilized the location and type of different POIs consistent with NAICS codes from the SafeGraph dataset (SafeGraph). Safegraph’s core dataset contains spatial coordinates and addresses of POIs, as well as basic information about each POI, such as brand, operating time, latitude, and longitude, and NAICS code. SafeGraph gathers POIs by scraping open store locators on the web using publicly available APIs and scraping open web domains that provide updated locations. To fill in the gaps, SafeGraph independently performs processing and modeling as well as licenses third-party data to infer additional attributes. After collecting the POIs from Safegraph’s core dataset, we used Microsoft’s open-building footprint dataset to determine the polygon of each POI (Microsoft Open Source). The 1,542,887 building footprint polygon geometries in the Houston metropolitan area were created by Microsoft using their computer vision algorithms on satellite imagery.

To evaluate the spatial variation of lifestyle impacts and recovery based on flooding status, we used flooding data from the estimated flood depths on August 29, 2017 (FEMA, 2018). The data had a gridded horizontal resolution of three meters, which was processed appropriately for the CBG scale analysis. The percentage of CBG area flooded during Hurricane Harvey was an indicator of flooding extent. Demographic information, including total population, median income, percentage of the white population, and percentage of elderly residents above 65 years old, were collected at a CBG level from the US Census data (US Census Data (2017)).

Data processing of human visitation to POIs

Home_CBG to stop table

Data processing focused on the period of August to November 2017. The cut-off date was the end of November to minimize the effect of the holiday season on population lifestyle patterns. To associate each anonymous device’s home CBG in Harris County, we first retrieved data from the device matrix table in the Spectus core database. We recognized a CBG as a user’s home_CBG if the device consistently resided there for more than one day. The device matrix includes the user_id and home_CBG_id. Second, we used the Spectus core dataset’s stop table to capture anonymous human mobility. The stop points were captured in the dataset if individuals stopped at a location for a suitable time scale. User_id, latitude and longitude, stop-by date, and stop-by time were all stored in the stop database. By merging the device matrix table and stop table through the user_id, home_CBG was acquired.

POI polygon table

To specify the polygon for each POI, the place_id in Spectus’s dataset was extracted using the POI table in Spectus’s core datasets. This table has basic information about each POI, such as place id, place name, place address, and POI’s latitude and longitude. We obtained each place id with its associated building polygon by merging the POI table with the building footprint dataset.

Home_CBG to POI table

By merging the latitude and longitude of the POI table to the building footprint dataset, we matched each stop point of POI with a unique place_id. Then we classified each POI using the NAICS code, which provides information on the associated sector, subsector, industry group, and industries. Since the 2017 Spectus data lacks the NAICS code, we can match the polygon information (latitude and longitude) to Safegraph’s data, which does include the NAICS code. Informed by prior studies (Li et al., 2021; Podesta et al., 2021), we identified eleven POIs which contained five essential (gasoline stations, grocery and merchandise, health and personal care stores, medical facilities, and education) and six non-essential (banks, stores and dealers, restaurants, entertainment, recreation and gym centers and beauty care). The classification of essential was further influenced by FEMA definition of community lifelines, where FEMA discusses how community lifelines promote the food, water, health, and shelter needs of the community (FEMA, 2020). Table SI.1 in the Supplementary Information has the detailed POI-NAICS table with essential and non-essential lifestyles. By doing so, the daily visit data was generated from the home_CBG to POI, and weekly data was generated by aggregating seven days of daily data. The rationale for weekly aggregation is that lifestyle patterns might vary between weekdays but are usually consistent on weekly basis (Sevtsuk & Ratti, 2010).

To analyze and quantify lifestyle recovery for different CBGs, we first established a baseline to compare against the lifestyle impacts. The baseline was calculated by averaging the number of visits during the first two weeks of August 2017, the period before Hurricane Harvey made landfall. Second, we grouped POIs into essential and non-essential POIs to establish the baseline number of visits to each group of POIs. Then, we compared the number of visits in the subsequent weeks against the baseline to measure lifestyle recovery.

We will briefly acknowledge the limitations of this data processing. Due to the data availability and scope, we were not able to collect data from before 2017 and did not feel it was appropriate to use data after 2018 to establish the baseline. Published work has used similar processing methods (Esmalian et al., 2022; Hong et al., 2021). As the field and analysis of mobility data grows, future studies could control for seasonal and temporal patterns by observing previous periods of annual activity. To ensure the validity of the data and results, we also need to ensure that the population activity and mobility datasets used are representative. Regarding the representativeness of Spectus data, researchers from the University of Washington evaluated the utility of this data by mapping trips within an origin-destination transportation analysis (Wang et al., 2019). By analyzing the aggregated mobility patterns of more than 500,000 anonymous Spectus users (representing 12.5% of the population of the Puget Sound region under analysis), the researchers determined that Spectus data, as compared to the cellular network and in-vehicle GPS data, benefited from a superior combination of large scale, high accuracy, precision, and observational frequency. Beyond validating scale and accuracy, the research found that Spectus data is highly demographically representative(Wang et al., 2019). Additional existing studies on Spectus data have also demonstrated the representativeness of the data (Aleta et al., 2020; Vugrin et al., 2010). SafeGraph data have also been used to study consumer preferences (Athey et al., 2018).

A CBG is considered as recovered at the specific week if the number of visits to POIs is 90% of the baseline value. The concept of recovery of essential and non-essential lifestyles was modified from studies observing the resilience of critical systems (Nan & Sansavini, 2017; Vugrin et al., 2010). CBGs that did not return to the 90% threshold are labeled at 15 weeks of recovery which is the maximum amount of time for the data analysis. A sensitivity analysis of recovery to the baseline (thresholds of 90%, 80%, 70%) was performed and is found in Figure SI.4 of the Supplementary Information.

Primary and secondary k-means clustering

K-means clustering is an unsupervised learning method for creating clusters and cluster centers in a set of unlabeled data (Lenormand et al., 2015; Pappalardo et al., 2015; Thuillier et al., 2018; Toole et al., 2015). Instead of analyzing the lifestyle patterns of each CBG separately, we used k-means clustering to group CBGs with similar POI visitation patterns (i.e., the lifestyle patterns in the primary clusters) and recovery rates (i.e., differential recovery trajectories in the secondary clusters). In other words, k-means clusters similar CBGs based on the available data and does not depend on manual input to establish the boundaries of each dataset.

As shown in Fig. 2, k-means clustering is used in two steps. The first step created the clusters of CBGs based on their non-disrupted lifestyle patterns before the disruption of Hurricane Harvey; the second step clustered the CBGs within each primary cluster based on the lifestyle recovery after disruption. A minimum-maximum formula standardized the visitation values. The primary clustering was obtained from the POI visitations during the baseline period of the first two weeks of August 2017 (before the landfall of Hurricane Harvey). Vector data consisted of the aggregate frequency of visits to POIs over the total number of visits to all POIs. For example, the first input in the vector would consider the number of visits to grocery stores divided by the total number of visits to all POIs for a particular CBG within the same period. The secondary clustering of the lifestyle recovery is based on four features: the maximum point of disruption in essential POI visits, the maximum point of disruption in non-essential POI visits, the duration of recovery in essential POI visits, and the duration of recovery in non-essential POI visits. The maximum point of disruption is defined as the greatest percent change in visits to essential and non-essential POIs following the baseline period. Duration of recovery is the number of weeks for CBGs to recover to 90% of the baseline. In both clustering methods, the number of clusters is found through the elbow method as shown in Figure SI.1 in the Supplementary Information.

Fig. 2: Primary and secondary clustering to determine the extent of lifestyle characterization and disruption.
figure 2

Notes: Primary clustering groups home CBGs from prominent visits to POIs. POIs are then grouped into essential and non-essential lifestyles. Secondary clusters group within each of the primary clusters based on the maximum point of disruption and duration of recovery for essential and non-essential POIs. Finally, trajectories of recovery are examined based on mean and distribution.

Results

Primary clustering of k-means characterizes the lifestyles of CBGs for the baseline period before Hurricane Harvey. It shows the level of dependence that the cluster normally has on different essential and non-essential lifestyles based on the frequency of visits. The elbow method (Figure SI.1 in Supplementary Information) found that four clusters were the optimal number of k-clusters. The median values and distribution of the demographic characteristics of the four clusters are found in Figure SI.2 and Table SI.2 in the Supplementary Information. In essence, 60.26% of CBGs were in Cluster 1, 3.68% in Cluster 2, 27.94% in Cluster 3, and 8.12% in Cluster 4. According to the merged census data, Cluster 1 contained about 3.09 million people within an area of 2.86*10^9 square miles; Cluster 2 contained 1.42*10^5 people within an area of 6.46*10^7 square miles; Cluster 3 contained about 1.48 million people within an area of 2.87 with 1.48*10^9 square miles; and Cluster 4 contained about 3.21*10^5 people within an area of 2.05*10^8 square miles. Figure 3 shows the spatial distribution and relative frequency of visits to the essential and non-essential lifestyles. The exact values of the visit frequency to the essential and non-essential POI categories are tabulated in the Supplementary Information for Tables SI.3a and SI.3b. Non-essential categories had consistent rankings across the four clusters with the top three rankings as stores and dealers, restaurants, and beauty care. Essential categories had consistent rankings for Cluster 1, 3, and 4 with the top three being health and personal care stores, grocery and merchandise, and medical facilities. However, Cluster 2 had a slight adjustment with the top three ordered rankings as health and personal care stores, medical facilities, and grocery and merchandise.

Fig. 3: Results show the frequency distribution of the essential and non-essential lifestyles across the four primary clusters.
figure 3

Notes: POIs with greater visits means that populations are more dependent on those POIs. The bar plots present the prominent weekly POI visitations (essential and non-essential) within each lifestyle cluster. The x-axis shows the number of weekly visitations, and the y-axis shows the type of POI. In the map, Clusters 1 and 3 represent the majority of primary lifestyle clusters with 60.26% of CBGs and 27.94% of CBGs respectively.

Table 1 displays how the relative frequency of each lifestyle varied across the clusters, which was confirmed by analysis of variance (ANOVA) testing with a statistical p-value of < 0.05. This indicates that each primary cluster has distinct dependence on its lifestyles when compared to the other clusters. It is important to recognize the subtle differences which make up the essential and non-essential lifestyles of the primary clusters. Cluster 1 had the highest relative frequency for essential lifestyles with health and personal care stores (77.40%) and education (0.32%) as well as for non-essential lifestyles with stores and dealers (65.57%). When compared to other clusters, Cluster 2 had the highest relative frequency for essential lifestyles with medical facilities (31.97%) and gasoline stations (3.60%) as well as for non-essential lifestyles with restaurants (34.27%), recreation and gyms (2.21%), and entertainment (0.99%). Cluster 3 had no distinct high frequency to the essential and non-essential lifestyles. Lastly, Cluster 4 had the highest relative frequency for essential lifestyles with grocery and merchandise (28.40%) and non-essential lifestyles of beauty care (21.86%) and banks (13.37%).

Table 1 ANOVA testing on the frequency of lifestyles for primary clusters.

Secondary clustering of essential and non-essential lifestyles

Secondary clustering of k-means within each primary cluster will determine the state of lifestyles during disrupted periods (e.g., Hurricane Harvey) and detect disproportionate rates of recovery within the same lifestyle. Using the elbow method, the analysis found three sub-clusters for each primary cluster. Among the total twelve sub-clusters, we identified four patterns of recovery trajectories: immediate recovery duration (IRD) which had 0 weeks of lifestyle disruption, short recovery duration (SRD) which had 1–6 weeks of disruption, moderate recovery duration (MRD) which had 7–14 weeks of disruption, and extreme recovery duration (ERD) which had 15 weeks or more of disruption. Figure 4 shows the spatial distribution of these secondary clusters. Figure 5 shows the data distribution and median values of the recovery trajectories of essential and non-essential lifestyles for each of the sub-clusters, where each CBG in the sub-cluster is represented by a point. Examples of the individual recovery rates of CBG is on Figure SI.5 in the Supplementary Information.

Fig. 4: Spatial distribution of the twelve secondary clusters and the four types of recovery trajectories.
figure 4

Notes: The maps show the spatial distribution of the secondary clusters including the immediate recovery duration (IRD) which had 0 weeks of lifestyle disruption, short recovery duration (SRD) which had 1–6 weeks of disruption, moderate recovery duration (MRD) which had 7–14 weeks of disruption, and extreme recovery duration (ERD) which had 15 weeks or more of disruption.

Fig. 5: Box-whisker plots of the duration of recovery for secondary clusters which show prominent essential and non-essential POIs for the primary clusters.
figure 5

Notes: The box-whisker plots display the distribution of the duration of recovery for essential and non-essential lifestyles of the CBGs in the secondary clusters. The icons and percentages show the relative highest frequency of visits to the essential and non-essential POIs for the primary clusters. Each CBG is represented by a point on the plot.

ANOVA testing found that the differences between the recovery trajectories of the secondary clusters were statistically significant at p < 0.05 (Table 2). This result shows that within the same primary cluster, or the same lifestyle, there could be disproportionate rates of recovery. For example, Cluster 2 has a higher frequency of visits to medical facilities when compared to other clusters which suggest that medical facilities could be a distinguishing feature of the lifestyle of Cluster 2. However, Cluster 2 had three rates of recovery meaning that certain CBGs had an immediate recovery duration, a short recovery duration, or an extreme recovery duration. For certain clusters, essential lifestyle recovery occurred before non-essential lifestyles and vice versa, which shows that lifestyle recovery may not be sequential.

Table 2 ANOVA testing on essential and non-essential lifestyle recovery for secondary clusters within primary clusters.

Exploration of demographics and flooding to lifestyle recovery

The research also examined lifestyle recovery patterns in connection to demographics and flooding. For the secondary clusters, Table 3 summarizes the count of CBGs, the median values and percentages of demographics across the CBGs, and the percentage of CBGs that have a 1% flooding threshold. The 1% flooding threshold means that at least 1% of the CBG area experienced flooding impacts. To clarify, the research examined the disruption of lifestyles from people’s home CBG to their ability to accommodate their essential and non-essential lifestyles. We consider the impact of flooding at the home CBG level, and we do not consider the flooding disruptions in accessibility to essential and non-essential lifestyles (e.g., flooding at critical roads to reach lifestyles).

Table 3 Median values of the demographics for the secondary clusters.

The results showed some variation between the demographic attributes of populations and their lifestyle recovery; however, the results did not have a general pattern of lifestyle recovery for different race and income groups to yield statistically conclusive findings. ANOVA testing found no statistical differences between the mean or median values of the demographic characteristics for the three lifestyle recovery trajectories within each lifestyle cluster. It is worth noting the demographic differences from an exploratory perspective, which can be further investigated in future research. For instance, Cluster 2-1 experienced extreme recovery duration (15 or more weeks) along with had a $32,674 median income and a 60% minority population. Meanwhile, Cluster 2-2 had an immediate recovery duration (0 weeks) but a median income of $67,857 and a 19% minority population. Thus, Cluster 2-1 had an extreme recovery duration coupled with lower income and higher percentage of minority population while Cluster 2-2 had an immediate recovery duration with higher income and lower percentage of minority population. This demonstrates the possible social disparities of recovery trajectories associated to vulnerable groups within similar lifestyles. However, the opposite trend is observed for Cluster 3. Cluster 3-0 had an extreme recovery duration (15 or more weeks) with a median income of $61,319 while Cluster 3-1 had a short recovery duration (2 weeks) with a median income of $47,721. Furthermore, the results indicate that the flooding of the home CBGs are not always connected with impacts of lifestyle recovery. Approximately 89% of non-flooded home CBGs still experienced at least 1 week of disruption in lifestyles, which emphasizes that the impacts on lifestyle can extend beyond direct flooding of the household. Particularly, Cluster 2-2 had 33% of home CBGs with at least 1% flooding and had immediate recovery duration (0 weeks) while Cluster 4-2 which similarly experienced 33% of home CBGs with at least 1% flooding had extreme recovery duration (15 or more weeks). However, Cluster 2-2 did hold a higher median income of $67,857 while Cluster 4-2 held a lower median income of $37,283. In this particular instance, lower income and not flooding percentage could be associated with the longer recovery duration. Though the research study did not find statistically significant relationships, these exploratory connections bring attention to the role of demographics and flooding in the household. Thus, future research should investigate the interrelationships of demographics and flooding for the lifestyle analysis of other disasters.

Closer examination of extreme recovery duration of lifestyles

We further examined disparities in lifestyle recovery by focusing on the CBGs in secondary clusters which had extreme recovery duration (15 or more weeks). Approximately 500 CBGs from Cluster 1-1, 4 CBGs from Cluster 2-1, 92 CBGs from Cluster 3-0, and 12 CBGs from Cluster 4-0 did not recover to 90% of their essential and non-essential lifestyles after 15 weeks. Table 4 shows the median values of the demographics from these CBGs as well as the count of CBGs, the percentage from the secondary and primary clusters, and the percentage of CBGs with 1% flooding.

Table 4 Median values of the demographics for CBGs with extreme recovery duration (15 or more weeks for essential and non-essential lifestyles).

Figure 6 displays the spatial distribution of the CBGs with extreme recovery duration (15 or more weeks) including their associated secondary clustering. For example, Cluster 2-1 and Cluster 4-2 had extreme recovery durations (15 or more weeks) but these clusters had median incomes below the median incomes of all CBGs. However, Cluster 1-2 and Cluster 3-0 also had extreme recovery durations (15 or more weeks), and these clusters had median incomes above the median income of all CBGs. Though these secondary clusters had different ranges of income, they still exhibited the same lifestyle recovery. This result suggests that people with different demographic attributes can have similar lifestyle patterns as well as similar lifestyle recovery. Furthermore, home CBGs with an extreme recovery duration may not always be subjected to any direct flooding. In total, 297 CBGs in Cluster 1-2, 2 CBGs in Cluster 2-1, 50 CBGs in Cluster 3-0, and 8 CBGs in Cluster 4-2, which equates to almost 59% of the CBGs with extreme recovery duration, did not experience at least 1% flooding of the area. The findings further indicate that the mechanisms driving the recovery of lifestyles may be beyond the extent of flooding in the household even in the most extreme circumstances. It demonstrates that the spatial structure of communities, which is primarily formed by human mobility and distribution of POI facilities, extends the spatial reach of flood impacts on people’s lifestyles regardless of the flood status of their home CBG. Due to the scope of the dataset, we do acknowledge that the research did not investigate flooding in relation to people’s ability to access essential and non-essential lifestyles. As location-intelligence data grows for the disaster research field, future studies can examine how to trip distance to lifestyles from home CBGs can play a role in the recovery process.

Fig. 6: Spatial distribution of CBGs with extreme recovery duration (15 or more weeks of essential and non-essential) at the secondary cluster level.
figure 6

Notes: The spatial distribution of the CBGs with extreme recovery duration (15 or more weeks) labeled with their associated primary cluster. Approximately 500 CBGs from Cluster 1-1, 4 CBGs from Cluster 2-1, 92 CBGs from Cluster 3-0, and 12 CBGs from Cluster 4-0 did not recover to 90% of their essential and non-essential lifestyles.

Discussion

Current strategies for measuring community recovery rely heavily on qualitative data which can be prone to subjectivity and biases, put the burden of data collection on impacted people, and provide lagging indicators of recovery (Kontokosta & Malik, 2018; Rosenblum et al., 2021). Recognizing these knowledge gaps, the research presents a transformative way to efficiently quantify and proactively monitor community recovery through lifestyle analysis. It argues that lifestyles could be a critical milestone of the overall recovery process. The recovery of lifestyles captures the combined effects of people’s activities, restoration of infrastructure, and access to businesses and services in the community. In the research methodology, we leveraged location intelligence data at the CBG level to specify distinct patterns of lifestyles and to quantify the extent of disaster impacts and speed of recovery. While location intelligence data has been utilized for examining other disaster-related phenomena such as evacuation (Han et al., 2019; Wang & Taylor, 2014, 2016), flooding impacts (Farahmand et al., 2022; Farahmand et al., 2021; Fotovatikhah et al., 2018), community resilience (Yabe et al., 2020; Yuan et al., 2021), and access to facilities (Lenormand et al., 2015), Fan et. al (2022), no prior work had attempted to leverage location intelligence data in the context of recovery of lifestyles. In particular, the integration of population lifestyle patterns with other relevant datasets is novel. It overlays human mobility, POI facility location, building footprint, sociodemographic, and flooding datasets to uncover lifestyle patterns over space and time. The following will discuss the implications of the significant findings of the research as well as connections to how decision-makers can interpret and adjust the methodology.

Characterizing communities based on dependence on essential and non-essential lifestyles

Restoration of these essential lifestyles, such as those which provide food, water, shelter, and health services, can promote overall community recovery. However, individuals may have distinct relationships with these essential and even non-essential lifestyles. In the case of Harris County, the clustering analysis identified four distinctive lifestyles with subtle differences in the dependencies to POIs. When compared to other clusters, Cluster 2 is more dependent on medical facilities (32%) which may also suggest that the recovery of Cluster 2 is also more linked to the recovery of medical facilities. In addition, Cluster 1 has the highest relative frequency of visits to health and personal stores (77%) and stores and dealers (66%) while Cluster 4 has the highest relative frequency of visits to grocery stores (28%) and beauty care (22%). These findings can help emergency managers understand which lifestyles are critical components of the standard lifestyles within a community. Such information can better prioritize the restoration of essential and non-essential lifestyles to efficiently maintain the well-being of the disrupted areas. For example, resources could be directed to restoring medical facilities for Cluster 2 and grocery stores for Cluster 1. Though the classification of essential and non-essential lifestyles is supported by previous studies (FEMA, 2020; Podesta et al., 2021), these could be adjusted by decision-makers based on the community’s needs. For example, banks could be considered critical to the community’s economic activity based on the decision-makers.

Detecting disproportionate rates of recovery trajectories

Furthermore, the results advance our understanding of differential rates of recovery among locations and populations by analyzing the recovery of lifestyles at the CBG level. In examining Harris County, there were four diverse recovery trajectories which included immediate recovery duration, short recovery duration, moderate recovery duration, and extreme recovery duration. These recovery trajectories also indicate that the recovery of essential and non-essential lifestyles does not have a fixed sequence meaning that essential lifestyle recovery can occur before or after non-essential lifestyle recovery. Within the same essential and non-essential lifestyle clusters, there are disproportionate rates of recovery. Thus, decision-makers must not only consider the unique lifestyles in a community but also the different abilities and capacities to recover from these lifestyles. For example, medical facilities may be prioritized for Cluster 2; however, further prioritization may be directed to areas that have extreme recovery duration. The current approach to recovery is rather reactive; however, by building a foundational knowledge of lifestyle, decision-makers could be more proactive in detecting potentially disproportionate rates of recovery and efficiently allocating resources.

Considering the impacts on vulnerable populations

The distinct demographics of these home CBGs could inform how emergency managers and public officials prioritize and manage the recovery of these areas as they can thoroughly understand which populations are being impacted. Disaster research argues that socially vulnerable populations, including those of low income and a high percentage of minority residents, have unique challenges to recovering from disasters (Mitsova et al., 2019; Mitsova et al., 2018). Literature has shown that such populations experience greater hardships and have lower capacities to withstand disrupted services (Coleman et al., 2020a, 2020b). Indeed, demographic characteristics are rooted in equitable recovery literature and lifestyles signal the direct needs and dependencies of populations. Though the data did not show an overall statistically significant pattern, it did uncover instances of disparity such as the extreme duration recovery and low median incomes in Clusters 2 and 4. Decision-makers should be on alert that delays in lifestyle recovery for vulnerable populations could lead to significant well-being impacts. Combining location-intelligence data with publicly available data on demographic information could assist in detecting communities that may face exacerbated impacts from disrupted lifestyles.

Understanding the extent of flooding on lifestyle patterns

The findings imply that the flooding impact on lifestyles extends beyond the direct flood exposure to home CBGs. The spatial structures of communities may be more formed by human mobility and facility distribution. Hence, populations living in non-flooded home CBGs could still experience significant disruption to their essential and non-essential lifestyles. This is demonstrated as approximately 89% of non-flooded home CBGs still experienced 1 week of disruption and 58% of home CBGs with extreme recovery duration were also non-flooded home CBGs. The finding reveals that greater investment should be made to study the network dynamics and spatial spillover effects of flooding and lifestyle populations. Decision-makers should take note of the systemic effects of flooding on the broader community. This means that even home CBGs that do not experience direct flooding can be negatively impacted and must be considered in the recovery process. Flooding could instead impact the ability of residents to meet their lifestyle needs by disrupting the normal route of travel. The connection between trip distance and the inability to access essential and non-essential lifestyles is outside the scope of this research; however, future studies can examine this as a potential avenue of the impact on lifestyles.

Steps for future research

The research findings will advance our understanding of lifestyle recovery as a critical milestone of overall community recovery. Future research could evaluate the underlying mechanisms of lifestyle recovery and other influencing factors such as demographics and flooding in several methods. First, future research could investigate how the trip distance and travel time to essential and non-essential facilities could play a role in lifestyle recovery. Second, supervised learning methods (such as regression, decision trees, and neural networks) could predict how sociodemographic features of the population and the attributes of a hazard event influence the recovery of lifestyles. Third, the analysis of lifestyle patterns will increase in reliability as more data is collected for population activity and human mobility on a broader temporal and finer spatial granularity. With a combined effort, future research will further the understanding of the social, physical, and environmental disparities associated with lifestyle recovery.

Concluding recommendations

This study provides a novel approach to quantifying and examining community recovery based on the fluctuations of population lifestyles. The characterization of community recovery would enable proactive monitoring of when communities bounce back at granular spatial scales. The results also could inform decision-makers and emergency managers in the following aspects:

  • First, the findings show that decision-makers should understand the relationship of communities to essential and non-essential services during normal periods. Such knowledge can set an initial state of the community to compare future disruptive events. By characterizing lifestyles early-on, decision-makers can prioritize the restoration of important services based on how the population depends on certain essential and non-essential services in their lifestyle.

  • Second, even within similar lifestyles, there are differential rates of recovery in the aftermath of disruptive events. This insight can bring attention to vulnerable populations and areas in a community that may not have the capabilities to recover from lifestyle disruptions.

  • Third, the research shows that demographics and flooding extent could partially explain the differential rates of lifestyle recovery. Using publicly available datasets, decision-makers could preemptively know which areas are prone to extreme recovery durations based on vulnerable demographic characteristics and flooding extent.