Cross-platform analysis of public responses to the 2019 Ridgecrest earthquake sequence on Twitter and Reddit

Ruan, Tao; Kong, Qingkai; McBride, Sara K.; Sethjiwala, Amatullah; Lv, Qin

doi:10.1038/s41598-022-05359-9

Download PDF

Article
Open access
Published: 31 January 2022

Cross-platform analysis of public responses to the 2019 Ridgecrest earthquake sequence on Twitter and Reddit

Tao Ruan¹,
Qingkai Kong²,
Sara K. McBride³,
Amatullah Sethjiwala¹ &
…
Qin Lv¹

Scientific Reports volume 12, Article number: 1634 (2022) Cite this article

3589 Accesses
19 Citations
15 Altmetric
Metrics details

Subjects

Abstract

Online social networks (OSNs) have become a powerful tool to study collective human responses to extreme events such as earthquakes. Most previous research concentrated on a single platform and utilized users’ behaviors on a single platform to study people’s general responses. In this study, we explore the characteristics of people’s behaviors on different OSNs and conduct a cross-platform analysis of public responses to earthquakes. Our findings support the Uses and Gratification theory that users on Reddit and Twitter are engaging with platforms that they may feel best reflect their sense of self. Using the 2019 Ridgecrest earthquakes as our study cases, we collected 510,579 tweets and 45,770 Reddit posts (including 1437 submissions and 44,333 comments) to answer the following research questions: (1) What were the similarities and differences between public responses on Twitter and Reddit? (2) Considering the different mechanisms of Twitter and Reddit, what unique information of public responses can we learn from Reddit as compared with Twitter? By answering these research questions, we aim to bridge the gap of cross-platform public responses research towards natural hazards. Our study evinces that the users on the two different platforms have both different topics of interest and different sentiments towards the same earthquake, which indicates the necessity of investigating cross-platform OSNs to reveal a more comprehensive picture of people’s general public responses towards certain disasters. Our analysis also finds that r/conspiracy subreddit is one of the major venues where people discuss the 2019 Ridgecrest earthquakes on Reddit and different misinformation/conspiracies spread on Twitter and Reddit platforms (e.g., “Big one is coming” on Twitter and “Nuclear test” on Reddit).

Improving microbial phylogeny with citizen science within a mass-market video game

Article Open access 15 April 2024

Worldwide divergence of values

Article Open access 09 April 2024

Anger is eliminated with the disposal of a paper written because of provocation

Article Open access 09 April 2024

Introduction

Online social networks (OSNs) have become an essential component of people’s everyday life and these different platforms serve as important hubs for public expression and interactions. Meanwhile, although most people use OSNs merely as a way of recording daily life, the potential insight behind the social media data goes far beyond. Previous studies have utilized data collected from OSNs to analyze public responses to extreme events including natural hazards or major social events^1,2,3. While most of these existing studies characterize one single OSN in the context of specific events, our study explores a different perspective: Given the wide variety of OSNs, can the investigations on different platforms reveal a more comprehensive picture of people’s general public responses towards certain disasters? Do people behave similarly on different platforms and can we gain new insights using data collected from multiple online social platforms or channels? Although researchers consider social media as an active fertile ground for contemplation and study, these publications frequently focus on one platform only or are limited to Twitter and Facebook. Nevertheless, like Reddit, other platforms can provide diverse insights as users engage with that channel differently than Twitter, particularly during earthquakes, based on our study.

To answer these questions, using a unique earthquake sequence that occurred in Southern California (SoCal) in 2019 as a case study, we provide an empirical illustration of people’s cross-platform engagement on two leading OSNs, Twitter and Reddit, a leading microblog platform and a popular news aggregation platform⁴. While Twitter leverages a following-follower structure, Reddit centers around subreddits which are communities with different interests⁵.

The Ridgecrest earthquake sequence is relatively unique because two major earthquakes (one M6.4 foreshock and one M7.1 mainshock) hit the same area (Trona/Ridgecrest, CA) within a short period of time (July 4 to July 5). It was the first large earthquake(s) to be felt widely in Southern California in 20 years⁶. This unique event sequence provides an opportunity to study people’s responses and the potential cumulative effects of such sequential extreme events⁷. Further complicating the earthquake response online, this was the first time that ShakeAlert, the earthquake early warning system of the West Coast of the USA, would have been able to provide alerts. However, the publicly available app, developed by the City of Los Angeles, did not alert users⁸, which caused users to react in a variety of ways online.

The use of social media channels to communicate and information-seek has some theoretical basis. We note that the Uses and Gratifications theory is particularly useful for our research, as it suggests that people access specific media channels based on how this channel reflects their personal values or sense of self⁹. Massey et al.¹⁰ found that the Uses and Gratifications theory applies to information-seeking behavior during the 1989 Loma Prieta earthquake. Further, this theory was explored in the 2015 Nepal earthquake with Twitter¹¹. These networks, such as Reddit and Twitter, can also benefit crisis response as communities of online volunteers self-organize to assist from around the world^12,13. Uses and gratifications theory contributes to our understanding why people may choose these different channels as well as how they express emotions, share information, and converse with each other about these events.

Given the importance of social media during a crisis, we found a dearth in the literature about how people responded to the same natural hazard on different OSNs. The vast majorities of studies focus on Twitter or Facebook, but rarely multiple platforms or channels in tandem, to compare and contrast this discourse. In a recent study by McBride et al.¹⁴, news media and social media discourse were compared and contrasted however this is not a common method. In this work, building upon a previous study by Ruan et al.⁷, which analyzed public responses to the 2019 Ridgecrest earthquakes on Twitter, we conduct a cross-platform analysis that combines Twitter and Reddit data to explore the following questions regarding public response to the earthquakes:

RQ1: What were the similarities and dissimilarities between users’ responses on Twitter and Reddit?

RQ2: Considering the different platforms of Twitter and Reddit, what is unique about people’s responses on Reddit as compared with Twitter?

We have collected a dataset containing 510,579 earthquake-related tweets and 45,770 Reddit posts (1437 submissions and 44,333 comments) posted between July 3rd and July 10th. After careful data preprocessing procedures, we compare the topics, emotions, and temporal variations between Twitter and Reddit.

By focusing on cross-platform OSN analysis of public responses to natural hazards, our work makes the following contributions:

(1)
Extracted Reddit posts (including submissions and comments) that are related to the 2019 Ridgecrest earthquake sequence, along with filtering such as checking the ratio of earthquake-related comments under individual submissions;
(2)
Identified different responses to the same earthquakes on Reddit and Twitter. More specifically, users’ responses on Reddit were much less emotionally negative and covered more diverse topics than those on Twitter;
(3)
Identified the most popular subreddits discussing earthquakes during the Ridgecrest earthquake sequence, and one of the most popular venues was r/conspiracy, indicating that rumor discussions may be more prevalent than expected; and
(4)
Discovered diverse responses in different subreddits, as reflected by users’ response time and conversation networks in the main subreddits during the earthquake sequence.

Related work

Use of social network during extreme events

Social media has become an increasingly important tool for people’s communication and news aggregation. Twitter and Reddit are two of the leading OSNs. Researchers have been utilizing the OSNs to collect information for extreme events, including both natural hazards and social events^1,2,3. Case studies include photos of the 2007 Southern California wildfire¹⁵, the 2010 Haiti earthquake¹⁶, the 2017 Hurricane Harvey¹⁷, the 2019 Indonesia fire¹⁸, and earthquake detection using Tweets^19,20,21. Some closely relevant research also discussed people’s responses to natural hazards, such as earthquakes²², the 2012 Hurricane Sandy²³, the 2015 Typhoon Etau²⁴, the 2016 Hurricane Matthew²⁵, and the recent COVID-19²⁶. Most studies only focused on a single platform, and there is limited work on cross-platform analysis. However, many different platforms are becoming increasingly popular and people use them with different motivations²⁷. Therefore, it is important to understand whether these studies on one single OSN can give us a full picture of people’s general responses to extreme events.

Topic modeling and emotion analysis for short text

The most well-known technique for topic modeling is latent Dirichlet allocation (LDA)²⁸ and it is effective for analyzing long documents. However, most posts are relatively short on OSNs. For example, Twitter is based on messages (i.e., tweets) with a limit of 280 characters in length, in which case LDA is not suitable due to the rare word-occurrence. Previous research has proposed new algorithms designed for short text topic modeling. The literature on short text topic modeling describes four overarching categories: Dirichlet multinomial mixture (DMM) based methods²⁹, global word co-occurrence-based methods, self-aggregation based methods³⁰, and pseudo-document-based topic model³¹. In our study, we performed one of the DMM based method named GPU-PDMM and used another global word co-occurrence-based methods named word network topic model (WNTM)³² to verify the results because they have been shown to perform well on Twitter and Reddit data³³.

Emotion analysis can be regarded as a computational treatment of opinions, sentiments, and subjectivity of text in order to find the viewpoint of authors on specific entities^34,35. Linguistic Inquiry and Word Count (LIWC) is a software that has been widely used for emotion analysis in social media study³⁶. It evaluates the frequency of a certain corpus containing words in predefined psychological or structural categories^36,37. We utilized LIWC to analyze the proportion of positive/negative words in tweets/ Reddit posts and explore how people’s emotions vary on these two platforms.

Cross-platform OSN analysis

There are many different social media platforms driven by different conceptual frameworks and motivations, and they are used by different groups of people. These platforms can provide different types of information during the same disaster. As such, cross-platform OSN analysis can potentially generate useful new insights for crisis informatics from different perspectives. However, since most prior works concentrated on single-platform analyses, analyses are lacking in the social media research domain, especially in crisis informatics analysis, with cross-platform data. While study by Hall et al.³⁸ gives an overview that addresses the “methodological, analytical, conceptual, and technological challenges and opportunities of cross-platform analysis in social media ecosystems”, limited cross-platform analysis exists in the literature that uses cross-platform analysis to explore public responses in crisis^{39,40,41,42,43}.

Methodology

The 2019 Ridgecrest earthquakes

We focus this study on the earthquake sequence near Ridgecrest in Southern California in July 2019. A M6.4 foreshock occurred on July 4 at 10:33 a.m. PDT, and 34 h later, another M7.1 mainshock struck again on July 5 at 8:19 p.m. PDT along with more than 100,000 aftershocks⁴⁴. Given these earthquakes (M6.4, M7.1) were felt by a large number of people, we focus our study on these two events.

Public responses to earthquake sequence

We used Twitter and Reddit to conduct our study to investigate cross-platform public responses on OSNs during the earthquake sequence. Both Twitter and Reddit are leading OSNs used by millions of people globally but they have very different structures and mechanisms, thus people using them have different motivations²⁷. Unlike Twitter, which is based on micro-blogging of information with maximum of 280 characters, Reddit has a 40,000 character limit (https://www.reddit.com/r/changelog/comments/39hf9x/) and contain subreddits, where people can congregate for certain topics. Many of these subreddits are user-created, with thousands of different groups throughout the site⁴⁵. These subreddits bring together people by interests in specific topics, communities of practice, or geographical areas⁴⁶. Using User and Gratifications theory as a framework, we suggest that interactions with these subreddits align with people’s sense of self and values. In contrast, Twitter allows anyone to send and receive 280-character text messages (tweets) via any Internet-enabled device, such as a Web page, mobile device, or third-party Twitter applications. Twitter does not have similar community-level information as Reddit but Twitter allows users to follow each other and therefore forms another type of “community” created by follower–followee relationships and their postings. Twitter also has many accounts with verified identities while Reddit is anonymous. The different mechanisms of two platforms can provide complementary information to characteristic public responses. In previous Twitter analysis, the verified account information was used to explore how different accounts including authorities (e.g., @USGS: U.S. Geological Survey), news media (e.g., @latimes: Los Angeles Times) and celebrities responded in the earthquake sequence⁷.

Different aspects can be used for OSN analysis, including structures, content, and user behaviors⁴⁷, reflecting Uses and Gratifications theory. We compare the corpus on Twitter and Reddit from the following aspects: emotion, topic, and user responses. Emotion analysis and topic modeling are two effective approaches to capturing how people felt about the events and what topics attracted people’s attention. Response time is another aspect that can be used to examine how responsive the users were on different platforms to these earthquakes.

Owing to the difference between Twitter and Reddit, even though there are some conversations on Twitter, most tweets are not replied to or retweeted, while on Reddit, people’s conversations are more pervasive. Some users posted submissions while other people then discussed the post in the comments. Therefore, those conversations between users represent critically important content on Reddit. Due to the special mechanism of Reddit, we also performed the following analysis based on its unique features, e.g., subreddits and user conversations:

1.
We examined diverse behavior by users on the different subreddits during the earthquake. Specifically, we examined users’ response time in the main subreddits;
2.
Based on the conversations of users, we constructed earthquake-conversation networks in those subreddits. We visualized these networks and used some quantitative measurements to quantify the differences among them.

Twitter data collection and filtering

Our Twitter data were collected from Pushshift (https://pushshift.io), which utilized the Twitter Stream API to obtain 25,376,348 tweets from July 3 to July 10, 2019, around the epicenter of the M6.4 foreshock. Note that July 3 to July 10, 2019 is the time period for the data collection since it covers the Ridgecrest earthquake sequence, but in the later analysis, we only need to focus on a shorter time period around the event. In order to select the earthquake-related tweets, we used the following keyword list to filter the relevant tweets: ‘earthquake,’ ‘gempa,’ ‘temblor,’ ‘terremoto,’ ‘sismo’ from a previous research by Earle et al.⁴⁸, and ‘aftershock,’ ‘epicenter,’ ‘tremor,’ ‘seismometer,’ ‘seism,’ ‘seismology,’ ‘seaquake,’ ‘epicentre,’ ‘seismicity,’ ‘ridgecrest,’ ‘ridgecrestearthquake,’ ‘quake’ as new keywords in our research.

However, some of the remaining tweets were not relevant to the earthquakes of interest. We excluded earthquakes that happened in other places through another keyword list: ‘canada,’ ‘british,’ ‘vancouver,’ ‘china’. A soccer club named San Jose Earthquakes Soccer Club also led to many irrelevant tweets so we used ‘sjearthquake,’ and ‘quake74’ to remove them. Finally, we verified the language feature in the raw data and only kept the English tweets, which resulted in 510,579 tweets in the end , which were contributed by 314,583 unique Twitter users (1.62 Tweets/user)).

Reddit data collection and filtering

The Reddit data were also collected from Pushshift⁴⁹. Reddit has a different structure from Twitter and two different datasets were provided: RS (i.e., Reddit submissions) and RC (i.e., Reddit comments). Pushshift maintains all the Reddit data in its database and releases monthly Reddit data. We, therefore, used the RS and RC for July 2019. Unlike the Twitter data, in which we limited tweets geographically around the epicenter, the Reddit data were from the whole platform and therefore included much more irrelevant data. We performed more complicated filtering to further refine the earthquake-related data.

Figure 1 elaborates our preliminary data filtering process. Because the Reddit raw dataset is stored on a monthly basis, we need to start from the whole July 2019 dataset. First, we traversed the RC_2019_07 (Reddit Comments in July 2019) dataset and used the same keyword listed above to obtain all the earthquake-related comments (about 8 million). Then based on the ‘link_id’ feature of those comments, we retrieved 27,208 corresponding submissions. Meanwhile, we also used the same keyword list to directly check the RS_2019_07 (Reddit Submissions in July 2019) dataset and extracted 14,991 submissions. The two sets of submissions (39,153 in total due to some overlap) and their comments constituted our preliminary earthquake-related Reddit posts.

However, this preliminary collection still contains “noisy” data. We discovered that some comments were related to earthquakes but most of the other comments were not. For example, we found some popular sport game threads during our study period had a number of related comments but few users mentioning the actual earthquakes. In order to exclude such cases, we further filtered the Reddit posts. For the first set of submissions (from comments’ ‘link_id’), we checked their comments. The submissions were retained only when the ratio of comments containing earthquake-related keywords was larger than 15% and the number of such comments is larger than 5. For the second set of submissions (from directly checking RS_2019_07), when the earthquake-related comment ratio is more than 15%, the submissions were kept. The second submission set used a looser standard because we found submissions were much more likely to be related to the earthquake topic if the submission body included earthquake-related keywords. Following this method, we collected 45,770 Reddit posts (including 1437 submissions and 44,333 comments), which were contributed by 25,462 unique Reddit users (1.79 posts/user), for Reddit analysis. Figure 2 shows the number of Reddit submissions and comments in a 15-min time window after our filtering process. Similar to the findings in⁷, two peaks of activity started shortly after the actual occurrence of the two major earthquakes, which verifies the rationality of our filtered Reddit data. Besides 15-min time window, we also examine other time windows including 5-min, 10-min and 30-min. All different time windows present consistent results, and we pick 15-min here because the result is smooth and also representative.

Preprocessing

Before applying topic modeling on the tweets or Reddit posts, preprocessing was required. In our study, we used standard natural language processing methods to preprocess all the corpus. All the mentions(@), hashtags(#), punctuation, and URLs were removed through regular expressions. We also used the simple_preprocess function provided in the gensim python package to strip tags, punctuation, multiple white-spaces, short words, and digits as well as remove stop words. All sentences were lower-cased, tokenized, and de-accented so that a list of tokens were obtained for each tweet or Reddit post and they were prepared for topic modeling. Finally, we removed all non-English tokens using the English dictionary.

Time division

In order to compare people’s responses to the two earthquakes on the OSNs, we used time windows. The first time window is between the foreshock and mainshock, while the second one is from the mainshock and with the same length as the first one. These two windows of the same length can help us directly compare the time periods.

Comparison of Twitter and Reddit data

In this section, we aim to answer RQ1: What were the similarities and dissimilarities of the earthquake-related public responses between Twitter and Reddit?

Emotion analysis

We used LIWC (Linguistic Inquiry and Word Count)³⁶, which has been introduced in the “Related work” section, to detect the proportion of emotional (positive vs. negative scores) language used in the different corpora. LIWC is a popular software widely used in social media research that counts words in psychologically meaningful categories. Two corpora are constructed from the tweets and Reddit data before and after the mainshock separately, which represent people’s responses after the foreshock and after the mainshock on the two different OSNs, respectively.

We plot the time series of the mean negative/positive LIWC scores in every 15-min time window in Fig. 3 to illustrate the temporal difference of people’s emotions after the two earthquakes on Twitter and Reddit.

Emotion differences

Figure 3 plots people’s emotional dynamics during the earthquakes. We observe significant differences in emotions on the two OSNs. As discussed in the previous study by Ruan et al.⁷, Twitter users became increasingly anxious after the mainshock, shown by the immediate large deviation of positive and negative LIWC scores after the mainshock. People’s overall negative emotion was larger than positive emotions. However, Reddit’s patterns scored higher positively than negatively. These positive scores may indicate that users tend to express their negative feelings on Twitter, possibly due to the word limit restrictions on Twitter. Twitter is also used widely for the ongoing discussion and instant evaluation of newsworthy events⁵⁰. Therefore, people may have rushed to Twitter and expressed their immediate feelings shortly after the event. In contrast, Reddit does not have a length limit and is more like a news aggregation platform. People have more time to discuss and reflect on events, rather than express their immediate emotions about the earthquakes.

Topic modeling

As discussed in the “Related work” section, GPU-PDMM was employed in our study to compare people’s response topics after the major earthquakes on two different OSNs. More specifically, we applied GPU-PDMM on each corpus after the foreshock and mainshock respectively. The output keywords by GPU-PDMM can help identify people’s focuses after the two earthquakes on different platforms. In addition, WNTM was applied to compare with GPU-PDMM results. There was considerable overlap between the topics output from the two topic modeling approaches (67% overlapping topics for foreshock, 80% overlapping topics for mainshock). We used the GPU-PDMM results for further analysis since GPU-PDMM utilizes the semantics of words (word-embedding) and its modeling results tend to have better cohesion.

We explored different topic numbers in the application (5, 10, 15, etc.) and manually compared the quality of the topics identified. We found that using 15 topics yields the most explicit and comprehensive topics. Meanwhile, we utilized coherence score for assessing the quality of the learned topics and 15 topics output a reasonable topic coherence.

Topic modeling results

Tables 1 and 2 show the output keywords of topics on Reddit by GPU-PDMM for the foreshock and mainshock. Comparing the results with the topics on Twitter⁷, Reddit topics have obviously different features from those from Twitter. As discussed in the previous Twitter analysis work, emotional keywords can be observed in many topics and thus emotional (either positive or negative) topics were common both after two earthquakes on Twitter. However, Tables 1 and 2 contain fewer emotional keywords. Instead of expressing personal emotions, Reddit posts are descriptions, covering a vast range of topics. We also observed note-worthy topics on Reddit that never appeared on Twitter. For instance, the Alert topic clearly indicates that people were discussing the earthquake early warning and performance expectations. As discussed in the “Methodology” section, during our preprocessing steps we followed the standard approach to remove all the words that are not in the English dictionary, and “ShakeAlert” was therefore removed. That is why “ShakeAlert” is not in the list of keywords. To the best of our knowledge, ShakeAlertLA was the only EEW app that could send alerts to the Los Angeles region at that time. It is possible that some of the people were talking about other earthquake notification apps but the topic modeling process combined them into the same topic.

The Nuclear topic focused on one rumor that the earthquake was due to a nuclear bomb test in the Naval Air Weapons Station (NAWS) at China Lake near the epicenter. The Hazards topic indicates that Reddit users also discussed other natural hazards (e.g., hurricane, tornado) when the earthquakes happened.

There were a few topics that crossed platforms. For example, the Big one has been shown to be a popular rumor topic on Twitter⁷ (Big one means an extremely large earthquake of M7.8 or even higher striking California) and people were also actively talking about it on Reddit. The Self-rescue and Preparedness are two other common topics on both platforms that focus on how people should protect themselves (e.g., whether people should run or hide and where is the safest place indoor) and what should be prepared (e.g., food, water, toolkit, etc.) during natural hazards. These topics generated interest and discussions from users on both platforms which may represent more general public concerns after the large, impactful earthquakes. It is notable that “Self-rescue” topic appears on Reddit both after foreshock and mainshock. Some of the keywords in this topic are very relevant to damage description such as “fall, building, shake”. We conclude the topic is about “Self-rescue ”due to the fact that when people talked about keywords like r“un, outside, stay, cover, safe, head” in Table 2, they were always discussing what is the correct choice during an earthquake: to run outside or stay inside and find someplace to cover the head. We assign the “Self-rescue” topic in Table 1 because this topic shares many keywords with the “Self-rescue” topic in Table 2 but we acknowledge that the “Self-rescue” in Table 1 is not as obvious as “Self-rescue” in Table 2.

The topic modeling results and the comparisons between Reddit and Twitter show that a single platform analysis can only partially cover the general public responses. When combining two or more OSNs we can potentially obtain a better comprehensive understanding of the public responses. This can assist science and emergency management agencies to discover and address issues of public interest more effectively. We can use the “Southern CA” topic after the foreshock as an example: this topic is an example of people reporting the specific locations that were severely affected by the earthquake. Typically such information needs to be collected by surveys (e.g., “Did You Feel It?” by USGS Earthquake Hazards Program). With the OSN data, we can obtain this information in a different way. Emergency management agencies can potentially get such kind of actionable information more promptly and allocate the rescue resources to the regions that need help most.

Table 1 Top 15 topics on Reddit after the M6.4 foreshock using GPU-PDMM. Topics on Twitter can be found in⁷.

Full size table

Table 2 Top 15 topics on Reddit after the M7.1 mainshock using GPU-PDMM. Topics on Twitter can be found in⁷

Full size table

Response time

To explore how people responded differently on the two OSNs, we use response time as a measure of response efficiency to compare them, which is defined as the time duration between the original post and retweets/comments of that post⁵¹.

Furthermore, we explored how users on those two platforms responded differently to the external information, such as the URLs pointing to other websites. We first extracted all the external URLs cited on the two OSNs and then obtained the common 197 URLs that appeared on both platforms.

Similar to what has been reported for Twitter users⁷, people’s response time on Reddit also followed the power-law phenomenon but had an obvious flat area as shown in Fig. 4. The reason for the flatter slope is due to the different mechanism that Reddit has: Reddit users may have posted the submissions in the evening and many other people replied to them the next morning. This was a common scenario on Reddit but not on Twitter.

Figure 5 plots the CDF of the response time difference for the common external URLs and the scatter plot for the posted time for the common URLs on two OSNs. As shown in the scatter plot, for the common URLs, Twitter users had much faster responses towards the external information than Reddit users since more points are below the y=x line. More specifically, 127 of the 197 common URLs were cited on Twitter first while only 70 appeared on Reddit first. The CDF plot also supports this observation: t(Reddit) - t(Twitter) \(\in \) [0, 6 h] has the highest bar, which indicates more common URLs appeared 0–6 h earlier on Twitter than Reddit. It is noteworthy that there are two peaks at around ± 30 h. This is because 34-h is the time between the foreshock and mainshock. Some common URLs were first cited on one platform after the foreshock and then repeated on the other platform after the mainshock.

Unique information retrieved from Reddit

Our second study aims to answer RQ2: Considering the different mechanisms between Twitter and Reddit, what unique information and insights about public responses can be gained from Reddit?

Subreddit analysis

To explore which subreddits were the most popular places where Reddit users talked about the Ridgecrest earthquakes, we examined each post’s subreddit and sort the subreddits by the number of posts. We found many different subreddits got involved in the Ridgecrest earthquake discussions, among which the most popular subreddits were r/news (12,702), r/LosAngeles (3864), r/conspiracy (2414), and r/Earthquakes (1942). Those subreddits’ names represented different themes of the user groups: r/news is where the latest news is aggregated and discussed while r/LosAngeles is the nearest large metropolitan area where people felt light to moderate shaking, and r/Earthquakes focuses specifically on earthquakes. Note that r/conspiracy was also one of the most popular subreddits. According to previous research, many earthquake-related rumors indeed spread in California locally⁵². It is intriguing that the conspiracy became people’s main focus on Reddit, which has users from around the world and is not subject to geographic constraints.

Response time within subreddits

We examined how users in different subreddits behaved during the earthquakes. Specifically, we looked into users’ response time in these four main subreddits. Figure 6 shows the CDF of how users’ response time differs in these four subreddits.

The first two plots in Fig. 6 indicate that after the two earthquakes, the local subreddit (r/LosAngeles) and news subreddit were the quickest to respond, while the r/Earthquakes and r/conspiracy were slower. This is reasonable since, after the earthquakes, people first turned to the geographic subreddit, to gain local perspectives and news subreddits to information seeking from media outlets. Then people search or post more detailed information on r/Earthquakes; we suggest this may be because the events seemed less pressing. The conspiracies were attractive to users but our research suggests it requires more time to create conspiratorial associations or stories.

The bottom 4-panel plot in Fig. 6 intends to compare how users on the same subreddit responded differently after foreshock and mainshock. Notably, people responded faster to the foreshock than the mainshock in the three of the subreddits (r/conspiracy, r/Earthquakes, and r/LosAngeles). It may seem surprising considering the mainshock was much larger than the foreshock. One potential reason for this could be because the mainshock occurred in the evening thus most discussions occurred the next day. However, the r/news subreddit was different: timeliness is more important on r/news than others and users in this subreddit typically paid attention to recent news with fewer people would respond to news from the previous day.

Conversation networks

Based on the conversations between users, we constructed the conversation networks in the four popular subreddits.

To visualize the networks, we used Gephi⁵³ to obtain Fig. 7 which shows how users in the four subreddits had conversations with others during the earthquakes. The subreddits’ interaction patterns were also different even though they were talking about the same event.

To quantify their differences, we calculated some features for the four networks. We did not use measures such as diameter and density due to the different sizes of the subreddits. Instead, we utilized measures that are agnostic to network sizes, such as transitivity and reciprocity⁵⁴.

Table 3 Conversation network measures in different subreddits. Reciprocity is the likelihood of mutual connections. Transitivity is the probability that the adjacent nodes of a node are connected. The columns Mean, SD, Median, Min, Max are calculated for degrees. Note that reciprocity and transitivity calculations are based on the whole graph instead of each node so standard deviations cannot be derived as degrees.

Full size table

As illustrated in Table 3, we used degree (i.e., how many interactions with others per user), reciprocity (i.e., the likelihood of mutual connections), transitivity (i.e., the clustering coefficient, or the probability that the adjacent nodes of a node are connected) to compare different subreddits’ structures. From this table, we can see that r/conspiracy has high transitivity (indicating more clustered around a few nodes) and the highest reciprocity (indicating more mutual conversations); Subreddit r/news has very low transitivity (connections are relatively evenly spread among all the nodes), and very low reciprocity (evidence of hierarchical relationships-for instance, media contributing content but unlikely to interact much)⁵⁵. These different measures indicate that people turned to different subreddits for different purposes. As such, their interaction patterns can differ considerably across subreddits.

Summary and impacts

Our work presents a first-of-its-kind cross-platform analysis of public responses to the 2019 Ridgecrest earthquake sequence on two different social media platforms: Twitter and Reddit. We conclude the paper with a summary and its potential impacts.

Summary

In this work, we utilized the Reddit and Twitter data to analyze people’s responses across different social media platforms in response to the 2019 Ridgecrest earthquakes. We collected user responses from the two platforms related to the Ridgecrest earthquakes, which comprises 510,579 tweets and 45,770 Reddit posts (including 1437 submissions and 44,333 comments). When filtering earthquake-related Reddit posts, we combine keywords and the ratio of earthquake-related comments under submissions, which led to a more reasonable Reddit dataset related to the Ridgecrest earthquakes. Based on the refined datasets, we compare people’s behaviors on the two OSNs from different perspectives. We first compared users’ emotions during the Ridgecrest earthquakes on Reddit and Twitter. Our results suggest that Twitter users had communicated more negative emotions than Reddit users, especially after the mainshock. We also explored the topics discussed on the two OSNs. Topic modeling results supported the above emotion analysis results in that the Twitter corpus topics generated significantly more emotional keywords than that from Reddit while the Reddit corpus covered more diverse topics (e.g., Nuclear, Alert). We also examined the common external URLs on the two OSNs and explored whether Reddit and Twitter had different response patterns toward this external information. The results showed the responses to the external URLs on Twitter were more active and faster. Meanwhile, based on Reddit’s unique mechanisms, we discussed the different response patterns in the popular subreddits and explored the users’ conversations in those subreddits. We found that even on the same Reddit platform, people’s response patterns and behaviors can vary significantly, based on which subreddit they chose.

Impacts

Aggregated responses, which are then used to develop themes, can assist emergency managers and science agencies responsible for communicating with the public. By thematically analyzing and grouping major questions or points of concern, emergency managers’ communication can be more effective in times of crisis^56,57. This tactic was used by emergency managers during the M6.2 Christchurch earthquake response⁵⁸, the 2008 Wenchuan and 2013 Ya’an earthquakes⁵⁹, the 2019 Albania Earthquake⁶⁰, and by science agencies in the 2016 Kaikoura Earthquake⁶¹, among other examples. These types of analysis can assist agencies to launch more effective and targeted crisis communication responses, either via using the same social media channels (e.g., Twitter or Reddit), but we argue that given the value of the insights, use the questions to also frame media responses. By not listening to social media, opportunities to engage and answer questions online may be lost, as what occurred during the Bombay Beach Swarm in 2016¹⁴.

Through the combination of different OSNs and performing cross-platform analysis, we can potentially help science response and emergency management agencies to gain a more comprehensive understanding of people’s concerns and public awareness during extreme events. For instance, misinformation or conspiracies can spread after natural hazards, but different kinds of misinformation may exist on different platforms. In our study case, Twitter users were actively talking about “Big one is coming” while Reddit users were talking about the earthquakes being caused by a nuclear test. Those topics can help science agencies monitor what types of misinformation are being spread online and then take corresponding actions to correct them and therefore prevent them from misleading more people. However, we found little evidence of cross-platform social media analysis in previous research, even less can be found for crisis informatics during natural hazards. Our research can be regarded as initial steps encouraging the use of more diverse data sources for exploring social aspects of disaster resilience²⁵. Our work analyzes two different platforms during the 2019 Ridgecrest earthquake sequence that was felt by a large number of people and demonstrates that a single-platform analysis cannot fully represent general public responses, thus motivating more cross-platform analysis in the future to obtain a more comprehensive view. Furthermore, since these OSNs have different mechanisms, diverse methods need to be applied when extracting useful information from them.

In our research, we present a workflow of extracting useful information with different approaches on Reddit than the previous work on Twitter⁷, including filtering earthquake-related posts and performing specific analyses based on the unique structure of the Reddit platform (e.g., subreddit, conversations). Our methodology can be beneficial to Reddit analysis on other topics as well. Reddit has been largely overlooked as a platform for study, as opposed to Twitter, which has a voluminous body of research. Our results show that the combination of multiple OSNs, rather than a single platform, can help emergency managers and science response agencies obtain a more comprehensive understanding of public responses, which plays a prominent part in evaluating and enhancing collective actions for rapid reconnaissance, disaster preparedness, and recovery strategies¹⁷. Finally, our work is consistent with the Uses and Gratifications theory’s main argument: that users are attracted and use platforms that best reflect their values and perceptions of self⁹.

Limitations and future work

There are several limitations in this study that could be addressed in future work. First, our study is representative of the English-speaking population and people having some experiences with earthquakes (i.e., those living in California). However, there are other languages spoken in the United States, e.g., by the Spanish communities in California. Meanwhile, US citizens on the east coast who rarely experience earthquakes can respond differently than people living in California. Public responses to earthquakes in other non-English-speaking communities and people with less earthquake experience could be explored in future work. Second, OSN users do not represent all age-groups. Based on previous research, some platforms such as Reddit are mainly used by young people (18–29). Therefore, analysis performed on those platforms may only represent the responses of younger population. Third, different cultures can prefer different OSNs, for example, Twitter is heavily used in Indonesia, but less widely used in China and Russia⁶². Therefore, even though the techniques in this study can be applied with a change of keywords and language analysis for other events, researchers should also be aware of the relevant platforms for that region. Last, our study does not address the potential change of topic trending in extreme events, partially due to the short period of our analysis. However, other extreme events such as a hurricane can last for a longer time and affect larger areas. Hence, the topics can change markedly. In this case, keywords filtering will need to be adjusted based on current events and region of interest accordingly.

Data availability

Our Reddit data are from Pushshift website (https://pushshift.io) and the Twitter data are collected from Twitter Academic API. Earthquake-related data can be easily retrieved following the extraction methodology of the paper. The other data that support the results of this study are available from the corresponding author upon reasonable request.

References

Saroj, A. & Pal, S. Use of social media in crisis management: A survey. Int. J. Disaster Risk Reduct. 48 (2020). https://doi.org/10.1016/j.ijdrr.2020.101584
Tang, J., Yang, S. & Wang, W. Social media-based disaster research: Development, trends, and obstacles. Int. J. Disaster Risk Reduct., 102095 (2021). https://doi.org/10.1016/j.ijdrr.2021.102095
Lachlan, K. A., Spence, P. R., Lin, X. & Greco, M. D. Screaming into the wind: Examining the volume and content of tweets associated with Hurricane Sandy. Commun. Stud. 65, 500–518. https://doi.org/10.1080/10510974.2014.956941 (2014).
Article Google Scholar
Priya, S. et al. Where should one get news updates: Twitter or Reddit. Online Soc. Netw. Media 9, 17–29. https://doi.org/10.1016/j.osnem.2018.11.001 (2019).
Article Google Scholar
Ovadia, S. More than just cat pictures: Reddit as a curated news source. Behav. Soc. Sci. Libr. 34, 37–40. https://doi.org/10.1080/01639269.2015.996491 (2015).
Article Google Scholar
Hauksson, E. et al. The normal-faulting 2020 m w 5.8 Lone Pine, Eastern California, earthquake sequence. Seismol. Res. Lett. 92, 679–698. https://doi.org/10.1785/0220200324 (2020).
Article Google Scholar
Ruan, T., Kong, Q., Zhang, Y., McBride, S. K. & Lv, Q. An analysis of Twitter responses to the 2019 Ridgecrest earthquake sequence. In 2020 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data and Cloud Computing, Sustainable Computing and Communications, Social Computing and Networking (ISPA/BDCloud/SocialCom/SustainCom), 810–818 (IEEE, 2020). https://doi.org/10.1109/ISPA-BDCloud-SocialCom-SustainCom51426.2020.00127
Chung, A. I. et al. Shakealert earthquake early warning system performance during the 2019 Ridgecrest earthquake sequence. Bull. Seismol. Soc. Am. 110, 1904–1923. https://doi.org/10.1785/0120200032 (2020).
Article Google Scholar
Katz, E., Blumler, J. G. & Gurevitch, M. Uses and gratifications research. Public Opin. Q. 37, 509–523 (1973).
Article Google Scholar
Massey, K. B. Analyzing the uses and gratifications concept of audience activity with a qualitative approach: Media encounters during the 1989 Loma Prieta earthquake disaster. J. Broadcast. Electron. Media 39, 328–349. https://doi.org/10.1080/08838159509364310 (1995).
Article Google Scholar
Malasig, B. J. C. & Quinto, E. J. M. Functions of and communication behavior on Twitter after the 2015 Nepal earthquake. J. Komunikasi Malays. J. Commun. 32 (2016).
Beatson, A., Buettner, A. & Schirato, T. Social media, crisis mapping and the Christchurch earthquakes of 2011. MEDIANZ Media Stud. J. Aotearoa N. Z. 14 (2014). https://doi.org/10.11157/medianz-vol14iss1id105
Fischer, D. Social networking sites in the aftermath of a crisis-the enabling role for self-organization. In Proceedings of the 51st Hawaii International Conference on System Sciences (2018). https://doi.org/10.24251/HICSS.2018.012
McBride, S. K., Llenos, A. L., Page, M. T. & Van Der Elst, N. # earthquakeadvisory: Exploring discourse between government officials, news media, and social media during the 2016 Bombay Beach Swarm. Seismol. Res. Lett. 91, 438–451. https://doi.org/10.1785/0220190082 (2019).
Article Google Scholar
Liu, S. B. et al. In search of the bigger picture: The emergent role of on-line photo sharing in times of disaster. In Proceedings of the Information Systems for Crisis Response and Management Conference (ISCRAM), 4–7 (2008).
Yates, D. & Paquette, S. Emergency knowledge management and social media technologies: A case study of the 2010 Haitian earthquake. Int. J. Inf. Manag. 31, 6–13. https://doi.org/10.1016/j.ijinfomgt.2010.10.001 (2011).
Article Google Scholar
Rajput, A. A., Li, Q., Zhang, C. & Mostafavi, A. Temporal network analysis of inter-organizational communications on social media during disasters: A study of Hurricane Harvey in Houston. Int. J. Disaster Risk Reduct. 46, 101622 (2020). https://www.sciencedirect.com/science/article/pii/S221242091931595X. https://doi.org/10.1016/j.ijdrr.2020.101622
Hasfi, N., Fisher, M. R. & Sahide, M. A. Overlooking the victims: Civic engagement on Twitter during Indonesia’s 2019 fire and haze disaster. Int. J. Disaster Risk Reduct. 60, 102271. https://doi.org/10.1016/j.ijdrr.2021.102271 (2021).
Article Google Scholar
Earle, P. et al. OMG earthquake! Can Twitter improve earthquake response?. Seismol. Res. Lett. 81, 246–251. https://doi.org/10.1785/gssrl.81.2.246 (2010).
Article Google Scholar
Sakaki, T. et al. Earthquake shakes Twitter users: Real-time event detection by social sensors. In Proceedings of the 19th International Conference on World Wide Web, WWW ’10, 851–860 (2010). https://doi.org/10.1145/1772690.1772777
Poblete, B. Twicalli: An earthquake detection system based on citizen sensors used for emergency response in Chile. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 1359–1359 (2017). https://doi.org/10.1145/3077136.3096474.
Vo, B.-K.H. & Collier, N. Twitter emotion analysis in earthquake situations. Int. J. Comput. Linguist. Appl. 4, 159–173 (2013).
Google Scholar
Hughes, A. L. et al. Online public communications by police and fire services during the 2012 Hurricane Sandy. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 1505–1514 (2014). https://doi.org/10.1145/2556288.2557227
Kitazawa, K. & Hale, S. A. Social media and early warning systems for natural disasters: A case study of Typhoon Etau in Japan. Int. J. Disaster Risk Reduct. 52, 101926 (2021). https://www.sciencedirect.com/science/article/pii/S221242092031428X. https://doi.org/10.1016/j.ijdrr.2020.101926
Yuan, F., Li, M. & Liu, R. Understanding the evolutions of public responses using social media: Hurricane Matthew case study. Int. J. Disaster Risk Reduct. 51, 101798. https://doi.org/10.1016/j.ijdrr.2020.101798 (2020).
Article Google Scholar
Behl, S., Rao, A., Aggarwal, S., Chadha, S. & Pannu, H. Twitter for disaster relief through sentiment analysis for COVID-19 and natural hazard crises. Int. J. Disaster Risk Reduct. 55, 102101. https://doi.org/10.1016/j.ijdrr.2021.102101 (2021).
Article Google Scholar
Hughes, D. J., Rowe, M., Batey, M. & Lee, A. A tale of two sites: Twitter vs. Facebook and the personality predictors of social media usage. Comput. Hum. Behav. 28, 561–569. https://doi.org/10.1016/j.chb.2011.11.001 (2012).
Article Google Scholar
Blei, D. M., Ng, A. Y. & Jordan, M. I. Latent Dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003).
MATH Google Scholar
Yin, J. & Wang, J. A Dirichlet multinomial mixture model-based approach for short text clustering. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 233–242 (2014). https://doi.org/10.1145/2623330.2623715
Quan, X., Kit, C., Ge, Y. & Pan, S. J. Short and sparse text topic modeling via self-aggregation. In Twenty-Fourth International Joint Conference on Artificial Intelligence (2015).
Zuo, Y. et al. Topic modeling of short texts: A pseudo-document view. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2105–2114 (2016). https://doi.org/10.1145/2939672.2939880
Zuo, Y., Zhao, J. & Xu, K. Word network topic model: A simple but general solution for short and imbalanced texts. Knowl. Inf. Syst. 48, 379–398. https://doi.org/10.1007/s10115-015-0882-z (2016).
Article Google Scholar
Qiang, J., Qian, Z., Li, Y., Yuan, Y. & Wu, X. Short text topic modeling techniques, applications, and performance: a survey. IEEE Trans. Knowl. Data Eng.https://doi.org/10.1109/TKDE.2020.2992485 (2020).
Feldman, R. Techniques and applications for sentiment analysis. Commun. ACM 56, 82–89. https://doi.org/10.1145/2436256.2436274 (2013).
Article Google Scholar
Medhat, W., Hassan, A. & Korashy, H. Sentiment analysis algorithms and applications: A survey. Ain Shams Eng. J. 5, 1093–1113. https://doi.org/10.1016/j.asej.2014.04.011 (2014).
Article Google Scholar
Tausczik, Y. R. & Pennebaker, J. W. The psychological meaning of words: LIWC and computerized text analysis methods. JLS 29, 24–54. https://doi.org/10.1177/0261927X09351676 (2010).
Article Google Scholar
Pennebaker, J. W., Francis, M. E. & Booth, R. J. Linguistic inquiry and word count: LIWC 2001. Mahway: Lawrence Erlbaum Associates 71, 2001 (2001).
Google Scholar
Hall, M., Mazarakis, A., Chorley, M. & Caton, S. Editorial of the special issue on following user pathways: Key contributions and future directions in cross-platform social media research. https://doi.org/10.1080/10447318.2018.1471575 (2018).
Reuter, C., Ludwig, T., Kaufhold, M.-A. & Pipek, V. XHELP: Design of a cross-platform social-media application to support volunteer moderators in disasters. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 4093–4102 (2015). https://doi.org/10.1145/2702123.2702171
Shen, S., Murzintcev, N., Song, C. & Cheng, C. Information retrieval of a disaster event from cross-platform social media. Inf. Discov. Deliv.https://doi.org/10.1108/IDD-01-2017-0003 (2017).
Kaufhold, M.-A., Rupp, N., Reuter, C. & Habdank, M. Mitigating information overload in social media during conflicts and crises: Design and evaluation of a cross-platform alerting system. Behav. Inf. Technol. 39, 319–342. https://doi.org/10.1080/0144929X.2019.1620334 (2020).
Article Google Scholar
Backfried, G. et al. Cross-media analysis for communication during natural disasters. In International Conference on Advances in Information Technology, 13–22 (Springer, 2013). https://doi.org/10.1007/978-3-319-03783-7_2
Bossu, R., Laurin, M., Mazet-Roux, G., Roussel, F. & Steed, R. The importance of smartphones as public earthquake-information tools and tools for the rapid engagement with eyewitnesses: A case study of the 2015 Nepal earthquake sequence. Seismol. Res. Lett. 86, 1587–1592. https://doi.org/10.1785/0220150147 (2015).
Article Google Scholar
Ross, Z. E. et al. Hierarchical interlocked orthogonal faulting in the 2019 Ridgecrest earthquake sequence. Science 366, 346–351. https://doi.org/10.1126/science.aaz0109 (2019).
Article ADS CAS PubMed Google Scholar
Mills, R. A. Reddit. com: A census of subreddits. In Proceedings of the ACM Web Science Conference, 1–2 (2015). https://doi.org/10.1145/2786451.2786491
Hara, N., Shachaf, P. & Stoerger, S. Online communities of practice typology revisited. J. Inf. Sci. 35, 740–757. https://doi.org/10.1177/0165551509342361 (2009).
Article Google Scholar
Lim, B. H., Lu, D., Chen, T. & Kan, M.-Y. # mytweet via instagram: Exploring user behaviour across multiple social networks. In 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), 113–120 (IEEE, 2015). https://doi.org/10.1145/2808797.2808820
Earle, P. S. et al. Twitter earthquake detection: earthquake monitoring in a social world. Ann. Geophys. 54 (2012). https://doi.org/10.4401/ag-5364
Baumgartner, J., Zannettou, S., Keegan, B., Squire, M. & Blackburn, J. The Pushshift Reddit dataset. Proc. Int. AAAI Conf. Web Soc. Media 14, 830–839 (2020).
Google Scholar
Bruns, A. & Burgess, J. Researching news discussion on Twitter: New methodologies. J. Stud. 13, 801–814. https://doi.org/10.1080/1461670X.2012.664428 (2012).
Article Google Scholar
Wang, B. & Zhuang, J. Crisis information distribution on Twitter: A content analysis of tweets during Hurricane Sandy. Nat. Hazards 89, 161–181. https://doi.org/10.1007/s11069-017-2960-x (2017).
Article Google Scholar
Whitney, D. J. et al. Earthquake beliefs and adoption of seismic hazard adjustments. Risk Anal. 24, 87–102. https://doi.org/10.1111/j.0272-4332.2004.00414.x (2004).
Article PubMed Google Scholar
Bastian, M., Heymann, S. & Jacomy, M. Gephi: An open source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media (2009). http://www.aaai.org/ocs/index.php/ICWSM/09/paper/view/154
Staudt Willet, K. B. & Carpenter, J. P. Teachers on Reddit? Exploring contributions and interactions in four teaching-related subreddits. J. Res. Technol. Educ. 52, 216–233. https://doi.org/10.1080/15391523.2020.1722978 (2020).
Article Google Scholar
Hogan, B. Online social networks: Concepts for data collection and analysis. In The Sage Handbook of Online Research Methods, 2nd edn (eds Fieldng, N. G. et al.), 241–258 (Sage Publications, 2016). https://ssrn.com/abstract=3047869
Wukich, C. et al. Social media use in emergency management. J. Emerg. Manag. 13, 281–294. https://doi.org/10.5055/jem.2015.0242 (2015).
Article PubMed Google Scholar
Dong, R., Li, L., Zhang, Q. & Cai, G. Information diffusion on social media during natural disasters. IEEE Trans. Comput. Soc. Syst. 5, 265–276. https://doi.org/10.1109/TCSS.2017.2786545 (2018).
Article PubMed Google Scholar
Bruns, A. & Burgess, J. Crisis communication in natural disasters: The Queensland floods and Christchurch earthquakes. In Twitter and Society [Digital Formations], Vol. 89 (eds Bruns, A. et al.), 373–384 (Peter Lang Publishing, 2014). https://eprints.qut.edu.au/66329/
Li, L. X. Involvement of social media in disaster management during the Wenchuan and Ya’an earthquakes. Asian J. Public Opin. Res. 1, 249249–267. https://doi.org/10.15206/ajpor.2014.1.4.249 (2014).
Article Google Scholar
Bossu, R. et al. Rapid public information and situational awareness after the November 26, 2019, Albania earthquake: Lessons learned from the LastQuake system. Front. Earth Sci. 8, 235. https://doi.org/10.3389/feart.2020.00235 (2020).
Article ADS Google Scholar
Woods, R. J. et al. Science to emergency management response. Bull. N. Z. Soc. Earthq. Eng. 50, 329–337. https://doi.org/10.5459/bnzsee.50.2.329-337 (2017).
Article Google Scholar
Newman, N. et al. Reuters Institute Digital News Report 2021 (Reuters Institute for the Study of Journalism, 2021). https://ssrn.com/abstract=3873260

Download references

Acknowledgements

Any use of trade, firm, or product names is for descriptive purposes only and does not imply endorsement by the U.S. Government. We thank our anonymous reviewers and the internal reviewer at the U.S. Geological Survey. Qingkai Kong’s work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract Number DE-AC52-07NA27344. This is LLNL Contribution Number LLNL-JRNL-823001. We also thank Jason Baumgartner from PushShift for getting the raw data from both Twitter and Reddit.

Author information

Authors and Affiliations

Department of Computer Science, University of Colorado Boulder, 430 UCB, Boulder, CO, 80309, USA
Tao Ruan, Amatullah Sethjiwala & Qin Lv
Lawrence Livermore National Laboratory, 7000 East Ave, Livermore, CA, 94550, USA
Qingkai Kong
U.S. Geological Survey, 345 Middlefield Road, MS 977, Menlo Park, CA, 94025, USA
Sara K. McBride

Authors

Tao Ruan
View author publications
You can also search for this author in PubMed Google Scholar
Qingkai Kong
View author publications
You can also search for this author in PubMed Google Scholar
Sara K. McBride
View author publications
You can also search for this author in PubMed Google Scholar
Amatullah Sethjiwala
View author publications
You can also search for this author in PubMed Google Scholar
Qin Lv
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the interpretation of the data and the manuscript text. T.R. performed data analysis and prepared figures. Q.L. conceptualized and designed the research idea. Q.K. provided the seismology-related background and analysis directions. S.K.M. provided the social science theory and background.

Corresponding authors

Correspondence to Tao Ruan or Qin Lv.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ruan, T., Kong, Q., McBride, S.K. et al. Cross-platform analysis of public responses to the 2019 Ridgecrest earthquake sequence on Twitter and Reddit. Sci Rep 12, 1634 (2022). https://doi.org/10.1038/s41598-022-05359-9

Download citation

Received: 13 September 2021
Accepted: 04 January 2022
Published: 31 January 2022
DOI: https://doi.org/10.1038/s41598-022-05359-9

This article is cited by

Beyond phase-in: assessing impacts on disinformation of the EU Digital Services Act
- Luca Nannini
- Eleonora Bonel
- Michele Joshua Maggini
AI and Ethics (2024)
Dynamics and characteristics of misinformation related to earthquake predictions on Twitter
- Irina Dallo
- Or Elroy
- Abraham Yosipof
Scientific Reports (2023)
Social informedness and investor sentiment in the GameStop short squeeze
- Kwansoo Kim
- Sang-Yong Tom Lee
- Robert J. Kauffman
Electronic Markets (2023)
Twitter data from the 2019–20 Australian bushfires reveals participatory and temporal variations in social media use for disaster recovery
- R. Ogie
- A. Moore
- T. Dilworth
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Improving microbial phylogeny with citizen science within a mass-market video game

Worldwide divergence of values

Anger is eliminated with the disposal of a paper written because of provocation

Introduction

Related work

Use of social network during extreme events

Topic modeling and emotion analysis for short text

Cross-platform OSN analysis

Methodology

The 2019 Ridgecrest earthquakes

Public responses to earthquake sequence

Twitter data collection and filtering

Reddit data collection and filtering

Preprocessing

Time division

Comparison of Twitter and Reddit data

Emotion analysis

Emotion differences

Topic modeling

Topic modeling results

Response time

Unique information retrieved from Reddit

Subreddit analysis

Response time within subreddits

Conversation networks

Summary and impacts

Summary

Impacts

Limitations and future work

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Beyond phase-in: assessing impacts on disinformation of the EU Digital Services Act

Dynamics and characteristics of misinformation related to earthquake predictions on Twitter

Social informedness and investor sentiment in the GameStop short squeeze

Twitter data from the 2019–20 Australian bushfires reveals participatory and temporal variations in social media use for disaster recovery

Comments

Search

Quick links