Covid-19 Discourse on Twitter: How the Topics, Sentiments, Subjectivity, and Figurative Frames Changed Over Time

Wicke, Philipp; Bolognesi, Marianna M.

doi:10.3389/fcomm.2021.651997

ORIGINAL RESEARCH article

Front. Commun. , 16 March 2021

Sec. Disaster Communications

Volume 6 - 2021 | https://doi.org/10.3389/fcomm.2021.651997

Covid-19 Discourse on Twitter: How the Topics, Sentiments, Subjectivity, and Figurative Frames Changed Over Time

$\nPhilipp Wicke$ Philipp Wicke¹

Marianna M. Bolognesi²^*

¹Creative Language Systems, School of Computer Science, University College Dublin, Dublin, Ireland
²Department of Modern Languages, Literatures, and Culture, University Bologna, Bologna, Italy

The words we use to talk about the current epidemiological crisis on social media can inform us on how we are conceptualizing the pandemic and how we are reacting to its development. This paper provides an extensive explorative analysis of how the discourse about Covid-19 reported on Twitter changes through time, focusing on the first wave of this pandemic. Based on an extensive corpus of tweets (produced between 20th March and 1st July 2020) first we show how the topics associated with the development of the pandemic changed through time, using topic modeling. Second, we show how the sentiment polarity of the language used in the tweets changed from a relatively positive valence during the first lockdown, toward a more negative valence in correspondence with the reopening. Third we show how the average subjectivity of the tweets increased linearly and fourth, how the popular and frequently used figurative frame of WAR changed when real riots and fights entered the discourse.

Introduction

Covid-19 was first officially reported by the Chinese authorities as a virus originated in Wuhan city, Hubei province in China, on 31st December 2019. According to official notifications of the World Health Organization (2020), while we revise this manuscript the disease has infected more than 106 million people worldwide, killing more than 2.3 million lives.

The issues related to the development of the global pandemic are challenging and complex, because they carry deep consequences not only in the medical, but also the social, economic, political, and behavioral domains. While the recent release of different types of vaccines suggest that we might be experiencing the last phases of this health crisis, the consequences of such a long-lasting worldwide pandemic will be certainly seen beyond the actual end of the medical emergency and in various aspects of our lives.

Online discourse on Twitter, in this regard, has recently attracted a number of contributions, because the texts (the tweets) found on this platform are considered to be a good proxy for the public opinion and perception related to the pandemic that we are currently experiencing (Bruns and Weller, 2016). It follows that understanding and interpreting such discourse, its evolution over time, and its interdependence with real-world events can help us understand how people conceptualize and react to the global crisis.

In particular, understanding how the topics discussed on Twitter in relation to the pandemic change over time can be crucial for understanding what aspects of the crisis are perceived to be more salient and important for the population (Zhou et al., 2020). In a very recent study Wicke and Bolognesi (2020) analyzed the topics of discussion in a corpus of tweets that covered 2 months (20th March−20th May 2020). In the discussion of their findings, the authors suggest that topics are likely to change over time. Therefore, adding a temporal dynamic to the topic modeling analysis may provide a clearer view of how the pandemic is processed in the minds of the speakers and discussed on Twitter.

Mining the sentiment polarity of tweets through the analysis of words used therein can provide precious information about how social measures such as travel bans, social distancing, and so forth have been taken in by the population during the first wave. By seeing potential changes in the sentiment polarity through time, and interpreting them in relation with major events and governmental decisions issued during the first wave, it may become possible to predict how similar measures are going to affect us now that we are experiencing a new wave.

If tweets that contain language loaded with affective information are likely to express opinions rather than facts, then they therefore tend to be subjective rather than objective. Mining the amount of affective information (positive or negative) associated with the language used in the tweets can shed light on the temporal dynamics of the overall subjectivity of the tweets. In other words, it will be possible to observe the distribution of fact-based vs. opinion-based tweets over time (De Smedt and Daelemans, 2012). This type of analysis can provide an interesting indicator of our eagerness to report, trust, and discuss facts and potential objective information, as opposed to opinions.

Finally, understanding how a specific conceptual framing used in the discourse about Covid on Twitter changes over time can provide a different type of indirect measure of people's attitude toward the pandemic. In particular, previous research has shown that various sensitive topics such as cancer, drugs, crime, and epidemics are typically framed using the pervasive metaphorical frame of WAR (Flusberg et al., 2017; Thibodeau et al., 2017; Wicke and Bolognesi, 2020). In some cases, however, the use of war-related terms to talk about sensitive topics has been proven to have negative effects on the people directly affected by the problem under discussion. For example, using war-related terms to talk about cancer affects patients' general attitude toward their own medical condition (Hendricks et al., 2018). Conversely, the use of alternative, more positive frames, such as JOURNEY or DANCE, can positively affect patients' attitude and general well-being. Since previous work has shown that, generally speaking, the WAR frame is particularly frequent in the discourse about Covid-19 (Wicke and Bolognesi, 2020), we hereby explore how the distribution of the lexical units within this figurative frame change over time, to possibly cover and express topics associated with the new stages of the pandemic, in a temporal perspective.

In line with the variables outlined above, the research questions addressed in this study can be summarized as follows:

1. Which topics are discussed on Twitter in relation to Covid and how do they change over time, with the development of the pandemic?

2. What valence (sentiment polarity) emerges from the tweets about Covid and how does it change over time?

3. How does the subjectivity of the tweets (i.e., opinion-based focus, vs. the objective fact-based focus) change over time?

4. How does the use of the pervasive figurative framing of WAR change over time?

Following the research questions outlined above, we formulated the following hypotheses.

1. TOPICS: The pandemic is in constant development and change. The topics of discussion on Twitter are likely to change accordingly, in concurrence with the most recent events associated with Covid-19. We therefore predict that different topic models, based on different degrees of granularity will capture different events covered by the media and the press, related to Covid-19.

2. SENTIMENT POLARITY: The corpus of tweets on which the current analysis is performed contains mainly data produced by American English users, collected between 20th March (first official day of lockdown in many States) and 1st July 2020. In this period of time the number of active cases increased steadily in the USA, according to the World Health Organization (2020). We therefore expect to find an increase in the negative feelings associated with the tweets, over time.

3. SUBJECTIVITY: Because of the development of the pandemic, and the increase of the daily cases, and of the (possibly) negative feelings emerging from the tweets, we expect the tweets to contain an increasing number of words loaded with affective content. It follows that we expect the tweets to be increasingly opinion-based (loaded with emotion), rather than fact-based (neutral), with the progressing of the epidemic.

4. FRAMING: We do not have a specific hypothesis in mind in relation to this research question, but we expect to observe possible changes in the way in which the WAR frame is used to talk about the virus. In particular, while words such as “fight” and “war” may continue to be frequently used, we might observe new words within this frame becoming common in the Covid discourse. This would suggest that the lexical tools used to frame the Covid discourse have been extended and developed, to confirm the centrality and pervasiveness of the WAR figurative frame.

The remainder of the paper is organized as follows: after a brief overview of related work on these topics, we proceed by addressing each research question in order, explaining methods, results, and discussion of the data related to each analysis. Finally, we take all the results together and provide a final general discussion of our findings.

Theoretical Background and Related Work

The information encoded in the short texts produced by private internet users on Twitter (the tweets) provides useful clues that in some cases can be used by experts. A growing body of research on social media discourse associated with disasters and crises is based on Twitter. Yeo et al. (2018) for instance, reported a study of social media discourse about the 2016 Southern Louisiana flooding in which they used Twitter data to construct a response communication network and show culture-specific characteristics of this discourse. In a more recent study, Yeo et al. tracked topics, sentiments, and patterns on longitudinal Twitter data on the same phenomenon (Yeo et al., 2020). Thanks to this analysis they provided an overview of the long-term crisis recovery with respect to the dominant voices, sentiments, and participants' numbers. The authors highlighted the need for long-term recovery communication, utilizing social media, and supporting local voices after a disaster. A spatiotemporal analysis of the Twitter discourse about Hurricane Matthew has been conducted by Martín et al. (2017). The authors conducted a temporal analysis and tracked disaster-related tweets over a week for different states in the US in order to correlate the distance to the hurricane with Twitter activity. With a fine-grained analysis they were able to observe evacuees and traveling information during the development of this disaster, which allowed them to check evacuation compliance.

In relation to previous epidemics, the linguistic data extracted from Twitter has been correlated with the actual spreading of the virus, showing that the number of tweets discussing flu-symptoms predicted the official statistics about the virus spread such as those published by Centers for Disease Control and Prevention and the Health Protection Agency (Culotta, 2010). Quantitative analyses of linguistic data been conducted during the development of various types of diseases to mine the information that internet users encode in language, while experiencing medical crises such as the dengue fever in Brazil (Gomide et al., 2011), the Zika disease (Wirz et al., 2018; Pruss et al., 2019), the measles outbreak in the Netherlands in 2013 (Mollema et al., 2015), and more recently, the Coronavirus epidemic (Wicke and Bolognesi, 2020).

Topics

In relation to the spreading of the Zika virus in 2015, Miller and colleagues (Miller et al., 2017) used a combination of natural language processing and machine learning techniques to determine the distribution of topics in relation to four characteristics of Zika: symptoms, transmission, prevention, and treatment. The authors reported the most persistent concerns or misconceptions regarding the Zika virus extracted from the corpus of tweets, and provided a complex map of topics that emerged from the analysis. For example, in relation to the prevention of the virus spreading, they observed the emergence of the following topics: need for control, and prevention of spread, need for money, ways to prevent spread, bill to get funds, and research. In a different study, Pruss et al. (2019) provided a cross-linguistic analysis of the discourse around the Zika virus, based on a corpus of tweets in three different languages (Spanish, Portuguese, and English). Using a multilingual topic model, the authors identified key topics of discussion across the languages and their distribution, demonstrating that the Zika outbreak was discussed differently around the world. Lazard and colleagues, instead, analyzed the topics related to the discourse around the Ebola outbreak, in 2014, and in particular after a case of Ebola was diagnosed on US soil. The authors reported that the main topics of concern for the American public were the symptoms and lifespan of the virus, the disease transfer and contraction, whether it was safe to travel, and how they could protect themselves from the disease. In a parallel study, Tran and Lee (2016) built Ebola-related information propagation models to mine the information encoded in the tweets about Ebola and explored how such information is distributed across the following six topics: 1. Ebola cases in the US, 2. Ebola outbreak in the world, 3. fear and prayer, 4. Ebola spread and warning, 5. jokes, swearing, and disapproval of jokes and 6. impact of Ebola to daily life. The authors found that the second topic had the lowest focus, while the fifth and sixth had the highest. Finally, in a very recent study, Park et al. (2020) propose a topic analysis related to the discourse around Covid on Twitter, analyzing a corpus of Indian, South Korean, Vietnamese, and Iranian tweets in a temporal perspective. The authors report some cultural differences, showing that in Iran and Vietnam, unlike in South Korea, the number of tweets did not correlate with the dates of specific events taking place in these countries, which were used by the authors as baselines. In a temporal analysis they report that the official epidemic phases issued by governments do not match well with the online attention on the epidemic. Nonetheless, the authors compared similarities in major topics across these countries over time and found that in Iran, Vietnam, and India, the peak of the daily tweet trend preceded the peak of the daily confirmed cases. This suggests that mining tweets can help to monitor public attention toward the diffusion of the epidemic.

Finally, Twitter-based studies that use topic modeling techniques or sentiment analysis are starting to appear in relation to the Covid discourse. However, to the best of our knowledge, they appear to use a significantly different methodology. Those works include Sentiment Analysis with Deep Learning Classifiers (Chakraborty et al., 2020; Li et al., 2020), a time-span of much fewer days (Abd-Alrazaq et al., 2020; Xue et al., 2020), and analyses of specific emotions without topic models (Lwin et al., 2020; Mathur et al., 2020). Because topic modeling is an exploratory, bottom-up, data-driven technique of data mining, we believe that a broader and more explorative approach, that takes into account multiple topic modeling solutions and a longer time span, may provide better insights on the themes discussed by Twitter users over time.

Sentiment Polarity

Many of these linguistic studies based on social media discourse have the aim to mine the sentiments of the population that is experiencing a pandemic, by understanding people's feelings toward the topics related to the disease. For example, Mollema and colleagues found that during the measles outbreak in the Netherlands in 2013 many Twitter users were extremely frustrated because of the increasing number of citizens that refused to vaccinate for, among others, religious reasons. The measles outbreak in the Netherlands began among Orthodox Protestants who often refuse vaccination for religious reasons.

The main distinction among sentiments observed within a given text is between positive and negative feelings. This dimension is commonly defined as emotional valence in cognitive science and cognitive psychology, and more typically defined as sentiment polarity in the machine-learning subfield called sentiment analysis. The exploration of the emotional valence encoded in the tweets has been used in some cases to predict future behavior, for example to predict whether a customer was likely to use a given service a second time, under the assumption that a positive feedback left on Twitter would imply that a client might be more inclined to use that service again. In the case of political messaging during electoral campaigns, positive feedback might correlate with voters' support for a specific candidate. In some cases, as pointed out by recent studies, social media analyses during crisis situations may be used to investigate real-time public opinion and thus help authorities to gain insight for quickly deciding for the best assistance policies to be taken (Mathur et al., 2020).

Temporal analyses of sentiments expressed in Twitter data have been previously done on a variety of topics, including the FIFA Confederations Cup in Brazil (Alves et al., 2014), the changes in voters' feelings during the US elections (Paul et al., 2017) and the changes of sentiments on a monthly, daily and hourly level across different geographical regions (Hu et al., 2019).

Opinions

As suggested by Liu (2010), facts are objective expressions about events, entities, and their properties, whereas opinions are usually subjective expressions that describe sentiments, appraisals, feelings toward events, entities, and their properties. Research on subjectivity detection, that is, the distinction between texts that express opinions and texts that express facts is becoming increasingly central in various fields, such as computer science, journalism, sociology, and political science (see Chatterjee et al., 2018 for a review). The reasons for this interest are varied. There are business-related issues, such as companies interested in understanding whether consumers have strong opinions toward a specific brand or whether instead they are indifferent. Politics is another field where many use data to understand whether a specific candidate triggers opinions or leaves voters indifferent.

Distinguishing fact-based and opinion-based texts in social media is an operation usually performed by different types of analysts to fulfill different goals. Detecting fact-based texts in the wild (thus filtering out opinion-based texts) is an operation that can be performed by analysts interested in detecting events and capturing factual data, for the automated and fast identification of (for example) breaking news from social media streams. Conversely, detecting opinion-based texts in the wild is an operation that enables analysts to capture users' beliefs and feelings. This is usually done by companies to develop marketing strategies toward their brand. In both types of tasks, Twitter has been used as a valuable resource of linguistic data for fact and opinion data mining (Li et al., 2012).

Subjectivity detection is a major subtask involved in sentiment analysis (Chaturvedi et al., 2018). Before analyzing the positive and negative feelings involved in a corpus of texts, those texts that have a neutral connotation, that is, those texts that are not subjective, need to be filtered out (Liu, 2010). This is usually done in order to ensure that only opinionated information is processed by a classifier that can distinguish between positive and negative feelings. A thorough review of the methods and the challenges involved in distinguishing between facts and opinions for sentiment analysis lies beyond the scope of the present paper (but see Chaturvedi et al., 2018 for a literature review). The following heuristic might summarize how subjectivity and sentiment polarity are related to one another: the more a text includes words that are loaded with (positive or negative) emotional content, the more that text is arguably subjective, as it expresses personal opinions, beliefs, and sentiments toward a specific topic. Conversely, texts that feature neutral words, not loaded with emotions, are likely to be more informative and objective.

Framing

In cognitive linguistics and communication sciences, and in particular in metaphor studies, public discourse is often analyzed in relation to different figurative and literal communicative framings (Burgers et al., 2016). A frame is hereby defined as a selection of some aspects of a perceived reality, which taken together make a standpoint from which a topic can be seen. Such a standpoint is “constructed to promote a particular problem definition, causal interpretation, moral evaluation, and/or treatment recommendation for the item described” (Entman, 1993. p. 53). Within this definition of framing, metaphors may be used to establish a perspective on a given topic. In health-related discourse, for example, “war-metaphors” are often used to talk about illnesses and treatments. For instance, in a pioneering work, Sontag and Broun (1977) described and criticized the popular use of war metaphors to talk about cancer, a topic of research recently investigated also by Semino et al. (2017). Their argumentation suggested that the use of military metaphors bears negative implications for clinical patients (see also Hendricks et al., 2018). Nevertheless, military metaphors are widely used and highly conventionalized, for their ability to provide a very effective structural framework that can be used to communicate about abstract topics, usually characterized by a strong negative emotional valence. Military metaphors, as suggested by Flusberg et al. (2017) draw on basic knowledge that everyone has, even though for most people this is not knowledge coming from first-hand experience. These metaphors are very efficient in expressing the urgency associated with a very negative situation, and the necessity for actions to be taken, in order to achieve an outcome quickly. As recently reported by Wicke and Bolognesi (2020) this frame is also frequently used to talk about Covid-19 on Twitter. As the authors show, the WAR frame (and thus war-related metaphors) is much more commonly used than alternative figurative frames that can be found in the discourse about Covid. The authors also show that the most commonly used lexical units related to the WAR framing are “fighting,” “fight,” “battle,” and “combat.” This may be attributed to the stage of the pandemic during which the study was conducted (peaks of the first wave, March–April 2020). As the authors suggest, it could be the case that different stages in the development of the pandemic are characterized by different uses of the WAR framing, in relation to Covid. For example, it could be the case that new lexical units within the WAR framing become frequently used, to express aspects of the sociocultural situation that were previously non-existent. These intuitions are tested in the present study, in section How Does the Figurative Framing of WAR Change Over Time?

Which Topics are Discussed on Twitter in Relation to Covid-19 and How do they Change Over Time with the Development of the Pandemic?

Methods

Data Acquisition

Twitter counts around 152 million active users worldwide (Statista, 2020). Through the publicly accessible Application Programming Interface (API) services, the platform allows analysts to mine the tweets that users post online, in compliance with the privacy regulations set by the platform programmers. According to the official Twitter redistribution policy¹ it is not allowed to share tweets and the metadata associated with them (user's name, date, etc.), but only tweet IDs, user IDs, and other meta-information alone.

Based on the extensive resource of tweet IDs collected by Lamsal (2020), we created a subcorpus of Covid-related tweets. The original dataset of tweets IDs collected by Lamsal contains 3–4 million tweets per day, in English, retrieved from Twitter based on a list of 90+ keywords that are possibly related to Covid, such as “corona,” “coronavirus,” or “pandemic.”² This resource contains tweets and retweets, as well as all tweets produced by any tweeter. For the purpose of constructing a balanced, representative, and computationally manageable corpus of tweets stemming from this extensive archive, we sampled 150,000 tweets per day from Lamsal's resource. From each sample we retained only one tweet per user and dropped retweets. Keeping only one tweet per user allowed us to balance compulsive tweeters and less involved Twitter users, thus preserving the representativeness and balance of the language used on Twitter to talk about Covid. The resulting corpus, on which the current analyses were performed, contains 1,698,254 tweets from individual users (without retweets), produced between 20.03.2020 and 01.07.2020.

Topic Modeling

The topic modeling analysis hereby implemented builds on an approach presented by Wicke and Bolognesi (2020), that uses Latent Dirichlet Allocation (LDA) (Blei et al., 2003). The standard LDA algorithm is an unsupervised machine learning algorithm that aims to describe samples of data in terms of heterogeneous categories. Since the LDA algorithm is unsupervised, the analysts need to specify the amount of topics to be modeled. For example, by specifying N = 4, each tweet in the corpus will receive a likelihood to belong to one of four categories automatically identified by the algorithm. The categories are defined by words that have the strongest co-occurrence with each other.

The processing and modeling pipeline can be summarized as follows:

• Stopword removal: the most common English words, e.g. “a,” “the,” “but,” are filtered out, based on established stopword lists (Stone et al., 2011; Wei et al., 2015; NLTK³).

• Tokenization: by means of the NLTK Tweet Tokenizer⁴

• Gibbs Sampling: by means of the Mallet library (Rehurek and Sojka, 2010) for Gensim to apply Gibbs Sampling in our LDA training.

• Number of topics: we explored all topic modeling solutions from topic number N = 2, 3, 4 … to N = 32, and then a highly granular solution, N = 64. Based on the coherence measure of the cluster solutions C_v (Syed and Spruit, 2017) we retained the best solutions.

Internal topic coherence is evaluated through the elbow method: all topic numbers are plotted in relation to their internal coherence and the selected solutions are those in which the function shows a clear bend, suggesting that for the next solution the coherence slope drops significantly. For the purpose of this study, we aimed at picking 4 different cluster solutions that vary in their degree of granularity. We partitioned the data into a smaller and into a larger number of topics in order to see potential differences emerge between a broad analysis and a more fine grained analysis of the topics within our corpus.

Temporal Analysis

Figure 1 displays the steps involved in the topic modeling based on the corpus of Covid-related tweets. The 100 groups of daily tweets were fed into the topic modeling algorithm, which provides the probability distribution for each tweet to belong to a certain topic. The result of the temporal modeling analysis is a series of clusters for each topic, for each day in the corpus. Based on these temporal distributions we provide an analysis of the observed patterns co-occurring with events in the news.

FIGURE 1

Figure 1. Processing pipeline for the constructed corpus. This corpus of tweets follows one path with processing steps, topic modeling and model selection (blue arrow). On the other path (green arrow), the same corpus is presented to the models in order to create four temporal topic analysis over 100 days with different resolutions i.e., different topic numbers.

Results

Following the process pipeline depicted in Figure 1, we created 32 LDA models, each with a different number of topics. The evaluation of the C_v coherence measure revealed an elbow of the function for N = 20 (see a plot of the curve on the Open Science Framework OSF platform repository⁵ for this paper). In addition to this, we selected a model that allowed for a broader analysis of the topics, hence a smaller number of (more inclusive) topics. Based on our previous experience with topic modeling and the coherence value function we selected N = 12, together with N = 32 for the fine-grained solution, and N = 64, our most fine grained solution. It should be noted that the LDA algorithm itself involves some degree of randomness and therefore it is likely to obtain different models, even when trained on the same data. Yet, our selection is based on the evaluation of the coherence measure in order to mitigate the statistical randomness.

Model A (N = 12), B (N = 20), C (N = 32), and D (N = 64) are stored in the OSF online repository, where the plots can be dynamically explored using an interactive web-service that we created using pyLDAvis⁶ (Sievert and Shirley, 2014).

In order to capture the temporal dynamics involved in the discourse, the groups of tweets collected for each of the 100 days (an average of 12,598 tweets per day, once the corpus is filtered for retweets and unique users) were fed into the topic modeling algorithm. Figures 2–5 illustrate the four analyses displaying the temporal line on the horizontal axis, the topics as different chromatic shades on the vertical axes, and the proportion of tweets within each topic (day by day) is represented by the colored areas. The labels in the legend for each topic consist of the top 3 or 4 most important words within each topic. These are visible in larger fonts in the interactive versions of these topic modeling solutions.

FIGURE 2

Figure 2. Temporal development of the topics (N = 12 topics, 100 days). Interactive version: https://bit.ly/3cfDq0V.

FIGURE 3

Figure 3. Temporal development of the topics (N = 20 topics, 100 days). Interactive version: https://bit.ly/2FNACwb.

FIGURE 4

Figure 4. Temporal development of the topics (N = 32 topics, 100 days). Interactive version: https://bit.ly/35RotkG.

FIGURE 5

Figure 5. Temporal development of the topics (N = 64 topics, 100 days). Interactive version: https://bit.ly/3kx6XGo.

Figure 2 displayed the less granular topic modeling analysis (N = 12), which is likely to capture broader and more generic topics associated with the discourse about Covid-19. Three main observations can be made, based on the changes in the colored areas. In Figure 6 we have highlighted those three bands, which occurred roughly in the first week of April, fourth week of April and fourth week of May. The pandemic-related events we correlate with the results of the topic modeling are being informed by official statements released by the World Health Organization (2020).

FIGURE 6

Figure 6. Event-related analysis of the N = 12 topic development. The three bands (first week of April, fourth week of April and fourth week of May) have been highlighted, and possible correlated events are provided in labeled text boxes.

In the first week of April, we observe a change in the topics “stop|government|money” and “year|fucking|years|months.” We interpret this in relation to a major event that took place on the second of April: A record of 6.6 million Americans filed claims for unemployment.⁷ As a consequence, we argue, the first week of April had a strong impact on people's opinion on continued/stopping government financial aid throughout the upcoming months/year.

The fourth week of April shows a strong increase for the topic “mask|masks|vaccine|face”. The following additional keywords are associated with this topic: “kill,” “cure,” “disease,” “human,” “wear,” “person,” “treatment,” “wearing,” “body,” “science,” “light,” “research,” “sense,” “common,” and “study.” Comparing this topic with the news reported by the press, it appears that on 23rd April Donald Trump suggested (ironically or otherwise) that coronavirus might be treated by injecting disinfectant or by UV lights.⁸ It is likely that this comment triggered the increased discussion on Twitter about common sense, science, and effective treatment (such as masks, vaccines, face masks).

The 20 topics solution shows the following trends:

• The topic marked as “home|stay|lockdown” displays a large portion of tweets in March; then the concentration decreases, to finally increase again in June. These trends might be related to the “Stay at home” guidelines issued by the WHO on 12th March and updated and extended on 29th March.

• In the first half of April there is a concentration of tweets in the topic labeled as “stop|spread|spreading.” We interpret this as a reaction to some notifications issued by the WHO, such as the confirmation of over 1 million cases of COVID-19 reported on 4th April 2020, the updated guidance on how to use masks on 6th April, and the publication of a draft landscape of COVID-19 candidate vaccines on 11th April. In relation to this, the increased concentration of tweets in the topic labeled as “positive|patients|test” around the end of April/beginning of May, might be related to the fact that in this period the USA became the first country in the world to hit 1M cases.

• A substantial concentration of tweets is observed in early April, in the topic labeled as “fight|India|country.” This is when the virus started to spread exponentially in this country.

• An increase of tweets in the topic labeled with the keyword “narendramodi” (Narendra Modi is the Indian Prime minister) can be observed around the first half of June, when the spreading of the virus was particularly fast in this country, and the Prime Minister was appearing often in the media, with messages related to the pandemic. Moreover, in the beginning of June he held summits with authorities in France and in the USA. Finally, another peak can be observed around the 28th June. We interpret this as an anticipation of the major event that took place on the 30th June, when Modi addressed the whole nation with a strong message, explaining that people had become more irresponsible and careless about COVID-19 prevention guidelines since the start of their first “Unlock 1.0.”⁹

• The topic labeled as “Trump” displays three main peaks of tweets, on 24th and 26th April, as well as on 21st June. The first two dates correspond to the days that followed the statement by Donald Trump in which he was floating the idea (ironically or otherwise) of ingesting disinfectants as a potential coronavirus treatment. The latter date corresponds to the date in which he held his first campaign rally since the US coronavirus lockdown began, in front of a smaller than expected crowd in Tulsa, Oklahoma.

• Finally, a crucial topic, previously undocumented in the N = 12 solution, becomes particularly relevant around the end of May. This topic is labeled as “lives|pandemic|police” and its appearance collides with the murder of African American George Floyd, by police officers. This is further described in relation to the most fine-grained topic solution, N = 64.

The N = 32 cluster solution, which is more fine-grained than the previous solutions described above, displays a few interesting trends in addition to those emerging from previous analyses:

• A peak of tweets can be noted on 18th May on the topic “government|state|control,” which may precede by a couple of days the official CDC announcement (probably leaked by the press a few days before its official release). This peak coincides with the introduction of a Community Mitigation Framework that includes updated guidance for communities, schools, workplaces, and events to mitigate the spread of COVID-19.¹⁰ This is also related to a peak in the topic “vaccine|cure|disease,” observed on 19th May.

• A peak of tweets on the topic “fucking|house|stupid” is observed on 1st April, when the WHO issued a report with specific guidelines for Public Health and Social Measures for the COVID-19 Pandemic.

• A substantial increase of tweets within the topic “social|mask|distancing” observed in June, concurrently with the gradual reopening of various countries and the need to remind people to keep safe distance.

Finally, the N = 64 topic development provides the most fine-grained analysis of the topics. These are reported in Figure 7. Here it can be observed that many events already mentioned in the previous analyses are captured also by this model. Moreover, the more detailed analysis reveals an increase of the topic “india|spreading|indian” around the 31st March, in addition to the peaks observed and described in the previous models. Around this time India and Pakistan intensified their efforts to contact-tracing participants of the Tablighi Jamaat coronavirus hotspot in Delhi with more than 4,000 confirmed cases.¹¹ We therefore take this feature of the N = 64 model as a good example of the topic modeling capturing local events with a greater number of topics.

FIGURE 7

Figure 7. Event-related analysis of the N = 64 topic development. The related events (white boxes) are linked to anomalies, peaks in the topic development (circular marks). Photograph left: Donald Trump speaking with supporters at a campaign rally (Credit: Gage Skidmore. Under CC A-SA 2.0 Generic License. Desaturated.) Photograph right: Black Lives Matter protest (Credit: Kety Duran @ketyduran).

Focusing on the latter half of the 100 days, we can explore how apparently unrelated topics are entering the Covid-19 discourse. For example, on Saturday 23rd May we can observe an increase for the topic labeled “world|happy|hope|rest.” Upon closer investigation of the topic model, we identify the related topic words: “allah,” “pray,” “save,” “bless,” “month,” and “protect”. This might refer to the end of Ramadan and the Eid Al-Fitr (festival of breaking the fast), since it correlates directly with the words “month,” “pray,” “allah,” and “bless.” These topics at first sight might appear to be unrelated to COVID-19. However, these are extracted from tweets that feature one or more of the COVID-19-related keywords. Tweeters are therefore likely to have expressed some connection between these events and the pandemic.

Finally, on the 25th May a video of African American George Floyd's arrest and murder while under restraint in Minneapolis police custody shows the moment when he was pinned to the ground by a police officer for 8 min and 46 s. This video ignited widespread condemnation and Nationwide protests in the U.S. In Figure 7, despite the large number of topics displayed, we can clearly observe how this event has affected the discourse about the COVID-19 pandemic. On the 25th May the topic labeled “police|protests|killed” shows a great concentration of tweets. The most important words in this topic are: “police,” “protests,” “killed,” “dead,” “protest,” “thousands,” “killing,” “mass,” “riots,” and “protesting.” Although it might appear that this topic is related to a different set of events, it is worth remembering that all the tweets on which the analysis is performed showcase a keyword associated to Covid-19 and its variants. At the same time, the graph shows an increase for the topic labeled “lives|black|human|matter,” which indicates how the Black Lives Matter (BLM) movement has gained momentum after the murder of George Floyd. Also, this topic is discussed in relation to Covid-19.

Discussion

The topic analyses show different trends. The less granular analysis, based on a limited number of topics (N = 12) shows a macro distinction into topics of discussion, where general themes emerge. Conversely, the more the number of topics increases, the more the tweets are partitioned into smaller clusters, which are more thematically coherent and seem to capture more specific events reported by the media and discussed by the Twitter users. Overlaps can be observed as well, across the various topic modeling solutions, with some trends emerging in generic as well as in more granular topic models. Nevertheless, we showed that a multiple approach to the data partitioning provides a better view into the data trends.

The explorative nature of the topic modeling approach allows analysts to mine large collections of linguistic data in a bottom-up manner, to observe tendencies of language use emerging from authentic texts. However, it should be acknowledged that the association of linguistic trends to specific events reported in the news, is an interpretative process subject to a degree of variability. Although we based our interpretations on the keywords emerged from the topic models and on major sources of information such as the WHO and the CDC websites, in principle it could be argued that different (but related) events reported in the media might have explained the changes in topics observed though our topic modeling analyses.