- 1Department of Management Science and Engineering, Business School, Beijing Institute of Technology, Zhuhai, China
- 2Research Base of Cross-Border Flow Risk and Governance, Beijing Institute of Technology, Zhuhai, China
The prevention and control of the coronavirus disease 2019 (COVID-19) epidemic in China has entered a phase of normalization. The basis for evaluating and improving public health strategies is understanding the emotions and concerns of the public. This study establishes a fine-grained emotion-classification model to annotate the emotions of 32,698 Sina Weibo posts related to COVID-19 prevention and control from July 2022 to August 2022. The Dalian University of Technology (DLUT) emotion-classification system was adjusted to form four pairs (eight categories) of bidirectional emotions: good-disgust, joy-sadness, anger-fear, and surprise-anticipation. A lexicon-based method was proposed to classify the emotions of Weibo posts. Based on the selected Weibo posts, the present study analyzed the Chinese public's sentiments and emotions. The results showed that positive sentiment accounted for 51%, negative sentiment accounted for 24%, and neutral sentiment accounted for 25%. Positive sentiments were dominated by good and joy emotions, and negative sentiments were dominated by fear and disgust emotions. The proportion of positive sentiments on official Weibo (accounts belonging to government departments and official media) is significantly higher than that on personal Weibo. Official Weibo users displayed a weak guiding effect on personal users in terms of positive sentiment and the two groups of users were almost completely synchronized in terms of negative sentiment. The linear discriminant analysis (LDA) was performed on the two negative emotions of fear and disgust in the personal posts. The present study found that the emotion of fear was mainly related to COVID-19 infection and death, control of people with positive nucleic acid tests, and the outbreak of local epidemic, while the emotion of disgust was mainly related to the long-term existence of the epidemic, the cost of nucleic acid tests, non-implementation of prevention and control measures, and the occurrence of foreign epidemics. These findings suggest that Chinese attitudes toward epidemic prevention and control are positive and optimistic; however, there is also a notable proportion of fear and disgust. It is expected that this study will help public health administrators to evaluate the effectiveness of possible countermeasures and work toward precise prevention and control of the COVID-19 epidemic.
1. Introduction
The coronavirus disease 2019 (COVID-19) pandemic poses a serious threat to human life and health (WHO, 2020; Mallah et al., 2021). Globally, COVID-19 has impacted not only people's physical health (Sohail et al., 2022; Yu et al., 2022a) but also their mental health (Holmes et al., 2020). The pandemic has been going on for more than two years, and localized outbreaks take place from time to time, significantly affecting people's mental health (Ren and Guo, 2020; Cui et al., 2022). Epidemic prevention and control measures, such as wearing masks, nucleic acid testing, vaccination, quarantine, and lockdowns, also have a negative impact on people's mental health (Budimir et al., 2021; Jin et al., 2021; Lin and Fu, 2022).
In China, the COVID-19 epidemic has entered a normalization phase, with a variety of prevention and control measures. In this normalization phase, how widespread are negative emotions among the Chinese public? What are the primary negative emotions? What are the causes of these negative emotions? Answering these questions is important for evaluating and improving prevention and control strategies. Sina Weibo, a microblog, is the most influential social media platform in China, and it is an important venue for the public to express their opinions and emotions. Using the emotion analysis and topic mining on Weibo, we can understand the emotions and concerns of the public and may therefore be able to help improve policies for the prevention and control of the COVID-19 epidemic.
Since the outbreak of the COVID-19 epidemic, researchers have analyzed social media and found that the epidemic has had a negative psychological impact on the public (Tan et al., 2021; Wang J. et al., 2022). Studies found that the factors that affect public sentiment include the number of COVID-19 cases (Da and Yang, 2020), gender (Naseem et al., 2021), and high-impact users (Xie et al., 2021). Prevention and control measures also have an impact on people's psychology. The subtypes of prevention and control strategies have different effects on sentiments and concerns (Sukhwal and Kankanhalli, 2022). The relationship between epidemic prevention measures and people's sentiments is complex Chum et al. (2021). Ogbuokiri et al. (2022) found that, in three different South African cities, citizens had different attitudes toward vaccines.
The sentiment analysis of social media can serve as a monitoring tool (Crocamo et al., 2021) and can help us understand public sentiment. However, it is not sufficient to understand emotions and their reasons. Emotion is a fine-grained sentiment and has an important impact on human behavior and decision-making (Hockenbury et al., 2016). There are various theories of emotion in psychology (Brady, 2021). Some psychologists believe that humans experience several basic emotions (Levenson, 2011). Ekman (1992) proposed six emotions: joy, sadness, anger, fear, disgust, and surprise. Plutchik (2001) proposed the emotion wheel model, which suggests that there are eight (four pairs) two-way emotions: joy and sadness, anger and fear, trust and disgust, and surprise and anticipation. The Ortony, Clore, and Collins (OCC) model defines 22 classes of basic emotions (Ortony et al., 1990). Parrott (2000) proposed a three-layer emotion model that has six primary emotions: love, joy, surprise, anger, sadness, and fear. Other psychologists argued that emotions are not discrete and have proposed a dimensional emotion model. The most influential are Russell's two-dimensional (2D) annular model and Plutchik's parabolic cone space model. Recently, Cowen extracted 27 emotions based on self-reports and believed that emotions were not clearly separated but gradually changed (Cowen and Keltner, 2017).
Social media primarily uses text and symbols. The emotion analysis of a text adopts a category emotion model that defines several basic emotions (Acheampong et al., 2020). Several studies have conducted emotion analyses on social media using different emotion-classification methods. Oliveira et al. (2022) analyzed tweets labeled with the emotions of anger, sadness, optimism, and joy. Ye et al. (2020) adopted Ekman's six basic emotions. Shen et al. (2021) adopted four categories of emotion: anger, fear, encouragement, and hope. Shi et al. (2022) classified emotions into seven categories. Wang H. et al. (2022) used the DLUT sentiment ontology, which includes seven types of emotions.
Text emotion analysis is a multi-classification problem (Murthy and Kumar, 2021). A lexicon-based approach can achieve high accuracy if the lexicon is complete, and the rules are well designed. In recent years, machine learning and deep learning models, such as Recurrent Neural Network (RNN), Long-Short Term Memory (LSTM), and Bidirectional Encoder Representations from Transformers (BERT), have also been used for text emotion analysis (Xu et al., 2020; Yu et al., 2022b). The commonly used BERT, a pre-trained model that needs to be fine-tuned on downstream tasks, can obtain better classification results (Acheampong et al., 2021). There are some BERT models pre-trained for the Chinese corpus, but it is difficult to fine-tune them due to the lack of a fine-grained emotion-labeled Chinese Weibo corpus (Xu et al., 2020). The lexicon-based method depends mainly on the lexicon and rules. English emotion lexicons, such as WorldNet-Affect (Strapparava and Valitutti, 2004), EmoSenticNet (Poria et al., 2014), and NRC word-emotion lexicons (Mohammad and Turney, 2013), are mature. The Chinese lexicons mainly include the HowNet, NTUSD, and Boson NLP. However, these lexicons provide only the positive and negative polarities of words. The DLUT sentiment ontology library adds “good” to Ekman's six basic emotions, and the emotions are divided into seven categories (Xu et al., 2008) that also give the intensity for each word. However, the DLUT has not been updated for a long time, lacks popular words from the Internet, and has some defects in emotional classification.
In summary, a few previous studies focused on the distribution of people's emotions during the normalization of the COVID-19 epidemic. The emotion classification of Chinese Weibo was also not reasonable. The lexicon-based sentiment classification method does not consider the synthesis of multiple sentences in microblogs. A few studies have analyzed whether official microblogs play a role in guiding public emotions.
This study aimed to explore the sentiments, emotions, and concerns of the Chinese public during the normalization phase of the COVID-19 prevention and control. First, a web crawler was used to obtain Weibo posts related to the prevention and control of COVID-19, and the data were preprocessed. Then, this study proposed an emotion annotation method that considers the role of sentences in different positions and the relationship between the emotion word and its modifiers. Finally, this study presented the distribution of sentiments and emotions of the Chinese public, analyzed the relationship between official and personal users, and performed LDA topic mining on negative emotion posts to understand public concerns. It is hoped that this study will help improve the policy of COVID-19 prevention and control and alleviate the negative emotions of the public.
2. Materials and methods
2.1. Data acquisition
During the normalization phase of the COVID-19 epidemic prevention and control, China adopted a dynamic clearing strategy, with different measures implemented in different regions according to each local situation. From July 2022 to August 2022, most of China was free of the epidemic, but a few regions such as Xinjiang, Hainan, and Tibet had localized epidemic, which is a typical scenario during the normalization phase.
Weibo posts related to COVID-19 prevention and control were collected using the Sina Weibo search template in the Octopus web crawler. The keywords were set as “New Coronavirus” OR “epidemic” OR “COVID-19” OR “prevention and control.” Because the Sina Weibo search engine provides only 50 pages of results at a time, this study conducted multiple data crawls, combined the results, de-duplicated them according to the publisher and content, and finally obtained a total of 32,698 posts. The fields of the dataset included publisher, content, publisher link, posting time, source, number of comments, number of retweets, number of likes, and current time.
2.2. Data preprocessing
Some fields in the dataset could not be used directly, and there was an abundance of irrelevant content in the posts that needed to be cleaned. In this study, Python programs were written to preprocess the data, such as the calculation of the posting date and time, user ID extraction, content cleaning, and user type determination.
The “posting time” in the original dataset is represented by “x seconds ago, “x minutes ago,” and “Today HH:MM.” In this study, the date and time of posting, based on the fields “current time” and “posting time., were calculated.
Due to privacy protection measures, the publishers are anonymous. To obtain basic information about the user, their user ID was extracted from the field “publisher link” using regular expression. For example, if the publisher link was https://weibo.com/7412756093?refer_flag=1001030103, the user ID “7412756093” was parsed using Python package re.
A considerable amount of content was mentioned in the Weibo posts that was not related to the sentiment analysis and had to be cleaned. In this study, irrelevant content, such as Weibo topic, user name, and URL (Uniform Resource Locator) using regular expressions, was eliminated and then stop words removed using the Baidu stop words dictionary.
Weibo users were divided into two categories: official users and personal users. Posts released by official media and government departments are often announcements on epidemic and policy propaganda. Official media includes television (TV) stations, radio stations, newspapers, magazines, and portal websites. Government departments, such as the Health and Welfare Commission and CDC, often publish Weibo posts as well. To determine the type of user, basic user information is crawled according to the user ID. The user dataset contains fields such as “username,” “brief description,” “certification,” and “industry.” The user type can be inferred from the fields “username” and “brief description.” In this study, a list of keywords that could be used to describe official users was defined. A Python program uses keyword-related regular expressions to find official users and then correct them manually.
After preprocessing, the fields of the dataset include publisher, user ID, user type, content, cleaned content, posting date, posting time, posting source, number of comments, retweets, and likes.
2.3. Emotion classification
Weibo posts may be long and may contain several sentences. First, the post is split into sentences using separators “. °! ?”. Sentences in different positions may play different roles in expressing emotions, and the first and last sentences may be more important. If the number of sentences n ≤ 2, let the weight of each sentence ; otherwise, for n>2, the weights of the first and last sentences are increased, and the weight of each sentence is calculated as follows:
An emotional score was calculated for each sentence. The sentence was matched with the emotion lexicon to extract the emotion word, and the emotion type of the sentence was determined by the emotion type of the emotion word. Dependent syntactic analysis was performed with the emotion word as the central word to obtain the emotion unit. The emotion unit is centered on the emotion word and may contain some degree of adverbs, negation words, and gerunds. Assuming that the type of emotion word is j, its intensity is q, the number of negation words is m, the number of degree adverbs or gerunds (which play an enhancing or weakening role) is l, and the adjustment factors for degree adverbs or gerunds are cv, v = 1, 2, ..., l, then, the score for emotion type j is calculated as follows:
The emotion unit is scored 0 for other emotion types.
One or more negative words may be present in a sentence, due to which the emotion score may be negative, and therefore needs to be converted to reverse emotion. However, seven emotions are mentioned in the DLUT ontology, among which good and disgust, happy and sadness, and anger and fear can be combined into two-way emotions, but there is no reverse emotion for the emotion of surprise. According to Plutchik's emotion wheel model, if the emotion type is surprise and the intensity score is negative, it is converted to anticipation. Therefore, emotions were divided into eight categories (four pairs): good-disgust, joy-sadness, anger-fear, and surprise-anticipation.
If there are multiple emotion units in a sentence, each with a different emotion type, the scores of all emotion units for each type of emotion are combined to obtain the emotion vector of the sentence as follows:
If the Weibo post contains multiple sentences, the emotion vector of each sentence is weighted and summed up to obtain the emotion vector of the post as follows:
The emotion type with the highest score in the vector 𝔼 was selected as the emotion type of the post, . If the maximum score , where δ is a predefined threshold, the emotion is classified as neutral.
2.4. Evaluation of the model
The present study randomly selected 2,000 posts from the Sina Weibo corpus for manual annotation to determine the sentiment and emotion. The emotion-classification algorithm explained was applied to the test set, and the performance metrics of the model were obtained by comparing its predictions with the annotated labels.
For sentiment classification, the accuracy of the model was 0.80. Performance metrics such as precision, recall, and F1-score are shown in Table 1. This is a multi-classification problem, and in this study macro-averaging was used to obtain the overall performance. The macro-average is an arithmetic average of the performance of each category. The macro precision was 0.83, macro recall was 0.79, and macro F1-score was 0.79.
The accuracy of the model for emotion classification was 0.78. The performance metrics of precision, recall, and F1-score are listed in Table 2. The macro precision was 0.81, macro recall was 0.66, and macro F1-score was 0.69.
2.5. Topic mining
Several popular topic models, including Latent Semantic Analysis (LSA), Probabilistic Latent Semantic Analysis (PLSA), and Latent Dirichlet Allocation (LDA). The LSA model is simple but lacks a solid foundation in mathematical statistics and requires Singular Value Decomposition (SVD) operations, which are inefficient. PLSA introduces a probabilistic model and uses the maximum likelihood for solving. However, the parameters to be estimated increase linearly with document size. The LDA is a model proposed by Blei et al. (2003). The LDA is based on the Bayesian theory, with a clear internal structure, and a number of parameters that is independent of the corpus; it is suitable for large-scale corpora. In the present study, LDA topic mining was implemented using the gensim (Ehek and Sojka, 2010) package in Python and extracted public concerns from the Weibo posts. LDA is a widely used topic mining method; however, it is difficult to determine the optimal number of topics. Blei et al. suggested selecting the number of topics with a minimal amount of perplexity. However, the larger the number of topics, the smaller the perplexity, which often leads to a very large optimal number of topics. Some scholars have used the elbow method to select the number of topics with an obvious turn in perplexity as the optimal number. Another frequently used metric is coherence, which measures the semantic similarity of keywords within the same topic. The greater the coherence, the higher the cohesion within the topic. However, coherence does not reflect whether different topics can be separated sufficiently.
The present study used both perplexity and coherence, combined with LDA visualization, to determine the optimal number of topics. First, the maximum number of topics was determined using the perplexity elbow method. Then, the possible optimal number of topics was selected based on coherence. Finally, the optimal number of topics was determined by observing the separation of topics using pyLDAvis (Python library for interactive topic model visualization) (Sievert and Shirley, 2014).
3. Results
3.1. Public sentiments and emotions
Emotions were annotated for the 32,698 Weibo posts. The proportion of each emotion was then calculated. To perform sentiment analysis, this study categorized the four emotions, good, joy, surprise, and anticipation, as positive sentiments, and categorized the other four emotions, fear, disgust, sadness, and anger, as negative sentiments. The sentiment and emotion distribution of Weibo posts during the epidemic normalization phase is shown in Table 3.
Positive sentiments accounted for 50.83% of the posts, neutral for 25.37%, and negative for 23.80%. It not only showed that more than 50% of the Weibo posts express positive sentiments, but also showed that close to one-fourth express negative sentiments, which is worth noting. The main emotions in positive sentiments were good (38.27%) and joy (12.13%), indicating that many Internet users were satisfied and optimistic about epidemic prevention and control. The main emotions in negative sentiments were fear (11.11%) and disgust (8.06%), indicating that some Internet users express fear and disgust, while a small number of them feel sadness (2.72%) and anger (1.91%).
3.2. Sentiment difference between official Weibo and personal Weibo
During the normalization phase of the COVID-19 prevention and control, the government takes responsibility for epidemic prevention and control, and the official media complies with reporting norms, becoming more cautious when publishing on Weibo. Personal users are free to publish and express their emotions and opinions directly. In this study, all posts were divided into two groups based on the type of publisher: official Weibo posts and personal Weibo posts. There were 6,411 official posts and 26,155 personal posts (a small number of posts were excluded due to difficulty in determining the type of publisher). A sentiment analysis was performed on the two groups of posts, and the sentiment and emotion distribution of the two groups is shown in Table 4.
On official Weibo, positive sentiment accounted for 60.7% of the posts, neutral sentiment accounted for 14% of the posts, and negative sentiment accounted for 25.3% of the posts. For personal Weibo, positive sentiment accounted for 48.4% of the posts, neutral sentiment accounted for 28.1% of the posts, and negative sentiment accounted for 23.5% of the posts. A comparison of sentiments between the two groups is shown in Figure 1. A chi-square test was conducted on the distribution of sentiments, with P < 0.05, indicating a significant difference between the two groups. The percentage of positive sentiments in the official group was significantly higher than that in the personal group, and the percentages of negative sentiments were similar in both groups, indicating that official users were more positive and optimistic toward the COVID-19 epidemic.
3.3. Guiding effect of official users
In the normalized phase of epidemic prevention and control, do official users have a guiding effect on personal users? If so, is the guiding effect strong or weak? To explore the relationship between official and personal users, two main positive emotions, good and joy, and two main negative emotions, fear and disgust, were selected in this study.
The present study counted the number of daily posts with the emotion of good on official and personal Weibo and plotted the time series, as shown in Figure 2. Similarly, the time series of daily posts with the emotion of joy is shown in Figure 3. To facilitate the comparison of trends, the number of official posts was scaled up.
Figure 2 reveals that most of the time, the trends for official and personal posts were the same, but in a few periods, official posts reached a peak before personal posts. For example, official posts reached a peak on July 2, while personal posts reached a peak on July 3. This phenomenon exists for both good and joy emotions. It is suggested that, in terms of positive sentiment, official users have a guiding effect on personal users, but the effect is weak.
Is there a similar phenomenon for negative emotions? In this study, the time series of official and personal posts with the emotions of fear and disgust, respectively, were plotted, as shown in Figures 4, 5, and found that trends for official and personal posts were almost identical. This is clearly different from the time series of good and joy emotions. It is presumed that when it comes to negative emotions, official users have no guiding effect on personal users.
3.4. Topics related to negative emotions
The percentage of negative sentiments in all of the Weibo posts was 23.80%, which is noteworthy. Negative sentiments may trigger risk events that require further analysis. Topics associated with negative sentiments are helpful for understanding public concerns. This study performed LDA topic mining on the personal posts with emotions of fear (9.38%) and disgust (9.07%), which accounted for a relatively large proportion of negative posts.
3.4.1. Topics related to emotion of fear
The number of topics in the LDA model was a hyperparameter that needed to be selected in advance. First, the topic number was set to range from 1 to 20 to obtain the relationship between the number of topics and perplexity, as shown in Figure 6. The elbow method was used to determine the upper limit of the number of topics, at 12. The coherence is then calculated for each topic number from 1 to 12. The relationship between the number of topics and the coherence is shown in Figure 7. Topic number 5, with the highest coherence, was selected for the LDA.
The present study visualized the LDA results using pyLDAvis, which is a dynamic interaction graph. The principal component analysis (PCA) of the two-dimensional (2D) projections of the five topics are shown in Figure 8. These five topics are clearly separated, and their keywords are significantly different. Then, the optimal number of topics is determined as 5.
The number of topics was set to five and the LDA performed on the personal posts with the emotion of fear. The keywords for the five topics are listed in Table 5, in descending order of weight.
Based on the keyword list for each topic, combined with qualitative analysis of typical Weibo posts, the topics associated with the emotion of fear were refined into this list:
• COVID-19 spread and mutation.
• The increase in confirmed cases within and outside the country.
• Infection and death caused by COVID-19.
• Control of personnel with positive nucleic acid tests.
• Local outbreak prevention and control.
These five topics have a semantic overlap and therefore can be grouped further into three topics:
• Infection and death due to the COVID-19 epidemic.
• Control of personnel with positive nucleic acid tests.
• Epidemics within and outside the country, especially locally.
3.4.2. Topics related to emotion of disgust
The LDA was also performed on personal posts expressing the emotion of disgust. First, the upper limit of the number of topics was set to 12 (see Figure 9). Then, the number six was selected as the optimal number of topics based on the coherence metric (see Figure 10).
The LDA was performed with topic number six and visualized using pyLDAvis. It was found that topics 2 and 3 overlapped on the PCA projection (see Figure 11), hence they were combined into one topic, and the topic number was adjusted to 5.
The the number of topics was set to five, the LDA performed, and a list of keywords obtained for each topic, as shown in Table 6.
According to the keywords of each topic, combined with typical Weibo posts, for qualitative analysis, the five topics associated with disgust were extracted as follows:
• Disgust with the outbreak, and the prevention and control of the epidemic.
• Nucleic acid testing charges.
• Someone is suspected of a crime that hinders the prevention and control of the epidemic.
• Infections outside the country, especially in the United States of America.
• Failure to implement epidemic prevention and control measures.
Summarizing these five topics further, the topics related to disgust are:
• Persistence of the COVID-19 epidemic.
• Non-compliance and non-implementation of prevention and control measures.
• Costs for nucleic acid testing.
• Outbreak of COVID-19 outside the country.
4. Discussions
The present study collected Sina Weibo posts during the normalization phase of the COVID-19 epidemic prevention and control. A fine-grained emotion-classification algorithm was designed with consideration given to the positions of sentences and syntactic structure of the emotion unit. The algorithm was used to label the emotions of the Weibo posts. Based on the labeled data, the present study analyzed the sentiments of the Chinese public. The outcome showed that most of the population was positively optimistic about the epidemic, with good and joy being the main emotions. However, there is also a certain proportion of negative sentiment, with the main negative emotions being fear and disgust. The LDA topic mining was conducted on personal Weibo posts containing negative emotions. The results showed that the negative emotions of the public were mainly related to the long-term existence of the epidemic, control of personnel with positive nucleic acid test results, non-implementation of prevention and control measures, and local outbreaks.
In this study, a lexicon-based emotion-classification method was developed. The emotion ontology of DLUT was expanded to form four pairs of bidirectional emotions. It solved the problem that the DLUT emotion system cannot handle when the emotion is surprise, and when the emotion's score is negative. This emotion-classification system may be useful for emotion analyses.
The present study used the LDA for topic analysis of negative sentiment posts. The LDA is a popular topic-mining method (Xie et al., 2021; Ogbuokiri et al., 2022; Oliveira et al., 2022; Wang H. et al., 2022). However, it was difficult to determine the optimal number of topics. Ogbuokiri et al. (2022) used Jaccard similarity to compare the similarity between two topics and then filtered the topics based on the metric of coherence. Due to the high dimensionality of the text data, results may differ when different thresholds are used. The present study used perplexity and coherence to initially select the number of topics, and then determined the optimal number of topics using pyLDAvis. This approach ensures a high degree of semantic consistency within topics as well as a good degree of topic separation.
The present study showed that positive sentiment accounted for more than 50% of the total sentiment between July 2022 and August 2022. Pan et al. (2021) studied the comment data of the People's Daily account on Sina Weibo, where positive sentiments accounted for more than 50%, which is close to our result. Wang J. et al. (2022) showed that when the World Health Organization (WHO) declared a global pandemic on 11 March 2020, the sentiment score of the global population dropped significantly, dominated by negative sentiments, but largely recovered to pre-pandemic levels in May 2020. Tan et al. (2021) reported similar findings, with a significant drop in public sentiment during the Wuhan 2020 epidemic. However, after the Wuhan 2020 epidemic ended, the overall sentiment of Chinese Weibo users recovered (although not to pre-epidemic 2020 levels), with sentiment scores fluctuating around 0.5, and with no correlation to new cases. These studies imply that despite the persistence of the epidemic, public sentiment gradually recovered after the end of the outbreak. The COVID-19 epidemic has now existed for more than 2 years, and with no serious outbreaks from July 2022 to August 2022, people's psychological resilience has largely recovered.
The present study showed that official users had a significantly higher positive sentiment than personal users and that official posts had a weak guiding effect on personal users in terms of positive sentiment. Some studies have focused on the roles of different users in social media. Pan et al. (2021) revealed that big V users were more important. However, there are not many studies on the interaction between official users and personal users. The government and official media play a very important role in epidemic prevention and control. Our study could help the government evaluate and improve the role and value of official media.
The present study showed that nearly one-fourth of the Weibo posts contained negative emotions, mainly fear and disgust. Fear was mainly related to infections and deaths due to the epidemic, the presence of the epidemic, and the control of people who tested positive for nucleic acids. Some other studies also noted that social media users expressed fear during the epidemic, but did not analyze the reasons behind the emotion. Oliveira et al. (2022) analyzed English Twitter feeds from 12 countries from January 2020 to April 2021, and found that the main negative emotions were fear and sadness. Wang H. et al. (2022) studied public sentiment related to the Omicron variant on Sina Weibo and found that fear accounted for the largest proportion of negative emotions. These studies suggest that fear may be one of the more important negative emotions during an epidemic. A few studies have focused on the emotion of disgust during epidemics. The present study revealed the percentage of disgust emotions and the topics related to disgust. Due to the prolonged existence of the COVID-19 epidemic, public disgust with epidemics is an important and noteworthy phenomenon. Failure of public health authorities to alleviate the disgust of the population may lead to the failure of prevention and control measures.
5. Conclusions
In this study, Sina Weibo posts were collected during the normalization phase of the COVID-19 epidemic prevention and control. The present study conducted an emotion analysis of the Weibo posts, which can reveal people's psychological feelings in more detail than a sentiment analysis. The present study improved the emotion-classification system of the DLUT and developed a lexicon-based emotion-classification method. During the normalization of the COVID-19 epidemic, more than 50% of the Weibo posts had positive sentiments, but more than 20% of the Weibo posts had negative sentiments. In terms of positive emotions, government departments and official media played a role in guiding personal users, but this effect was weak. The negative emotions were mainly fear and disgust, which were related to topics such as the persistence of the epidemic, control of nucleic acid-positive persons, non-implementation of prevention and control measures, and domestic and international outbreaks. This study can help public administrators understand public emotions and evaluate policy in a timely manner. The results of the study, such as the main negative emotions and related topics, provide clues to understand public concerns, which can help to adjust policies and achieve more effective prevention and control measures.
The present study has some limitations. First, the sentiment lexicon has not been updated in recent years because the online language has been changing rapidly. It is planned to collect buzz words from social media and update the lexicon. Second, there are texts, emojis, and even pictures and videos in Weibo posts, and integrating multiple types of media can be considered to determine emotions in the future. Finally, the topics obtained through LDA can only provide clues for understanding sentiments. To discover the mechanisms behind these sentiments, text mining must be combined with statistical analysis, interviews, experimental studies, etc.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
FZ contributed to the conception, design, analysis, and revision of the manuscript. QT contributed to data collection and analysis. JC contributed to the results and discussions. NH contributed to sentiment analysis programming. All authors contributed to the article and approved the submitted version.
Funding
This research was funded by the National Natural Science Foundation of China (Grant No. 71571190) and the Guangdong Province Key Research Base of Humanities and Social Sciences (Grant No. 2022WZJD012).
Acknowledgments
The authors acknowledge the Public Safety Emergency Management Team in Beijing Institute of Technology, Zhuhai, China.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Acheampong, F. A., Nunoo-Mensah, H., and Chen, W. (2021). Transformer models for text-based emotion detection: a review of BERT-based approaches. Artif. Intell. Rev. 54, 5789–5829. doi: 10.1007/s10462-021-09958-2
Acheampong, F. A., Wenyu, C., and Nunoo-Mensah, H. (2020). Text-based emotion detection: advances, challenges, and opportunities. Eng. Rep. 2. doi: 10.1002/eng2.12189
Blei, D., Ng, A., Jordan, M., and Lafferty, J. (2003). Latent Dirichlet allocation. J. Machine Learn. Res. 3, 993–1022.
Brady, M. (2021). Précis: emotions: the basics. J. Philosophy Emot. 3, 1–4. doi: 10.33497/2021.summer.1
Budimir, S., Probst, T., and Pieh, C. (2021). Coping strategies and mental health during COVID-19 lockdown. J. Ment. Health 30, 1–8. doi: 10.1080/09638237.2021.1875412
Chum, A., Nielsen, A., Bellows, Z., Farrell, E., Durette, P.-N., Banda, J. M., et al. (2021). Changes in public response associated with various COVID-19 restrictions in Ontario, Canada: observational infoveillance study using social media time series data. J. Med. Internet Res. 23, e28716. doi: 10.2196/28716
Cowen, A. S., and Keltner, D. (2017). Self-report captures 27 distinct categories of emotion bridged by continuous gradients. Proc. Natl. Acad. Sci. 114, E7900–E7909. doi: 10.1073/pnas.1702247114
Crocamo, C., Viviani, M., Famiglini, L., Bartoli, F., Pasi, G., Carr,à, G., et al. (2021). Surveilling COVID-19 emotional contagion on twitter by sentiment analysis. Eur. Psychiat. 64, e17. doi: 10.1192/j.eurpsy.2021.3
Cui, J., Lu, J., Weng, Y., Yi, G. Y., and He, W. (2022). COVID-19 impact on mental health. BMC Med. Res. Methodol. 22, 15. doi: 10.1186/s12874-021-01411-w
Da, T., and Yang, L. (2020). Local COVID-19 severity and social media responses: evidence from China. IEEE Access 8, 204684–204694. doi: 10.1109/ACCESS.2020.3037248
Ehek, R., and Sojka, P. (2010). “Software framework for topic modelling with large corpora,” in Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks (Valletta, Malta: ELRA). p. 45–50.
Ekman, P. (1992). An argument for basic emotions. Cognit. Emot. 6, 169–200. doi: 10.1080/02699939208411068
Hockenbury, S. E., Nolan, S. A., and Hockenbury, D. H. (2016). Discovering Psychology (7th Edition). New York city, NY: Worth Publishers.
Holmes, E. A., O'Connor, R. C., Perry, V. H., Tracey, I., Wessely, S., Arseneault, L., et al. (2020). Multidisciplinary research priorities for the COVID-19 pandemic: a call for action for mental health science. Lancet Psychiatry 7, 547–560. doi: 10.1016/S2215-0366(20)30168-1
Jin, Y., Sun, T., Zheng, P., and An, J. (2021). Mass quarantine and mental health during COVID-19: a meta-analysis. J. Affect Disorders 295, 1335–1346. doi: 10.1016/j.jad.2021.08.067
Levenson, R. W. (2011). Basic emotion questions. Emot. Rev. 3, 379–386. doi: 10.1177/1754073911410743
Lin, C., and Fu, X. (2022). A cross-sectional study of depression, anxiety, and insomnia symptoms in people in quarantine during the COVID-19 epidemic. Int. J. Public Health 67, 1604723. doi: 10.3389/ijph.2022.1604723
Mallah, S. I., Ghorab, O. K., Al-Salmi, S., Abdellatif, O. S., Tharmaratnam, T., Iskandar, M. A., et al. (2021). COVID-19: breaking down a global health crisis. Ann. Clin. Microb. Anti. 20, 35. doi: 10.1186/s12941-021-00438-7
Murthy, A. R., and Kumar, K. M. A. (2021). A review of different approaches for detecting emotion from text. Iop Conf. Ser. Mater Sci. Eng. 1110, 012009. doi: 10.1088/1757-899X/1110/1/012009
Naseem, U., Razzak, I., Khushi, M., Eklund, P. W., and Kim, J. (2021). COVID senti: a large-scale benchmark twitter data set for COVID-19 sentiment analysis. IEEE Trans. Comput. Soc. Syst. 8, 1003–1015. doi: 10.1109/TCSS.2021.3051189
Ogbuokiri, B., Ahmadi, A., Bragazzi, N. L., Nia, Z. M., Mellado, B., Wu, J., et al. (2022). Public sentiments toward COVID-19 vaccines in South African cities: an analysis of Twitter posts. Front. Public Heal. 10, 987376. doi: 10.3389/fpubh.2022.987376
Oliveira, F. B., Haque, A., Mougouei, D., Evans, S., Sichman, J. S., Singh, M. P., et al. (2022). Investigating the emotional response to COVID-19 news on twitter: a topic modelling and emotion classification approach. IEEE Access 10, 16883–16897. doi: 10.1109/ACCESS.2022.3150329
Ortony, A., Clore, G. L., and Collins, A. (1990). The Cognitive Structure of Emotions. Cambridge, MA: Cambridge University Press
Pan, W., Wang, R., Dai, W., Huang, G., Hu, C., Pan, W., et al. (2021). China public psychology analysis about COVID-19 under considering sina Weibo data. Front. Psychol. 12, 713597. doi: 10.3389/fpsyg.2021.713597
Plutchik, R. (2001). The nature of emotions: human emotions have deep evolutionary roots, a fact that may explain their complexity and provide tools for clinical practice. Am. Sci. 89, 344. doi: 10.1511/2001.4.344
Poria, S., Gelbukh, A., Cambria, E., Hussain, A., and Huang, G.-.B. (2014). EmoSenticSpace: a novel framework for affective common-sense reasoning. Knowl. Based Syst. 69, 108–123. doi: 10.1016/j.knosys.2014.06.011
Ren, F.-F., and Guo, R.-J. (2020). Public mental health in post-COVID-19 era. Psychiat Danub 32, 251–255. doi: 10.24869/psyd.2020.251
Shen, L., Yao, R., Zhang, W., Evans, R., Cao, G., Zhang, Z., et al. (2021). Emotional attitudes of Chinese citizens on social distancing during the COVID-19 outbreak: analysis of social media data. JMIR Med. Inform. 9, e27079. doi: 10.2196/27079
Shi, W., Zeng, F., Zhang, A., Tong, C., Shen, X., Liu, Z., et al. (2022). Online public opinion during the first epidemic wave of COVID-19 in China based on Weibo data. Hum. Soc. Sci. Commun. 9, 159. doi: 10.1057/s41599-022-01181-w
Sievert, C., and Shirley, K. (2014). “LDAvis: a method for visualizing and interpreting topics,” in Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces (Baltimore, Maryland, USA: Association for Computational Linguistics). p. 63–70. doi: 10.3115/v1/W14-3110
Sohail, A., Yu, Z., Arif, R., Nutini, A., and Nofal, T. A. (2022). Piecewise differentiation of the fractional order CAR-T cells-SARS-2 virus model. Results Phys. 33, 105046–105046. doi: 10.1016/j.rinp.2021.105046
Strapparava, C., and Valitutti, A. (2004). WordNet-affect: an affective extension of WordNet. LREC. 4, 1083–6.
Sukhwal, P. C., and Kankanhalli, A. (2022). Determining containment policy impacts on public sentiment during the pandemic using social media data. Proc. Natl. Acad. Sci. 119, e2117292119. doi: 10.1073/pnas.2117292119
Tan, H., Peng, S.-L., Zhu, C-. P., You, Z., Miao, M.-C., and Kuai, S.-G. (2021). Long-term effects of the COVID-19 pandemic on public sentiments in mainland China: sentiment analysis of social media posts. J. Med. Internet Res. 23, e29150. doi: 10.2196/29150
Wang, H., Sun, K., and Wang, Y. (2022). Exploring the Chinese public's perception of omicron variants on social media: LDA-based topic modeling and sentiment analysis. Int. J. Environ. Res. Pu. 19, 8377. doi: 10.3390/ijerph19148377
Wang, J., Fan, Y., Palacios, J., Chai, Y., Guetta-Jeanrenaud, N., Obradovich, N., et al. (2022). Global evidence of expressed sentiment alterations during the COVID-19 pandemic. Nat. Hum. Behav. 6, 349–358. doi: 10.1038/s41562-022-01312-y
Xie, R., Chu, S. K. W., Chiu, D. K. W., and Wang, Y. (2021). Exploring public response to COVID-19 on Weibo with LDA topic modeling and sentiment analysis. Data Inform. Manage. 5, 86–99. doi: 10.2478/dim-2020-0023
Xu, D., Tian, Z., Lai, R., Kong, X., Tan, Z., Shi, W., et al. (2020). Deep learning based emotion analysis of microblog texts. Inform. Fus. 64, 1–11. doi: 10.1016/j.inffus.2020.06.002
Xu, L., Lin, H., Pan, Y., and Chen, J. (2008). Constructing the affective lexicon ontology. J. China Soc. Sci. Inf. 27, 180–185.
Ye, Y., Long, T., Liu, C., and Xu, D. (2020). The effect of emotion on prosocial tendency: the moderating effect of epidemic severity under the outbreak of COVID-19. Front. Psychol. 11, 588701. doi: 10.3389/fpsyg.2020.588701
Yu, Z., Sohail, A., Arif, R., Nutini, A., Nofal, T. A., Tunc, S., et al. (2022a). Modeling the crossover behavior of the bacterial infection with the COVID-19 epidemics. Results Phys. 39, 105774–105774. doi: 10.1016/j.rinp.2022.105774
Keywords: public, sentiment, emotion, Sina Weibo, COVID-19, China
Citation: Zhang F, Tang Q, Chen J and Han N (2023) China public emotion analysis under normalization of COVID-19 epidemic: Using Sina Weibo. Front. Psychol. 13:1066628. doi: 10.3389/fpsyg.2022.1066628
Received: 04 November 2022; Accepted: 14 December 2022;
Published: 09 January 2023.
Edited by:
Juan C. Aceros, Industrial University of Santander, ColombiaReviewed by:
Zhenhua Yu, Xi'an University of Science and Technology, ChinaHoubing Herbert Song, Embry–Riddle Aeronautical University, United States
Copyright © 2023 Zhang, Tang, Chen and Han. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Fa Zhang, richter2000@163.com