A multi-label text sentiment analysis model based on sentiment correlation modeling

Ni, Yingying; Ni, Wei

doi:10.3389/fpsyg.2024.1490796

ORIGINAL RESEARCH article

Front. Psychol., 20 December 2024

Sec. Emotion Science

Volume 15 - 2024 | https://doi.org/10.3389/fpsyg.2024.1490796

This article is part of the Research TopicMethodology for Emotion-Aware Education Based on Artificial IntelligenceView all 5 articles

A multi-label text sentiment analysis model based on sentiment correlation modeling

Yingying Ni¹

Wei Ni²^*

¹School of Media & Communication Shanghai Jiao Tong University, Shanghai, China
²Department of Critical Care Medicine, Sir Run Run Shaw Hospital, Hangzhou, Zhejiang, China

Objective: This study proposes an emotion correlation-enhanced sentiment analysis model (ECO-SAM), a sentiment correlation modeling-based multi-label sentiment analysis model.

Methods: The ECO-SAM utilizes a pre-trained BERT encoder to obtain semantic embedding of input texts and then leverages a self-attention mechanism to model the semantic correlation between emotions. Additionally, it utilizes a text emotion matching neural network to make sentiment analysis for input texts.

Results: The experiment results in public datasets demonstrate that compared to baseline models, the ECO-SAM obtains the precision score increasing by 13.33% at most, the recall score increasing by 3.69% at most, and the F1 score increasing by 8.44% at most. Meanwhile, the modeled sentiment semantics are interpretable.

Limitations: The data modeled by the ECO-SAM are limited to text-only modality, excluding multi-modal data that could enhance classification performance. Additionally, the training data are not large-scale, and there is a lack of high-quality large-scale training data for fine-tuning sentiment analysis models.

Conclusion: The ECO-SAM is capable of effectively modeling sentiment semantics and achieving excellent classification performance in many public sentiment analysis datasets.

1 Introduction

Sentiment analysis is a significant task in natural language processing that aims to mine the emotional tendencies of given texts, thereby helping to gain a deeper understanding of the text content and its potential impact. Currently, with the rise of public social media platforms such as Sina Weibo and Twitter, sentiment analysis techniques have shown important roles in social sentiment analysis and event tracking (Mujahid et al., 2021; Omar and Abd El-Hafeez, 2023)^. Relevant researchers use sentiment analysis algorithms to identify the emotional tendencies of massive social media platform users’ posts, thereby comprehensively analyzing the trend of public opinion and taking corresponding measures. The sentiment analysis technology itself has also expanded from the traditional simple binary classification task to the multi-classification task, that is, identifying the specific emotions contained in the text, such as happy, sad, like, and angry.

However, compared to the traditional binary sentiment analysis, the multi-label sentiment analysis task faces challenges such as data sparsity, class imbalance, and difficulty in modeling emotional semantics. To this end, researchers have proposed various multi-label text emotion classification models based on statistics, machine learning, and deep learning techniques. For example, sentiment analysis models based on emotional dictionaries (Wu et al., 2019; Acerbi et al., 2023; Duan et al., 2021a) identify the emotional categories of texts by matching the retrieved words in the emotional dictionary. Text emotion dictionary models based on Naive Bayes and support vector machines (Neethu and Rajasree, 2013) use statistical learning methods to analyze and model word frequency statistical features to recognize the probability of text emotions. With the widespread application of deep learning in the field of natural language understanding (Vaswani et al., 2023; Mikolov et al., 2013; Yadav and Vishwakarma, 2020; Beridge et al., 2022), deep learning text emotion recognition models represented by recurrent neural networks (RNN) (Zhang et al., 2019) and large-scale pre-trained models (pre-trained model) (Duan et al., 2021b; Cortiz, 2021) have made significant progress in the identification of specific text emotion categories by relying on the powerful capabilities of deep learning in semantic representation modeling.

To efficiently mine and utilize semantic correlation between emotions to enhance multi-label sentiment analysis, in this study, we propose an emotion correlation-enhanced sentiment analysis model (ECO-SAM). Inspired by the widely used self-attention mechanism for language modeling and the basic emotion theory, we first design a novel attention-based emotion correlation modeling module that could automatically learn the semantic correlation between emotions from data and obtain correlation-enhanced emotion embedding representation. Next, we transform the multi-label sentiment analysis problem into an information retrieval problem, which aims to find the most suitable emotions from the emotion candidate list for a given query text. Then, we design an emotion-matching module that uses neural networks to learn the matching function between emotion and text embedding from data. Finally, we demonstrate the effectiveness of ECO-SAM via extensive experiments on two public sentiment analysis datasets. The experiment results unveil that the ECO-SAM obtains the precision score increasing by 13.33% at most, the recall score increasing by 3.69% at most, and the F1 score increasing by 8.44% at most. Meanwhile, the modeled sentiment semantics are interpretable.

2 Related work

2.1 Basic emotion theory

The basic emotion theory was proposed by American psychologist Ekman (1972). The theory believes that humans have six basic emotions: happiness, sadness, fear, anger, surprise, and disgust. These basic emotions are considered to be universally present across cultures and species. Based on the basic emotion theory, Ekman (1972) found some universality of emotional expressions through observing the facial expressions of people in different cultures. Izard (1977) expanded the basic emotion theory, discussing the relationship between basic emotions and the relationship between emotion and cognition. The study proposed a model of the emotional system, describing the relationships between basic emotions and how they interact and regulate each other. For example, the author pointed out that there is a close relationship between “anger” and “disgust,” while “happiness” and “sadness” have an antagonistic relationship. Russell (1980) proposed the circular emotion theory, which expanded the basic emotion theory and emphasized the construction and subjective experience of emotions, implying the idea of modeling the association between emotions. Cowen and Keltner (2017) explored how people describe and distinguish different emotional experiences in self-reports. The study found more fine-grained emotional experiences compared to the basic emotion theory, expanding the understanding of emotions and breaking through the traditional concept of basic emotions. It shows that emotions are complex and diverse and can be described and captured through multiple discrete emotion categories and continuous gradients.

In summary, the basic emotion theory first proposed the six basic elements of emotion. Relevant scholars have delved deeper into the construction of emotions and the relationships between emotions based on the basic emotion theory and developed a gradually more comprehensive emotional theory framework.

2.2 Sentiment analysis

Sentiment analysis is a text classification task that aims to identify the emotional category of a text based on its semantic features. According to the different distribution of emotion labels, sentiment analysis can be divided into emotion polarity classification (binary), emotion category classification (multi-class), and emotion label classification (multi-label). Sentiment analysis models include rule-based emotion dictionary methods (Wu et al., 2019; Acerbi et al., 2023; Xu et al., 2021), statistical machine learning-based methods (Neethu and Rajasree, 2013; Rastogi et al., 2021), and deep learning-based methods (Yadav and Vishwakarma, 2020; Duan et al., 2021a; Cortiz, 2021; Zhang et al., 2019; Ahmed et al., 2022). The rule-based emotion dictionary method is an unsupervised approach that uses emotion dictionaries to obtain the emotion values of emotional words in the document and then determines the overall emotional tendency of the document through weighted calculation. This method does not consider the connections between words, nor does it consider the changes in the emotional tendency of words due to the context.

Common emotion dictionaries include English dictionaries such as General Inquirer, SentiWordNet, Opinion Lexicon, and MPQA (Chatterjee et al., 2019), as well as Chinese dictionaries such as HowNet (Fu et al., 2017), NTUSD (Chen et al., 2022), and the Chinese emotion lexicon ontology (Deng et al., 2017). The statistical machine learning-based method is a supervised approach that trains machine learning classification models on text data with emotion labels and then applies the trained machine learning classification models to text emotion prediction tasks. For example, Gaye et al. (2021) proposed a text emotion recognition model based on support vector machines (SVMs), dividing the emotion analysis process into two strategies and four methods. Ghourabi et al. (2020) proposed a text emotion recognition method based on Naive Bayes, establishing a three-layer tree-structured emotion recognition structure. In addition, Patel and Urry (2024) proposed a text emotion recognition method that combines deep semantic and surface-level grammar, applicable to aspect-level sentiment analysis.

The deep learning-based method is also a supervised approach, training neural network classification models on text data with emotion labels, and utilizing the strong fitting ability of neural networks to accurately predict text emotion categories. For example, Grandjean et al. (2008) proposed a sentiment analysis model based on convolutional neural networks, where the dual convolutional layer structure can extract features from sentences of any length. Ji et al. (2019) proposed a sentiment analysis model based on deep belief networks, solving the problem of sparse text features. With the rise of large language models (LLMs) (Zhao et al., 2023; Elyoseph et al., 2023; Liu et al., 2023), the pre-trained LLM-based methods have emerged in sentiment analysis and achieved excellent performance on large-scale datasets. For instance, Valderrama et al. (2024) used the BERT model to obtain more complete text semantic representations, thereby more accurately predicting text emotion categories. Sailunaz et al. (2018) compared the sentiment analysis capabilities of various large language models in the research on user behaviors of spreading others’ privacy information on social networks. Gao et al. (2021) proposed to use prompt learning to enhance the classification performance of pre-trained models when the data volume is relatively small. In the multi-modal emotion recognition scenario, Zhu et al. (2020) proposed a sentiment analysis model based on improved ResNet to analyze and improve the accuracy of image emotion classification. Currently, deep learning models play a pivotal role in accurate sentiment analysis. As shown in Table 1, we count and list current state-of-the-art sentiment analysis methods based on previous research.

Table 1

Table 1. Current state-of-the-art (SOTA) sentiment analysis methods.

2.3 Deep learning and attention mechanism

The attention mechanism was first proposed by Bahdanau et al. (2016), which is a deep learning technique used to model the semantic association and related representation between different parts of the semantic sequence. In natural language processing, the attention mechanism is often used to model the semantic association between the context in the corpus, thereby achieving the correspondence between the model output results and the context in tasks such as text generation and text classification. The transformer model proposed by Vaswani et al. (2023) is a representative model using a self-attention mechanism. The transformer model has strong semantic representation and text output capabilities and is the foundation of many text classifiers and text sentiment recognition methods.

3 Sentiment analysis based on emotion correlation modeling

Existing sentiment analysis methods struggle to model the important role of emotion correlation in emotion recognition. Therefore, this study first proposes a text sentiment analysis method based on emotion correlation modeling (ECO-SAM). Subsequently, the superiority of the ECO-SAM in sentiment analysis and emotion correlation modeling is demonstrated on the Weibo text sentiment analysis dataset. Finally, the ECO-SAM is applied to text emotion analysis under a given topic.

3.1 An overview of ECO-SAM

The framework of the proposed ECO-SAM algorithm is shown in Figure 1. The framework consists of three modules: the text encoder module, the attention text correlation modeling module, and the emotion matching module. The text encoder module uses the large-scale pre-trained model BERT to encode the text input into a high-dimensional text semantic vector. The attention text correlation modeling module uses the attention mechanism to transform the trainable emotion inherent feature vectors and output feature vectors containing emotion correlations, while also outputting an emotion correlation matrix. The emotion classification neural network module matches the text semantic vector with each emotion-correlated emotion feature vector and calculates the probability of the text containing that emotion. The algorithm finally outputs the probability of emotion containment.

Figure 1

Figure 1. Structure of ECO-SAM.

In the training stage, the model’s emotion inherent feature vectors, attention text correlation modeling module, and emotion classification neural network are trained using a multi-label text emotion recognition dataset. In the inference stage, the parameters of ECO-SAM are frozen to achieve end-to-end sentiment analysis.

3.2 BERT text encoder

The BERT text encoder in the ECO-SAM (Cui et al., 2021) is a large-scale pre-trained text encoding model based on BERT (Vaswani et al., 2023). This module utilizes a masked language model (MLM) to generate deep bidirectional language representations. Experiments in the original BERT study (Vaswani et al., 2023) have demonstrated that BERT achieved state-of-the-art performance on 11 natural language processing tasks, which substantiates the efficacy of the BERT module in text semantic representation.

Formally, let the original text input be a character sequence $s = \{w_{1} w_{2} \dots w_{N}\}$ , then the encoding process of BERT can be formalized as shown in Equation 1:

\begin{array}{l} v_{s}^{(senti)} = f_{BERT} (w_{1} \dots w_{N}) & (1) \end{array}

where $v_{s}^{(senti)} \in ℝ^{D}$ , is the text semantic representation vector. The $D$ represents the dimension of the text semantic representation vector defined by BERT. In general, D $= 1, 024$ .

3.3 Attention-based emotion correlation modeling module

The attention-based emotion correlation modeling module uses the self-attention mechanism to model the semantic correlation of emotions, thereby addressing the lack of research on emotion correlation in existing studies. Specifically, the self-attention mechanism adopts the query-key-value (QKV) pattern. Each emotion in the framework has a trainable query vector, key vector, and value vector (the value vector corresponds to the inherent emotion feature vector in Figure 1). First, for a target emotion, its query vector is obtained, and the cosine similarity between the query vector and the key vector of each other emotion is calculated. The similarity with each other emotion reflects the semantic dependence of the target emotion, i.e., the extent to which the semantic representation of the target emotion depends on that particular emotion. Then, the feature vector containing the emotion correlation of the target emotion is calculated. This vector is the weighted average of the inherent feature vectors (value vectors) of each emotion, with the weights being the calculated semantic dependence. Finally, Pearson’s correlation coefficient between the feature vectors containing emotion correlations is calculated and the emotion correlation matrix is output.

Formally, one-hot encoding is used to mark each emotion. Let $S = {(s_{j k})}_{D \times K}$ denotes the emotion feature inherent vector matrix, $Q = {(q_{j k})}_{D \times K}$ denotes the evaluation query vector matrix, and $Z = {(z_{j k})}_{D \times K}$ denotes the emotion key vector matrix. In the prediction of emotion probabilities, this module first calculates the emotion feature $e_{k}$ using $S$ and the one-hot emotion vector $x_{k}$ , as shown in Equation 2. Meanwhile, it obtains the query and key vectors for each emotion, as shown in Equation 3 and Equation 4:

\begin{array}{l} e_{k} = S \times x_{k}, & (2) \end{array}

\begin{array}{l} q_{j} = Q \times x_{j}, j = 1, 2, \dots, K, & (3) \end{array}

\begin{array}{l} z_{j} = Z \times x_{j}, j = 1, 2, \dots, K, & (4) \end{array}

Subsequently, the semantic dependence similarity between the target emotion and each emotion is calculated using Equation 5:

\begin{array}{l} α_{k, j} = softmax (\frac{q_{k}^{⊤} z_{j}}{\sqrt{D}}) . & (5) \end{array}

Finally, the emotion-semantic embedding with correlation modeling for the target emotion is calculated, as presented in Equation 6:

\begin{array}{l} e_{k}^{(a t t)} = \sum_{j = 1}^{K} α_{k, j} \times e_{j} . & (6) \end{array}

The resulting calculation $e_{k}^{(a t t)}$ is the emotion vector representation that contains the emotion dependence relationship, which is used in the subsequent steps to recognize the emotion of the text.

3.4 Emotion matching module

The emotion matching module uses a neural network to compute the degree of matching between the text semantic representation and the emotion-semantic representation, thereby predicting the probability of each emotion in the text. Specifically, given the semantic representation vector of a sentence and the semantic representation vector of an emotion, this module uses a quadratic form neural network to predict the probability of the text emotion, as shown in Equation 7:

\begin{matrix} {\hat{y}}_{s, k} = sigmoid (v_{s}^{(a t t) ⊤} W e_{k}^{(senti)}) \\ = sigmoid (v_{s}^{(a t t) ⊤} (O^{⊤} Λ O) e_{k}^{(senti)}) \\ = sigmoid ({(O v_{s}^{(a t t)})}^{⊤} Λ (O e_{k}^{(senti)})) \end{matrix} (7)

where $W = O^{⊤} Λ O$ is the eigenvalue decomposition of the semantic matching matrix $W \in ℝ^{D \times D}$ . The above eigenvalue decomposition transformation implies that this neural network prediction process is equivalent to applying the same linear transformation to the text semantic vector and the emotion-semantic vector and then taking the element-wise weighted average, with the weights being the eigenvectors. The training process of the neural network is equivalent to optimizing the linear transformation and the eigenvectors, so that the predicted probability of text emotion is close to the true data label.

3.5 Loss function

Since the sentiment analysis problem addressed by the ECO-SAM is a multi-label classification problem, the cross-entropy loss is used as the loss function, as shown in Equation 8. During the model training process, the training objective of the ECO-SAM is to minimize the loss function value:

\begin{array}{l} L (Ω) = - \sum_{i = 1}^{N} \sum_{k = 1}^{C} y_{i, k} log (p_{i, k}) & (8) \end{array}

where $Ω$ represents all the trainable parameters in the ECO-SAM. N represents the number of samples (text samples in the training set). $C$ represents the number of possible emotion categories. $y_{i, k}$ represents whether the text contains the emotion, where $y_{i, k} = 1$ indicates the text contains the emotion $k$ , and $y_{i, k} = 0$ indicates otherwise. $p_{i, k}$ represents the probability predicted by the ECO-SAM that the text contains each emotion.

4 Text emotion recognition experiment

4.1 Experimental setup

This experiment compares the proposed multi-label sentiment analysis model, ECO-SAM, with various baseline text emotion prediction models using the public Weibo dataset. The goal is to verify the accuracy of the ECO-SAM in sentiment analysis and its ability to model emotion feature correlations. For the experimental datasets, this study used two publicly available datasets: NLPCC2014 and GoEmotions (Demszky et al., 2020). This module takes three inputs for emotion k: the feature inherent vector, query vector, and the corresponding key vectors for all emotions. Each text contains up to two emotions. The GoEmotions dataset consists of 58,000 text data from the English forum Reddit, with the original data containing 27 fine-grained emotion categories. Based on the basic emotion theory, we screened out the 7 emotions consistent with the NLPCC2014 dataset as well as the neutral case as the target of sentiment analysis and selected 32,445 valid samples. Next, we split each dataset into training, validation, and test sets in the ratio of 70%:10%:20%, respectively.

In terms of the experiment setting, all models implemented using Python 3.8, with the deep learning framework being PyTorch, and the operating system being Linux. The hardware configuration for running the experiments is a server with two 2.10GHz Intel Xeon E5-2620 v4 CPUs and one NVIDIA Tesla-A100 GPU.

4.2 Text emotion prediction experiment

The main experiments in this study include emotion prediction experiments and emotion feature correlation analysis. Finally, the ECO-SAM emotion prediction model is applied to sentiment analysis. In the emotion prediction experiment, the following baseline models are used:

Random: Random prediction. For each emotion, the text has a 1/2 probability of being classified into that emotion category. Whether an emotion prediction model performs better than random prediction is a basic criterion for its usability.

cnsenti (Deng and Nan, 2022): Chinese Sentiment, an emotion prediction model based on the HowNet emotion dictionary of Chinese Knowledge Network.

SVM (Mullen and Collier, 2004): Support Vector Machine, an emotion prediction model based on support vectors. In the experiment, BERT is used to encode the text into semantic vectors, which are then used as input to the SVM.

LSTM (Hochreiter and Schmidhuber, 1997): Long short-term memory is a type of recurrent neural network (RNN) architecture designed to address the vanishing gradient problem in traditional RNNs. LSTMs are particularly effective at learning long-term dependencies in data, making them well-suited for applications such as sentiment analysis and time series analysis.

BiLSTM (Graves and Schmidhuber, 2005): It is an extension of the traditional LSTM architecture that processes input sequences in both forward and backward directions. This bidirectional approach provides a more comprehensive understanding of the sequence.

BERT (Devlin et al., 2019) is a pre-trained transformer-based language model. BERT can encode raw texts into semantic vectors with rich information for downstream tasks. For the text emotion prediction task, we use a fully connected neural network as the downstream output layer.

T5 (Raffel et al., 2020): It is a transformer-based language model proposed by Google that unifies various NLP tasks by framing them all as text-to-text problems, where both input and output are text strings.

The results of the text emotion prediction experiment are shown in Table 2. Considering the characteristics of the multi-label classification task, the evaluation metrics are Micro Precision, Micro Recall, and Micro F1 Score. The higher the score for each of these evaluation metrics, the higher the accuracy of the model’s text emotion prediction.

Table 2

Table 2. Statistics of datasets after preprocessing.

Since GoEmotions is an English dataset, the baseline model cnsenti, which is based on the Chinese dictionary, is unable to recognize the text emotions in this dataset. From the above experimental results, it can be seen that the ECO-SAM proposed in this study outperforms the existing text emotion prediction baseline models in terms of precision, recall, and F1 score, with the highest increase in precision being 13.33%, the highest increase in recall being 3.69%, and the highest increase in F1 score being 8.44%. This proves that the ECO-SAM can predict text emotions more accurately compared to existing models (Table 3).

Table 3

Table 3. Experiment results of sentiment analysis.

Furthermore, among the baseline models, the BERT method also significantly outperforms other existing methods. The comparison between the BERT and cnsenti shows that the text emotion prediction model based on BERT pre-trained language encoding has better performance on Weibo emotion prediction than the traditional model based on rules and emotion dictionaries. The comparison between BERT and SVM shows that the text emotion prediction algorithm based on neural networks has better performance on Weibo emotion prediction than the algorithm based on SVM. Compared to the best baseline model BERT, our proposed ECO-SAM method further improves the performance of the text emotion prediction model based on BERT pre-trained language encoding through an innovative emotion feature modeling module.

4.3 Visualization experiment: emotion feature correlation modeling experiment

This section uses the NLPCC2014 dataset as an example to analyze the ability of the ECO-SAM to model emotional semantic similarity. The ECO-SAM text emotion prediction model improves the accuracy of text emotion prediction by modeling the correlation between emotion features through the attention-based emotion modeling module. This experimental stage mainly focuses on the modeling results of the emotional feature correlation in the ECO-SAM. In the ECO-SAM, emotional features are represented as $e_{k}^{(a t t)}$ , where k represents the emotion category sequence number. For any two emotions k1 and k2, this experiment uses Pearson’s correlation coefficient of the emotion features as the measure of emotion feature correlation, denoted as $Corr (k_{1}, k_{2})$ . This correlation coefficient ranges between −1 and 1. When $Corr (k_{1}, k_{2}) > 0$ , the two emotion features are positively correlated (similar); when $Corr (k_{1}, k_{2}) \approx 0$ , the two emotion features are uncorrelated (independent); when $Corr (k_{1}, k_{2}) < 0$ , the two emotion features are negatively correlated (semantically opposite). The results of the emotion feature correlation calculation are shown in the following figure, which includes seven emotions: anger, disgust, fear, happiness, like, sadness, and surprise. The brighter the color of each square in the figure, the greater the correlation value, and the stronger the association between the two emotions. According to Figure 2, the three emotions most strongly associated with each emotion are as follows:

• Anger: Disgust (0.99), Surprise (0.50), Fear (0.39).

• Disgust: Anger (0.99), Surprise (0.46), Fear (0.34).

• Fear: Surprise (0.97), Anger (0.39), Sadness (0.38).

• Happiness: Like (0.55), Surprise (0.48), Fear (0.37).

• Like: Happiness (0.55), Sadness (0.31), Anger (0.24).

• Sadness: Fear (0.38), Anger (0.35), Like (0.31).

• Surprise: Fear (0.97), Anger (0.50), Happiness (0.48).

Figure 2

Figure 2. Heatmap of the correlation of emotion features.

The above results show that different types of emotions, due to their semantic differences, either exhibit strong correlations or are mutually independent of each other. Some emotions, due to the consistency of their semantics, often exhibit a relatively strong clustering feature. For example, “anger” and “disgust” are both negative emotions, and their semantic correlation reaches 0.99. They also have relatively strong correlations with “fear,” indicating that the above four emotions are similar in semantic connotation, which is consistent with people’s intuition. At the same time, “happiness” and “like” have a relatively strong correlation, indicating that the two intuitively positive emotions also have similar semantic connotations. In addition, “surprise” has a relatively high semantic similarity with positive emotions such as “happiness,” as well as with negative emotions such as “fear.” This suggests that “surprise” as an emotion that an individual perceives due to sudden changes tends to be neutral. In other words, “surprise” can coexist with positive emotions (such as “pleasant surprise”) and also with negative emotions (such as “horrifying surprise”).

5 Discussion

The significance of this research is as follows: First, at the theoretical level, this study organically combines basic emotion theory and deep learning technology, innovatively proposes a large-scale pre-trained text emotion recognition method (ECO-SAM), and verifies the method’s accurate text emotion recognition and emotion-semantic correlation modeling capabilities through large-scale experiments on real datasets. In the task of sentiment analysis, accuracy is a core issue in related research and is also an important technical guarantee for public opinion monitoring. Therefore, the high performance of ECO-SAM in the experiments is undoubtedly of great significance for enhancing the effectiveness of public opinion monitoring. Second, by leveraging the emotion -semantic correlation modeling capability of ECO-SAM, this study also analyzes the correlation relationships between different emotions within this topic, providing important data references for related public opinion monitoring.

At the same time, this research still has some limitations. First, due to the limitations of available data, the training corpus built using the ECO-SAM is still not sufficient to fully unleash the model’s maximum performance, and the data volume needs to be further increased in future research. Second, in terms of text semantic parsing capability, the performance of the ECO-SAM method in recognizing the emotions of texts with large implicit information such as irony and sarcasm still needs to be improved. In the future research plan, on the one hand, we can further improve the text emotion recognition capability through methods such as expanding the dataset and optimizing the model architecture. On the other hand, with the rise of large language models (LLMs) (such as ChatGPT), we can combine the advantages of LLMs in text generation and emergent capabilities, as well as the advantages of ECO-SAM in strong semantic modeling and low computational cost, to develop more efficient sentiment analysis techniques. Furthermore, the topic and user distribution on online social platforms are complex and rich in information. How to leverage the rich topic and user information to assist text emotion recognition and public opinion monitoring, and explore the downstream applications of emotion recognition and emotion-semantic modeling, we also believe, is an important future research direction.

6 Conclusion

Online social platforms are highly susceptible to large-scale controversial network issues, many of which can easily escalate into emotionally charged irrational propagation. Existing sentiment analysis models have difficulty in modeling emotion correlation, and the accuracy of emotion prediction needs to be improved. To solve the above problems, this study first conducted extensive and in-depth-related research and innovatively proposed an emotion correlation-enhanced sentiment analysis model (ECO-SAM) based on basic emotion theory and deep learning technology, to achieve accurate text emotion recognition and emotion correlation modeling on online social platforms. The large-scale comparative experiments on the real text emotion recognition Chinese dataset NLPCC2014 and the English dataset GoEmotions verified the accurate text emotion recognition capability of the ECO-SAM. Emotion recognition comparative experiments showed that the ECO-SAM improved the precision, recall, and F1 score of text emotion recognition by 13.33, 3.69, and 8.44%, respectively, compared to the optimal baseline method BERT, effectively improving the accuracy of text emotion recognition. The emotion feature correlation experiment showed that emotions with similar emotional colors (positive/negative) have relatively strong semantic correlations; the “surprise” emotion has a relatively high semantic correlation with both positive emotions and negative emotions, acting as a bridge between the two in the emotion correlation graph.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found at: https://github.com/qweraqq/NLPCC2014_sentiment.

Author contributions

YN: Writing – original draft, Writing – review & editing, Conceptualization, Investigation, Project administration. WN: Data curation, Formal analysis, Resources, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by Zhejiang Provincial Health Science and Technology Program Project (2021KY757) and Special Anti-epidemic Project of Zhejiang Provincial Department of Education (Y202043731).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abdullah, T., and Ahmet, A. (2022). Deep learning in sentiment analysis: recent architectures. ACM Comput. Surv. 55, 1–37. doi: 10.1145/3548772