Skip to main content

ORIGINAL RESEARCH article

Front. Psychol., 17 August 2022
Sec. Quantitative Psychology and Measurement

A deep learning-based prediction model of college students’ psychological problem categories for post-epidemic era—Taking college students in Jiangsu Province, China as an example

\r\nYongheng Liu,Yongheng Liu1,2Yajing Shen*Yajing Shen1*Zhiyong CaiZhiyong Cai1
  • 1Department of Mental Health Education, Nanjing Audit University, Nanjing, China
  • 2Faculty of Statistics and Data Science, Nanjing Audit University, Nanjing, China

For a long time, it takes a lot of time and energy for psychological workers to classify the psychological problems of college students. In order to quickly and efficiently understand the common psychological problems of college students in the region for real-time analysis in the post-epidemic era, 2,000 college students’ psychological problems were selected as research data in the community question section of the “Su Xin” application, a psychological self-help and mutual aid platform for college students in Jiangsu Province. First, word segmentation, removal of stop words, establishment of word vectors, etc. were used for the preprocessing of research data. Secondly, it was divided into 9 common psychological problems by LDA clustering analysis, which also combined with previous researches. Thirdly, the text information was processed into word vectors and transferred to the Attention-Based Bidirectional Long Short-Term Memory Networks (AB-LSTM). The experimental results showed that the proposed model has a higher test accuracy of 78% compared with other models.

Introduction

As an increasingly popular research question, how to use the data collected on the Internet to automatically assess the mental health of users has been attracting in-depth research by scholars in the fields of computer science, statistics and psychology. The International Conference on Computational Linguistics (ICSL)1 has added the Annual Conference on Computational Linguistics and Clinical Psychology (CLPsych) since 2014, which promoted social network-driven automatic assessment of mental health data. However, automated detection and prevention of underlying mental health problems remains a significant challenge due to privacy secrecy and biased research, as well as inadequate models, lack of expertise, or underutilization.

Psychological research should aim at major issues of social and economic development, based on national conditions, pay attention to the frontiers of the world, make full use of scientific research technology, and build a statistical measure of new development patterns. Under the new development pattern, the types of data are complex and diverse, and the amount of data has grown tremendously, which requires new statistical research. Especially in the face of major public health emergencies such as COVID-19, a large amount of complex data was generated in a short period of time. It is even more necessary to combine actual data with new statistical measures, so as to find out the regularity of emergencies in time and conduct people’s health risk assessment. Therefore, strengthening the combination of contemporary psychological research and big data information can carry out effective real-time analysis, which has theoretical and practical guiding significance for formulating countermeasures and suggestions in the future and improving the focus of mental health education in colleges and universities. Although China has taken a series of effective measures to control the spread of COVID-19, the panic and anxiety brought about by the epidemic has an important impact on mental health (Qiu et al., 2020). In particular, college students as a special group has undergone tremendous changes in their learning styles and living conditions under the influence of the epidemic (Kecojevic et al., 2020). Mining and analyzing the big data of college students’ psychological problems, deepening the efficient combination of psychological problems and data information, will help promote the development of precise prevention and intervention for mental health, and allocate resources more scientifically and effectively, thereby promoting the formation of college students’ good mental health (Qiuqing et al., 2022).

We studied the psychological problems of college students under the epidemic prevention and control by means of text information cluster analysis, and constructed an accurate and effective prediction model based on this, which is of great significance to effectively intervene in the psychological problems of college students.

Literature review

Although the epidemic has been controlled in the more developed countries due the scale of vaccination, small-scale epidemics often fluctuate, which has changed the physiology, psychology and behavior of college students to some extent in the “post-epidemic era” (Ren and Guo, 2020). In particular, the college student group was in a period of mental immaturity and was extremely vulnerable to shocks in the face of external emergencies (Chang et al., 2020). Some studies have found that in the post-epidemic era, the incidence of psychological problems such as depression and anxiety among students is still relatively high (Cheng, 2021). Much of the information extracted from users’ social network data could help mental health professionals assess the severity of users’ psychological problems and better organize the treatment process (Gaur et al., 2019). Therefore, quickly identifying the types of psychological problems of college students is conducive to solving the difficulties they encounter more efficiently in the post-epidemic era.

Related research on clustering of psychological problems

Since the standards for identifying mental health are relatively abstract, scholars from different countries have different perspectives on the connotation and standards of mental health. For example, American psychologists Maslow and Mittelmann (1941) put forward ten standards of mental health, and pointed out that the standards of mental health are related to age, gender, culture, religious belief, and even country or region. In recent years, as social network platforms have become an inseparable part of people’s lives, the use of network data to evaluate users’ mental health has the advantages of abundant available data, strong timeliness, non-invasive, long-term tracking, wide coverage of evaluation objects, and convenient storage, in which way can effectively overcome the limitations of traditional mental health assessment methods (Ernala et al., 2019). Resnik et al. (2013) used direct topic modeling with LDA to generate interpretable, psychologically relevant “topics” that added value in the predictions of clinical assessments. Acevedo et al. (2021) used an algorithm to classify academic stress levels during COVID-19.

Related research on psychological problem prediction

Aiming at the prediction of psychological problems in major public health events in the past, Li et al. (2021) studied the mental health status of college students at different stages, and established a variety of scales combined with logistic regression models to explore the factors that affect college students’ psychological problems. They found that during COVID-19 epidemic, acute stress, anxiety and depression were common among college students, and significantly increased in the early stage of the epidemic. Through the experimental simulation of 250 groups of real data, Wang et al. (2019) improved the initial weights and thresholds of the college students’ psychological crisis warning model based on genetic BP neural network by using MATLAB. Ge et al. (2020) used a machine learning approach to conduct a longitudinal survey of college students to understand the prevalence of possible anxiety and possible insomnia, and to identify their risk factors.

To sum up, although some scholars have conducted research on the mental health of college students, they have found that COVID-19 has a greater negative impact on the mental health of college students. However, there is a certain lag in questionnaire research, which may affect the accuracy of related research on college students’ mental health.

The objectives of this research are as follows: (1) Objectively judge the mental health status of college students according to the expressions of the respondents, (2) choose a more effective method to obtain the psychological state of college students by analyzing the text review data, (3) acquire the accurate classification in the face of massive mental health data, in order to solve the psychology problems of college students more efficiently and pertinently, and save medical resources.

Tools and methods

Data sources

This study used crawler technology to crawl 1,456 college students’ psychological problems between January 2020 and May 2022 from the “Community Questions” section of the “Suxin” APP, a psychological self-help and mutual assistance platform for college students in Jiangsu Province, as research data. Then, each paragraph in the text comment was divided into a piece of data, and a total of 2000 pieces of data were finally obtained after preprocessing.

Model introduction

Latent Dirichlet allocation

The most commonly used model in text clustering is the vector space model, but the features of this model are extremely sparse, and it is often impossible to measure the similarity between texts by relying on the degree of co-occurrence of words. The Latent Dirichlet Allocation (LDA) (Blei et al., 2003) topic model selected in this paper can perfectly solve this problem and still has excellent performance in short texts. Using LDAvis (Sievert and Shirley, 2014), we visualized topic models to explain topic distributions and their associated terms ordered by probability for further analysis of college students’ psychological problems (see Supplementary Appendix 1).

Long-short term memory

Long-short term memory neural network (LSTM) is suitable for dealing with time-varying sequential problems, and the psychotext data to be processed in this study fits this characteristic. The way people usually speak is from front to back, and the next word is built on the logic of the previous word, so using LSTM networks to process mental text data is the most intuitive way. The LSTM’s hidden layers form a closed loop. The weight from LSTM hidden layer to hidden layer is the memory controller of the network, responsible for scheduling memory, and the state of the hidden layer will participate in the next prediction as the memory state at a certain moment (see Supplementary Appendix 2).

Predictive model of college students’ psychological problem types

The proposed attention-based bidirectional long short-term memory networks network model

The structure of the attention-based bidirectional long short-term memory networks (AB-LSTM) psychological problem type prediction model used in this paper is shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Attention-based bidirectional long short-term memory networks.

Step1: Text preprocessing. Data cleaning was performed on the original data, the three parts of Chinese characters, numbers and English letters in the text were preserved, and punctuation marks, emoticons and meaningless characters in the text were deleted. After the cleaning was completed, the jieba Chinese word segmentation database was used for word segmentation, and then the stop words were removed using the stop word list. In order to avoid the interference of some numbers or words, this paper filtered the dictionary obtained after word segmentation, and filtered out more words according to the part of speech of the word segmentation. In order to reduce the complexity of text classification, this paper used keras’ Tokenizer to serialize and vectorize text. First, used the fit_on_texts method of Tokenizer to get the word_index of the mapping relationship between the corresponding words and numbers, and then used texts_to_sequences to get the serialized text data. The padding method was used to make up the same length, and then the Embedding layer that comes with Keras was used for vectorization.

Step2: Data partitioning. The original text sample vector obtained above was randomly divided into training set and test set according to the ratio of 9:1.

Step3:Network structure building. In this step, the Embedding layer and SpatialDropout1D layer (Salim et al., 2020) are introduced to process the data and output to the Bi-LSTM layer, and the network structure is constructed by combining the attention mechanism.

(1) Embedding layer:Embedding is a way to convert a discrete variable into a continuous vector representation. Embedding can be understood as finding a function or mapping, generating a new spatial expression, and mapping the X space information expressed by the word one-hot to the multi-dimensional space vector of Y. That is, multiply the one-hot encoding matrix by a randomly initialized weight matrix to map it into a new word vector. The mapping relationship is shown in the figure below.

[ 0 0 0 1 0 ] [ θ 11 θ 12 θ 13 θ 14 θ 21 θ 22 θ 23 θ 24 θ 31 θ 32 θ 33 θ 34 θ 41 θ 51 θ 42 θ 52 θ 43 θ 53 θ 44 θ 54 ] = [ θ 41 θ 42 θ 43 θ 44 ] (1)

(2) SpatialDropout1D layer: This paper used the SpatialDropout1 method provided by Ketkar (2017). Each time the layer is updated during training, the input units are randomly set to all zeros at a ratio of 0.2 to a particular latitude, which helps prevent overfitting.

(3) Bi-LSTM layer: Bi-LSTM can utilize the information of early and late sequences (Yu et al., 2020). In this layer, the word vector wj is combined into a sequence S as the input of Bi-LSTM, and then combined the hidden state hj−1 generated by the previous layer, to generate the hidden state hj corresponding to each word. The output h of the Bi-LSTM at time t contains forward layer h and backward layer hj, the specific formula is as follows:

h j = LSTM ( h j - 1 , w j ) (2)

(2)

h j = LSTM ( h j - 1 , w j ) (3)

Attention mechanism

Not all words are equally important to a sentence. Some words rich in emotional signals in the sentence, such as adjectives, usually play a decisive role in the emotional attitude of psychological problems. Therefore, the attention mechanism can be used to extract more important textual feature representationct (Wang et al., 2016). Within certain time steps, the attention mechanism assigns weights αt,j to different parts of the text, and the text feature representation ct is calculated as follows.

c t = j = 1 n α t , j [ h j , h j ] (4)

In this formula, αt,j is a weight of the feature vector, hj is the hidden state. In order to calculate αt,j, A layer of feedforward neural network was used to calculate etj as a representation of hj. The calculation formula is as follows:

e tj = δ ( W S [ h j , h j ] + b s ) (5)

Where Ws and bs are the weight matrix and bias, respectively. δ is the non-linear activation function Relu. The calculation of the weight at,j is as follows:

α t , j = exp ( e tj ) j exp ( e tj ) (6)

Finally, within certain time steps, ctcomposes the text feature expression.c The final classification result can be expressed as y^.

c = { c 1 ; c 2 ; c 3 ; ; c t } (7)
y ^ = ( w s c + b s ) (8)

In this formula,Ws is the weight matrix, and bs is the bias.

Model training

The optimization method selects the Adam algorithm (Kingma and Ba, 2014) to update the model parameters. Because it is a multi-classification problem, the cross-entropy loss is used as the loss function for psychological problem prediction. The expression of the cross entropy loss function is:

L = 1 N i L i = - 1 N i C = 1 M y ic log ( p ic ) (9)

Among them, M is the number of psychological problem categories, yic is the sign function (0 or 1), if the true category of sample i is equal to c, take 1, otherwise take 0. pic is the predicted probability that the observed sample i belongs to the category c. The specific algorithm is shown in Table 1.

TABLE 1
www.frontiersin.org

Table 1. Text content classification algorithm based on AB-LSTM.

Experiment and analysis

Cluster analysis

We used the LDA algorithm to extract 9 topics from the preprocessed documents, and simultaneously calculated the most relevant words and their probabilities under these 9 topics. The input text data was manually checked and summarized according to the given vocabulary, and the nine themes were named: “interpersonal relationship,” “emotional stress,” “academic stress,” “negative life events,” “consult the teacher,” “family relationship,” “mental disease,” “romantic relationship,” “personal growth.”

(1) Interpersonal Relationship (IR): When individuals encounter setbacks in interpersonal relationships and do not know coping strategies, general psychological problems including social fear, interpersonal conflict, interpersonal anxiety, fear of looking at each other, fear of peripheral vision, fear of the opposite sex, fear of telephone calls, communication disorders, fear of rejection, not being rejected, social avoidance, social difficulties, unclear, friendship jealousy, addiction to internet relationships, etc. will arise.

(2) Emotional Stress (ES): The psychological tension reaction or state formed by an individual under the action of emotions such as anxiety or fear. Situational stimuli such as major blows from nature or society, individuals experience emotional stress due to tension when they feel that they are unable to cope.

(3) Academic Stress (AS): In the study life, psychological problems caused by the failure of the election, unsuitable learning method and environment under COVID-19, unsatisfactory academic performance, high pressure, unable to keep up with the school’s training plan and teaching progress and so on.

(4) Negative Life Events (NLE): General psychological problems that an individual has as a result of negative events in his life. Negative life events refer to factors that can negatively affect people’s mental activities, including controllable factors such as quarrels with other people and uncontrollable factors such as natural and man-made disasters.

(5) Consult the Teacher (CT): Inquire about how to use the platform, reply to the teachers on the platform, or have opinions on the teachers of school, and hope that the teachers on the platform can give suggestions, etc.

(6) Family Relationship (FR): Since most college students have just grown up, they still have some rebellious mentality, and sometimes they cannot handle the relationship with their parents well. Such problems include psychological problems caused by conflicts with parents, or not knowing how to communicate with parents.

(7) Mental Disease (MD): Poor psychological tolerance, with serious psychological problems such as depression, bipolar disorder, schizophrenia, anxiety disorder, organic mental disorder and delusional disorder, which are in urgent need of psychological guidance from teachers.

(8) Romantic Relationship (RR): Psychological problems caused by incompatible patterns such as virgin complex, virgin complex, suspicion, emotional trauma, fear of love, emotional conflict, emotional crisis, shadow of lovelorn, mate choice anxiety, rejection of confession, fear of blind date, emotional violence, etc.

(9) Personal Growth (PG): Because individuals do not develop good coping strategies, lack courage and self-confidence, experience bad emotions, and encounter psychological problems caused by setbacks in the process of growth.

The bar chart is shown in Figure 2. It can be clearly seen that in all the review sample data, there are more problems involved “interpersonal relationship” and “emotional stress” and less problems involved “romantic relationship” and “personal growth.”

FIGURE 2
www.frontiersin.org

Figure 2. The distribution of psychological problems of college students.

Network structure establishment

Python3 was chosen to built the embedding layer, SpatialDropout1D layer, LSTM layer, and output layer of the AB-LSTM neural network structure. The embedding layer used a vector of length 100 to represent each word. The SpatialDropout1D layer randomly set the ratio of input units to 0.2 at each update during training, which helped prevent overfitting. The LSTM layer contained 100 memory cells. The output layer was a fully connected layer containing 9 categories, and the activation function was a softmax function to calculate the probability of multiple categories. The loss function was the cross entropy loss function, and the optimization algorithm was the Adam algorithm. Used the training data (X_train, Y_train) to fit the model, set the number of network iterations epochs = 15, and the batch size was 32.

Experimental data set and evaluation index

This research classified the text review data based on the LSTM algorithm, with a total of 2,000 samples, of which the training data accounted for 90% and the test data accounted for 10%. As shown in Figure 3, the left side of the figure was the loss curve of the model, the blue line was the loss of training data, the yellow line was the loss of test data, the abscissa was the number of iterations, and there were 20 iterations in total. When iterating to the 10th round, the loss of the model decreased slowly. Although the loss of training data is still decreasing, the loss of test data was basically stable, and even has an upward trend. The right picture showed the accuracy curve. The model fitting results were good. The highest accuracy rate for the test set was 0.82, and the model was not overfitting.

FIGURE 3
www.frontiersin.org

Figure 3. (A) Model loss diagram. (B) Model accuracy chart.

The confusion matrix compared the actual results in the data set with the predicted results in matrix form. The rows of the matrix represented the actual results, and the columns of the matrix represented the prediction results. Figure 4 demonstrates a confusion matrix of the best performing model. Each type of psychological problem is presented in an abbreviated form, such as Ir for Interpersonal relationship. AB-LSTM achieved an accuracy rate of more than 0.6 in each category.

FIGURE 4
www.frontiersin.org

Figure 4. Confusion matrix diagram.

Model comparison analysis

This paper counted all words (removing duplicates) by counting text words, and then used these words as feature vectors and the number of lines as a dimension. Divided the data set into a test set and a training set, and then used the feature vectorization library to transform the text into feature vectors (converted the text into multi-dimensional feature vectors), and trained the model parameters with the divided training data. Finally, used the trained model to predict the results and compare them with other models.

To verify the effectiveness of the proposed AB-LSTM model in this paper, it is compared with MultinomialNB (Xu et al., 2017) and SVC (Liu et al., 2022). The “Accuracy” for each label is the ratio of the number of correctly classified samples to the total number of samples. Different models have different performances in the test set, and the same model has different prediction accuracy in different topics. In order to compare the accuracy of the classification results of different classification algorithms, the results are shown in Table 2.

TABLE 2
www.frontiersin.org

Table 2. Accuracy of different classification algorithms.

This table shows the prediction accuracy of three modeling methods for various psychological problems, and the highest value has been bolded. It can be seen from the table that the prediction accuracy of the AB-LSTM model on psychological problems caused by interpersonal relationship, mental disease, emotional stress and other reasons is significantly higher than that of the other two models. It can also maintain a high accuracy rate for psychological problems caused by negative life events, romantic relationship, personal growth and other reasons.

The indicators selected in this paper were: precision rate and recall rate. Calculate the F1 score for measurement. The formula is as follows:

Pr ecision = Ture positive ( True Positive + False Positive ) (10)
Recall = Ture positive ( True Positive + False Negative ) (11)
F1 Score = 2 * ( Precision * Recall ) ( Precision + Recall ) (12)

In order to exclude the situation that the experimental results were not comparable due to different feature structures, we used the same feature processing except for the different models to ensure the authenticity and reliability of the comparison results. As shown in Table 3, AB-LSTM was superior to other models in precision, recall and F1 score, compared with the MultinomialNB model and the support vector machine model.

TABLE 3
www.frontiersin.org

Table 3. Score table of different classification algorithms.

Conclusion

Overview

The main research work of this paper was based on the research data of 2000 college students’ psychological problems in the community question section of the “Suxin” APP, a psychological self-help and mutual aid platform for college students in Jiangsu Province. The psychological data obtained in this paper for the first time had no labels, and innovatively introduced LDA clustering into the classification of psychological problems, which changed the previous thinking of simply using artificial methods to study it, and more scientifically clustered the psychological problems of college students. After preprocessing it with machine learning techniques, an improved AB-LSTM was proposed to predict the psychological problems of college students with an average accuracy of 78%. Due to the small sample size, the accuracy of the model has achieved desirable results.

In a one-way LSTM, the model actually only uses the “above” information without considering the “below” information. In the prediction of psychological problems, the information of the entire input sequence needs to be used, so the bidirectional propagation LSTM was adopted, and the attention mechanism was introduced to strengthen the long-distance information in the LSTM.

Although many students’ courses involve mental health, due to the repeated epidemics, students’ needs for psychological counseling have increased significantly. There are convincing evidence that timely counseling can help improve the mental health of college students (Thomas et al., 2014). The experimental results show that interpersonal relationship, emotional stress and academic stress are very common among college students’ psychological problems in the post-epidemic era. Most schools were forced to close down, students rarely participated in group activities, had more contact with roommates, and significantly increased interpersonal problems. At the same time, irritability is easy to appear during the isolation period. Due to the increase in the number of consultations, many students cannot get timely psychological consultation, which makes the emotional pressure continue to increase. As one of the most common emotions students experience in college, academic stress can sometimes have an impact on physical and mental health. Due to the COVID-19 pandemic, colleges and universities around the world have moved away from classrooms to offer virtual teaching methods, creating challenges, adaptations and more stress for students. Through the AB-LSTM model, the prediction accuracy of interpersonal problems is 0.62, the prediction accuracy of emotional stress is 0.8, and the prediction accuracy of academic stress is 0.8, which are better than MultinomialNB model and SVC model.

Contribution

The current study is one of the attempts to link machine learning theory with the analysis of psychological problems in college students. Using machine learning methods to predict the psychological problems of college students can improve the efficiency of psychological counseling and help managers to further improve the psychological health of college students under the epidemic.

Limitations

This study has several limitations. First, the model has low prediction accuracy for negative life events and counseling teachers, and cannot identify psychological problems other than these 9 categories. This requires more psychologists to label the data so that the model can identify more types of psychological problems. Secondly, the data are all from Chinese college students, and the results of the study are not universal. At the same time, there is an error in the manual labeling, which interferes with the subsequent model training. Finally, the model can only be used as an auxiliary tool to identify these 9 types of psychological problems, and cannot replace psychological counselors to help college students solve psychological problems.

Further research

In order to provide better psychological counseling services for college students, further research is needed, mainly focusing on the following issues:

(1) Collect more data on college students from different regions, colleges, and levels, improve samples of psychological problems, and strengthen the prediction model. Universal significance and popularity.

(2) Further optimize the text prediction ability, prediction speed and stability of the algorithm.

(3) After judging the types of college students’ psychological problems, they can give corresponding replies to the problems and reduce the burden of psychological counseling workers.

Data availability statement

The original contributions presented in this study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent from the participants or participants legal guardian was not required to participate in this study in accordance with the national legislation and the institutional requirements. Written informed consent was not obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

YL wrote the first draft and revised the manuscript. YS built the framework and revised the manuscript. ZC collected the data. All authors contributed to the article and approved the submitted version.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2022.975493/full#supplementary-material

Footnotes

  1. ^ http://clpsych.org

References

Acevedo, C. M. D., Gómez, J. K. C., and Rojas, C. A. A. (2021). Academic stress detection on university students during COVID-19 outbreak by using an electronic nose and the galvanic skin response. Biomed. Signal Proces. Control 68:102756. doi: 10.1016/j.bspc.2021.102756

CrossRef Full Text | Google Scholar

Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022.

Google Scholar

Cheng, Y., and Zhang, J. (2021). Psychological status analysis and moral education strategy research of vocational college students in the post-epidemic Era. J. High. Educ. Res. 2, 11–14. doi: 10.32629/jher.v2i1.250

CrossRef Full Text | Google Scholar

Chang, J., Yuan, Y., and Wang, D. (2020). Mental health status and its influencing factors among college students during the epidemic of COVID-19. Nan Fang Yi Ke Da Xue Xue Bao J. South. Med. Univ. 40, 171–176. doi: 10.12122/j.issn.1673-4254.2020.02.06

PubMed Abstract | CrossRef Full Text | Google Scholar

Ernala, S. K., Birnbaum, M. L., Candan, K. A., Rizvi, A. F., Sterling, W. A., Kane, J. M., et al. (2019). “Methodological gaps in predicting mental health states from social media: Triangulating diagnostic signals,” in Proceedings of the 2019 chi conference on human factors in computing systems(New York, NY) 1–16. doi: 10.1145/3290605.3300364

CrossRef Full Text | Google Scholar

Gaur, M., Alambo, A., Sain, J. P., Kursuncu, U., Thirunarayan, K., Kavuluru, R., et al. (2019). “Knowledge-aware assessment of severity of suicide risk for early intervention,” in The World Wide Web Conference(Geneva) 514–525. doi: 10.1145/3308558.3313698

CrossRef Full Text | Google Scholar

Ge, F., Zhang, D., Wu, L., and Mu, H. (2020). Predicting psychological state among Chinese undergraduate students in the COVID-19 epidemic: A longitudinal study using a machine learning. Neuropsychiatr. Dis. Treat. 16, 2111–2118. doi: 10.2147/ndt.s262004

PubMed Abstract | CrossRef Full Text | Google Scholar

Kecojevic, A., Basch, C. H., Sullivan, M., and Davi, N. K. (2020). The impact of the COVID-19 epidemic on mental health of undergraduate students in New Jersey, cross-sectional study. PLoS One 15:e0239696. doi: 10.1371/journal.pone.0239696

PubMed Abstract | CrossRef Full Text | Google Scholar

Ketkar, N. (2017). “Introduction to keras,” in Deep learning with Python (Berkeley, CA: Apress), 97–111. doi: 10.1007/978-1-4842-2766-4_7

CrossRef Full Text | Google Scholar

Kingma, D. P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv [Preprint]. doi: 10.48550/arXiv.1412.6980

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, Y., Zhao, J., Ma, Z., McReynolds, L. S., Lin, D., Chen, Z., et al. (2021). Mental health among college students during the COVID-19 pandemic in China: A 2-wave longitudinal survey. J. Aff. Disord. 281, 597–604. doi: 10.1016/j.jad.2020.11.109

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, T., Jin, L., Zhong, C., and Xue, F. (2022). Study of thermal sensation prediction model based on support vector classification (SVC) algorithm with data preprocessing. J. Build. Eng. 48:103919. doi: 10.1016/j.jobe.2021.103919

CrossRef Full Text | Google Scholar

Maslow, A. H., and Mittelmann, B. (1941). Principles of abnormal psychology: The dynamics of psychic illness. (London: Harper & brothers). doi: 10.1093/ptj/21.4.227

CrossRef Full Text | Google Scholar

Qiu, J., Shen, B., Zhao, M., Wang, Z., Xie, B., and Xu, Y. (2020). A nationwide survey of psychological distress among Chinese people in the COVID-19 epidemic: Implications and policy recommendations. Gen. Psychiatry 33:e100213. doi: 10.1136/gpsych-2020-100213corr1

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiuqing, M., Huixiang, X., and Zirong, Y. (2022). Analysis on Health Information Needs and Evolution of Internet Users in the Post-Epidemic Period. Libraly J. 41:119. doi: 10.13663/j.cnki.lj.2022.02.014

CrossRef Full Text | Google Scholar

Ren, F. F., and Guo, R. J. (2020). Public mental health in post-COVID-19 era. Psychiatr. Danubina 32, 251–255. doi: 10.24869/psyd.2020.251

PubMed Abstract | CrossRef Full Text | Google Scholar

Resnik, P., Garron, A., and Resnik, R. (2013). “Using topic modeling to improve prediction of neuroticism and depression in college students,” in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (Washington, DC) 1348–1353.

Google Scholar

Salim, S. S., Ghanshyam, A. N., Ashok, D. M., Mazahir, D. B., and Thakare, B. S. (2020). “Deep LSTM-RNN with word embedding for sarcasm detection on Twitter,” in 2020 International Conference for Emerging Technology (INCET), (IEEE), 1–4. doi: 10.1109/INCET49848.2020.9154162

CrossRef Full Text | Google Scholar

Sievert, C., and Shirley, K. (2014). “LDAvis: A method for visualizing and interpreting topics,” in Proceedings of Workshop on Interactive Language Learning, Visualization, and Interfaces, Association for Computational Linguistics(Baltimore, MD) 63–70. doi: 10.3115/v1/w14-3110

CrossRef Full Text | Google Scholar

Thomas, S. J., Caputi, P., and Wilson, C. J. (2014). Specific attitudes which predict psychology students’ intentions to seek help for psychological distress. J. Clin. Psychol. 70, 273–282. doi: 10.1002/jclp.22022

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, J., Zhang, Z., Luo, H., Liu, Y., Chen, W., and Wei, G. (2019). Research on early warning model of college students’ psychological crisis based on genetic BP neural network. Am. J. Appl. Psychol. 8, 112–120. doi: 10.11648/j.ajap.20190806.12

CrossRef Full Text | Google Scholar

Wang, Y., Huang, M., Zhu, X., and Zhao, L. (2016). “Attention-based LSTM for aspect-level sentiment classification,” in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (Austin, TX). 606–615. doi: 10.18653/v1/d16-1058

CrossRef Full Text | Google Scholar

Xu, S., Li, Y., and Wang, Z. (2017). Advanced Multimedia and Ubiquitous Engineering Berlin:Springer 347–352. doi: 10.1007/978-981-10-5041-1_57

CrossRef Full Text | Google Scholar

Yu, X. M., Feng, W. Z., Wang, H., Chu, Q., and Chen, Q. (2020). An attention mechanism and multi-granularity-based Bi-LSTM model for Chinese Q&A system. Soft Comput. 24, 5831–5845. doi: 10.1007/s00500-019-04367-8

CrossRef Full Text | Google Scholar

Keywords: AB-LSTM, psychological problem categories, natural language processing, machine learning, text cluster analysis

Citation: Liu Y, Shen Y and Cai Z (2022) A deep learning-based prediction model of college students’ psychological problem categories for post-epidemic era—Taking college students in Jiangsu Province, China as an example. Front. Psychol. 13:975493. doi: 10.3389/fpsyg.2022.975493

Received: 22 June 2022; Accepted: 22 July 2022;
Published: 17 August 2022.

Edited by:

Sandra Maria Correia Loureiro, University Institute of Lisbon (ISCTE), Portugal

Reviewed by:

João Ferreira Do Rosário, Instituto Politécnico de Lisboa, Portugal
Eduardo Moraes Sarmento, Lusophone University of Humanities and Technologies, Portugal

Copyright © 2022 Liu, Shen and Cai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yajing Shen, anN0enN5ajA0MDFAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.