Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression

Patra, Braja Gopal; Sun, Zhaoyi; Cheng, Zilin; Kumar, Praneet Kasi Reddy Jagadeesh; Altammami, Abdullah; Liu, Yiyang; Joly, Rochelle; Jedlicka, Caroline; Delgado, Diana; Pathak, Jyotishman; Peng, Yifan; Zhang, Yiye

doi:10.3389/fpsyt.2023.1258887

BRIEF RESEARCH REPORT article

Front. Psychiatry , 20 November 2023

Sec. Perinatal Psychiatry

Volume 14 - 2023 | https://doi.org/10.3389/fpsyt.2023.1258887

This article is part of the Research Topic Women in Psychiatry 2023: Perinatal Psychiatry View all 12 articles

Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression

$\r\nBraja Gopal Patra&#x;&#x;$ Braja Gopal Patra^1†‡

Zhaoyi Sun^1†‡

Zilin Cheng^1†‡

Praneet Kasi Reddy Jagadeesh Kumar¹

Abdullah Altammami¹

Yiyang Liu¹

Rochelle Joly²

Caroline Jedlicka^3,4^‡

Diana Delgado⁴^‡

Jyotishman Pathak¹^‡

Yifan Peng¹^‡

Yiye Zhang^1*‡

¹Department of Population Health Sciences, Weill Cornell Medicine, New York, NY, United States
²Department of Obstetrics and Gynecology, Weill Cornell Medicine, New York, NY, United States
³Kingsborough Community College, City University of New York, New York, NY, United States
⁴Samuel J. Wood Library & C.V. Starr Biomedical Information Center, Weill Cornell Medicine, New York, NY, United States

Objective: Evidence suggests that high-quality health education and effective communication within the framework of social support hold significant potential in preventing postpartum depression. Yet, developing trustworthy and engaging health education and communication materials requires extensive expertise and substantial resources. In light of this, we propose an innovative approach that involves leveraging natural language processing (NLP) to classify publicly accessible lay articles based on their relevance and subject matter to pregnancy and mental health.

Materials and methods: We manually reviewed online lay articles from credible and medically validated sources to create a gold standard corpus. This manual review process categorized the articles based on their pertinence to pregnancy and related subtopics. To streamline and expand the classification procedure for relevance and topics, we employed advanced NLP models such as Random Forest, Bidirectional Encoder Representations from Transformers (BERT), and Generative Pre-trained Transformer model (gpt-3.5-turbo).

Results: The gold standard corpus included 392 pregnancy-related articles. Our manual review process categorized the reading materials according to lifestyle factors associated with postpartum depression: diet, exercise, mental health, and health literacy. A BERT-based model performed best (F1 = 0.974) in an end-to-end classification of relevance and topics. In a two-step approach, given articles already classified as pregnancy-related, gpt-3.5-turbo performed best (F1 = 0.972) in classifying the above topics.

Discussion: Utilizing NLP, we can guide patients to high-quality lay reading materials as cost-effective, readily available health education and communication sources. This approach allows us to scale the information delivery specifically to individuals, enhancing the relevance and impact of the materials provided.

1 Background and significance

Pregnancy is a vulnerable period that exposes patients to heightened anxiety, depression, and stress. One in seven birthing parents develops postpartum depression (PPD), a potentially life-threatening mental health condition and a much higher proportion of pregnant patients experience antenatal psychosocial stress (1, 2). Anxiety, depression, and stress lead to adverse maternal, infant, and family outcomes, disproportionately affecting disadvantaged families (1, 3). The negative impact can be mitigated by interventions from healthcare providers (4, 5). However, resource constraints, compounded with the sensitive nature of pregnancy and stigma against mental health, present a unique challenge in the prevention, screening, and management of mental health concerns in the clinical settings (1, 6). Disparities in the distance to healthcare, health literacy, socioeconomic status, and neighborhood characteristics further strangulate equitable access to clinical interventions during pregnancy (4, 7, 8).

In turn, patients resort to self-management to relieve stressors and resolve individual questions around pregnancy (9). Patients have grown to be active online information seekers (10–12), of which people of childbearing age are the most active members in the digital space in the US (13). Three-quarters of the US pregnant population is known to seek information about pregnancy health and birth online (14). However, a myriad of information and players with unspecified motives exist online, making it often difficult for lay audiences, particularly those with low health literacy, to comprehend and act appropriately (15, 16). Notably, the appropriateness of online content was identified as a barrier to user satisfaction and continued engagement among the pregnancy population (17, 18).

Our research is aimed to develop a platform to deliver personalized health education and communication materials. The Support Personalized prEgnancy Care with Artificial inteLligence (SPECIAL) platform (https://www.specialdayshealthinfo.com/) houses content on health education and communication developed by a commercial vendor. This study aimed to assess the feasibility of utilizing natural language processing (NLP) to repurpose publicly accessible lay reading materials from magazines and online sources as health education and communication content.

1.1 Related work

Numerous studies have explored strategies for evaluating online health information and utilizing it for health education, communication, and promotion (12, 19, 20). However, our approach in this study diverges from existing literature. We present an innovative method that streamlines the search for online lay articles and information based on relevance and topic, effectively alleviating the health literacy challenges and search burden experienced by patients. Studies most relevant to us are previous work in categorizing news from public datasets such as TagMyNews using various machine learning and optimization techniques (21). Likewise, there are studies on the classification of health-related information and news from social media, such as the extraction of information related to Zika virus, syndromic surveillance, and identifying misinformation from social media posts (22–25).

1.2 Objective

Extensive theoretical research and empirical evidence have consistently established the crucial role of health education and communication, particularly within the framework of social support, in determining health outcomes during pregnancy (9). As part of our ongoing study on preventing postpartum depression (PPD) through intervention development, we propose employing a machine learning approach to deliver health education and communication. This approach aims to assist patients in recognizing sources of support and enhancing their self-management abilities. Our objective is to develop an NLP model by curating relevant lay articles on pregnancy health that can serve as effective health education and communication materials for patients.

Our focus on lay articles stems from their accessibility and the ease with which patients can comprehend them, even without an advanced level of health literacy. In order to establish a scalable and sustainable approach, we hypothesize that NLP can be employed to identify current and credible reading materials pertaining to PPD from lay sources. By leveraging NLP techniques, we aim to provide patients with up-to-date and reliable resources related to PPD, thus enhancing their overall health literacy and empowering them to make informed decisions.

2 Materials and methods

2.1 Data construction

We identified articles from online sources such as Mayo Clinic, MedlinePlus, and American Pregnancy Association. We also collected medically reviewed articles from non-medical sources with easy accessibility and high recognition that are written with the lay population as the target audience. We looked for articles related to PPD risk factors and social support. Using Vaux's theory of social support (26), we centered the article classification around social support constructs and focused on articles and resources that serve as instrumental (diet, exercise), informational (health literacy), and emotional (wellness) support. Here, “informational” refers to articles that are related to pregnancy but do not fall into the previous three categories. We collected an initial set of articles by online search using keywords including pregnancy diet, pregnancy exercise, pregnancy fitness, pregnancy yoga, pregnancy mental health, pregnancy anxiety, pregnancy mood swings, depression during pregnancy, mental wellbeing pregnancy, mental health AND pregnancy, pregnancy and mental issues, Perinatal mood and anxiety disorder (PMAD) in pregnancy, anxiety during pregnancy, and dealing with stress during pregnancy. As we envisioned that the articles would be a preventative measure against PPD for pregnant readers, we intentionally excluded articles that mentioned PPD to avoid alerting patients to the potential disease risk (27).

Collected articles were then reviewed for selection following NIH recommendations on how to evaluate health information on the internet (28). The recommendations focused on evaluating the ownership, purpose, and funding source of the website hosting the information; the evidence used to support the information; reviewers of the information (medically reviewed or not); the year of publication; consumer information collected by the site, and availability of monitoring for online interactions among consumers. Based on these criteria, we prioritized articles that were medically reviewed or published by a non-profit organization to serve as the corpus of the pipeline. Our content validation process also included a credibility check on the collected articles to avoid misinformation and conflicting information across sources. This secondary process excluded articles that were published by non-US sources, were not written by certified specialists holding degrees such as M.D. and Ph.D., and were not published by websites well-known to the general public. Finally, we excluded articles on mixed topics that contained less than 80% of content related to pregnancy, and we also excluded articles published before 2019 to ensure the timeliness of the information.

We identified 777 articles based on the above criteria from online searches. There were 523 credible articles left after the content validation step. Finally, we had 392 articles in our defined categories for training the NLP models. As a control, we also selected 431 articles that were not related to pregnancy. A list of unrelated news article topics is mentioned in Supplementary Table S5. The details of the article collection are shown in Figure 1 and counts of articles are provided in Table 1. Each article was reviewed by two annotators and the Cohen kappa metric for inter-rater reliability was 0.95. Any disagreement between reviewers was resolved by a third reviewer. During the review process, we identified the sources of the collected articles. Going forward, we will utilize these verified and authentic sources to gather pregnancy-related articles. To streamline the process, we have developed automated two web crawlers specifically tailored for each source, enabling efficient article collection.

FIGURE 1

Figure 1. News articles inclusion criteria.

TABLE 1

Table 1. Characteristics of the dataset.

2.2 Model development

We employed machine and deep-learning models to classify the collected articles. We concatenated the title and full-length text as the input. For both titles and full-length text, we removed stop words using the Natural Language Toolkit (NLTK) (29). We experimented with two approaches: an end-to-end approach and a two-step approach. The end-to-end approach treated articles as five topics: diet, exercise, mental health, health literacy, or not related to pregnancy. The two-step approach identified articles in two steps. The first stage classified articles based on whether or not they are related to pregnancy. Then, the second step classified a pregnancy-relevant article into four topics: diet, exercise, mental health, and health literacy.

2.2.1 Experimental settings

In this study, we used 5-fold cross-validation to obtain a distribution of the experimental metrics and reported the macro-averaged precision (P), recall (R), and F1-scores (F). In each of the five folds, we used one-fold (20%) as the hold-out test set and the remaining four folds (80%) as the training set. We leveraged the state-of-the-art Bidirectional Encoder Representations from Transformers [BERT (30)] and further fine-tuned the pertained model in downstream tasks. The model took a sequence of tokens with a maximum length of 512 and produced a 768-dimensional sequence representation vector. For text that is shorter than 512 tokens, we added paddings (empty tokens) to the end of the text to make up the length. For text that is longer than 512 tokens, we used the first 512 tokens as the input. Then, two fully connected layers are appended on top of the pooler output layer of the BERT model. Finally, a SoftMax layer is used to map the representation vector to the target label space. For the BERT model, we fine-tuned the “BioBERT” (31) model with the training data for 20 epochs with a learning rate of 2*10⁻⁵ and batch size of 16. We adopted AdamW (32) as the optimizer and cross-entropy as the loss function. All BERT models were constructed on Amazon SageMaker with an NVIDIA T4 GPU with 128 GB of GPU memory. Pytorch (1.12.1) library was used to develop BERT models.

We compared the BERT model with the Generative Pre-trained Transformer model (GPT). We used the Dec 15, 2022 version of gpt-3.5-turbo to classify articles by question answering. Different questions corresponded to different classification methods and article types. For one-step approach, we asked gpt-3.5-turbo the question “Only give a one-word answer for which category this text belongs to: pregnancy-related diet, pregnancy-related exercise, pregnancy-related mental health, pregnancy-related health literacy, or pregnancy-unrelated.” This allowed gpt-3.5-turbo to directly classify the articles into one of five categories. For the two-step method and the classification of pregnancy-related and pregnancy-topics, we first used the question “Only answer yes or no to the question: Is the following text related to pregnancy and not” to classify the articles as pregnancy-related or not. Further, for the pregnancy-related articles, we asked the question “Only give a one-word answer for which category this text belongs to: diet, exercise, mental health, or health literacy” to distinguish the pregnancy topics. We also compared the proposed BERT models with traditional machine learning methods. In particular, we experimented with Random Forests (RF) using TF-IDF features. We set the word frequency to be (< 4) and capped the dimensionality to 1500 features. The number of trees in the forest was 200. Scikit-learn (1.0.2) library was used for the RF model.

3 Results

3.1 Overall performance

Table 2 shows the performance of RF, gpt-3.5-turbo, and BERT-based classifications using one-step and two-step approaches. In the case of the one-step approach, we observed that BERT obtained the best performance with a macro-averaged precision of 0.957, a recall of 0.951, and an F1-score of 0.953, which are 4.01, 8.60, and 6.50% higher than those of RF, and 18.6, 4.6, and 14.9% higher than those of gpt-3.5-turbo, respectively. The BERT-based two-step approach achieved micro-averaged precision, recall, and F1-score of 0.919, 0.926, and 0.921, respectively. The RF-based two-step approach achieved macro-averaged precision, recall, and F1-score of 0.888, 0.902, and 0.892, respectively. The gpt-3.5-turbo-based two-step approach obtained macro-averaged precision, recall, and F1-score of 0.876, 0.922, and 0.890, respectively. When comparing the end-to-end model with the two-step approach, we found that the end-to-end BERT model performed better than the two-step BERT-based approach. However, the end-to-end gpt-3.5-turbo model is worse than its counterparts.

TABLE 2

Table 2. Overall performances of RFs, gpt-3.5-turbo, and BERT models.

3.2 Classification by relevance to pregnancy

Figure 2 and Supplementary Table S1 show the binary classification performance comparison between RF, gpt-3.5-turbo, and BERT. We observed that BERT achieved the best performance with macro-averaged precision, recall, and F1-score of 0.975, 0.974, and 0.974, respectively for classifying pregnancy-related and unrelated articles. Detailed counts of BERT-based model such as true positive, false positive, and false negative of each fold can be found in Supplementary Table S2. RF and gpt-3.5-turbo showed lower F1-scores of 0.945 and 0.908 than BERT.

FIGURE 2

Figure 2. (A) F1-scores of the classification of pregnancy-relevant articles. (B) F1-scores of the classification of pregnancy topics.

3.3 Classification by pregnancy topics

Figure 2 and Supplementary Table S3 show that the gpt-3.5-turbo model achieved the best performance with macro-averaged precision, recall, and F1-score of 0.973, 0.972, and 0.972, respectively. Detailed counts of gpt-3.5-turbo model such as the true positive, false positive, and false negative of each fold can be found in Supplementary Table S4. The performance of gpt-3.5-turbo was 3.53, 3.43, and 3.53% higher than that of RF and 0.52, 0.48, and 0.56% higher than that of BERT.

4 Discussion

Our pipelines demonstrated the ability of the NLP algorithm to identify pregnancy-related online health education materials. The end-to-end BERT model obtained better results than the two-step BERT by 3.22% in the F1-score. For RF, although the two-step approach increased the recall by 3.74%, the precision decreased by 2.95%. Similarly, the two-step approach using gpt-3.5-turbo increased the F1-score by 8.6%. BERT outperformed RF and gpt-3.5-turbo in all scenarios. While end-to-end had better performance than two-step, it lacks interpretability and requires retraining when new topics are added. In comparison, in the two-step approach, we only need to train the second one and there is flexibility in adding additional topics for articles.

In the binary classification of pregnancy-related articles, RF obtained a satisfactory F1-score. The performance of BERT in this task is not an obvious advantage; although it improved by 2.9% it required more time and computing cost. In addition, the type of input text had little effect on the performance of the model, and the title achieved high accuracy under extremely fast computation speed. In the classification of pregnancy topics, the performance advantage of BERT and gpt-3.5-turbo are significant. Despite the demonstrated performance of gpt-3.5-turbo in general question answering, we discovered that, due to the specialized needs of our questioning (related to pregnancy but does not mention postpartum depression), we needed a dedicated NLP model for our use case. Still, gpt-3.5-turbo demonstrated superior performance in pregnancy topic classification, thus showing its potential for use in the future.

We performed our experiments using 392 pregnancy-related articles, and the BERT-based system performed better in most cases. In the future, we plan to add more articles with additional categories related to pregnancy. We searched the articles for our experiments manually, and it is possible that our search did not retrieve all relevant articles. We plan to implement automated web crawlers to collect the articles from the identified sources more efficiently in the future.

4.1 Limitations

Here are two limitations of our study. First, we intentionally focused on articles from 2019 onwards as the gold standard to capture recent trends and understandings in health topics. While we acknowledge the potential relevance and value of articles published prior to 2019, our current model was designed with a specific timeframe in mind to ensure contemporary relevance. However, we will consider expanding our dataset to include older articles in future iterations or expansions of the model. Second, the BERT-based system has a limitation of accepting only 512 tokens as input from each document. One potential solution to this challenge is to divide the document into multiple chunks and feed them individually to the BERT-based system. However, since the system already outperformed all other systems in terms of performance, we decided not to pursue this approach in order to avoid complicating the development process.

5 Conclusion

Given the abundance of high-quality lay articles and the knowledge- and labor-intensiveness of creating patient education materials, we proposed that lay articles related to pregnancy could be categorized and repurposed into educational and communicational materials for patients through NLP models. We explored state-of-the-art BERT models and gpt-3.5-turbo in an effort to obtain superior performances in classifying these articles. Current work is underway to incorporate the model based on gpt-3.5-turbo into our research website to scale the volume and diversity of content the website can provide to patients with diverse backgrounds and needs. Future work involves an on-going pilot evaluation with patients to assess acceptability and technical feasibility.

Data availability statement

The original contributions presented in the study are publicly available. This data can be found here: https://github.com/brajagopalcse/AI_Driven_Patient_Education_Materials.

Author contributions

BP: Conceptualization, Formal analysis, Investigation, Methodology, Software, Supervision, Validation, Writing—original draft, Writing—review & editing, Data curation, Resources, Visualization. ZS: Data curation, Formal analysis, Software, Writing—review & editing, Validation. ZC: Data curation, Formal analysis, Software, Writing—review & editing, Validation. PK: Data curation, Formal analysis, Methodology, Project administration, Software, Validation, Writing—review & editing. AA: Data curation, Formal analysis, Writing—review & editing. YL: Data curation, Formal analysis, Software, Writing—review & editing, Validation. RJ: Conceptualization, Data curation, Writing—review & editing. CJ: Conceptualization, Data curation, Writing—review & editing. DD: Conceptualization, Data curation, Writing—review & editing. JP: Writing—review & editing, Supervision. YP: Investigation, Writing—review & editing, Supervision. YZ: Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Visualization, Writing—original draft, Writing—review & editing, Data curation, Validation.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research is supported by the US Department of Transportation Center for Transportation, Environment, and Community Health, AWS Diagnostic Development Initiative Support (YP), and Amazon Research Award (YP).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2023.1258887/full#supplementary-material

References

1. Schetter CD, Tanner L. Anxiety, depression and stress in pregnancy: implications for mothers, children, research, and practice. Curr Opin Psychiatry. (2012) 25:141. doi: 10.1097/YCO.0b013e3283503680

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Mughal S, Azhar Y, Siddiqui W. Postpartum depression. In: StatPearls. StatPearls Publishing (2022).

Google Scholar

3. Walsh K, McCormack CA, Webster R, Pinto A, Lee S, Feng T, et al. Maternal prenatal stress phenotypes associate with fetal neurodevelopment and birth outcomes. Proc Nat Acad Sci. (2019) 116:23996–4005. doi: 10.1073/pnas.1905890116

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Panaite V, Bowersox NW, Zivin K, Ganoczy D, Kim HM, Pfeiffer PN. Individual and neighborhood characteristics as predictors of depression symptom response. Health Serv Res. (2019) 54:586. doi: 10.1111/1475-6773.13127

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Peahl AF, Zahn CM, Turrentine M, Barfield W, Blackwell SD, Roberts SJ, et al. The michigan plan for appropriate tailored healthcare in pregnancy prenatal care recommendations. Obstetr Gynecol. (2021) 138:593–602. doi: 10.1097/AOG.0000000000004531

PubMed Abstract | CrossRef Full Text | Google Scholar

6. O'Connor E, Senger CA, Henninger ML, Coppola E, Gaynes BN. Interventions to prevent perinatal depression: evidence report and systematic review for the us preventive services task force. Jama. (2019) 321:588–601. doi: 10.1001/jama.2018.20865

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Adhikari K, Patten SB, Williamson T, Patel AB, Premji S, Tough S, et al. Neighbourhood socioeconomic status modifies the association between anxiety and depression during pregnancy and preterm birth: a community-based Canadian cohort study. BMJ Open. (2020) 10:e031035. doi: 10.1136/bmjopen-2019-031035

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Zhang Y, Tayarani M, Wang S, Liu Y, Sharma M, Joly R, et al. Identifying urban built environment factors in pregnancy care and maternal mental health outcomes. BMC Preg Childbirth. (2021) 21:1–11. doi: 10.1186/s12884-021-04056-1

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Bedaso A, Adams J, Peng W, Sibbritt D. The relationship between social support and mental health problems during pregnancy: a systematic review and meta-analysis. Reprod Health. (2021) 18:1–23. doi: 10.1186/s12978-021-01209-5

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Lupton D. The use and value of digital media for information about pregnancy and early motherhood: a focus group study. BMC Pregn Childbirth. (2016) 16:1–10. doi: 10.1186/s12884-016-0971-3

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Harpel T. Pregnant women sharing pregnancy-related information on facebook: web-based survey study. J Med Internet Res. (2018) 20:e115. doi: 10.2196/jmir.7753

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Timmers T, Janssen L, Kool RB, Kremer JA. Educating patients by providing timely information using smartphone and tablet apps: systematic review. J Med Internet Res. (2020) 22:e17342. doi: 10.2196/17342

CrossRef Full Text | Google Scholar

13. TenBarge AM, Riggins JL. Responding to unsolicited medical requests from health care professionals on pharmaceutical industry-owned social media sites: three pilot studies. J Med Internet Res. (2018) 20:e285. doi: 10.2196/jmir.9643

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Sayakhot P, Carolan-Olah M. Internet use by pregnant women seeking pregnancy-related information: a systematic review. BMC Preg Childbirth. (2016) 16:1–10. doi: 10.1186/s12884-016-0856-5

CrossRef Full Text | Google Scholar

15. Pilgrim K, Bohnet-Joschko S. Selling health and happiness how influencers communicate on instagram about dieting and exercise: mixed methods research. BMC Public Health. (2019) 19:1–9. doi: 10.1186/s12889-019-7387-8

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Fung IC-H, Blankenship EB, Ahweyevu JO, Cooper LK, Duke CH, Carswell SL, et al. Public health implications of image-based social media: A systematic review of instagram, pinterest, tumblr, and flickr. Perm J. (2020) 24. doi: 10.7812/TPP/18.307

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Oviatt JR, Reich SM. Pregnancy posting: exploring characteristics of social media posts around pregnancy and user engagement. Mhealth. (2019) 5:46. doi: 10.21037/mhealth.2019.09.09

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Zhu C, Zeng R, Zhang W, Evans R, He R. Pregnancy-related information seeking and sharing in the social media era among expectant mothers: qualitative study. J Med Internet Res. (2019) 21:e13694. doi: 10.2196/13694

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Korp P. Health on the internet: implications for health promotion. Health Educ Res. (2006) 21:78–86. doi: 10.1093/her/cyh043

CrossRef Full Text | Google Scholar

20. Littlechild SA, Barr L. Using the internet for information about breast cancer: a questionnairebased study. Patient Educ Couns. (2013) 92:413–7. doi: 10.1016/j.pec.2013.06.018

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Romero F, Koochak Z. Assessing and Implementing Automated News Classification. Standford, CA: Standford University (2015).

Google Scholar

22. Mandal S, Rath M, Wang Y, Patra BG. Predicting zika prevention techniques discussed on Twitter: an exploratory study. In: Proceedings of the 2018 Conference on Human Information Interaction & Retrieval. (2018) p. 269–272. doi: 10.1145/3176349.3176874

CrossRef Full Text | Google Scholar

23. Ayoub J, Yang XJ, Zhou F. Combat covid-19 infodemic using explainable natural language processing models. Inf Process Manag. (2021) 58:102569. doi: 10.1016/j.ipm.2021.102569

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Wilson AE, Lehmann CU, Saleh SN, Hanna J, Medford RJ. Social media: a new tool for outbreak surveillance. Antimicrobial Steward Healthcare Epidemiol. (2021) 1:e50. doi: 10.1017/ash.2021.225

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Ravichandran BD, Keikhosrokiani P. Classification of covid-19 misinformation on social media based on neuro-fuzzy and neural network: a systematic review. Neur Comput Appl. (2023) 35:699–717. doi: 10.1007/s00521-022-07797-y

CrossRef Full Text | Google Scholar

26. Vaux A, Ruggiero M. Stressful life change and delinquent behavior. Am J Commun Psychol. (1983) 11:169. doi: 10.1007/BF00894365

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ancker JS, Witteman HO, Hafeez B, Provencher T, Van de Graaf M, Wei E. “you get reminded you're a sick person”: personal data tracking and patients with multiple chronic conditions. J Med Internet Res. (2015) 17:e202. doi: 10.2196/jmir.4209

PubMed Abstract | CrossRef Full Text | Google Scholar

28. National Institutes of Health. Health information on the internet: Questions and answers. National Institutes of Health (2011).

Google Scholar

29. Bird S, Klein E, Loper E. Natural Language Processing With Python: Analyzing Text With the Natural Language Toolkit. Sebastopol, CA: O'Reilly Media, Inc. (2009).

Google Scholar

30. Devlin J, Chang MW, Lee K, Toutanova K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint. arXiv:1810.04805 (2018). doi: 10.48550/arXiv.1810.04805

CrossRef Full Text | Google Scholar

31. Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, et al. Biobert: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. (2020) 36:1234–40. doi: 10.1093/bioinformatics/btz682

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Kingma DP, Ba J. Adam: A method for stochastic optimization. arXiv preprint. arXiv:1412.6980 (2014). doi: 10.48550/arXiv.1412.6980

CrossRef Full Text | Google Scholar

Keywords: online health information, health communication, natural language processing, pregnancy, postpartum depression

Citation: Patra BG, Sun Z, Cheng Z, Kumar PKRJ, Altammami A, Liu Y, Joly R, Jedlicka C, Delgado D, Pathak J, Peng Y and Zhang Y (2023) Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression. Front. Psychiatry 14:1258887. doi: 10.3389/fpsyt.2023.1258887

Received: 14 July 2023; Accepted: 25 October 2023;
Published: 20 November 2023.

Edited by:

Laura Orsolini, Marche Polytechnic University, Italy

Reviewed by:

Balu Bhasuran, University of California, San Francisco, United States
Chi-Shin Wu, National Health Research Institutes, Taiwan

Copyright © 2023 Patra, Sun, Cheng, Kumar, Altammami, Liu, Joly, Jedlicka, Delgado, Pathak, Peng and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yiye Zhang, eWl6MjAxNEBtZWQuY29ybmVsbC5lZHU=

^†These authors share first authorship

^‡ORCID: Braja Gopal Patra orcid.org/0000-0003-2997-5314
Zhaoyi Sun orcid.org/0009-0003-8197-1465
Zilin Cheng orcid.org/0009-0004-0603-1964
Caroline Jedlicka orcid.org/0000-0002-3496-9632
Diana Delgado orcid.org/0000-0002-6290-3497
Jyotishman Pathak orcid.org/0000-0002-4856-410X
Yifan Peng orcid.org/0000-0001-9309-8331
Yiye Zhang orcid.org/0000-0003-3494-2699

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Automated classification of lay health articles using natural language processing: a case study on pregnancy health and postpartum depression

1 Background and significance

1.1 Related work

1.2 Objective

2 Materials and methods

2.1 Data construction

2.2 Model development

2.2.1 Experimental settings

3 Results

3.1 Overall performance

3.2 Classification by relevance to pregnancy

3.3 Classification by pregnancy topics

4 Discussion

4.1 Limitations

5 Conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher's note

Supplementary material

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good