Skip to main content

ORIGINAL RESEARCH article

Front. Public Health
Sec. Public Mental Health
Volume 13 - 2025 | doi: 10.3389/fpubh.2025.1472050
This article is part of the Research Topic New Insights into Social Isolation and Loneliness, Volume II View all 7 articles

Identifying major depressive disorder among US adults living alone using stacked ensemble machine learning algorithms

Provisionally accepted
  • 1 West China Hospital, Sichuan University, Chengdu, China
  • 2 Sichuan Center for Disease Control and Prevention, Chengdu, Sichuan Province, China

The final, formatted version of the article will be published soon.

    It has been increasingly recognized that adults living alone have a higher likelihood of developing Major Depressive Disorder (MDD) than those living with others. However, there is still no prediction model for MDD specifically designed for adults who live alone.This study aims to investigate the effectiveness of utilizing personal health data in combination with a stacked ensemble machine learning (SEML) technique to detect MDD among adults living alone, seeking to gain insights into the interaction between personal health data and MDD.Our data originated from the US National Health and Nutrition Examination Survey (NHANES) spanning 2007 to 2018.We finally selected a set of 30 easily accessible variables encompassing demographic profiles, lifestyle factors, and baseline health conditions. We constructed a SEML model for MDD detection, incorporating three conventional machine learning algorithms as base models and a Neural Network (NN) as the meta-model. Furthermore, SHapley Additive exPlanations (SHAP) analysis was used to explain the impact of each predictor on MDD.The study included 2,642 adult participants who lived alone, of whom 10.6% (279 out of 2,642) had a PHQ-9 score of 10 or above, indicating the presence of MDD. The performance of our SEML model was robust, with an area under the curve (AUC) of 0.85. Further analysis using SHAP revealed positive correlations between the occurrence of MDD and factors such as sleep disorders, number of prescription medications, need for specific walking aids, leak urine during nonphysical activities, chronic bronchitis, and Healthy Eating Index (HEI) scores for sodium. Conversely, age, the Family Monthly Poverty Level Index (FMMPI), and HEI scores for added sugar showed negative correlations with MDD occurrence.Additionally, a U-shaped relationship was noted between the occurrence of MDD and both sleep duration and Body Mass Index (BMI), as well as HEI scores for dairy.The study has successfully developed a predictive model for MDD, specifically tailored for adults living alone using a stacked ensemble technique, enhancing the identification of MDD and its risk factors among adults living alone.

    Keywords: Major Depressive Disorder, adults living alone, stacked ensemble technique, machine learning, US NHANES

    Received: 28 Jul 2024; Accepted: 10 Feb 2025.

    Copyright: © 2025 Chen, liu, Zhang, Xing, Jiang, Xiang and Duan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence:
    Zhou Xiang, West China Hospital, Sichuan University, Chengdu, China
    Xin Duan, West China Hospital, Sichuan University, Chengdu, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.