A machine learning model to predict the risk of depression in US adults with obstructive sleep apnea hypopnea syndrome: a cross-sectional study

Li, Enguang; Ai, Fangzhu; Liang, Chunguang

doi:10.3389/fpubh.2023.1348803

ORIGINAL RESEARCH article

Front. Public Health, 08 January 2024

Sec. Public Mental Health

Volume 11 - 2023 | https://doi.org/10.3389/fpubh.2023.1348803

This article is part of the Research TopicArtificial Intelligence and Mental Health CareView all 13 articles

A machine learning model to predict the risk of depression in US adults with obstructive sleep apnea hypopnea syndrome: a cross-sectional study

Enguang Li

Fangzhu Ai

Chunguang Liang^*

Department of Nursing, Jinzhou Medical University, Jinzhou, China

Objective: Depression is very common and harmful in patients with obstructive sleep apnea hypopnea syndrome (OSAHS). It is necessary to screen OSAHS patients for depression early. However, there are no validated tools to assess the likelihood of depression in patients with OSAHS. This study used data from the National Health and Nutrition Examination Survey (NHANES) database and machine learning (ML) methods to construct a risk prediction model for depression, aiming to predict the probability of depression in the OSAHS population. Relevant features were analyzed and a nomogram was drawn to visually predict and easily estimate the risk of depression according to the best performing model.

Study design: This is a cross-sectional study.

Methods: Data from three cycles (2005–2006, 2007–2008, and 2015–2016) were selected from the NHANES database, and 16 influencing factors were screened and included. Three prediction models were established by the logistic regression algorithm, least absolute shrinkage and selection operator (LASSO) algorithm, and random forest algorithm, respectively. The receiver operating characteristic (ROC) area under the curve (AUC), specificity, sensitivity, and decision curve analysis (DCA) were used to assess evaluate and compare the different ML models.

Results: The logistic regression model had lower sensitivity than the lasso model, while the specificity and AUC area were higher than the random forest and lasso models. Moreover, when the threshold probability range was 0.19–0.25 and 0.45–0.82, the net benefit of the logistic regression model was the largest. The logistic regression model clarified the factors contributing to depression, including gender, general health condition, body mass index (BMI), smoking, OSAHS severity, age, education level, ratio of family income to poverty (PIR), and asthma.

Conclusion: This study developed three machine learning (ML) models (logistic regression model, lasso model, and random forest model) using the NHANES database to predict depression and identify influencing factors among OSAHS patients. Among them, the logistic regression model was superior to the lasso and random forest models in overall prediction performance. By drawing the nomogram and applying it to the sleep testing center or sleep clinic, sleep technicians and medical staff can quickly and easily identify whether OSAHS patients have depression to carry out the necessary referral and psychological treatment.

Introduction

Depression is a widespread mental health disorder that seriously limits the patient’s psychological and social function, which reduces their quality of life. At the same time, depression also brings severe financial and emotional stress to the families of patients. Its main features include persistent fatigue, depression, low mood, reduced interest, and poor concentration (1). Depression is related to mental health and is now the main reason for the global burden of disease. In addition to the severe influence on personal emotion and psychological state, it may seriously impact work and personal relationships (2). In 2015, the WHO announced that, globally, depression affects more than 300 million people, or 4.4% of the world’s population, and is the leading cause of disability globally (3), with about 1 million people dying of depression each year (4). At the same time, depression also imposes significant socioeconomic costs. The annual cost of treating depression in the US is reported to be as high as $210 billion (5). However, in high-income countries, nearly half of people with depression are not diagnosed or treated. In low and middle-income countries, the proportion is as high as 80%–90%. Early detection and prevention of depression is, therefore, essential to reduce the global burden. Society needs to take action early in life and in adversity and the impact of inequality (6).

Obstructive Sleep Apnea Hypopnea Syndrome (OSAHS) is a chronic disease characterized by recurrent upper airway collapse and obstruction during sleep (7), resulting in periodic reduction or cessation of ventilation, which causes hypoxia, hypercapnia and sleep arousal (8). Cross-sectional studies have found that OSAHS may increase the risk of depression (9). In addition, a dose–response relationship has been found between the severity of OSAHS and the risk of depression (10). That is, the more severe the OSAHS, the higher the risk of depression. This means that the presence of OSAHS may cause more significant difficulties in the treatment of depression. A cross-sectional study of community and clinical populations found a relatively high prevalence of depression in patients with OSAHS of 17% (11). In contrast, the prevalence of depression among patients with a definite diagnosis of OSAHS in the sleep clinic showed a wide variability, ranging from 5% to 63% (2). To verify the causal relationship between OSAHS and depression, according to a prospective longitudinal study of depression a year later with OSAHS between cause and effect (12). Therefore, we suggest that screening for psychiatric in patients with OSAHS timely find depression to effectively prevent and treat depression and reduce the impact on the quality of life and social economy.

Currently, one of the early screening methods for depression is to determine the presence or absence of depression by using the Depression Self-Rating Scale (DSRS). However, there is no self-assessment scale for depression for patients with OSAHS. Although several self-rating depression scales have been shown to have reliable reliability and validity in OSAHS patients (13–16), these scales still do not accurately predict the risk of depression in OSAHS patients. Clinical predictive modeling was introduced to solve this problem. It is a mathematical formula to estimate the probability that a particular individual is currently suffering from a disease or experiencing a specific outcome. In this study, a prediction model was used to estimate the likelihood of depression in OSAHS patients to more accurately assess the risk of depression in patients and take appropriate interventions.

However, traditional statistical methods are only suitable for solving simple linear problems rather than for dealing with complex nonlinear relationships. Secondly, traditional models lack adaptive learning capabilities and require manual selection and extraction of variables. This process requires specialized knowledge and experience and relies on prior knowledge or specific rules to build and adapt. Moreover, the traditional model can only deal with small-scale data sets, and the processing effect on high-dimensional data is poor (17).

Therefore, introducing machine learning (ML), a powerful and intelligent tool, can solve all these problems. ML models can adaptively learn and adjust models from data without manually specifying model parameters or rules. In addition, ML models also have good generalization ability. It can effectively generalize the patterns learned from the training data to the new data. In addition, ML models can handle large-scale data, automatically extract features, and build models from the data. Finally, the ML model also has high interpretability, can through the way of visualization and explanatory, help researchers to understand the behavior of the model and the decision-making process (18, 19).

At present, the ML has been widely applied in the depression risk prediction model was constructed. For example, Dai Su et al. used ML algorithms to construct a risk prediction model for depression in older Chinese adults (20). Fang Xia et al. developed a prediction model for depression caused by heavy metals in older people using the ML method based on the National Health and Nutrition Examination Survey (NHANES) database (21). The research result shows that ML, which improves the prediction accuracy of depression, reduces error and mass data processing, and so on, shows great potential.

After a comprehensive literature search, we found that most of the previous studies focused on exploring the correlation between OSAHS and depression. At the same time, there are a large number of predictive studies of patients with OSAHS (9, 22–26). However, no studies have been found to predict whether patients with OSAHS will develop depression. This includes studies using traditional statistical methods (such as logistic regression) and ML methods (such as random forest, SVM, etc.) to construct depression risk prediction models for OSAHS patients. Therefore, this study selected a large data sample from the NHANES database and screened for influencing factors associated with depression. To construct a risk prediction model that can predict whether OSAHS patients will have depression using ML methods.

Materials and methods

Data and sample

Description of National Health and Nutrition Examination Survey data

The data used in this study come from the NHANES database published by the Centers for Disease Control and Prevention (CDC). NHANES, a population-based cross-sectional survey, aims to collect information about relevant American adults’ and children’s diet, nutrition, health, and health behavior (5). A representative sample of households across the United States was selected using multistage stratified random sampling. Since 1999, the NHANES program has conducted a nationally representative sample every two years. Each year, NHANES investigators conduct home visits and in-person interviews with a nationally representative sample of about 5,000 people of all ages. They collect data on basic information, family structure, health status, and eating habits of the respondents. After the face-to-face survey, participants were invited to a temporary examination center for various physical measurements, physical function tests, and laboratory tests. Finally, the collected data will be collated, coded, and anonymized before being stored in the NHANES database. It was also approved by the Research Ethics Review Board of the National Center for Health Statistics (NCHS). Each participant was asked to sign a consent form, including all the questionnaires, and check. For participants younger than 18 years of age, they were required to complete data collection with informed consent from their parents or guardians (27, 28). In this study, the OSAHS population was selected based on participants’ self-report of the question “How often do you snort/stop breathing?” on the sleep questionnaire. Therefore, we excluded data periods that did not include this question in the sleep questionnaire and finally selected data from the three 2-year periods that had this question (2005–2006, 2007–2008, and 2015–2016). These data will be used to construct a prediction model for depression in the OSAHS population. All data were downloaded from the official NHANES website.¹

Outcome variable

The 9-item Patient Health Questionnaire-9 (PHQ-9) was used in this study to assess depression in patients with OSAHS. The questionnaire used a four-point Likert scale, with options for each item including 0 (not at all), 1 (a few days), 2 (more than half a day), and 3 (almost every day). Each item is scored from 0 to 3, and the total score ranges from 0 to 27 (29). Patients with a PHQ-9 total score ≥ 5 were considered to have depression according to study criteria (30). It is worth noting that the ultimate purpose and significance of this study is to estimate the probability of depression in OSAHS patients by selecting the best prediction model and constructing a nomogram based on the relevant influencing factors. The application of this nomogram in sleep testing centers or sleep clinics can help sleep technicians and medical staff quickly and easily identify whether OSAHS patients have depression and make necessary referrals. Therefore, patients were divided into two groups with and without depression only according to whether they would develop depression. However, Kroenke noted that significant clinical significance was often seen in moderate to severe cases and suggested concomitant antidepressants to improve sleep (31). Therefore, if considering the practical application value of psychiatric clinical practice, it is recommended that future research be able to divide the severity of depression in detail and construct multivariate dependent variable prediction models. Such studies are expected to improve the accuracy and predictive power of the model and thus better provide clinical assistance to psychiatrists.

Predictor variables

In this study, we categorized the OSAHS population based on participants’ responses to the question “How often do you stop breathing?” on a sleep questionnaire. In answer to this question, we recorded responses of 0 (never) as indicating the non-OSAHS population, while responses of 1 (rarely, 1–2 nights per week), 2 (occasionally, 3–4 nights per week), and 3 (often, five or more nights per week) were defined as indicating the OSAHS population.

Data on demographic characteristics of NHANES from 2005–2006, 2007–2008, and 2015–2016 were selected for this study. Data on age, gender, race, education level, marital status, ratio of family income to poverty (PIR), body mass index (BMI), and sleep hours were included. Of these, we selected adults aged 18 years and older for the study. For race, we categorized them into five categories: Mexican American, Other Hispanic, Non-Hispanic White, Non-Hispanic Black, and Other Race. Education level was categorized into five groups: Less than 9th Grade, 9th–11th Grade, High School Grad/GED or Equivalent, Some College or AA degree, and College Graduate or above. Marital status included married, widowed, divorced, separated, unmarried, and living with a partner. Income status was divided into two categories by using PIR: low-income (PIR ≤ 1.3) and non-low-income (PIR > 1.3). BMI was categorized into four types: underweight (BMI < 18.5), normal weight (18.5–24.99), overweight (25.0–29.99), and obese (BMI ≥ 30.0). Sleep hours were also categorized into three categories: short sleep hours (<7 h), normal sleep hours (7–9 h), and long sleep hours (>9 h).

Lifestyle variables include smoking and alcohol drinking. Smoking status was determined based on respondents’ self-reports to two questions: “Have you ever smoked more than 100 cigarettes in your lifetime?” and “Do you currently smoke?.” Smoking status was categorized into three categories: never smoker (lifetime never smoked more than 100 cigarettes, current never smoked), former smoker (lifetime smoked more than 100 cigarettes, current never smoked), and now smoker (lifetime smoked more than 100 cigarettes, current daily smoker or current smoker for a few days).

Alcohol drinking was determined by self-report of respondents on the following questions: “In the past 12 months, how often did you drink any type of alcoholic beverage (measured in days)?” Based on their responses, we defined drinking as three types: never drinking (0), low drinking (1–36 days), and heavy drinking (≥37 days).

Health information variables included general health condition, hypertension, diabetes, asthma, coronary heart disease, and OSAHS severity. General health condition was determined by self-report of respondents on the following questions: “I have some general questions about your health.” and “Would you say your health in general is?” Of these, “excellent,” “very good,” and “good” responses were redefined as “good general health condition.” “Fair” is defined as “General health.” “Poor” is defined as “bad general health condition.”

The presence of the four conditions, hypertension, diabetes, asthma, and coronary heart disease, was determined using a “yes” or “no” response. For diabetes problems, it is essential to note that if respondents answer “Border,” it has also been defined as no diabetes.

OSAHS severity was determined by the subject’s response to the question, “How often do you snort/stop breathing?.” Specifically, “1–2 nights per week” was defined as “mild,” and “3–4 nights per week” was described as “moderate.” “5 or more nights per week” was defined as “severe.” Table 1 provides details of the assignment of each influence factor.

TABLE 1

Table 1. Predictor variable assignment.

Statistical analysis

Data description

Stata 17.0 software was applied to extract and clean the NHANES data, and SPSS 25.0 and R Studio software were used for statistical analysis and description. Measurements that conformed to normal distribution were expressed as M ± SD (Mean ± standard error), and comparisons between groups were made using the independent samples t-test. If it did not meet, it was expressed as M (P25, P75), and comparisons between groups were made using the Mann–Whitney U test. Count data were expressed as n (%), and comparisons between groups were made using the χ² test, with p < 0.05 being considered statistically significant.

ML models

Before ML model training and evaluation, we used the set.seed(123) function in R studio software to split the dataset into training and validation sets at a 7:3 ratio. The training set is used for training multiple models, while the validation set is used to verify the performance and generalization ability of the model.

Logistic regression model

In R Studio software, the functions and commands of the mlr package were used for univariate logistic regression, followed by multiple collinearity diagnoses in SPSS software, and the variables with statistically significant differences were included in multivariate logistic regression for analysis. The final selected variables were used in R Studio software to draw the nomogram and establish the logistic regression prediction model by the plotLearnerPrediction() function.

LASSO model

In this study, we used the mlr package and glmnet package in R Studio software for training and fitting the lasso model. We performed 10-fold cross-validation with the cv.glmnet() function to select the best lambda value. Then, we retrained the lasso model based on the best lambda value and used the coef() function to obtain the coefficients of the model to complete the training of the lasso model. Given that there may be some covariance and correlation between the independent variables, in order to avoid overfitting the model, we performed dimensionality reduction on the independent variables to screen out the influencing factors related to OSAHS depression. Based on the above dimension reduction analysis, the lasso method was used to analyze all the independent variables included in the model, as shown in Figure 1. In this process, the model can be started from the initial to join the independent variable coefficient of compression gradually until the part of the independent variable coefficient is compressed to 0 to avoid the model’s overfitting problem.

FIGURE 1

Figure 1. Lasso screening variable dynamic plot.

Random forest model

Using the randomForest package in the R Studio for training the random forest model. Based on MeanDecreaseGini, We ranked the 16 independent variables and used the random forest feature importance assessment algorithm to derive the importance of each influencing factor (32), selection of important variables with high impact on depression in OSAHS patients. The optimal number of features of the random forest model was chosen according to the out-of-bag error rate. To better understand the relationship between variables and improve the model’s prediction performance. Among the model parameters, there are two key parameters to consider: the number of predicted evaluation indicators (mtry) and the number of random trees (ntree). Among them, mtry is the number of randomly selected evaluation indicators used to construct a random tree, usually the square root of the number of all evaluation indicators in the sample. The tree represents the number of random trees built in the model. When mtry = 5, the minimum error rate outside the package. When ntree = 500, the error is basically stable, and the dynamic relationship between the prediction error of random forest and the number of random trees is shown in Figure 2. Therefore, the parameters of the optimal model are mtry = 5 and ntree = 500. The final selected variables were included in the multivariate logistic regression analysis, and the random forest model was finally constructed.

FIGURE 2

Figure 2. Dynamic relationship between prediction error of random forest and the number of random trees.

Model comparison

Adopt the receiver operating characteristic curve (ROC curve) and the area under the curve (AUC), specificity, sensitivity, Youden index, and DCA comparison of the model to evaluate and compare the performance of the forecasting model. Specifically, this study used the pROC package of R studio software to draw the ROC curve of the prediction model. Then, calculate the AUC, specificity, sensitivity, and Youden index. Subsequently, the ROC curves of the three models were compared using the DeLong test to judge whether the ROC curves of the three models were significantly different. Finally, the “rmda” package and the “decision_curve” function algorithm were used to draw and compare the differences between the DCA curves of different models.

Results

Patient screening and statistical analysis process

After strict data cleaning, we selected the three NHANES data cycles: 2005–2006, 2007–2008, and 2015–2016. In the process, we finally chose to include 2,453 patients in the standard. All eligible patients were randomly divided into a training set and a validation set at a ratio of 7:3, with 1718 patients in the training set and 735 in the validation set. Such a dataset partitioning is consistent with the approach Yalong Zhang et al. adopted in their study of ML prediction models (33). Meanwhile, Jianping Lv et al., in their research, used a ML model designed to predict the risk of bullying victimization among adolescents in the same way that our dataset was partitioned (34). The detailed screening process is shown in Figure 3.

FIGURE 3

Figure 3. Researchers and statistical flowchart.

Comparison of baseline information

Through the statistical analysis, this study found no significant difference in baseline characteristics between the training and validation sets (p > 0.05). This shows no deviation between the two groups caused by the uneven distribution of the dependent variable, as shown in Table 2. In addition, we divided the training set into non-depressed and depressed groups and compared baseline information between the two groups. Specific comparative results can be found in Table 3.

TABLE 2

Table 2. Comparison of baseline data between the two groups.

TABLE 3

Table 3. Comparison of baseline data between depression group and non-depression group in training data.

Models predict performance in depressed patients with OSAHS

Logistic regression model

In the training set, we divided 1718 OSAHS patients into depressed and non-depressed groups. Through the single variable analysis, we found a statistically significant (p < 0.05) result of 12 factors involved, including gender, age, education level, marital status, PIR, general health condition, BMI, smoking, hypertension, diabetes, asthma, and OSAHS severity. Subsequently, statistically significant variables in the univariate analysis were included in the multicollinearity diagnosis. According to the analysis results, all variance inflation factors (VIF) involved in the binary logistic regression analysis were less than 5, and the tolerance index was greater than 0.1. This indicates that there is no case of multicollinearity between covariates. All variables are included in the logistic model as a predictor. The results of multicollinearity diagnosis are shown in Table 4. Statistically, there is a difference in the univariate analysis, and there is no multicollinearity of a variable in binary logistic regression analysis. Adopting the positive method step by step and likelihood ratio test, the method of removing confounding factors, finally got into the model’s variables. The results showed that gender, general health condition, BMI, smoking, OSAHS severity, age, education level, PIR, and asthma were significant influencing factors for depression in OSAHS patients. Among these influencing factors, Gender, General health condition, BMI, Smoking, and OSAHS severity were identified as independent risk factors for depression in OSAHS patients. The factors associated with depression in univariate and multivariate analyses are shown in Table 5.

TABLE 4

Table 4. Multi-collinearity analysis results of predictive variables of depression of OSAHS patients.

TABLE 5

Table 5. Factors associated with depression in univariable and multivariable analyses in the training set.

Based on the factors included in the above regression analysis and the corresponding regression coefficients of each element, a risk prediction model for depression in OSAHS patients was constructed, and a nomogram was drawn. According to the influencing factors in the nomogram and the corresponding scores of each variable, the prediction probability corresponding to the total score was the probability of depression in OSAHS patients when the scores were summed. Points are the individual scores, total points are the full scores, and risk of depression is the incidence of depression corresponding to the total scores, as shown in Figure 4. The nomogram assignment method of relevant factors is shown in Table 6.

FIGURE 4

Figure 4. Nomogram prediction model for logistic risk of depression in OSAHS patients. PIR, ratio of family income to poverty; BMI, body mass index.

TABLE 6

Table 6. Nomogram of relevant factors in the assignment method.

LASSO model

Depression was used as the dependent variable, and a total of gender, age, race, education level, marital status, PIR, general health condition, sleep hours, BMI, alcohol drinking, smoking, hypertension, diabetes, asthma, coronary heart disease, and OSAHS severity, a total of 16 independent variables. From Figure 5, the optimal model was obtained by selecting the λ value with the minor error (0.005586744) through ten-fold cross-validation. On this basis, we choose the associated with OSAHS depression 14 of the most promising of the independent variables, including gender, age, education level, marital status, PIR, general health condition, sleep hours, BMI, alcohol drinking, smoking, hypertension, asthma, coronary heart disease, and OSAHS severity. We conducted the binary logistic regression analysis based on the selection of variables, and the results were obtained. After analysis, it was found that marital status, PIR, general health condition, sleep hours and smoking are the independent influencing factors of depression in OSAHS patients (p < 0.05).

FIGURE 5

Figure 5. Lasso model ten-fold cross-validation method to screen predictors process diagram.

Random forest model

According to the results, age, general health condition, race, education level, marital status, OSAHS severity, BMI, smoking, and sleep hours were the top nine critical factors for predicting depression in OSAHS patients. Figure 6 demonstrates the ranking of importance of these indicators. Subsequently, the above nine variables in binary logistic regression analysis, the final results showed that marital status, general health condition, and smoking were independent influencing factors for depression in OSAHS patients (p < 0.05).

FIGURE 6

Figure 6. Variable importance plot. PIR, ratio of family income to poverty; BMI, body mass index.

Comparison of model prediction performance

Comparison of ROC curve prediction performance

To compare the performance of the three models in predicting depression in OSAHS patients, we used the test set data for evaluation. Results show that compared to the lasso model, the sensitivity of the logistic regression model is low, but its specificity and AUC area are higher. This means that the logistic regression model performs better in accurately identifying non-depressed patients, while the lasso model is more sensitive in capturing depressed patients. AUC was used as the preferred index to judge the model’s prediction performance. Therefore, the prediction performance of the logistic regression model was better than that of the lasso and random forest models. The comparative results are shown in Table 7. The ROC curve is shown in Figure 7.

TABLE 7

Table 7. Comparison of prediction performance of three kinds of models.

FIGURE 7

Figure 7. Comparison of ROC curve prediction performance of three prediction models for OSAHS patients with depression (The x-axis indicates the false positive rate, and the y-axis represents sensitivity.).

Comparison of DCA prediction performance

Clinical decision curve analysis of the prediction model found that when the probability threshold was in the range of 0.19 to 0.82, the prediction model had an excellent net benefit in predicting depression in OSAHS patients. The decision curve analysis results show that the net benefits of the three models were similar for thresholds probability ranging from 0.25 to 0.45. When the threshold probability range was 0.19–0.25 and 0.45–0.82, respectively, the net benefit of the logistic regression model was the most significant. Therefore, the logistic regression model showed better clinical utility than the random forest and lasso models as shown in Figure 8.

FIGURE 8

Figure 8. Comparison of the predictive performance of three predictive models decision-making curve analysis (DCA) predictions (The x-axis indicates the high risk threshold, and the y-axis represents net benefit.).

Considering the above indicators, the logistic regression model has better predictive performance than the lasso and random forest models in predicting depression in OSAHS patients. And analysis of the influence of related factors, including gender, general health condition, BMI, smoking, OSAHS severity, age, education level, PIR, and asthma.

Clinical utility

Figure 9 shows an example of a patient’s nomogram. The patients who are 35 years of age, female, have a bachelor’s degree, low income, have general health, obesity, and smoking in the past, now give up smoking, do not admit to a history of asthma, suffer from severe OSAHS. According to the diagram model, the patients with a total score of 163.5 points have a probability of about 57% of depression.

FIGURE 9

Figure 9. Example of nomogram. PIR, ratio of family income to poverty; BMI, body mass index.

Discussion

The main strength of this study is the use of the NHANES extensive sample database and the use of ML models to predict and identify potential influencing factors of depression in OSAHS patients. In this study, we constructed and trained a model based on the depression of OSAHS adults in the US. We considered variables such as demographic characteristics, lifestyle, and health factors and weighted them during the construction of the model. Through our selection model and can be output in probability of OSAHS adult depression in the United States. These results improve the intelligence of the mental health care system and have a positive impact. Sleep-related healthcare providers can use these ML algorithms to identify OSAHS patients who are potentially at risk for depression, which in turn can detect early if they are suffering from depression and help them with early intervention. The research also further increases the chances of OSAHS patient’s access to mental health services. This study will explore the incidence of depression in American adults with OSAHS, the influencing factors, and the predictive power of risk prediction models. The practical application of these findings to real life will also be discussed.

Depression among OSAHS adults in the US

The present study was based on three cycles of NHANES data (2005–2006, 2007–2008, and 2015–2016) and included variables that included demographic characteristics, lifestyle, and health information. A total of 2,453 OSAHS patients were screened, including 1,671 non-depressed patients and 782 depressed patients. The incidence of depression was 31.9%, slightly lower than the findings of Houda Gharsalli and Siddharth Bajpai et al. (4, 13). The difference in the incidence of depression may be due to the different inclusion criteria used in the OSAHS population. The study by Houda Gharsalli and Siddharth Bajpai et al. employed specialized diagnostic equipment, such as polysomnography (PSG), with an apnea-hypopnea index (AHI) ≥5 as the diagnostic criteria for OSAHS. In contrast, this study was judged and included only based on patients’ self-reported results on “How often do you snore/stop breathing?.” At the same time, temporal and geographic differences may also contribute to lower rates of depression in the OSAHS population than the results of other studies.

Factors influencing the development of depression in OSAHS adults in the US

This study used three ML algorithms, logistic regression, lasso, and random forest, to construct a predictive model of depression in the US OSAHS population. Results show that the logistic regression model is better than the random forest and logistic regression models in terms of specificity, Youden index, and AUC. However, its sensitivity is lower. Therefore, according to the results of this study, among the three prediction models, the logistic regression model performs better than the lasso model and the random forest model. In predicting the occurrence of depression in the OSAHS population, the model is affected by factors such as gender, age, education level, PIR, general health condition, BMI, smoking, asthma, and OSAHS severity. In addition, the model results are visually presented in Figure 4 in this study.

In terms of sociodemographic characteristics， according to the findings of Alimohamad Asghari, Magali Saint Martin, and Yaozhang Dai, as well as Rachel H. Salk, women are more prone to depression relative to men (35–37). This difference exists not only in China but also in most countries and cultures worldwide. In general, women are twice as likely to suffer from depression (OR = 1.95) than men, which may be related to frequent hormonal disturbances due to genetic and physiological factors. Some studies have shown that women are more likely to experience depression during menopause (38, 39). According to the results of this study, obese individuals are more likely to suffer from depression compared to normal weight or underweight individuals, which is consistent with the findings of Ashley Wendell Kranjac and Tuula H. Heiskanen (40, 41). According to Ashley Wendell Kranjac et al., after taking into account the combined effects of gender and BMI, obese women were 43% more likely to experience depression than normal-weight women. Tuula H. Heiskanen’s study, a 6-year prospective study of outpatients, found that subjects with significant weight gain were more likely to develop major depression (41). This may be because obese people often suffer from associated chronic inflammation. Immune cells in adipose tissue produce signaling proteins related to inflammation, and some of these proteins, such as cytokines, are strongly related to mental health problems and have even been used as biomarkers for depression (42).

Compared to older people, teens are at higher risk of depression. This is consistent with the findings of Stephanie Wagner (43). In the study of Stefanie Wagner, hospitalized patients with depression were divided into four different age groups. The results showed that the aged 18 to 29 years old young patients are more likely to show extreme behavior, such as suicide and drug abuse. In contrast, middle-aged and older patients aged between 50 and 65 years were more likely to show mild depression, such as decreased sexual interest. This may be related to adolescent adolescence body hormone disorder and lack of mental toughness, leading to mental instability (44).

The higher the level of education, the lower the risk of depression. Early studies have pointed out that depression is associated with low levels of education (45), and education has a significant effect on the development of depression. That is, illiterates are more likely to have more severe depression (46). This may be due to the low level of education, which leads to narrower social contacts and fewer avenues for problem solving when experiencing negative life events and is more likely to cause heavier negative emotions, which can lead to depression.

The findings suggest that low-income people are more likely to suffer from depression than non-low-income people, which was confirmed in the study of Glaesmer (47). It is estimated that while around half of people with depression in high-income countries are not diagnosed or treated, in low-income and middle-income countries, the proportion may be as high as 80%–90%. According to a December 2020 commentary in the journal Science, there is a causal interaction between poverty and mental illness. This means that people with low incomes are more vulnerable to the threat of mental illness, and at the same time, mental illness is also one of the vital causes of people with low incomes (13). This phenomenon may be because low-income people usually face financial difficulties in their daily lives and lack sufficient funds to meet basic needs such as food, housing, and healthcare. This financial pressure may make them feel helpless, anxious, depressed, and more prone to depression.

The poorer the general health condition, the greater the probability of the risk of depression. This finding is consistent with results from the World Health Survey published by Saba Moussavi in The Lancet. This study showed that depression was associated with the lowest health scores, both in isolation and in co-occurrence with other chronic diseases (48). A research study by Nicolas Zdanowicz also indicated that physical health and its improvement are related to the level of depression (49), and Érica Dorigatti de Ávila clarified that patients without depression have a higher level of mental health (50). The possible cause is physical factors. Bad health condition is accompanied by chronic illness, pain, or discomfort that may affect an individual’s emotional and psychological state, thereby increasing the risk of depression. In addition, due to limited physical function and lower quality of life, people with depression may harm their self-worth and self-esteem. This psychological stress may further aggravate depression. Chronic physical illnesses and health problems may cause individuals to develop negative feelings and increase the risk of depression. And overall, poor health increases the personal burden of life. People may need more time and money to treat their diseases, which may lead to economic stress and anxiety, thus increasing the risk of depression.

According to the study, smoking is considered to be one of the factors that predict the high risk of depression in OSAHS patients. Specific studies have shown higher rates of depression among current and former smokers compared with never smokers. According to a survey of the U.S. population, LUIS G. ESCOBEDO found that former smokers are more likely to develop depression, especially those who have a history of major depression (51). The cross-sectional study conducted by Tana M. Luger also noted that current smokers were more likely to develop depression than never-smokers. In contrast, current smokers were more likely to develop depression than former smokers (52). This phenomenon may be because nicotine intake from smoking can bring short-term pleasure and relaxation. Still, long-term smoking may lead to neurotransmitter disorders, thereby affecting emotional stability and increasing the risk of depression.

People with asthma are more likely to suffer from depression than people without asthma. This idea is supported by a biological linkage study by Mingdi Jiang et al. It implies that the inflammatory response may be a critical factor in regulating the common pathways of depression and asthma (53). The results of Mahima Akula’s study also confirmed the correlation between the two (54). Furthermore, a bidirectional association between asthma and depression was observed in Hyo Geun Choi’s study (55). In a clinical practice study of adolescents with asthma, 11.5% had depression (42). This may be because people with asthma may feel negative emotions such as low self-esteem, anxiety, and fear. Due to asthma having wave properties, some patients may need to avoid social situations and activities, which may lead to individual patients being isolated and isolated, which will affect their psychological health. In addition, patients with asthma often face physical discomfort such as dyspnea and chest tightness, which may affect the individual’s emotional and psychological state, which in turn exacerbates negative emotions and increases the risk of depression.

As the severity of OSAHS increases, so does the risk of depression. Cass Edwards et al. Research confirmed the results and found that with the rise in the severity of OSAHS, PHQ score and the incidence of depression also gradually increased (56). This result may be due to the OSAHS patients during apnea oxygen supply is insufficient and hypoxemia. With the deterioration of OSAHS, hypoxemia has a more serious negative impact on brain function and emotion regulation, which leads to the occurrence of depression. In addition, patients with OSAHS may experience symptoms such as fatigue, lethargy, and difficulty concentrating during the day due to decreased sleep quality. These limitations in daily functioning may negatively affect an individual’s psychological state and increase the risk of depression.

Severe OSAHS can also cause sleep disturbances, which in turn can lead to a decline in social activities and work ability. This situation is further exacerbated by negative emotions such as anxiety, low self-esteem, and depression. Therefore, it can be concluded that there is a strong correlation between OSAHS and depression and that its severity is positively related to the risk of depression.

Evaluation and application of risk prediction models for depression

Research results show that the logistic regression model is better than the random forest and lasso models regarding specificity, Youden index, and AUC area. To validate the model in the clinical application value of this study, the DCA was used, and the net income of the model was used in the comparison. The results showed that the logistic regression model had the most significant net benefit within the vast majority of threshold probability ranges (0.19 to 0.25 and 0.45 to 0.82) and had a good effect on clinical application. Therefore, the comprehensive prediction ability of the comparison results shows that the logistic regression model is superior to the lasso and random forest models. It should be noted that the lasso model may ignore some relevant features due to the high correlation between features. In addition, selecting the appropriate regularization parameter needs experience, cross validation, and other methods. This increases the complexity of model tuning (57). The random forest model is composed of multiple decision trees. Although feature importance can be used to understand the contribution of each feature to the model, the overall model is less explanatory than the logistic regression model. In addition, due to the random forest model to build a decision tree and perform multiple feature selection and integration of the operation, its training time is relatively long (58). In contrast, the logistic regression model, as a generalized linear model, uses the least squares method to fit the model and thus has high accuracy (59). Logistic regression models can make predictions and explore the direction and degree of influence between independent and dependent variables, so they have better explanatory power (49). Logistic regression models can be quantified and visualized by nomograms, which have outstanding advantages in auxiliary diagnosis in the medical field (60).

Before applying a risk prediction model for depression in the OSAHS population, it is necessary to select the most appropriate model and adjust the parameters to maximize the prediction effect of the model to improve the accuracy and sensitivity of identifying OSAHS patients at high risk for depression. Subsequently, predictive models need to be translated into forms applicable to the community and clinic, such as nomograms or mobile applications that allow physicians and sleep technologists to calculate the probability of depression easily and quickly. This will help to protect OSAHS patients in advance and effectively prevent the occurrence of depression.

In this study, ML models, especially logistic regression models, demonstrated excellent depression prediction and recognition capabilities in a large dataset. In contrast to traditional statistical methods, ML methods no longer require the researcher to specify the relevant variables subjectively but can automatically identify the variables associated with the outcome variables in the data set. This is precisely one of the advantages of ML in building clinical prediction models. Future research could apply ML methods to model and compare in longitudinal studies to obtain basic information such as the prevalence of depression. For example, as far as studies in predicting depression are concerned, studies like those done by Dai Su et al. in a longitudinal study of the older adult population in China are a good example (20). In addition, it is possible to cross-combine multiple models in ML to form a hybrid model and verify whether the hybrid model outperforms the traditional single ML model in terms of predictive performance (61). To better psychological doctors and health care at all levels, provide appropriate information and services.

In a clinical sense, physicians can assess whether a patient is at risk for depression based on gender, general health condition, BMI, smoking, OSAHS severity, age, education level, PIR, and asthma. Once a patient is identified as being at risk for depression, interventions can be implemented, including medication, psychotherapy, and behavioral changes. Early intervention and treatment can help patients reduce the symptoms of depression, improve their sleep quality and quality of life, and improve the effect of OSAHS treatment.

When using the logistic regression model to predict depression in OSAHS patients, this study suggests that specificity, sensitivity, and Youden index should be considered comprehensively, and the choice of specificity and sensitivity should be weighed according to the specific situation. In addition, in clinical practice, demonstrating the effectiveness of these indicators is also necessary. To ensure the applicability and reliability of the model, it is recommended to continue to collect larger scale, diverse data and use these data to validate and replicate the findings. By expanding the dataset’s scope, the model’s predictive performance in different populations and contexts can be more fully evaluated. Such efforts can help improve the accuracy and generalization ability of the model and provide a more reliable basis for future clinical practice. When the model is applied in clinical practice, it needs to be comprehensively evaluated by combining clinical experience and individual patient differences. This process involves interpretation and interpretation of the model predictions and a comprehensive consideration of the patient’s situation. At the same time, it is necessary to continuously update and optimize the model to improve its predictive performance and clinical utility. Finally, for the research on the influencing factors, the specific mechanism of each factor’s influence on the occurrence of depression can be further explored in depth. This includes detailed studies of biological, psychosocial, and other factors to reveal their associations with depression. At the same time, how to prevent and treat depression by intervening in these factors can also be studied.

The results of this study can be incorporated into the development and implementation of relevant public health policies. Government departments can develop prevention and intervention strategies for depression in OSAHS patients according to the logistic regression model and influencing factors identified in this study. By formulating strategies based on these models and influencing factors, patients’ mental health can be effectively improved. Department of Public Health and medical institutions can reasonably allocate resources to strengthen the prevention and treatment of depression in patients with OSAHS. This could include making more counselors or psychotherapists available and improving screening and diagnostic facilities for depression, among others. In related health education activities, the findings of this study should be disseminated to improve the public’s awareness and attention to depression in OSAHS patients. This helps reduce the social discrimination against depression to promote social support and understanding. It can also guide medical practice. By applying a logistic regression model, the medical personnel can more accurately identify the existence of the risk of depression in patients with OSAHS, which can carry on the intervention and treatment in a timely manner. This method helps to improve the early diagnostic rate of depression and to provide more effective personalized treatment options for patients to enhance their mental health. On prevention strategies, the results of this study are to develop in OSAHS patients with depression prevention strategy provides an essential basis. Based on the logistic regression model, the influence factors of medical personnel can be targeted to carry out the intervention measures, including strengthening the OSAHS patients’ psychological health education and psychological support services and establishing health management programs.

It should be noted that our results may have been affected by various potential biases in the NHANES database. One of the main biases is the low participation rate of specific populations, which may introduce sampling bias. Therefore, the data representation may be poor and cannot fully reflect the OSAHS group. This may affect the generalizability of the findings. Therefore, it is suggested that a multicenter study be carried out to expand the sample representativeness and increase the external data validity and generalization ability. Second, it should be noted that some of the data in the NHANES database rely on participant self-reports, such as diagnostic information for patients with OSAHS. This dependence may be limited by the deviation of subjective evaluation and memory bias, the influence of such factors to assess the severity of OSAHS symptoms, or inaccuracy. This may affect the reliability and accuracy of the findings. Therefore, to improve the objectivity and accuracy of diagnosis, this study suggested introducing objective measurement tools, such as more sleep figures (PSG). Using these objective measurement tools can reduce the dependence on self-reported respondents and help assess the symptoms and severity of OSAHS more accurately. Suggestions in the study consider integrating other data sources, such as clinical records and medical insurance databases, to get more comprehensive and multi-angle OSAHS data. By combining these diverse data sources, the reliability and generalizability of the findings can be increased.

In addition, future research should further optimize the ML model’s performance to improve the prediction accuracy of depression in patients with OSAHS. The introduction of other machine learning algorithms or deep learning methods can also be considered to explore better predictive models. These methods can model and analyze the data from different perspectives and improve the stability and accuracy of the model. By exploring effective intervention strategies and conducting intervention trials, the effects of other methods can be evaluated, and their long-term effects can be followed up to reduce the incidence of depression and improve the quality of life of patients with OSAHS.

Conclusion

This research uses the NHANES database to establish three ML models, the logistic regression model, the lasso model, and the random forest model, to predict depression in the OSAHS group and identify the related factors. Among them, the logistic regression model was superior to the lasso and random forest models’ overall prediction performance. By drawing the nomogram and applying it to the sleep testing center or sleep clinic, sleep technicians and medical staff can quickly and easily identify whether OSAHS patients have depression to carry out the necessary referral and psychological treatment.

Limitations

The data used in this study were obtained from the NHANES database, which includes a variety of relevant variables, such as smoking history and sleep disorders. Most of these variables are based on patient self-reporting and may have subjective bias, affecting the data’s objective accuracy.

Due to the lack of relevant variables in the NHANES database, such as the AHI index, minimum oxygen saturation, etc., including these predictors may allow for better model prediction performance.

In this study, due to the limited sample size of the screening, we did not perform external validation to verify the effect of this predictive model. However, we hope that future studies will externally validate this model by conducting a multicenter study with an increased sample size and applying it to a community population for screening further to test the generalization ability and robustness of the model.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The NCHS Research Ethics Review Board (ERB) reviewed and approved the studies involving human participants. Written informed consent for participation was not required for this study by the national legislation and the institutional requirements.

Author contributions

EL: Writing – original draft, Conceptualization, Data curation, Formal analysis, Resources, Visualization. FA: Writing – review & editing, Investigation, Methodology, Software. CL: Writing – review & editing, Funding acquisition, Project administration, Supervision, Validation.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by the Social Science Planning Foundation of Liaoning Province (L21CSH005).

Acknowledgments

Thanks to all the authors for their hard work on this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2023.1348803/full#supplementary-material

Footnotes

1. ^ https://www.cdc.gov/nchs/nhanes/index.htm

References

1. Aalbers, S, Fusar-Poli, L, Freeman, RE, Spreen, M, Ket, JCF, Vink, AC, et al. Music therapy for depression. Cochrane Database Syst Rev. (2017) 2017:CD004517. doi: 10.1002/14651858.CD004517.pub3

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Prince, M, Patel, V, Saxena, S, Maj, M, Maselko, J, Phillips, MR, et al. No health without mental health. Lancet. (2007) 370:859–77. doi: 10.1016/s0140-6736(07)61238-0

CrossRef Full Text | Google Scholar

3. World Health Organization . Depression and other common mental disorders: global health estimates. Geneva: World Health Organization (2017).

Google Scholar

4. Douglas, N, Young, A, Roebuck, T, Ho, S, Miller, BR, Kee, K, et al. Prevalence of depression in patients referred with snoring and obstructive sleep apnoea. Intern Med J. (2013) 43:630–4. doi: 10.1111/imj.12108

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Barak-Corren, Y, Castro, VM, Javitt, S, Hoffnagle, AG, Dai, Y, Perlis, RH, et al. Predicting suicidal behavior from longitudinal electronic health records. Am J Psychiatr. (2017) 174:154–62. doi: 10.1176/appi.ajp.2016.16010077

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Herrman, H, Patel, V, Kieling, C, Berk, M, Buchweitz, C, Cuijpers, P, et al. Time for united action on depression: a lancet–World psychiatric association commission. Lancet. (2022) 399:957–1022. doi: 10.1016/s0140-6736(21)02141-3

CrossRef Full Text | Google Scholar

7. Rundo, JV . Obstructive sleep apnea basics. Cleve Clin J Med. (2019) 86:2–9. doi: 10.3949/ccjm.86.s1.02

CrossRef Full Text | Google Scholar

8. Veasey, SC, Solomon, CG, and Rosen, IM. Obstructive Sleep Apnea in Adults. N Engl J Med. (2019) 380:1442–9. doi: 10.1056/NEJMcp1816152

CrossRef Full Text | Google Scholar

9. Edwards, C, Almeida, OP, and Ford, AH. Obstructive sleep apnea and depression: a systematic review and meta-analysis. Maturitas. (2020) 142:45–54. doi: 10.1016/j.maturitas.2020.06.002

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Peppard, PE, Szklo-Coxe, M, Hla, KM, and Young, T. Longitudinal association of sleep-related breathing disorder and depression. Arch Intern Med. (2006) 166:1709–15. doi: 10.1001/archinte.166.16.1709

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Harris, M, Glozier, N, Ratnavadivel, R, and Grunstein, RR. Obstructive sleep apnea and depression. Sleep Med Rev. (2009) 13:437–44. doi: 10.1016/j.smrv.2009.04.001

CrossRef Full Text | Google Scholar

12. Chen, Y-H, Keller, JK, Kang, J-H, Hsieh, H-J, and Lin, H-C. Obstructive sleep apnea and the subsequent risk of depressive disorder: a population-based follow-up study. J Clin Sleep Med. (2013) 09:417–23. doi: 10.5664/jcsm.2652

CrossRef Full Text | Google Scholar

13. Gharsalli, H, Harizi, C, Zaouche, R, Sahnoun, I, Saffar, F, Maalej, S, et al. Prevalence of depression and anxiety in obstructive sleep apnea. Tunis Med. (2022) 100:525–33.

PubMed Abstract | Google Scholar

14. Mihalj, M . Depression and fatigue are due to obstructive sleep apnea in multiple sclerosis. Acta Clin Croat. (2022) 61:599–604. doi: 10.20471/acc.2022.61.04.05

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Lang, CJ, Appleton, SL, Vakulin, A, McEvoy, RD, Wittert, GA, Martin, SA, et al. Co-morbid OSA and insomnia increases depression prevalence and severity in men. Respirology. (2017) 22:1407–15. doi: 10.1111/resp.13064

CrossRef Full Text | Google Scholar

16. Haddock, N, and Wells, ME. The association between treated and untreated obstructive sleep apnea and depression. Neurodiagn J. (2018) 58:30–9. doi: 10.1080/21646821.2018.1428462

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Lee, Y, Ragguett, R-M, Mansur, RB, Boutilier, JJ, Rosenblat, JD, Trevizol, A, et al. Applications of machine learning algorithms to predict therapeutic outcomes in depression: a meta-analysis and systematic review. J Affect Disord. (2018) 241:519–32. doi: 10.1016/j.jad.2018.08.073

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Hatton, CM, Paton, LW, McMillan, D, Cussens, J, Gilbody, S, and Tiffin, PA. Predicting persistent depressive symptoms in older adults: a machine learning approach to personalised mental healthcare. J Affect Disord. (2019) 246:857–60. doi: 10.1016/j.jad.2018.12.095

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Shatte, ABR, Hutchinson, DM, and Teague, SJ. Machine learning in mental health: a scoping review of methods and applications. Psychol Med. (2019) 49:1426–48. doi: 10.1017/s0033291719000151

CrossRef Full Text | Google Scholar

20. Su, D, Zhang, X, He, K, and Chen, Y. Use of machine learning approach to predict depression in the elderly in China: a longitudinal study. J Affect Disord. (2021) 282:289–98. doi: 10.1016/j.jad.2020.12.160

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Xia, F, Li, Q, Luo, X, and Wu, J. Machine learning model for depression based on heavy metals among aging people: a study with National Health and Nutrition Examination Survey 2017–2018. Front Public Health. (2022) 10:10. doi: 10.3389/fpubh.2022.939758

CrossRef Full Text | Google Scholar

22. Li, M, Zou, X, Lu, H, Li, F, Xin, Y, Zhang, W, et al. Association of sleep apnea and depressive symptoms among US adults: a cross-sectional study. BMC Public Health. (2023) 23:427. doi: 10.1186/s12889-023-15358-8

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Yan, X, Wang, L, Liang, C, Zhang, H, Zhao, Y, Zhang, H, et al. Development and assessment of a risk prediction model for moderate-to-severe obstructive sleep apnea. Front Neurosci. (2022) 16:936946. doi: 10.3389/fnins.2022.936946

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Mazzotti, DR, Keenan, BT, Lim, DC, Gottlieb, DJ, Kim, J, and Pack, AI. Symptom subtypes of obstructive sleep apnea predict incidence of cardiovascular. Am J Respir Crit Care Med. (2019) 200:493–506. doi: 10.1164/rccm.201808-1509OC

CrossRef Full Text | Google Scholar

25. Schmickl, CN, Orr, JE, and Kim, P. Point-of-care prediction model of loop gain in patients with obstructive sleep. BMC Pulm Med. (2022) 22:158. doi: 10.1186/s12890-022-01950-y

CrossRef Full Text | Google Scholar

26. Keshavarz, Z, and Rezaee, R. Obstructive sleep apnea: a prediction model using supervised machine learning. Stud Health Technol Inform. (2020) 272:387–90. doi: 10.3233/SHTI200576

CrossRef Full Text | Google Scholar

27. Curtin, LR, Mohadjer, LK, Dohrmann, SM, Kruszon-Moran, D, Mirel, LB, Carroll, MD, et al. National Health and nutrition examination survey: sample design, 2007–2010. Vital Health Stat 2. (2013) 160:1–23.

PubMed Abstract | Google Scholar

28. Johnson, CL, Dohrmann, SM, Burt, VL, and Mohadjer, LK. National health and nutrition examination survey: sample design, 2011-2014. Vital Health Stat 2. (2014) 162:1–33.

PubMed Abstract | Google Scholar

29. Kroenke, K, Spitzer, RL, and Williams, JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. (2001) 19:708–9. doi: 10.1046/j.1525-1497.2001.016009606.x

PubMed Abstract | CrossRef Full Text | Google Scholar

30. McIntyre, RS, Lee, Y, Rong, C, Rosenblat, JD, Brietzke, E, Pan, Z, et al. Ecological momentary assessment of depressive symptoms using the mind.me application: Convergence with the Patient Health Questionnaire-9 (PHQ-9). J Psychiatr Res. (2021) 135:311–317. doi: 10.1016/j.jpsychires.2021.01.012

CrossRef Full Text | Google Scholar

31. Kroenke, K . PHQ-9: global uptake of a depression scale. World Psychiatry. (2021) 20:135–6. doi: 10.1002/wps.20821

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Luo, L, Sun, W, Han, Y, Zhang, W, Liu, C, and Yin, S. Importance evaluation based on random Forest algorithms: insights into the relationship between negative air ions variability and environmental factors in urban green spaces. Atmos. (2020) 11:706. doi: 10.3390/atmos11070706

CrossRef Full Text | Google Scholar

33. Zhang, Y, and Zhang, Z. Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer. Front Public Health. (2022) 10:1008137. doi: 10.3389/fpubh.2022.1008137

CrossRef Full Text | Google Scholar

34. Lv, J, Ren, H, Guo, X, Meng, C, Fei, J, Mei, H, et al. Nomogram predicting bullying victimization in adolescents. J Affect Disord. (2022) 303:264–72. doi: 10.1016/j.jad.2022.02.037

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Asghari, A, Mohammadi, F, Kamrava, SK, Tavakoli, S, and Farhadi, M. Severity of depression and anxiety in obstructive sleep apnea syndrome. Eur Arch Otorhinolaryngol. (2012) 269:2549–53. doi: 10.1007/s00405-012-1942-6

CrossRef Full Text | Google Scholar

36. Sforza, E, Saint Martin, M, Barthélémy, JC, and Roche, F. Mood disorders in healthy elderly with obstructive sleep apnea: a gender effect. Sleep Med. (2016) 19:57–62. doi: 10.1016/j.sleep.2015.11.007

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Arias-Carrion, O, Dai, Y, Li, X, Zhang, X, Wang, S, Sang, J, et al. Prevalence and predisposing factors for depressive status in Chinese patients with obstructive sleep apnoea: a large-sample survey. PLoS One. (2016) 11:e0149939. doi: 10.1371/journal.pone.0149939

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Salk, RH, Hyde, JS, and Abramson, LY. Gender differences in depression in representative national samples: meta-analyses of diagnoses and symptoms. Psychol Bull. (2017) 143:783–822. doi: 10.1037/bul0000102

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Saunamäki, T, and Jehkonen, M. Depression and anxiety in obstructive sleep apnea syndrome: a review. Acta Neurol Scand. (2007) 116:277–88. doi: 10.1111/j.1600-0404.2007.00901.x

CrossRef Full Text | Google Scholar

40. Kranjac, AW, Nie, J, Trevisan, M, and Freudenheim, JL. Depression and body mass index, differences by education: evidence from a population-based study of adult women in the U.S. Buffalo-Niagara region. Obes Res Clin Pract. (2017) 11:63–71. doi: 10.1016/j.orcp.2016.03.002

CrossRef Full Text | Google Scholar

41. Heiskanen, TH, Koivumaa-Honkanen, HT, Niskanen, LK, Lehto, SM, Honkalampi, KM, Hintikka, JJ, et al. Depression and major weight gain: a 6-year prospective follow-up of outpatients. Compr Psychiatry. (2013) 54:599–604. doi: 10.1016/j.comppsych.2013.02.001

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Licari, A, Castagnoli, R, Ciprandi, R, Brambilla, I, Guasti, E, Marseglia, GL, et al. Anxiety and depression in adolescents with asthma: a study in clinical practice. Acta Biomed. (2022) 93:e2022021. doi: 10.23750/abm.v93i1.10731

CrossRef Full Text | Google Scholar

43. Wagner, S, Wollschläger, D, Dreimüller, N, Engelmann, J, Herzog, DP, Roll, SC, et al. Effects of age on depressive symptomatology and response to antidepressant treatment in patients with major depressive disorder aged 18 to 65 years. Compr Psychiatry. (2020) 99:99. doi: 10.1016/j.comppsych.2020.152170

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Luo, Y, Wang, A, Zeng, Y, and Zhang, J. A latent class analysis of resilience and its relationship with depressive symptoms in the parents of children with cancer. Support Care Cancer. (2022) 30:4379–87. doi: 10.1007/s00520-022-06860-7

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Wickersham, A, Sugg, HVR, Epstein, S, Stewart, R, Ford, T, and Downs, J. Systematic review and meta-analysis: the association between child and adolescent depression and later educational attainment. J Am Acad Child Adolesc Psychiatry. (2021) 60:105–18. doi: 10.1016/j.jaac.2020.10.008

PubMed Abstract | CrossRef Full Text | Google Scholar

46. da Costa Dias, FL, Teixeira, AL, Guimarães, HC, Santos, APB, Resende, EPF, Machado, JCB, et al. The influence of age, sex and education on the phenomenology of depressive symptoms in a population-based sample aged 75+ years with major depression: the Pietà study. Aging Ment Health. (2019) 25:462–7. doi: 10.1080/13607863.2019.1698517

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Glaesmer, H, Riedel-Heller, S, Braehler, E, Spangenberg, L, and Luppa, M. Age- and gender-specific prevalence and risk factors for depressive symptoms in the elderly: a population-based study. Int Psychogeriatr. (2011) 23:1294–300. doi: 10.1017/s1041610211000780

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Moussavi, S, Chatterji, S, Verdes, E, Tandon, A, Patel, V, and Ustun, B. Depression, chronic diseases, and decrements in health: results from the World Health Surveys. Lancet. (2007) 370:851–8. doi: 10.1016/s0140-6736(07)61415-9

PubMed Abstract | CrossRef Full Text | Google Scholar

49. AJ, VD, Pasupathy, KS, Huschka, TR, Heaton, HA, Hellmich, TR, and Sir, MY. Extended patient alone time in emergency department leads to increased risk of 30-day hospitalization. J Patient Saf. (2021) 17:e1458–64. doi: 10.1097/PTS.0000000000000545

CrossRef Full Text | Google Scholar

50. de Ávila, ÉD, de Molon, RS, Loffredo, LCM, Massucato, EMS, and Hochuli-Vieira, E. Health-related quality of life and depression in patients with dentofacial deformity. Oral Maxillofac Surg. (2012) 17:187–91. doi: 10.1007/s10006-012-0338-5

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Escobedo, LG, and Kirch, DG. Depression and smoking initiation among US Latinos. Addiction. (1996) 91:113–9.

Google Scholar

52. Weinberger, AH, Mazure, CM, Morlett, A, and McKee, SA. Two decades of smoking cessation treatment research on smokers with depression: 1990-2010. Nicotine Tob Res. (2012) 15:1014–31. doi: 10.1093/ntr/nts213

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Jiang, M, Qin, P, and Yang, X. Comorbidity between depression and asthma via immune-inflammatory pathways: a meta-analysis. J Affect Disord. (2014) 166:22–9. doi: 10.1016/j.jad.2014.04.027

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Akula, M, Kulikova, A, Khan, DA, and Brown, ES. The relationship between asthma and depression in a community-based sample. J Asthma. (2018) 55:1271–7. doi: 10.1080/02770903.2017.1418885

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Choi, HG, Kim, J-H, Park, J-Y, Hwang, YI, Jang, SH, and Jung, K-S. Association between asthma and depression: a National Cohort Study. J Allergy Clin Immunol Pract. (2019) 7:1239–1245.e1. doi: 10.1016/j.jaip.2018.10.046

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Edwards, C, Mukherjee, S, Simpson, L, Palmer, LJ, Almeida, OP, and Hillman, DR. Depressive symptoms before and after treatment of obstructive sleep apnea in men and women. J Clin Sleep Med. (2015) 11:1029–38. doi: 10.5664/jcsm.5020

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Dai, P, Chang, W, Xin, Z, Cheng, H, Ouyang, W, and Luo, A. Retrospective study on the influencing factors and prediction of hospitalization expenses for chronic renal failure in China based on random forest and LASSO regression. Front Public Health. (2021) 9:9. doi: 10.3389/fpubh.2021.678276

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Zhang, C, and Ma, Y. Ensemble machine learning. Berlin: Springer Science & Business Media (2012).

Google Scholar

59. Nguyen, PTT, Hoang, DV, Pham, KM, and Nguyen, HT. A multiple logistic regression model based on gamma-Glutamyl transferase as a biomarker for early prediction of drug-induced liver injury in Vietnamese patients. J Clin Pharmacol. (2021) 62:110–7. doi: 10.1002/jcph.1955

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Hu, H, Lai, X, Tan, C, Yao, N, and Yan, L. Factors associated with in-patient mortality in the rapid assessment of adult earthquake trauma patients. Prehosp Disaster Med. (2022) 37:299–305. doi: 10.1017/s1049023x22000693

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Liu, Y . Prediction of depression in the elderly based on machine learning [bachelor]: Shandong University (2023).

Google Scholar

Keywords: machine learning, depression, OSAHS, prediction models, NHANES

Citation: Li E, Ai F and Liang C (2024) A machine learning model to predict the risk of depression in US adults with obstructive sleep apnea hypopnea syndrome: a cross-sectional study. Front. Public Health. 11:1348803. doi: 10.3389/fpubh.2023.1348803

Received: 03 December 2023; Accepted: 22 December 2023;
Published: 08 January 2024.

Edited by:

Peter ten Klooster, University of Twente, Netherlands

Reviewed by:

Seyed-Ali Sadegh-Zadeh, Staffordshire University, United Kingdom
Zhiang Niu, West China Hospital, Sichuan University, China

Copyright © 2024 Li, Ai and Liang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Chunguang Liang, bGlhbmdjaHVuZ3VhbmdAanptdS5lZHUuY24=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.