- 1Shenzhen Hospital of Integrated Traditional Chinese and Western Medicine, Shenzhen, China
- 2Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China
- 3Clinical Medical of Acupuncture, Moxibustion and Rehabilitation, Guangzhou University of Chinese Medicine, Guangzhou, Guangdong, China
Background: Stroke is the second leading cause of death worldwide and a major cause of long-term neurological disability, imposing an enormous financial burden on families and society. This study aimed to identify the predictors in stroke patients and construct a nomogram prediction model based on these predictors.
Methods: This retrospective study included 11,435 participants aged >20 years who were selected from the NHANES 2011–2018. Randomly selected subjects (n = 8531; 75%) and the remaining subjects comprised the development and validation groups, respectively. The least absolute shrinkage and selection operator (LASSO) binomial and logistic regression models were used to select the optimal predictive variables. The stroke probability was calculated using a predictor-based nomogram. Nomogram performance was assessed by the area under the receiver operating characteristic curve (AUC) and the calibration curve with 1000 bootstrap resample validations. Decision curve analysis (DCA) was performed to evaluate the clinical utility of the nomogram.
Results: According to the minimum criteria of non-zero coefficients of Lasso and logistic regression screening, older age, lower education level, lower family income, hypertension, depression status, diabetes, heavy smoking, heavy drinking, trouble sleeping, congestive heart failure (CHF), coronary heart disease (CHD), angina pectoris and myocardial infarction were independently associated with a higher stroke risk. A nomogram model for stroke patient risk was established based on these predictors. The AUC (C statistic) of the nomogram was 0.843 (95% CI: 0.8186–0.8430) in the development group and 0.826 (95% CI: 0.7811, 0.8716) in the validation group. The calibration curves after 1000 bootstraps displayed a good fit between the actual and predicted probabilities in both the development and validation groups. DCA showed that the model in the development and validation groups had a net benefit when the risk thresholds were 0–0.2 and 0–0.25, respectively.
Discussion: This study effectively established a nomogram including demographic characteristics, vascular risk factors, emotional factors and lifestyle behaviors to predict stroke risk. This nomogram is helpful for screening high-risk stroke individuals and could assist physicians in making better treatment decisions to reduce stroke occurrence.
Introduction
The previous stroke incidence of 12.2 million was high, and it increased to 101 million by 2019 according to the latest Global Burden of Disease Study (GBD). In addition, the GBD 2019 systematic analysis also showed that stroke was still the second leading cause of death worldwide and a major cause of long-term neurological disability, which brought an enormous financial burden to families and society (GBD 2019 Stroke Collaborators, 2021). Therefore, prevention and treatment are vital to curb the stroke pandemic. However, because of the limited effective therapies of current medical treatment and the high incidence of neuropsychiatric sequelae of stroke, stroke prevention is an effective strategy to decrease the incidence of stroke and health consequences (Boehme et al., 2017; Kleindorfer et al., 2021). Previous studies have indicated that body mass index (BMI), high blood pressure, diabetes, cardiovascular disease, and dyslipidemia are strongly correlated with the incidence of stroke (Kurth et al., 2002; Li et al., 2005; Chen et al., 2016; Du et al., 2019; Alloubani et al., 2021; Shiozawa et al., 2021). Lifestyle behaviors, including smoking, alcohol consumption, sleep quality, and depression, were also observed to be related to the occurrence of stroke (Dong et al., 2012; Christensen et al., 2018; Zhou L. et al., 2020; Luo et al., 2021). In addition, studies have also reported that basic demographic information, such as sex, age, educational level, marital status, and income level, are important factors associated with stroke (Gan et al., 2017; Jackson et al., 2018; Liu et al., 2018; Zhou W. et al., 2020). Most of the previous studies only reported one risk factor connected with stroke. The comprehensive combination of all of the above potential risk factors to predict the risk of stroke is still lacking. The occurrence of stroke is an etiologically complex disease influenced by a variety of risk factors that act and interact together and is thus not influenced by only one risk factor (Boehme et al., 2017). Therefore, constructing a predictive model that combines all potential risk factors together to detect the risk of stroke in populations is important, and it will help to detect stroke risk early and accurately and help with the adoption of appropriate preventive strategies in a timely manner.
Currently, nomograms are used as predictive tools by integrating various predictors to create graphical calculation instruments of statistical predictive models, which provide a predicted probability of a clinical event or certain endpoint outcome (Cahlon et al., 2012; Makkouk et al., 2017). Hence, constructing a nomogram is helpful to forecast the probability of stroke and provide corresponding and comprehensive preventive suggestions for individual conditions in a timely manner. However, effective and sensitive calculation models for predicting the occurrence of stroke have been scarce until now.
Therefore, in this study, we investigated the risk factors for stroke and constructed a predictive model based on the National Health and Nutrition Examination Survey (NHANES) database. First, we selected highly fitted predictor variables by least absolute shrinkage and selection operator (LASSO) and regression analysis, and then a nomogram including all potential predictors was developed and validated to forecast the probability of stroke risk. This prediction model will help to screen and identify high-risk populations for suffering stroke and provide them with optimal and timely clinical decision-making and preventive recommendations.
Materials and methods
Study design
This was a retrospective study, obtaining data from the NHANES (2011–2018) 8-year dataset. NHANES is an open database, with approximately 5,000 people surveyed every 2 years, represented the United States population of all ages. Prior to NHANES data collection, NHANES was approved by the Institutional Review Board (IRB) of the National Center for Health Statistics Ethics, and written informed consent was obtained from all participants.
Study population
A total of 22,230 participants aged >20 years were selected from the NHANES 2011–2018. The inclusion criteria include the following: (1) patients with stroke (defined as participants had ever been told by a doctor or professional that they had a stroke); and (2) patients were over 20 years old. People lost clinical data or incomplete clinical information of predictor variables and outcome were excluded. Then, we cleaned the data and removed the missing data of demographic characteristics and predictor variables. Ultimately, a total of 11,435 participants were selected for our final analyses. Among them, 8531 (75%) were randomly selected using EmpowerStats software as the development group, and the remaining subjects comprised the validation group.
Candidate predictor variables
According to the clinical experience and literature review, some variables were selected as risk predictors that might influence the occurrence of stroke, including basic demographic information (such as sex, age, educational level, marital status, income level), lifestyle behaviors (smoking, alcohol drinking, sleeping quality, and depression mood), and other highly correlated risk factors [such as BMI (underweight <18.5, normal weight > = 18.5, <25, overweight > = 25, <30, obesity > = 30), high blood pressure, diabetes, dyslipidemia and cardiovascular disease]. The basic demographic information was extracted from the self-report questionnaire and mainly included age, sex (male or female), marital status (married, divorced, separated, widowed, single, never married), educational level (less than high school, high school and more than high school), and family income level (high income > = 55,000$, 5,5000$ >mediate income > = 20,000$, lower income <20,000$).
Lifestyle behavior indexes were obtained through a questionnaire and included smoking (smoker or non-smoker), alcohol drinking status (light < = 5 alcohol drinks/day–past 12 months, 5 alcohol drinks/days <moderate < = 10 alcohol drinks/day–past 12 months, heavy >10 alcohol drinks/day-past 12 months), trouble sleeping (told doctor that they had trouble sleeping or did not tell doctor that they have trouble sleeping), and depression status [a score of > = 10 of the PHQ-9 defined as depression status, PHQ-9 < 10 defined as normal status (Kroenke et al., 2001)]. The other potential risk factors included high blood pressure (hypertension or normal), diabetes (have you ever been told by a doctor or health professional that you have diabetes, yes or no), cardiovascular disease [including ever told by a doctor had congestive heart failure (CHF), coronary heart disease (CHD), angina pectoris, myocardial infarction, yes or no] were also obtained from the questionnaire. Blood lipid data, including high-density lipoprotein [HDL, <1.00 mmol/L (low), 1.00–1.6 mmol/L (moderate), > = 1.6 mmol/L (high)], total cholesterol [TC, 0 < 5.2 mmol/L (normal), 5.2–6.2 mmol/L (borderline), > = 6.2 mmol/L (high)], were extracted from the NHANES laboratory data.
Outcome (stroke) assessment
The endpoint outcome of our study was defined as the occurrence of stroke. The predefined stroke groups were based on the Medical Condition Questionnaire of NHANES. Participants were asked whether they had ever been told by a doctor or professional that they had a stroke. If the answer was “Yes,” then we classified these participants as the predefined stroke group. The remaining population was defined as the non-stroke group.
Statistical analysis
Demographic data, including continuous variables and categorical variables, were extracted from the NHANES database by R 4.2.0. All continuous variables were compared between groups using the Kruskal-Wallis rank sum test and are presented as the mean ± standard deviation (mean ± SD). The chi-square test or Fisher’s exact test (if the theoretical frequency was less than 10) was performed to compare categorical variables and displayed as frequencies and proportions. Missing variable data were excluded from the analysis. A risk prediction model for stroke was constructed based on the logistic regression method. First, to avoid the collinearity of inclusion covariates and filter the optimal predicted risk factors, we screened the potential risk factors for stroke by least absolute shrinkage and selection operator (Lasso) regression. This method was helpful to avoid the collinearity of inclusion covariates and select the optimal predicted risk factors (Sauerbrei et al., 2007). The optimal features with non-zero coefficients were selected by Lasso regression and then included in a multivariable logistic regression to screen out all significant risk factors. Risk features with P < 0.05 were selected, and the nomogram prediction model was established. Receiver operator characteristic (ROC) curves were used to evaluate the sensitivity and specificity of the nomogram (Heagerty et al., 2000). A concordance index (C-index), calibration curves and a decision curve analysis (DCA) were used to measure the predictive performance of the nomogram. A decision curve analysis (DCA) was performed to evaluate the clinical utility of the nomogram by calculating the net benefits at different threshold probabilities (Vickers et al., 2008). To improve the accuracy and stability of the model, the 1000 bootstrap resample validation method was also conducted for internal validation. For testing validation, we performed ROC, C-index and calibration curve analyses and DCA using the same methods as mentioned above. All statistical analyses were conducted with EmpowerStats and R-4.12 software. A p-value of less than 0.05 indicated statistical significance.
Results
Baseline characteristics
A total of 11,435 participants were enrolled in this study based on the pre-specified inclusion and exclusion criteria. All included subjects had a mean age of 46.8 ± 16.9 years and included 5,959 males (52.1%) and 5,476 females (47.9%). Seventy-five percent of the eligible subjects were randomly divided into the development group (n = 8531), and the rest were divided into the validation group (n = 2904). Among the 11,435 individual subjects, 305 patients were diagnosed with stroke, 223 in the training group and 82 in the validation group. Only age, family income and hypertension status variables had significant differences between the development group and validation group (p < 0.05). The specific demographic information and clinical characteristics are shown in Table 1.
Clinical predictor selection
According to clinical experience, expert opinion and previous literature, we included 17 potential predictors in the Lasso logistics analysis to avoid overfitting of the variables and enhance the accuracy of the prediction model. Finally, a total of 16 variables were left based on the minimum criteria of non-zero coefficients (Figure 1). Twelve risk factors with p-values less than 0.05 were further selected by logistic regression [including age, education, family income, hypertension, depression status, diabetes, alcohol, smoking, trouble sleeping, coronary heart disease (CHD), angina pectoris and myocardial infarction; Table 2].
Figure 1. Clinical feature selection using the Lasso logistic regression model. (A) The partial likelihood deviance (binomial deviance) curve was plotted vs. log (lambda). Dotted vertical lines were presented at the optimal predictors using the minimum criteria (min.criteria) and the 1 SE of the minimum criteria (1-SE criteria). (B) Lasso coefficients of a total 17 clinical features.
Development and assessment of the predictive nomogram
Model development
We developed a predictive full nomogram containing age, educational level, family income, hypertension, depression status, diabetes, cardiovascular disease [including congestive heart failure (CHF), coronary heart disease (CHD), angina pectoris, myocardial infarction], trouble sleeping, smoking, and drinking based on the minimum criteria of non-zero coefficients of Lasso regression and significant logistic regression screening. Each predictor was calculated as a specific score on a rating scale, the total points of each variable were summed, and a vertical line was drawn downward at the total points to correspond to the probability of stroke. A higher score indicated a higher probability of stroke (shown in Figure 2). The area under the ROC curve [AUC (C statistic)] of this full nomogram was 0.843 (95% CI: 0.8186–0.8430; Figure 3A). The calibration curves after 1000 bootstraps displayed a good fit between the actual and predicted probabilities in the nomogram, which indicated that the predictive models were stable and accurate (Figure 4A). The curves showed that the model had a net benefit when the risk threshold was between 0 and 0.2 (Figure 5A).
Figure 3. Receiver operator characteristic (ROC) curves of the nomogram. (A) ROC curves of the nomogram in the training group. (B) ROC curve of the nomogram in the validation group.
Figure 4. Calibration curves of the nomogram. (A) ROC curves of the nomogram in the training group. (B) ROC curve of the nomogram in the validation group.
Figure 5. Decision curve of the nomogram. (A) Decision curves of the nomogram in the training group. (B) Decision curves of the nomogram in the validation group.
Model validation
The 25% random internal validation indicated that these nomogram models had good predictive performance and good stability. The AUC (C statistic) of the model in the validation cohort was 0.826 (95% CI: 0.7811, 0.8716) (Figure 3B). The nomogram calibration curve of the validation cohort also demonstrated that the model had good calibration (Figure 4B). The decision curves showed that the model had a net benefit when the risk threshold was between 0 and 0.25 (Figure 5B).
Discussion
Given the high occurrence rate of stroke and heavy burden of post-stroke sequelae, an optimal prediction model for stroke prevention is clinically meaningful. Therefore, in this study, we screened the potential risk factors by Lasso and logistic regression and constructed and validated a novel nomogram-based score including these competing risk factors to predict the risk of suffering stroke.
Our results revealed that patients with older age, lower education level, lower family income, hypertension, depression status, diabetes, heavy smoking, heavy drinking, trouble sleeping, myocardial infarction congestive heart failure (CHF), coronary heart disease (CHD), angina pectoris and myocardial infarction had a higher risk of developing stroke. Similar to the reports of previous studies, older age, hypertension and diabetes were significantly and highly associated with stroke (Hu et al., 2005; Alloubani et al., 2018; Yousufuddin and Young, 2019).
Aging increases susceptibility to cerebrovascular diseases and increases the complications of stroke with the decline of physical function (Boehme et al., 2017; Yousufuddin and Young, 2019). Hypertension and diabetes mainly affect vascular blood flow and vascular function, which are positively associated with stroke mortality and morbidity (Sowers et al., 2001; Chang et al., 2022). Heart disease, including coronary heart disease (CHD) and myocardial infarction, was a confirmed risk factor and was considered to be 3–4 times more prevalent than the absence of heart disease (Arboix, 2015; Virani et al., 2021). These results further validated that aging, hypertension, diabetes and heart disease were associated with a higher risk for developing stroke. This study also indicated that individuals with lower family income had a higher risk for stroke than those with higher income. Related studies were in line with our study and suggested that lower income was strongly related to stroke occurrence and its risk factors, including hypertension and smoking (Virani et al., 2021). Heavy smoking is a dangerous risk factor for stroke, as previously reported. Most of the studies suggested that an increment of five cigarettes would increase the risk of stroke by 12% according to the dose-response analysis or that heavy cigarette consumption (5 or more cigarettes per day) would increase the risk of developing stroke (Pan et al., 2019; Luo et al., 2021). Regarding trouble sleeping, most studies have demonstrated that long-term sleep disturbance causes endocrine abnormalities that directly or indirectly influence blood pressure, glucose, and lipids, resulting in cerebrovascular pathology of stroke (Grandner et al., 2016; Phua et al., 2017). Emotional behavior also influences the occurrence of stroke, and our study indicated that depression had a positive association with a high risk of developing stroke, which was consistent with previous study analyses (Li et al., 2021). Depression might increase stroke risk by affecting neuroendocrine and inflammatory responses and unhealthy behaviors (heavy smoking and drinking, irregulated diet) (Strine et al., 2008; Pan et al., 2011; Shen et al., 2022).
We used Lasso regression to build an optimized nomogram-based model. The AUC results in the training and validation groups were 0.843 and 0.825, respectively, indicating that the nomogram predictive model had good accuracy and stability. This nomogram model was convenient to use because the number of predictors was smaller and the calculation methods were easy to master. This nomogram, including demographic characteristics, vascular risk factors, emotional factors and lifestyle behaviors, is comprehensive and sensitive for identifying high-risk individuals who have not yet developed stroke. Previous studies indicated that stroke occurrence was not only one risk factor but also influenced by multiple factors that interact with each other (Boehme et al., 2017). Nomograms are helpful for predicting the probability of suffering from disease (Park, 2018). Previous studies have shown that nomograms are widely used in the prediction of stroke, including establishing prediction models to identify blood biomarkers or medical imaging data associated with stroke and stroke implications (Liu J. et al., 2022; Liu L. et al., 2022; Yang et al., 2022). All these results indicated that the nomogram is stable and accurate in predicting the probability of suffering stroke, which was in line with our study.
Our nomogram model can obtain the total score of risk factors and the risk probability of developing stroke, which can help physicians make more helpful suggestions for patients. For instance, for an individual aged 60 years who had a lower education and family income, hypertension, depression status, diabetes, heavy smoking, heavy drinking, trouble sleeping, coronary heart disease (CHD), congestive heart failure (CHF) and myocardial infarction (angina pectoris), the total score of risk factors provided by the nomogram model is 355, which corresponds to a 75% probability of stroke. Therefore, the physician could judge that this patient has a high risk of developing stroke and suggest that this patient actively control risk factors by lifestyle and behavior change and take corresponding treatment measures to reduce stroke risk.
To our knowledge, previous studies also established predictive model to predict the stroke risk. All previous prediction models including Framingham Stroke Risk Score, CHADS2 and CHA2DS2-VASc Score, were stable and accurate in predicting the probability of suffering stroke, which were consistent with our results. However, the Framingham Stroke Risk Profile (FSRP) that only includes predictors such as age, systolic blood pressure, antihypertensive treatment, diabetes history, smoking, and cardiovascular disease history to predict the stroke risk in future 10 years (Wolf et al., 1991). Therefore, the model may have some limitations in that it lacks some important stroke predictors. Other prediction models including CHADS2 and CHA2DS2-VASc Score mainly assessed the incidence of stroke in patients with non-valvular atrial fibrillation, which could better guide stroke risk stratification and anticoagulant drug application in patients with atrial fibrillation (Gage et al., 2001; Lip et al., 2010). These models could only apply in atrial fibrillation people which might narrow the scope of application of the population compared with our prediction model. Therefore, in this model, we comprehensively included potential risk factors and established a predictive model by a visual graphical calculation instrument to forecast the probability of stroke.
However, there are still several limitations of our prediction model. First, our nomogram model was based on the NHANES database, indicating that the data were retrospective, and some variables were based on self-report, which increased the selection bias and reduced the accuracy of our model. We established the prediction model from random and different time points. Second, because of the limited data of NHANES, some potential pathogenic gene statuses, etiological subtypes, imaging results and other important risk factors were not investigated to further establish the comprehensive prediction model. Third, this model was mainly established based on the American population, and multicenter clinical validation is needed to further evaluate the external applicability of this nomogram model.
Conclusion
This study established a nomogram that included demographic characteristics, vascular risk factors, emotional factors and lifestyle behaviors to predict the risk of developing stroke. The validations also showed the accurate and stable predictive performance of this nomogram. This nomogram is helpful to screen high-risk stroke individuals and could assist physicians in making better treatment decisions to reduce stroke occurrence.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.cdc.gov/nchs/nhanes/.
Ethics statement
The studies involving human participants were reviewed and approved by the Institutional Review Board (IRB) of the National Center for Health Statistics Ethics. The patients/participants provided their written informed consent to participate in this study.
Author contributions
CW and CT designed the study. ZX, SZ, and ML acquired the data. CW and QW analysis and interpreted the data. CW drafted the manuscript. QW and CT revised the manuscript for important intellectual content. All authors read and approved the final manuscript.
Funding
This study was supported by the National Natural Science Foundation of China (No. 82104978), Natural Science Foundation of Guangdong Province (No. 2023A1515012234), Bao’an TCM Development Foundation (No. 2022KJCX-ZJZL-7), and Sanming Project of Medicine in Shenzhen (No. SZZYSM202106009).
Acknowledgments
We are very grateful to all the researchers involved in this study. We also thank the staff of the National Health and Nutrition Examination Survey Team for providing the data of NHANES available on their website.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Alloubani, A., Nimer, R., and Samara, R. (2021). Relationship between Hyperlipidemia, Cardiovascular disease and stroke: A systematic review. Curr. Cardiol. Rev. 17:e051121189015. doi: 10.2174/1573403X16999201210200342
Alloubani, A., Saleh, A., and Abdelhafiz, I. (2018). Hypertension and diabetes mellitus as a predictive risk factors for stroke. Diabetes Metab. Syndr. 12, 577–584. doi: 10.1016/j.dsx.2018.03.009
Arboix, A. (2015). Cardiovascular risk factors for acute stroke: Risk profiles in the different subtypes of ischemic stroke. World J. Clin. Cases 3, 418–429. doi: 10.12998/wjcc.v3.i5.418
Boehme, A., Esenwa, C., and Elkind, M. (2017). Stroke risk factors, genetics, and prevention. Circ. Res. 120, 472–495. doi: 10.1161/CIRCRESAHA.116.308398
Cahlon, O., Brennan, M., Jia, X., Qin, L., Singer, S., and Alektiar, K. (2012). A postoperative nomogram for local recurrence risk in extremity soft tissue sarcomas after limb-sparing surgery without adjuvant radiation. Ann. Surg. 255, 343–347. doi: 10.1097/SLA.0b013e3182367aa7
Chang, W., Fei, S., Pan, N., Yao, Y., and Jin, Y. (2022). Incident stroke and its influencing factors in patients with Type 2 diabetes mellitus and/or hypertension: A prospective cohort study. Front. Cardiovasc. Med. 9:770025. doi: 10.3389/fcvm.2022.770025
Chen, R., Ovbiagele, B., and Feng, W. (2016). Diabetes and stroke: Epidemiology, pathophysiology, pharmaceuticals and outcomes. Am. J. Med. Sci. 351, 380–386. doi: 10.1016/j.amjms.2016.01.011
Christensen, A., Nordestgaard, B., and Tolstrup, J. (2018). Alcohol intake and risk of ischemic and Haemorrhagic stroke: Results from a Mendelian Randomisation study. J. Stroke 20, 218–227. doi: 10.5853/jos.2017.01466
Dong, J., Zhang, Y., Tong, J., and Qin, L. (2012). Depression and risk of stroke: A meta-analysis of prospective studies. Stroke 43, 32–37. doi: 10.1161/STROKEAHA.111.630871
Du, X., Wang, C., Ni, J., Gu, H., Liu, J., Pan, J., et al. (2019). Association of blood pressure with stroke risk, stratified by age and stroke type, in a low-income population in China: A 27-year prospective cohort study. Front. Neurol. 10:564. doi: 10.3389/fneur.2019.00564
Gage, B., Waterman, A., Shannon, W., Boechler, M., Rich, M., and Radford, M. (2001). Validation of clinical classification schemes for predicting stroke: Results from the National Registry of Atrial Fibrillation. JAMA 285, 2864–2870. doi: 10.1001/jama.285.22.2864
Gan, Y., Wu, J., Zhang, S., Li, L., Yin, X., Gong, Y., et al. (2017). Prevalence and risk factors associated with stroke in middle-aged and older Chinese: A community-based cross-sectional study. Sci. Rep. 7:9501. doi: 10.1038/s41598-017-09849-z
GBD 2019 Stroke Collaborators (2021). Global, regional, and national burden of stroke and its risk factors, 1990-2019: A systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 20, 795–820.
Grandner, M., Alfonso-Miller, P., Fernandez-Mendoza, J., Shetty, S., Shenoy, S., and Combs, D. (2016). Sleep: Important considerations for the prevention of cardiovascular disease. Curr. Opin. Cardiol. 31, 551–565. doi: 10.1097/HCO.0000000000000324
Heagerty, P., Lumley, T., and Pepe, M. (2000). Time-dependent ROC curves for censored survival data and a diagnostic marker. Biometrics 56, 337–344. doi: 10.1111/j.0006-341x.2000.00337.x
Hu, G., Sarti, C., Jousilahti, P., Peltonen, M., Qiao, Q., Antikainen, R., et al. (2005). The impact of history of hypertension and type 2 diabetes at baseline on the incidence of stroke and stroke mortality. Stroke 36, 2538–2543. doi: 10.1161/01.STR.0000190894.30964.75
Jackson, C., Sudlow, C., and Mishra, G. (2018). Education, sex and risk of stroke: A prospective cohort study in New South Wales. Australia. BMJ Open 8:e024070. doi: 10.1136/bmjopen-2018-024070
Kleindorfer, D., Towfighi, A., Chaturvedi, S., Cockroft, K., Gutierrez, J., Lombardi-Hill, D., et al. (2021). 2021 guideline for the prevention of stroke in patients with stroke and transient ischemic attack: A guideline from the American Heart Association/American Stroke Association. Stroke 52, e364–e467. doi: 10.1161/STR.0000000000000375
Kroenke, K., Spitzer, R., and Williams, J. (2001). The PHQ-9: Validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613. doi: 10.1046/j.1525-1497.2001.016009606.x
Kurth, T., Gaziano, J., Berger, K., Kase, C., Rexrode, K., Cook, N., et al. (2002). Body mass index and the risk of stroke in men. Arch. Intern. Med. 162, 2557–2562. doi: 10.1001/archinte.162.22.2557
Li, C., Engstrom, G., Hedblad, B., Berglund, G., and Janzon, L. (2005). Blood pressure control and risk of stroke: A population-based prospective cohort study. Stroke 36, 725–730. doi: 10.1161/01.STR.0000158925.12740.87
Li, J., Li, L., Lv, Y., Kang, Y., Zhu, M., and Wang, W. (2021). Effect of the interaction between depression and sleep disorders on the stroke occurrence: An analysis based on National Health and Nutritional Examination Survey. Behav. Neurol. 2021:6333618. doi: 10.1155/2021/6333618
Lip, G., Nieuwlaat, R., Pisters, R., Lane, D., and Crijns, H. (2010). Refining clinical risk stratification for predicting stroke and thromboembolism in atrial fibrillation using a novel risk factor-based approach: The euro heart survey on atrial fibrillation. Chest 137, 263–272. doi: 10.1378/chest.09-1584
Liu, J., Chou, E., Lau, K., Woo, P., Li, J., and Chan, K. (2022). Machine learning algorithms identify demographics, dietary features, and blood biomarkers associated with stroke records. J. Neurol. Sci. 440:120335. doi: 10.1016/j.jns.2022.120335
Liu, L., Zhou, J., Zhang, Y., Lu, J., Gan, Z., Ye, Q., et al. (2022). A nomogram for individualized prediction of calf muscular vein thrombosis in stroke patients during rehabilitation: A retrospective study. Clin. Appl. Thromb. Hemost. 28:10760296221117991. doi: 10.1177/10760296221117991
Liu, Q., Wang, X., Wang, Y., Wang, C., Zhao, X., Liu, L., et al. (2018). Association between marriage and outcomes in patients with acute ischemic stroke. J. Neurol. 265, 942–948. doi: 10.1007/s00415-018-8793-z
Luo, J., Tang, X., Li, F., Wen, H., Wang, L., Ge, S., et al. (2021). Cigarette smoking and risk of different pathologic types of stroke: A systematic review and dose-response meta-analysis. Front. Neurol. 12:772373. doi: 10.3389/fneur.2021.772373
Makkouk, A., Sundaram, V., Chester, C., Chang, S., Colevas, A., Sunwoo, J., et al. (2017). Characterizing CD137 upregulation on NK cells in patients receiving monoclonal antibody therapy. Ann. Oncol. 28, 415–420. doi: 10.1093/annonc/mdw570
Pan, A., Sun, Q., Okereke, O., Rexrode, K., and Hu, F. (2011). Depression and risk of stroke morbidity and mortality: A meta-analysis and systematic review. JAMA 306, 1241–1249. doi: 10.1001/jama.2011.1282
Pan, B., Jin, X., Jun, L., Qiu, S., Zheng, Q., and Pan, M. (2019). The relationship between smoking and stroke: A meta-analysis. Medicine 98:e14872. doi: 10.1097/MD.0000000000014872
Park, S. (2018). Nomogram: An analogue tool to deliver digital knowledge. J. Thorac. Cardiovasc. Surg. 155:1793. doi: 10.1016/j.jtcvs.2017.12.107
Phua, C., Jayaram, L., and Wijeratne, T. (2017). Relationship between sleep duration and risk factors for stroke. Front. Neurol. 8:392. doi: 10.3389/fneur.2017.00392
Sauerbrei, W., Royston, P., and Binder, H. (2007). Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Stat. Med. 26, 5512–5528. doi: 10.1002/sim.3148
Shen, R., Zhao, N., Wang, J., Guo, P., Shen, S., Liu, D., et al. (2022). Association between level of depression and coronary heart disease, stroke risk and all-cause and cardiovascular mortality: Data from the 2005-2018 National Health and Nutrition Examination Survey. Front. Cardiovasc. Med. 9:954563. doi: 10.3389/fcvm.2022.954563
Shiozawa, M., Kaneko, H., Itoh, H., Morita, K., Okada, A., Matsuoka, S., et al. (2021). Association of body mass index with ischemic and hemorrhagic stroke. Nutrients 13:2343. doi: 10.3390/nu13072343
Sowers, J., Epstein, M., and Frohlich, E. D. (2001). Diabetes, hypertension, and cardiovascular disease: An update. Hypertension 37, 1053–1059. doi: 10.1161/01.HYP.37.4.1053
Strine, T., Mokdad, A., Dube, S., Balluz, L., Gonzalez, O., Berry, J., et al. (2008). The association of depression and anxiety with obesity and unhealthy behaviors among community-dwelling US adults. Gen. Hosp. Psychiatry. 30, 127–137. doi: 10.1016/j.genhosppsych.2007.12.008
Vickers, A., Cronin, A., Elkin, E., and Gonen, M. (2008). Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med. Inform. Decis. Mak. 8:53. doi: 10.1186/1472-6947-8-53
Virani, S., Alonso, A., Aparicio, H., Benjamin, E., Bittencourt, M., Callaway, C., et al. (2021). Heart disease and stroke statistics-2021 update: A report from the American Heart Association. Circulation 143, e254–e743. doi: 10.1161/CIR.0000000000000950
Wolf, P., D’Agostino, R., Belanger, A., and Kannel, W. (1991). Probability of stroke: A risk profile from the Framingham Study. Stroke 22, 312–318. doi: 10.1161/01.STR.22.3.312
Yang, J., Yang, X., Wen, J., Huang, J., Jiang, L., Liao, S., et al. (2022). Development of a nomogram for predicting asymptomatic coronary artery disease in patients with ischemic stroke. Curr. Neurovasc. Res. 19, 188–195. doi: 10.2174/1574887117666220513104303
Yousufuddin, M., and Young, N. (2019). Aging and ischemic stroke. Aging 11, 2542–2544. doi: 10.18632/aging.101931
Zhou, L., Yu, K., Yang, L., Wang, H., Xiao, Y., Qiu, G., et al. (2020). Sleep duration, midday napping, and sleep quality and incident stroke: The Dongfeng-Tongji cohort. Neurology 94, e345–e356. doi: 10.1212/WNL.0000000000008739
Keywords: stroke, risk factors, nomogram, prediction model, NHANES
Citation: Wu C, Xu Z, Wang Q, Zhu S, Li M and Tang C (2023) Development, validation, and visualization of a novel nomogram to predict stroke risk in patients. Front. Aging Neurosci. 15:1200810. doi: 10.3389/fnagi.2023.1200810
Received: 05 April 2023; Accepted: 17 July 2023;
Published: 07 August 2023.
Edited by:
Mario Bernardo-Filho, Rio de Janeiro State University, BrazilCopyright © 2023 Wu, Xu, Wang, Zhu, Li and Tang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chunxiao Wu, wcxnyd@126.com
†These authors have contributed equally to this work and share first authorship