Skip to main content

ORIGINAL RESEARCH article

Front. Cardiovasc. Med., 29 November 2023
Sec. Coronary Artery Disease
This article is part of the Research Topic Periodontitis and Cardiovascular Disease: Shared Clinical Challenges in Patient Care View all 4 articles

Application of machine learning algorithms to construct and validate a prediction model for coronary heart disease risk in patients with periodontitis: a population-based study

\r\nYicheng Wang,,,Yicheng Wang1,2,3,4Binghang Ni,,,Binghang Ni1,2,3,4Yuan Xiao,,,Yuan Xiao1,2,3,4Yichang Lin,,,Yichang Lin1,2,3,4Yu Jiang,,,Yu Jiang1,2,3,4Yan Zhang,,,
\r\nYan Zhang1,2,3,4*
  • 1Department of Cardiovascular Medicine, Affiliated Fuzhou First Hospital of Fujian Medical University, Fuzhou, Fujian, China
  • 2The Third Clinical Medical College, Fujian Medical University, Fuzhou, Fujian, China
  • 3Cardiovascular Disease Research Institute of Fuzhou City, Fuzhou, Fujian, China
  • 4The Graduate School of Fujian Medical University, Fuzhou, Fujian, China

Background: The association between periodontitis and cardiovascular disease is increasingly recognized. In this research, a prediction model utilizing machine learning (ML) was created and verified to evaluate the likelihood of coronary heart disease in individuals affected by periodontitis.

Methods: We conducted a comprehensive analysis of data obtained from the National Health and Nutrition Examination Survey (NHANES) database, encompassing the period between 2009 and 2014.This dataset comprised detailed information on a total of 3,245 individuals who had received a confirmed diagnosis of periodontitis. Subsequently, the dataset was randomly partitioned into a training set and a validation set at a ratio of 6:4. As part of this study, we conducted weighted logistic regression analyses, both univariate and multivariate, to identify risk factors that are independent predictors for coronary heart disease in individuals who have periodontitis. Five different machine learning algorithms, namely Logistic Regression (LR), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Classification and Regression Tree (CART), were utilized to develop the model on the training set. The evaluation of the prediction models’ performance was conducted on both the training set and validation set, utilizing metrics including AUC (Area under the receiver operating characteristic curve), Brier score, calibration plot, and decision curve analysis (DCA). Additionally, a graphical representation called a nomogram was created using logistic regression to visually depict the predictive model.

Results: The factors that were found to independently contribute to the risk, as determined by both univariate and multivariate logistic regression analyses, encompassed age, race, presence of myocardial infarction, chest pain status, utilization of lipid-lowering medications, levels of serum uric acid and serum creatinine. Among the five evaluated machine learning models, the KNN model exhibited exceptional accuracy, achieving an AUC value of 0.977. The calibration plot and brier score illustrated the model's ability to accurately estimate probabilities. Furthermore, the model's clinical applicability was confirmed by DCA.

Conclusion: Our research showcases the effectiveness of machine learning algorithms in forecasting the likelihood of coronary heart disease in individuals with periodontitis, thereby aiding healthcare professionals in tailoring treatment plans and making well-informed clinical decisions.

Introduction

Periodontitis is an inflammatory response that affecting the periodontal tissues, leading to progressive degradation of the tooth-supporting structures. This pathological process is initiated by a consortium of pathogenic microorganisms presenting within dental plaque (1, 2). As a highly prevalent chronic condition among the global adult population, periodontitis impacts more than 50% of individuals worldwide (3). According to statistics, periodontitis affects many American individuals over the age of 30 (4). However, there has been a recent surge in the prevalence of periodontitis among younger individuals (5).

In the United States and throughout the world, coronary heart disease (CHD) is a chronic cardiovascular illness that exhibits significant morbidity and mortality rates (6, 7). The underlying pathophysiology is that the narrowing or blockage of coronary arteries due to atherosclerosis leads to decreased blood and oxygen supply to myocardial tissues, ultimately resulting in tissue necrosis and the development of the disease (8). Any age can get coronary heart disease, and as people get older, their chances of getting it increase (9). Therefore, the recognition and control of traditional risk elements associated with CHD, including smoking, alcohol intake, high blood pressure, obesity, diabetes mellitus, and lack of physical activity, are pivotal in both preventing and managing CHD (10).

Prior research has firmly established a strong association between the development of cardiovascular diseases, such as CHD, and the presence of elevated inflammation levels (6, 11). Multiple research studies have provided evidence indicating a correlation between periodontitis and an elevated likelihood of developing cardiovascular disease (1215). Periodontitis also has an impact on the outcome of individuals with cardiovascular disease and raises the likelihood of mortality (16). This association may be explained by inflammatory mediators since periodontitis can alter the amounts of inflammatory markers in the blood (17). The bacteremia and systemic inflammatory state associated with periodontitis plays a significant role in the development of vascular endothelial lesions and the enhancement of inflammatory processes in the vascular wall (18). Through the induction of a systemic inflammatory response and autoimmune illness, a chronic infection brought on by periodontitis progresses to atherosclerosis and, ultimately, coronary heart disease (19).

In recent years, machine learning has become a powerful computerized method for analyzing data and has been widely embraced in the field of medicine as an effective tool for predicting disease risks (2025). Several studies have shown that clinical prediction models utilize the powerful predictive power of machine learning algorithms to outperform traditional statistical methods (26, 27). Considering the increasing association between periodontitis and cardiovascular diseases, including CHD, no studies have been reported on prediction model for CHD risk in patients with periodontitis. Therefore, a prediction model that integrates risk factors is needed to assess the risk of CHD in patients with periodontitis. Our study utilizes data from NHANES spanning 2009–2014, in order to construct a prediction model for assessing the risk of CHD among individuals with periodontitis. By employing machine learning algorithms and comparing their performance, our aim is to facilitate personalized clinical decision-making for healthcare professionals. The identification of high-risk patient groups will enable targeted interventions, thereby reducing hospitalization rates and enhancing overall clinical outcomes.

Methods

Study design and participants

NHANES is a research initiative that aims to conduct a comprehensive assessment of the health and nutritional well-being of individuals, including both adults and children, who are living within the borders of the United States. Annually, a representative sample of approximately 5,000 individuals is selected, ensuring national coverage, while the database undergoes updates every two years. The study evaluated the intricate health status of Americans by employing a series of sophisticated stratified, multi-stage sampling designs. Our study used data from a total of 30,434 subjects from three consecutive cycles of NHANES 2009–2014. The exclusion criteria were: (1) Age less than or less than 30 years. (2) Those who were not diagnosed with periodontitis. (3) Missing values for at least one of all variables included in the participants of this study. After inclusion and exclusion, a total of 3,245 subjects aged 30 years and above who participated in a demographic survey, physical examination, and questionnaire with a diagnosis of periodontitis were finally included in our analyses. All individuals involved in the study provided their consent after signing an informed consent document, and the survey protocol received approval from the Research Ethics Review Board at the National Center for Health Statistics. All procedures were carried out following applicable guidelines and regulations. The screening flowchart of the study population is shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. ROC curve analysis of 5 ML algorithms in the validating set.

Ethics of approval statement

The authors assume complete accountability for all aspects of the research, ensuring thorough investigation and resolution of any inquiries regarding the accuracy or integrity of any segment of the study. The study adhered to the revised 2013 Declaration of Helsinki. Since all data from the NHANES program is publicly accessible and free, there was no necessity for medical ethics committee board approval. Prior to participation, written informed consent was acquired from all participants.

The definition of periodontitis

Comprehensive periodontal examinations were conducted by a dental hygienist to evaluate the periodontal status of participants. Participants aged 30 years and above were eligible for inclusion in the periodontal assessment if they possessed at least one tooth (excluding the third molar) and did not meet any of the health exclusion criteria. According to the AAP/CDC criteria, periodontitis was assessed and classified into non-existent, mild, moderate, and severe categories based on its severity (28). The total number of periodontitis cases was calculated by aggregating the incidences of mild, moderate, and severe cases.

The definition of coronary heart disease

The assessment of CHD status primarily relied on questionnaires, wherein participants were asked if a healthcare professional had ever diagnosed them with CHD. Affirmative responses indicated the presence of CHD in the subject.

Other data selection and measurements

The demographic characteristics can be categorized according to sex (female, male), race (non-Hispanic white individuals, non-Hispanic black individuals, Mexican American, Hispanic American, other races), marital status [unmarried, married or cohabiting with a partner, married but currently living alone (separated, divorced, or widowed)], and educational attainment level (below 9th grade, 9th-11th grade, high school graduate, partial college or AA graduate or higher).

To compute the poverty-to-income ratio (PIR), household (or individual) income was divided by state-specific poverty thresholds corresponding to the survey year. Waist circumference (WC) and body mass index (BMI) were measured by medical professionals at mobile screening stations, with BMI calculated as weight in kilograms divided by height in meters squared (kg/m2). Sleep duration on workdays, sedentary behavior, and weekly physical activity were self-reported through a questionnaire. Participants were asked about their daily sitting time to assess sedentary behavior. Sleep duration on workdays was determined by querying participants about their typical amount of sleep received during such days. Physical activity time was obtained by asking subjects about the duration of exercise performed per week.

Smoking status of each participant was assessed through self-report and categorized into three groups: non-smokers, former smokers who are no longer smoking, and current smokers. Drinking status was classified as follows: never drinkers, abstainers after previous drinking, heavy drinkers (≥3 drinks per day for women/≥4 drinks per day for men/five or more days of binge drinking per month), moderate drinkers (≥2 drinks per day for women/≥3 drinks per day for men/≥2 days of binge drinking per puff), and light drinkers (excluding the above). The ACC/AHA proposes a two-stage classification system for blood pressure, wherein hypertension is defined as having a systolic blood pressure (SBP) exceeding 130 mmHg and/or diastolic blood pressure (DBP) below 80 mmHg (29). The question “Have you ever received a diagnosis of high blood pressure, also known as hypertension, from a medical professional?” was employed to ascertain the presence of hypertension in the study participants. Diabetes conditions are categorized into three groups: no diabetes, prediabetes, and diabetes. To assess prediabetes, participants were asked the following question: ‘Have you ever received a diagnosis of pre-diabetes, impaired fasting glucose, impaired glucose tolerance, borderline diabetes or been informed by your doctor or healthcare provider that your blood glucose levels are higher than normal but not high enough to be classified as diabetes mellitus or glucose diabetes?’ Participants who responded affirmatively were considered prediabetic. Similarly, individuals diagnosed with diabetes were identified by asking the question: ‘Have you ever been told by a doctor or health professional that you have diabetes?'“ The use of glucose-lowering drugs can be divided into three types: no use of antihyperglycemic/ antihypertensive/ lipid-lowering, use of antihyperglycemic/ antihypertensive/ lipid-lowering, taking medications prescribed other than antihyperglycemic/ antihypertensive/ lipid-lowering. Various blood biochemical markers such as total cholesterol (TCHOL), albumin (ALB), high density lipoprotein (HDL), uric acid (UA), triglycerides (TG), creatinine (CR), blood urea nitrogen (BUN), and HbA1c are measured from the laboratory.

Development and validation of five machine learning prediction models

The data was randomly partitioned into training set and validation set in a 6:4 ratio. Five machine learning algorithms, namely Gradient Boosting Machine (GBM), Support Vector Machine (SVM), Logistic Regression (LR), Classification and Regression Tree (CART), and K-Nearest Neighbor (KNN), were employed to construct models on the training set. The performance of each model was assessed on the validation set and compared accordingly. The optimal choice of the machine algorithm is determined by selecting the model with the highest AUC.

The predictive performance of the five machine learning models was assessed by plotting the receiver operating characteristic curve (ROC) in both the training set and validation set, and calculating the AUC. The model with the highest AUC was selected as the optimal choice for each algorithm. Calibration curve analysis and Brier score were employed to examine the correlation between actual probabilities and predicted probabilities for each model. Additionally, DCA was utilized to evaluate the clinical applicability of these five machine learning algorithms.

Statistical analysis

NHANES employs a stratified multistage probability sampling technique, wherein specific sampling weights are assigned to each participant based on the primary sampling unit, thereby facilitating the generation of nationally representative estimates (30). Following the NHANES analysis guidelines, new sample weights were generated by combining continuous data from three two-year cycles (original two-year sample weights divided by 2). NHANES sample weights were utilized for baseline descriptions and logistic regression analyses. In the baseline information, means and standard errors were used to represent continuous variables, whereas categorical variables were presented as percentages (%) and numbers (n). Between-group disparities were assessed by employing t-tests for continuous variables and Fisher's exact tests or chi-square tests for categorical variables. Univariate and multivariate logistic regression models were employed to identify independent risk factors for CHD, with odds ratio (OR) and corresponding 95% confidence intervals (CI) used as effect estimates. Statistical analyses were conducted using R software (version 4.3.0) and SPSS version 28.0, considering P < 0.05 as statistically significant.

Results

The baseline characteristics of study population

The study included a total of 3,245 participants aged 30 years and older, who had previously been diagnosed with periodontitis, based on data extracted from the NHANES database spanning from 2009 to 2014. The mean age was 54.13 years and included 2,016 males and 1,229 females. There were 41.33% non-Hispanic white persons, 23.11% non-Hispanic black persons, 15.93% Mexican Americans, 9.18% Hispanic Americans, and 10.45% Americans of other races. Regarding the marital status of the participants, 10.63% were categorized as unmarried, while 63.05% were identified as married or cohabiting, and 26.32% reported being divorced or separated. The study population of 3,245 was partitioned into training set and validation set in a ratio of 6:4. The study population was stratified into two groups based on the presence or absence of CHD in both the training set and validation set. In the training set, there were significant statistical differences observed between the two groups with respect to age, race, marital status, smoking and alcohol consumption, waist circumference, uric acid, creatinine, urea nitrogen, albumin, total cholesterol, high-density lipoprotein, HbA1c, time spent in physical activity, myocardial infarction, chest pain, diabetes mellitus, hyperlipidemia, and use of antihyperglycemic/antihypertensive/lipid-lowering medication (P < 0.05). In the validation set, there was a notable dissimilarity observed between the two groups concerning age, gender, race, uric acid, creatinine, urea nitrogen, smoking status, hypertension status, myocardial infarction status, chest pain status, diabetes mellitus status, and the use of antihyperglycemic/antihypertensive/lipid-lowering medication(P < 0.05). The demographic profile of the study sample is presented in Table 1.

TABLE 1
www.frontiersin.org

Table 1. Weighted baseline characteristics.

Univariate and multivariate regression analysis

Table 2 presents the outcomes of employing logistic regression analysis to identify the factors contributing to CHD risk. Univariate regression analysis demonstrated that age, race, smoking status, myocardial infarction status, hypertension status, chest pain status, hyperlipidemia status, diabetes mellitus status, waist circumference, UA, CR, BUN, ALB, TCHOL, HDL, HbA1c, and the use of antihyperglycemic/antihypertensive/lipid-lowering medication had a notable association between the emergence of CHD risk in individuals with periodontitis (P < 0.05). The variables with a significance level of P < 0.05 in the univariate regression analysis were included in the multivariate regression analysis. This analysis revealed that age, race, myocardial infarction status, chest pain status, lipid-lowering medication use, UA levels, and CR levels emerged as the final predictors utilized for constructing the model assessing coronary heart disease risk among patients with periodontitis(P < 0.05).

TABLE 2
www.frontiersin.org

Table 2. Weighted univariate and multivariate regression analysis.

Development and validation of five machine learning models

The training set incorporates five machine learning algorithms, namely LR, CART, GBM, SVM, and KNN, to construct the prediction model. Subsequently, the predictive performance of these models is assessed through ROC curve analysis (Figure 2A). The K-nearest neighbor algorithm model demonstrated the highest predictive performance for assessing the risk of coronary heart disease in the periodontitis population (AUC = 0.977), followed by the support vector machine model (AUC = 0.932), gradient boosting machine model (AUC = 0.911), logistic regression model (AUC = 0.886), and classification and regression tree model (AUC = 0.849). As depicted in Figure 2B, among the five machine learning models in the validation set, the k-nearest neighbor model exhibits superior performance in ROC curve analysis with an AUC value of 0.938. Meanwhile, the accuracy of the prediction results relative to the actual occurrence of events was evaluated by the calibration curves of the training set and validation set. The calibration curve for the training set show that the predictive ability of the k-nearest neighbor model is very similar to the actual results (Figure 3A). Of course, the calibration curve for the validation set show that the k-nearest neighbor model also performs well (Figure 3B). Additionally, the discriminative power of the model was assessed by calculating the Brier score in both the training and validation sets. Amongst the five machine learning models, the k-nearest neighbor algorithm exhibited a superior discrimination with a Brier score of 0.019 in the training set, compared to 0.024 for support vector machine model, 0.024 for gradient boosting machine model, 0.022 for classification and regression tree model, and 0.024 for logistic regression model respectively. Consequently, it can be concluded that the k-nearest neighbor algorithm demonstrates optimal discrimination ability. Furthermore, in the validation set, this algorithm also outperformed others with a Brier score of 0.022 as shown in Table 3. The DCA of the training set demonstrates that among the five machine learning models, the k-nearest neighbor model exhibits superior performance, thereby confirming its qualified clinical utility (Figure 4A). Furthermore, the DCA of the validation set reveals a significant positive net benefit in predicting risk associated with the implementation of the k-nearest neighbor model (Figure 4B). Therefore, the KNN model is selected as the ultimate prediction model.

FIGURE 2
www.frontiersin.org

Figure 2. (A) ROC curve analysis of 5 ML algorithms in the training set. (B) ROC curve analysis of 5 ML algorithms in the validating set.

FIGURE 3
www.frontiersin.org

Figure 3. (A) Calibration curve analysis of 5 ML algorithms in the training set. (B) Calibration curve analysis of 5 ML algorithms in the validating set.

TABLE 3
www.frontiersin.org

Table 3. Brier scores for training set and validating set.

FIGURE 4
www.frontiersin.org

Figure 4. (A) DCA curve analysis of 5 ML algorithms in the training set. (B) DCA curve analysis of 5 ML algorithms in the validating set.

Development of nomogram

Given the satisfactory performance of logistic regression predictions, we constructed a nomogram (Figure 5) on the training set to show the practicality and visualization of our model for predicting CHD risk in individuals with periodontitis.

FIGURE 5
www.frontiersin.org

Figure 5. Nomogram for the risk of coronary heart disease for patients with periodontitis.

Relative significance of factors in machine learning algorithms

Based on the final results, we have identified the KNN model as the ultimate predictive model. The relative importance of the variables in the five machine algorithm models is shown in Figure 6. The predictor variables in the KNN model were ranked in descending order of importance, age, CR, myocardial infarction status, race, chest pain status, UA, and lipid-lowering medication use.

FIGURE 6
www.frontiersin.org

Figure 6. Importance ranking of variables in KNN model.

Discussion

By analyzing a total of 3,245 NHANES 2009–2014 participants, we developed and validated five distinct machine learning algorithms (LR, CART, GBM, SVM, and KNN) to accurately predict the risk of CHD in individuals suffering from. The NHANES study used a large stratified, multi-stage sampling design, which weighted the data to more accurately reflect overall population characteristics than unweighted results. Through weighted logistic regression analysis, we identified seven variables: age, race, myocardial infarction status, chest pain status, usage of lipid-lowering medication, UA levels, and CR levels. Notably, each machine learning model exhibited distinct characteristics in terms of identification accuracy, calibration performance, and clinical utility; among them all, the KNN model demonstrated superior predictive ability. Consequently, this machine learning-based approach holds promise for clinicians to estimate disease prevalence within specific populations.

The age factor emerged as the foremost risk determinant in our investigation. Significantly, the mean age of patients diagnosed with periodontitis and concurrent coronary heart disease exceeded that of patients without coronary heart disease by more than a decade. In physiology, aging is an irreversible process characterized by a gradual decline in physiological functions (31). The risk of CHD escalates significantly with advancing age (32). Therefore, the early diagnosis and treatment of chronic diseases such as CHD are crucial, necessitating the development of strategies to prevent coronary heart disease at an early stage and enhance the quality of life among middle-aged and elderly individuals.

Despite notable advancements in reducing the burden of cardiovascular disease within the general population of the United States, persistent racial and ethnic disparities in cardiovascular disease mortality remain evident. Specifically, individuals of black ethnicity in the United States continue to exhibit a heightened susceptibility to cardiovascular disease compared to other racial and ethnic groups (33). The genetic basis may underlie the observed racial disparities. Further research is necessary to comprehensively understand the heterogeneous distribution of cardiovascular disease based on race and ethnicity, as well as to elucidate the underlying factors contributing to racial and ethnic disparities.

Numerous studies have consistently demonstrated a robust correlation among elevated levels of UA and the pathogenesis and progression of coronary atherosclerosis, along with the severity of CHD, cardiovascular mortality, and all-cause mortality (34, 35). Furthermore, logistic regression analysis in our study revealed a significant association between uric acid levels and the risk of CHD. This relationship may be attributed to the induction of oxidative stress, endothelial dysfunction and inflammatory mechanisms triggered by elevated uric acid concentrations, thereby increasing the susceptibility to coronary heart disease (36, 37). Therefore, active uric acid-lowering therapy is imperative in the presence of hyperuricemia concomitant with cardiovascular disorders such as coronary heart disease, aiming to retard the progression of cardiovascular disease and optimize patient prognosis.

The occurrence of myocardial infarction serves as the principal manifestation of coronary artery disease, representing a severe and critical condition (38). Immediate surgical intervention is typically necessary for the management of acute myocardial infarction (39). Chest pain serves as a clinical manifestation of CHD and frequently acts as a precursor to acute myocardial infarction (40).

According to our findings, the positive association between creatinine and the risk of CHD remained robust even after controlling for all potential confounding factors. Previous studies have demonstrated a positive correlation between elevated serum creatinine levels and an augmented risk of cardiovascular diseases, including coronary heart disease (41, 42). The presence of elevated serum creatinine levels typically indicates renal impairment, which is commonly associated with an augmented cardiovascular risk (43). Therefore, it is imperative to reduce serum creatinine levels in order to achieve optimal cardiovascular risk management in patients diagnosed with coronary heart disease.

Our study employed machine learning algorithms to specifically predict the risk of coronary heart disease in individuals with periodontitis, a factor that has been rarely explored in previous research endeavors, despite the extensive evaluation of CHD risk within the general population. To the best of our knowledge, this study presents the pioneering application of a machine learning-based predictive model to evaluate the risk of CHD in participants diagnosed with periodontitis.

Naturally, our study is subject to certain limitations. Firstly, NHANES is based on cross-sectional features of the survey, which makes it difficult to determine causality for the diseases under discussion because of the unclear sequence of events. Secondly, owing to inherent limitations in the NHANES study design, this study was unable to provide prognostic insights into the timing and severity of coronary heart disease. Thirdly, although we partitioned the NHANES dataset into a training set and a validation set in a 6:4 ratio, we did not incorporate external data for assessing the predictive model's validity. Additionally, our chosen population solely consisted of adult individuals residing in the United States, thereby limiting its direct applicability to populations in other countries. Consequently, there is an imperative need for conducting multi-center studies across diverse nations. Lastly, the data we used were all from the NHANES database, including home interviews and mobile examination center (MEC) health checks. This may cause some interference with the accuracy of our data and affect the objectivity of the results.

Conclusion

In this study, we developed a machine learning-based prediction model to assess the risk of coronary heart disease in patients with periodontitis. Our findings demonstrate that among five machine learning models, the KNN model exhibited superior predictive performance. The implementation of our prediction model enables healthcare professionals to provide early and personalized diagnosis and treatment plans for patients with periodontitis, thereby facilitating effective management of coronary heart disease risk.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). All information from the NHANES program is available and free for public, so the agreement of the medical Ethics Committee board was not necessary. All study participants, in the NHANES data we utilized, provided informed consent prior to their participation in the NHANES survey, as per the NHANES protocol. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

YW: Conceptualization, Formal Analysis, Methodology, Software, Visualization, Writing – original draft. BN: Writing – review & editing. YX: Writing – review & editing. YL: Writing – review & editing. YJ: Writing – review & editing. YZ: Funding acquisition, Writing – original draft, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article.

This work was supported by the Fuzhou Key Specialty Project (Grant number 20191005), and the Fuzhou “14th Five-Year Plan” Clinical Specialty Training and Cultivation Construction Project (Grant number 20220103).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Winning L, Patterson CC, Linden K, Evans A, Yarnel J, McKeown PP, et al. Periodontitis and risk of prevalent and incident coronary heart disease events. J Clin Periodontol. (2020) 47(12):1446–56. doi: 10.1111/jcpe.13377

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Ma C, Wu M, Gao J, Liu C, Xie Y, Lv Q, et al. Periodontitis and stroke: a Mendelian randomization study. Brain Behav. (2023) 13(2):e2888. doi: 10.1002/brb3.2888

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Marruganti C, Baima G, Aimetti M, Grandini S, Sanz M, Romandini M. Periodontitis and low cognitive performance: a population-based study. J Clin Periodontol. (2023) 50(4):418–29. doi: 10.1111/jcpe.13779

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Eke PI, Thornton-Evans GO, Wei L, Borgnakke WS, Dye BA, Genco RJ. Periodontitis in US adults: national health and nutrition examination survey 2009-2014. J Am Dent Assoc. (2018) 149(7):576–588.e6. doi: 10.1016/j.adaj.2018.04.023

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Wu L, Zhang SQ, Zhao L, Ren ZH, Hu CY. Global, regional, and national burden of periodontitis from 1990 to 2019: results from the global burden of disease study 2019. J Periodontol. (2022) 93(10):1445–54. doi: 10.1002/JPER.21-0469

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Wu L, Shi Y, Kong C, Zhang J, Chen S. Dietary inflammatory index and its association with the prevalence of coronary heart disease among 45,306 US adults. Nutrients. (2022) 14(21):4553. doi: 10.3390/nu14214553

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Ma J, Li K. Systemic immune-inflammation index is associated with coronary heart disease: a cross-sectional study of NHANES 2009-2018. Front Cardiovasc Med. (2023) 10:1199433. doi: 10.3389/fcvm.2023.1199433

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Huang M, Liu F, Li Z, Liu Y, Su J, Ma M, et al. Relationship between red cell distribution width/albumin ratio and carotid plaque in different glucose metabolic states in patients with coronary heart disease: a RCSCD-TCM study in China. Cardiovasc Diabetol. (2023) 22(1):39. doi: 10.1186/s12933-023-01768-w

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Huang Y, Lin Y, Zhai X, Cheng L. Association of beta-2-microglobulin with coronary heart disease and all-cause mortality in the United States general population. Front Cardiovasc Med. (2022) 9:834150. doi: 10.3389/fcvm.2022.834150

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Sun P, Weng H, Fan F, Zhang N, Liu Z, Chen P, et al. Association between plasma vitamin B5 and coronary heart disease: results from a case-control study. Front Cardiovasc Med. (2022) 9:906232. doi: 10.3389/fcvm.2022.906232

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Zhou N, Xie ZP, Liu Q, Xu Y, Dai SC, Lu J, et al. The dietary inflammatory index and its association with the prevalence of hypertension: a cross-sectional study. Front Immunol. (2022) 13:1097228. doi: 10.3389/fimmu.2022.1097228

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Walther C, Wenzel JP, Schnabel RB, Heydecke G, Seedorf U, Beikler T, et al. Association between periodontitis and heart failure in the general population. ESC heart Failure. (2022) 9(6):4189–97. doi: 10.1002/ehf2.14150

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Rosa RAC, Rodrigues JVS, Cláudio MM, Franciscon JPS, Mulinari-Santos G, Cirelli T, et al. The relationship between hypertension and periodontitis: a cross-sectional study. J Clin Med. (2023) 12(15):5140. doi: 10.3390/jcm12155140

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Wagner AK, D’Souza M, Bang CN, Holmstrup P, Blanche P, Fiehn NE, et al. Treated periodontitis and recurrent events after first-time myocardial infarction: a Danish nationwide cohort study. J Clin Periodontol. (2023) 50:1305–14. doi: 10.1111/jcpe.13853

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Czesnikiewicz-Guzik M, Osmenda G, Siedlinski M, Nosalski R, Pelka P, Nowakowski D, et al. Causal association between periodontitis and hypertension: evidence from Mendelian randomization and a randomized controlled trial of non-surgical periodontal therapy. Eur Heart J. (2019) 40(42):3459–70. doi: 10.1093/eurheartj/ehz646

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Chen F, Song Y, Li W, Xu H, Dan H, Chen Q. Association between periodontitis and mortality of patients with cardiovascular diseases: a cohort study based on NHANES. J Periodontol. (2023). doi: 10.1002/JPER.23-0276

CrossRef Full Text | Google Scholar

17. Gao S, Tian J, Li Y, Liu T, Li R, Yang L, et al. Periodontitis and number of teeth in the risk of coronary heart disease: an updated meta-analysis. Med Sci Monit. (2021) 27:e930112. doi: 10.12659/MSM.930112

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Cardoso EM, Reis C, Manzanares-Céspedes MC. Chronic periodontitis, inflammatory cytokines, and interrelationship with other chronic diseases. Postgrad Med. (2018) 130(1):98–104. doi: 10.1080/00325481.2018.1396876

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Gao K, Wu Z, Liu Y, Tao L, Luo Y, Yang X, et al. Risk of coronary heart disease in patients with periodontitis among the middled-aged and elderly in China: a cohort study. BMC oral Health. (2021) 21(1):621. doi: 10.1186/s12903-021-01951-z

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Handelman GS, Kok HK, Chandra RV, Razavi AH, Lee MJ, Asadi H. Edoctor: machine learning and the future of medicine. J Intern Med. (2018) 284(6):603–19. doi: 10.1111/joim.12822

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Li W, Wang J, Liu W, Xu C, Li W, Zhang K, et al. Machine learning applications for the prediction of bone cement leakage in percutaneous vertebroplasty. Front Public Health. (2021) 9:812023. doi: 10.3389/fpubh.2021.812023

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Nemati S, Holder A, Razmi F, Stanley MD, Clifford GD, Buchman TG. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med. (2018) 46(4):547–53. doi: 10.1097/CCM.0000000000002936

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Bhardwaj V, Sharma A, Parambath SV, Gul I, Zhang X, Lobie PE, et al. Machine learning for endometrial cancer prediction and prognostication. Front Oncol. (2022) 12:852746. doi: 10.3389/fonc.2022.852746

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Shi Y, Ma L, Chen X, Li W, Feng Y, Zhang Y, et al. Prediction model of obstructive sleep apnea-related hypertension: machine learning-based development and interpretation study. Front Cardiovasc Med. (2022) 9:1042996. doi: 10.3389/fcvm.2022.1042996

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Li W, Hong T, Liu W, Dong S, Wang H, Tang ZR, et al. Development of a machine learning-based predictive model for lung metastasis in patients with ewing sarcoma. Front Med (Lausanne). (2022) 9:807382. doi: 10.3389/fmed.2022.807382

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Deo RC. Machine learning in medicine. Circulation. (2015) 132(20):1920–30. doi: 10.1161/CIRCULATIONAHA.115.001593

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Li W, Wang G, Wu R, Dong S, Wang H, Xu C, et al. Dynamic predictive models with visualized machine learning for assessing chondrosarcoma overall survival. Front Oncol. (2022) 12:880305. doi: 10.3389/fonc.2022.880305

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Eke PI, Page RC, Wei L, Thornton-Evans G, Genco RJ. Update of the case definitions for population-based surveillance of periodontitis. J Periodontol. (2012) 83(12):1449–54. doi: 10.1902/jop.2012.110664

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Whelton PK, Carey RM, Mancia G, Kreutz R, Bundy JD, Williams B. Harmonization of the American college of cardiology/American heart association and European society of cardiology/European society of hypertension blood pressure/hypertension guidelines: comparisons, reflections, and recommendations. J Am Coll Cardiol. (2022) 80(12):1192–201. doi: 10.1016/j.jacc.2022.07.005

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Wu LD, Chu P, Kong CH, Shi Y, Zhu MH, Xia YY, et al. Estimated pulse wave velocity is associated with all-cause mortality and cardiovascular mortality among adults with diabetes. Front Cardiovasc Med. (2023) 10:1157163. doi: 10.3389/fcvm.2023.1157163

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Li Z, Zhang Z, Ren Y, Wang Y, Fang J, Yue H, et al. Aging and age-related diseases: from mechanisms to therapeutic strategies. Biogerontology. (2021) 22(2):165–87. doi: 10.1007/s10522-021-09910-5

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Si J, Chen L, Yu C, Guo Y, Sun D, Pang Y, et al. Healthy lifestyle, DNA methylation age acceleration, and incident risk of coronary heart disease. Clin Epigenetics. (2023) 15(1):52. doi: 10.1186/s13148-023-01464-2

PubMed Abstract | CrossRef Full Text | Google Scholar

33. He J, Zhu Z, Bundy JD, Dorans KS, Chen J, Hamm LL. Trends in cardiovascular risk factors in US adults by race and ethnicity and socioeconomic Status, 1999-2018. JAMA. (2021) 326(13):1286–98. doi: 10.1001/jama.2021.15187

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Yu W, Cheng JD. Uric acid and cardiovascular disease: an update from molecular mechanism to clinical perspective. Front Pharmacol. (2020) 11:582680. doi: 10.3389/fphar.2020.582680

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Grossman C, Grossman E, Goldbourt U. Uric acid variability at midlife as an independent predictor of coronary heart disease and all-cause mortality. PLoS One. (2019) 14(8):e0220532. doi: 10.1371/journal.pone.0220532

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Maulana S, Nuraeni A, Aditya Nugraha B. The potential of prognostic biomarkers of uric acid levels in coronary heart disease among aged population: a scoping systematic review of the latest cohort evidence. J Multidiscip Healthc. (2022) 15:161–73. doi: 10.2147/JMDH.S340596

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Liang L, Hou X, Bainey KR, Zhang Y, Tymchak W, Qi Z, et al. The association between hyperuricemia and coronary artery calcification development: a systematic review and meta-analysis. Clin Cardiol. (2019) 42(11):1079–86. doi: 10.1002/clc.23266

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Wu M, Huang Z, Zeng L, Wang C, Wang D. Programmed cell death of endothelial cells in myocardial infarction and its potential therapeutic strategy. Cardiol Res Pract. (2022) 2022:6558060. doi: 10.1155/2022/6558060

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Saito Y, Oyama K, Tsujita K, Yasuda S, Kobayashi Y. Treatment strategies of acute myocardial infarction: updates on revascularization, pharmacological therapy, and beyond. J Cardiol. (2023) 81(2):168–78. doi: 10.1016/j.jjcc.2022.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Aa N, Lu Y, Yu M, Tang H, Lu Z, Sun R, et al. Plasma metabolites alert patients with chest pain to occurrence of myocardial infarction. Front Cardiovasc Med. (2021) 8:652746. doi: 10.3389/fcvm.2021.652746

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Casiglia E, Tikhonoff V, Virdis A, Grassi G, Angeli F, Barbagallo CM, et al. Serum uric acid / serum creatinine ratio as a predictor of cardiovascular events. Detection of prognostic cardiovascular cut-off values. J Hypertens. (2023) 41(1):180–6. doi: 10.1097/HJH.0000000000003319

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Schytz PA, Nissen AB, Torp-Pedersen C, Gislason GH, Nelveg-Kristensen KE, Hommel K, et al. Creatinine increase following initiation of antihypertensives is associated with cardiovascular risk: a nationwide cohort study. J Hypertens. (2020) 38(12):2519–26. doi: 10.1097/HJH.0000000000002573

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Chen X, Jin H, Wang D, Liu J, Qin Y, Zhang Y, et al. Serum creatinine levels, traditional cardiovascular risk factors and 10-year cardiovascular risk in Chinese patients with hypertension. Front Endocrinol (Lausanne). (2023) 14:1140093. doi: 10.3389/fendo.2023.1140093

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: periodontitis, coronary heart disease, machine learning, prediction model, NHANES, national health and nutrition examination survey

Citation: Wang Y, Ni B, Xiao Y, Lin Y, Jiang Y and Zhang Y (2023) Application of machine learning algorithms to construct and validate a prediction model for coronary heart disease risk in patients with periodontitis: a population-based study. Front. Cardiovasc. Med. 10:1296405. doi: 10.3389/fcvm.2023.1296405

Received: 18 September 2023; Accepted: 17 November 2023;
Published: 29 November 2023.

Edited by:

Alessandro Polizzi, University of Catania, Italy

Reviewed by:

Li-Da Wu, Nanjing Medical University, China
Simona Santonocito, Università degli Studi di Catania, Italy

© 2023 Wang, Ni, Xiao, Lin, Jiang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yan Zhang MTg1NTg3NTA2MDBAMTYzLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.