- 1Center for Biomedical Engineering, School of Information Science and Technology, Fudan University, Shanghai, China
- 2Key Laboratory of Medical Imaging Computing and Computer Assisted Intervention of Shanghai, Shanghai, China
- 3Department of Cardiology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Introduction: Atrial fibrillation (AF) is prone to heart failure and stroke. Early management can effectively reduce the stroke rate and mortality. Current clinical guidelines screen high-risk individuals based solely on age, while this study aims to explore the possibility of other AF risk predictors.
Methods: A total of 18,738 elderly people (aged over 60 years old) in Chinese communities were enrolled in this study. The baseline characteristics were mainly based on the diagnosis results of electrocardiogram (ECG) machine during follow up, accompanied by some auxiliary physical examination basic data. After the analysis of both independent and combined baseline characteristics, AF risk predictors were obtained and prioritized according to the results. Independent characteristics were studied from three aspects: Chi-square test, Mann–Whitney U test and Cox univariate regression analysis. Combined characteristics were studied from two aspects: machine learning models and Cox multivariate regression analysis, and the former was combined with recursive feature elimination method and voting decision.
Results: The resulted optimal combination of risk predictors included age, atrial premature beats, atrial flutter, left ventricular hypertrophy, hypertension and heart disease.
Conclusion: Patients diagnosed by short-time ECG machines with the occurrence of the above events had a higher probability of AF episodes, who are suggested to be included in the focus of long-term ECG monitoring or increased screening density. The incidence of risk predictors in different age ranges of AF patients suggests differences in age-specific patient management. This can help improve the detection rate of AF, standardize the management of patients, and slow down the progression of AF.
1. Introduction
Atrial fibrillation (AF) is the most common type of supraventricular arrhythmia in clinical practice. Its hazards include: stroke and thromboembolism (1), heart failure (2), myocardial infarction (3), cognitive decline (4), renal function injury (5), and decreased quality of life (6). The incidence of AF in the general population is about 0.4–1% (7), and increases gradually with age for individuals. This is consistent with the content in the 2020 ESC guidelines (8): “Common AF screening strategies include opportunistic or systematic screening of individuals over a certain age (usually > 65 years) or with other characteristics suggestive of increased stroke risk”. Besides, some studies (9–12) have shown that the prevalence and incidence of AF are relevant to other factors such as gender and regions, which may contribute to the assessment of AF risk.
The risk prediction of AF is conducive to early detection, diagnosis, intervention, and standardized treatment of AF, which can avoid complications and further deterioration of the condition; otherwise, paroxysmal AF at the initial stage will progress to permanent AF (13). To reduce AF related mortality through risk prediction, AF risk factors, not just age, must be identified in order to develop effective and targeted interventions. Since ECG machines with analytical and diagnostic capability are an indispensable monitoring means in the medical field (14), multi-year follow-up studies that combine their diagnosis with some physical examination baseline characteristics will help identify AF predictors in addition to age.
The remainder of the paper is organized as follows. Section 2 describes study organization, inclusion criteria of participants, data collection and AF risk prediction methods. Specifically, a total of 18,738 elderly people (aged over 60 years) were enrolled and followed up for 1–4 years. The assessment of baseline characteristics related to AF risk prediction corresponding to each follow-up was recorded as a single task that included independent and combined risk predictors. The obtained results are summarized in Section 3 and next discussed in Section 4. Finally, Section 5 concludes the paper.
2. Materials and methods
2.1. Organization and participants
Since July 2015, we have provided annual physical examinations for residents aged 60 years and older at four community health centers (Shihudang, Maogang, Xinbang and Dongjing) in Songjiang District, Shanghai, China. The data inclusion criteria of this study were as follows: (1) aged over 60 years old, (2) registered residents in the four above-mentioned towns, and (3) diagnosed without AF by the resting12-lead ECG obtained during the physical examination in 2015. Those who did not agree to undergo the medical examination funded by the local government were excluded. A total of 18,738 participants were examined from July 2015 to December 2020. This is a prospective cohort study approved by the Ethical Review Board of Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China (No. 201508).
2.2. Data collection
The data used in this study includes physical examination basic data (age, sex, blood pressure, body mass index (BMI), hypertension, diabetes mellitus and cardiovascular disease history) and diagnosis data of ECG machine (sinus arrhythmia, atrial premature beats (APBs), atrial flutter, ventricular premature beat, left ventricular hypertrophy (LVH), ST segment change, etc.). Specific data list is shown in Supplementary material 1. Hypertension was defined as systolic blood pressure (SBP) ≥ 140 mmHg, diastolic blood pressure (DBP) ≥ 90 mmHg, or current antihypertensive therapy. Diabetes mellitus was defined as a previous diagnosis of diabetes mellitus, treatment with oral hypoglycemic agents or insulin, or having a fasting plasma glucose ≥ 126 mg/dL (7.0 mmol/L) or hemoglobin A1c level ≥ 6.5% (15). Heart disease is defined as a previous diagnosis of heart failure or coronary heart disease.
Before the physical examination and ECG monitoring, the subjects should avoid strenuous exercise, radiation examination and other matters. They are expected to keep the whole body relaxed and keep the supine state during the data measurement. The 12-lead ECG recordings were obtained using a MAC 2000 resting ECG machine (GE Healthcare, Milwaukee, Wisconsin, USA) and analyzed by the Marquette 12SL ECG analysis program. The program can label arrhythmia, measure standard intervals, and perform waveform analysis. The diagnosis of the ECG machine was checked by an experienced clinical physiologist. All ECG abnormalities were confirmed and coded according to the Minnesota ECG Coding Classification System by two cardiologists who were unaware of the clinical data. In addition, the three factors including hypertension, diabetes mellitus and cardiovascular disease history were coded 0-1, with 1 indicating a related history. All subjects were expected to undergo an annual follow-up after enrollment, but some participants failed to continue their appointments.
There were more subjects without AF within a fixed time frame, which was denoted as pre_NAF group. In contrast, the subjects with AF episodes detected were denoted as the pre_AF group, and the data distribution is shown in Table 1. The pre_NAF group made up a large proportion, which was also consistent with the fact that non-AF population accounted for a higher proportion in the overall distribution. The 1st year in Table 1 meant that the subjects were followed up for just one year, and similarly, the 4th year indicated that the subjects were followed up for four years. Subjects in pre_NAF group were different from healthy population because they may have other types of heart disease besides AF.
2.3. Preprocessing
NAN values (966 in total) and outliers need to be removed from the data due to biases introduced during manual data registration and measurement. The interquartile ranges of the boxplot were used to detect outliers in the remaining 17772 cases, and the formulae are as follows:
where Q1 and Q3 represent the lower and upper quartiles, respectively, and LW and UW represent the lower and upper edges of the boxplot, respectively.
Sample points outside this range were judged as outliers. The height of the box reflects the degree of data fluctuation to some extent, and the upper and lower edges represent the maximum and minimum values of the data group. Removal of outliers was operated on only three factors (SBP, DBP and BMI) because all indicators except Age, SBP, DBP and BMI were discrete variables coded 0-1 and it is believed that there was no measurement error in age. Then the total data is standardized with max-min normalization method to eliminate the influence of dimension. The formula is as follows:
where xi,j and are data in row i and column j before and after normalization, respectively. max(xj) and min(xj) are the maximum and minimum data in column j, respectively.
2.4. Assessment tasks of AF risk predictors
Four assessment tasks corresponding to the follow-up were constructed for the AF risk predictors from the data of the pre_NAF group and the pre_AF group. Repeated random sampling (20 times) was adopted to minimize the bias caused by unbalanced samples.
2.4.1. Machine learning models
Logistic regression (LR) assigns estimation coefficients to the linear model such that the sum of squared residuals between the observed target and the predicted target of the linear approximation in the dataset is minimized.
Support vector machine (SVM) is often used for bivariate classification of data, and its decision boundary is the maximum margin hyperplane solved for the learning samples, which can be transformed into a convex quadratic programming problem. In this study, radial basis functions (RBF) are used as kernel functions.
Random forest (RF) is composed of a set of tree classifiers {h(x,Θk),k = 1,…}, where {Θk} is an independent identically distributed random vector generated in conformity with the kth tree. Each tree votes on the most popular class corresponding to the input vector x, which is essentially an integration of multiple decision trees. In this paper, Gini impurity is chosen as the selection criterion of decision trees, which indicates the probability of a randomly selected sample being misclassified in a subset. Supposing the probability that a node is estimated as a different class at position t is p(k|t),k = 1,2,…,Q, and Q is the number of sample types, then the Gini index G(t) is defined as:
After preprocessing, the data sampled from the pre_NAF group and the fixed pre_AF group were subjected to 10-fold cross-validation each time, and there was no data overlap between the training set and the test set in the same run. Finally, the results of multiple machine learning models were averaged after feature selection optimization.
2.4.2. Feature selection (FS)
Recursive feature elimination (RFE) uses the backward selection method to compute the feature subset recursively. The steps of the algorithm are as follows:(1) the initial set is trained and the importance of each feature is obtained by an external estimator; (2) the least important features are removed and the remainder are put into machine learning models again for filtering; (3) the above elimination steps are repeated recursively to receive the optimal feature combination. The importance of features is calculated by different supervised learning estimators. In LR and SVM, it is measured by the absolute value of the feature coefficients, that is, the weight w corresponding to the independent variable x, while RF uses Gini index to estimate the feature importance.
2.4.3. Statistical test
For numerical variables, the normality test was conducted. If each group met the normality, the t-test was performed for inter-group comparison. Otherwise, the median, minimum and maximum were used for statistical description, and the non-parametric test was used for inter-group comparison. Categorical data were compared between two groups with Chi-Square test. And Mann–Whitney U test was used to determine whether there are statistically significant differences between medians of independent sample groups (pattern of AF and NAF).
Cox proportional-hazards model is a semi-parametric regression model, which can simultaneously study the relationship between multiple risk factors and the occurrence time of events. Cox regression has low requirement to data distribution, while multiple linear regression and logistic regression require data distribution to be approximately normal and binomial, respectively. The arguments of Cox regression model can be continuous numerical variables or discrete categorical variables, and stepwise regression method is used to screen effective characteristics from multiple influencing factors. We calculated the hazard ratio (HR) values, confidence intervals (CI) and p-values using both Cox univariate and multivariate regression models to investigate the effect of baseline characteristics on the prediction of AF and to check disruptive factors. p < 0.05 was considered of statistical significance.
2.5. Evaluation
The evaluation criterions of the classification results are accuracy (ACC), sensitivity (SEN), specificity (SPEC), F1 score and positive predictive value (PPV). The F1 score is defined as the harmonic mean of precision and sensitivity. Their formulas are as follows:
where TP, TN, FP, FN and N stand for true positive, true negative, false positive, false negative and the total number of samples respectively.
3. Results
3.1. Removal of outliers
The outliers of SBP, DBP and BMI were eliminated respectively. The boxplot in Figure 1 reflects the central location and distribution range of three groups of discrete quantitative data. The number of outliers and the number of remaining samples in the 4 tasks are shown in Table 2.
3.2. Independent AF risk factors
For numerical variables (age, SBP, DBP, and BMI), the normality tests were conducted by normal probability diagrams and Figure 2 shows the examples of the pre_NAF group in task1. The non-parametric Mann–Whitney U tests were performed for inter-group comparison on all samples, since two groups of numerical variables in the four tasks cannot meet the normal distribution at the same time. Mean, error bars and p-values of 4 numerical variables shown in Figure 3 indicated that only age was of significant difference in 4 separate tasks between patients who were detected positive for AF and subjects without AF symptoms during the follow-up.
Figure 2. Normal probability diagrams of 4 factors in the pre_NAF group in task1. The probabilities for different values of the factors are shown in blue. If all the sample points are close to the red line, it is reasonable to assume that the samples follow a normal distribution. (A) Age; (B) SBP; (C) DBP; (D) BMI.
Figure 3. Mean, error bars and p-values of 4 numerical variables in the pre_NAF group and pre_AF group in 4 tasks. The height of the columns represents the mean, and the error bar reflects the standard deviation. p values are obtained by the Mann–Whitney U test and indicate the existence of significant differences. (A) Age; (B) SBP; (C) DBP; (D) BMI.
The baseline characteristics of specific subjects for tasks 1 to 4 were shown in Tables 3, 4. Among various categorical variables, only APBs and heart disease were significant under the Chi-square test during the follow-up in at least 3 separate tasks. Due to the small number of subjects recruited in the pre_AF group, the statistical number of indicators coded as 1 tended to be smaller than that in the general elderly population (more than 60 years old).
To investigate whether these baseline variables were appropriate independent AF risk predictors with comprehensive significance over a four-year period, we performed Chi-square test and Mann-Whitney U test for categorical and numerical variables respectively, and Cox regression models were applicable for both variables. Cox regression models required random sampling to avoid unbalanced data distribution. It should be noted that for the inclusion of negative samples, only subjects who successfully completed 4 years of follow-up (12997 negative samples) were considered due to the unstable contact status of the pre_NAF group in the previous 3 years, while all 313 positive samples were included. Considering that some variables appeared more frequently in pre_NAF patients, we excluded them to keep consistent with the objective of this study to find positive AF risk predictors. Table 5 shows baseline characteristics with p-values less than 0.05 under the Chi-square test and Mann-Whitney U test in this case. APBs, LVH and hypertension had an incidence of more than 20% in AF patients. Based on results in Table 5, there were 8 baseline characteristics with HR > 1 and p-value < 0.05 obtained by univariate Cox regression analysis (Table 6), including age, APBs, atrial flutter, junctional premature beat, LVH, ST and T change, hypertension and heart disease. They were identified as independent AF risk predictors finally and supposed to have higher priority in search for independent AF risk predictors.
Table 5. Baseline characteristics of 12997 negative samples and 313 positive samples with p-values less than 0.05 under the Chi-square test and Mann-Whitney U test.
Table 6. Independent predictors for subjects with and without AF (cox proportional hazard model, univariate analysis).
3.3. Combined AF risk factors
The combined AF risk related baseline characteristics were investigated from two aspects: machine learning models and multivariate Cox regression analysis.
3.3.1. Results of machine learning models
Three typical machine learning models are used to measure predictive ability of all baseline characteristics in tasks 1 to 4 (Table 7). LR models had comparable high predictive performance after parameter optimization in the overall data distribution. Their results were positively correlated with time evolution and the gap between SEN and SPEC narrowed in the last two tasks.
Table 7. The results of three machine learning models used for all baseline characteristics in 4 tasks.
By reducing feature dimension and redundancy, RFE method combined with LR models contributed to the establishment of AF risk prediction models (Table 8). The final variable subset was decided by more than half of the votes, named as unified feature selection (unified FS). A slightly lower predictive performance of unified optimal feature collection is acceptable, provided that it had some generalization ability for various optimal sets obtained under different data distributions.
Table 9 shows important variables selected from the combination of RFE method and unified FS in 4 tasks based on LR models. Variables that appeared more frequently in pre_NAF patients were excluded, even though they were valid for machine learning models. In this case, factors that played a role in all tasks included age, APBs, hypertension and heart disease. Variables valid in three tasks included SBP, DBP, BMI, sinus bradycardia, atrial flutter, junctional premature beat, ventricular premature beat, LVH and ST and T change. There were no variables valid only for tasks 1 or 2. Variables valid only for tasks 3 or 4 included ST segment change. Unstable factors other than the above were ignored due to their seemingly weak association with AF risk prediction.
3.3.2. Results of multivariate Cox regression analysis
We conducted multivariate Cox regression analysis (Table 10) to measure whether the important independent predictors in Table 6 (univariate Cox regression analysis) were simultaneously positively associated with AF risk prediction and had future practicability. All HR values were more than 1, indicating that they were risk factors and can promote positive outcomes. P-values of variables including age, APBs, atrial flutter, LVH, hypertension and heart disease were less than 0.05, indicating that these independent variables were significant for the interpretation of the whole model. Compared with the four risk predictors obtained by the machine learning models for all tasks, Cox multivariate analysis suggested that LVH and atrial flutter were also important variables. These baseline characteristics are supposed to have higher priority in this cohort study. Besides, we calculated the p-values for the 4 tasks (0.0310, 0.0191, 0.0016, and 0.0015, respectively) by comparing single age factor and combined characteristics using the Mann–Whitney U test. The results indicated a gap between AF risk predictors combination and the single age element suggested by traditional guidelines (8, 16).
Table 10. Combined predictors for subjects with and without AF (cox proportional hazard model, multivariate analysis).
3.4. Baseline characteristics of subjects at different age ranges
Subjects in the following 7 age ranges were studied separately (Table 11) and it is clear that the incidence of APBs increased substantially with age. The incidence of AF events and heart disease increased in the first six and five intervals, respectively. The probability of LVH increased in the first three ranges, then stabilized at about 21% in people aged 75–89 and only 10.7% in people aged 90 and older. The trend in hypertension was relatively erratic and peaked among participants aged 85–89 years. Similarly, we calculated the incidence of baseline characteristics across age ranges in AF patients alone (Table 12).
Table 11. The number and proportion of baseline characteristics for all subjects of different age ranges.
Table 12. The number and proportion of baseline characteristics for subjects with AF of different age ranges.
4. Discussion
AF is the most common supraventricular arrhythmia in clinic. Previous works have reported that approximately 15% to 31% of paroxysmal AF patients at the early-stage progress to persistent or permanent AF during a time period between 4 and 8 years (17). Although AF itself poses little threat to life, ischemic stroke caused by AF is one of the main causes of death (17%) in Chinese community patients with AF (15). To optimize the early management of AF patients and slow down the progression of AF, Chinese guideline (16) provides suggestions for AF risk prediction by age and this has been demonstrated by many studies that there is a positive association between the incidence of AF and advancing age (18–22).
However, AF risk prediction only based on age can fail to screen positive cases and conduct clinical evaluation in a timely and effective manner due to the high proportion of asymptomatic AF patients (23). This study aims to help the progress of AF risk prediction through the diagnosis results of ECG machine that can reflect the changes of cardiac electrophysiology. The principal contributions of this study were as follows: (1) Diagnosis data of ECG machine and some physical examination basic data were explored to create more possibilities for AF risk prediction; (2) Both independent and combined risk predictors of positive correlation with AF were obtained and analyzed in detail, and combined risk predictors are more valuable in clinical applications; (3) This study functions as a preliminary step to reduce the target population for long-term ECG monitoring, which is beneficial to optimize the management of high AF risk population and improve the detection rate of AF.
4.1. Advantage of ECG machine for AF risk prediction
As a widely used automated algorithm for computer-based interpretation, GE Healthcare 12SL ECG Analysis Programs used in this study refines itself through regular clinical input and clinically relevant gold standard databases. Although there are small systematic differences between the measurements obtained with automated electrocardiographs from different manufacturers (24, 25), their diagnostic results are similar in differentiating between specific individuals and populations. Nowadays, most of the efficacy tests are skewed to pathologic rhythms with much emphasis on AF (26), which accords with its increasing prevalence and the topic of this study. In the absence of absolute medical definition of waveform fiducial points, the stability of the “gold standard” of human judgment is subject to uncertainty (25). This makes the absolute acceptance of any “gold standard” controversial, even if it is quantifiable. Therefore, the integrated stable analysis algorithms of ECG machines are suitable for clinical applications due to its simplicity, reliability and repeatability. As an extension of analysis algorithms of ECG machines, this study aims to explore effective AF risk predictors in the existing clinical experiment circumstances, and in turn serve the clinical diagnosis. Since the ECG monitoring systems used in this community-based cohort study aims to provide service applied to medical fields instead of non-medical applications (such as sports and elderly activities), performance improvements such as cost, energy efficiency, and battery life are not considered.
It should be pointed out that our focus is on the value of the diagnosis results of ECG machines in the AF risk prediction and there were some limitations in the experimental environment of data acquisition. Therefore, not all AF related factors such as hyperthyroidism were taken into account, but they can be included in the future work.
4.2. Evaluation of results
Tables 7, 8 show the average predictive performance of the machine learning models in four tasks. These evaluation indexes basically did not reach 70% but mostly exceed 60%, and there are two possible reasons. On the one hand, this study performed dichotomy tasks between AF and non-AF rather than AF and healthy subjects, and many subjects in the non-AF group actually had other cardiovascular diseases and related complications. The similarity of symptoms may bias the results. On the other hand, the data provided by the ECG machines were the simplified 0-1 code instead of specific values, so some valuable information can be lost in the process. Besides, ACC, SEN and SPEC in task 3 and task 4 were higher than those in task 1 and task 2, indicating that individuals can more possibly have AF attack caused by risk accumulation. Furthermore, some variables may show temporal change, with different distributions in various tasks (e.g., age increases with the task).
4.3. AF risk predictors
The independent AF predictors were analyzed by Chi-square test, Mann–Whitney U test and Cox proportional-hazards models. The chi-square test dealt with categorical variables and Mann-Whitney U test dealt with numerical variables, while Cox proportional-hazards models accepted both types of variables. There were 8 baseline characteristics in the intersection of their significance variables sets, including age, APBs, atrial flutter, junctional premature beat, LVH, ST and T change, hypertension and heart disease.
The combined AF risk predictors were determined by Cox regression analysis and LR models with RFE method. The Cox regression models simultaneously assessed the effect of several variables on events, allowing us to examine how specific factors affected the incidence of AF occurring at a given time point. The resulting covariates with HR values greater than 1 and p values less than 0.05 were considered to be significantly positively associated with increased AF risk. The combined baseline characteristics included age, APBs, atrial flutter, LVH, hypertension and heart disease. The p-values (less than 0.05) obtained with the Mann-Whitney U test indicated a gap between combined AF risk predictors and the single age element. Although age is associated with higher AF sensitivity, the consequent sacrifice of specificity may cause anxiety and overdiagnosis. In clinical practice, independent variables cannot fully reflect the outcome variables (positive or negative), and are susceptible to the interference of other variables when they are not completely independent. In particular, AF risk prediction using dozens of variables or a single variable is difficult to implement. Therefore, we focus on the combined AF risk predictors that evolved from univariate analysis.
APBs and AF are arrhythmias of atrial origin. If the large number of APBs indicates atrial fibrosis or electrical activity disorder, it can easily develop into AF in the future. In the community-based Chinese cohort study, we found that the presence of ECG machine-diagnosed APBs was a strong independent and combined predictor of AF risk in the elderly population (≥ 60 years). During the follow-up of 1 to 4 years, the APBs detection rate in this cohort was 22% in patients with AF versus 6.9% in patients without AF. Many studies (27–35) used 24-hour Holters to analyze the relationship between APBs and AF, which can count baseline APBs and use different thresholds to define its frequency of occurrence in the 24-hour recordings. However, the relationship between the count of APBs and the probability of developing AF highly depends on baseline information (such as relevant medical history, medication history, etc.) of the AF-risk population, indicating the limitations of thresholds setting. While in this study, only the occurrence events of APBs reflected by ECG machine with short-term data collection were required to perform AF risk prediction, which could be an effective pre-step before long-term ECG monitoring for high-risk groups. In addition, many strokes occurred without a temporal association with the AF episodes (36, 37), suggesting that frequent APBs may be a stroke risk marker independent of the causal mechanism of AF.
Atrial flutter and AF are proposed to be related entities and may transform into one another (38). Since they usually co-exist before and after medication or ablation, most studies (39–42) have explored the incidence of new AF in patients undergoing ablation. In this study, 16552 patients without AF and 313 patients with AF who had no previous history of successful ablation of atrial flutter were tested by ECG machines. However, the incidence of atrial flutter was only 1.6% in positive samples and only 0.05% in negative samples during the follow-up. This may be affected by the short monitoring period and insufficient follow-up time, and more data is expected to be included.
Left ventricular hypertrophy is usually a compensatory hypertrophy of the heart caused by hypertension. Studies (43–48) investigating the AF predictive role of ECG-based LVH were mainly based the population including both hypertension and normotension. The follow-up period ranged from 3.2 to 11.9 years and the mean age of subjects was 55.4 ± 11 years (49). On this basis, LVH was observed in 258 of 3,235 AF events (8%), compared to 72 of 313 AF events (23%) in the elderly population during the follow-up of 1-4 years in this study. Differences in LVH criteria may limit the confidence of the results and more large randomized controlled studies are necessary. But the differences of results may indicate a higher incidence of LVH in the elderly over 60 years. In addition, there was an apparent correlation between hypertension and LVH in this study, especially for subjects suffering from AF. Among 313 patients with AF, 43 out of 187 (23%) hypertensive patients had LVH and 43 of the 72 patients (60%) with LVH had hypertension. Although LVH was determined to be an independent AF risk predictor, its high occurrence rate in people without AF (2844 cases out of 16552, 17%) may result in inevitable high misdiagnosis rate, as does hypertension (7306 cases out of 16552, 44%). We suggest that LVH and hypertension should be diagnosed in conjunction with other AF risk predictors, rather than independently, in the elderly population.
Heart disease in this study refers to a previous diagnosis of heart failure or coronary heart disease. Recording of histories of heart disease is essential before AF risk prediction in the elderly population, but it is supposed to be combined with other risk predictors for diagnosis to avoid low specificity.
In this study, the positive association between the baseline characteristics above and AF was confirmed in residents aged 60 years and older at four community health centers in China. Their combination is beneficial to reduce the misdiagnosis rate caused by single factor diagnosis for AF risk prediction. Besides, the prioritization of risk predictors can help physicians to specify relevant strategies and help the hierarchical management of AF in specific applications. For patients with a large number of abnormal primary risk factors, the density of AF screening should be strengthened, and long-term monitoring should be performed if necessary. It reflects the nature of this study’s use of short-time ECG recordings as a pre-step to reduce the target population for long-term ECG monitoring. For patients with abnormal range of only secondary risk factors, follow-up, regular physical examination and health education should be carried out to guide patients’ self-health management. For AF patients with relatively stable diagnosis and treatment, routine treatment, rehabilitation and long-term follow-up are expected.
4.4. Association of AF risk predictors with age ranges
Given the small sample size of participants over 90, we focused on the first six age ranges. Traditional guidelines consider individuals over 65 years of age to be at high AF risk, and this was confirmed according to the increasing incidence of AF in Table 11. The incidence of APBs was positively correlated with the age range, both for all samples and for AF participants. The incidence of LVH, hypertension and heart disease in all samples increased in the first few age ranges and then stabilized or fluctuated slightly. But in patients with AF, there was a small decline in their incidence in subjects older than 74. Compared with the probability of occurrence in the total sample, the incidence of APBs increased in AF patients (85–89 years old) by up to 34.9% and by an average of 16.5%. The average risk of hypertension in AF patients increased by 9.8%, followed by heart disease (5.9%) and LVH (2.35%). Considering that these baseline characteristics were independent and combined risk predictors, we ranked their importance in terms of their increased probability of AF episodes: APBs, hypertension, heart disease, and LVH. This can provide suggestions for the increase of weights when constructing prediction models with other characteristics that were not covered in this study, or managing AF-related age-specific populations. In addition, APBs was only 1 and 1.4% less common in AF patients and total samples aged 60–64 than in those aged 65–69, respectively (Tables 11, 12). This suggests that the age range for screening high-risk groups can be appropriately extended to over 60 years of age.
5. Conclusion
In this community-based cohort study, independent and combined AF risk predictors based on the diagnosis results of ECG machine and some basic physical examination data were explored and analyzed. On the basis of univariate analysis, the recommended combined characteristics included age, APBs, atrial flutter, LVH, hypertension and heart disease, and they were verified superior to the single age factor. The combined AF risk predictors are beneficial to reduce the misdiagnosis rate caused by independent factor diagnosis for AF risk prediction. As a pre-step to reduce the target population for long-term ECG monitoring, the positive association between the baseline characteristics above and AF can provide suggestions for people to be included in the focus of AF screening. In this case, the enhancement of screening density and the arrangement of long-term monitoring for these high-risk population can improve the detection rate of AF, standardize the management of patients, and slow down the progression of AF. Besides, AF risk predictors had different incidence rates in different age ranges, which can provide suggestions for the setting of model weights for the management of AF in specific age groups. Additional data especially in the AF group will be collected in the future, and the study will be expanded to include people aged 30 to 60 years. Furthermore, other influencing factors like Hyperthyroidism will be taken into account.
Data availability statement
The original contributions presented in this study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving human participants were reviewed and approved by Ethical Review Board of Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China and Songjiang Central Hospital, Shanghai, China. The patients/participants provided their written informed consent to participate in this study.
Author contributions
JH designed research, performed research, analyzed data, and wrote and edited the manuscript. SL and CY designed research and edited the manuscript. YW designed research, edited the manuscript, and had full access to all of the data in the study. All authors contributed to the article and approved the submitted version.
Funding
This study was supported by Shanghai Science and Technology Support Project (18441900900) and Shanghai Municipal Special Project of Industry Transformation and Upgrading (GYQJ-2020-1-31).
Acknowledgments
Thanks to Dr. Gaoyan Zhong of Biomedical Engineering Center of Fudan University and the subjects who cooperated with the relevant experiments.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fcvm.2023.1068562/full#supplementary-material
References
1. Paciaroni M, Agnelli G, Micheli S, Caso V. Efficacy and safety of anticoagulant treatment in acute cardioembolic stroke: a meta-analysis of randomized controlled trials. Stroke. (2007) 38:423–30. doi: 10.1161/01.STR.0000254600.92975.1f
2. Chiang CE, Naditch-Brûl L, Murin J, Goethals M, Inoue H, Neill J, et al. Distribution and risk profile of paroxysmal, persistent, and permanent atrial fibrillation in routine clinical practice: insight from the real-life global survey evaluating patients with atrial fibrillation international registry. Circ Arrhythm Electrophysiol. (2012) 5:632–9. doi: 10.1161/CIRCEP.112.970749
3. Soliman EZ, Safford MM, Muntner P, Khodneva Y, Dawood FZ, Zakai NA, et al. Atrial fibrillation and the risk of myocardial infarction. JAMA Intern Med. (2014) 174:107–14. doi: 10.1001/jamainternmed.2013.11912
4. Staerk L, Sherer JA, Ko D, Benjamin EJ, Helm RH. Atrial fibrillation: epidemiology, pathophysiology, and clinical outcomes. Circ Res. (2017) 120:1501–17. doi: 10.1161/CIRCRESAHA.117.309732
5. Böhm M, Ezekowitz MD, Connolly SJ, Eikelboom JW, Hohnloser SH, Reilly PA, et al. Changes in renal function in patients with atrial fibrillation: an analysis from the RE-LY trial. J Am Coll Cardiol. (2015) 65:2481–93. doi: 10.1016/j.jacc.2015.03.577
6. Walters TE, Wick K, Tan G, Mearns M, Joseph SA, Morton JB, et al. Symptom severity and quality of life in patients with atrial fibrillation: psychological function outweighs clinical predictors. Int J Cardiol. (2019) 279:84–9. doi: 10.1016/j.ijcard.2018.10.101
7. Go AS, Hylek EM, Phillips KA, Chang Y, Henault LE, Selby JV, et al. Prevalence of diagnosed atrial fibrillation in adults. JAMA. (2001) 285:2370–5. doi: 10.1001/jama.285.18.2370
8. Hindricks G, Potpara T, Dagres N, Arbelo E, Bax JJ, Blomström-Lundqvist C, et al. 2020 ESC guidelines for the diagnosis and management of atrial fibrillation developed in collaboration with the European association for cardio-thoracic surgery (EACTS): the task force for the diagnosis and management of atrial fibrillation of the European society of cardiology (ESC) developed with the special contribution of the European heart rhythm association (EHRA) of the ESC. Eur Heart J. (2021) 42:373–498. doi: 10.1093/eurheartj/ehaa612
9. Chugh SS, Havmoeller R, Narayanan K, Singh D, Rienstra M, Benjamin EJ, et al. Worldwide epidemiology of atrial fibrillation: a global burden of disease 2010 Study. Circulation. (2014) 129:837–47. doi: 10.1161/CIRCULATIONAHA.113.005119
10. Schnabel RB, Yin X, Gona P, Larson MG, Beiser AS, McManus DD, et al. 50 year trends in atrial fibrillation prevalence, incidence, risk factors, and mortality in the Framingham heart study: a cohort study. Lancet. (2015) 386:154–62. doi: 10.1016/S0140-6736(14)61774-8
11. Zhang S. Atrial fibrillation in mainland China: epidemiology and current management. Heart. (2009) 95:1052–5. doi: 10.1136/hrt.2008.146589
12. Kirchhof P, Benussi S, Kotecha D, Ahlsson A, Atar D, Casadei B, et al. 2016 ESC guidelines for the management of atrial fibrillation developed in collaboration with EACTS. Eur Heart J. (2016) 37:2893–962. doi: 10.1093/eurheartj/ehw210
13. European Heart Rhythm Association, European Association for Cardio-Thoracic Surgery, Camm AJ, Kirchhof P, Lip GY, Schotten U, et al. Guidelines for the management of atrial fibrillation: the task force for the management of atrial fibrillation of the European society of cardiology. Eur Heart J. (2010) 31:2369–429. doi: 10.1093/eurheartj/ehq278
14. Cardiovascular Non-Invasive Testing Group, Cardiovascular Professional Committee, Expert Group For The Compilation Of-Lead ECG Automatic Diagnosis Function Evaluation Specification. Clinical evaluation criteria for automatic diagnosis of 12-lead electrocardiograph. Chin J Med Instr. (2021) 45:563–7. doi: 10.3969/j.issn.1671-7104.2021.05.020
15. Wei Y, Zeng Q, Cai L, Wang X, Wang B, Shen C, et al. Contemporary survival and anticoagulation of patients with atrial fibrillation: a community based cohort study in China. Front Cardiovasc. (2022) 9:911393. doi: 10.3389/fcvm.2022.911393
16. Chinese Society of Pacing and Electrophysiology, Chinese Society of Arrhythmias, Atrial Fibrillation Center Union of China. Current knowledge and management of atrial fibrillation: consensus of Chinese experts 2021. Chin J Cardiac Arrhyth. (2022) 26:15–88.
17. de Vos CB, Pisters R, Nieuwlaat R, Prins MH, Tieleman RG, Coelen RJ, et al. Progression from paroxysmal to persistent atrial fibrillation clinical correlates and prognosis. J Am Coll Cardiol. (2010) 55:725–31. doi: 10.1016/j.jacc.2009.11.040
18. Kannel WB, Wolf PA, Benjamin EJ, Levy D. Prevalence, incidence, prognosis, and predisposing conditions for atrial fibrillation: population-based estimates. Am J Cardiol. (1998) 82:2N–9. doi: 10.1016/S0002-9149(98)00583-9
19. Feinberg WM, Blackshear JL, Laupacis A, Kronmal R, Hart RG. Prevalence, age distribution, and gender of patients with atrial fibrillation: analysis and implications. Arch Intern Med. (1995) 155:469–73. doi: 10.1001/archinte.1995.00430050045005
20. Krahn AD, Manfreda J, Tate RB, Mathewson FA, Cuddy TE. The natural history of atrial fibrillation: incidence, risk factors, and prognosis in Manitoba follow-up study. Am J Med. (1995) 98:476–84. doi: 10.1016/S0002-9343(99)80348-9
21. Wolf P, Abbott R, Kannel W. Atrial fibrillation as an independent risk factor for stroke: the Framingham study. Stroke. (1991) 22:983–8. doi: 10.1161/01.STR.22.8.983
22. Furberg CD, Psaty BM, Manolio TA, Gardin JM, Smith VE, Rautaharju PM. Prevalence of atrial fibrillation in elderly subjects: the cardiovascular health study. Am J Cardiol. (1994) 74:236–41. doi: 10.1016/0002-9149(94)90363-8
23. Boriani G, Laroche C, Diemberger I, Fantecchi E, Popescu MI, Rasmussen LH, et al. Asymptomatic atrial fibrillation: clinical correlates, management, and outcomes in the EORP-AF pilot general registry. Am J Med. (2015) 128:509–18.e2. doi: 10.1016/j.amjmed.2014.11.026
24. Kligfield P, Badilini F, Rowlandson I, Xue J, Clark E, Devine B, et al. Comparison of automated measurements of electrocardiographic intervals and durations by computer-based algorithms of digital electrocardiographs. Am Heart J. (2014) 167:150–9. doi: 10.1016/j.ahj.2013.10.004
25. Kligfield P, Badilini F, Denjoy I, Babaeizadeh S, Clark E, De Bie J, et al. Comparison of automated interval measurements by widely used algorithms in digital electrocardiographs. Am Heart J. (2018) 200:1–10. doi: 10.1016/j.ahj.2018.02.014
26. Faruk N, Abdulkarim A, Emmanuel I, Folawiyo YY, Adewole KS, Mojeed HA, et al. A comprehensive survey on low-cost ECG acquisition systems: advances on design specifications, challenges and future direction. Biocybern Biomed Eng. (2021) 41:474–502. doi: 10.1016/j.bbe.2021.02.007
27. Wallmann D, Tüller D, Kucher N, Fuhrer J, Arnold M, Delacretaz E. Frequent atrial premature contractions as a surrogate marker for paroxysmal atrial fibrillation in patients with acute ischaemic stroke. Heart. (2003) 89:1247–8. doi: 10.1136/heart.89.10.1247
28. Wallmann D, Tüller D, Wustmann K, Meier P, Isenegger J, Arnold M, et al. Frequent atrial premature beats predict paroxysmal atrial fibrillation in stroke patients: an opportunity for a new diagnostic strategy. Stroke. (2007) 38:2292–4. doi: 10.1161/STROKEAHA.107.485110
29. Engström G, Hedblad B, Juul-Möller S, Tydén P, Janzon L. Cardiac arrhythmias and stroke: increased risk in men with high frequency of atrial ectopic beats. Stroke. (2000) 31:2925–9. doi: 10.1161/01.STR.31.12.2925
30. Binici Z, Intzilakis T, Nielsen OW, Køber L, Sajadieh A. Excessive supraventricular ectopic activity and increased risk of atrial fibrillation and stroke. Circulation. (2010) 121:1904–11.
31. Chong BH, Pong V, Lam KF, Liu S, Zuo ML, Lau YF, et al. Frequent premature atrial complexes predict new occurrence of atrial fibrillation and adverse cardiovascular events. Europace. (2012) 14:942–7. doi: 10.1093/europace/eur389
32. Dewland TA, Vittinghoff E, Mandyam MC, Heckbert SR, Siscovick DS, Stein PK, et al. Atrial ectopy as a predictor of incident atrial fibrillation: a cohort study. Ann Intern Med. (2013) 159:721–8. doi: 10.7326/0003-4819-159-11-201312030-00004
33. Gladstone DJ, Dorian P, Spring M, Panzov V, Mamdani M, Healey JS, et al. Atrial premature beats predict atrial fibrillation in cryptogenic stroke. Stroke. (2015) 46:936–41. doi: 10.1161/STROKEAHA.115.008714
34. Acharya T, Tringali S, Bhullar M, Nalbandyan M, Ilineni VK, Carbajal E, et al. Frequent atrial premature complexes and their association with risk of atrial fibrillation. Am J Cardiol. (2015) 116:1852–7. doi: 10.1016/j.amjcard.2015.09.025
35. Himmelreich JCL, Lucassen WAM, Heugen M, Bossuyt PMM, Tan Hanno L, Harskamp RE, et al. Frequent premature atrial contractions are associated with atrial fibrillation, brain ischaemia, and mortality: a systematic review and meta-analysis. Europace. (2019) 21:698–707. doi: 10.1093/europace/euy276
36. Daoud EG, Glotzer TV, Wyse DG, Ezekowitz MD, Hilker C, Koehler J, et al. Temporal relationship of atrial tachyarrhythmias, cerebrovascular events, and systemic emboli based on stored device data: a subgroup analysis of TRENDS. Heart Rhythm. (2011) 8:1416–23. doi: 10.1016/j.hrthm.2011.04.022
37. Brambatti M, Connolly SJ, Gold MR, Morillo CA, Capucci A, Muto C, et al. Temporal relationship between subclinical atrial fibrillation and embolic events. Circulation. (2014) 129:2094–9.
38. Manolis A. Contemporary diagnosis and management of atrial flutter: a continuum of atrial fibrillation and vice versa? Cardiol Rev. (2017) 25:289–97. doi: 10.1097/CRD.0000000000000162
39. Maskoun W, Pino MI, Ayoub K, Llanos OL, Almomani A, Nairooz R, et al. Incidence of atrial fibrillation after atrial flutter ablation. JACC Clin Electrophysiol. (2016) 2:682–90. doi: 10.1016/j.jacep.2016.03.014
40. Henmi R, Ejima K, Shoda M, Yagishita D, Hagiwara N. Interatrial conduction time can predict new-onset atrial fibrillation after radiofrequency ablation of isolated, typical atrial flutter. J Cardiovasc Electrophysiol. (2016) 27:1293–7. doi: 10.1111/jce.13040
41. Enriquez A, Sarrias A, Villuendas R, Ali Fariha S, Conde D, Hopman WM, et al. New-onset atrial fibrillation after cavotricuspid isthmus ablation: identification of advanced interatrial block is key. Europace. (2015) 17:1289–93. doi: 10.1093/europace/euu379
42. Da Costa A, Romeyer C, Mourot S, Messier M, Cerisier A, Faure E, et al. Factors associated with early atrial fibrillation after ablation of common atrial flutter. A single centre prospective study. Eur Heart J. (2002) 23:498–506. doi: 10.1053/euhj.2001.2819
43. Hamada R, Muto S. Simple risk model and score for predicting of incident atrial fibrillation in Japanese. J Cardiol. (2019) 73:65–72. doi: 10.1016/j.jjcc.2018.06.005
44. Schnabel RB, Sullivan LM, Levy D, Pencina MJ, Massaro JM, D’Agostino RB Sr, et al. Development of a risk score for atrial fibrillation (Framingham heart study): a community-based cohort study. Lancet. (2009) 373:739–45. doi: 10.1016/S0140-6736(09)60443-8
45. Macfarlane PW, Murray H, Sattar N, Stott DJ, Ford I, Buckley B, et al. The incidence and risk factors for new onset atrial fibrillation in the PROSPER study. Europace. (2011) 13:634–9. doi: 10.1093/europace/eur016
46. Chamberlain AM, Agarwal SK, Folsom AR, Soliman EZ, Chambless LE, Crow R, et al. A clinical risk score for atrial fibrillation in a biracial prospective cohort (from the atherosclerosis risk in communities [ARIC] Study). Am J Cardiol. (2011) 107:85–91.
47. Patel N, Neal WT, Whalen SP, Soliman EZ. Electrocardiographic left ventricular hypertrophy predicts atrial fibrillation independent of left ventricular mass. Ann Noninvasive Electrocardiol. (2017) 22:1–5. doi: 10.1111/anec.12419
48. Chrispin J, Jain A, Soliman EZ, Guallar E, Alonso A, Heckbert SR, et al. Association of electrocardiographic and imaging surrogates of left ventricular hypertrophy with incident atrial fibrillation: MESA (multi-ethnic study of atherosclerosis). J Am Coll Cardiol. (2014) 63:2007–13. doi: 10.1016/j.jacc.2014.01.066
Keywords: atrial fibrillation, statistical test, baseline characteristics, risk prediction, electrocardiogram machine
Citation: He J, Liu S, Yang C and Wei Y (2023) Value of baseline characteristics in the risk prediction of atrial fibrillation. Front. Cardiovasc. Med. 10:1068562. doi: 10.3389/fcvm.2023.1068562
Received: 13 October 2022; Accepted: 16 January 2023;
Published: 01 February 2023.
Edited by:
Wilber Su, Banner - University Medical Center Phoenix, United StatesReviewed by:
Yongnan Li, Lanzhou University, ChinaChia-Tung Wu, Linkou Chang Gung Memorial Hospital, Taiwan
Copyright © 2023 He, Liu, Yang and Wei. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Cuiwei Yang, yangcw@fudan.edu.cn; Yong Wei, weiyong202@qq.com