- 1School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
- 2Center for HTA, China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China
- 3Shanghai Municipal Health Commission, Shanghai, China
- 4Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, United States
- 5Icahn School of Medicine at Mount Sinai, Institute for Healthcare Delivery Science, New York, NY, United States
- 6China Hospital Development Institute, Shanghai Jiao Tong University, Shanghai, China
Background: Cardiovascular disease (CVD) causes substantial financial burden to patients with the condition, their households, and the healthcare system in China. Health care costs for treating patients with CVD vary significantly, but little is known about the factors associated with the cost variation. This study aims to identify and rank key determinants of health care costs in patients with CVD in China and to assess their effects on health care costs.
Methods: Data were from a survey of patients with CVD from 14 large tertiary grade-A general hospitals in S City, China, between 2018 and 2020. The survey included information on demographic characteristics, health conditions and comorbidities, medical service utilization, and health care costs. We used re-centered influence function regression to examine health care cost concentration, decomposing and estimating the effects of relevant factors on the distribution of costs. We also applied quantile regression forests—a machine learning approach—to identify the key factors for predicting the 10th (low), 50th (median), and 90th (high) quantiles of health care costs associated with CVD treatment.
Results: Our sample included 28,213 patients with CVD. The 10th, 50th and 90th quantiles of health care cost for patients with CVD were 6,103 CNY, 18,105 CNY, and 98,637 CNY, respectively. Patients with high health care costs were more likely to be older, male, and have a longer length of hospital stay, more comorbidities, more complex medical procedures, and emergency admissions. Higher health care costs were also associated with specific CVD types such as cardiomyopathy, heart failure, and stroke.
Conclusion: Machine learning methods are useful tools to identify determinants of health care costs for patients with CVD in China. Findings may help improve policymaking to alleviate the financial burden of CVD, particularly among patients with high health care costs.
Introduction
Cardiovascular disease (CVD) refers to a group of disorders of the heart and blood vessels, including coronary heart disease (CHD), cerebrovascular disease, peripheral arterial disease, rheumatic heart disease, congenital heart disease, deep vein thrombosis, and pulmonary embolism. CVD is the leading cause of death globally (1). The World Health Organization (WHO) has reported that an estimated 17.9 million people died from CVD in 2019, representing 32% of all global deaths (2). Over three quarters of CVD deaths take place in developing countries (2). In China, it was estimated that about 330 million patients suffer from CVD and two out of every five deaths were due to CVD (3).
Besides the burden of morbidity and mortality, CVD results in substantial financial burden to patients and their families in China (4, 5). There is emerging evidence that CVD and other noncommunicable diseases can lead to poverty due to the high health care cost of treating the disease and high out-of-pocket expenditure among those who are uninsured or underinsured (6). Previous studies have assessed the health care cost of treating one type of CVD such as hypertension (7–10) and CHD (11–13) and mostly focused on developed countries such as the United States (6, 14–16). No studies, to the best of our knowledge, have assessed the health care costs of treating all types of CVD in China. The current study aims to fill this research gap by assessing the health care costs in patients with all types of CVD in China.
In previous health care cost analyses of CVD, mean-based models (e.g., generalized linear models with certain link functions for specific distributions) are often used to identify factors associated with the mean health care costs of CVD (17–19). These models evaluate the relationship between covariates and the mean outcome, assuming a uniform relationship across different percentiles of the cost distribution (20). However, this assumption does not always hold because the determinants of high health care costs may be different from those of low costs, and the effects of the determinants may vary across different parts of the cost distribution. The current study, instead, uses a machine learning approach to identify key determinants of health care costs in patients with CVD in China. We hypothesize that the determinants of health care cost vary from patients with high costs to those with low costs. The results would help healthcare professionals and policymakers design targeted interventions that may alleviate the financial burden of patients with CVD, their households, and the health care system.
Methods
Data sources and study population
This study used a cross-sectional cohort design. We exacted data from a survey of patients with CVD from 14 tertiary grade-A general hospitals in S City between 2018 and 2020. The list of hospitals is described in Supplementary Table S1. The survey collected patient information on their demography (e.g., age, sex), health conditions and comorbidities, medical service utilization, and health care costs.
Our sample included 28,213 adult patients ≥18 years. Excluding 591 patients with missing values on key variables, the final sample included 27,622 patients (Supplementary Figure S1). The International Classification of Diseases, 10th revision (ICD-10) codes I00-I99 and Q20-Q28 were used to identify patients with CVD (21). Specific types of CVD included congenital heart disease (ICD-10 codes Q20-Q28), coronary artery disease (CAD) (ICD-10 codes I25), heart failure (ICD-10 codes I50), myocardial infarction (MI) (ICD-10 codes I21), cardiomyopathy (ICD-10 codes I42-I43), hypertensive disease (ICD-10 codes I10-I15), and stroke (ICD-10 codes I60, I61, I63, I64) (Supplementary Table S2) (22). The survey data are de-identified. An exemption from ethical review has been approved by the Institutional Review Boards of School of Public Health at Shanghai Jiao Tong University School of Medicine.
Outcomes
The outcomes of interest included total health care cost, which consists of insurance-covered cost and out-of-pocket cost. We also considered health care costs by treatment category, including the cost of comprehensive medical service, diagnosis, treatment, medication, and medical consumables.
Covariates
The covariates included age, sex, marital status, residency (living in S City, living outside of S City), employment status (retired, employed, unemployed), common diagnosis (CHD, CAD, heart failure, MI, cardiomyopathy, hypertensive disease and stroke), cardiogenic comorbidity (yes, no), non-cardiac comorbidity (yes, no), number of comorbidities (0, 1, 2, 3, 4, ≥5), number of medical procedures, level of operation (I, II, III, IV), length of hospital stay, payment method (Urban Employee Basic Medical Insurance (UEBMI), New Rural Cooperative Medical Scheme/Urban Resident Basic Medical Insurance (NRCMS/URBMI), full self-payment and others), admission type (emergency and outpatient), and proportion of out-of-pocket.
Statistical analysis
To describe the distribution of health care costs, we first calculated the means of different quantiles of annual health care costs by demographic characteristics and compared the differences. We also calculated the Gini coefficient, a measure of inequality, for health care costs, the proportion of high health care costs (top 10%) in total costs. A high Gini coefficient suggests inequality in health care costs. We examined the statistical differences between high- (top 10%) and low-cost patients (bottom 10%) by between-group difference tests. We used recentered influence functions (RIFs) to estimate the small changes in the distribution of independent variables on the distributional measure of interest such as Gini coefficient (23).
We used quantile regression forests (QRFs) to identify determinants of health care costs across patients with different quantiles of CVD costs. Quantile regression (QR) can estimate how specific quantiles or percentiles of the distribution of the outcome variable vary with covariates. It is robust against outliers and is more informative for a skewed distribution than mean-based regression (24). QRFs is an extension of QR and has been applied in health care research for its prediction accuracy (25). QRFs utilizes the infrastructure of random forests and is a nonparametric model for conditional quantile estimation.
Specifically, we implemented a backward stepwise variable selection algorithm developed by Hu et al. (26), based on the variable importance scores generated by QRFs to determine the key factors for the 10th, 50th, and 90th quantiles, respectively, representing the low-, and high cost threshold of health care costs in patients with CVD (25). We computed the importance score of a QRFs for each covariate based on the “mean decrease in accuracy” (20). In a QRFs ensemble, each tree has an out-of-bag (OOB) sample that was left out from tree construction for assessing the predictive performance of the tree model (25). An iterative process was carried out for variable selection. Each time, we removed the least important variable and rebuilt a QRFs model with the remaining variables and recorded the out-of-bag average quantile loss (AQL) until no variable was left. AQL was used for the evaluation of model performance. Finally, we performed a weighted quantile regression (QR) to quantify the effects of each selected factor on different quantiles of health care costs in patients with CVD. Taking into account the variance across the specific CVD diseases, we performed subgroup analyses of CAD and stroke, the two diseases with the highest prevalence. As a sensitivity analysis, we also used GLM to estimate the effect of the key factors on the medical expenditures by the mean-based approach.
All statistical analyses were performed by R version 4.2.2. QRFs models were built using the “quantregForest” R package. A p value from two-sided test <0.05 was considered statistically significant.
Results
Population characteristics and distribution of health care costs
Out of 27,622 patients with CVD, 10,614 (38%) were female and 19,427 (70%) lived in S City. The mean age of the patients was 64.14 (14.09) years. The proportion of males among the top 10% of patients was higher than among the bottom 10% of patients (64% vs. 56%), and the same applied to the proportion of non-native (46% vs. 31%). Other population characteristics were summarized in Table 1.
Table 1. Demographic, disease characteristics and medical service utilization of inpatients with CVDs (n = 27,622).
Nearly 38% of the health care costs for treating CVD were generated by the top 10% of patients and 1% by the bottom 10%. The Gini coefficient of health care costs in patients with CVD was 0.56, that of out-of-pocket cost was 0.74, and that of medical insurance cost was 0.65, indicating a very high concentration. The Gini coefficient of health care costs for additional factors were showed in Supplementary Table S3. Among all types of CVD, the Gini coefficients of the costs exceeded 0.4 except for MI. The Gini coefficient of cardiomyopathy was the largest, reaching 0.68 in total cost, 0.82 in out-of-pocket, and 0.76 in medical insurance, which was much higher than the average (Figure 1).
The average health care cost was 41,282 CNY, while the 10th, 50th, and 90th of the costs were 6,103 CNY, 18,105 CNY, and 98,637 CNY, respectively. The average health care costs for each type of CVD were presented in Supplementary Table S4. Among them, myocardial infarction (MI) had the highest average annual per-person cost (56,118 CNY). Hypertensive disease had the highest cost (4,533 CNY) in the low-cost group, while cardiomyopathy was highest in the high-cost group. Coronary artery disease (CAD) accounted for the highest proportion (21.04%) of total costs (Supplementary Table S4). The bottom 10% of inpatients had a higher proportion of insurance coverage, accounting for 93.8%, while the top 10% had 72.3%. Among the specific categories of health care costs, the bottom 10% of patients had the highest proportion of diagnostic costs, reaching 62.8%, while the top 10% of patients had the highest proportion of medical consumables, reaching 62.9% (Supplementary Table S5).
Variable selection and rank of importance in different quantiles
Figures 2A,C,E showed the estimated out-of-bag AOL from every QRFs model built at each iteration in the backward stepwise algorithm for the 10th, 50th, and 90th quantiles of health care costs in patients with CVD. Number of operations, level of operation, inpatient length of stay, admission type and residence were selected as important factors for patients with both the 10th percentile, 50th percentile, and 90th percentile healthcare costs. Number of comorbidities was selected as an important factor only for patients with the 10th percentile cost, while sex, age, medical payment and proportion of self-payment were selected only for patients with the 90th percentiles costs. Figures 2B,D,F plotted the importance scores for the selected key factors for the 10th, 50th, and 90th quantiles of health care costs in patients with CVD. The variable selection of the identified key factors and the estimated effect of selected key factors for the 10th, 50th, and 90th percentile of health care costs with CAD and stroke were presented in Supplementary Figures S2, S3 and Supplementary Tables S7, S8.
Figure 2. (A), (C) and (E) plotted the estimated out-of-bag AOL from every QRFs model built at each iteration in the backward stepwise algorithm for the 10th, 50th and 90th quantiles of health care costs in patients with CVD. (B), (D) and (F) plotted the importance scores for the selected key factors for the 10th, 50th and 90th quantiles of health care costs in patients with CVD.
Estimated effect of selected key factors
Table 2 presented the estimated effects of the identified key determinants on total health care costs in patients with CVD. Compared to females, males were associated with higher costs among the high-cost groups (1,615 CNY, 95%CI: 348 CNY, 2,883 CNY). The cost would increase by 162 CNY (95% CI: 117 CNY, 207 CNY) per year of age among the high-cost group, but age was not significantly associated with health care costs in the low-cost group. The number of comorbidities was significantly associated with health care costs among the low-cost group. Patients with one, two, three, four or more comorbidities would have additional costs of 2,026 CNY (95% CI: 1,582 CNY, 2,471 CNY), 2,232 CNY (95% CI: 1,842 CNY, 2,622 CNY), 2,564 CNY (95% CI: 2,171 CNY, 2,957 CNY), and 2,635 CNY (95% CI: 2,267 CNY, 3,002 CNY), respectively, compared to those with no comorbidities. However, the number of comorbidities was not associated with costs among the high-cost groups. Payment methods were associated with costs in the high-cost groups, but not in the low-cost groups. Compared to UEBMI, NRCMS/URBMI was associated with higher costs (5,545 CNY, 95% CI: 660 CNY, 10,430 CNY), while this association was opposite in fully self-payment (−5,673 CNY, 95% CI: −7,841 CNY, −3,505 CNY). For every 1% increase in the proportion of self-payment, the costs increased by 134 CNY (95% CI: 100 CNY, 167 CNY) among the high-cost groups. Results from the sensitivity analysis using mean-based approach (GLM) were presented in Supplementary Table S9.
Table 2. Estimated effect of selected key factors on the medical expenditures using quantile regressions.
Discussion
The determinants of health care costs in patients with CVD in China are poorly known. Using data from a survey of patients with CVD from 14 large hospitals in China, we identified the determinants of health care costs in patients CVD and assessed their effects on the costs. The results showed that the health care costs in urban patients with CVD in China were highly concentrated in small groups of patients, similar to the empirical evidence (4, 27–29). The top 10% of inpatients accounted for approximately 38% of annual health care costs, and only 8% of the insured (UEBMI/NRCMS/URBMI) consumed close to 61% of annual inpatient medical insurance expenditures, which was generally consistent with the conclusions of some studies about other diseases in China (4, 30). Our study found that there was no difference in the concentration of total costs among age groups, but the concentration of out-of-pocket expenses and medical insurance expenses was completely opposite. The older adults have the highest concentration degree of out-of-pocket expenses and the lowest concentration degree of medical insurance expenses, although some evidence suggesting that concentration decreased with age (4, 31). Sex differences existed in the concentration of inpatient medical expenditures, with females having a greater concentration than males at all expense categories. Among the seven common diseases, patients with cardiomyopathy had the highest concentration and ones with MI had the lowest. Our results showed that high-cost inpatients were male and older, which explained the increase in the concentration of inpatient health care costs with population aging.
Employing the rigorous methodology to identify determinants for health care costs is important to informing the debate regarding how to improve health care value overall and address variation in health care costs in patients with CVD among low-cost and high-cost groups. It is particularly critical given the highly skewed distribution of health care costs with CVD in China. In previous studies, key factors of health care costs with CVD were often found using mean-based regression or for one of common diseases (9, 15, 19). To address the limitations, we used a machine learning with a principled backward stepwise algorithm (26) to identify determinants for varied levels of health care costs with CVD through a large survey with many covariates. We found that most of the key factors were the same across the quantiles, however, number of comorbidities was selected only the 10th percentile, while sex, age, medical payment and proportion of self-payment were selected only the 90th percentiles. Through selecting the key factors based on various quantiles of health care costs and ranking the relative importance, our study showcased the a more appropriate method for a detailed understanding of how determinants explain the variability in different parts of the health care cost distribution.
One major contribution of our study was to quantify the drivers for health care costs in patient with CVD in different cost groups. Previous studies have confirmed the basic consensus that some factors that elevated the probability of being a high-cost users (28, 30, 32, 33). However, with only qualitative assumptions, the impact of potential drivers on changes in concentration remains ambiguous, and a rigorous quantitative assessment lacking. To bridge this gap, we examined the drivers of health care costs in patient with CVD using the quantile regression forests. Our results showed that if the proportion of inpatients with cardiomyopathy increased by 10%, the predicted Gini coefficient would increase by 2.11%, and the costs gap would increase 2.08%. This suggested that the low prevalence of serious diseases, represented by cardiomyopathy, but accompanied by high-cost characteristics, was an important driver of health care cost concentration. In recent years, the burden of disease in China has shifted considerably, with the epidemiological transition from acute diseases, to chronic disease, such as cardiovascular diseases (34). Furthermore, the prevalence of chronic diseases, and trends of specific chronic diseases, has increased (35). For example, cardiovascular hospitalization costs increased by more than 20% annually since 2004 (1), stroke prevalence increased by 155% and the incidence increased by 31.6% in rural areas from 1980s to 2013 (36). The prevalence and spending on cardiovascular diseases will continue to rise as China’s demographics reflect population aging, prolonged life expectancy, increased expectation of medical care, and declining mortality rates, as well as the accumulation of risk factors.
We also found that these determinants did not uniformly impact the health care costs with linking the selected key determinants to health care costs using a weighted quantile regression. For example, number of comorbidities was selected only the 10th percentile. This finding might be surprising, given that it had been well documented that the high burden of comorbidities (37, 38). However, the magnitude of the relative difference was most profound at the lower percentile. It might be that the high cost of health care for those with CVD is more prominent among individuals with lower health care costs and has less of a differential impact on costs among those with higher costs and more complex conditions and care needs. In addition, medical payment method was selected only the 90th percentile. Compared with UEBMI, hospitalization costs were higher using the NRCMS/URBMI and lower with the full self-payment approach. This disproportionality in the effect estimates was often ignored in frequently used mean-based methods, potentially leading to biased conclusions.
Results from our study might also provide important insights for the development of tailored interventions to reduce potentially inappropriately high health care costs of CVD while maintaining or improving the care quality. For example, residence was significantly associated with higher percentile of health care costs of CVD. Due to the rich medial resources and top-notch medical technology, patients who seek medical treatment outside the city often suffer from more complex or severe diseases. Among the patients hospitalized in other provinces, 9.8% had Level-I medical procedure and 17.5% had Level-II medical procedure. These patients may be able to receive medical treatment locally. In addition, patients hospitalized in other provinces also incur more indirect costs, such as accommodation and transportation. Developing strategies by policymakers to reduce unnecessary or undesired treatments and related spending for out-of-town patients is warranted.
There are several limitations in the study. First, we conducted the large survey in almost all “AAA” general hospitals in S City, China. The findings generalized to the cities with rich medical resources but not the whole country. Second, we were not able to build causality in the relationships between health care costs and demographics, disease characteristics, and medical service utilization due to the cross-sectional nature of the survey data (39). However, our study identified determinants important for different quantiles of health care costs with CVD and can serve as a groundwork for future causal inference research in cost analysis. Third, we cannot evaluate other important variables that were not included in the study, either not measured or not collected in the survey, such as treatment quality or prognosis due to the lack of the follow-up. Despite the potential omitted variables, by using machine learning approach on a large sample that included individuals across demographics, residential information, comorbidity, surgery or operation information, insurance types, we believed our study deepens the understanding of the complex web of drivers and expands current research on CVD health care costs.
Conclusion
This study assessed the health care costs of treating all types of CVD in China and identified key determinants of high health care costs. To our knowledge, this is the first study assessing the economic burden of all types of CVD in China. Patients with high health care costs were more likely to be older, male, and have a longer length of hospital stay, more comorbidities, more complex medical procedures, and emergency admissions. Higher health care costs were also associated with specific CVD types such as cardiomyopathy, heart failure, and stroke. All of these findings may provide important insights for the development of tailored interventions to alleviate the financial burden of CVD in China, particularly among patients with high health care costs.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the institutional review board of School of Public Health at Shanghai Jiao Tong University School of Medicine on February 20, 2020 (IRB# SJUPN-202008). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
ML: Methodology, Validation, Writing – original draft. HG: Data curation, Resources, Writing – review & editing. CS: Writing – original draft. YX: Supervision, Writing – original draft. XL: Data curation, Writing – review & editing. LL: Writing – review & editing. YL: Conceptualization, Methodology, Writing – review & editing. GL: Funding acquisition, Project administration, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was supported by the National Natural Science Foundation of China (Nos. 72074147, 72293585).
Acknowledgments
We want to thank all respondents for voluntarily participating in the survey.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2023.1301276/full#supplementary-material
References
1. Liu, S, Li, Y, Zeng, X, Wang, H, Yin, P, Wang, L, et al. Burden of cardiovascular diseases in China, 1990-2016: findings from the 2016 global burden of disease study. JAMA Cardiol. (2019) 4:342–52. doi: 10.1001/jamacardio.2019.0295
2. Organization WH, Centre WM. (2021). Cardiovascular diseases (CVDs). Available at: https://www.who.int/en/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) (Accessed April 3 2023).
3. Writing Committee of the Report on Cardiovascular Health and Diseases in China. Report on cardiovascular health and diseases in China 2021: an updated summary. Chinese Circ J. (2022) 37:553–78. doi: 10.3969/j.issn.1000-3614.2022.06.001
4. Fan, Q, Wang, J, Nicholas, S, and Maitland, E. High-cost users: drivers of inpatient healthcare expenditure concentration in urban China. BMC Health Serv Res. (2022) 22:1348. doi: 10.1186/s12913-022-08775-9
5. Wang, S, Petzold, M, Cao, J, Zhang, Y, and Wang, W. Direct medical costs of hospitalizations for cardiovascular diseases in Shanghai, China: trends and projections. Medicine (Baltimore). (2015) 94:e837. doi: 10.1097/MD.0000000000000837
6. Birger, M, Kaldjian, AS, Roth, GA, Moran, AE, Dieleman, JL, and Bellows, BK. Spending on cardiovascular disease and cardiovascular risk factors in the United States: 1996 to 2016. Circulation. (2021) 144:271–82. doi: 10.1161/CIRCULATIONAHA.120.053216
7. Sheng, H, Dong, W, He, Y, Sui, M, Li, H, Liu, Z, et al. Regional variation of medical expenditures attributable to hypertension in China’s middle-aged and elderly population. Medicine (Baltimore). (2022) 101:e32395. doi: 10.1097/MD.0000000000032395
8. Lekoubou, A, Bishu, KG, and Ovbiagele, B. Naitonwide healthcare expenditures among hypertensive individuals with stroke 2003-2014. J Stroke. (2018) 27:1760–9. doi: 10.1016/j.jstrokecerebrovasdis.2018.02.003
9. Liu, G, Fang, Q, Ji, X, OuYang, J, Zhu, Y, Wang, L, et al. Medical expenditure and its influencing factors of patients with hypertension in Shanxi Province, China: a study based on ‘System of health accounts 2011′ and multiple-layer perceptron neural network. BMJ Open. (2022) 12:e048396. doi: 10.1136/bmjopen-2020-048396
10. Zhang, D, Wang, G, Zhang, P, Fang, J, and Ayala, C. Medical expenditures associated with hypertension in the U.S., 2000-2013. Am J Prev Med. (2017) 53:S164–71. doi: 10.1016/j.amepre.2017.05.014
11. Tromp, J, Ouwerkerk, W, Cleland, JGF, Angermann, CE, Dahlstrom, U, Tiew-Hwa Teng, K, et al. Global differences in burden and treatment of ischemic heart disease in acute heart failure: REPORT-HF. JACC Heart Fail. (2021) 9:349–59. doi: 10.1016/j.jchf.2020.12.015
12. Cornhill, AK, Dykstra, S, Satriano, A, Labib, D, Mikami, Y, Flewitt, J, et al. Machine learning patient-specific prediction of heart failure hospitalization using cardiac MRI-based phenotype and electronic health information. Front Cardiovasc Med. (2022) 9:890904. doi: 10.3389/fcvm.2022.890904
13. John, KJ, Turaka, VP, Muruga Bharathy, K, Vignesh Kumar, C, Jayaseelan, L, Visalakshi, J, et al. Predictors of mortality, strategies to reduce readmission, and economic impact of acute decompensated heart failure: results of the Vellore heart failure registry. Indian Heart J. (2020) 72:20–6. doi: 10.1016/j.ihj.2020.03.005
14. Grandhi, GR, Valero-Elizondo, J, Mszar, R, Brandt, EJ, Annapureddy, A, Khera, R, et al. Association of cardiovascular risk factor profile and financial hardship from medical bills among non-elderly adults in the United States. Am J Prev Cardiol. (2020) 2:100034. doi: 10.1016/j.ajpc.2020.100034
15. Shah, CH, and Dave, CV. Healthcare costs associated with comorbid cardiovascular and renal conditions among persons with diabetes, 2008-2019. Diabetol Metab Syndr. (2022) 14:179. doi: 10.1186/s13098-022-00957-z
16. Mekhaimar, M, Dargham, S, El-Shazly, M, Al Suwaidi, J, Jneid, H, and Abi, KC. Diabetes-related cardiovascular and economic burden in patients hospitalized for heart failure in the US: a recent temporal trend analysis from the National Inpatient Sample. Heart Fail Rev. (2021) 26:289–300. doi: 10.1007/s10741-020-10012-6
17. Tapias, FS, Otani, VHO, Vasques, DAC, Otani, TZS, and Uchida, RR. Costs associated with depression and obesity among cardiovascular patients: medical expenditure panel survey analysis. BMC Health Serv Res. (2021) 21:433. doi: 10.1186/s12913-021-06428-x
18. Ebrahimoghli, R, Janati, A, Sadeghi-Bazargani, H, Hamishehkar, H, and Khalili-Azimi, A. Incremental healthcare resource utilization and expenditures associated with cardiovascular diseases in patients with diabetes: a cross-sectional study. Iran J Med Sci. (2022) 47:53–62. doi: 10.30476/ijms.2020.87284.1742
19. He, Q, Dou, Z, Su, Z, Shen, H, Mok, TN, Zhang, CJP, et al. Inpatient costs of congenital heart surgery in China: results from the National Centre for cardiovascular diseases. Lancet Reg Health West Pac. (2023) 31:100623. doi: 10.1016/j.lanwpc.2022.100623
20. Li, L, Hu, L, Ji, J, Mckendrick, K, Moreno, J, Kelley, AS, et al. Determinants of Total end-of-life health care costs of Medicare beneficiaries: a quantile regression forests analysis. J Gerontol A Biol Sci Med Sci. (2022) 77:1065–71. doi: 10.1093/gerona/glab176
21. Luengo-Fernandez, R, Leal, J, Gray, A, Petersen, S, and Rayner, M. Cost of cardiovascular diseases in the United Kingdom. Heart. (2006) 92:1384–9. doi: 10.1136/hrt.2005.072173
22. Kokotailo, RA, and Hill, MD. Coding of stroke and stroke risk factors using international classification of diseases, revisions 9 and 10. Stroke. (2005) 36:1776–81. doi: 10.1161/01.STR.0000174293.17959.a1
23. Firpo, S, Fortin, N, and Lemieux, T. Unconditional Quantile Regressions. Econometrica. (2009) 77:953–73. doi: 10.3982/ECTA6822
24. Hu, L, and Hogan, JW. Causal comparative effectiveness analysis of dynamic continuous-time treatment initiation rules with sparsely measured outcomes and death. Biometrics. (2019) 75:695–707. doi: 10.1111/biom.13018
25. Hu, L, Ji, J, Li, Y, Liu, B, and Zhang, Y. Quantile regression forests to identify determinants of neighborhood stroke prevalence in 500 cities in the USA: implications for neighborhoods with high prevalence. J Urban Health. (2021) 98:259–70. doi: 10.1007/s11524-020-00478-y
26. Hu, L, Li, L, Ji, J, and Sanderson, M. Identifying and understanding determinants of high healthcare costs for breast cancer: a quantile regression machine learning approach. BMC Health Serv Res. (2020) 20:1066. doi: 10.1186/s12913-020-05936-6
27. Cohen, S. The concentration of health care expenditures in the U.S. and predictions of future spending. J Econ Soc Meas. (2016) 41:167–89. doi: 10.3233/JEM-160427
28. Holle, M, Wolff, T, and Herant, M. Trends in the concentration and distribution of health care expenditures in the US, 2001-2018. JAMA Netw Open. (2021) 4:e2125179. doi: 10.1001/jamanetworkopen.2021.25179
29. Ibuka, Y, Chen, S, Ohtsu, Y, and Izumida, N. Medical spending in Japan: an analysis using administrative data from a citizen’s health insurance plan. Fisc Stud. (2016) 37:561–92. doi: 10.1111/j.1475-5890.2016.12110
30. Chen, H, Peng, X, and Shen, M. Concentration and persistence of healthcare spending: evidence from China. Sustainability. (2021) 13:5761. doi: 10.3390/su13115761
31. Pashchenko, S, and Porapakkarm, P. Medical spending in the US: facts from the medical expenditure panel survey data set. Fisc Stud. (2016) 37:689–716. doi: 10.1111/j.1475-5890.2016.12100
32. Hanley, GE, and Morgan, S. Chronic catastrophes: exploring the concentration and sustained nature of ambulatory prescription drug expenditures in the population of British Columbia. Canada Soc Sci Med. (2009) 68:919–24. doi: 10.1016/j.socscimed.2008.12.008
33. Joynt, KE, Gawande, AA, Orav, EJ, and Jha, AK. Contribution of preventable acute care spending to total spending for high-cost Medicare patients. JAMA. (2013) 309:2572–8. doi: 10.1001/jama.2013.7103
34. Zhou, M, Wang, H, Zeng, X, Yin, P, Zhu, J, Chen, W, et al. Mortality, morbidity, and risk factors in China and its provinces, 1990-2017: a systematic analysis for the global burden of disease study 2017. Lancet. (2019) 394:1145–58. doi: 10.1016/S0140-6736(19)30427-1
35. Yang, G, Kong, L, Zhao, W, Wan, X, Zhai, Y, Chen, LC, et al. Emergence of chronic non-communicable diseases in China. Lancet. (2008) 372:1697–705. doi: 10.1016/S0140-6736(08)61366-5
36. Sun, H, and Wang, W. A nationwide epidemiological sample survey on cerebrovascular disease in China. Chin J Contemp Neurol Neurosury. (2018) 18:83–8. doi: 10.3969/j.issn.1672-6731.2018.02.002
37. Calver, J, Brameld, K, Preen, D, Alexia, S, Boldy, D, and McCaul, K. High-cost users of hospital beds in Western Australia: a population-based record linkage study. Med J Aust. (2006) 184:393–7. doi: 10.5694/j.1326-5377.2006.tb00289.x
38. Rosella, L, Fitzpatrick, T, Wodchis, W, Calzavara, A, Manson, H, and Goel, V. High-cost health care users in Ontario, Canada: demographic, socio-economic, and health status characteristics. BMC Health Serv Res. (2014) 14:532. doi: 10.1186/s12913-014-0532-2
Keywords: health care costs, cardiovascular disease, quantile regression forest, machine learning, financial burden
Citation: Lu M, Gao H, Shi C, Xiao Y, Li X, Li L, Li Y and Li G (2023) Health care costs of cardiovascular disease in China: a machine learning-based cross-sectional study. Front. Public Health. 11:1301276. doi: 10.3389/fpubh.2023.1301276
Edited by:
Thomas T. H. Wan, University of Central Florida, United StatesReviewed by:
Varadraj Prabhu Gurupur, University of Central Florida, United StatesJay J. Shen, University of Nevada, Las Vegas, United States
Copyright © 2023 Lu, Gao, Shi, Xiao, Li, Li, Li and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yan Li, yanliacademic@gmail.com; Guohong Li, guohongli@sjtu.edu.cn
†These authors have contributed equally to this work and share first authorship