Explanatory deep learning to predict elevated pulmonary artery pressure in children with ventricular septal defects using standard chest x-rays: a novel approach

Li, Zhixin; Luo, Gang; Ji, Zhixian; Wang, Sibao; Pan, Silin

doi:10.3389/fcvm.2024.1330685

ORIGINAL RESEARCH article

Front. Cardiovasc. Med. , 12 January 2024

Sec. Pediatric Cardiology

Volume 11 - 2024 | https://doi.org/10.3389/fcvm.2024.1330685

Explanatory deep learning to predict elevated pulmonary artery pressure in children with ventricular septal defects using standard chest x-rays: a novel approach

$\r\nZhixin Li&#x;$ Zhixin Li^†

Gang Luo^†

Zhixian Ji

Sibao Wang $Silin Pan \r\n$ Silin Pan*

Heart Center, Women and Children’s Hospital, Qingdao University, Qingdao, China

Objective: Early risk assessment of pulmonary arterial hypertension (PAH) in patients with congenital heart disease (CHD) is crucial to ensure timely treatment. We hypothesize that applying artificial intelligence (AI) to chest x-rays (CXRs) could identify the future risk of PAH in patients with ventricular septal defect (VSD).

Methods: A total of 831 VSD patients (161 PAH-VSD, 670 nonPAH-VSD) was retrospectively included. A residual neural networks (ResNet) was trained for classify VSD patients with different outcomes based on chest radiographs. The endpoint of this study was the occurrence of PAH in VSD children before or after surgery.

Results: In the validation set, the AI algorithm achieved an area under the curve (AUC) of 0.82. In an independent test set, the AI algorithm significantly outperformed human observers in terms of AUC (0.81 vs. 0.65). Class Activation Mapping (CAM) images demonstrated the model's attention focused on the pulmonary artery segment.

Conclusion: The preliminary findings of this study suggest that the application of artificial intelligence to chest x-rays in VSD patients can effectively identify the risk of PAH.

Graphical Abstract

Graphical Abstract.

Introduction

Among the growing population of adults with congenital heart disease, pulmonary arterial hypertension associated with congenital heart disease (PAH-CHD) is a major cause of increased mortality (1, 2). Ventricular septal defect (VSD) is the most common congenital heart anomaly (3). Complications such as PAH are common and require significant medical resources (4). The surgical outcomes of VSD repair are closely related to the age of the patient, with children under 2 years often experiencing a return to normal function post-surgery, while the long-term efficacy for those over 2 years with severe PAH remains uncertain (2, 5). It is widely accepted that repairing significant volume overload across the shunt between the systemic and the pulmonary circulations can lead to notable benefits and improve life expectancy in young patients (6). However, for VSD patients without early detection of PAH, even if the defect is successfully corrected, the mid to long-term outcomes after repair are unfavorable (7). Therefore, early detection of VSD patients with associated PAH (PAH-VSD) becomes a major clinical concern.

In pathophysiology, the development of PAH requires a genetic predisposition or other triggering factors that activate increased pulmonary blood flow, along with a series of mediators that cause vasoconstriction and vascular remodeling. Based on the pathological processes, scholars have identified various biomarkers that can indicate the risk of developing PAH-CHD such as Circulating endothelial cells (8), specific hyperoxia test (5), Acoustic (9), and surface electrocardiogram (10). However, current methods do not adequately utilize the information obtained from convenient standard chest radiographs to assist in diagnosis, such as increased lung markings and dilatation of pulmonary artery segments.

To fully explore the information derived from chest radiographs, machine learning and computer vision techniques offer methods to enhance insights, improve accuracy, and optimize the workload and time required for interpretation. The objective of this study is to develop and validate an AI-based interpretable prediction model that can predict the likelihood of PAH occurrence in children with VSD based on preoperative chest x-ray examinations. This research aims to leverage the power of artificial intelligence to identify high-risk children with PAH-VSD at an early stage, allowing for early prevention and better treatment, ultimately leading to improved patient outcomes.

Materials and methods

Data partition

This study is a retrospective analysis, and the chest x-ray images were collected from the Affiliated Hospital of Qingdao University for Women and Children. The inclusion criteria were as follows: (1) Children primarily diagnosed with ventricular septal defects upon admission; (2) Children without significant medical history; (3) Children who underwent both chest x-ray and echocardiography examinations after their initial hospitalization. The exclusion criteria were: (1) Low-quality data (lung lesions or air-trapping, and known history of lung disease or surgery.); (2) Outpatient cases with inadequate clinical information; (3) Patients with ventricular septal defects who only underwent either chest x-ray or echocardiography during their initial hospitalization; (4) Readmitted patients. To ensure the accuracy and reliability of the model, non-standard data were excluded. A total of 831 children who underwent their first diagnosis of ventricular septal defects and subsequent ventricular septal defect repair surgery from February 2015 to February 2023 were included for training and testing the predictive model using their chest x-ray images.

According to two pediatric cardiologists with over 10 years of experience, the children were divided into two groups based on their initial diagnosis confirmed by echocardiography reports (95%) or right heart catheterization (5%): the non-PAH group (670) and the PAH group (161). In this study, echocardiography was employed to explore pulmonary arterial hypertension, primarily utilizing pulmonary artery Doppler flow spectrum technology and the tricuspid regurgitation pressure gradient method, which has a high correlation with the clinically recognized “gold standard” right heart catheterization to measure pulmonary artery hypertension. The study protocol was reviewed and approved by our Institutional Review Board. As the data were obtained from patients who agreed to have their data comprehensively studied in routine clinical practice, informed consent was obtained from all patients. All methods adhered to the relevant guidelines and regulations (Figure 1).

Figure 1

Figure 1. Flowchart of this study.

Image acquisition

In this study, conventional radiographs were acquired under standard conditions utilizing the DRX Evolution Plus (Carestream Health, USA). All DICOM images were converted to JPG images with a resolution of 256 × 256 through downsampling. The dataset was randomly divided into a training set (80%) and a test set (20%). Additionally, image augmentation techniques such as gamma correction, horizontal flipping, rotation, and pixel shifting were employed to enhance the images in the training dataset. Subsequently, resampling was performed to balance the data between the two groups, and hyper parameters were adjusted using grid search.

Model development

We constructed a model based on ResNet50 to detect PAH in VSD patients. Residual neural networks help alleviate the vanishing gradient problem, allowing for the training of deeper networks (11). The network consists of 50 layers, divided into 5 blocks, each containing a set of residual blocks. Each residual block (RB) consists of two batch normalization layers, two rectified linear unit (ReLU) layers and two 3 × 3 convolutional layers. We then fine-tuned the pretrained model and performed nested ten-fold cross-validation. A batch size of 32 was set, and training was performed using the Adam optimizer. The network model was built using the PyTorch (version 1.6.0) deep learning framework on a computer equipped with a 16 vCPU AMD EPYC 9654 96-Core Processor and an RTX 4090 24GB GPU from NVIDIA Corp.

Class activation mapping

We generated heat maps using class activation mapping (CAM) in order to identify regions with high activation levels (12–14). Global feature vector representation is obtained after the last convolutional layer. Convolutional Activation Mapping is utilized to generate high-resolution class-discriminative heat maps from the final convolutional layer. These heatmaps, generated by mapping the activations, are superimposed on the chest x-rays of patients diagnosed with VSD. This overlay provides an emphasis on the regions of utmost significance for predicting the PAH category. By utilizing CAM, decision makers can make the right decisions and gain a deeper understanding of the model (12).

Role of the funding source

The study's funding source did not contribute to the study's design, data collection, analysis, interpretation, or manuscript preparation. The corresponding author had unrestricted access to all the data generated throughout the study and ultimately determined whether the manuscript would be published.

Results

Datasets

The training cohort consisted of 664 VSD patients. Participants were divided into the two groups: those with PAH (PAH group: 128 patients) and those without PAH (non-PAH group: 536 patients). There were no differences in the features between the training and validation cohorts. Clinical characteristics of the participants included in our study are presented in Table 1.

Table 1

Table 1. Baseline characteristics of the study population.

Model evaluation

Trained artificial intelligence classification models distinguish unlabeled chest radiographic images into two groups: non-PAH and PAH. To evaluate the performance of our proposed model, we obtained different evaluation metrics such as accuracy, precision, sensitivity, and f1 score.

Figure 2 displays the Receiver Operating Characteristic (ROC) curve for the validation dataset, which serves as a graphical representation of the model's capacity to accurately discriminate between true positives and false positives. This curve provides an intuitive measure of the model's accuracy in differentiating between different classes. In order to evaluate the generalizability of our results, we collected an external test cohort consisting of 50 patients, for which the AUC of the DL algorithm was found to be 0.81. The AUC in this queue is similar to the original validation dataset queue, but significantly higher than the AUC of chest x-rays evaluated by human observers (0.65) (Figure 3). Figure 4 showcases the confusion matrix for the model's performance on the external test group. This matrix provides a comprehensive summary of the model's classification accuracy across all classes. The rows represent the predicted labels, while the columns represent the actual labels. To obtain a more comprehensive assessment of the model's accuracy, various performance metrics such as precision, accuracy, F1-score, etc., were calculated and presented in Table 2.

Figure 2

Figure 2. Validation group ROC curve of ResNet50 model.

Figure 3

Figure 3. (A) ResNet50 model external test group ROC curve. (B) ResNet50 model human observers assess risk ROC curve.

Figure 4

Figure 4. Resnet50 model external test group confusion matrix.

Table 2

Table 2. The effect of the ResNet50 in external test group.

Heatmap evaluation

In order to enhance the interpretability of clinical models and increase physicians' confidence in the models, we used Class Activation Mapping (CAM) to examine the attention focus areas in the images recognized by the artificial intelligence, which helped us determine the areas in CXRs that our model emphasized. Figure 5 exhibits heatmap overlays applied to 12 cases of true positive and true negative instances for the purpose of illustrating these regions of interest. Within the PAH group, the AI model demonstrates a tendency to concentrate on the pulmonary artery region as well as the right side of the heart. Meanwhile, in the nonPAH-VSD group, the AI model primarily focuses on the pulmonary artery. It is worth noting that characteristic features in chest radiographs of PAH-CHD patients include a prominent pulmonary artery segment and deepened pulmonary vascular shadow, indicating possible enlargement of the right atrium and right ventricle due to elevated pulmonary artery pressure. Hence, based on these observations, we are confident that our final AI model effectively discerns the variations in CXR images, which aligns well with our existing knowledge.

Figure 5

Figure 5. (A) Attention map of chest x-ray model for nonPAH-VSD patients. (B) Attention map of chest x-ray model for PAH-VSD patients.

Discussion

In this study, advanced deep learning techniques were used to develop a model for estimating the likelihood of PAH occurrence in patients with VSD through chest x-ray screenings. In the external test dataset, the most successful model exceeded an AUC threshold of 0.80, implying high sensitivity and specificity. In settings where alternative imaging modalities are not available, chest x-rays may be a crucial tool for detecting PAH patients, especially in settings where alternative imaging modalities are unavailable. Furthermore, these findings highlight the importance of machine learning-based approaches in healthcare, as they offer significant efficiency and accuracy improvements. Additionally, these results suggest that standardized and cost-effective examination methods can yield additional clinical information through AI algorithms. We believe this research represents the first attempt to predict PAH occurrence in VSD patients based on chest x-ray images using AI algorithms. As a result of CAM being incorporated into the model, clinicians will be able to recognize specific areas on chest x-ray images which indicate this disease's likelihood, which will provide further confidence to them.

Clinical implications

Previously, CXR has been recognized as a useful tool in the examination of patients with elevated PAH, given its simplicity and cost-effectiveness, making it widely available worldwide. A recent study has shown that CXR measurements can identify a larger number of subjects with undiagnosed PAH. Furthermore, there are various methods for detecting PAH, including laboratory data, electrocardiography, and physical examinations, with AUC values reaching a maximum of 0.65. In previous studies, the limited availability of invasive data from right heart catheterization, which serves as the gold standard, has highlighted the potential superiority of our CXR-based model. From a reproducibility standpoint, automated evaluation for obtaining quantitative results without any user interaction, including measurements, is needed. Our results demonstrate that AI models can be trained to predict the occurrence of PAH in VSD patients based on CXR images. We believe that this study serves as a pilot investigation aimed at exploring the feasibility of applying deep learning algorithms to the clinical assessment of pulmonary arterial hypertension in VSD patients.

Our model can provide a highly explanatory and insightful tool due to the visual evidence supporting the classification results, which is an inevitable part of clinical diagnosis. By visually and accurately displaying significant regions of cardiac dysfunction, it can serve as an excellent artificial intelligence tool for radiologists in medical diagnosis, and can be extensively applied in clinical practices where comprehensive annotations are challenging to obtain.

Limitations

Firstly, the sample size of patients in this study is limited. Deep learning algorithms require a large volume of data, typically thousands of patients, to achieve better generalization performance. Moreover, due to the small number of patients, we were unable to create models to predict specific types of pulmonary hypertension (such as preoperative PAH or postoperative PAH). Therefore, the development of viable AI models after classification was not feasible. Further evaluation through echocardiography and cardiac catheterization at referral centers is necessary. Additionally, as this study is retrospective in nature and the data collected were obtained from hospitals in the Eastern region of China, it is likely to have inherent biases. To address this issue, we plan to conduct a multicenter study involving multiple hospitals in the future. Given these limitations, this study is considered preliminary, and we believe that this report can serve as a motivating factor for future large-scale multicenter research.

Conclusions

Applying artificial intelligence to CXR (a conventional, universal, and cost-effective test) is a potential tool for assessing the future risk of PAH in the VSD patient population. However, this preliminary study suggests that the use of artificial intelligence in CXR for predicting PAH risk in the VSD population is more effective than subjective judgments made by human observers. Nevertheless, the interpretability and stability of the results are still inadequate. It should be considered that it is premature to incorporate this technology into current guidelines.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Materials, further inquiries can be directed to the corresponding author.

Ethics statement

The studies involving humans were approved by Qingdao Women and Children's Hospital Clinical Trial Ethics Committee. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

ZL: Data curation, Validation, Visualization, Writing – original draft, Writing – review & editing. GL: Validation, Writing – original draft, Writing – review & editing. ZJ: Writing – review & editing. SW: Writing – review & editing. SP: Investigation, Methodology, Project administration, Resources, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Constantine A, Dimopoulos K. Evaluating a strategy of PAH therapy pre-treatment in patients with atrial septal defects and pulmonary arterial hypertension to permit safe repair (“treat-and-repair”). Int J Cardiol. (2019) 291:142–4. doi: 10.1016/j.ijcard.2019.05.039

PubMed Abstract | Crossref Full Text | Google Scholar

2. Goldstein SA, Krasuski RA. Pulmonary hypertension in adults with congenital heart disease. Cardiol Clin. (2022) 40(1):55–67. doi: 10.1016/j.ccl.2021.08.006

PubMed Abstract | Crossref Full Text | Google Scholar

3. Hoffman JIE. Incidence of congenital heart disease: I. Postnatal incidence. Pediatr Cardiol. (1995) 16:103–13. doi: 10.1007/BF00801907

PubMed Abstract | Crossref Full Text | Google Scholar

4. Farber HW, Foreman AJ, Miller DP, McGoon MD. REVEAL registry: correlation of right heart catheterization and echocardiography in patients with pulmonary arterial hypertension. Congest Heart Fail. (2011) 17(2):56–63. doi: 10.1111/j.1751-7133.2010.00202.x

PubMed Abstract | Crossref Full Text | Google Scholar

5. Provencher S, Sitbon O, Humbert M, Cabrol S, Jaïs X, Simonneau G. Long-term outcome with first-line bosentan therapy in idiopathic pulmonary arterial hypertension. Eur Heart J. (2006) 27(5):589–95. doi: 10.1093/eurheartj/ehi728

PubMed Abstract | Crossref Full Text | Google Scholar

6. Rooshesselink J. Outcome of patients after surgical closure of ventricular septal defect at young age: longitudinal follow-up of 22–34 years. Eur Heart J. (2004) 25(12):1057–62. doi: 10.1016/j.ehj.2004.04.012

PubMed Abstract | Crossref Full Text | Google Scholar

7. Mendeloff EN, Meyers BF, Sundt TM, Guthrie TJ, Sweet SC, de la Morena M, et al. Lung transplantation for pulmonary vascular disease. Ann Thorac Surg. (2002) 73(1):209–17. doi: 10.1016/s0003-4975(01)03082-x

PubMed Abstract | Crossref Full Text | Google Scholar

8. Smadja DM, Gaussem P, Mauge L, Israël-Biet D, Dignat-George F, Peyrard S, et al. Circulating endothelial cells: a new candidate biomarker of irreversible pulmonary hypertension secondary to congenital heart disease. Circulation. (2009) 119(3):374–81. doi: 10.1161/CIRCULATIONAHA.108.808246

PubMed Abstract | Crossref Full Text | Google Scholar

9. Kaddoura T, Vadlamudi K, Kumar S, Bobhate P, Guo L, Jain S, et al. Acoustic diagnosis of pulmonary hypertension: automated speech-recognition-inspired classification algorithm outperforms physicians. Sci Rep. (2016) 6(1):33182. doi: 10.1038/srep33182

PubMed Abstract | Crossref Full Text | Google Scholar

10. Whitman IR, Patel VV, Soliman EZ, Bluemke DA, Praestgaard A, Jain A, et al. Validity of the surface electrocardiogram criteria for right ventricular hypertrophy: the MESA-RV study (multi-ethnic study of atherosclerosis-right ventricle). J Am Coll Cardiol. (2014) 63(7):672–81. doi: 10.1016/j.jacc.2013.08.1633

PubMed Abstract | Crossref Full Text | Google Scholar

11. Targ S, Almeida D, Lyman K. Resnet in resnet: generalizing residual architectures. arXiv [preprint]. arXiv:1603.08029 (2016). doi: 10.48550/arXiv.1603.08029

PubMed Abstract | Crossref Full Text | Google Scholar

12. Marmolejo-Saucedo AJ, Kose U. Numerical grad-cam based explainable convolutional neural network for brain tumor diagnosis. Mob Netw Appl. (2022):1–10. doi: 10.1007/s11036-022-02021-6

Crossref Full Text | Google Scholar

13. Borg M, Jabangwe R, Aberg S, Ekblom A, Hedlund L, Lidfeldt A. Test automation with grad-CAM heatmaps—a future pipe segment in MLOps for vision AI? In: 2021 IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW). IEEE (2021). 175–81. doi: 10.1109/ICSTW52544.2021.00039

14. Chattopadhay A, Sarkar A, Howlader P, Balasubramanian VN. Grad-CAM plus plus: generalized gradient-based visual explanations for deep convolutional networks. In: 2018 IEEE Winter Conference on Applications of Computer Vision (WACV 2018). IEEE (2018). 839–47. doi: 10.1109/WACV.2018.00097

Keywords: artificial intelligence, pulmonary arterial hypertension, chest x-ray, ventricular septal defect, deep learning—artificial intelligence

Citation: Li Z, Luo G, Ji Z, Wang S and Pan S (2024) Explanatory deep learning to predict elevated pulmonary artery pressure in children with ventricular septal defects using standard chest x-rays: a novel approach. Front. Cardiovasc. Med. 11:1330685. doi: 10.3389/fcvm.2024.1330685

Received: 16 November 2023; Accepted: 3 January 2024;
Published: 12 January 2024.

Edited by:

Inga Voges, University Medical Center Schleswig-Holstein, Germany

Reviewed by:

Zhuoming Xu, Shanghai Children’s Medical Center, China
Federico Gutierrez-Larraya, University Hospital La Paz, Spain
Nazmi Narin, Izmir Katip Celebi University, Türkiye

© 2024 Li, Luo, Ji, Wang and Pan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Pan Silin c2lsaW5wYW5AMTI2LmNvbQ==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Explanatory deep learning to predict elevated pulmonary artery pressure in children with ventricular septal defects using standard chest x-rays: a novel approach

Introduction

Materials and methods

Data partition

Image acquisition

Model development

Class activation mapping

Role of the funding source

Results

Datasets

Model evaluation

Heatmap evaluation

Discussion

Clinical implications

Limitations

Conclusions

Data availability statement

Ethics statement

Author contributions

Funding

Conflict of interest

Publisher's note

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good