- 1School of Computer Science and Technology, Qilu University of Technology (Shandong Academy of Sciences), Jinan, China
- 2Cancer Center, The Second Hospital of Shandong University, Jinan, China
- 3Department of Gastrointestinal Surgery, Central Hospital Affiliated to Shandong First Medical University, Jinan, China
Attention convolutional neural networks (ATT-CNNs) have got a huge gain in picture operating and nature language processing. Shortage of interpretability cannot remain the adoption of deep neural networks. It is very conspicuous that is shown in the prediction model of disease aftermath. Biological data are commonly revealed in a nominal grid data structured pattern. ATT-CNN cannot be applied directly. In order to figure out these issues, a novel method which is called the Path-ATT-CNN is proposed by us, making an explicable ATT-CNN model based on united omics data by making use of a recently characterized pathway image. Path-ATT-CNN shows brilliant predictive demonstration difference in primary lung tumor symptom (PLTS) and non-primary lung tumor symptom (non-PLTS) when applied to lung adenocarcinomas (LADCs). The imaginational tool adoption which is linked with statistical analysis enables the status of essential pathways which finally exist in LADCs. In conclusion, Path-ATT-CNN shows that it can be effectively put into use elucidating omics data in an interpretable mode. When people start to figure out key biological correlates of disease, this mode makes promising power in predicting illness.
Introduction
Processing complicated structures and dependencies in data is a difficult job (Ampie et al., 2015). Deep neural network methods offer ways to process the nonlinear model (Bengio et al., 2017). Our goal was to learn many insightful levels of abstraction (Cai and Wei, 2020). Particularly, the attention convolution neural network (ATT-CNN) has obtained achievement (Chen et al., 2020). ATT-CNN has been put into used to solve biological problems because of its nominal grid-structured version in biological collection, and it is difficult for people to solve it (Du et al., 2019). Lately, many articles have used traditional neural networks to interpret omics data collection (Evans et al., 2018). Also, data are offered by the National Cancer Center Research Institute (NCCRI) program which is in the analysis of many molecular parts (Goldstein et al., 2017). Shortage of interpretability is a key part which makes the medical neural networks adoption limited (Golestaneh and Karam, 2017). The trained models or results by biological interpretation are used to understand the complicated human diseases’ biological mechanisms better (Learning, 2020). A method named class activation mapping (CAM) is developed which uses global average pooling. Its task is to increase the interpretability (LeCun et al., 2015). CAM, which makes the image regions visual, is applied by the CNN which precisely generates a target map (Li et al., 2021). There are some limitations to CNN: (i) the CNN architecture that is shown in modeling should be modified, which can be altered by fully connected layers. In addition, the average pooling layer can also alter it. Its goal is to employ CAM, and (ii) CAM of the modified network should be fine-tuned. A more common method was lately developed named as X-gradient-weighted class activation mapping (X-Grad-CAM) (Liu et al., 2019). It uses any target class gradient information which immerses into the convolutional layer. Its task is to generate class activation maps. We created another novel method (Lomonaco et al., 2022) named as Path-ATT-CNN (Nandhini Abirami et al., 2021). This can generate an interpretable ATT-CNN model which uses omics data in cancer outcomes (Wang et al., 2019). Data are input into the ATT-CNN model, and biological pathway images are applied (Wirsching and Weller, 2017). This is made in use of integrated omics data with the low dimensional space including molecular characteristics of lung adenocarcinomas (LADCs) (Wu, 2017). Our goal was to find the biological pathways which are linked with outcomes after modeling. We showed that the method can be used for predicting primary lung tumor symptom in diagnosed patients (Wu et al., 2020) and for differentiating between primary lung tumor symptom (PLTS) and non-primary lung tumor symptom (non-PLTS). The key biomarker identification that is linked with survival in LADC patients can explain the implicit biological performance (Yuan et al., 2018). It plays a vital role in LADCs. In this article, we showed that the ATT-CNN model is better than other methods (Yuan and Huang, 2019), and then, the model applying X-Grad-CAM is interpretable (Yuan et al., 2021a). It makes influential biological pathways visual in identification (Yuan et al., 2017).
Materials and Methods
Data
G,C separately show data for LADCs, RNA expression, and M ∈ Rn×r originate in the Gene Expression Omnibus. Here, n and r refer to the samples and genes numbers, respectively. Primary lung tumor symptom (PLTS) is defined as presence of lung adenocarcinomas in patients, and non-primary lung tumor symptom (non-PLTS) is defined as absence of lung adenocarcinomas in patients (Yuan et al., 2021b).
Pathway Image
In order to figure out this problem, data are input into pathway-level profiles, and pathway information is drawn first. The pathways that are enriched by genes of lung adenocarcinomas (LADCs) are identified. A total of 2,200 pathways are shown. Pathway pi, the RNA expression which is connected with gene data is drawn from the LADC expression matrix (Yun et al., 2019). The pathway produces a midterm matrix
Sequence of Pathways
X-Grad-CAM is used to sequence the pathways. Its goal was to figure out the key PLTO pathways in LADCs. It can find target regions. So, when reciprocal pathways are linked with pathway images, the possibilities of X-Grad-CAM identifying key pathways are wide. It is also in line with the nature of ATT-CNNs. ATT-CNN is able to localize primary pathways in input position. It is originally rational to make connected pathways. Pathways are in nearness on the pathway images. In this way, people can better find important regions. Therefore, 2,200 pathways are ranked in the conduct: person connection between pathways which is figured out on a matrix of 2,200q is generated. Combining all resultant pathway images makes up the matrix. This process occurs in 2,200 pathways. Pathways lie in among unselected situation. It was the most similar to the pathway in the row. Finally, all pathway images have the identical sequence of pathways.
ATT-CNN Architecture
We collect spatial content of a feature map. We make it by applying average pooling and max pooling. This process generates two totally different spatial context terms:
Cross Validation
The model which uses a five-fold cross validation will be trained under 40 repeats. Data which are randomly assigned as follows undergo five-fold cross validation: 20% of data are designed as a test set. Also, the unexhausted data are allotted as a training set, and the validation set is applied to adjust hyper-parameters. The model performs by using the training collection and optimizing hyper-parameters which are in the ATT-CNN model. It is performed in each cross validation which is assessed on the test collection. Also, the area under the curve is used to test the performance.
Biological Interpretation Using X-Grad-CAM
Thinking of an input image, I, l means the scale which is in the target visual layer. Fl is a score set. Also, it shows the manner of the target layer.
In Eq. 2, Mc means a sum of weighted feature diagram. This motivation is strength pixels. Its intensities are good for increasing the score. A localization diagram for every class is regulated to exist in between 0 and 1. The class activation diagrams have the identical size, which is on the feature diagram. It is put into the figure of the input:
Because of biological interpretation which applies X-Grad-CAM, the ATT-CNN model, which is unsimilar with the model that has cross validation, is made by applying all samples. Every sample, which is input into the model, produces two activation diagrams (PLTO and non-PLTO). Figure 2 explains the process of generating the pathway images.
So, the difference map is produced. A statistical analysis, which duplicates this process for all samples, is applied. Assessing the statistical difference between the PLTO and non-PLTO groups is not an easy job. We use the Wilcoxon rank-sum test to figure out this problem for a given pixel.
Comparison With Benchmark Methods
Several machine learning methods are compared with the predictive performance of Path-ATT-CNN. The same settings, which are used as described, are shown for all the experiments. Also, the setting is also in the aforementioned cross validation operation. There is a neural network architecture which is empirically adjusted. It composes an import layer. It comprises five hidden layers. It makes up an output layer. There are nodes in each hidden layer whose numbers are 10 k, 7.5 k, 5 k, 2.5 k, and 500. ReLU which acts as an activation function is located in the hidden layers. The number of dropout rate is calculated using the hidden layers and is set at 80%. The manner of MiNet which initially plans to figure out the identical index that is altered is identified by this method. These several famous methods which are trained by the Adam optimizer act as comparison with Path-ATT-CNN.
Result
Data
Our goal was to explain molecular characteristics of lung adenocarcinomas (LADCs). A total of 226 primary lung LADCs of medical stages I–II are evaluated. They undergo genome-wide expression identification. In total, 174 genes are chosen as upregulated specifically in 79 lung LADCs. A total of 79 cases are split into: 11 ALK-positive patients, 36 A triple-negative LADC patients, and 32 patients of B triple-negative LADC patients. They are unattended clustering according to the demonstration of 174 genes. Group A triple-negative LADC patients show remarkably worse prognosis for relapse than LADC patients and group B triple-negative LADC cases. A total of 9 genes are authenticated to remarkably put in group A triple-negative LADC cases which are good for finishing their prognoses. Genes, which distinguish this group of LADCs, are suggested to be helpful for patients chosen for additive chemotherapy after operation resection of stages I–II triple-negative LADCs and provide useful insights into the process of molecular target therapies for these patients. The data of LACDs are composed of RNA expression which is assessed at the gene level: RNA expression for 843 genes in 578 cases. There are patients who survived in the last follow-up ≤ 10 months. Also, those patients are removed for finishing the task and receiving better outcomes. Also, PLTO and non-PLTO groups have 155 and 432 cases, respectively. Class weights which are set in the model are important. This method can figure out imbalance data in the light of the ratio of the quantity of samples in the two groups. In total, out of 489 unusual genes we selected 346 TP53. PCA tests have shown individual pathways, respectively, on data type which transfers gene-level information to pathway-level information. The average smoking age in PLTO and non-PLTO groups is 20 and 30 years, respectively. The difference in smoking age in the two groups is remarkable with a p-value < 0.001. This method is conducted by applying a two-sample t-test. Smoking age plays a key role in the survival. The variable is in the ATT-CNN model. Also, it will be put into the fully connected layer.
Modeling Performance
The ATT-CNN model which is used to identify PLTO and non-PLTO groups in GSE, which uses pathway images is evaluated in a five-fold CV scheme. In total, 2,200 rows and 3 × q columns are generated by a pathway image representation. Here, q refers to the number of PCs. Also, each row represents the same pathway. In the experiment, the size of the experiment is valued with q = 1. In every experiment, which has an average AUC over 30 iterations of the five-fold CV, is reported by us. The performance of PC (q = 1) is shown in Figure3, which reaches an average AUC of 0.763. The performance of q = 2, a model without smoking age, and the average AUC is 0.697. This situation shows the importance of smoking age. Otherwise, the ATT-CNN model on pathway images with q = 2, which uses a group of two omics types, is conducted. In conclusion, the application of omics types in modeling shows better performance.
Comparison With Benchmark Methods
There are four different machine learning methods mentioned by us in this article. Also, four methods are compared with the performance of Path-ATT-CNN. The ATT-CNN model is used to identify PLTO and non-PLTO groups in GSE. Class weights are put into the model because the ratio of the quantity of samples is in the two groups. With regard to most cancers, Path-ATT-CNN performed better than any other methods (Yuan et al., 2021a). The details are shown in Table1.
Identification of Key Pathways
The ATT-CNN model, which applies samples lying in pathway images, is produced. To finish this, we should use the first two PCs for patient omics types, which is able to make helpful insights into biological interpretations. Mechanisms are linked with survival in LADC patients and self-reliant from smoking age. This is used to classify the biological mechanisms (Yuan et al., 2021a). Individual pathway images are integrated into the experiment. X-Grad-CAM produced two activation maps (LADCs and non-LADCs).
All 20 pixels in the four hot spots are observed in PCs of RNA demonstration. Several pathways which are loaded in both PC 1 and PC 2 are found. Kaplan–Meier analysis is used to analyze the existing time of two groups which are classified by a median separation of PC values which are in the key pixels.
Discussion
Lung cancer has become a primary cause of cancer deaths, which annually leads to over one million deaths worldwide. There are over 1.2 million new cases which are diagnosed every year. Lung adenocarcinoma reaches an average five-year survival rate of 15%. Lung adenocarcinoma is the most common type of lung cancer. The primary reason is late examination and the scarcity of late treatments. In this situation, smoking undoubtedly becomes the primary reason why many people have been suffering from lung cancer. About 10% of cases occur in patients who have never smoked. LADCs are becoming a major public health concern. The identification of biomarkers can understand the implicit aggressiveness. It plays a key role in the biology of LADCs. Many articles have found assumed survival-associated biomarkers in LADCs. They make it by using the omics platform. The analysis of data which come from individual articles nowadays lacks the capability to find strong biomarkers that are able to make medical decisions in LADCs. United analysis which are about omics data is able to offer insightful information. This information is about the complicated biology and molecular nonuniformity of cancer. Lately, ATT-CNNs have got a huge achievement. Second, applying ATT-CNNs in bioinformatics problems and biological data shown in non-grid structures is a limitation. The application of deep learning techniques of biological data is another limitation. It is put into the interpretability which is in the trained model. Also, it is seen as a “black box.” We recently have generated a conception of “pathway image.” Our goal was to show omics data which made us to adapt ATT-CNNs in biological pathway analysis. Pathways, which are in a connected activity, are put more strongly in the pixel. It can fully leverage ATT-CNN capabilities skillfully which is connected to pixel activity. Also, the ATT-CNN model which is on the united omics data is at the pixel level that remarkably improves predictive performance. Also, it adds co-complemented message to the model. Possibility of the pathway-level analysis that removes noisy information is high. It may lie in individual omics type. This represents every pathway which is seen as a linear collection of related genes. The adoption of X-Grad-CAM and the ATT-CNN model skillfully make biological interpretations, which identify pathways linked with survival in LADC patients. The simplicity and efficiency of the ATT-CNN modeling and X-Grad-CAM lie in biological explanation (Zheng et al., 2017). In this article, the pathway image is for every specimen. Also, its size is linked with 2,200 pathways which are in rows and a number of PCs that are predefined in columns (Zhou et al., 2020). There are missing genes in the omics data because of the dimensional reduction of each pathway. PCA, which releases the difficulty, figures out the missing data shown without those genes. The resulting pathway which is based on the ATT-CNN model effectively predicts PLTO for LADC patients. When omics types are used, it reaches an average AUC of 0.783. This process is applied at the first two PCs for omics type. When omics type is applied, similar performance is performed with an average AUC of 0.769. In conclusion, the application of omics data has a better performance. The biological explanation of the class activation maps is produced. It shows the intensity difference in the two activation diagrams for every sample. Otherwise, the map which applies an X-Grad-CAM technique is derived. There are differences between PLTO and non-PLTO in statistical analysis. Its goal was to find a special pixel on the difference diagram. Although Path-ATT-CNN has demonstrated effective performances in predicting performance, there are some biological problems in processing biological data by the ATT-CNN model. The speed of processing biological issues would be smaller because of too many network layers. Applying the back-propagation method will make adjusting parameters of the input layer slowly. First, applying SGD methods will make trained data restrain local minimum and not global minimum. Second, pooling layers may lose much valuable information and ignore connection of locality and entirety. Third, because of feature extraction, it put a black box in an improvement of network function. This can be further developed in the future. It will be a future extension which includes more closer configuration. It is an essential impact to predict illness.
Conclusions
In this article, Path-ATT-CNN is designed. The method starts from the concept of “pathway image,” that generates application of omics data. Through this model, we can remarkably learn about metabolic pathways. The good pathway is helpful to patients who can learn about their body mechanism. It is able to predict patient’s lung cancer situation better than any other methods. Then, the use of X-Grad-CAM which is directly applied on the pathway image makes the identification of a plausible pathway. In conclusion, this article shows the future potential to use ATT-CNNs on omics data and X-Grad-CAM. In this way, we can detect more complicated biological correlates of disease disadvantages. We also found a key pathway. There is a limitation to the model, that is, the identification of identical pathways that need closed alignment of the pathways. This can be improved in the next 10 years.
Data Availability Statement
The original contributions presented in the study are included in the article/supplementary material; further inquiries can be directed to the corresponding author.
Author Contributions
LiY and ZY conceived the method. CH and JL designed the method. GY and LaY conducted the experiments and wrote the main manuscript text. TS and JZ prepared figures. All authors reviewed the manuscript.
Funding
This work was supported by the grant from the Natural Science Foundation of Shandong Province, China (No. ZR2020QF038), the National Natural Science Foundation of China (Grant Nos. 62002189 and 62002187), the Young Taishan Scholars Program, Shandong, China (No. tsqn201909178), and partly supported by the Natural Science Foundation of Shandong Province, China (Grant Nos. ZR2021MH104 and ZR2020MH208).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We thank the funding supports from the Natural Science Foundation of Shandong Province, China (Nos. ZR2020QF038, ZR2021MH104, and ZR2020MH208), NSFC (Nos. 62002189 and 62002187), and Young Taishan Scholars Program, Shandong, China (No. tsqn201909178).
References
Ampie, L., Woolf, E. C., and Dardis, C. (2015). Immunotherapeutic Advancements for Glioblastoma. Front. Oncol. 5, 12. doi:10.3389/fonc.2015.00012
Cai, W., and Wei, Z. (2020). Remote Sensing Image Classification Based on a Cross-Attention Mechanism and Graph Convolution. IEEE Geoscience Remote Sens. Lett. 19. doi:10.1109/LGRS.2020.3026587
Chen, L., Chen, J., Hajimirsadeghi, H., and Mori, G. (2020). Adapting Grad-CAM for Embedding Networks. arxiv, 2783–2792.
Du, J., Gui, L., He, Y., Xu, R., and Wang, X. (2019). Convolution-Based Neural Attention with Applications to Sentiment Classification. IEEE Access 7, 27983–27992. doi:10.1109/access.2019.2900335
Evans, R., Jumper, J., Kirkpatrick, J., Sifre, L., Green, T., Qinv, C., et al. (2018). De Novo structure Prediction with Deeplearning Based Scoring. Annu. Rev. Biochem. 77, 363–382.
Goldstein, A., Veres, P., Burns, E., Briggs, M. S., Hamburg, R., Kocevski, D., et al. (2017). An Ordinary Short Gamma-Ray Burst with Extraordinary Implications: Fermi -GBM Detection of GRB 170817A. Astrophysical J. Lett. 848 (2), L14. doi:10.3847/2041-8213/aa8f41
Golestaneh, S. A., and Karam, L. J. “Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, July 2017, 596–605.
Learning, D. (2020). Deep Learning,” High-Dimensional Fuzzy Clustering. Chicago: Chicago International Breast Course.
LeCun, Y., Bengio, Y., and Hinton, G. (2015). Deep Learning. Nature 521 (7553), 436–444. doi:10.1038/nature14539
Li, Z., Liu, F., Yang, W., Peng, S., and Zhou, J. “A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects,” in Proceedings of the IEEE Transactions on Neural Networks and Learning Systems, June 2021. doi:10.1109/tnnls.2021.3084827
Liu, J., Zha, Z.-J., Hong, R., Wang, M., and Zhang, Y. “Deep Adversarial Graph Attention Convolution Network for Text-Based Person Search,” in Proceedings of the 27th ACM International Conference on Multimedia, October 2019, 665–673.
Lomonaco, V., Pellegrini, L., Rodriguez, P., Caccia, M., She, Q., Chen, Y., et al. (2022). CVPR 2020 Continual Learning in Computer Vision Competition: Approaches, Results, Current Challenges and Future Directions. Artif. Intell. 303, 103635. doi:10.1016/j.artint.2021.103635
Nandhini Abirami, R., Durai Raj Vincent, P., Srinivasan, K., Tariq, U., and Chang, C.-Y. (2021). Deep CNN and Deep GAN in Computational Visual Perception-Driven Image Analysis. Complexity 2021. doi:10.1155/2021/5541134
Wang, L., Huang, Y., Hou, Y., Zhang, S., and Shan, J. “Graph Attention Convolution for Point Cloud Semantic Segmentation,” in Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, June 2019, 10296–10305.
Wirsching, H.-G., and Weller, M. (2017). Glioblastoma,” Malignant Brain Tumors. Hong Kong: Glioblastoma, 265–288. doi:10.1007/978-3-319-49864-5_18
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., and Yu, P. S. (2020). A Comprehensive Survey on Graph Neural Networks. IEEE Trans. Neural Netw. Learn Syst. 32 (1), 4–24. doi:10.1109/TNNLS.2020.2978386
Wu, J. (2017). Introduction to Convolutional Neural Networks. Natl. Key Lab Nov. Softw. Technol. 5, 495.
Yuan, L., and Huang, D.-S. (2019). A Network-Guided Association Mapping Approach from DNA Methylation to Disease. Sci. Rep. 9 (1), 1–16. doi:10.1038/s41598-019-42010-6
Yuan, L., Zheng, C.-H., Xia, J.-F., and Huang, D.-S. (2015). Module Based Differential Coexpression Analysis Method for Type 2 Diabetes. BioMed Res. Int. 2015, 836929. doi:10.1155/2015/836929
Yuan, L., Zhu, L., Guo, W. L., Zhou, X., Zhang, Y., Huang, Z., et al. (2016). Nonconvex Penalty Based Low-Rank Representation and Sparse Regression for eQTL Mapping. IEEE/ACM Trans. Comput. Biol. Bioinform 14 (5), 1154–1164. doi:10.1109/TCBB.2016.2609420
Yuan, L., Yuan, C.-A., and Huang, D.-S. (2017). FAACOSE: A Fast Adaptive Ant Colony Optimization Algorithm for Detecting SNP Epistasis. Complexity 2017. doi:10.1155/2017/5024867
Yuan, L., Guo, L. H., Yuan, C. A., Zhang, Y. H., Han, K., Nandi, A., et al. (2018). Integration of Multi-Omics Data for Gene Regulatory Network Inference and Application to Breast Cancer. IEEE/ACM Trans. Comput. Biol. Bioinform 16 (3), 782–791. doi:10.1109/TCBB.2018.2866836
Yuan, L., Sun, T., Zhao, J., and Shen, Z. (2021). A Novel Computational Framework to Predict Disease-Related Copy Number Variations by Integrating Multiple Data Sources. Front. Genet. 12, 696956. doi:10.3389/fgene.2021.696956
Yuan, L., Zhao, J., Sun, T., and Shen, Z. (2021). A Machine Learning Framework that Integrates Multi-Omics Data Predicts Cancer-Related LncRNAs. BMC Bioinforma. 22 (1), 1–18. doi:10.1186/s12859-021-04256-8
Yun, S., Jeong, M., Kim, R., Kang, J., and Kim, H. J. (2019). Graph Transformer Networks. Adv. neural Inf. Process. Syst. 32.
Zheng, H., Fu, J., Mei, T., and Luo, J. “Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition,” in Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, October 2017, 5209–5217.
Keywords: ATT-CNN, Path-ATT-CNN, primary lung tumor symptom, pathways, neural network
Citation: Yuan L, Lai J, Zhao J, Sun T, Hu C, Ye L, Yu G and Yang Z (2022) Path-ATT-CNN: A Novel Deep Neural Network Method for Key Pathway Identification of Lung Cancer. Front. Genet. 13:896884. doi: 10.3389/fgene.2022.896884
Received: 15 March 2022; Accepted: 09 May 2022;
Published: 16 June 2022.
Edited by:
Arpit Tandon, Sciome LLC, United StatesReviewed by:
Durai Raj Vincent P. M, VIT University, IndiaYipeng Song, Yantai Yuhuangding Hospital, China
Hanyao Huang, Nanyang Institute of Technology, China
Copyright © 2022 Yuan, Lai, Zhao, Sun, Hu, Ye, Yu and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhenyu Yang, eXp5QHFsdS5lZHUuY24=