Skip to main content

ORIGINAL RESEARCH article

Front. Immunol., 23 May 2023
Sec. Cancer Immunity and Immunotherapy
This article is part of the Research Topic Community Series in Unveiling the Tumor Microenvironment by Machine Learning to Develop New Immunotherapeutic Strategies, volume II View all 26 articles

Combining WGCNA and machine learning to construct basement membrane-related gene index helps to predict the prognosis and tumor microenvironment of HCC patients and verifies the carcinogenesis of key gene CTSA

Weijie Sun,&#x;Weijie Sun1,2†Jue Wang&#x;Jue Wang1†Zhiqiang WangZhiqiang Wang1Ming XuMing Xu1Quanjun LinQuanjun Lin1Peng Sun*Peng Sun1*Yihang Yuan*Yihang Yuan1*
  • 1Department of General Surgery, Tongren Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China
  • 2Department of Infectious Diseases, The First Affiliated Hospital of Anhui Medical University, Hefei, China

Hepatocellular carcinoma (HCC) is a malignant tumor with high recurrence and metastasis rates and poor prognosis. Basement membrane is a ubiquitous extracellular matrix and is a key physical factor in cancer metastasis. Therefore, basement membrane-related genes may be new targets for the diagnosis and treatment of HCC. We systematically analyzed the expression pattern and prognostic value of basement membrane-related genes in HCC using the TCGA-HCC dataset, and constructed a new BMRGI based on WGCNA and machine learning. We used the HCC single-cell RNA-sequencing data in GSE146115 to describe the single-cell map of HCC, analyzed the interaction between different cell types, and explored the expression of model genes in different cell types. BMRGI can accurately predict the prognosis of HCC patients and was validated in the ICGC cohort. In addition, we also explored the underlying molecular mechanisms and tumor immune infiltration in different BMRGI subgroups, and confirmed the differences in response to immunotherapy in different BMRGI subgroups based on the TIDE algorithm. Then, we assessed the sensitivity of HCC patients to common drugs. In conclusion, our study provides a theoretical basis for the selection of immunotherapy and sensitive drugs in HCC patients. Finally, we also considered CTSA as the most critical basement membrane-related gene affecting HCC progression. In vitro experiments showed that the proliferation, migration and invasion abilities of HCC cells were significantly impaired when CTSA was knocked down.

1 Introduction

Hepatocellular carcinoma (HCC) is responsible for about 90% of primary liver cancers (1). It is also one of the most fatal malignant tumors worldwide, with high morbidity and mortality rates (2, 3). The frequent occurrence of metastasis and recurrence is a major contributing factor to the poor prognosis of HCC patients (4). Despite the development of numerous drug combination strategies for the treatment of HCC, the current level of patient survival time has not yet met satisfactory standards. Consequently, there is an urgent need to identify new biomarkers that can more accurately predict the prognosis of HCC.

Basement membranes (BM) are a ubiquitous and unique type of extracellular matrix that plays a key role in cancer cell metastasis (5). In the case of HCC and the surrounding uninvolved liver tissue, the BM is primarily made up of three components: fibronectin (FN), laminin (LAM), and collagen IV (Coll IV) (6). BM is known to affect numerous physiological and pathological activities of cells including cell proliferation, adhesion, migration, and vascular remodeling (7, 8). As a result, in most cancers, BM plays a crucial role in driving cell metastasis (5, 9, 10). Due to the significant role of BM in cancer metastasis, it is an ideal target for anticancer drugs. Previous studies have found that stable markers can be created using different gene sets such as cuproptosis and necroptosis to predict the prognosis of HCC patients (11, 12). Recently, Jayadev et al. have redefined 222 BM-related genes (BMRG) and proteins (13), but a robust prognostic model based on BMRG in HCC is yet to be developed.

In this study, we screened 222 BMRG and identified 4 that were used to construct the basement membrane-related gene prognostic index (BMRGI). This index helps to more accurately predict the prognosis of patients with hepatocellular carcinoma (HCC). Furthermore, we evaluated the clinical relevance and impact of BMRGI on the tumor microenvironment. More importantly, we identified CTSA as a key BMRG in HCC, and comprehensively analyzed the expression differences of CTSA in HCC, and have confirmed that the expression of CTSA has a significant impact on the proliferation, migration, and metastasis of HCC cells.

2 Methods and materials

2.1 Data download and processing

The mRNA expression data of HCC patients with corresponding clinical information and somatic mutation data were downloaded from The Cancer Genome Atlas (TCGA, https://portal.gdc.cancer.gov/) database, and the mRNA expression data and clinical information from the Japan-HCC cohort were downloaded at International Cancer Genome Consortium (ICGC, https://dcc.icgc.org/). When performing correlation data analysis, we excluded cases with missing data. Finally, when performing prognostic analysis, we chose to exclude cases with a survival time ≤30. The single-cell RNA seq (scRNA-seq) data of 4 HCC patients were obtained from GSE146115 in the GEO database (https://www.ncbi.nlm.nih.gov/geo/), with a total of 27227 genes and 3200 cells obtained.

2.2 Screening of WGCNA and differential BMRG

The “WGCNA” package was used to construct the gene co-expression network of BMRG in the TCGA-HCC dataset (14). The core module was considered the one with the highest Pearson coefficient and also the one most associated with clinical traits. Furthermore, we analyzed differentially expressed genes (|log2FC| > 0.585, False Discovery Rate (FDR)< 0.05) in the TCGA-HCC dataset using the “Limma” package. Finally, we further investigated the 47 common genes.

2.3 Construction and verification of BMRGI

47 common genes were analyzed by univariate Cox analysis based on the “survival” package, and potential BMRG affecting the overall survival of HCC patients were screened out (p<0.05). Then, these candidate genes were analyzed by using the least absolute shrinkage and selection operator (LASSO). Based on the analysis results, we established a four-gene optimal prognostic model. The calculation formula of BMRGI for each HCC patient is as follows:

BMRGI=i=1nExpression(i)*Coefficient(i)

Where X refers to the expression level of the selected gene, and Coef is the coefficient of the selected gene. In addition, the same calculation method is applied to the verification queue ICGC. According to the median BMRGI, HCC patients were divided into high BMRGI group and low BMRGI group. Kaplan–Meier curves were used to assess differences in OS between different BMRGI groups.

2.4 scRNA-seq data processing and analysis

The Seurat package is used for preprocessing and filtering of scRNA-seq (15). The PercentageFeatureSet function is used to calculate the mitochondrial gene content in cells. We further analyzed the cells in which the number of genes was >200 and the proportion of mitochondrial genes was<10%. We set the number of principal components (PC) to 20, the resolution to 0.4, and the 1500 genes with the largest variation between cells to cluster the cells.

2.5 CellChat analysis

We use CellChat to quantify and infer the communication links between different cell types from scRNA-seq data, and identify the signal input and output among them. In this study, we filtered for cell communication of less than 10 cells.

2.6 Gene ontology analysis, kyoto encyclopedia of genes and genomes analysis and gene set enrichment analysis

The “Limma” package was used to analyze differentially expressed genes (DEGs) between high and low BMRGI groups (|log2FC| > 1, FDR< 0.05). GO analysis was performed based on the “ clusterProfiler” package. KEGG analysis was performed based on the “clusterProfiler”, “org.Hs.eg.db”, “enrichplot” package. In addition, we also performed GSEA using the “clusterProfiler” package to explore biological differences among different BMRGI groups.

2.7 Analysis of immunological properties

The enrichment scores of 16 immune-related cells and 13 immune-related terms in HCC samples were calculated using the ssGSEA algorithm based on the R packages “GSVA” and “GSEABase”.

We also summarized common immune checkpoint molecules and HLA family genes, and analyzed the correlation between BMRGI and the expression of each gene, and displayed it with a radar map. Furthermore, we explored the somatic mutation profile of TCGA-HCC samples and listed the top10 mutation-prone genes in different BMRGI subgroups. In addition, we also compared the difference of TMB in the high BMRGI group and the low BMRGI group. Finally, the TIDE (http://tide.dfci.harvard.edu/) algorithm was used to predict and evaluate the response of HCC patients to immunotherapy.

2.8 Sensitivity analysis of common drugs

We use the R package “oncoPredict” (16) for the evaluation of common drug sensitivities.

2.9 Identification of core BMRG

SVM-REF (17), LASSO (18)and RandomForest (19) are commonly used machine learning methods with excellent classification performance. In biology-related research, it is often used for the screening of characteristic genes (20). In this study, we use these three types of machine learning to filter out characteristic BMRG, and use intersection to filter out the most critical BMRG.

2.10 Multilevel expression verification of CTSA

We analyzed the differential expression of CTSA at the mRNA level of HCC tissues online from the GEPIA2 database (http://gepia2.cancer-pku.cn/#index) (combined samples from TCGA and GTEx databases) (21). In addition, we analyzed the expression differences of CTSA at the protein level in the CPTAC database online using the UCLCAN database (http://ualcan.path.uab.edu/index.html) (22). Finally, the HPA database obtained the immunohistochemical images of CTSA in normal liver tissues and HCC tissues (23), and obtained the basic information of the corresponding tissue samples.

2.11 RNA extraction, and real-time quantitative PCR

Cell total RNA was extracted using Trizol reagent (Invitrogen, USA)). RNA extraction and RT-qPCR as previously described (24). Briefly, RNA was reversed to cDNA using PrimeScript™ RT Master Mix (Takara Bio, JAPAN). Fluorescence quantification was performed by TB-Green qPCR (Takara Bio, JAPAN) and normalized to β-actin. The information of all designed primers is listed in Supplementary Table 1.

2.12 Cell culture, transient transfection

All cell lines used in this study (including normal liver cell line LO2 and HCC cell lines HEPG2, BEL7402 and HCCLM3) were donated by Dr. Dai (25). All cell lines were cultured in complete DMEM medium (DMEM medium with 10% fetal bovine serum and 1% penicillin-streptomycin). Transient transfections were performed using jetPRIME Transfection Reagent (Polyplus, China) and followed the manufacturer’s instructions. siRNA sequences were designed by Tsingke Biotechnology Co., Ltd. The SiCTSA sequence is as follows, SiCTSA-1: sense-GCCUCUUUCCGGAGUACAA; antisense-UUGUACUCCGGAAAGAGGC. SiCTSA-2: sense-CUGCUUAGCUCACAGAAAU; antisense-AUUUCUGUGAGCUAAGCAG.

2.13 Cell counting kit-8 (CCK8) experiment

We planted 2×103 cells in a 96-well plate, and set 5 replicate wells in each group, cultured them for 0 hour, 24 hours and 48 hours, respectively, and then added 10ul CCK8 reagent (Targetmol, USA) and incubated at 37°C for 2 hours. Absorbance was then measured at 450 nm using a microplate reader (TECAN, Switzerland).

2.14 Transwell experiment

We planted 5×104 cells in the upper chamber (Corning, USA) containing 250ul serum-free medium. The upper chamber was without Matrigel (Corning, USA) for migration experiments, with Matrigel for invasion experiments, and the lower chamber add 800ul complete medium. After 24 hours of incubation, the cells were fixed with 4% paraformaldehyde and stained with 0.1% crystal violet. The cells on the upper surface of the upper chamber were wiped with a cotton swab, photographed under a microscope (Leica, Germany) at 100 times, and then counted.

2.15 Statistical analysis

All bioinformatics analyzes were performed on R software (version 4.1.2). Continuous variables that were not normally distributed were tested using the Wilcoxon test. Correlation analysis between BMRGI and drug IC50 was performed using the spearman method. The Kaplan-Meier method was used to draw the survival curves of different subgroups. All experimental data were analyzed for variance using Student’s T-test. p or FDR< 0.05 represents a statistical difference.

3 Result

3.1 WGCNA identified BM key module genes

According to the Materials and methods section, we identified 222 BMRG. First, we conducted WGCNA on 222 BMRG. By setting a minimum of 25 genes within a module, module connectivity (Figure 1A), 4 modules were finally identified (Figure 1B). According to the correlation thermograms of the modules, we found that the blue modules had the highest correlations with clinical traits (Figure 1C). Therefore, we choose the blue module for further analysis. Second, we performed differential analysis on 222 genes, and the results showed a total of 131 DEGs, of which 122 BMRG were up-regulated and 9 BMRG were down-regulated (Figure 1D). Furthermore, we showed the correlation of blue module genes with DEGs by Venn diagram, and finally obtained 47 common genes (Figure 1E).

FIGURE 1
www.frontiersin.org

Figure 1 DEG screening of key module genes. (A) Scale independence and mean connectivity. (B) Gene clustering dendrogram, a total of 4 modules were identified. (C) Correlation heatmap of modules and clinical traits. (D) Volcano plot for BMRG differential analysis. (E) Venn diagram showing common genes of key module genes and differential genes.

3.2 Construction of BMRGI for HCC patients

We first performed univariate Cox analysis on 47 common genes, and the results showed that 18 BMRG were risk factors affecting OS in HCC patients (Figure 2A). LASSO analysis was further performed on 18 prognostic genes, and finally we identified 4 BMRG (Figure 2B). In addition, we also analyzed the expression differences and prognostic value of the 4 BMRG. Differential analysis showed that CTSA, ADAM9, LAMB3, and SPON2 were highly expressed in HCC (Figure 2C), and kaplan-meier analysis showed that high expression usually means poor prognosis (Figure 2D). Finally, we constructed the basement membrane-related gene prognostic index BMRGI based on the results of LASSO analysis.

FIGURE 2
www.frontiersin.org

Figure 2 Construction of BMRGI and verification of its prognostic value. (A) Univariate Cox regression analysis of common genes. (B) LASSO analysis. (C) Expression difference analysis of CTSA, ADAM9, LAMB3 and SPON2. (D) Kaplan-Meier survival curves of CTSA, ADAM9, LAMB3 and SPON2. (E) Visual distribution of risk scores for TCGA cohort and ICGC cohort. (F) Survival status and time of TCGA and ICGC cohort. (G) Kaplan-Meier survival curves of BMRGI in TCGA and ICGC cohort.

According to the median BMRGI, HCC patients were divided into high BMRGI group and low BMRGI group. We use TCGA-HCC as the training cohort and ICGC as the validation cohort. First, we showed the risk scores of the training cohort and validation cohort more intuitively (Figure 2E). Furthermore, in both the TCGA-HCC cohort and the ICGC cohort, the higher the BMRGI, the shorter the survival time of HCC patients (Figure 2F). Finally, we performed a kaplan-meier analysis, and the results showed that the OS of the high BMRGI group was significantly lower than that of the low BMRGI group in both the TCGA cohort and the ICGC cohort (Figure 2G).

3.3 Single-cell transcriptional profiling and cell-cell interactions in HCC tissue

We used tSNE to perform dimensionality reduction and clustering on the preprocessed scRNA-seq data, and finally obtained 12 clusters (Figure 3A). In addition, we also displayed the most significantly expressed genes in the 12 clusters using a heatmap (Figure 3B). Cell types were automatically annotated by the SingleR package, and these 12 clusters were clustered into 5 cell types, including Hepatocytes, T cells, Macrophage, B cell and NK cell (Figure 3C).

FIGURE 3
www.frontiersin.org

Figure 3 scRNA-seq analysis and CellChat Analysis (A) tSNE analysis to classify cell clusters. (B) Heatmap showing highly expressed genes in cell clusters. (C) The “SingleR” package annotates cell clusters into 5 cell types. (D) Network diagram of the number and weight of connections between different cell types. (E) Diagram of the communication network between cells and other cells. (F) Bubble diagram of receptor ligand molecules involved in cell communication. (G, H) Distribution and expression levels of model genes in 5 cell types.

In addition, we further evaluated the interactions between different cells using the “CellChat” package. Figure 3D shows the number and weight of interactions among the five cell types. Furthermore, we present these results separately for a clearer picture of the strength of cell-cell interactions (Figure 3E). Overall, Hepatocytes rarely act as receptors for signals from the other four types of immune cells, but they can communicate with immune cells by emitting signals. Immune cells interact and receive signals frequently. In addition, receptor ligand molecules when mediating cell-cell interactions are shown in Figure 3F.

Finally, we explored the distribution of model genes in different cell types and showed the expression levels of the model in different cell types in bubble plots (Figures 3G, H). In brief, CTSA was highly expressed in hepatocytes, NK cells, and macrophages, SPON2 was highly expressed only in hepatocytes, whereas ADAM9 and LAMB1 were expressed at low levels in all cell types.

3.4 Comprehensive analysis of clinical parameters in HCC patients

As shown in Figure 4A, the heat map of BMRGI and common clinicopathological parameters, the results showed that the tumor stages of HCC patients with different BMRGI groups had statistical differences. In addition, we further determined the prognostic value of BMRGI in patients with different pathological features. The results showed that the high BMRGI group had significantly worse OS than the low BMRGI group in HCC patients with different clinicopathological parameters (age, gender, tumor grade and stage) (p< 0.1, Figures 4B–I). These results suggest that our BMRGI can effectively predict the prognosis of HCC patients with different clinicopathological features.

FIGURE 4
www.frontiersin.org

Figure 4 Correlation analysis between BMRGI and clinicopathological features. (A) Heatmap of the distribution of clinical case characteristics of patients in the high BMRGI group and low BMRGI group. (B, C) Kaplan-meier survival curves of high BMRGI group and low BMRGI group in different age groups. (D, E) Kaplan-meier survival curves of high BMRGI group and low BMRGI group in different gender groups. (F, G) Kaplan-meier survival curves of high BMRGI group and low BMRGI group under different grading groups. (H, I) Kaplan-meier survival curves of high BMRGI group and low BMRGI group in different stages.

3.5 Construction and evaluation of clinical nomogram based on BMRGI

In order to construct a more practical and stable nomogram, we incorporated several common clinicopathological parameters (age, gender, tumor grade and stage). Univariate Cox analysis showed that tumor stage and BMRGI were risk factors affecting the prognosis of HCC patients (Figure 5A). Multivariate Cox analysis confirmed that tumor stage and BMRGI were independent risk factors affecting the prognosis of HCC patients after adjusting for other clinicopathological parameters (Figure 5B). Given the high correlation between BMRGI and prognosis of HCC patients. We constructed a new nomogram combining common clinicopathological parameters and BMRGI (Figure 5C). We first evaluated the AUC value of various indicators to predict the prognosis of HCC patients using the ROC curve, and the results showed that the ability of BMRGI to predict the prognosis of HCC patients was significantly better than other clinicopathological features (including the classic indicator tumor stage), and the nomogram constructed based on this further improved the accuracy of predicting the prognosis of HCC patients (Figure 5D). In addition, the excellent accuracy and robustness of the nomogram in assessing the 1-year, 3-year, and 5-year survival of patients was further illustrated by ROC curves and calibration curves (Figures 5E, F).

FIGURE 5
www.frontiersin.org

Figure 5 Construction of clinical nomogram. (A, B) Forest plots for univariate and multivariate Cox regression analysis. (C) Nomogram combining common clinical parameters and BMRGI. (D) ROC curves for clinical parameters, BMRGI and nomogram. (E) ROC curves of nomograms predicting 1-year, 3-year and 5-year survival rates. (F) Calibration curves for nomograms.

3.6 GO, KEGG and GSEA analysis related to BMRGI

First, we analyzed the genetic differences between the high BMRGI group and the low BMRGI group (|log2FC| > 1, FDR< 0.05). Based on these differential genes, we further performed GO analysis and KEGG analysis to explore their biological characteristics. GO analysis results showed that, in terms of biological process, DEGs were mainly enriched in “ membrane invagination, phagocytosis, engulfment, phagocytosis, recognition, plasma membrane invagination, phagocytosis, humoral immune response mediated by circulating immunoglobulin, humoral immune response, B cell receptor signaling pathway, cell chemotaxis, leukocyte migration”. In terms of cellular composition, DEGs were mainly enriched in “immunoglobulin complex, collagen−containing extracellular matrix, immunoglobulin complex, circulating, external side of plasma membrane, basal plasma membrane, Golgi lumen, basal part of cell, apical plasma membrane, basolateral plasma membrane, apical part of cell”. and in terms of molecular functions, DEGs were mainly enriched in “antigen binding, immunoglobulin receptor binding, extracellular matrix structural constituent, collagen binding, glycosaminoglycan binding, fibronectin binding, sulfur compound binding, heparin binding, growth factor binding, insulin−like growth factor binding” (Figure 6A). The results of KEGG analysis showed that DEGs were only enriched in the Focal adhesion signaling pathway (Figure 6B).

FIGURE 6
www.frontiersin.org

Figure 6 GO analysis, KEGG analysis and GSEA analysis. (A) Circle and bubble charts for GO analysis. (B) Barplot and bubble charts for KEGG analysis. (C) GSEA analysis of high BMRGI group and low BMRGI group.

In addition, GSEA analysis was further carried out in this study. The results showed that the signal pathways affected by the high BMRGI group were mainly enriched in “ KEGG CELL ADHESION MOLECULES CAMS, KEGG CYTOKINE CYTOKINE RECEPTOR INTERACTION, KEGG ECM RECEPTOR INTERACTION, KEGG FOCAL ADHESION, KEGG NEUROACTIVE LIGAND RECEPTOR INTERACTION”. The signal pathways affected by the low BMRGI group were mainly enriched in “KEGG DRUG METABOLISM CYTOCHROME P450, KEGG FATTY ACID METABOLISM, KEGG GLYCINE SERINE AND THREONINE METABOLISM, KEGG METABOLISM OF XENOBIOTICS BY CYTOCHROME P450, KEGG RETINOL METABOLISM”(Figure 6C).

3.7 Comprehensive analysis of the correlation between BMRGI and tumor microenvironment

In view of the important guiding significance of immune checkpoint molecules and HLA family molecules in immunotherapy. We analyzed the correlation between BMRGI and 48 common immune checkpoint molecules and 24 HLA family molecules. The results showed that BMRGI was significantly positively correlated with 41 immune checkpoint molecules as well as 23 HLA family molecules (Figures 7A, B). In addition, we assessed the levels of 16 immune-related cells and 13 immune-related terms in tissue samples from HCC patients using ssGSEA. In terms of immune-related cells: Compared with the low BMRGI group, aDCs, DCs, iDCs, Macrophages, pDCs, Th1_cells, Th2_cells, and Treg were significantly increased in the high BMRGI group, while NK_cells were significantly decreased (Figure 7C). In terms of immune-related terms, compared with the low BMRGI group, the levels of APC_co_Stimulation, CCR, Check-point, HLA, MHC_class_I, and Parainflammation were significantly increased, while Type_II_IFN_Reponse was significantly decreased (Figure 7D). Then, we analyzed the somatic mutation profile of HCC patients and identified the top 10 mutated genes in the high and low BMRGI groups. The results showed that TP53 mutations were significantly lower in the high BMRGI group than in the BMRGI group (Figures 7E, F). However, there was no statistical difference in TMB between the high and low BMRGI groups (Figure 7G). Finally, we assessed the sensitivity to immunotherapy in the high and low BMRGI groups. The results showed that TIDE was lower in the high BMRGI group, indicating that the lower the possibility of immune escape, the better the effect of immunotherapy (Figure 7H).

FIGURE 7
www.frontiersin.org

Figure 7 Correlation analysis between BMRGI and tumor microenvironment and common drug sensitivity. (A) Correlation between BMRGI and immune checkpoint molecules. (B) Correlation between BMRGI and HLA family molecules. (C) Differences in immune cell infiltration between high and low BMRGI groups. (D) Differences in immune-related terms between high and low BMRGI groups. (E) TOP10 mutated genes in high BMRGI group. (F) TOP10 mutated genes in low BMRGI group. (G) Difference analysis of TMB between high BMRGI group and low BMRGI group. (H) Difference analysis of TIDE scores between high BMRGI group and low BMRGI group. (I) Difference analysis of IC50 values of 6 commonly used drugs in high BMRGI group and low BMRGI group. (J) Correlation analysis between IC50 values of 6 commonly used drugs and BMRGI. * represents p < 0.05, ** represents p < 0.01, and *** represents p < 0.001.

3.8 Screening for sensitive drugs in HCC patients

We evaluated and observed the differences in sensitivity to 6 common drugs in HCC patients between the two groups. The lower the IC50 value, the higher the sensitivity to the drug. The results showed that patients in the low BMRGI group were more sensitive to sorafenib, oxaliplatin, cytarabine and fludarabine, whereas patients in the high BMRGI group were more sensitive to 5-fluorouracil and gefitinib higher (Figures 7I, J). All in all, these results provide a good reference for clinical medication.

3.9 Identification of key BMRG

We conducted a more refined analysis based on the 47 common genes screened above. First, we identified marker molecules of HCC by 3 machine learning methods (LASSO, SVM-REF, and RandomForest) (Figures 8A–C). Among them, CSTA, ITGA6, ITGB8 and LAMC1 are common marker molecules (Figure 8D). Then we evaluated the diagnostic value of the four marker molecules through the ROC curve, and the results showed that CSTA (AUC = 0.952), ITGA6(AUC = 0.942), ITGB8(AUC = 0.756) and LAMC1(AUC = 0.936) all had high diagnostic value (Figures 8E–H). At the same time, we found that CTSA not only has the highest diagnostic value, but also constitutes one of the members of BMRPI. Therefore, we considered CTSA as the most critical BMRG in HCC for further study.

FIGURE 8
www.frontiersin.org

Figure 8 Screening for Feature BMRG. Characteristic genes in DEGs selected by LASSO (A), SVM-SEF (B) and RandomForst (C). (D) The Venn diagram shows the common genes of the three algorithms. (E–H) ROC curve of common genes.

3.10 Multilevel expression verification and in vitro functional exploration of CTSA

We further investigated the differential expression of CTSA in HCC. First, we searched through GEPIA2.0, and the results showed that CTSA was highly expressed in HCC (Figure 9A). Second, we explored the differential expression of CTSA at the protein level. The UALCAN database (https://ualcan.path.uab.edu/index.html) showed that the protein level of CTSA in HCC was significantly higher than that in the normal group (Figure 9B). Likewise, the HPA database showed that CTSA was highly expressed in HCC tissues compared with normal liver tissues (Figure 9C). Finally, we detected the expression of CTSA in normal liver cell lines (LO2) and liver cancer cell lines (BEL7402, HEPG2, HCCLM3), and the results showed that the expression level of CTSA in liver cancer cell lines was significantly higher than that in normal liver cell lines (Figure 9D).

FIGURE 9
www.frontiersin.org

Figure 9 Multidimensional expression validation of CTSA and modulation of HCC oncogenic capacity in vitro. (A) Differential analysis of mRNA expression of CTSA in HCC in GEPIA2.0 database. (B) Differential analysis of protein expression levels of CTSA in HCC from UALCAN database. (C) IHC images of CTSA in HCC tissues and normal liver tissues from HPA database. (D) Expression levels of CTSA in normal liver cell lines and HCC cell lines. (E) Knockdown efficiency of siCTSA in BEL7402 and HCCLM3 cells. (F, G) CCK8 assay detects the effect of knocking down CTSA on the proliferation ability of BEL-7402 (Left) and HCCLM3 (Right). (H) Transwell assay was used to detect the effect of knocking down CTSA on the migration and invasion of BEL7402 cells. (I) Transwell assay was used to detect the effect of knocking down CTSA on the migration and invasion of HCCLM3 cells. *** represents p < 0.001.

To gain insight into the in vitro function of CTSA in HCC, we characterized the oncogenic phenotype of HCCLM3 and BEL-7402 cells (SiCTSA-1 and SiCTSA-2) with CTSA knockdown. The RT-qPCR results showed that siCTSA-1 and siCTSA-2 could significantly inhibit the CTSA expression of HCC cells (BEL-7402 and HCCLM3 cells) (Figure 9E). We studied the role of CTSA in HCC cell proliferation by CCK8 assay, and the role of CTSA in HCC cell migration and invasion using Transwell assay. CCK8 assay and Transwell assay analysis showed that the reduction of CTSA impaired the proliferation (Figures 9F, G), migration (Figure 9H) and invasion (Figure 9I) abilities of HCC cells (BEL7402 and HCCLM3).

4 Discussion

HCC is the most prevalent histological type of primary liver cancer, known for its high metastatic and recurrence characteristics (4). Unfortunately, most HCC patients are diagnosed at an advanced stage, which significantly reduces the chance of curative treatment and leads to a poor prognosis (26). BM, as an important component of the extracellular matrix, is an important barrier that cancer cells must overcome to form metastasis (5, 27). Numerous studies have demonstrated the association between the main components of BM and HCC tumor metastasis, as well as poor prognosis (2830). Current study shows that systematic analysis of specific gene sets achieves promising results in predicting cancer prognosis (31, 32). Despite advancements in research, there is still a lack of reliable prognostic models for HCC based on basement membrane genes. To address this gap, our study utilized WGCNA and machine learning to develop a strong prognostic index based on BMRG. Our model has demonstrated high accuracy in predicting the prognosis of HCC patients. This study utilized the TCGA-HCC dataset to identify 4 BMRG (CTSA, ADAM9, LAMB1, and SPON2) through WGCNA and machine learning techniques. These BMRG were used to construct BMRGI. Previous research has shown that all four BMRG are closely associated with HCC. Wang et al. discovered that CTSA has potential as a diagnostic and prognostic marker for HCC patients (33). In HCC, ADAM9 is known to be overexpressed and is responsible for inducing ROS generation, which in turn promotes HCC cell invasion (34). Additionally, LAMB1 has been shown to be regulated by the RNA helicase DDX24, which contributes to the malignant progression of HCC (35). However, the role of SPON2 in HCC is still a matter of debate. While high expression of SPON2 has been linked to poor prognosis in HCC patients (36), it has also been found to inhibit tumor metastasis by promoting the infiltration of M1-like macrophages (37). Taken together, these studies showed that the four BMRG were closely related to HCC and its prognosis, which indicated the correctness of our BMRGI based on them.

By analyzing the single-cell atlas of HCC tissue, we identified five types of cells present. Our findings suggest that hepatocytes are capable of acting as ligands to send signals to immune cells, while immune cells exhibit a weaker ability to send signals to liver cells. We conducted an analysis of the expression of model genes across various cell types and found that CTSA expression was particularly high in hepatocytes, NK cells, and macrophages. This suggests that any abnormal expression of CTSA could potentially impact the progression of HCC by influencing the immune microenvironment of HCC.

We conducted related validation on BMRGI. Survival analysis revealed that the prognosis of the high BMRGI group was significantly worse than that of the lower BMRGI group in the TCGA-HCC cohort. Furthermore, we validated the ability of BMRGI to predict the prognosis of HCC patients in the ICGC cohort. The study found that BMRGI is an independent risk factor for the prognosis of HCC patients, as determined by the results of multivariate Cox analysis. Additionally, ROC curve analysis revealed that BMRGI is a better predictor of HCC patient prognosis compared to other clinicopathological parameters. Subgroup analysis based on clinical characteristics demonstrated that BMRGI has a strong ability to predict prognosis for HCC patients with varying clinical characteristics. In order to facilitate clinical application and improve the accuracy of predicting the prognosis of HCC patients, we combined common clinicopathological parameters with BMRGI to construct a nomogram.

We conducted further analysis of the differentially expressed genes (DEGs) between the high and low BMRGI groups to investigate the biological properties of these subgroups. Our analysis, which included Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Gene Set Enrichment Analysis (GSEA), revealed significant differences in biological processes related to immunity and cell adhesion (BM is closely related) between the different BMRGI subgroups.

This study demonstrates the accuracy of BMGPI construction (closely related to BM-related biological characteristics). Additionally, BMGPI effectively recognizes differences in the tumor immune microenvironment. Further analysis was conducted to determine the correlation between BMRGI and the tumor immune microenvironment. Previous research has shown that immune checkpoint molecules and HLA family molecules are strong predictors of response to immunotherapy (3841). Therefore, we analyzed the correlation between BMRPI and immune checkpoint molecules and HLA family molecules. The results showed that BMRGI was positively correlated with most immune checkpoint molecules and HLA molecules, suggesting that BMRGI may also be a good biomarker for predicting immunotherapy response. Second, our results showed significant differences in terms of immune-related cells between high and low BMRGI groups, implying that different BMRGI subgroups may differ in response to immunotherapy. Interestingly, the mutation rate of TP53 in the high BMRGI group was significantly higher than that in the low BMRGI group, which may be one of the reasons for the poor prognosis in the high BMRGI group (42). However, overall TMB levels were not statistically different between the two groups. In this study, we used the TIDE algorithm to analyze the response to immunotherapy in various BMRGI subgroups of HCC patients. Our findings indicate that patients in the high BMRGI group had a lower TIDE score, suggesting that they may be less prone to immune escape and therefore have a better response to immunotherapy. Additionally, we evaluated the sensitivity of different BMRGI subgroups to six commonly used therapeutic drugs, providing valuable insights for clinical decision-making.

Finally, we screened 47 common gene by multiple machine learning methods and finally identified 4 BMRG: CTSA, ITGA6, ITGA8, and LAMC1. ROC analysis showed that these genes have high diagnostic value for distinguishing HCC. We found that CTSA not only had the highest diagnostic value (AUC:0.952), but also constituted one of the core members of BMRGI. Therefore, we regard CTSA as the most critical member of BMRG and conduct in-depth research. We verified that the expression of CTSA in HCC was significantly higher than that in normal tissues at the mRNA level and protein level by GEPIA2.0 database, UALCAN database and HPA database. In addition, we also verified by RT-qPCR that the expression of CTSA in HCC cell lines was significantly higher than that in normal liver cell lines. Since the oncogenic role of CTSA in HCC is still unclear, this prompted us to further explore the role of CTSA in HCC progression. More importantly, our in vitro cell experiments showed that the proliferation, migration and invasion abilities of HCC cell lines (BEL7402 and HCCLM3) were significantly reduced after CTSA knockdown.

Like other studies, even this study has some limitations and shortcomings. First, when we validated the prognostic value of BMRGI, we did not validate it in real cohorts. Second, the carcinogenesis of CTSA has not been explored by in vivo experiments. Finally, the specific molecular mechanism by which CTSA affects the progression of HCC was not elucidated in this study.

In conclusion, our study trained and validated a BMRGI that could effectively predict the prognosis of HCC patients based on 222 BMRG. Based on this, we also developed a nomogram for clinical application. The biological and immunological characteristics of BMRGI in HCC were explored through a series of bioinformatics methods, and some insights were provided for clinical immunotherapy and targeted therapy. Finally, we also verified the role of the key BMRG CTSA in HCC progression through in vivo functional experiments.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Author contributions

PS and YY have constructed and devised the research. WS and JW performed data analysis and wrote the manuscript. ZW, MX and QL analyzed the data. WS performed the experiments. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by National Natural Science Foundation of China (82070634), Shanghai Municipal Science and Technology Commission (20ZR1451700), and Shanghai Collaborative Innovation Center for Translational Medicine (TM201731).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1185916/full#supplementary-material

References

1. Llovet JM, Kelley RK, Villanueva A, Singal AG, Pikarsky E, Roayaie S, et al. Hepatocellular carcinoma. Nat Rev Dis Primers (2021) 7(1):6. doi: 10.1038/s41572-020-00240-3

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Forner A, Llovet JM, Bruix J. Hepatocellular carcinoma. Lancet (London England) (2012) 379(9822):1245–55. doi: 10.1016/s0140-6736(11)61347-0

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Yin Z, Dong C, Jiang K, Xu Z, Li R, Guo K, et al. Heterogeneity of cancer-associated fibroblasts and roles in the progression, prognosis, and therapy of hepatocellular carcinoma. J Hematol Oncol (2019) 12(1):101. doi: 10.1186/s13045-019-0782-x

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Reuten R, Zendehroud S, Nicolau M, Fleischhauer L, Laitala A, Kiderlen S, et al. Basement membrane stiffness determines metastases formation. Nat materials (2021) 20(6):892–903. doi: 10.1038/s41563-020-00894-0

CrossRef Full Text | Google Scholar

6. Donato MF, Colombo M, Matarazzo M, Paronetto F. Distribution of basement membrane components in human hepatocellular carcinoma. Cancer (1989) 63(2):272–9.

PubMed Abstract | Google Scholar

7. Mutgan AC, Jandl K, Kwapiszewska G. Endothelial basement membrane components and their products, matrikines: active drivers of pulmonary hypertension? Cells (2020) 9(9):2029. doi: 10.3390/cells9092029

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Sherwood DR. Basement membrane remodeling guides cell migration and cell morphogenesis during development. Curr Opin Cell Biol (2021) 72:19–27. doi: 10.1016/j.ceb.2021.04.003

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Banerjee S, Lo WC, Majumder P, Roy D, Ghorai M, Shaikh NK, et al. Multiple roles for basement membrane proteins in cancer progression and emt. Eur J Cell Biol (2022) 101(2):151220. doi: 10.1016/j.ejcb.2022.151220

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Najafi M, Farhood B, Mortezaee K. Extracellular matrix (Ecm) stiffness and degradation as cancer drivers. J Cell Biochem (2019) 120(3):2782–90. doi: 10.1002/jcb.27681

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Chen S, Liu P, Zhao L, Han P, Liu J, Yang H, et al. A novel cuproptosis-related prognostic lncrna signature for predicting immune and drug therapy response in hepatocellular carcinoma. Front Immunol (2022) 13:954653. doi: 10.3389/fimmu.2022.954653

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Lu J, Yu C, Bao Q, Zhang X, Wang J. Identification and analysis of necroptosis-associated signatures for prognostic and immune microenvironment evaluation in hepatocellular carcinoma. Front Immunol (2022) 13:973649. doi: 10.3389/fimmu.2022.973649

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Jayadev R, Morais M, Ellingford JM, Srinivasan S, Naylor RW, Lawless C, et al. A basement membrane discovery pipeline uncovers network complexity, regulators, and human disease associations. Sci Adv (2022) 8(20):eabn2265. doi: 10.1126/sciadv.abn2265

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Langfelder P, Horvath S. Wgcna: an r package for weighted correlation network analysis. BMC Bioinf (2008) 9:559. doi: 10.1186/1471-2105-9-559

CrossRef Full Text | Google Scholar

15. Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol (2018) 36(5):411–20. doi: 10.1038/nbt.4096

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Maeser D, Gruener RF, Huang RS. Oncopredict: an r package for predicting in vivo or cancer patient drug response and biomarkers from cell line screening data. Briefings Bioinf (2021) 22(6):bbab260. doi: 10.1093/bib/bbab260

CrossRef Full Text | Google Scholar

17. Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn (2002) 46(1):389–422. doi: 10.1023/A:1012487302797

CrossRef Full Text | Google Scholar

18. Duan J, Soussen C, Brie D, Idier J, Wan M, Wang YP. Generalized lasso with under-determined regularization matrices. Signal Process (2016) 127:239–46. doi: 10.1016/j.sigpro.2016.03.001

CrossRef Full Text | Google Scholar

19. Ishwaran H, Kogalur UB. Consistency of random survival forests. Stat probability Lett (2010) 80(13-14):1056–64. doi: 10.1016/j.spl.2010.02.020

CrossRef Full Text | Google Scholar

20. Zhao Z, He S, Yu X, Lai X, Tang S, Mariya ME, et al. Analysis and experimental validation of rheumatoid arthritis innate immunity gene Cyfip2 and pan-cancer. Front Immunol (2022) 13:954848. doi: 10.3389/fimmu.2022.954848

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Tang Z, Kang B, Li C, Chen T, Zhang Z. Gepia2: an enhanced web server for Large-scale expression profiling and interactive analysis. Nucleic Acids Res (2019) 47(W1):W556–w60. doi: 10.1093/nar/gkz430

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Chandrashekar DS, Karthikeyan SK, Korla PK, Patel H, Shovon AR, Athar M, et al. Ualcan: an update to the integrated cancer data analysis platform. Neoplasia (New York NY) (2022) 25:18–27. doi: 10.1016/j.neo.2022.01.001

CrossRef Full Text | Google Scholar

23. Colwill K, Gräslund S. A roadmap to generate renewable protein binders to the human proteome. Nat Methods (2011) 8(7):551–8. doi: 10.1038/nmeth.1607

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Sun W, Shen J, Liu J, Han K, Liang L, Gao Y. Gene signature and prognostic value of ubiquitin-specific proteases members in hepatocellular carcinoma and explored the immunological role of Usp36. Front bioscience (Landmark edition) (2022) 27(6):190. doi: 10.31083/j.fbl2706190

CrossRef Full Text | Google Scholar

25. Dai W, Xu L, Yu X, Zhang G, Guo H, Liu H, et al. Ogdhl silencing promotes hepatocellular carcinoma by reprogramming glutamine metabolism. J Hepatol (2020) 72(5):909–23. doi: 10.1016/j.jhep.2019.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Forner A, Reig M, Bruix J. Hepatocellular carcinoma. Lancet (London England) (2018) 391(10127):1301–14. doi: 10.1016/s0140-6736(18)30010-2

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Mierke CT. The matrix environmental and cell mechanical properties regulate cell migration and contribute to the invasive phenotype of cancer cells. Rep Prog Phys Phys Soc (Great Britain) (2019) 82(6):064602. doi: 10.1088/1361-6633/ab1628

CrossRef Full Text | Google Scholar

28. Yamashita T, Koshikawa N, Shimakami T, Terashima T, Nakagawa M, Nio K, et al. Serum laminin Γ2 monomer as a diagnostic and predictive biomarker for hepatocellular carcinoma. Hepatol (Baltimore Md) (2021) 74(2):760–75. doi: 10.1002/hep.31758

CrossRef Full Text | Google Scholar

29. Wang T, Jin H, Hu J, Li X, Ruan H, Xu H, et al. Col4a1 promotes the growth and metastasis of hepatocellular carcinoma cells by activating fak-src signaling. J Exp Clin Cancer Res CR (2020) 39(1):148. doi: 10.1186/s13046-020-01650-7

CrossRef Full Text | Google Scholar

30. Krishnan MS, Rajan Kd A, Park J, Arjunan V, Garcia Marques FJ, Bermudez A, et al. Genomic analysis of vascular invasion in hcc reveals molecular drivers and predictive biomarkers. Hepatol (Baltimore Md) (2021) 73(6):2342–60. doi: 10.1002/hep.31614

CrossRef Full Text | Google Scholar

31. Sun W, Xu Y, Zhao B, Zhao M, Chen J, Chu Y, et al. The prognostic value and immunological role of angiogenesis-related patterns in colon adenocarcinoma. Front Oncol (2022) 12:1003440. doi: 10.3389/fonc.2022.1003440

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Ye Y, Zhao Q, Wu Y, Wang G, Huang Y, Sun W, et al. Construction of a cancer-associated fibroblasts-related long non-coding rna signature to predict prognosis and immune landscape in pancreatic adenocarcinoma. Front Genet (2022) 13:989719. doi: 10.3389/fgene.2022.989719

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Wang H, Xu F, Yang F, Lv L, Jiang Y. Prognostic significance and oncogene function of cathepsin a in hepatocellular carcinoma. Sci Rep (2021) 11(1):14611. doi: 10.1038/s41598-021-93998-9

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Dong Y, Wu Z, He M, Chen Y, Chen Y, Shen X, et al. Adam9 mediates the interleukin-6-Induced epithelial-mesenchymal transition and metastasis through ros production in hepatoma cells. Cancer Lett (2018) 421:1–14. doi: 10.1016/j.canlet.2018.02.010

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Liu T, Gan H, He S, Deng J, Hu X, Li L, et al. Rna helicase Ddx24 stabilizes Lamb1 to promote hepatocellular carcinoma progression. Cancer Res (2022) 82(17):3074–87. doi: 10.1158/0008-5472.can-21-3748

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Feng Y, Hu Y, Mao Q, Guo Y, Liu Y, Xue W, et al. Upregulation of spondin-2 protein expression correlates with poor prognosis in hepatocellular carcinoma. J Int Med Res (2019) 47(2):569–79. doi: 10.1177/0300060518803232

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Zhang YL, Li Q, Yang XM, Fang F, Li J, Wang YH, et al. Spon2 promotes M1-like macrophage recruitment and inhibits hepatocellular carcinoma metastasis by distinct integrin-rho gtpase-hippo pathways. Cancer Res (2018) 78(9):2305–17. doi: 10.1158/0008-5472.can-17-2867

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Gibney GT, Weiner LM, Atkins MB. Predictive biomarkers for checkpoint inhibitor-based immunotherapy. Lancet Oncol (2016) 17(12):e542–e51. doi: 10.1016/s1470-2045(16)30406-5

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Darvin P, Toor SM, Sasidharan Nair V, Elkord E. Immune checkpoint inhibitors: recent progress and potential biomarkers. Exp Mol Med (2018) 50(12):1–11. doi: 10.1038/s12276-018-0191-1

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Schaafsma E, Fugle CM, Wang X, Cheng C. Pan-cancer association of hla gene expression with cancer prognosis and immunotherapy efficacy. Br J Cancer (2021) 125(3):422–32. doi: 10.1038/s41416-021-01400-2

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Loustau M, Anna F, Dréan R, Lecomte M, Langlade-Demoyen P, Caumartin J. Hla-G neo-expression on tumors. Front Immunol (2020) 11:1685. doi: 10.3389/fimmu.2020.01685

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Long J, Wang A, Bai Y, Lin J, Yang X, Wang D, et al. Development and validation of a Tp53-associated immune prognostic model for hepatocellular carcinoma. EBioMedicine (2019) 42:363–74. doi: 10.1016/j.ebiom.2019.03.022

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: hepatocellular carcinoma, basement membranes, prognosis, immunotherapy, machine learning, ScRNA-seq, CTSA, vitro experiment

Citation: Sun W, Wang J, Wang Z, Xu M, Lin Q, Sun P and Yuan Y (2023) Combining WGCNA and machine learning to construct basement membrane-related gene index helps to predict the prognosis and tumor microenvironment of HCC patients and verifies the carcinogenesis of key gene CTSA. Front. Immunol. 14:1185916. doi: 10.3389/fimmu.2023.1185916

Received: 14 March 2023; Accepted: 10 May 2023;
Published: 23 May 2023.

Edited by:

Jun Liu, Yuebei People’s Hospital, China

Reviewed by:

Bufu Tang, Fudan University, China
Jiahao Gao, Fudan University, China

Copyright © 2023 Sun, Wang, Wang, Xu, Lin, Sun and Yuan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Yihang Yuan, c3VyZ2VyeXVhbkBvdXRsb29rLmNvbQ==; Peng Sun, c3AyMDgyQHNodHJob3NwaXRhbC5jb20=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.