- 1Medical Biochemistry and Molecular Biology Department, Faculty of Medicine, Ain Shams University, Cairo, Egypt
- 2Faculty of Oral and Dental Medicine, Misr International University (MIU), Cairo, Egypt
- 3Biochemistry Department, Faculty of Medicine, Umm Al-Qura University, Makkah, Saudi Arabia
- 4Bioinformatics Group, Center of Informatics Sciences (CIS), School of Information Technology and Computer Sciences, Nile University, Giza, Egypt
- 5Biotechnology/Biomolecular Chemistry Department, Faculty of Science, Cairo University, Giza, Egypt
- 6Physiology Department, Faculty of Medicine, Ain Shams University, Cairo, Egypt
- 7Department of Internal Medicine, Badr University in Cairo, Badr, Egypt
- 8Department of Anatomy and Cell Biology, Faculty of Medicine, Ain Shams University, Cairo, Egypt
- 9Department of Anatomy and Cell Biology, Faculty of Medicine, Galala University, Suez, Egypt
- 10Systems and Biomedical Engineering Department, Faculty of Engineering, Cairo University, Giza, Egypt
Introduction: Liver cancer, particularly Hepatocellular carcinoma (HCC), remains a significant global health concern due to its high prevalence and heterogeneous nature. Despite the existence of approved drugs for HCC treatment, the scarcity of predictive biomarkers limits their effective utilization. Integrating diverse data types to revolutionize drug response prediction, ultimately enabling personalized HCC management.
Method: In this study, we developed multiple supervised machine learning models to predict treatment response. These models utilized classifiers such as logistic regression (LR), k-nearest neighbors (kNN), neural networks (NN), support vector machines (SVM), and random forests (RF) using a comprehensive set of molecular, biochemical, and immunohistochemical features as targets of three drugs: Pantoprazole, Cyanidin 3-glycoside (Cyan), and Hesperidin. A set of performance metrics for the complete and reduced models were reported including accuracy, precision, recall (sensitivity), specificity, and the Matthews Correlation Coefficient (MCC).
Results and Discussion: Notably, (NN) achieved the best prediction accuracy where the combined model using molecular and biochemical features exhibited exceptional predictive power, achieving solid accuracy of 0.9693 ∓ 0.0105 and average area under the ROC curve (AUC) of 0.94 ∓ 0.06 coming from three cross-validation iterations. Also, found seven molecular features, seven biochemical features, and one immunohistochemistry feature as promising biomarkers of treatment response. This comprehensive method has the potential to significantly advance personalized HCC therapy by allowing for more precise drug response estimation and assisting in the identification of effective treatment strategies.
1 Introduction
Hepatocellular carcinoma (HCC) is a deadly form of liver cancer, ranking fifth among the most commonly diagnosed malignancies worldwide and being the third leading cause of cancer-related deaths (Sung et al., 2021). In Egypt specifically, HCC remains the most prevalent cancer and the primary cause of cancer-related mortality, resulting in approximately 26,000 deaths annually (Omar et al., 2023). HCC arises due to repeated cycles of inflammation, leading to the progression of fibrosis, cirrhosis, and ultimately, the development of cancer (Piñero et al., 2020). Patients with chronic liver conditions, such as viral hepatitis, and even those with alcoholic and nonalcoholic fatty liver disease or metabolic syndromes, are at a higher risk of developing HCC (Chidambaranathan-Reghupaty et al., 2021). Unfortunately, the lack of effective biomarkers for early detection, limited understanding of HCC’s heterogeneity, and therapeutic resistance contribute to the high mortality rates (Tunissiolli et al., 2017). Despite the availability of various treatment options such as surgical resection, targeted immune therapies, and radiofrequency ablation, patients with unresectable HCC face significant unmet medical needs and experience a poor prognosis (Xing et al., 2021; Llovet et al., 2022).
Numerous RNA biomarker subtypes, such as Circular RNAs (circRNAs), microRNAs (miRNAs), and long noncoding RNAs (lncRNAs), have demonstrated their potential as diagnostic and prognostic biomarkers in HCC (Tan et al., 2019; Jiang et al., 2019). Particularly, these biomarkers are involved in many HCC-related pathways such as autophagy, apoptosis, cell cycle regulation, and immune checkpoints (Takamura et al., 2011; Lee et al., 2018). CircRNAs, predominantly located in the cytoplasm and exhibiting relative stability, function as miRNA sponges or protein scaffolds, thereby facilitating protein-protein interactions (Shi et al., 2021). They play a significant role in the initiation and progression of HCC (Su et al., 2019). Dysregulation of lncRNA has been observed in various cancer types, where it can either suppress or promote tumorigenesis and tumor development (Kang et al., 2019). LncRNA exerts its influence through interactions with DNA, RNA, and proteins, as well as by acting as microRNA sponges (Huang et al., 2020). Moreover, they play a crucial role in tumor progression and contribute to the malignant phenotypes observed in HCC, including proliferation, invasion, and migration (Unfried et al., 2021). Considering the complexity of the RNA landscape, which involves various splicing forms, alternative polyadenylation, and chimeric RNAs, the incorporation of multiple RNA types could enhance the performance of diagnostic and prognostic panels (Eun et al., 2023; Fu et al., 2022; Schlosser et al., 2022).
RNA extracted from malignant growths has great potential as a valuable resource for determining drug efficacy in various cancer forms (Fang et al., 2023). Despite previous endeavors falling short of desired outcomes for clinical implementation, there remains a persistent interest in exploring novel avenues (Zhang H. Y. et al., 2023; Zhang J. et al., 2023). Presently, the technique of immunohistochemistry (IHC) has emerged as a promising tool for assessing the presence of HCC such as glutathione S-transferase pi (GSTP), proliferating cell nuclear antigen (PCNA), and tumor necrosis factor (TNF), indicating that integrating protein expression data with mRNA may offer a more effective means of predicting drug response (Márquez-Quiroga et al., 2022; Stärkel et al., 2015; Feng et al., 2022).
Our research endeavors were extensively conducted on animal models, aiming to identify effective drugs for HCC. During our studies, we explored diverse therapeutic approaches, including the utilization of proton-pump inhibitors (PPIs) such as Pantoprazole. Pantoprazole has garnered attention as a promising therapeutic strategy for gastric cancer, showing the ability to enhance sensitivity to antitumor drugs, attenuate liver tumorigenesis, and modulate autophagy in rat models (Zhang et al., 2019; Bridoux et al., 2022; Kim et al., 2022; Matboli et al., 2019). Furthermore, our investigations delved into the anticancer effects of natural compounds, with a specific focus on triggering cancer cell death (Zaghloul et al., 2017; Kooti et al., 2017). Hesperidin, a flavanone glycoside commonly found in citrus fruits, emerged as a prominent candidate as it exhibited anti-inflammatory, anti-carcinogenic, antioxidative, and lipid-lowering effects (Banjerdpongchai et al., 2016). Moreover, it has demonstrated an antigenotoxic effect by counteracting DNA damage induced by hydrogen peroxide (Fernández-Bedmar et al., 2017). Importantly, hesperidin has exhibited inhibitory effects on the development of various cancer types, including tongue, esophageal, and colon cancers (Tanaka and Sugie, 2007; Aggarwal et al., 2020). Additionally, Cyanidin 3-glycoside (Cyan), a notable anthocyanidin found in plants and fruits, has been associated with multiple beneficial properties, such as anti-aging, anti-oxidative, anti-inflammatory, vascular relaxation, antiproliferative, and anti-inflammatory effects (Safdar et al., 2023; Mirmalek et al., 2023). Moreover, Cyan has suppressed the progression of breast, lung, and liver cancers (Yin et al., 2019; Chen et al., 2021). To gain comprehensive insights, we employed a panel of mRNA signatures involved in HCC pathogenesis and their epigenetic regulators complemented by biochemical, histological, and immunohistochemistry analyses, these comprehensive evaluations provided valuable indications of the biological actions and potential clinical benefits of our therapeutic candidates.
Recently, the advent of artificial intelligence (AI) has enhanced all possible aspects of clinical care for HCC. AI holds the potential to revolutionize HCC management by addressing key challenges in several crucial areas by enhancing the prediction of future HCC risk in patients, improving the accuracy of diagnosis, and refining prognostication for patients already diagnosed with HCC while predicting their response to specific drugs (Calderaro et al., 2022). The primary objective of our study is to develop a machine-learning (ML) model that utilizes a comprehensive set of molecular, biochemical, and IHC features derived from HCC rat models in order to predict drug response.
2 Materials and methods
2.1 Chemicals and drugs
Diethylnitrosamine (DEN) with a purity of 99.0% and acetamidofluorene (2-AAF) with a purity of 98% were acquired from Sigma Aldrich (St. Louis, MO, United States) with CAS numbers 55-18-5 and 53-96-3, respectively. Hesperidin was also obtained from Sigma Aldrich, while Cyan was purchased from (Earth Natural Supplements in Colorado Springs, United States). Pantoprazole was supplied from (Controloc; Takeda GmbH, Oranienburg, Germany).
2.2 Experimental protocol
One hundred and thirty male Wistar rats, weighing between 190 and 200 g, were procured from the animal house of Nile Pharmaceuticals in Cairo, Egypt. The rats were housed in a controlled environment with temperatures maintained at 22°C–24°C, following a twelve-hour light-dark cycle. They were provided with standard rat chow and had access to tap water, allowing them to acclimate for 1 week. Ethical approval was obtained from the Ain Shams Research Ethics Committee, Faculty of Medicine, Egypt, FMASU MD 32/2016. In accordance with the guidelines of the Declaration of Helsinki. Before each injection, the weight of each rat was measured to ensure accurate calculation of the drug dosage. To induce hepatic pre-cancerous legions (HPCL), the rats were intraperitoneally injected with DEN+2-AAF. DEN was administered intraperitoneally once a week for three consecutive weeks at a dose of 100 mg/kg, followed by a one-week rest period. Subsequently, 2-AAF was injected intraperitoneally at a dose of 300 mg/kg (Hasanin et al., 2021a). The rats were then divided randomly into eleven groups: I) The control group (10 rats) received intraperitoneal injections of 0.9% NaCl. II) The HPCL group (12 rats) received intraperitoneal injections of DEN+2-AAF. III) The Hesperidin groups (12 rats each) included Hesperidin-50, Hesperidin-100, and Hesperidin-200. These rats were injected with DEN+2-AAF for HPCL induction and then treated with hesperidin at doses of 50, 100, and 200 mg/kg/day, respectively, for four consecutive days weekly for 16 weeks. IV) The Cyan groups (12 rats each) included Cyan-10, Cyan-20, and Cyan-30. They were injected with DEN+2-AAF for PCL induction and subsequently treated with Cyan at doses of 10, 20, and 30 mg/kg/day, respectively, for four consecutive days weekly for 16 weeks. V) The Pantoprazole groups (12 rats each) included Pantoprazole-25, Pantoprazole-50, and Pantoprazole-100. These rats were injected with DEN+2-AAF for HPCL induction and then treated with Pantoprazole intraperitoneally at doses of 25, 50, and 100 mg/kg/day, respectively, for three consecutive days weekly for 4 weeks (Figure 1).
2.3 Blood sample and liver tissue assessment
At the end of the experiment, the rats were anesthetized with urethane (1.2 g/kg; intraperitoneal injection) dissolved in distilled water. The blood was obtained from the retroorbital vein and incubated for approximately 30 min to allow clotting. Subsequently, the samples were centrifuged at 5,000 rpm for 20 min to separate the serum. The livers were carefully dissected. Specifically, the right lobe of each liver was removed and sliced into longitudinal sections measuring 2–4 mm in thickness. These sections were preserved in 10% formalin for the histopathological and immunohistochemical examinations. The remaining liver lobes and sera samples were promptly frozen at −80°C for liver function tests and RNA extraction.
2.4 Biochemical analysis
Quantitative analysis of serum biomarkers was conducted to assess various parameters including alanine transaminase (ALT), aspartate transaminase (AST), total cholesterol (TC), triglycerides (TG), HDL cholesterol (HDL-C), and LDL cholesterol (LDL-C), alkaline phosphatase (ALP), gamma-glutamyl transferase (GGT), total bilirubin (T. Bilirubin), direct bilirubin (D. Bilirubin), albumin using commercially available kits and an automated Beckman Coulter AU680 autoanalyzer (Beckman Coulter Inc., CA). Additionally, the levels of alpha-fetoprotein (AFP) were determined using an ELISA kit (Abcam cat no. ab108838, Cambridge, MA, United States).
2.5 Molecular markers selection
We used the Gene Expression Omnibus (GEO) database (https://www.ncbi.nlm.nih.gov/gds/?term=) to find differentially expressed genes (DEGs) using the keyword “Hepatocellular Carcinoma,” We collected datasets that fulfilled our criteria, which involved having tissue samples related to hepatocellular carcinoma (HCC) along with normal tissue samples for comparison. Moreover, we verified that each dataset had an adequate sample size to ensure the reliability of our statistical analysis, and chose the GSE141090, GSE24600, GSE38199, GSE49515, and GSE41804 datasets (details in Supplementary Table S2), then we applied GEO2R/Limma R package tool to find significant DEGs, the P values less than 0.05 were considered significant with log twofold change (LogFC) value ≥ 1 or ≤ − 1, using the adjustment method of Benjamini and Hochberg (false discovery rate) (Supplementary Table S1). Based on our interest in cell cycle progression, autophagy, metastasis, and apoptosis, we selected BCL2 Associated X (BAX), Autophagy Related 16 Like 1 (ATG16-L1), Cyclin E mRNA, Tumor Protein P53 (P53), Ras-related in brain11 gene (RAB11), and Tubulin gamma 1 (TUBG1). The expression ATLAS (https://www.ebi.ac.uk/gxa/home) in addition to The Cancer Proteome Atlas (TCPA) (available at https://www.tcpaportal.org/) databases were used to confirm the differential expression of the selected genes in HCC (Supplementary Figure S2). Using the Kyoto Encyclopedia of Genes and Genomes (KEGG) and the Reactome databases we performed pathway analysis for the selected genes using the Enricher database (https://maayanlab.cloud/Enrichr/). Then we selected the epigenetic regulators of the DEGS, firstly, we retrieved the miRNAs (miR-125b-1-3p, miR-1262, miR-1298, and miR-106b) that interact with the selected DEGs using the mirwalk database (available at http://mirwalk.umm.uni-heidelberg.de/) and the RNA22 database (available at https://cm.jefferson.edu/rna22/Interactive/) (Supplementary Figure S3). Moreover, we chose the LncRNAs (lncRNA-RP11-513I15.6, lncRNA-RP11-583F2.2, and lncRNA-MALAT) based on our previous studies on HCC, (Matboli et al., 2019; Hasanin et al., 2020; Zabady et al., 2022; Matboli et al., 2021; Hasanin et al., 2021b) their differential expression in HCC confirmed by the TANTRIC database (available at https://www.tanric.org/), and their interaction with miRNA verified by the RNA22 database (available at https://cm.jefferson.edu/rna22/Interactive/) (Supplementary Figures S4, S5). The hsa_circ_0001345 was chosen based on our previous study that validated its role in autophagy in HCC (Zabady et al., 2022).
2.6 Total RNA extraction and real-time PCR
The mRNA expression of TUBG1, RAB11A, ATG16-L1, BAX, and P53, as well as the lncRNAs lncRNA-RP11-513I15.6, lncRNA-RP11-583F2.2, and lncRNA-MALAT, was determined using the RT (Omar et al., 2023) SYBR Green ROX real-time quantitative polymerase chain reaction (qPCR) Mastermix and Quantitect SYBR Green Mastermix Kit (Qiagen, Düsseldorf, Germany), specific primers provided by Qiagen (QT00176519, PPR42379A, PM00597373, UPFH0540166, UPFR1074538, SBRN-027Z, LPH05247A, LPH24879A, and SBH0655633) were used in conjunction with the Step One Plus™ System (Applied Biosystems Inc., Foster City, CA, United States). β-actin (PM00480207) was utilized as the endogenous control. For miRNA expression analysis in the liver tissue, we followed the miScript primer assay and miScript SYBR Green kit protocol from Qiagen (Düsseldorf, Germany) to investigate the expression of miR-125b-1-3p, miR-1262, miR-1298, and miR-106b_1 (YI04102304, SBH0386397, SBR1207111, and SI05465957). The endogenous control used was RNU-6 (SI03956260). To obtain the targeted circRNA junction sequence, we referred to the CircInteractome database, and a custom-designed primer assay from the primer-blast database was used for amplification (Zabady et al., 2022; Dudekula et al., 2016; Ye et al., 2012). The PCR program consisted of an initial denaturation step at 95°C for 15 min, followed by 40 cycles of denaturation at 94°C for 10 s, annealing at 55°C for 30 s, and extension at 70°C for 34 s. Each reaction was performed in duplicate. To quantify the expression of the target molecules, we employed the 2−ΔΔCt method (Livak and Schmittgen, 2001). The expression levels of the target genes were normalized against the housekeeping gene for each sample and compared to a reference sample.
2.7 Histological and immunohistochemical examination
Liver samples were obtained from all animal groups and were treated with 10% neutral formaldehyde for 24 h for fixation. The samples were then dehydrated and embedded in paraffin blocks. Thin sections of 5 μm thickness were prepared and subjected to hematoxylin-eosin (H&E) staining to examine potential histopathological alterations. To capture the images, an Olympus BX50 Light microscope from Japan was utilized. Following routine dewaxing with xylene and hydration through a graded ethanol series, the sections underwent a 15-min incubation with a 3% hydrogen peroxide solution at room temperature to neutralize endogenous peroxidase activity. Subsequently, the sections were washed with gently running tap water, followed by rinsing with phosphate-buffered saline (PBS). Primary antibodies were then applied to the sections and incubated overnight at 4°C. The specific antibodies used included a rabbit polyclonal antibody against GST-P (catalog AB106268; Abcam, Cambridge, MA) at a dilution of 1/250, anti-PCNA (Santa Cruz Biotechnology, Santa Cruz, CA) at a dilution of 1/400, and anti-TNF-α (Cat. No. NB600–587, Novus Biologicals, Littleton, CO, United States) at a dilution of 1/200, with incubation conducted at room temperature for 2 h. After rinsing with PBS, the sections were incubated with a biotinylated secondary antibody for 30 min at room temperature. The peroxidase activity was visualized using a 3,3′-Diaminobenzidine solution (Vector Laboratories, Inc., Burlingame, CA), and counterstaining was performed with hematoxylin. To quantify the GST-P-positive area, images were captured using a charge-coupled device (CCD) camera connected to a Windows computer. For PCNA expression level calculation, the number of PCNA-positive hepatocytes was determined by counting 1,000 cells in each case, and the results were presented as a percentage (number of PCNA-positive hepatocytes/1,000 hepatocytes) at × 100 magnification. The area percentage of immunostaining for TNF-α was measured using the Leica Qwin 500 C image analyzer. Measurements were obtained from 10 non-overlapping low-power fields per section in each group.
2.8 Statistical analysis
The statistical analysis was conducted using SPSS 26.0 software. Continuous variables were reported as mean ± standard deviation (SD), following the verification of their normal distribution using the Shapiro-Wilk test. One-way analysis of variance (ANOVA) with Tukey post hoc test was employed to compare differences between groups. A p-value below 0.05 was considered statistically significant. The Chi-square test was employed to compare the categorical data of inflammation grades and fibrosis stages as the grade of 0 indicates no foci, grade 1 signifies less than 2 foci per 200x field of view, grade 2 indicates 2-4 foci per 200x field of view, and grade 3 represents more than 4 foci per 200x field of view. On the other hand, the fibrosis stage is assessed based on the location and extent of fibrotic changes. A stage of 0 indicates no fibrosis, stage 1 represents fibrosis around the sinusoids or portal areas, stage 2 signifies the presence of fibrosis around both the sinusoids and portal/periportal regions, and stage 3 indicates bridging fibrosis (Kleiner et al., 2005).
3 Machine learning models
3.1 Data
One of the goals of this study is to build a machine-learning predictive model to estimate the treatment response based on a set of observational measurements of the treated samples. Let
3.2 Predictive models
We built several supervised machine learning models using the classifiers logistic regression (LR), k-nearest neighbors (kNN), neural networks (NN), support vector machines (SVM), and random forests (RF) to predict the treatment response. For each classifier, we tested seven models (Table 3), where the main difference among them is the data type of the features used as predictors. Such distinctions in predictors can assess the differential predictive power of these models based on the utilized data type. In addition to the seven models listed in Table 3 per classifier, we constructed another seven corresponding models, which we designated as “reduced” models. We used the greedy forward sequential feature selection (SFS) approach with random forests as the estimator to pick up the smallest set of features that are the most representative of the model. This reduction in the number of features can reduce model complexity and computation time while enhancing model interpretability.
The LR classifier had the maximum number of iterations for convergence set to 1,00,000 with a stopping tolerance of 0.0001. The number of neighbors for the kNN classifier was 5 with uniform weights. The NN classification was done using the multi-layer perceptron (MLP) classifier with two hidden layers of a few numbers of neurons (5, 3) to avoid overfitting, the maximum number of convergence iterations to be 200, and the “lbfgs” optimizer as the solver. The parameters for the SVM classifier were the regularization parameter (C) with a value of 1 and a tolerance of 0.0001 for the stopping criterion. The RF classifier had 100 trees with a maximum depth of 10 for each tree. The SFS approach used an RF estimator in a forward direction with parameters of 0.00001 for the stopping criterion and cross-validation of 5 iterations.
3.3 Cross-validation and class balancing
We split data into two disjoint, exclusive sets. The first set was used to train the machine learning model, while the second set was used to test the model’s predictive performance. The test set represented unseen samples for the trained model to avoid overfitting. This splitting process was repeated k times to perform a k-fold cross-validation (kCV) assessment, where every single sample was used once for testing the model’s ability to classify the unseen data. This study contains groups of data representing different treatment drugs, one or more controls, and a disease model (no treatment). We did not follow the strategy of leave-one-group-out to mitigate the problem of under-representing a group which would increase the misclassification rate. Hence, we divided each group into k subsets, where at each CV iteration, only one subset is used for testing. To do so, assume that si is the index of sample s and i = {1, 2, 3, … , n}, where n is the total number of the study samples. The function mod (si,k) uses the mod operator to get the integer remainder of dividing si by k, where the output of (mod (si,k) + 1)
Our data contains two classes in accordance with the treatment response: non-responsive and responsive, with a relative ratio of 1:3.56, respectively, as shown in Figure 2. We used the Synthetic Minority Oversampling Technique (SMOTE) to sample up the undersampled class to have classes of equal size in order to minimize the classifier bias. The upsampling process was performed only in the training phase, and not on the tested samples.
3.4 Identification of potential treatment biomarkers
To study the potential for a certain feature to be a treatment response biomarker for a specific drug, we screened each feature against the control and against the disease model for each drug separately. We used the non-parametric, two-sided Wilcoxon rank-sum test for independent samples to test whether two sets of data come from the same population. The null hypothesis, Ho, is that both sets come from the same population, while the alternative hypothesis, H1, is that they come from two different populations. For comparing the data of one feature following a certain treatment against the data of the same feature for the control case, rejecting Ho is a bad sign for that feature to be a potential biomarker for using that treatment drug, but failure to reject Ho could be a good sign for a potential biomarker since in the latter case the treatment made the values of this feature post-treatment close to those values for the control. On the other hand, in the case of comparing against the disease model, rejecting Ho is a good sign for a potential biomarker since the test indicates two different populations: the disease model population and the post-treatment population.
3.5 Software packages
Python 3.7 was used as the programming language for processing the data of this study. We used many Python-based packages and modules as well to ease the processing pipeline. “NumPy” (version 1.20.3) and “pandas” (version 1.3.5) were used to read data from files, store them in data structures such as DataFrames, and manipulate data. The machine learning models were built using “scikit-learn” (version 1.0.2) besides splitting the data into iterations of disjoint training and testing samples for cross-validating the results. The package “statsmodels” (version 0.13.2) was incorporated to get the statistical significance for univariate regression models and to correct for the multiple hypothesis tests via the false discovery rate (FDR) approach. Under-sampled classes were up-sampled using the Synthetic Minority Oversampling Technique (SMOTE) implemented in the package “imbalanced-learn” (version 0.11.0). The latter package also was used to construct the processing pipeline across all the predictive models. The Wilcoxon rank-sum test was employed from the package “SciPy” (version 1.7.3). Figures were generated with the help of the package “matplotlib” (version 3.5.0).
4 Results
4.1 Biochemical analysis of serum samples
We first assessed the expression of hepatic damage markers in the sera. The HPCL group had a significant increase in ALP, ALT, AST, GGT, T. Bilirubin, D. Bilirubin, and Albumin, in addition to the lipid profile (TC, TG, HDL-C, and LDL-C) and the AFP, the tumor marker, in comparison to the control group (p < 0.05). However, the nine treatment groups showed a remarkable decrease in the levels of hepatic damage markers (ALT, AST, ALP, GGT, T. Bilirubin, and D. Bilirubin) and improving the lipid profile markers (TC, TG, HDL-C, and LDL-C) in addition to decrease the AFP compared to the HPLC group. This indicates that the treatments were effective in mitigating hepatic damage and normalizing the lipid profile (Table 4).
Table 4. The effect of HPCL induction and drugs’ administration on serum hepatic damage markers and lipid profile.
4.2 Differential expression analysis of hepatic molecular markers
The relative expression of BAX, ATG16-L1, P53, and RAB11 mRNAs significantly declined after HPCL induction while TUBG and Cyclin E mRNAs significantly elevated compared to the normal group. Moreover, there was an obvious dysregulation in the relative expression of various non-coding RNA molecules in the HPCL group compared to the Normal group, including microRNAs; as miR-125b, miR-1289, and miR-1262 were significantly decreased, in contrast, miR-106b was significantly increased compared to the normal group. Interestingly, all lncRNAs displayed a significant increase in relative expression in the HPCL group. However, the relative expression of circ_0001345 showed a significant decrease. The treatments with either Hesperidin, Cyan, or Pantoprazole demonstrated a modulation effect on the expression pattern of mRNA molecules (Table 5).
Table 5. The effect of HPCL induction and drugs’ administration on the relative expression of the RNA panel.
4.3 Histopathological and immunohistochemical evaluation
The histopathological analysis of liver sections depicted distinct histological features among the various experimental groups. The control group exhibited a normal liver structure with the absence of any pathological changes (Figure 3A). However, the HPCL group displayed significant liver damage, characterized by extensive destruction of liver architecture, infiltration of inflammatory cells, and the presence of hemorrhagic areas (Figure 3B). In groups treated with hesperidin and cyan, there were confluent lesions of inflammatory cells infiltrating different areas, including the intralobular, periductal, and perivascular regions (Figures 3C–H). Focal lymphocytic infiltration was observed around portal triads and as individual lesions in some sections treated with hesperidin-100 and Pantoprazole-50 (Figures 3G, J). Conversely, no notable inflammatory lesions were observed in the sections treated with hesperidin-200 and cyan-30 (Figures 3E, H) and Pantoprazole-100 (Figure 3K). The immunohistochemical analysis of liver sections from rats revealed distinct patterns of staining when GST-P antibody was applied. In the control group, no positive reaction was observed. However, in the HPCL group, a substantial distribution of large areas stained positive was evident, occupying a significant portion of the liver section. Upon treatment, the different groups displayed varying results. In the hesperidin-50 group, multiple scattered lesions with positive staining were observed. Similar patterns were seen in the treated groups. These lesions varied in size, ranging from large positive stained nodules in Hesperidin-50 and Pantoprazole-25 groups to small positive hepatic nodules of different sizes in Hesperidin-100, Cyan-10, and Pantoprazole-50 groups. Additionally, some groups displayed foci and collections of brown-stained cells scattered within the negatively stained hepatic parenchyma in Hesperidin-200, Cyan-20, Cyan-30, Pantoprazole-50, and Pantoprazole-100 groups (Figure 4) (Table 6). The results of the PCNA immunohistochemical analysis indicated that there was a higher expression of PCNA in the HPCL group when compared to the control group (Figures 5A, B). Furthermore, when examining liver sections from rats treated with different doses, it was observed that there was a decrease in the number of nuclei that exhibited positive staining for PCNA (Figures 5C–L).
Figure 3. Photomicrographs of liver sections stained with H&E. (A) The Control group shows normal hepatic architecture as hepatocytes are arranged radially around the central vein, the structure of the hepatic sinus was clear and there was no pathological change. (B) HPCL Group shows massive destruction of liver architecture with infiltration of inflammatory cells and hemorrhagic area. (C–K) Treated groups; (C) hesperidin-50 Group, (D) hesperidin-100 Group, (E) hesperidin-200 Group, (F) Cyan-10 Group, (G) Cyan-20 Group, (H) Cyan-30 Group, (I) Pantoprazole-25 Group, (J) Pantoprazole-50 Group, and (K) Pantoprazole-100 Group. (magnification: (A–K): × 100).
Figure 4. Photomicrographs of rats’ liver sections immunohistochemically stained with glutathione S transferase-P (GST-P) antibody. (A) The control group shows a negative reaction. (B, C) HPCL Group shows the massive distribution of large positive stained areas occupying most of the section. (D–L) Treated groups; (D) hesperidin-50 Group, (E) hesperidin-100 Group, (F) hesperidin-200 Group, (G) Cyan-10 Group, (H) Cyan-20 Group, (I) Cyan-30 Group, (J) Pantoprazole-25 Group, (K) Pantoprazole-50 Group, and (L) Pantoprazole-100 Group. (M) A bar chart depicts the levels of expression of GST-P across the different animal groups. (magnification: (A–L):× 40).
Figure 5. Photomicrographs of rats’ liver sections immunohistochemically stained with PCNA show brown stained nucleus indicating a positive immune reaction of hepatic cells scattered in between negatively stained hepatic parenchymal cells. (A) The control group shows a negative reaction. (B) HPCL Group shows a massive distribution of positively stained cells occupying most of the section. (C) Hesperidin-50 Group, (D) hesperidin-100 Group, (E) hesperidin-200 Group, (F) Cyan-10 Group, (G) Cyan-20 Group, (H) Cyan-30 Group, (I) Pantoprazole-25 Group, (J) Pantoprazole-50 Group, (K) Pantoprazole-100 Group. (magnification: (A–K): × 100). (L) A bar chart depicts the levels of expression of PCNA across the different animal groups.
5 Machine learning-based analysis
5.1 Prediction of the treatment response using the machine learning models
Table 7 shows the features included in and excluded from the reduced models based on their importance to the classification process using the greedy SFS approach. Only two molecular features (miR-1289 and TUBG mRNA), two biochemical features (ALT and TG), and one IHC feature (GSTP) are kept.
Table 7. Features kept for the reduced models are listed under the “Included features” column, while the discarded features are listed under the “Excluded features” column.
A set of performance metrics for the complete and reduced models can be found in Tables 8–12 for the LR, kNN, NN, RF, and SVM classifiers; respectively. These metrics are accuracy, precision, recall (sensitivity), specificity, and the Matthews Correlation Coefficient (MCC). The MCC provides unbiased, more accurate evaluation for binary classification problems, especially for the cases of unbalanced data (Chicco and Jurman, 2020). Although we used the SMOTE technique to even the number of samples between the two classification classes, it is advised to still use the MCC to compare performance between the different machine learning models. We used k = 3 for this study. Complete results for each iteration for each classifier can be found in Supplementary Files S2–S6.
Table 8. Model prediction using multivariate LR for both the complete models and the reduced models.
Table 12. Model prediction using SVM classification for both the complete models and the reduced models.
Using each data type alone (Molecular, Biochemical, IHC) in both the complete and reduced models, and generally speaking for the different classifiers, the molecular model gave the best MCC values, while the worst performance came from the biochemical data for the LR and kNN classifiers, and from the IHC data for the NN, RF, and SVM classifiers. For the reduced models, the MCC values for the molecular models were close to those from the complete models, and they were better than those of the complete models for the biochemical and IHC data for all the classifiers except the SVM classifier.
Integrating two or all the data types in one model for the case of the complete models, generally, degraded the performance compared to the best accuracy by any of the single-type models. On the other hand, adding the molecular data to either the biochemical data or the IHC data enhanced the performance compared to either of their individual cases. A few cases of integrating more than one data type demonstrated being slightly superior to the performance of the best individual data type such as combining the molecular and biochemical or molecular and IHC data types with the NN classifier.
In most cases, the reduced models combining the molecular data type showed consistent performance comparable to the reduced molecular model, because the greedy SFS approach favored the important molecular features over those from the biochemical and IHC features, emphasizing the stronger relationship between the phenotype and the genotypic features.
Figure 6 shows the receiver operating characteristics (ROC) curve (Figure 2A) and the confusion matrix (Figure 2B) for the reduced model using the RF classifier and the molecular features. The average area under the ROC curve (AUC) is 0.96 ∓ 0.04 coming from three cross-validation iterations. Moreover, this model successfully predicted 138 responsive (positive) samples out of 139 samples (true positives) and 36 non-responsive (negative) samples out of 39 samples (true negatives). All the ROC curves and confusion matrices for all the models can be found in the Supplementary Figures S6, S7 in Supplementary 1.
Figure 6. (A) The Receiver Operating Characteristics (ROC) curve, and (B) the confusion matrix for the reduced model with the random forest (RF) classifier using the molecular data type.
5.2 Potential biomarkers identification
Tables 13–15 show the hypothesis testing results for the molecular, biochemical, and IHC data types, respectively, comparing the individual features of each data type against the control and against the disease model. These tables do not show the results for all the features, but the results for features with positive or negative promise to be biomarkers. Complete results can be found in the Supplementary Files S7–S12. To summarize the number of occurrences for each treatment drug as either an effective or ineffective biomarker, Table 16 shows these numbers across the three data types. Generally speaking, increasing the dose enhances the chance for a drug to be effective.
Table 13. Molecular data: Statistically assessing the difference between values of a feature post-treatment against the control and against the disease model. Only statistically insignificant differences are shown. Similar behavior to the control case is a positive sign, while similarity to the disease model is not favorable. q-values are after using the false discovery rate (FDR) approach to correct for multiple hypotheses.
Table 14. Biochemical data: Statistically assessing the difference between values of a feature post-treatment against the control and against the disease model. Only statistically insignificant differences are shown. Similar behavior to the control case is a positive sign, while similarity to the disease model is not favorable. q-values are after using the false discovery rate (FDR) approach to correct for multiple hypotheses.
Table 15. IHC data: Statistically assessing the difference between values of a feature post-treatment against the control and against the disease model. Only statistically insignificant differences are shown. Similar behavior to the control case is a positive sign, while similarity to the disease model is not favorable. q-values are after using the false discovery rate (FDR) approach to correct for multiple hypotheses.
Table 16. Summary for how many times a treatment drug showed potential for being a treatment response biomarker and how many times it showed incapability. Numbers are displayed in the format (molecular, biochemical, IHC).
6 Discussion
The field of precision medicine has been enriched by recent advancements in pharmacogenomics, providing a wealth of diverse data types. Analyzing these data has aided in identifying unique cellular sensitivity and resistance patterns of different targets to numerous chemical compounds. One of the key goals of cancer research is to uncover the genomic and molecular characteristics responsible for specific clinical outcomes (Aubrecht et al., 2013). Predictive machine learning models with gene expression features offer an opportunity to explore the impact of molecular features on treatment response, enhancing our understanding of cancer vulnerabilities and enabling the development of predictive models for drug responsiveness. In this study, we employed bioinformatics and machine learning techniques to select robust features from a set of 14 molecular, 12 biochemical, and 3 IHC markers to predict the response to three different drugs (Hesperidin, Cyan, and Pantoprazole) at varying dosages in an HCC rat model.
In our study, we chose three drugs (hesperidin, Pantoprazole, and cyan) due to their established hepatoprotective properties, which include anti-inflammatory and antiapoptotic effects. Hesperidin exhibits anti-cancer effects by inducing cell death in hepatocellular carcinoma HepG2 cells through caspase-independent pathways, primarily by activating the ERK1/2 pathway (Yumnam et al., 2014). Moreover, it induces apoptosis by increasing the accumulation of reactive oxygen species (ROS) and activating the apoptosis signal-regulating kinase 1/Jun N-terminal kinase (ASK1/JNK) pathway (Xia et al., 2018). Additionally, hesperidin inhibits calcium/calmodulin-dependent protein kinase IV (CAMKIV), thereby activating the caspase-3-dependent intrinsic pathway and upregulating the pro-apoptotic protein Bax, which contributes to its anti-apoptotic and anticancer properties (Naz et al., 2019). Recent evidence suggests that hesperidin induces apoptosis in HeLa cells by involving the endoplasmic reticulum stress pathway and arresting the cell cycle at the G0/G1 phase through the downregulation of cyclin D1, cyclin E1, and cyclin-dependent kinase 2 (Cdk2) at the protein level (Wang et al., 2015). Cyan effectively inhibits hepatic gluconeogenesis by reducing the expression of gluconeogenic genes. This inhibition is achieved through the phosphorylation and inactivation of coactivators CRTC2 and HDAC5 by AMPK. Although Cyan does not directly interact with AMPK, it activates AMPK via the adiponectin receptor signaling pathway, as evidenced by experiments involving the knockdown of adiponectin receptor genes. Furthermore, Cyan increases cellular AMP levels in hepatocytes, and oral administration of Cyan in mice leads to elevated plasma adiponectin concentrations, collectively contributing to AMPK activation (Jia et al., 2022). Additionally, Cyan exhibits potent antioxidant properties and induces cellular senescence and apoptosis in hepatocarcinoma cells under oxidative stress conditions. Cyan enhances the expression of senescence-associated β-galactosidase and key markers of cellular senescence, namely P16, P21, and P53. Tan et al. (2020) found that Cyan exerted a protective effect on HepG2 cells against oxidative damage induced by H2O2 by reducing reactive radicals. Additionally, Cyan enhanced the expression of important antioxidant enzymes such as SOD, CAT, and GPx. Moreover, Cyan demonstrates a regulatory effect on cell survival and apoptosis by decreasing the levels of the pro-apoptotic protein Bax and increasing the levels of the anti-apoptotic protein Bcl-2 in HepG2 cells (Panritdum et al., 2024). In experiments using LX-2 cells and mouse hepatic stellate cells (pHSCs), Lu et al. demonstrated that pantoprazole dose-dependently notably alleviated liver damage, reduced collagen buildup and inflammation, and inhibited the expression of fibrosis-related genes like Col1a1, Acta2, Tgfβ1, and Mmp-2. Through transcriptome analysis and subsequent validation in pantoprazole-treated LX-2 cells, they found that pantoprazole hindered the expression of Yes-associated protein (YAP) and its downstream targets, such as CTGF, ID1, survivin, CYR61, and GLI2. By manipulating YAP expression levels, they demonstrated that PPZ downregulated hepatic fibrogenic gene expression via YAP. Furthermore, pantoprazole facilitated the degradation and ubiquitination of YAP via the proteasome, inhibiting HSC activation. PPZ also disrupted the interaction between the deubiquitinating enzyme OTUB2 and YAP, leading to YAP destabilization and impeding hepatic fibrosis progression (Lu et al., 2021).
Additionally, these drugs have demonstrated efficacy in regulating exosome production and autophagy in HCC. Exosomes are now acknowledged as vital facilitators of communication between cells, and their miRNAs, which are involved in cell proliferation and metastasis, play essential roles in the progression of HCC (Mathieu et al., 2019; Zhang et al., 2021). Exosomes stimulate intercellular autophagy to eliminate the negative effects caused by cancer-causing agents, while autophagy, in turn, regulates both the biogenesis and degradation of exosomes (Bunggulawa et al., 2018). Previous research conducted by our team has shown that hesperidin treatment induces the formation of multiple hepatic vacuoles at various stages of autophagosome development. Our findings revealed the gradual emergence of multilamellar bodies, which are specific autophagic vacuoles composed of concentric membrane layers enclosing cellular organelles. Hesperidin has been found to enhance the activity of autophagy in hepatocytes. Additionally, it downregulates exosomal RAB11A mRNA and upregulates exosomal miR-1298, resulting in protective effects on the liver in rat models (Hasanin et al., 2020). Pantoprazole primarily targets gastric H+/K+ ATPases and V-ATPases (Spugnini et al., 2015). Its anti-neoplastic effects are attributed to the inhibition of V-ATPase activity, leading to cytosolic acidification and alkalization of endosomal and lysosomal compartments (De Milito et al., 2012). Our previous study proved that Pantoprazole administration reduces the expression of GST-P and PCNA, significantly decreases exosomal RAB11A mRNA levels, along with downregulating exosomal Lnc-RNA-RP11-513I15.6, which were found to be elevated in the HCC animal model (Matboli et al., 2019). The beneficial effects of Anthocyanins, including cyan, involve various signaling pathways such as MAPK, NF-κB, AMPK, and Wnt/β-catenin, as well as critical cellular processes including apoptosis and autophagy (Li et al., 2017). Previous research has demonstrated that cyan exhibits a protective effect against high glucose-induced podocyte dysfunction by improving autophagy, reducing apoptosis, and suppressing epithelial-mesenchymal transition through the activation of the SIRT1/AMPK pathway (Wang et al., 2021). Moreover, cyan-enhanced autophagy reduces apoptosis in primary human dermal fibroblasts by mitigating oxidative stress, suggesting a potential mechanism for protection against UVA light-induced damage (Wu et al., 2019).
Most cases of HCC arise from prolonged inflammation in the liver, leading to disruptions in various cellular signaling pathways. These changes in the liver environment disrupt the equilibrium between cell proliferation and cell death, ultimately increasing the likelihood of malignant transformation. Over 90% of HCC cases arise in the presence of hepatic injury and inflammation. The risk factors associated with HCC trigger an unresolved inflammatory response characterized by the infiltration of macrophages and immature myeloid cells, as well as dysregulated cytokine production, and this perpetuates a wound-healing response, leading to the progressive development of fibrosis, cirrhosis, and ultimately HCC (Keenan et al., 2019). During the premalignant stage of hepatocarcinogenesis, chronic activation of inflammatory signaling pathways generates reactive oxygen species (ROS) and reactive nitrogen species (NOS) (Refolo et al., 2020). This chronic inflammation, characterized by the infiltration of macrophages and immature myeloid cells and dysregulated cytokine production, is considered a primary trigger for the development and progression of HCC. In the premalignant environment, inflammatory cells, including stromal cells, produce various molecules such as cytokines, growth factors, chemokines, prostaglandins, and proangiogenic factors (Yu et al., 2018). Autophagy has a significant relationship with inflammation in HCC (Yu et al., 2017). In the early stages of hepatocarcinogenesis, autophagy helps alleviate oxidative stress, prevents genomic instability, and limits uncontrolled inflammation, thereby playing a crucial role in tumor suppression, however, as HCC progresses and autophagy increases, it becomes relevant to poor prognosis and is associated with tumor cell survival under conditions that typically induce cell death, such as hypoxia and nutrient deprivation (Sun et al., 2013). In addition, autophagy deficiency in macrophages can lead to the production of inflammatory and fibrogenic factors (Sun et al., 2021). The depletion of Kupffer cells has been shown to rescue the tumor-promoting effect of autophagy deficiency during the preneoplastic stage. This suggests that autophagy regulates inflammation and fibrosis-promoting effects, potentially through the modulation of NF-κB-associated pathways and the production of cytokines such as IL1α/β (Sun et al., 2017). Therefore, the interplay between autophagy and inflammation is a critical aspect of HCC progression and may offer new insights for targeted therapeutic strategies.
ATG16-L1, an essential protein in autophagy, plays a crucial role in HCC, its deficiency leads to decreased bacterial clearance and abnormal interleukin-1β production, fostering inflammation and carcinogenesis (Peantum et al., 2018). The ATG16-L1 p.T300A polymorphism emerged as a significant risk factor for HCC in cirrhotic patients (Reuken et al., 2019). Studies using a mouse model showed that increased ATG16L1 activation is required to prevent the progression of steatohepatitis and the onset of HCC. Furthermore, the overexpression of ATG16-L1 suppressed NF-κB and IL6 signaling pathways, suggesting its potential role in mitigating HCC development (Okada et al., 2017). RAB11A serves as a target molecule involved in various microRNA-mediated tumor suppression processes. Zhang et al. proved that RAB11A is involved in HCC progression by regulating the expression of MMP2 by activating the PI3K/AKT signaling pathway (Zhang et al., 2020). The overexpression of cyclin E in HCC is believed to play a crucial role in promoting cell proliferation and survival (Zhou et al., 2003). Cyclin E1 and cyclin E2, which interact with cyclin-dependent kinase 2 (Cdk2), are known to stimulate cell cycle progression and initiation of HCC (Sonntag et al., 2018). Chen et al. provided evidence showing that TUBG1 exhibited substantial upregulation in both non-alcoholic fatty liver disease (NAFLD) and HCC tissues. This suggests that TUBG1 serves as a carcinogenic factor contributing to the development of NAFLD and HCC (Chen et al., 2022). Apoptosis induction triggered by oncogene activation, DNA damage, and senescence is a recognized mechanism during chronic liver damage, crucial for cancer prevention (Zhang et al., 2016). The proteins BAX and BAK control the choice between cell survival and mitochondrial apoptosis. Thus, assessing the regulation of BAX/BAK can anticipate cellular predisposition to apoptosis (Funk et al., 2020). p53 plays an essential role in promoting growth arrest and inducing cell death. The inactivation or loss of functional p53 is a necessary condition for oncogenesis, as it allows for uncontrolled growth and proliferation. This is a prevalent abnormality observed in various human cancers, including HCC (Luo et al., 2021).
Furthermore, our investigation revealed dysregulated expression levels of various mRNA fragments in the HCC rat model. Notably, Cyclin E and TUBG mRNAs showed increased expression, while BAX, ATG16-L1, P53, and RAB11 mRNA exhibited downregulation. The administration of the drugs resulted in the modulation of these expression disturbances. Importantly, previous research, including our own, has highlighted the significance of these biomarkers in HCC pathogenesis. For instance, RAB11A and ATG16L1 are core genes involved in autophagy, and their interaction suggests a novel model for autophagosome biogenesis. The BAX mRNA serves as a central regulator of cell death and mitochondrial dysfunction. The loss of tumor suppressor p53, often observed in tumors, leads to defects in the cell cycle and hampers the ability to respond to DNA damage or oncogene dysregulation, ultimately inducing apoptosis or cellular senescence. Cyclin E contributes to increased proliferative drive favoring hepatocarcinogenesis. Moreover, cyclin E and p53 have been reported to inversely regulate each other’s expression, possibly mediated by miR-34 (Pok et al., 2013). The overexpression of γ-TUBG has been identified as a characteristic feature of thyroid, breast, and liver cancers it also can induce carcinogenesis (Niu et al., 2009; Hořejší et al., 2012).
Abnormal expression of non-coding RNAs (such as miRNAs, lncRNAs, and cicRNA) has been detected in HCC regulation and linked to critical aspects of cell invasion, metastasis, and drug resistance. In our study, we observed significant disruptions in the expression of non-coding RNAs following the induction of HCC. Notably, circ_0001345 showed a pronounced decrease in expression, accompanied by an upregulation of miR-106b, while other miRNAs (miR-1262, miR-125b, and miR-1289) exhibited downregulation. Additionally, all examined lncRNAs (lncRNA-MALAT, lncRNA-RP11-513I15.6, and lncRNA-RP11-583F2.2) displayed downregulation. Interestingly, the administration of the drugs resulted in the restoration of these expression disruptions.
Regarding the biochemical analysis, we observed in the HPCL considerable elevation in liver destructive markers, indicating liver damage. Additionally, the serum lipid profile displayed significant alterations, particularly a significant decrease in HDLC levels. The AFP, a well-established biomarker used for HCC screening, diagnosis, prognostication, and therapeutic evaluation, (Hu et al., 2022) showed a striking increase. However, following drug administration, we observed enhanced biochemical and AFP levels. This was accompanied by a decrease in fibrotic and inflammatory states, as well as a significant reduction in the percentage area of GSTP foci, PCNA expression, and TNF levels, particularly with higher drug doses.
Machine learning algorithms a branch of AI are increasingly being employed for personalized predictions of drug responses (Vadapalli et al., 2022). These algorithms facilitate the integration of data from diverse sources in a statistically meaningful manner, allowing for the identification of predictive biomarkers (Yamanishi et al., 2012; Rampášek et al., 2019). A critical aspect of this process is how data from multiple sources are harmoniously integrated to enhance the overall prediction performance of drug responses. In alignment with this concept, various approaches have been developed for drug response prediction by leveraging prior knowledge based on genomic and molecular profiles (Stanfield et al., 2017; Cichonska et al., 2018). These approaches aim to exploit the wealth of information embedded in biological systems to refine predictions. This is vital for aiding clinicians in determining the most efficient and least harmful therapeutic choices, facilitating a more intelligent selection and monitoring of patients participating in clinical trials, and contributing to the ongoing evolution of personalized medicine (Simon, 2013; Relling and Evans, 2015; Aronson and Rehm, 2015). In this context, we built predictive models using three different types of signatures, encompassing molecular, biochemical, and IHC features. One of the primary objectives was to assess which signature type might be predictive of treatment response in ML models. In addition to the complete models, we constructed several reduced models by employing the greedy forward sequential feature selection (SFS) approach with random forests as the estimator, we aimed to identify the smallest set of features that best represent each model. The selected molecular features were miR-1289 and TUBG mRNA, while the biochemical features included ALT and TG, and the IHC features comprised GSTP.
Our study showed that the molecular model consistently achieved the highest MCC values when considering each data type individually—Molecular, Biochemical, and IHC—in both complete and reduced models across various classifiers. In reduced models, MCC values for molecular models closely resemble those of complete models and outperform them in the case of biochemical and IHC data across all classifiers except SVM. Integrating multiple data types in complete models generally diminishes performance compared to single-type models. However, integrating molecular data with either biochemical or IHC data enhanced performance compared to individual cases, and in some instances, combining multiple data types slightly improves performance over the best individual data type, such as with the NN classifier. Reduced models combining molecular data consistently perform well, as the greedy SFS approach favored the important molecular features over IHC and biochemical features. Also, the ROC curve and confusion matrix for the reduced model employing the RF classifier with molecular features showed an average AUC of 0.96 ± 0.04 from three cross-validation iterations. Additionally, this model accurately predicts 138 responsive and 36 non-responsive samples out of the total samples highlighting its robustness.
Furthermore, we employed a rigorous analysis approach to select potential biomarkers that exhibit promising treatment responses, closely resembling the normal response. We focused on features with p-values exceeding the predetermined alpha threshold when comparing individual features of each data type against the control group, meaning that the control samples and the feature samples might come from the same population. These features were considered highly promising due to their ability to bring the treatment response closer to that of the normal response. The results revealed a total of seven highly promising molecular responses within our drug spectrum that exhibited statistically insignificant results when compared to the control model. Among these responses, three were lncRNAs, two were miRNAs, and two were mRNAs. Specifically, lncRNA-RP11-513I15.6 demonstrated its potential as a marker for Hesperidin-200, Cyan (at 10 and 20 mg doses), and Pantoprazole-100. MiR-125b showcased promise for Hesperidin-50 and Pantoprazole-25, while LNC-RP11-583F2.2 exhibited potential for Hesperidin-100 and pantoprazole-50. Furthermore, lncRNA-MALAT displayed significant associations with Hesperidin-200 and Cyan-30, miR-125b was linked to Hesperidin-50 and Pantoprazole-25, miR-106b showed potential with Cyan-30, and TUBG mRNA displayed promise with Cyan-30. Notably, RAB11 mRNA exhibited an interesting response when treated with Hesperidin-50.
In addition to these molecular markers, seven biochemical markers showed as highly predictive candidates (AST, ALP, Albumin, TC, TG, HDL-C, LDL-C) and TNF as important IHC biomarkers. These findings highlight the potential significance of a comprehensive approach that combines multiple types of markers to better understand drug response. A particularly noteworthy finding is that increasing the dosage of the administered drug enhances the likelihood of achieving effective results.
Prior studies have examined the connections between genomic profiles and drug response (Barretina et al., 2012; Garnett et al., 2012). Numerous algorithms for predicting drug sensitivity have been put forth like logistic regression that recognized as a comparative method in multiple studies in drug sensitivity predictions (Daemen et al., 2013; Menden et al., 2013; Costello et al., 2014; Kim et al., 2012). Zeng et al. (2022) developed artificial intelligence models utilizing digital histological images that demonstrate the capability to predict the activation of multiple immune and inflammatory gene signatures with an AUC > 0.8. Riddick et al. (2011) developed a multistep algorithm to predict in vitro drug sensitivity using gene expression data from the NCI60 panel. Their approach, tested on 19 breast cancer cell lines, involved feature selection, elimination of outlying cell lines, and training random forest regression models. The authors concluded that their algorithm surpassed existing techniques in predicting drug sensitivity. Daemen et al. (2013) drug sensitivity data for 138 drugs and molecular profiles from 70 breast cancer cell lines. This data was utilized to train models that classify samples into two categories: those responding well to treatment and those responding poorly. Subsequently, the models were validated using independent patient-derived data, focusing on two drugs commonly employed in breast cancer treatment tamoxifen and valproic acid. In ovarian cancer, Chen et al. (2015) demonstrated the possible significance of a 61-transcript expression pattern in anticipating a patient’s reaction to platinum-taxane chemotherapy. Importantly, when this expression pattern was integrated with the BRCA1/2 mutation status, a conventional prognostic marker in this context, it resulted in improved patient stratification based on clinical response.
While current models cannot fully replicate the complexities of human disease, significant progress has been made in developing HCC animal models (Fornari et al., 2022). These models are essential for foundational research, enabling more convenient and effective studies of disease mechanisms, therapeutic targets, and drug screening (Liao et al., 2024). The creation of models capable of monitoring the onset, progression, and reversibility of drug-induced toxicity remains a crucial objective, as they are invaluable for designing safer and more effective drugs (Blidisel et al., 2021).
This study has limitations due to the small sample size, and our investigation concentrated on a restricted selection of drugs, potentially not capturing the full array of therapeutic options for HCC. Future inquiries encompassing a wider variety of medications and treatment approaches could offer a more thorough insight into effective treatment methodologies for this intricate condition. Further studies are imperative to evaluate the efficacy of the chosen mRNA-miRNA-LncRNA-Circ-RNA in HCC with diverse etiologies. Since we solely examined these molecular markers in animal specimens, it’s essential to corroborate these findings through multicenter clinical trials involving human subjects. Expanding our research to include human samples will significantly enhance the relevance and applicability of our findings. We recommend more studies to be held in this context to significantly enhance the relevance and applicability of our findings. Training our current machine learning models on preclinical datasets and validating them using human clinical data will allow us to assess and refine the models’ predictive performance in a real-world setting.
7 Conclusion
In conclusion, our study introduces a machine-learning algorithm capable of accurately predicting the treatment response to three distinct drugs: Pantoprazole, Cyan, and Hesperidin. Our approach involved a comprehensive panel that incorporated traditional HCC biochemical, molecular, and IHC features. This integration of multiple markers not only enhanced the reliability and accuracy of the panel but also ensured cost-effectiveness and speed by utilizing RT-PCR in addition to conventional testing methods instead of more expensive high-profile approaches (Cohen et al., 2018).
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Ethics statement
The animal study was approved by the study was approved by the Ain Shams Research Ethics Committee, Faculty of Medicine, Egypt, FMASU-R 111/2022, FWA 000017585. In accordance with the guidelines of the Declaration of Helsinki. The study was conducted in accordance with the local legislation and institutional requirements.
Author contributions
MM: Writing–review and editing, Writing–original draft, Validation, Supervision, Project administration, Methodology, Funding acquisition, Conceptualization. HA-A: Writing–original draft, Methodology. AK: Writing–original draft, Investigation, Formal Analysis. RK: Writing–original draft, Visualization, Validation. MA: Writing–review and editing, Methodology. HK: Validation, Writing–review and editing. MH: Writing–original draft, Methodology, Investigation. HE: Writing–review and editing, Software. EH: Visualization, Writing–original draft. IY: Supervision, Software, Resources, Formal Analysis, Data curation, Writing–original draft, Visualization.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This study was supported by Mounir Armanious Research Center call 1, 2021. (M. AR. C.) 2021-1 and Science and Technology Development Fund (STDF), BARG call 7, Project ID: 38135.
Acknowledgments
We acknowledge Marian M. S. Roushdy, Laila M. Farid, and Mariam B. Abouelkhair for their effort in this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmolb.2024.1430794/full#supplementary-material
References
Aggarwal, V., Tuli, H. S., Thakral, F., Singhal, P., Aggarwal, D., Srivastava, S., et al. (2020). Molecular mechanisms of action of hesperidin in cancer: recent trends and advancements. Exp. Biol. Med. 245 (5), 486–497. doi:10.1177/1535370220903671
Aronson, S. J., and Rehm, H. L. (2015). Building the foundation for genomics in precision medicine. Nature 526 (7573), 336–342. doi:10.1038/nature15816
Aubrecht, J., Schomaker, S. J., and Amacher, D. E. (2013). Emerging hepatotoxicity biomarkers and their potential to improve understanding and management of drug-induced liver injury. Genome Med. 5, 85–93. doi:10.1186/gm489
Banjerdpongchai, R., Wudtiwai, B., Khaw-On, P., Rachakhom, W., Duangnil, N., and Kongtawelert, P. (2016). Hesperidin from citrus seed induces human hepatocellular carcinoma HepG2 cell apoptosis via both mitochondrial and death receptor pathways. Tumor Biol. 37, 227–237. doi:10.1007/s13277-015-3774-7
Barretina, J., Caponigro, G., Stransky, N., Venkatesan, K., Margolin, A. A., Kim, S., et al. (2012). The Cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature 483 (7391), 603–607. doi:10.1038/nature11003
Blidisel, A., Marcovici, I., Coricovac, D., Hut, F., Dehelean, C. A., and Cretu, O. M. (2021). Experimental models of hepatocellular carcinoma—a preclinical perspective. Cancers 13 (15), 3651. doi:10.3390/cancers13153651
Bridoux, M., Simon, N., and Turpin, A. (2022). Proton pump inhibitors and cancer: current state of play. Front. Pharmacol. 13, 798272. doi:10.3389/fphar.2022.798272
Bunggulawa, E. J., Wang, W., Yin, T., Wang, N., Durkan, C., Wang, Y., et al. (2018). Recent advancements in the use of exosomes as drug delivery systems. J. Nanobiotechnology 16 (1), 81–83. doi:10.1186/s12951-018-0403-9
Calderaro, J., Seraphin, T. P., Luedde, T., and Simon, T. G. (2022). Artificial intelligence for the prevention and clinical management of hepatocellular carcinoma. J. Hepatology 76 (6), 1348–1361. doi:10.1016/j.jhep.2022.01.014
Castro-Gil, M. P., Sanchez-Rodriguez, R., Torres-Mena, J. E., Lopez-Torres, C. D., Quintanar-Jurado, V., Gabino-Lopez, N. B., et al. (2021). Enrichment of progenitor cells by 2-acetylaminofluorene accelerates liver carcinogenesis induced by diethylnitrosamine in vivo. Mol. Carcinog. 60 (6), 377–390. doi:10.1002/mc.23298
Chen, P., Huhtinen, K., Kaipio, K., Mikkonen, P., Aittomäki, V., Lindell, R., et al. (2015). Identification of prognostic groups in high-grade serous ovarian cancer treated with platinum–taxane chemotherapy. Cancer Res. 75 (15), 2987–2998. doi:10.1158/0008-5472.CAN-14-3242
Chen, T., Zhang, S., Zhou, D., Lu, P., Mo, X., Tamrakar, R., et al. (2022). Screening of co-pathogenic genes of non-alcoholic fatty liver disease and hepatocellular carcinoma. Front. Oncol. 12, 911808. doi:10.3389/fonc.2022.911808
Chen, X., Zhang, W., and Xu, X. (2021). Cyanidin-3-glucoside suppresses the progression of lung adenocarcinoma by downregulating TP53I3 and inhibiting PI3K/AKT/mTOR pathway. World J. Surg. Oncol. 19, 232–2. doi:10.1186/s12957-021-02339-7
Chicco, D., and Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC genomics 21, 6–3. doi:10.1186/s12864-019-6413-7
Chidambaranathan-Reghupaty, S., Fisher, P. B., and Sarkar, D. (2021). Hepatocellular carcinoma (HCC): epidemiology, etiology and molecular classification. Adv. cancer Res. 149, 1–61. doi:10.1016/bs.acr.2020.10.001
Cichonska, A., Pahikkala, T., Szedmak, S., Julkunen, H., Airola, A., Heinonen, M., et al. (2018). Learning with multiple pairwise kernels for drug bioactivity prediction. Bioinformatics 34 (13), i509–i518. doi:10.1093/bioinformatics/bty277
Cohen, J. D., Li, L., Wang, Y., Thoburn, C., Afsari, B., Danilova, L., et al. (2018). Detection and localization of surgically resectable cancers with a multi-analyte blood test. Science 359 (6378), 926–930. doi:10.1126/science.aar3247
Costello, J. C., Heiser, L. M., Georgii, E., Gönen, M., Menden, M. P., Wang, N. J., et al. (2014). A community effort to assess and improve drug sensitivity prediction algorithms. Nat. Biotechnol. 32 (12), 1202–1212. doi:10.1038/nbt.2877
Daemen, A., Griffith, O. L., Heiser, L. M., Wang, N. J., Enache, O. M., Sanborn, Z., et al. (2013). Modeling precision treatment of breast cancer. Genome Biol. 14, R110–R114. doi:10.1186/gb-2013-14-10-r110
De Milito, A., Lucia, M. M., and Fais, S. (2012). A rationale for the use of proton pump inhibitors as antineoplastic agents. Curr. Pharm. Des. 18 (10), 1395–1406. doi:10.2174/138161212799504911
Dudekula, D. B., Panda, A. C., Grammatikakis, I., De, S., Abdelmohsen, K., and Gorospe, M. (2016). CircInteractome: a web tool for exploring circular RNAs and their interacting proteins and microRNAs. RNA Biol. 13 (1), 34–42. doi:10.1080/15476286.2015.1128065
Eun, J. W., Cheong, J. Y., Jeong, J. Y., and Kim, H. S. (2023). A New understanding of long non-coding RNA in hepatocellular carcinoma—from m6A modification to blood biomarkers. Cells 12 (18), 2272. doi:10.3390/cells12182272
Fang, Y., Zhang, X., Huang, H., and Zeng, Z. (2023). The interplay between noncoding RNAs and drug resistance in hepatocellular carcinoma: the big impact of little things. J. Transl. Med. 21 (1), 369–374. doi:10.1186/s12967-023-04238-9
Feng, G. Y., Cheng, Y., Chen, K., and Shi, Z. R. (2022). Correlation between immunohistochemical markers in hepatocellular carcinoma cells and in vitro high-throughput drug sensitivity screening. Can. J. Gastroenterology Hepatology. doi:10.1155/2022/5969716
Fernández-Bedmar, Z., Anter, J., Alonso-Moraga, A., Martín de las Mulas, J., Millán-Ruiz, Y., and Guil-Luna, S. (2017). Demethylating and anti-hepatocarcinogenic potential of hesperidin, a natural polyphenol of Citrus juices. Mol. Carcinog. 56 (6), 1653–1662. doi:10.1002/mc.22621
Fornari, F., Giovannini, C., Piscaglia, F., and Gramantieri, L. (2022). Animal models of hepatocellular carcinoma: current applications in clinical research. J. Hepatocell. Carcinoma 31, 1263–1278. doi:10.2147/JHC.S347946
Fu, P., Gong, B., Li, H., Luo, Q., Huang, Z., Shan, R., et al. (2022). Combined identification of three lncRNAs in serum as effective diagnostic and prognostic biomarkers for hepatitis B virus-related hepatocellular carcinoma. Int. J. Cancer 151 (10), 1824–1834. doi:10.1002/ijc.34201
Funk, K., Czauderna, C., Klesse, R., Becker, D., Hajduk, J., Oelgeklaus, A., et al. (2020). BAX redistribution induces apoptosis resistance and selective stress sensitivity in human HCC. Cancers 12 (6), 1437. doi:10.3390/cancers12061437
Garnett, M. J., Edelman, E. J., Heidorn, S. J., Greenman, C. D., Dastur, A., Lau, K. W., et al. (2012). Systematic identification of genomic markers of drug sensitivity in cancer cells. Nature 483 (7391), 570–575. doi:10.1038/nature11005
Hasanin, A. H., Habib, E. K., El Gayar, N., and Matboli, M. (2021a). Promotive action of 2-acetylaminofluorene on hepatic precancerous lesions initiated by diethylnitrosamine in rats: molecular study. World J. Hepatology 13 (3), 328–342. doi:10.4254/wjh.v13.i3.328
Hasanin, A. H., Habib, E. K., El Gayar, N., and Matboli, M. (2021b). Promotive action of 2-acetylaminofluorene on hepatic precancerous lesions initiated by diethylnitrosamine in rats: molecular study. World J. Hepatology 13 (3), 328–342. doi:10.4254/wjh.v13.i3.328
Hasanin, A. H., Matboli, M., and Seleem, H. S. (2020). Hesperidin suppressed hepatic precancerous lesions via modulation of exophagy in rats. J. Cell. Biochem. 121 (2), 1295–1306. doi:10.1002/jcb.29363
Hořejší, B., Vinopal, S., Sládková, V., Dráberová, E., Sulimenko, V., Sulimenko, T., et al. (2012). Nuclear γ-tubulin associates with nucleoli and interacts with tumor suppressor protein C53. J. Cell. physiology 227 (1), 367–382. doi:10.1002/jcp.22772
Hu, X., Chen, R., Wei, Q., and Xu, X. (2022). The landscape of alpha fetoprotein in hepatocellular carcinoma: where are we? Int. J. Biol. Sci. 18 (2), 536–551. doi:10.7150/ijbs.64537
Huang, Z., Zhou, J. K., Peng, Y., He, W., and Huang, C. (2020). The role of long noncoding RNAs in hepatocellular carcinoma. Mol. cancer 19, 77–78. doi:10.1186/s12943-020-01188-4
Jia, Y., Wu, C., Rivera-Piza, A., Kim, Y. J., Lee, J. H., and Lee, S. J. (2022). Mechanism of action of cyanidin 3-O-glucoside in gluconeogenesis and oxidative stress-induced cancer cell senescence. Antioxidants 11 (4), 749. doi:10.3390/antiox11040749
Jiang, W., Zhang, L., Guo, Q., Wang, H., Ma, M., Sun, J., et al. (2019). Identification of the pathogenic biomarkers for hepatocellular carcinoma based on RNA-seq analyses. Pathology and Oncol. Res. 25, 1207–1213. doi:10.1007/s12253-019-00596-2
Kang, C. L., Qi, B., Cai, Q. Q., Fu, L. S., Yang, Y., Tang, C., et al. (2019). LncRNA AY promotes hepatocellular carcinoma metastasis by stimulating ITGAV transcription. Theranostics 9 (15), 4421–4436. doi:10.7150/thno.32854
Keenan, B. P., Fong, L., and Kelley, R. K. (2019). Immunotherapy in hepatocellular carcinoma: the complex interface between inflammation, fibrosis, and the immune response. J. Immunother. cancer 7, 267–273. doi:10.1186/s40425-019-0749-z
Kim, D. C., Wang, X., Yang, C. R., and Gao, J. X. (2012). A framework for personalized medicine: prediction of drug sensitivity in cancer by proteomic profiling. InProteome Sci. 10, S13–S18. doi:10.1186/1477-5956-10-S1-S13
Kim, S., Jeong, S., Park, S. J., Chang, J., Choi, S., Cho, Y., et al. (2022). Association between proton pump inhibitor use and risk of hepatocellular carcinoma: a Korean nationally representative cohort study. J. Clin. Med. 11 (10), 2865. doi:10.3390/jcm11102865
Kleiner, D. E., Brunt, E. M., Van Natta, M., Behling, C., Contos, M. J., Cummings, O. W., et al. (2005). Design and validation of a histological scoring system for nonalcoholic fatty liver disease. Hepatology 41 (6), 1313–1321. doi:10.1002/hep.20701
Kooti, W., Servatyari, K., Behzadifar, M., Asadi-Samani, M., Sadeghi, F., Nouri, B., et al. (2017). Effective medicinal plant in cancer treatment, part 2: review study. J. evidence-based complementary and Altern. Med. 22 (4), 982–995. doi:10.1177/2156587217696927
Lee, Y. A., Noon, L. A., Akat, K. M., Ybanez, M. D., Lee, T. F., Berres, M. L., et al. (2018). Autophagy is a gatekeeper of hepatic differentiation and carcinogenesis by controlling the degradation of Yap. Nat. Commun. 9 (1), 4962. doi:10.1038/s41467-018-07338-z
Li, D., Wang, P., Luo, Y., Zhao, M., and Chen, F. (2017). Health benefits of anthocyanins and molecular mechanisms: update from recent decade. Crit. Rev. food Sci. Nutr. 57 (8), 1729–1741. doi:10.1080/10408398.2015.1030064
Liao, Y., Wei, F., He, Z., He, J., Ai, Y., Guo, C., et al. (2024). Animal-derived natural products for hepatocellular carcinoma therapy: current evidence and future perspectives. Front. Pharmacol. 15, 1399882. doi:10.3389/fphar.2024.1399882
Livak, K. J., and Schmittgen, T. D. (2001). Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. methods 25 (4), 402–408. doi:10.1006/meth.2001.1262
Llovet, J. M., Castet, F., Heikenwalder, M., Maini, M. K., Mazzaferro, V., Pinato, D. J., et al. (2022). Immunotherapies for hepatocellular carcinoma. Nat. Rev. Clin. Oncol. 19 (3), 151–172. doi:10.1038/s41571-021-00573-2
Lu, Z. N., Niu, W. X., Zhang, N., Ge, M. X., Bao, Y. Y., Ren, Y., et al. (2021). Pantoprazole ameliorates liver fibrosis and suppresses hepatic stellate cell activation in bile duct ligation rats by promoting YAP degradation. Acta Pharmacol. Sin. 42 (11), 1808–1820. doi:10.1038/s41401-021-00754-w
Luo, Y. D., Fang, L., Yu, H. Q., Zhang, J., Lin, X. T., Liu, X. Y., et al. (2021). p53 haploinsufficiency and increased mTOR signalling define a subset of aggressive hepatocellular carcinoma. J. hepatology 74 (1), 96–108. doi:10.1016/j.jhep.2020.07.036
Mak, K. M., and Mei, R. (2017). Basement membrane type IV collagen and laminin: an overview of their biology and value as fibrosis biomarkers of liver disease. Anatomical Rec. 300 (8), 1371–1390. doi:10.1002/ar.23567
Márquez-Quiroga, L. V., Arellanes-Robledo, J., Vásquez-Garzón, V. R., Villa-Treviño, S., and Muriel, P. (2022). Models of nonalcoholic steatohepatitis potentiated by chemical inducers leading to hepatocellular carcinoma. Biochem. Pharmacol. 195, 114845. doi:10.1016/j.bcp.2021.114845
Matboli, M., ElGwad, A. A., Hasanin, A. H., El-Tawdi, A., Habib, E. K., Elmansy, R. A., et al. (2019). Pantoprazole attenuates tumorigenesis via inhibition of exosomal secretion in a rat model of hepatic precancerous lesion induced by diethylnitrosamine and 2-acetamidofluorene. J. Cell. Biochem. 120 (9), 14946–14959. doi:10.1002/jcb.28757
Matboli, M., Hasanin, A. H., Hussein, R., El-Nakeep, S., Habib, E. K., Ellackany, R., et al. (2021). Cyanidin 3-glucoside modulated cell cycle progression in liver precancerous lesion, in vivo study. World J. Gastroenterology 27 (14), 1435–1450. doi:10.3748/wjg.v27.i14.1435
Mathieu, M., Martin-Jaular, L., Lavieu, G., and Théry, C. (2019). Specificities of secretion and uptake of exosomes and other extracellular vesicles for cell-to-cell communication. Nat. cell Biol. 21 (1), 9–17. doi:10.1038/s41556-018-0250-9
Menden, M. P., Iorio, F., Garnett, M., McDermott, U., Benes, C. H., Ballester, P. J., et al. (2013). Machine learning prediction of cancer cell sensitivity to drugs based on genomic and chemical properties. PLoS one 8 (4), e61318. doi:10.1371/journal.pone.0061318
Mirmalek, S. A., Faraji, S., Ranjbaran, S., Aryan, H., Arani, H. Z., Jangholi, E., et al. (2023). Cyanidin 3-glycoside induced apoptosis in MCF-7 breast cancer cell line. Archives Med. Sci. AMS 19 (4), 1092–1098. doi:10.5114/aoms.2020.93789
Naz, H., Tarique, M., Ahamad, S., Alajmi, M. F., Hussain, A., Rehman, M. T., et al. (2019). Hesperidin-CAMKIV interaction and its impact on cell proliferation and apoptosis in the human hepatic carcinoma and neuroblastoma cells. J. Cell. Biochem. 120 (9), 15119–15130. doi:10.1002/jcb.28774
Niu, Y., Liu, T., Tse, G. M., Sun, B., Niu, R., Li, H. M., et al. (2009). Increased expression of centrosomal alpha, gamma-tubulin in atypical ductal hyperplasia and carcinoma of the breast. Cancer Sci. 100 (4), 580–587. doi:10.1111/j.1349-7006.2008.01075.x
Okada, H., Takabatake, R., Honda, M., Takegoshi, K., Yamashita, T., Nakamura, M., et al. (2017). Peretinoin, an acyclic retinoid, suppresses steatohepatitis and tumorigenesis by activating autophagy in mice fed an atherogenic high-fat diet. Oncotarget 8 (25), 39978–39993. doi:10.18632/oncotarget.18116
Omar, A., Kaseb, A., Elbaz, T., El-Kassas, M., El Fouly, A., Hanno, A. F., et al. (2023). Egyptian society of liver cancer recommendation guidelines for the management of hepatocellular carcinoma [corrigendum]. J. Hepatocell. Carcinoma 10, 2101–2102. doi:10.2147/jhc.s442441
Panritdum, P., Muangnoi, C., Tuntipopipat, S., Charoenkiatkul, S., and Sukprasansap, M. (2024). Cleistocalyx nervosum var. paniala berry extract and cyanidin-3-glucoside inhibit hepatotoxicity and apoptosis. Food Sci. and Nutr. 12, 2947–2962. doi:10.1002/fsn3.3975
Peantum, J., Kunanopparat, A., Hirankarn, N., Tangkijvanich, P., and Kimkong, I. (2018). Autophagy related-protein 16-1 Up-regulated in hepatitis B virus-related hepatocellular carcinoma and impaired apoptosis. Gastroenterology Res. 11 (6), 404–410. doi:10.14740/gr1075w
Piñero, F., Dirchwolf, M., and Pessôa, M. G. (2020). Biomarkers in hepatocellular carcinoma: diagnosis, prognosis and treatment response assessment. Cells 9 (6), 1370. doi:10.3390/cells9061370
Pok, S., Wen, V., Shackel, N., Alsop, A., Pyakurel, P., Fahrer, A., et al. (2013). Cyclin E facilitates dysplastic hepatocytes to bypass G1/S checkpoint in hepatocarcinogenesis. J. Gastroenterology Hepatology 28 (9), 1545–1554. doi:10.1111/jgh.12216
Rampášek, L., Hidru, D., Smirnov, P., Haibe-Kains, B., and Goldenberg, A. (2019). Dr. VAE: improving drug response prediction via modeling of drug perturbation effects. Bioinformatics 35 (19), 3743–3751. doi:10.1093/bioinformatics/btz158
Refolo, M. G., Messa, C., Guerra, V., Carr, B. I., and D’Alessandro, R. (2020). Inflammatory mechanisms of HCC development. Cancers 12 (3), 641. doi:10.3390/cancers12030641
Relling, M. V., and Evans, W. E. (2015). Pharmacogenomics in the clinic. Nature 526 (7573), 343–350. doi:10.1038/nature15817
Reuken, P. A., Lutz, P., Casper, M., Al-Herwi, E., Stengel, S., Spengler, U., et al. (2019). The ATG16L1 gene variant rs2241880 (p. T300A) is associated with susceptibility to HCC in patients with cirrhosis. Liver Int. 39 (12), 2360–2367. doi:10.1111/liv.14239
Riddick, G., Song, H., Ahn, S., Walling, J., Borges-Rivera, D., Zhang, W., et al. (2011). Predicting in vitro drug sensitivity using Random Forests. Bioinformatics 27 (2), 220–224. doi:10.1093/bioinformatics/btq628
Safdar, M. A., Aslam, R. M., Shakeel, A., Shiza, W. M., Jmail, A., Mehmood, M. H., et al. (2023). Cyanidin as potential anticancer agent targeting various proliferative pathways. Chem. Biol. and Drug Des. 101 (2), 438–452. doi:10.1111/cbdd.14173
Schlosser, S., Tümen, D., Volz, B., Neumeyer, K., Egler, N., Kunst, C., et al. (2022). HCC biomarkers–state of the old and outlook to future promising biomarkers and their potential in everyday clinical practice. Front. Oncol. 12, 1016952. doi:10.3389/fonc.2022.1016952
Shi, L., Liu, B., Shen, D. D., Yan, P., Zhang, Y., Tian, Y., et al. (2021). A tumor-suppressive circular RNA mediates uncanonical integrin degradation by the proteasome in liver cancer. Sci. Adv. 7 (13), eabe5043. doi:10.1126/sciadv.abe5043
Simon, R. (2013). Drug-diagnostics co-development in oncology. Front. Oncol. 3, 315. doi:10.3389/fonc.2013.00315
Sonntag, R., Giebeler, N., Nevzorova, Y. A., Bangen, J. M., Fahrenkamp, D., Lambertz, D., et al. (2018). Cyclin E1 and cyclin-dependent kinase 2 are critical for initiation, but not for progression of hepatocellular carcinoma. Proc. Natl. Acad. Sci. 115 (37), 9282–9287. doi:10.1073/pnas.1807155115
Spugnini, E. P., Sonveaux, P., Stock, C., Perez-Sayans, M., De Milito, A., Avnet, S., et al. (2015). Proton channels and exchangers in cancer. Biochimica Biophysica Acta (BBA)-Biomembranes. 1848 (10), 2715–2726. doi:10.1016/j.bbamem.2014.10.015
Stanfield, Z., Coşkun, M., and Koyutürk, M. (2017). Drug response prediction as a link prediction problem. Sci. Rep. 7 (1), 40321. doi:10.1038/srep40321
Stärkel, P., De Saeger, C., Delire, B., Magat, J., Jordan, B., Konda, V. R., et al. (2015). Tetrahydro iso-alpha acids and hexahydro iso-alpha acids from hops inhibit proliferation of human hepatocarcinoma cell lines and reduce diethylnitrosamine induced liver tumor formation in rats. Nutr. cancer 67 (5), 748–760. doi:10.1080/01635581.2015.1032429
Su, Y., Lv, X., Yin, W., Zhou, L., Hu, Y., Zhou, A., et al. (2019). CircRNA Cdr1as functions as a competitive endogenous RNA to promote hepatocellular carcinoma progression. Aging (Albany NY) 11 (19), 8182–8203. doi:10.18632/aging.102312
Sun, K., Guo, X. L., Zhao, Q. D., Kou, X., Xie, X., Zhou, Y., et al. (2013). Paradoxical role of autophagy in the dysplastic and tumor-forming stages of hepatocarcinoma development in rats. Cell death and Dis. 4 (2), e501. doi:10.1038/cddis.2013.35
Sun, K., Xu, L., Jing, Y., Han, Z., Chen, X., Cai, C., et al. (2017). Autophagy-deficient Kupffer cells promote tumorigenesis by enhancing mtROS-NF-κB-IL1α/β-dependent inflammation and fibrosis during the preneoplastic stage of hepatocarcinogenesis. Cancer Lett. 388, 198–207. doi:10.1016/j.canlet.2016.12.004
Sun, M., Tan, L., and Hu, M. (2021). The role of autophagy in hepatic fibrosis. Am. J. Transl. Res. 13 (6), 5747–5757.
Sung, H., Ferlay, J., Siegel, R. L., Laversanne, M., Soerjomataram, I., Jemal, A., et al. (2021). Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA a cancer J. Clin. 71 (3), 209–249. doi:10.3322/caac.21660
Takamura, A., Komatsu, M., Hara, T., Sakamoto, A., Kishi, C., Waguri, S., et al. (2011). Autophagy-deficient mice develop multiple liver tumors. Genes and Dev. 25 (8), 795–800. doi:10.1101/gad.2016211
Tan, C., Cao, J., Chen, L., Xi, X., Wang, S., Zhu, Y., et al. (2019). Noncoding RNAs serve as diagnosis and prognosis biomarkers for hepatocellular carcinoma. Clin. Chem. 65 (7), 905–915. doi:10.1373/clinchem.2018.301150
Tan, J., Li, P., Xue, H., and Li, Q. (2020). Cyanidin-3-glucoside prevents hydrogen peroxide (H 2 O 2)-induced oxidative damage in HepG2 cells. Biotechnol. Lett. 42, 2453–2466. doi:10.1007/s10529-020-02982-2
Tanaka, T., and Sugie, S. (2007). Inhibition of colon carcinogenesis by dietary non-nutritive compounds. J. Toxicol. pathology 20 (4), 215–235. doi:10.1293/tox.20.215
Tunissiolli, N. M., Castanhole-Nunes, M. M., Biselli-Chicote, P. M., Pavarino, É. C., da Silva, R. F., Goloni-Bertollo, E. M., et al. (2017). Hepatocellular carcinoma: a comprehensive review of biomarkers, clinical aspects, and therapy. Asian Pac. J. cancer Prev. APJCP 18 (4), 863–872. doi:10.22034/APJCP.2017.18.4.863
Unfried, J. P., Marín-Baquero, M., Rivera-Calzada, Á., Razquin, N., Martín-Cuevas, E. M., de Bragança, S., et al. (2021). Long noncoding RNA NIHCOLE promotes ligation efficiency of DNA double-strand breaks in hepatocellular carcinoma. Cancer Res. 81 (19), 4910–4925. doi:10.1158/0008-5472.CAN-21-0463
Vadapalli, S., Abdelhalim, H., Zeeshan, S., and Ahmed, Z. (2022). Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine. Briefings Bioinforma. 23 (5), bbac191. doi:10.1093/bib/bbac191
Wang, S., Huang, Y., Luo, G., Yang, X., and Huang, W. (2021). Cyanidin-3-O-glucoside attenuates high glucose–induced podocyte dysfunction by inhibiting apoptosis and promoting autophagy via activation of SIRT1/AMPK pathway. Can. J. Physiology Pharmacol. 99 (6), 589–598. doi:10.1139/cjpp-2020-0341
Wang, Y., Yu, H., Zhang, J., Gao, J., Ge, X., and Lou, G. (2015). Hesperidin inhibits HeLa cell proliferation through apoptosis mediated by endoplasmic reticulum stress pathways and cell cycle arrest. BMC cancer 15, 682–691. doi:10.1186/s12885-015-1706-y
Wu, S., Hu, Y., Bai, W., Zhao, J., Huang, C., Wen, C., et al. (2019). Cyanidin-3-o-glucoside inhibits UVA-induced human dermal fibroblast injury by upregulating autophagy. Photodermatol. Photoimmunol. and Photomed. 35 (5), 360–368. doi:10.1111/phpp.12493
Xia, R., Sheng, X., Xu, X., Yu, C., and Lu, H. (2018). Hesperidin induces apoptosis and G0/G1 arrest in human non-small cell lung cancer A549 cells. Int. J. Mol. Med. 41 (1), 464–472. doi:10.3892/ijmm.2017.3250
Xing, M., Wang, X., Kirken, R. A., He, L., and Zhang, J. Y. (2021). Immunodiagnostic biomarkers for hepatocellular carcinoma (HCC): the first step in detection and treatment. Int. J. Mol. Sci. 22 (11), 6139. doi:10.3390/ijms22116139
Yamanishi, Y., Pauwels, E., and Kotera, M. (2012). Drug side-effect prediction based on the integration of chemical and biological spaces. J. Chem. Inf. Model. 52 (12), 3284–3292. doi:10.1021/ci2005548
Ye, J., Coulouris, G., Zaretskaya, I., Cutcutache, I., Rozen, S., and Madden, T. L. (2012). Primer-BLAST: a tool to design target-specific primers for polymerase chain reaction. BMC Bioinforma. 13, 134–141. doi:10.1186/1471-2105-13-134
Yin, H., Wang, L., Wu, M., Liu, Y., Li, N., and Chen, T. (2019). Cyanidin-3-O-glucoside chloride acts synergistically with luteolin to inhibit the growth of colon and breast carcinoma cells. Die Pharmazie-An Int. J. Pharm. Sci. 74 (1), 54–61. doi:10.1691/ph.2019.8746
Yu, L. X., Ling, Y., and Wang, H. Y. (2018). Role of nonresolving inflammation in hepatocellular carcinoma development and progression. NPJ Precis. Oncol. 2 (1), 6. doi:10.1038/s41698-018-0048-z
Yu, S., Wang, Y., Jing, L., Claret, F. X., Li, Q., Tian, T., et al. (2017). Autophagy in the “inflammation-carcinogenesis” pathway of liver and HCC immunotherapy. Cancer Lett. 411, 82–89. doi:10.1016/j.canlet.2017.09.049
Yumnam, S., Park, H. S., Kim, M. K., Nagappan, A., Hong, G. E., Lee, H. J., et al. (2014). Hesperidin induces paraptosis like cell death in hepatoblastoma, HepG2 Cells: involvement of ERK1/2 MAPK. PloS one 9 (6), e101321. doi:10.1371/journal.pone.0101321
Zabady, S., Mahran, N., Soltan, M. A., Alaa Eldeen, M., Eid, R. A., Albogami, S., et al. (2022). Cyanidin-3-Glucoside modulates hsa_circ_0001345/miRNA106b/ATG16L1 Axis expression as a potential protective mechanism against hepatocellular carcinoma. Curr. Issues Mol. Biol. 44 (4), 1677–1687. doi:10.3390/cimb44040115
Zaghloul, R. A., Elsherbiny, N. M., Kenawy, H. I., El-Karef, A., Eissa, L. A., and El-Shishtawy, M. M. (2017). Hepatoprotective effect of hesperidin in hepatocellular carcinoma: involvement of Wnt signaling pathways. Life Sci. 185, 114–125. doi:10.1016/j.lfs.2017.07.026
Zeng, Q., Klein, C., Caruso, S., Maille, P., Laleh, N. G., Sommacale, D., et al. (2022). Artificial intelligence predicts immune and inflammatory gene signatures directly from hepatocellular carcinoma histology. J. Hepatology 77 (1), 116–127. doi:10.1016/j.jhep.2022.01.018
Zhang, B., Ling, T., Zhaxi, P., Cao, Y., Qian, L., Zhao, D., et al. (2019). Proton pump inhibitor pantoprazole inhibits gastric cancer metastasis via suppression of telomerase reverse transcriptase gene expression. Cancer Lett. 452, 23–30. doi:10.1016/j.canlet.2019.03.029
Zhang, G., Zeng, X., Zhang, R., Liu, J., Zhang, W., Zhao, Y., et al. (2016). Dioscin suppresses hepatocellular carcinoma tumor growth by inducing apoptosis and regulation of TP53, BAX, BCL2 and cleaved CASP3. Phytomedicine 23 (12), 1329–1336. doi:10.1016/j.phymed.2016.07.003
Zhang, H. Y., Zhu, J. J., Liu, Z. M., Zhang, Y. X., Chen, J. J., and Chen, K. D. (2023). A prognostic four-gene signature and a therapeutic strategy for hepatocellular carcinoma: construction and analysis of a circRNA-mediated competing endogenous RNA network. Hepatobiliary and Pancreat. Dis. Int. 23, 272–287. doi:10.1016/j.hbpd.2023.06.009
Zhang, J., Gao, J., Hu, M., Xu, S., Cheng, C., Zheng, W., et al. (2023). Integrated investigation of the clinical implications and targeted landscape for RNA methylation modifications in hepatocellular carcinoma. Eur. J. Med. Res. 28 (1), 46–55. doi:10.1186/s40001-023-01016-7
Zhang, J., Song, Q., Wu, M., and Zheng, W. (2021). The emerging roles of exosomes in the chemoresistance of hepatocellular carcinoma. Curr. Med. Chem. 28 (1), 93–109. doi:10.2174/0929867327666200130103206
Zhang, Z. Y., Lu, M., Liu, Z. K., Li, H., Yong, Y. L., Zhang, R. Y., et al. (2020). Rab11a regulates MMP2 expression by activating the PI3K/AKT pathway in human hepatocellular carcinoma cells. Pathology-Research Pract. 216 (9), 153046. doi:10.1016/j.prp.2020.153046
Keywords: hepatocellular carcinoma, drug response, predictive biomarkers, machine learning, rats
Citation: Matboli M, Al-Amodi HS, Khaled A, Khaled R, Ali M, Kamel HFM, Hamid MSAE, ELsawi HA, Habib EK and Youssef I (2024) Integrating molecular, biochemical, and immunohistochemical features as predictors of hepatocellular carcinoma drug response using machine-learning algorithms. Front. Mol. Biosci. 11:1430794. doi: 10.3389/fmolb.2024.1430794
Received: 10 May 2024; Accepted: 27 September 2024;
Published: 16 October 2024.
Edited by:
Mar Requena Mullor, University of Almeria, SpainReviewed by:
Nahid Arghiani, University of Massachusetts Medical School, United StatesSherif Shawky, Zewail City of Science and Technology, Egypt
Reda Elbadawy, Benha University, Egypt
Copyright © 2024 Matboli, Al-Amodi, Khaled, Khaled, Ali, Kamel, Hamid, ELsawi, Habib and Youssef. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Marwa Matboli, drmarwa_matboly@med.asu.edu.eg; Ibrahim Youssef, ibrahim.youssef@eng1.cu.edu.eg