Skip to main content

ORIGINAL RESEARCH article

Front. Genet., 09 December 2022
Sec. Cancer Genetics and Oncogenomics

The molecular basis of the associations between non-alcoholic fatty liver disease and colorectal cancer

  • 1Department of Gastroenterology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China
  • 2Department of Rheumatology, The Second Affiliated Hospital of Fujian Medical University, Quanzhou, China

Background: Given the ongoing research on non-alcoholic fatty liver disease (NAFLD) and colorectal cancer (CRC), the number of studies suggesting a strong link between NAFLD and CRC is on the rise, while its underlying pathological mechanisms remain uncertain. This study aims to explore the shared genes and mechanisms and to reveal the molecular basis of the association between CRC and NAFLD through bioinformatics approaches.

Methods: The Gene Expression Omnibus (GEO) dataset GSE89632 is downloaded for NAFLD cases and healthy controls. Additionally, the GSE4107 and GSE9348 datasets are obtained for CRC cases and healthy controls. Differentially expressed genes (DEGs) are obtained for NAFLD and CRC datasets, as well as shared genes between the two disorders. GO and KEGG enrichment analyses are further conducted. Subsequently, the STRING database and Cytoscape software are utilized to establish the PPI network and identify the hub genes. Then, co-expression analysis is performed using GeneMANIA. Subsequently, ROC curves and external datasets validation were applied to further screen the candidate markers. Finally, NetworkAnalyst is available as a means to construct a miRNA-gene regulatory network.

Results: Under the threshold of FDR ≤ 0.01, 147 common genes are obtained in NAFLD and CRC. Categorization of GO functions shows that DEGs are predominantly enriched in “response to organic substance”, “cellular response to chemical stimulus”, and “response to external stimulus”. The predominant KEGG pathways in DEGs are the “IL-17 signaling pathway”, the “TNF signaling pathway”, “Viral protein interaction with cytokine and cytokine receptor”, “Cytokine-cytokine receptor interaction”, and the “Toll-like receptor signaling pathway”. Additionally, MYC, IL1B, FOS, CXCL8, PTGS2, MMP9, JUN, and IL6 are identified as hub genes by the evaluation of 7 algorithms. With the construction of miRNA-gene networks, 2 miRNAs, including miR-106a-5p, and miR-204-5p are predicted to be potential key miRNAs.

Conclusion: This study identifies possible hub genes acting in the co-morbidity of NAFLD and CRC and discovers the interaction of miRNAs and hub genes, providing a novel understanding of the molecular basis for the relevance of CRC and NAFLD, thus contributing to the development of new therapeutic strategies to combat NAFLD and CRC.

Introduction

Non-alcoholic fatty liver disease (NAFLD), or more precisely, metabolism-related fatty liver disease, is a chronic and increasingly prevalent liver disease associated with insulin resistance (IR), obesity, hypertension, hyperlipidemia, and type 2 diabetes (T2DM) (Younossi, 2019). NAFLD, which affects one in four adults, is an umbrella term that encompasses a heterogeneous spectrum of diseases that initially evolves from hepatic steatosis leading to non-alcoholic steatohepatitis (NASH), cirrhosis, and ultimately hepatocellular carcinoma (HCC) (Younossi, 2019). NAFLD imposes substantial morbidity and mortality on those affected, with cardiovascular disease (CVD) being the most common contributor to death, followed by liver-related complications and extrahepatic malignancies (Angulo et al., 2015). As a clinicopathological syndrome that is challenging to diagnose and treat, NAFLD is a critical threat to global health and the economy.

Colorectal cancer (CRC) is a widespread malignancy that scares everyone. With the third highest incidence rate, CRC is the second leading cause of cancer-related deaths as per the Global Cancer Statistics 2020 (GLOBOCAN 2020) (Sung et al., 2021). CRC is not easily detectable in its early stages and, more frighteningly, once it progresses to advanced stages, the five-year survival rate is less than 10% (Parizadeh et al., 2019). Such high incidence, high mortality rate, and poor prognosis impose a heavy health and economic toll on families and society. This highlights the importance of identifying at-risk populations. Limiting the evolution of NAFLD may be a sensible method to prevent CRC.

Nowadays, the correlation between NAFLD and CRC has been confirmed by several studies. Risk factors for NAFLD, such as hyperlipidemia, obesity, and diabetes, have been also reported to be the predictable risk of CRC (Jimba et al., 2005). Individuals with NAFLD have a higher opportunity to suffer from CRC and a worse prognosis (Wu et al., 2019). Evidence from a meta-analysis suggests that both the existence and severity of NAFLD may be closely linked to an elevated risk of suffering from cancer (Chen et al., 2019). Similarly, a large cross-sectional study in Korea noted an elevated risk of CRC in patients with NAFLD who possessed higher non-invasive fibrosis scores (Ahn et al., 2017). To date, although a growing number of studies have verified the strong correlation between NAFLD and CRC, related pathogenetic and genetic studies are still limited. Further exploration of CRC in the setting of NAFLD, with potential therapeutic options, is needed.

Bioinformatics, as a discipline that can be applied to help people analyze the complex mechanisms of disease, has shown great ability in the study of a wide variety of diseases, including the identification of novel cancers (Yan et al., 2018; HA et al., 2020) and the prediction of novel biomarkers for diseases (Wang and Liotta, 2011; Tsai, 2019). Driven by the Gene Expression Omnibus (GEO) database, an accessible database offering genetic information that can be utilized for bioinformatic analysis to define new disease targets, a gene expression profile for NAFLD, and two gene expression profiles for CRC are downloaded. Differentially expressed genes (DEGs), as well as shared genes between NAFLD and CRC are ascertained following analysis. Clustering analysis via Gene Ontology (GO) terminology enrichment, Kyoto Gene and Genome Encyclopedia (KEGG) is performed to ascertain the functions and pathways of genes. Protein-protein interaction (PPI) network is applied to analyze the shared gene interactions and locate hub genes. Eventually, a predictive miRNA-gene regulatory network is constructed using NetworkAnalyst to predict key miRNAs associated with DEGs. This study aims to explore the pathological mechanisms and potential therapeutic targets of CRC in the context of NAFLD.

Materials and methods

Microarray data collection

From the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo) (Edgar et al., 2002), microarray datasets for NAFLD and CRC are downloaded. The GSE89632 dataset (Arendt et al., 2015) contains RNA data from liver biopsies of patients with NAFLD (39 samples) and normal controls (24 samples). The GSE4107 dataset (Hong et al., 2007) contains colonic mucosal sample data from 12 CRC patients and 10 healthy controls. Information on colonic mucosal samples of 70 CRC patients and 12 non-CRC controls is in GSE9348 (Hong et al., 2010). Since these gene expression profiles are extracted from free and open databases available on the internet, Ethics Committee approval is not required for our study. The design block diagram of this study is shown in Figure 1.

FIGURE 1
www.frontiersin.org

FIGURE 1. The specific flow map of this study.

Differentially expressed genes selection

The two datasets GSE4107 and GSE9348 are normalized by log2-transformed [log2 (x + 1)] and removed from the batch effect for further analysis. The “limma” package from R software is used to identify the differentially expressed genes (DEGs) in the NAFLD and CRC datasets respectively. Probes without matching gene symbols and genes presenting multiple probes are excluded. For the purpose of reducing the false positive rate, the adjusted p-value (FDR) is taken. The selection criteria is set to FDR≤0.01 and |log2 FC|≥0.8. The screened DEGs for both diseases are put into the online analysis tool Venn (http://bioinformatics.psb.ugent.be/webtools/Venn/) to obtain intersecting genes. The shared genes are screened out for the next step of the analysis.

Functional classification and pathway enrichment of differentially expressed genes

Gene ontology (GO) (Pomaznoy et al., 2018) and Kyoto Encyclopedia of Genes and Genomes (KEGG) (Kanehisa et al., 2017)Pathway analyses are performed using the analysis website Metascape (https://metascape.org/gp/index.html#/main/step1) (Zhou et al., 2019) to explore DEGs’ potential biological functions both in NAFLD and CRC. Values of p < 0.01 is assumed to be statistically significant.

Establishment of protein-protein interaction and identification of hub genes

The PPI network is constructed through STRING (http://string-db.org/) (Franceschini et al., 2013) to further investigate the interactions between the shared genes which are obtained above. Interaction scores over 0.4 as significant are considered. Following this, it is visualized through the application of Cytoscape software to the PPI network (Franceschini et al., 2013). And then, the Cytoscape plugin, MCODE, is performed to filter out pivotal protein expression molecules. Set the selection criteria as: degree cut-off = 2, node score cut-off = 0.2, and max depth = 100. Subsequently, the cytoHubba plugin is applied for evaluation, and genes with at least 2 recurrences according to 7 algorithms (MCC, MNC, EPC, Degree, BottleNeck, Closeness, and Radiality) are taken as hub genes. Finally, GeneMANIA (http://www.genemania.org/) (Warde-Farley et al., 2010) is utilized to identify intra-gene associations and construct co-expression networks.

Evaluation and verification of hub genes

The sensitivity and specificity of these hub genes in predicting NAFLD and CRC disease are assessed by the receiver operating characteristic (ROC) curve and the area under the ROC curve in NAFLD and CRC datasets. Additional publicly available datasets GSE163211 and GSE184093 are used to further validate the expression levels of these hub genes in both diseases. The two-sample t-test was used to compare the differential expression levels, and p-values <0.05 were regarded as statistically significant. The box plots were used to represent the expression levels of hub genes in different groups by the R package ggplot2. Statistical significance was indicated as follows: *(p < 0.05), **(p < 0.01), and ns (p > 0.05, no significance).

MiRNAs associated with hub genes

A miRNA-gene regulatory network of hub genes is to be built with the NetworkAnalyst tool (version 3.0, https://www.networkanalyst.ca/), followed by the application of Cytoscape software for collation and visualization.

Results

Differential expression analysis of differentially expressed genes in non-alcoholic fatty liver disease and colorectal cancer

One dataset containing NAFLD gene information (GSE89632) and two datasets containing CRC gene information (GSE4107, GSE9348) are retrieved from the NCBI GEO database. Details about the three datasets are shown in Table 1. After preprocessing, we obtained 686 DEGs in NAFLD group and 2,157 DEGs in CRC samples. The DEGs are visualized with volcanos (Figures 2A, 3A) and heat maps (Figures 2B, 3B). Further screening of these genes is performed and Venn diagrams are plotted. 147 intersecting genes common to both diseases are obtained, as shown in Figure 4.

TABLE 1
www.frontiersin.org

TABLE 1. Information on the datasets included in the current study.

FIGURE 2
www.frontiersin.org

FIGURE 2. Differentially expressed genes (DEGs) for the GSE89632 dataset. (A), volcano plot depicting the status of gene expression between patients with NAFLD and normal controls. (B), heatmap of the top 40 DEGs. Regions in red and blue are displayed to distinguish between the up- and down-regulation of genes.

FIGURE 3
www.frontiersin.org

FIGURE 3. Differentially expressed genes (DEGs) for the dataset derived by integrating GSE4107 and GSE9348. Panel (A), volcano plot depicting the status of gene expression between patients with CRC and normal controls. Panel (B), heatmap of the top 40 DEGs. Regions in red and blue are displayed to distinguish between the up- and down-regulation of genes.

FIGURE 4
www.frontiersin.org

FIGURE 4. Venn diagram depicting shared genes between NAFLD and CRC.

GO and KEGG enrichment pathway analysis of overlapping differentially expressed genes

The occurrence of diseases usually relates to multiple biological functions. GO analysis shows that the overlapping genes are mainly enriched in “response to organic substance”, “cellular response to chemical stimulus”, and “response to external stimulus” (Figure 5A). The predominant KEGG pathways in DEGs are the “IL-17 signaling pathway”, the “TNF signaling pathway”, “Viral protein interaction with cytokine and cytokine receptor”, “Cytokine-cytokine receptor interaction”, and the “Toll-like receptor signaling pathway” (Figure 5B). The results indicate an important role of inflammation in these two diseases.

FIGURE 5
www.frontiersin.org

FIGURE 5. Analysis of functional enrichment for intersecting common genes. (A), bubble diagram of GO analyses for common genes. (B), bubble diagram of KEGG analyses for common genes. The number of genes is indicated by the size of the circle, whereas the p-value is shown in gradient color.

Establishment of protein-protein interaction and identification of hub genes

Primarily, PPI networks are conducted to determine the interactions among DEGs via the online Search Tool for the Retrieval of Interacting Genes (STRING). Then, visual analysis is performed using Cytoscape (Figure 6A). Based on Cytoscape’s plugin MCODE, The most tightly attached region (22 nodes, 161 edges) of the PPI network is identified (Figure 6B). Table 2 shows the ranking of the top 9 hub genes in the network according to 7 algorithms of cytoHubba. After taking the intersection of the Venn diagrams, the top 8 genes are determined as potential hub genes, including IL6 (Interleukin-6), JUN (Transcription factor AP-1), PTGS2 (Prostaglandin G/H synthase 2), MMP9 (Matrix metalloproteinase-9), CXCL8 (Interleukin-8), FOS (Proto-oncogene c-Fos), IL1B (Interleukin-1 beta), and MYC (Myc proto-oncogene protein) (Figures 6C, 7A). GO analysis indicates that the 8 hub genes are mainly enriched in “neuroinflammatory response”, “cytokine-mediated signaling pathway”, “inflammatory response”, “regulation of neuroinflammatory response”, “positive regulation of cell migration”, and “cellular response to cytokine stimulus” (Figure 8A). In the KEGG analysis, the mainly enriched pathways are the “IL-17 signaling pathway”, the “TNF signaling pathway”, “Kaposi sarcoma-associated herpesvirus infection”, “Hepatitis B”, and the “Toll-like receptor signaling pathway” (Figure 8B). Majority of hub genes are involved in the IL-17 and TNF signaling pathways, as well as inflammatory responses. The details of each hub gene and its enrichment pathways are depicted in Figures 8C,D.

FIGURE 6
www.frontiersin.org

FIGURE 6. Protein-protein interaction (PPI) networks. (A), PPI of 147 shared genes between NAFLD and CRC identified. (B), the regions most densely connected as identified by the MCODE plugin. (C), 8 hub genes were ascertained with 7 different algorithms of cytoHubba. Corresponding to the nodes, the size and color of each gene is based on the degree of interaction.

TABLE 2
www.frontiersin.org

TABLE 2. List of the top 9 genes ranked by 7 algorithms in cytoHubba.

FIGURE 7
www.frontiersin.org

FIGURE 7. Venn diagram and co-expression network. (A), Venn diagram illustrates the situation of 7 algorithms to identify hub genes. Each algorithm selects the top 9 ranked genes. (B), Co-expression networks of hub genes are constructed by GeneMANIA. The networks are represented by different color blocks inside the circles and the functions are represented by various line colors.

FIGURE 8
www.frontiersin.org

FIGURE 8. Functional enrichment analysis. (A), bubble diagram that shows GO analysis of 8 hub genes. (B), bubble diagram that shows the KEGG analysis of 8 hub genes. (C), circle diagram to display the GO terms exactly enriched for each of the 8 hub genes. (D), circle diagram to display the KEGG pathways exactly enriched for each of the 8 hub genes. Distinguish different enrichment terms with various colors.

Evaluation and verification of hub genes

To evaluate whether the 8 hub genes are of predictive significance, ROC curve analysis is performed on the NAFLD and CRC datasets. It is shown that the AUCs of all hub genes are greater than 0.7 (Figures 9A,B). Microarray datasets retrieved from the GEO database for NAFLD and CRC (GSE163211: 242 NAFLD samples vs. 76 healthy controls; GSE184093: 9 CRC samples vs. 9 healthy controls) are used to validate the expression levels of hub genes after normalization. In GSE163211, FOS and PTGS2 are not involved in the quantification of expression from NAFLD patients. The expression profiles of the other 6 hub genes in GSE163211 are shown in Figure 9C. Compared with the normal group, IL6, MMP9, CXCL8, and IL1B in GSE163211 are significantly increased. The expression profiles of 8 hub genes in CRC patients are validated by the GSE184093 dataset. It is observed that the expression levels of IL6, MMP9, PTGS2, CXCL8, IL1B, and MYC are significantly higher than those in the normal group (Figure 9D). Therefore, combining the expression levels of hub genes and ROC results, we conclude that IL6, CXCL8, MMP9 and IL1B may be the most potential candidate markers.

FIGURE 9
www.frontiersin.org

FIGURE 9. ROC curve and box plot of hub genes. (A), The AUC of hub genes for the diagnosis of NAFLD in the ROC analysis of the GSE89632 dataset. (B), The AUC of hub genes for the diagnosis of CRC in the ROC analysis of the dataset derived by integrating GSE4107 and GSE9348. (C), The expression of hub genes in NAFLD vs. Normal group in the GSE163211 dataset. (D), The expression of hub genes in CRC vs. Normal group in the GSE184093 dataset.

The miRNA-gene regulatory network

Prediction of target miRNAs for hub genes is performed via the NetworkAnalyst database. As shown in Figure 10, in the miRNA-gene regulatory network, miR-106a-5p interacts with IL6, CXCL8 and IL1B. In addition, miR-204-5p is a target miRNA for MMP9, CXCL8 and IL1B.

FIGURE 10
www.frontiersin.org

FIGURE 10. The top 4 hub genes in the integrated miRNA-gene network. The red circles indicate these hub genes. MiRNAs with linkage to more central genes are highlighted exclusively with larger orange diamonds, and the rest are indicated with smaller yellowish triangles.

Discussion

NAFLD has been characterized as a chronic and ongoing pathological process evolving from hepatic steatosis to marked liver damage, affecting not only the liver itself but also being a promoter of carcinogenesis in other distant organs (Yang et al., 2017). A cohort study involving 1314 CRC patients who underwent surgical resection first confirmed that NAFLD is probably an independent factor impacting the overall survival (OS) of CRC victims (You et al., 2015). With an increasingly widespread impact, NAFLD has also emerged as a risk factor for CRC, which carries the second-highest global cancer mortality rate. Therefore, it would be worthwhile to address the underlying molecular mechanisms between NAFLD and CRC as well as to identify early targets for preventing disease progression.

In this study, the GEO dataset for NAFLD and CRC is searched and 147 common DEGs are identified. Categorization of GO functions reveals that DEGs are mainly enriched in “response to organic substance”, “cellular response to chemical stimulus”, and “response to external stimulus”. DEGs are mainly enriched in the “IL-17 signaling pathway”, the “TNF signaling pathway”, “Viral protein interaction with cytokine and cytokine receptor”, “Cytokine-cytokine receptor interaction”, and the “Toll-like receptor signaling pathway” concerning the KEGG pathway. The 8 hub genes identified with the application of 7 algorithms include MYC, IL1B, FOS, CXCL8, PTGS2, MMP9, JUN, and IL6. GO analysis indicates that the 8 hub genes are mainly enriched in “neuroinflammatory response”, “cytokine-mediated signaling pathway”, “inflammatory response”, “regulation of neuroinflammatory response”, “positive regulation of cell migration”, and “cellular response to cytokine stimulus”. In the KEGG analysis, the mainly enriched pathways are the “IL-17 signaling pathway”, the “TNF signaling pathway”, “Kaposi sarcoma-associated herpesvirus infection”, “Hepatitis B″, and the “Toll-like receptor signaling pathway”. The above analysis reveals that inflammatory processes may play an important role in the co-pathogenesis of NAFLD and CRC. Combining the expression of hub genes in validation datasets and ROC curves, IL6, MMP9, CXCL8, and IL1B are further identified as potential genes for the development of CRC based on NAFLD. Persistent lipid deposition in NAFLD induces a sustained inflammatory response with infiltration of lymphocytes (Chakraborty and Wang, 2020), macrophages, and neutrophils in the liver, which may provide a beneficial microenvironment for the initiation and further development of the metastatic CRC. Lymphocytes and macrophages that infiltrate may secrete pro-inflammatory cytokines like TNF-α(25), which evokes epithelial-to-mesenchymal transition (EMT) (Bates and Mercurio, 2003) and contributes to the metastasis of CRC. Infiltrating neutrophils secrete IL-17, which increases the inflammatory response in the hepatic ductal tract (Gao and Tsukamoto. -, 2016). Interleukin 17 (IL-17) is considered to be a pro-inflammatory cytokine and has been demonstrated to promote the transition from simple steatosis to steatohepatitis (Tang et al., 2011). IL-17 is able to promote tumor angiogenesis (Numasaki et al., 2003) by inducing the expression of vascular endothelial growth factor (VEGF), which acts as a pathogenic role in CRC formation (Li et al., 2019a). Hepatic TLR signaling is activated in NAFLD, and TLR2, TLR4, TLR5, TLR9 are all implicated in the pathogenesis of NAFLD. Therefore, Kouichi Miura et al. suggested that TLRs are targets for NAFLD treatment (Miura and Ohnishi, 2014). Toll like receptor signaling pathway can influence CRC progression through multiple pathways and is critical in the treatment of CRC (Moradi-Marjaneh et al., 2018). These enrichment results for GO terms and the KEGG pathway indicate that hub genes found in our study might be participating in the disease progression of NAFLD and CRC by the aforementioned means.

MMP9 is one of the zinc-ion-dependent metalloproteinase Family. It was revealed that overexpression of MMP9 could accelerate CRC progression and metastasis through the MKK-3/p38/NF-κB pathway (Chen et al., 2020). A cross-sectional study found that lower MMP9 levels and higher IL6 levels were associated with worse fibrosis by measuring 12 serum markers in 105 adult patients with NAFLD (Goyale et al., 2021). Overexpression of IL6 favors the progression of liver inflammation and NAFLD (Fontes-Cal et al., 2021). IL-6 contributes to the progression of CRC via the activation of STAT3 and NF-kB (De Simone et al., 2015). CXCL8, also known as IL-8, is necessary for the linkage between tumors and inflammation. CXCL8 plays an essential role in promoting angiogenesis, proliferation, invasion, migration and epithelial-mesenchymal transition (EMT) of CRC cells (Bie et al., 2019). The serum levels of CXCL8 were significantly higher in NASH patients compared to healthy group (Bahcecioglu et al., 2005). It was confirmed in a study involving 604 patients with NAFLD that common variants of IL6 and IL1B may increase susceptibility for NASH (Nelson et al., 2016). Higher expression of IL1B was detected in the visceral adipose tissue of CRC patients than in healthy controls (Castellano-Castillo et al., 2018). IL6, MMP9, CXCL8 and IL1B are all involved in the IL17 signaling pathway and are inextricably linked to the inflammatory response. It is speculated in this study that IL6, MMP9, CXCL8 and IL1B may contribute to the occurrence and development of CRC and NAFLD, and may be potentially candidate markers. Inflammation may be the access pathway that connects the co-occurrence of NAFLD and cancer.

The miRNA-gene regulatory network is constructed in this study, then the top 2 miRNAs interacting with DEGs are selected (miR-106a-5p, and miR-204-5p). The aberrant miRNA expression is tightly associated with cancer development and progression. MiR-204-5p exerts a potent tumor suppressor function in colorectal cancer by repressing RAB22A (a member of the RAS oncogene family) (Yin et al., 2014). Whether miR-204-5p affects NAFLD has not been studied. Research confirms that miR-204-5p may inhibit inflammation and chemokine production by regulating the IL6/IL6R axis (Li et al., 2019b). The possibility that miR-204-5p can act on NAFLD through the inhibition of inflammatory factors still needs further exploration. MiR-106a-5p can exert an influence on NAFLD progression through the NF-κB signaling pathway (Li et al., 2021). It was found that miR-106a-5p expression was elevated in CRC tissues, and patients with high miR-106a-5p expression tended to have a poor prognosis (Yue et al., 2015). The miR-204-5p obtained in the miRNA-gene regulatory network inhibited CRC and inflammation, while the overexpression of miR-106a-5p worsened CRC and NAFLD. Therefore, it was suggested that miR-204-5p and miR-106a-5p might be used as targets against NAFLD and CRC. NAFLD is often accompanied by inflammatory environment, and overexpression of inflammatory and chemokines has a pro-tumor role in CRC development. The 4 candidate markers screened in this study were all enriched in inflammatory responses and inflammation-related signaling pathways such as IL17 signaling pathway. Elevated IL6, MMP9, CXCL8 and IL1B in NAFLD may act on the IL17 signaling pathway and miR-106a-5p to promote the development and progression of CRC.

In conclusion, the current study identifies 8 hub genes by bioinformatics analysis and constructs a miRNA-gene regulatory network that helps to understand the pathophysiological processes of CRC and NAFLD as well as provides the potential early diagnostic and therapeutic targets for CRC and NAFLD. Nevertheless, there are several limitations to our study. One limitation of the study is that only the possible hub genes and miRNAs that act together in both NAFLD and CRC diseases are analyzed by bioinformatics, but the specific mechanisms are still lacking and need to be further explored. In addition, the finding is only based on predictions derived from public databases. Therefore, further biological and clinical data are needed to clarify the mechanism of interaction between these hub genes and miRNAs between CRC and NAFLD diseases.

Conclusion

This study identifies a potential miRNA-gene regulatory network that may be relevant to CRC and NAFLD. 8 hub genes (including MYC, IL1B, FOS, CXCL8, PTGS2, MMP9, JUN, and IL6) are likely to exert an essential impact on the pathophysiological mechanisms of the two diseases. IL6, MMP9, CXCL8 and IL1B were further identified as candidate markers. Several possible target miRNAs (miR-106a-5p, and miR-204-5p) are also predicted. This study provides new insights into the occurrence of NAFLD and CRC at the molecular level, which may be helpful in the early prevention of CRC and the development of treatment strategies for NAFLD and CRC. It also reminds us that patients with NAFLD or those with corresponding risk factors should be screened or alerted to the occurrence of CRC.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Author contributions

TQ conceived and designed the study, performed the statistical analysis, and drafted the manuscript. WH revised the manuscript and provided suggestions. ZR helped to carry out the study. TF reviewed and edited the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was support by Quanzhou City Science & Technology Program of China (No. 2022NS080).

Acknowledgments

The authors thank all the people in our laboratory for their discussions and suggestions.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Ahn, J. S., Sinn, D. H., Min, Y. W., Hong, S. N., Kim, H. S., Jung, S. H., et al. (2017). - non-alcoholic fatty liver diseases and risk of colorectal neoplasia. Aliment. Pharmacol. Ther. 45 (2), 345–353. doi:10.1111/apt.13866

PubMed Abstract | CrossRef Full Text | Google Scholar

Angulo, P., De Kleiner, , Dam-Larsen, S., Adams, L. A., Bjornsson, E. S., Charatcharoenwitthaya, P., et al. (2015). Liver fibrosis, but No other histologic features, is associated with long-term outcomes of patients with nonalcoholic fatty liver disease. Gastroenterology 149 (2), 389–397. doi:10.1053/j.gastro.2015.04.043

PubMed Abstract | CrossRef Full Text | Google Scholar

Arendt, B. M., Comelli, E. M., Ma, D. W., Lou, W., Teterina, A., Kim, T., et al. (2015). Altered hepatic gene expression in nonalcoholic fatty liver disease is associated with lower hepatic n-3 and n-6 polyunsaturated fatty acids. Hepatology 61 (5), 1565–1578. doi:10.1002/hep.27695

PubMed Abstract | CrossRef Full Text | Google Scholar

Bahcecioglu, I. H., Yalniz, M., Ataseven, H., Ilhan, N., Ozercan, I. H., Seckin, D., et al. (2005). Levels of serum hyaluronic acid, tnf-alpha and il-8 in patients with nonalcoholic steatohepatitis. Hepatogastroenterology. 52 (65), 1549–1553. Epub 2005/10/06.

PubMed Abstract | Google Scholar

Bates, R. C., and Mercurio, A. M. (2003). Tumor necrosis factor-alpha stimulates the epithelial-to-mesenchymal transition of human colonic organoids. Mol. Biol. Cell 14 (5), 1790–1800. doi:10.1091/mbc.e02-09-0583

PubMed Abstract | CrossRef Full Text | Google Scholar

Bie, Y., Ge, W., Yang, Z., Cheng, X., Zhao, Z., Li, S., et al. (2019). The crucial role of Cxcl8 and its receptors in colorectal liver metastasis. Dis. Markers 2019, 8023460. Epub 2019/12/13. doi:10.1155/2019/8023460

PubMed Abstract | CrossRef Full Text | Google Scholar

Castellano-Castillo, D., Morcillo, S., Clemente-Postigo, M., Crujeiras, A. B., Fernandez-García, J. C., Torres, E., et al. (2018). Adipose tissue inflammation and vdr expression and methylation in colorectal cancer. Clin. Epigenetics 10, 60. Epub 2018/05/03. doi:10.1186/s13148-018-0493-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Chakraborty, D., and Wang, J. (2020). Nonalcoholic fatty liver disease and colorectal cancer: Correlation and missing links. Life Sci. 262, 118507. doi:10.1016/j.lfs.2020.118507

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H., Ye, Y., Yang, Y., Zhong, M., Gu, L., Han, Z., et al. (2020). Tipe-mediated up-regulation of mmp-9 promotes colorectal cancer invasion and metastasis through mkk-3/P38/nf-?b pro-oncogenic signaling pathway. Signal Transduct. Target. Ther. 5 (1), 163. Epub 2020/08/28. doi:10.1038/s41392-020-00276-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, J., Orcid, I., Bian, D., Zang, S., Yang, Z., Tian, G., et al. (2019). The association between nonalcoholic fatty liver disease and risk of colorectal adenoma and cancer incident and recurrence: A meta-analysis of observational studies. Expert Rev. Gastroenterol. Hepatol. 13 (4), 385–395. doi:10.1080/17474124.2019.1580143

PubMed Abstract | CrossRef Full Text | Google Scholar

De Simone, V., Franzè, E., Ronchetti, G., Colantoni, A., Fantini, M. C., Di Fusco, D., et al. (2015). Th17-Type cytokines, il-6 and tnf-Α synergistically activate Stat3 and nf-kb to promote colorectal cancer cell growth. Oncogene 34 (27), 3493–3503. Epub 2014/09/02. doi:10.1038/onc.2014.286

PubMed Abstract | CrossRef Full Text | Google Scholar

Edgar, R., Domrachev, M., and Lash, A. E. (2002). Gene expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 30 (1), 207–210. doi:10.1093/nar/30.1.207

PubMed Abstract | CrossRef Full Text | Google Scholar

Fontes-Cal, T. C. M., Mattos, R. T., Medeiros, N. I., Pinto, B. F., Belchior-Bezerra, M., Roque-Souza, B., et al. (2021). Crosstalk between plasma cytokines, inflammation, and liver damage as a new strategy to monitoring nafld progression. Front. Immunol. 12, 708959. Epub 2021/08/28. doi:10.3389/fimmu.2021.708959

PubMed Abstract | CrossRef Full Text | Google Scholar

Franceschini, A., Szklarczyk, D., Frankild, S., Kuhn, M., Simonovic, M., Roth, A., et al. (2013). STRING v9.1: Protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res. 41, D808–D815. doi:10.1093/nar/gks1094

PubMed Abstract | CrossRef Full Text | Google Scholar

Gao, B., and Tsukamoto. -, H. (2016). Inflammation in alcoholic and nonalcoholic fatty liver disease: Friend or foe? Gastroenterology 150 (8), 1704–1709. doi:10.1053/j.gastro.2016.01.025

PubMed Abstract | CrossRef Full Text | Google Scholar

Goyale, A., Jain, A., Smith, C., Papatheodoridi, M., Misas, M. G., Roccarina, D., et al. (2021). Assessment of non-alcoholic fatty liver disease (nafld) severity with novel serum-based markers: A pilot study. PLoS One 16 (11), e0260313. Epub 2021/11/24. doi:10.1371/journal.pone.0260313

PubMed Abstract | CrossRef Full Text | Google Scholar

Ha, Ç., Orcid, I., and Demir, M. (2020). - microrna and cardiovascular diseases. Balk. Med. J. 37 (2), 60–71. doi:10.4274/balkanmedj.galenos.2020.2020.1.94

CrossRef Full Text | Google Scholar

Hong, Y., Downey, T., Eu, K. W., Koh, P. K., and Cheah, P. Y. (2010). A 'metastasis-prone' signature for early-stage mismatch-repair proficient sporadic colorectal cancer patients and its implications for possible therapeutics. Clin. Exp. Metastasis 27 (2), 83–90. doi:10.1007/s10585-010-9305-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Hong, Y., Ho, K. S., Eu, K. W., and Cheah, P. Y. (2007). A susceptibility gene set for early onset colorectal cancer that integrates diverse signaling pathways: Implication for tumorigenesis. Clin. Cancer Res. 13 (4), 1107–1114. doi:10.1158/1078-0432.CCR-06-1633

PubMed Abstract | CrossRef Full Text | Google Scholar

Jimba, S., Nakagami, T., Takahashi, M., Wakamatsu, T., Hirota, Y., Iwamoto, Y., et al. (2005). Prevalence of non-alcoholic fatty liver disease and its association with impaired glucose metabolism in Japanese adults. Diabet. Med. 22 (9), 1141–1145. doi:10.1111/j.1464-5491.2005.01582.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y., and Morishima, K. (2017). Kegg: New perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45 (D1), D353–D361. doi:10.1093/nar/gkw1092

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, H., Wang, J., Liu, X., and Cheng, Q. (2019). Microrna-204-5p suppresses il6-mediated inflammatory response and chemokine generation in hk-2 renal tubular epithelial cells by targeting Il6r. Biochem. Cell Biol. = Biochimie Biol. Cell. 97 (2), 109–117. Epub 2018/08/16. doi:10.1139/bcb-2018-0141

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, J., Qi, J., Tang, Y., Liu, H., Zhou, K., Dai, Z., et al. (2021). A nanodrug system overexpressed Circrna_0001805 alleviates nonalcoholic fatty liver disease via mir-106a-5p/mir-320a and abca1/cpt1 Axis. J. Nanobiotechnology 19 (1), 363. Epub 2021/11/19. doi:10.1186/s12951-021-01108-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, X., Orcid, I., Bechara, R., Zhao, J., McGeachy, M. J., Orcid, I., et al. (2019). - il-17 receptor-based signaling and implications for disease. Nat. Immunol. 20 (12), 1594–1602. doi:10.1038/s41590-019-0514-y

PubMed Abstract | CrossRef Full Text | Google Scholar

Miura, K., and Ohnishi, H. (2014). Role of gut microbiota and toll-like receptors in nonalcoholic fatty liver disease. World J. Gastroenterol. 20 (23), 7381–7391. Epub 2014/06/27. doi:10.3748/wjg.v20.i23.7381

PubMed Abstract | CrossRef Full Text | Google Scholar

Moradi-Marjaneh, R., Hassanian, S. M., Fiuji, H., Soleimanpour, S., Ferns, G. A., Avan, A., et al. (2018). Toll like receptor signaling pathway as a potential therapeutic target in colorectal cancer. J. Cell. Physiol. 233 (8), 5613–5622. Epub 2017/11/19. doi:10.1002/jcp.26273

PubMed Abstract | CrossRef Full Text | Google Scholar

Nelson, J. E., Handa, P., Aouizerat, B., Wilson, L., Vemulakonda, L. A., Yeh, M. M., et al. (2016). Increased parenchymal damage and steatohepatitis in caucasian non-alcoholic fatty liver disease patients with common Il1b and Il6 polymorphisms. Aliment. Pharmacol. Ther. 44 (11-12), 1253–1264. Epub 2016/10/13. doi:10.1111/apt.13824

PubMed Abstract | CrossRef Full Text | Google Scholar

Numasaki, M., Fukushi, J., Ono, M., Narula, S. K., Zavodny, P. J., Kudo, T., et al. (2003). - interleukin-17 promotes angiogenesis and tumor growth. Blood 101 (7), 2620–2627. doi:10.1182/blood-2002-05-1461

PubMed Abstract | CrossRef Full Text | Google Scholar

Parizadeh, S. M., Orcid, I., Parizadeh, S. A., Alizade-Noghani, M., Jafarzadeh-Esfehani, R., Ghandehari, M., et al. (2019). Association between non-alcoholic fatty liver disease and colorectal cancer. Expert Rev. Gastroenterol. Hepatol. 13 (7), 633–641. doi:10.1080/17474124.2019.1617696

PubMed Abstract | CrossRef Full Text | Google Scholar

Pomaznoy, M., Id- Orcid, X., Ha, B., and Peters, B. (2018). Gonet: A tool for interactive gene ontology analysis. BMC Bioinforma. 19 (1), 470. doi:10.1186/s12859-018-2533-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Sung, H., Orcid, I., Ferlay, J., Siegel, R. L., Orcid, I., Laversanne, M., et al. (2021). Global cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. Ca. A Cancer J. Clin. 71 (3), 209–249. doi:10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

Tang, Y., Bian, Z., Zhao, L., Liu, Y., Liang, S., Wang, Q., et al. (2011). Interleukin-17 exacerbates hepatic steatosis and inflammation in non-alcoholic fatty liver disease. Clin. Exp. Immunol. 166 (2), 281–290. doi:10.1111/j.1365-2249.2011.04471.x

PubMed Abstract | CrossRef Full Text | Google Scholar

Tsai, S. (2019). Molecular characteristics of biliary tract and primary liver tumors. Surg. Oncol. Clin. N. Am. 28 (4), 685–693. doi:10.1016/j.soc.2019.06.004

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, X., and Liotta, L. (2011). Clinical bioinformatics: A new emerging science. J. Clin. Bioinforma. 1 (1), 1–9113. doi:10.1186/2043-9113-1-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Warde-Farley, D., Donaldson, S. L., Comes, O., Zuberi, K., Badrawi, R., Chao, P., et al. (2010). The GeneMANIA prediction server: Biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 38, W214–W220. doi:10.1093/nar/gkq537

PubMed Abstract | CrossRef Full Text | Google Scholar

Wu, K., Orcid, I., Zhai, M. Z., Weltzien, E. K., Feliciano, E. M. C., Meyerhardt, J. A., et al. (2019). Non-alcoholic fatty liver disease and colorectal cancer survival. Cancer Causes Control 30 (2), 165–168. doi:10.1007/s10552-018-1095-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Yan, P., He, Y., Xie, K., Kong, S., and Zhao. -, W. (2018). In silico analyses for potential key genes associated with gastric cancer. PeerJ 7 (6), e6092. doi:10.7717/peerj.6092

PubMed Abstract | CrossRef Full Text | Google Scholar

Yang, Y. J., Bang, C. S., Orcid, I., Shin, S. P., and Baik, G. H. (2017). Clinical impact of non-alcoholic fatty liver disease on the occurrence of colorectal neoplasm: Propensity score matching analysis. PLoS One 12 (8), 0182014. doi:10.1371/journal.pone.0182014

PubMed Abstract | CrossRef Full Text | Google Scholar

Yin, Y., Zhang, B., Wang, W., Fei, B., Quan, C., Zhang, J., et al. (2014). Mir-204-5p inhibits proliferation and invasion and enhances chemotherapeutic sensitivity of colorectal cancer cells by downregulating Rab22a. Clin. Cancer Res. 20 (23), 6187–6199. Epub 2014/10/09. doi:10.1158/1078-0432.Ccr-14-1030

PubMed Abstract | CrossRef Full Text | Google Scholar

You, J., Huang, S., Huang, G. Q., Zhu, G. Q., Ma, R. M., Liu, W. Y., et al. (2015). Nonalcoholic fatty liver disease: A negative risk factor for colorectal cancer prognosis. Medicine 94 (5), 0000000000000479. doi:10.1097/MD.0000000000000479

PubMed Abstract | CrossRef Full Text | Google Scholar

Younossi, Z. M. (2019). Non-alcoholic fatty liver disease - a global public health perspective. J. Hepatol. 70 (3), 531–544. Epub 2018/11/12. doi:10.1016/j.jhep.2018.10.033

PubMed Abstract | CrossRef Full Text | Google Scholar

Yue, B., Sun, B., Liu, C., Zhao, S., Zhang, D., Yu, F., et al. (2015). Long non-coding rna fer-1-like protein 4 suppresses oncogenesis and exhibits prognostic value by associating with mir-106a-5p in colon cancer. Cancer Sci. 106 (10), 1323–1332. Epub 2015/08/01. doi:10.1111/cas.12759

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhou, Y., Zhou, B., Pache, L., Chang, M., Id- Orcid, X., Khodabakhshi, A. H., et al. (2019). Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat. Commun. 10 (1), 1523. doi:10.1038/s41467-019-09234-6

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: non-alcoholic fatty liver disease (NAFLD), colorectal cancer (CRC), differentially expressed genes (DEGs), bioinformatics analysis, miRNA-gene regulatory network

Citation: Qiu T, Hu W, Rao Z and Fang T (2022) The molecular basis of the associations between non-alcoholic fatty liver disease and colorectal cancer. Front. Genet. 13:1007337. doi: 10.3389/fgene.2022.1007337

Received: 30 July 2022; Accepted: 28 November 2022;
Published: 09 December 2022.

Edited by:

Samrat Roy Choudhury, University of Arkansas for Medical Sciences, United States

Reviewed by:

Meiyi Li, The Chinese University of Hong Kong, China
Olfat Ali Hammam, Theodor Bilharz Research Institute, Egypt

Copyright © 2022 Qiu, Hu, Rao and Fang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Taiyong Fang, fangtaiyong@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.