Gene-Microbiome Co-expression Networks in Colon Cancer

Uriarte-Navarrete, Irving; Hernández-Lemus, Enrique; de Anda-Jáuregui, Guillermo

doi:10.3389/fgene.2021.617505

ORIGINAL RESEARCH article

Front. Genet., 15 February 2021

Sec. Systems Biology Archive

Volume 12 - 2021 | https://doi.org/10.3389/fgene.2021.617505

This article is part of the Research TopicApplications and Methods in Genomic NetworksView all 18 articles

Gene-Microbiome Co-expression Networks in Colon Cancer

Irving Uriarte-Navarrete¹

Enrique Hernández-Lemus^1,2^*

Guillermo de Anda-Jáuregui^1,2,3^*

¹Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico
²Centro de Ciencias de la Complejidad, Universidad Nacional Autónoma de México, Mexico City, Mexico
³Conacyt Research Chairs, National Council on Science and Technology, Mexico City, Mexico

It is known that cancer onset and development arise from complex, multi-factorial phenomena spanning from the molecular, functional, micro-environmental, and cellular up to the tissular and organismal levels. Important advances have been made in the systematic analysis of the molecular (mostly genomic and transcriptomic) within large studies of high throughput data such as The Cancer Genome Atlas collaboration. However, the role of the microbiome in the induction of biological changes needed to reach these pathological states remains to be explored, largely because of scarce experimental data. In recent work a non-standard bioinformatics strategy was used to indirectly quantify microbial abundance from TCGA RNA-seq data, allowing the evaluation of the microbiome in well-characterized cancer patients, thus opening the way to studies incorporating the molecular and microbiome dimensions altogether. In this work, we used such recently described approaches for the quantification of microbial species alongside with gene expression. With this, we will reconstruct bipartite networks linking microbial abundance and gene expression in the context of colon cancer, by resorting to network reconstruction based on measures from information theory. The rationale is that microbial communities may induce biological changes important for the cancerous state. We analyzed changes in microbiome-gene interactions in the context of early (stages I and II) and late (stages III and IV) colon cancer, studied changes in network descriptors, and identify key discriminating features for early and late stage colon cancer. We found that early stage bipartite network is associated with the establishment of structural features in the tumor cells, whereas late stage is related to more advance signaling and metabolic features. This functional divergence thus arise as a consequence of changes in the organization of the corresponding gene-microorganism co-expression networks.

Introduction

Colon cancer is consistently ranked among the top five contributors to cancer deaths worldwide (Bray et al., 2018). Its incidence and mortality are rapidly rising in developing countries, possibly influenced by changes in lifestyle and socioeconomic conditions. It is expected that this trend will actually further increase according to recent studies (Arnold et al., 2017).

As with many other cancers, colon cancer is known to have a genetic component as well as environmental factors which further modulate or increase the risks. Its molecular determinants include genomic, regulatory, and epigenomic components (Raskov et al., 2020) whereas the environmental component is also multifactorial, ranging from toxicological exposure (Fernández-Martínez et al., 2020), physical activity (Friedenreich et al., 2020), dietary habits and more. A more recent factor that is an important research topic is the role that microbiome interactions may be playing at the molecular and patho-physiological levels.

Recent findings have pointed out to different, sometimes disparate phenomena, such as the influence of bacterial protein toxins (Fiorentini et al., 2020), altered microbiome composition (Xu et al., 2020), and the non-rational use of antibiotics (Simin et al., 2020). Among these, microbome-host interactions are hypothesized to modulate and integrate these diverse signals (Yang et al., 2020). For instance, experimental evidence has been found for functional alterations mediated by microorganisms involved in colon cancer progression (Yu et al., 2020). It is currently accepted that these complex biomolecular and organismal interactions can be better understood using a systems biology approach (Peñalver Bernabé et al., 2018).

In the context of oncology, network biology has proven to be a powerful tool for the integration of multiple high throughput technologies (de Anda-Jáuregui and Hernández-Lemus, 2020). Networks provide flexible frameworks to represent the relevant physio-pathological interactions present in the tumor environments. For instance, bipartite networks have been used to represent gene expression control by micro-RNAs; a strategy that allows not only to describe statistical associations, but also to identify putative functional associations (de Anda-Jáuregui et al., 2018, 2019).

In this work, we reconstructed bipartite networks that capture the statistical dependence between microorganism abundance and gene expression in early (stages I and II) and late (stages III and IV) colon cancer, using data from The Cancer Genome Atlas (TCGA). We analyze these networks to identify changes in the relative relevance of microorganisms between these conditions, in terms of their topological role in their respective networks. We analyzed genes associated to the highest ranked microorganisms in each network as a means to identify changes in associated biological functions. This work hence aims to provide novel insights into microorganism-mediated functional alterations potentially involved in colon cancer progression.

Materials and Methods

For this work, we collected gene expression data from TCGA, along with microorganism quantification data that was generated by Poore et al. (2020), for the same 269 samples. We classified these samples into early (n = 150) and late (n = 119) colon cancer based on tumor stages as provided by TCGA metadata.

Interactions between each pair of measured microorganism and gene were detected using mutual information (MI) as a measure for statistical dependence. The highest ranked interactions were kept in order to reconstruct bipartite networks for each group. Downstream analyses included topological characterization and functional enrichment analysis. In Figure 1, we present a schematic representation of our analysis pipeline.

FIGURE 1

Figure 1. Network analysis pipeline.

Gene Expression Data

We used data from TCGA, obtained through the Genome Data Commons portal. We used level three pre-processed gene expression data; the full analysis pipeline is documented at https://docs.gdc.cancer.gov/Data/Bioinformatics_Pipelines/Expression_mRNA_Pipeline/; briefly, RNA-seq data is aligned using STAR (Dobin et al., 2012), and reads mapped to each gene are counted using HT-SEQ (Anders et al., 2014); Read counts are normalized using the Fragments per Kilobase of transcript per Million mapped reads (FPKM) calculation, which divides counts by the gene length and the total number of reads mapped to protein-coding genes.

Based on the available metadata, samples with tumor stages I and II were grouped as early colon cancer, while samples with tumor stages III and IV were grouped as late colon cancer. Due to some samples being discarded from the microbiome quantification pipeline by the original authors (Poore et al., 2020, see next section), we ended up using 137 early stage and 64 late stage samples (see Supplementary File 1 for the TCGA identificators of the used samples).

Microorganism Abundance Data

We used the public dataset generated in Poore et al. (2020) as our source for microorganism abundance data. Briefly, in said work the authors were able to quantify microorganism abundance in TCGA tumor samples via a novel bioinformatics approach. Briefly, They took raw whole genome sequencing (WGS) data and analyzed the nearly 0.9% of total sequencing reads were classified as non-human and assigned to bacteria, archaea, or viruses at the genus level using Kraken (Wood and Salzberg, 2014); which matches k-mers to taxa in a reference database. Normalization was performed considering sample number within a cancer type and sample type. To correct for batch effects, discrete taxonomical counts are converted to log-counts per million per sample using Voom (Law et al., 2014), and a secondary supervised normalization was performed to remove significant batch effects. Additionally, contamination concerns were addressed using the Bayesian source tracking model SourceTracker2 (Knights et al., 2011). Based on their quantification, we crossed microorganism abundance and gene expression data at the aliquot level, to ensure biological comparability between the datasets.

Microbiome-Gene Co-expression Quantification

Having matched gene expression and microorganism abundance data organized into expression matrices, we calculated mutual information for each pair of microorganism × gene. Mutual information is the maximum likelihood information theoretic measure of statistical dependence. Since it is capable to capture non-linear relationships between features, it has been successfully used for gene co-expression network reconstruction (de Anda-Jáuregui et al., 2016; He et al., 2017). It has also been previously used for bipartite network reconstruction of multiomic data (de Anda-Jáuregui et al., 2018, 2019). In this work, we calculated MI using the infotheo package in R.

Once MI values were calculated, we selected those interactions above the 99.5 quantile to be considered as links on a bipartite network: $B (m i c r o o r g a n i s m, g e n e)$ ; A bipartite graph (or bigraph) is a network whose nodes can be divided into two disjoint sets U and V such that each link connects a U-node to a V-node. Importantly, no links are found between two nodes belonging to the same set (Barabási et al., 2016). For mutual information calculation, data is discretized using the equal frequency method (Meyer 2008), which assigns each observation to one of N bins, with N being the number of observations. The discretized vectors are then used as the input for proper mutual information calculation, using an entropy estimation of the empirical probability distribution. Both of these calculations were performed using the infotheo package for R.

For completeness, the reconstructed networks contained all measured microorganisms (N = 4, 450) and protein-coding genes (N = 16, 593), even if they do not participate in any link (that is, they have connectivity degree k = 0). The threshold was selected based on previous analyses of multi-omic bipartite networks (de Anda-Jáuregui et al., 2018, 2019); we must acknowledge that by using this threshold we guarantee fair comparisons between the reconstructed networks; however, the structure and composition of these networks will not be comparable to networks generated through other methods (including the selection of a different threshold).

Network Analyses

We characterized the topology of each of the generated using a combination of the igraph (Csardi and Nepusz, 2006) in R and networkx (Hagberg et al., 2008) in Python. Additionally, we used Cytoscape (Shannon, 2003) to generate network visualizations. In this work, we focused mainly on centrality measures including degree, bipartite clustering coefficient, and redundancy coefficients (Latapy et al., 2008). Comparisons between appropriate distributions were evaluated using the Kolmogorov-Smirnov test.

Functional Enrichment of High-Degree Microorganism Gene Neighborhoods

We analyzed the neighborhoods of the highest ranked microorganisms (based on their degree) to identify host biological functions associated to these microorganisms. To do so, we performed over-representation analysis (ORA) via FDR-corrected hypergeometric tests for biological processes and molecular functions (as annotated in the Gene Ontology database) using the WebgestaltR (Liao et al., 2019) package. Parameters for ORA considered the full genome as the reference set, and a false discovery rate (FDR) threshold of 0.05. It should be noted that the enrichment is performed over the set of genes that conform the neighborhood of each microorganism; this is to identify biological functions from the host that can be associated to microorganisms through their co-expressed genes (see Figure 2). We further used natural language processing tools from the tm package in R (Meyer et al., 2008) to compare identified functions and processes, by tokenizing their names and descriptions and identifying the most mentioned keywords or tokens.

FIGURE 2

Figure 2. Enrichment of host biological functions associated to microorganisms through their gene neighborhoods. Each microorganism has a set of neighbor genes in the bipartite network. This gene set is tested against a set of known biological functions (as annotated in the GO database) through the hypergeometric test. Through these procedure, we can associate known biological functions from the host to each of the measured microorganisms.

Results and Discussion

Microorganism-Gene Co-expression Networks Are Topologically Similar in Early and Late Colon Cancer

By studying bipartite networks, we wanted to know what are some possible ways in which the presence of microorganisms may affect the host's response (as proxied by changes in gene expression highly correlated with microbial abundance) and vice versa. Clues to this may be provided by the microbe-gene links. The reconstructed microorganism-gene co-expression networks for the early and late stages of colon cancer exhibit a similar global topology. They are both dominated by a giant connected component that contains all detected links. This giant connected component is composed of all measured microorganisms, and over 80% of measured genes. It should be noted that in the case of both genes and microorganisms, presence in the network is not directly correlated by the abundance in the original measurements, nor biased due to zero-inflation effects (see Supplementary File 2). Figure 3 depicts these networks. Table 1 presents the global topological features of these networks.

FIGURE 3

Figure 3. Microorganism-gene co-expression networks for early (A) and late (B) colon cancer. In this visualizations, microorganisms are colored purple and genes are colored blue. Nodes with degree k = 0 are removed for visualization purposes, highlighting that in both networks, connected nodes form a single giant component. (C,D) Show a subset of the early (C) or late (D) networks, highlighting the most connected microorganisms.

TABLE 1

Table 1. General network descriptors.

The bipartite degree distributions of these networks (seen in Figure 4) are quite similar between early and late stage. In this context, it is more informative to assess the degree distributions for each type of nodes (microorganisms and genes) separately. In this regard, we observe that in both networks, genes follow a heavy-tailed distribution (blue dots in Figure 4); that is, most genes are connected to few microorganisms, whereas a few genes are connected to many microorganisms. Meanwhile, microorganism nodes (red dots in Figure 4) exhibit a different pattern: a curve with no low-degree nodes; indicating that every detected microorganism has putative effects on the expression of a relatively large set of genes. In any case, the distributions for both genes and microorganisms are similar between early and late stages cancer networks.

FIGURE 4

Figure 4. Degree distributions for the early and late colon cancer networks. Values for microorganisms are shown in red, and values for genes are shown in blue. Notice how genes exhibit a heavy-tailed distribution, whereas a different behavior is observed for microorganisms in both networks.

We evaluated two other topological properties of the nodes in these networks: the clustering and redundancy coefficients (see Figure 5).

FIGURE 5

Figure 5. Density plots for redundancy (top) and clustering (bottom) coefficients. We can see how microorganisms (red lines) are significantly less redundant and clustered than genes within these networks.

Network redundancy (sometimes called path degeneracy) is related to how many different paths or trajectories can be taken to go from one node to another. Unlike trees or loosely connected networks, complex networks (such as the ones discussed here) are characterized by being highly redundant. This means that there are multiple (sometimes many) different paths connecting two given nodes. For probabilistic networks this implies that the Markov blanket (the subset of the network with the useful connectivity information) spans much of the network. This in turn implies that to break up (percolate, in technical terms) the network to pieces, one must remove a large number of links. In the case of bipartite networks, the concept of redundancy has to be adapted, since neighborhood overlaps correspond to links obtained in several ways during projection which are not distinguishable. Then redundancy is caused by nodes that when removed from the bipartite graph, do not cause significant changes in the projection (Latapy et al., 2008).

The clustering coefficient is a quantitative measure of the tendency of nodes in a graph to cluster together. It is calculated for a node (local clustering coefficient), as the ratio of the number of “triangles” (technically “closed triplets”) formed by links connected to this node, to all possible triangles that can be formed with this node and its immediate neighbors. The global clustering coefficient is a network quantity, which is indeed the average of the local clustering coefficient of all the nodes in connected components of the network. In the case of clustering coefficients for bipartite networks, these measure the probability that given four nodes with three links, they are actually all connected with four links (all the possible links in a bipartite configuration of four nodes) (Latapy et al., 2008).

In bipartite networks, these are measures of the contribution of a given node to the connectivity of nodes of the opposite type (Latapy et al., 2008). We observe that in the case of microorganisms (red curves in Figure 5), these exhibit low values: this indicates that there is no single microorganism through which most genes could interact. Meanwhile, genes (blue curves in Figure 5) exhibit higher values, meaning that gene-mediated connections between microorganisms are, on average, more likely to be redundant. Table 2 shows the statistical differences between the evaluated distributions.

TABLE 2

Table 2. Distribution comparison (Kolmogorov-Smirnov test).

Despite these overall similarities, networks for early and late colon cancer exhibit notable differences in terms of their connections. Although the composition of the GCC is fundamentally similar in terms of the microorganisms and genes found in it, the way in which this are connected is completely dissimilar, with a Jaccard similarity for edges of only 0.28%.

This differences in connectivity in turn explain the different degree ranking of both microorganisms and genes. The ranked list of microorganisms and genes show poor correlation between the early and late stages (Spearman ρ of 0.015 for microorganisms and 0.269 for genes). Due to these differences, the highest ranked microorganisms are (a) different in the early and late stages of colon cancer and (b) have a different set of associated genes. With this in mind, we explored how these facts change the set of host biological functions associated to the most connected microorganisms.

Regarding microorganisms, Tables 3, 4 present the top 10 highly connected microorganism (at the genus level) in the gene microorganism bipartite networks for early and late stage colon cancer, respectively.

TABLE 3

Table 3. Early colon cancer: top 10 highest ranked microorganism by degree.

TABLE 4

Table 4. Late colon cancer: top 10 highest ranked microorganism by degree.

By examining Tables 3, 4, it may be surprising that most of the microbial species themselves have not been reported to be related with the onset and development of colon cancer. This of course may be explained by the fact that systematic high-throughput studies of the relationship between cancer and microbial dysbiosis are indeed still being developed. So the absence of evidence may not (yet) be taken as evidence of absence. However, in the next subsection we will see how, even though the organisms themselves may not sound that familiar, the statistically dependent gene neighborhoods of such microorganisms will recapitulate relevant functional features known in the biology of colon cancer.

Host Biological Functions Associated to Highly Connected Microorganisms Change With Colon Cancer Progression

We set to identify functions that could be linked to microorganisms detected in the early and late stage tumors. Since there is no annotation of human biological functions associated to microorganisms, we performed ORA on the gene neighborhoods of the top 10 highest ranked microorganisms by degree, searching for enrichment of biological processes and molecular functions annotated in Gene Ontology.

Enrichment Results for Biological Processes

The biological processes branch of the Gene Ontology is devoted to biologically relevant functional processes, some of these have clearly understood biomolecular mechanisms and some others are yet to be fully dissected. However, they allow for an advancement in our understanding of the molecular and cellular physiology behind gene and protein interactions.

Statistically enriched biological processes may represent functional processes in which the host-microbiome interactions are manifested. As we will see, some of these actually correspond to well-known hallmarks of cancer.

In Figures 6, 7, we present the results of these enrichment analyses as a heatmap. Notice that even if we performed the analyses for the 10 highest ranked microorganisms, only five genus were significantly associated to functions through their gene neighborhoods in each network.

FIGURE 6

Figure 6. Functional enrichment of highly connected microorganisms in the early colon cancer network—biological processes.

FIGURE 7

Figure 7. Functional enrichment of highly connected microorganisms in the late colon cancer network—biological processes.

Notably, higher enrichment values (in terms of FDR) are found in the early stage (Figure 6) than in the late stage (Figure 7). The interpretation is that biological functions are perhaps better mapped to the gene neighborhoods in the early colon cancer network—possibly indicating a more coordinated response to these microorganisms.

We identified only two biological processes appearing both in the early and late networks. These are protein-containing complex localization and nuclear transport. To better understand the functional differences identified, we tokenized the names of the detected biological processes and compared them between the early and late networks.

In Figure 8, we compare and contrast the terms associated to these biological processes. We observe in the early stages concepts associated to tumorigenesis such as proliferation, biogenesis, and (cell) cycle; as well as nucleic acids. Meanwhile, in the late stages, we observe terms that could be associated to late-stage cancer such as migration and angiogenesis. Concepts shared between both stages include regulation, muscle, and protein. For the full set of enrichment results, please refer to Supplementary File 3.

FIGURE 8

Figure 8. Venn diagram of top 20 most mentioned concepts in biological processes associated to early and late colon cancer.

Enrichment Results for Molecular Functions

By recognizing that our understanding of the way microbiome-host interactions may be playing a role on the onset and development of cancer-associated biological processes is still quite incipient, we decided to also examine the molecular functions dimension of the Gene Ontology. This is so since molecular function refers to specific chemical and biochemical interactions of a more general nature that may be related to one, or more commonly to a large set of biological processes.

The rationale is that molecular species related to the entangled multi-microbial metabolism are possible interacting with the molecules involved in human (and in particular tumor and tumor micro-environment) cells.

Figures 9, 10 present the molecular function enrichment analysis for the early and late colon cancer networks. As in the case of biological processes, molecular functions are enriched on different microbial genus in the early and late stage networks. It is worth noticing that the more significant physico-chemical functions in the early stage correspond to structural features (particularly enriched for the gene-neighborhood of the Nitrosospira genus, see Figure 9) whereas the more enriched molecular functions in the late stage network corresponds to actin binding for genes in the network vicinity of the Pelomonas genus (Figure 10).

FIGURE 9

Figure 9. Functional enrichment of highly connected microorganisms in the early colon cancer network—molecular functions.

FIGURE 10

Figure 10. Functional enrichment of highly connected microorganisms in the late colon cancer network—molecular functions.

We can also notice in Figure 10 that other microbial genuses' gene neighborhoods are highly enriched for molecular functions, such is the case of Jeotgallicoccus for actin binding, and to several types of oxido-reductase, as well as cytochrome-oxidase activity; and the case of Nitriliruptor for GTP-ase and nucleotide binding, and Desulphurella for ubiquitin and thyroid receptor activity.

As in the case of the Biological Processes enrichment analysis, Figure 11 presents the results of natural language processing and tokenization of terms resulting in the statistically significant enrichment GO-categories. As it was mentioned, early stage molecular functions are somehow related to structural cellular features, whereas late stage are related to cellular metabolism and transport processes, being binding phenomena the common function at the intersection of both stage networks. For the full set of enrichment results, please refer to Supplementary File 3.

FIGURE 11

Figure 11. Venn diagram of top 20 most mentioned concepts in molecular functions associated to early and late colon cancer.

Discussion

Topology of the Microbiome-Gene Co-expression Networks

Complex networks are characterized by their composition and global topological structure, that is by what are their elements and how are these connected in the networks. As presented in Figures 2–4 and Table 1 in results, the global topological structures of early and late stage colorectal cancer bipartite networks are indeed quite similar. Approximately equal sizes in terms of number of nodes and edges. Similar size of their giant connected components and even a very high value of node similarity in their GCCs. However, as it can be seen in Table 1 the edge similarity (a quantity proportional to the number of shared edges between the two networks) is actually extremely small (0.28%). This means that even if the elementary components of the networks (i.e., the genes and microorganisms) are almost the same and the global network features are so similar, the actual networks are indeed quite different, something unsurprising given that they represent two different biological scenarios.

Also noteworthy is the fact that by examining Figure 4 we could notice that the two different types of nodes (genes and microorganisms) present striking differences in their degree connectivity probability distributions (blue dots representing genes and red dots microorganisms) and that the same patterns is observed for early and late stage colorectal cancer. The degree distributions for genes present long-tailed distributions that have been thoroughly characterized in complex biomolecular networks. In those long-tailed distributions one can notice how most genes have a relatively low number of connections whereas a few hub genes are densely connected in the networks.

Microorganisms, on the other hand present a rather different degree distribution scenario. In both networks, microorganisms show a more symmetric short-tailed distribution in which a most microorganisms are highly connected and present narrower variability in their connectivity degree. This difference perhaps represent that microbial communities somehow serve as integrating entities in the bipartite network. This, in turn, may be related with the low redundancy coefficients displayed by microorganisms in both networks as it can be seen in Figure 5 (top row). Low redundancy of the specific microbial agents may prove later to have relevance for the design of microbiome-driven therapeutic strategies, though it is still very early to further speculate on this.

One relevant and complementary aspect to consider on the role that gene-microbial interactions may play can be glimpsed by looking at the probability density distributions for the clustering coefficient (Figure 5 bottom row). We can see that in both networks (early and late stage) microorganisms present low values of clustering coefficient, whereas for genes there are wider probability distributions. Microorganisms are highly connected but not so-clustered. This in turn contributes to their being less redundant. This also may imply that the gene-microbiome co-expression program in the cancer networks is shaped by the full set of gene-microbial interactions and is not dominated by a few central players. This fact has been already discussed in the literature: physio-pathological phenomena related to microbial activity is, in general, influenced by microbiome dysbiosis rather than by the activity of a single or a few microorganisms.

Changes in Network Composition and Relative Importance

The latter points led us to discuss on how, even if the whole set of microorganisms is present in both, early and late stage colorectal cancer networks, their connectivity and importance in information processing within the networks vastly differ.

Consider Tables 3, 4, for instance. There, we can see that the top 10 highly ranked microorganisms (that is, those with higher statistical dependencies and connectivity in the gene-microbial co-expression networks) are quite different. Indeed, no microorganism is present simultaneously at the top 10 of both networks, even at the, somewhat general, genus level presented here. This points out to a possible reprogramming of the gene-microbiome regulatory structure associated with the phenotypic differences between early and late stage colorectal cancer.

Regarding the highest ranked microorganisms associated to early stage colon cancer (Table 3), we have found that, in the case of Rhodospirillum, for instance, it is known to be able to produce molecules such as L-asparaginase which is a regulator of telomerase activity that has been found able to act on human cancer and immune cells (Zhdanov et al., 2017a,b; Plyasova et al., 2020). Nitrosospira is associated with processes related to ammonia oxidation (Kowalchuk and Stephen, 2001) in connection with colon cancer (Bingham et al., 1996; Bruce et al., 2000; Davis and Milner, 2009; O'keefe, 2016). Pontibacter has been found enriched in patients with gastric cancer and correlated with TNM severity (Dong et al., 2019).

In the case of Shinella, significant abundance has been found in mucosal associated microbiota in patients with severe irritable bowel syndrome (Li et al., 2018), and also is known to be involved in the production of N-nitrosonornicotine, a strong (group 1) carcinogen (Qiu et al., 2016). Vogesella dysbiosis has been recently found associated with gastric cancer (Coker et al., 2018; Rajilic-Stojanovic et al., 2020), as well as with changes in the endometrial microbiota associated with inflammatory cytokines in endometrial cancer (Lu et al., 2020), and with esophageal squamous cell carcinoma (Lv et al., 2020).

As regards to Rubrivivax, it is able to produce a molecule rubrivivaxin that is a cytotoxic agent and a COX-1 inhibitor (Kumavath et al., 2011). As is known COX-1 and COX-2 are relevant players in human colorectal cancer (Sano et al., 1995; Sinicrope and Gill, 2004; Pannunzio and Coluccia, 2018). Rubrivivax dysbiosis has also been found present in connection to lung cancer (Greathouse et al., 2018).

Thermodesulfovibrio has been recently discussed to play a role in the modulation of FOXP3 and IL-17 involved in immune tolerance in colon cancer (Bergsten et al., 2020). Sulfate reducing bacteria, also including Desulphurella are known to be associated with the pathogenesis of colorectal cancer (Kováč et al., 2017; Suri et al., 2019). Nitriliruptor has been reported to be involved colorectal cancer (Marzban et al., 2020), its dysbiosis has been mentioned also in connection to renal carcinomas (Wang et al., 2020) and severe cases of irritable bowel syndrome (Zhuang et al., 2018).

In connection with microorganisms associated with late stage colon cancer (Table 4), Jeotgalicoccus abundance has been found to be abnormal in the urinary microbiome in connection with bladder cancer (Hussein et al., 2021). It also has been included in a metagenomic panel screening for the diagnosis of ovarian cancer (Kim et al., 2020) and associated with antibiotic perturbation leading to accelerated tumor growth in breast cancer (Kirkup et al., 2019). Interestingly, Cryocola has been found to be increasingly abundant after H. pylori eradication in gastric cancer cells (Figueiredo and Castaño-Rodŕıguez, 2020) which may point out to second order competition effects. Dactylosporangium produces molecules such as macrolides that disrupt the mitochondrial membrane potentials in colorectal cancer cells HCT116 and HT29 (Tan et al., 2018) and belong to a class of microorganisms that are being considered as source of bioactive metabolites with pharmaceutical interest (Rangseekaew and Pathom-Aree, 2019).

In the case of Pelomonas, it has been recognized as involved in the onset of multifocal atrophic gastritis with intestinal metaplasia, a likely pre-malignant gastric lesion (Yang et al., 2016). It is also abundant in the tumor microenvironment of up to fifty percent of colorectal tumors in one study (Pierce et al., 2018). Pelomonas also has been found as one of the disrupted genera associated with bladder cancer (Liu et al., 2019; Mansour et al., 2020).

Zymomonas have been recognized to play several roles in cancer. Zymomonas' levan is involved in MMP-9 activation and extracellular matrix remodeling and inflammation (Sturzoiu et al., 2011) and also to induce changes in oxidative states leading to antiproliferative and proapoptotic effects in MCF7 breast cancer cells (Queiroz et al., 2017). Similarly, Methylomonas have been found to be involved in the production of toxin genes that are functional drivers in human colorectal cancer (Dutilh et al., 2013) and in the production of azurin, a known cytotoxic factor regulating cell death (Chakrabarty et al., 2008).

It should be noticed, however, that confirmation studies, in particular functional intervention assays, are needed to establish more clearly the actual role of microbiome dysbiosis in connection with the onset and development of human malignancies in general and specially colon cancer.

Biological Functionality Associated to the Microbiome Changes With Progression

The concerted study of gene-microbial interactions is still at its infancy. It results challenging thus to ascertain or even hypothesize on the role that microbial communities play in the already complex and incomplete panorama of biomolecular interactions inside human cells and tissues. In order to advance, if just a little, in our understanding of how microorganisms and their joint metabolic fluxes and ecological interactions influence the molecular and cellular composition and functions, we have resorted to analyse the gene-microorganism co-expression networks. By looking at the known molecular players (genes) that present strong statistical dependencies with specific microbial species we may start by assigning those (via guilt-by-association schemes) a putative functional role in human (in this case, tumor) biology.

Gene enrichment analysis was used to indirectly probe associations with the microbiome by looking at the gene-neighborhood of highly connected microorganisms in early and late stage colorectal cancer bipartite networks. Gene Ontology Biological Processes (BP) and Molecular Function (MF) branches were considered as target databases for the statistical over-representation enrichment analysis as presented in Figures 6, 7 for BP, and Figures 9, 10 for MF in early/late colorectal tumor networks, respectively.

As presented in results, we were able to find functional differences between the early and late stage gene-microbiome co-expression programmes. A number of statistical significant processes and molecular functions are presented in the heatmaps in Figures 5, 6, 8, 9. To present a summary of these findings, we used natural language processing tools on tokenized versions of the enrichment tables. Figures 8, 11 present Venn diagrams depicting highly mentioned tokens. We can see that in the case of BP (Figure 8), early stage networks are enriched for terms related to proliferation and cell growth, including structural elements and synthesis of biomaterials, whereas late stage is characterized by terms related to signaling and transport processes. Biochemical and physical regulation mechanisms are present in processes at the intersection of both networks.

Following a similar approach, tokens related to molecular functions associated with early and late stage colorectal cancer are presented in Figure 11. As in the case of biological processes, molecular functions associated with early tumors are related with structural features, late stage contains terms related to signaling and metabolic interactions, whereas the only molecular functions at the intersection of stages are related to binding.

By integrating these results some preliminary ideas may be drawn: first of all, it is becoming possible to analyse (albeit still in a somehow rudimentary way) the combined effect that the microbiome plays in conjunction with human tumor cells in the onset, establishment and development of colorectal cancer. These initial analyses, reveal differences in the functional features of the gene-microbiome bipartite co-expression networks, as inferred from probabilistic modeling of high-throughput genomic and transcriptomic experiments in large datasets. These differences, when supplemented with statistical enrichment analyses point out to a plausible scenario in which early stage colon cancer presents features related to the establishment of distinctive physical structures in the cells, that start to couple with biomolecular interactions at the cellular level, whereas advanced stages present an image of more complex signaling and metabolic processes occurring as the tumor keeps evolving to more advanced, malignant stages.

Scope and Limitations

In this work we identify changes in the co-expression/co-presence network connectivity found between colon cancer microbiome and its gene expression as the disease progresses. This type of studies are admittedly at their preliminary stages, but the integrative view they aim to provide seems promissory toward a better understanding of complex disease phenotypes. It is relevant, however, to acknowledge some limitations and assumptions of our current approach, in order to properly contextualize our findings and convey a balanced message.

One worth-mentioning constraint that may restrict the scope of our assertions is the following: Our work is based on experimental data coming from the TCGA colon cancer cohort. The volume of this cohort, as well as the availability of proper, well-curated, clinical metadata, makes it suitable for our (high-throughput, probabilistic-based) analyses. Furthermore, the open microbiome quantification strategy and the resulting data from Poore et al. (2020) allowed for a (relatively) high-confident network reconstruction. This is, however, the only cohort for which such suitable data is available, thus limiting our ability to replicate our findings in an independent cohort. While the sample size is adequate for probabilistic network reconstruction purposes, it can only capture as much of the microbiome heterogeneity as what was captured by the original authors. On a related topic, since access to the TCGA raw data required for the microbiome quantification data described in Poore et al. (2020) is controlled, we must rely on the quantification strategy as performed by the original authors—which is in turn influenced by sequencing depth and wet-lab procedure constraints from the original work.

Aside from these specific issues, some additional, general limitations should also be mentioned: although the methods used both in our work and in Poore et al. (2020) and even those in the TCGA original approach are all in the state of the art, there are still challenges. Even though the TCGA data has both, excellent depth and high quality sequencing, it was not intended as a metagenomic sequencing assay. Also, even the best metagenomic approaches rely on currently incomplete annotations. Pre-processing stages to consider multi-omic approaches, including metagenomic data are being developed so, these may not be as optimized and standardized as it will be desirable.

In spite of these clear limitations, we are convinced of the value of approaches such as the one presented here to start trying to answer these questions from an integrative data-centered view.

Conclusions

The progression of colon cancer involves changes in the interactions between cancer tissue and microbiome. In this work, we integrated microbiome quantification data with gene expression data using network models. These models describe the aforementioned changes in this interactions. We found that indeed, the set of microorganisms with a higher connectivity with host genes changes from the early to the late stages of colon cancer. Furthermore, reorganization is accompanied by changes in the associated set of biological functions, showing physiological adaptations associated to the tumor-microbiome relationships. To better understand and validate this findings, future experimental work is needed to properly characterize the mechanisms through which the microbiome may be mediating the observed tumor adaptations.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

IU-N organized data, performed calculations, and analyzed the data. EH-L co-designed the study, contributed to the methodological approach, analyzed data, discussed results, reviewed the manuscript, and co-supervised the project. GdA-J envisioned the project, devised the methodological strategy, developed code, performed calculations, analyzed data, discuss results, drafted the manuscript, and co-supervised the project. All authors read and approved the final manuscript.

Funding

This work was supported by the Consejo Nacional de Ciencia y Tecnología [SEP-CONACYT-2016-285544 and FRONTERAS-2017-2115], and the National Institute of Genomic Medicine, México. Additional support has been granted by the Laboratorio Nacional de Ciencias de la Complejidad, from the Universidad Nacional Autónoma de México. EH-L is recipient of the 2016 Marcos Moshinsky Fellowship in the Physical Sciences.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors want to thank Gabriela Graham for her support with language editing and proof-reading of this manuscript.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.617505/full#supplementary-material

References

Anders, S., Pyl, P. T., and Huber, W. (2014). HTSeq-a Python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169. doi: 10.1093/bioinformatics/btu638

PubMed Abstract | CrossRef Full Text | Google Scholar

Arnold, M., Sierra, M. S., Laversanne, M., Soerjomataram, I., Jemal, A., and Bray, F. (2017). Global patterns and trends in colorectal cancer incidence and mortality. Gut 66, 683–691. doi: 10.1136/gutjnl-2015-310912

PubMed Abstract | CrossRef Full Text | Google Scholar

Barabási, A.-L. (2016). Network Science. Cambridge: Cambridge University Press.

Google Scholar

Bergsten, E., Mestivier, D., Amiot, A., DeAngelis, N., Khazaie, K., and Sobhani, I. (2020). Immune tolerance to colon cancer is mediated by colon dysbiosis: human results and experimental in vivo validation. J. Clin. Oncol. 38:1. doi: 10.1200/JCO.2020.38.15_suppl.e16062

CrossRef Full Text | Google Scholar

Bingham, S., Pignatelli, B., Pollock, J., Ellul, A., Malaveille, C., Gross, G., et al. (1996). Does increased endogenous formation of n-nitroso compounds in the human colon explain the association between red meat and colon cancer? Carcinogenesis 17, 515–523. doi: 10.1093/carcin/17.3.515

PubMed Abstract | CrossRef Full Text | Google Scholar

Bray, F., Ferlay, J., Soerjomataram, I., Siegel, R. L., Torre, L. A., and Jemal, A. (2018). Global cancer statistics 2018: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. Cancer J. Clin. 68, 394–424. doi: 10.3322/caac.21492

PubMed Abstract | CrossRef Full Text | Google Scholar

Bruce, W. R., Giacca, A., and Medline, A. (2000). Possible mechanisms relating diet and risk of colon cancer. Cancer Epidemiol. Prevent. Biomark. 9, 1271–1279.

PubMed Abstract | Google Scholar

Chakrabarty, A. M., Gupta, T. K. D., Punj, V., Zaborina, O., Hiraoka, Y., and Yamada, T. (2008). Cytotoxic Factors for Modulating Cell Death. US Patent App. 11/509,682. Boston, MA: Google Patents.

Google Scholar

Coker, O. O., Dai, Z., Nie, Y., Zhao, G., Cao, L., Nakatsu, G., et al. (2018). Mucosal microbiome dysbiosis in gastric carcinogenesis. Gut 67, 1024–1032. doi: 10.1136/gutjnl-2017-314281

PubMed Abstract | CrossRef Full Text | Google Scholar

Csardi, G., and Nepusz, T. (2006). The igraph software package for complex network research. InterJ. Complex Syst. 1695, 1–9.

Google Scholar

Davis, C. D., and Milner, J. A. (2009). Gastrointestinal microflora, food components and colon cancer prevention. J. Nutr. Biochem. 20, 743–752. doi: 10.1016/j.jnutbio.2009.06.001

PubMed Abstract | CrossRef Full Text | Google Scholar

de Anda-Jáuregui, G., Espinal-Enríquez, J., Drago-García, D., and Hernández-Lemus, E. (2018). Nonredundant, highly connected microRNAs control functionality in breast cancer networks. Int. J. Genomics 2018:9585383. doi: 10.1155/2018/9585383

PubMed Abstract | CrossRef Full Text | Google Scholar

de Anda-Jáuregui, G., Espinal-Enríquez, J., and Hernández-Lemus, E. (2019). Highly-connected, non-redundant micrornas functional control in breast cancer molecular subtypes. BiorXiv. 1–10. doi: 10.1101/652354

CrossRef Full Text | Google Scholar

de Anda-Jáuregui, G., and Hernández-Lemus, E. (2020). Computational oncology in the multi-omics era: state of the art. Front. Oncol. 10:423. doi: 10.3389/fonc.2020.00423

PubMed Abstract | CrossRef Full Text | Google Scholar

de Anda-Jáuregui, G., Velázquez-Caldelas, T. E., Espinal-Enríquez, J., and Hernández-Lemus, E. (2016). Transcriptional network architecture of breast cancer molecular subtypes. Front. Physiol. 7:568. doi: 10.3389/fphys.2016.00568

PubMed Abstract | CrossRef Full Text | Google Scholar

Dobin, A., Davis, C. A., Schlesinger, F., Drenkow, J., Zaleski, C., Jha, S., et al. (2012). STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21. doi: 10.1093/bioinformatics/bts635

PubMed Abstract | CrossRef Full Text | Google Scholar

Dong, Z., Chen, B., Pan, H., Wang, D., Liu, M., Yang, Y., et al. (2019). Detection of microbial 16s rRNA gene in the serum of patients with gastric cancer. Front. Oncol. 9:608. doi: 10.3389/fonc.2019.00608

PubMed Abstract | CrossRef Full Text | Google Scholar

Dutilh, B. E., Backus, L., van Hijum, S. A., and Tjalsma, H. (2013). Screening metatranscriptomes for toxin genes as functional drivers of human colorectal cancer. Best Pract. Res. Clin. Gastroenterol. 27, 85–99. doi: 10.1016/j.bpg.2013.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

Fernández-Martínez, N. F., Ching-López, A., Olry de Labry Lima, A., Salamanca-Fernández, E., Pérez-Gómez, B., Jiménez-Moleón, J. J., et al. (2020). Relationship between exposure to mixtures of persistent, bioaccumulative, and toxic chemicals and cancer risk: a systematic review. Environ. Res. 188:109787. doi: 10.1016/j.envres.2020.109787

PubMed Abstract | CrossRef Full Text | Google Scholar

Figueiredo, C., and Casta no-Rodríguez, N. (2020). The microbiome and gastric cancer: an update. Microb. Health Dis. 2:e627. doi: 10.26355/mhd_20206_267

CrossRef Full Text | Google Scholar

Fiorentini, C., Carlini, F., Germinario, E. A. P., Maroccia, Z., Travaglione, S., and Fabbri, A. (2020). Gut microbiota and colon cancer: a role for bacterial protein toxins? Int. J. Mol. Sci. 21:6201. doi: 10.3390/ijms21176201

PubMed Abstract | CrossRef Full Text | Google Scholar

Friedenreich, C. M., Ryder-Burbidge, C., and McNeil, J. (2020). Physical activity, obesity and sedentary behavior in cancer etiology: epidemiologic evidence and biologic mechanisms. Mol. Oncol. 1–11. doi: 10.1002/1878-0261.12772

PubMed Abstract | CrossRef Full Text | Google Scholar

Greathouse, K. L., White, J. R., Vargas, A. J., Bliskovsky, V. V., Beck, J. A., von Muhlinen, N., et al. (2018). Microbiome-TP53 gene interaction in human lung cancer. bioRxiv. 273524. doi: 10.1101/273524

CrossRef Full Text | Google Scholar

Hagberg, A. A., Schult, D. A., and Swart, P. J. (2008). “Exploring network structure, dynamics, and function using networkX,” in Proceedings of the 7th Python in Science Conference, eds G. Varoquaux, T. Vaught, and J. Millman (Pasadena, CA), 11–15.

Google Scholar

He, J., Zhou, Z., Reed, M., and Califano, A. (2017). Accelerated parallel algorithm for gene network reverse engineering. BMC Syst. Biol. 11:5. doi: 10.1186/s12918-017-0458-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Hussein, A. A., Elsayed, A. S., Durrani, M., Jing, Z., Iqbal, U., Gomez, E. C., et al. (2021). Investigating the association between the urinary microbiome and bladder cancer: an exploratory study. Urol. Oncol. doi: 10.1016/j.urolonc.2020.12.011

PubMed Abstract | CrossRef Full Text | Google Scholar

Kim, Y.-K., Taesung, P., Song, Y. S., and Kim, S. I. (2020). Method for Diagnosing Ovarian Cancer Through Microbial Metagenome Analysis. US Patent App. 16/629,360. Boston, MA: Google Patents.

Google Scholar

Kirkup, B. M., McKee, A., Makin, K. A., Paveley, J., Caim, S., Alcon-Giner, C., et al. (2019). Perturbation of the gut microbiota by antibiotics results in accelerated breast tumour growth and metabolic dysregulation. BioRxiv. 553602. doi: 10.1101/553602

CrossRef Full Text | Google Scholar

Knights, D., Kuczynski, J., Charlson, E. S., Zaneveld, J., Mozer, M. C., Collman, R. G., Bushman, F. D., et al. (2011). Bayesian community-wide culture-independent microbial source tracking. Nat. Methods 8, 761–763. doi: 10.1038/nmeth.1650

PubMed Abstract | CrossRef Full Text | Google Scholar

Kováč, J.Kushkevych, I., et al. (2017). “New modification of cultivation medium for isolation and growth of intestinal sulfate-reducing bacteria,” in Proceeding of International PhD Students Conference MendelNet (Brno), 702–707.

Google Scholar

Kowalchuk, G. A., and Stephen, J. R. (2001). Ammonia-oxidizing bacteria: a model for molecular microbial ecology. Annu. Rev. Microbiol. 55, 485–529. doi: 10.1146/annurev.micro.55.1.485

PubMed Abstract | CrossRef Full Text | Google Scholar

Kumavath, R. N., Ramana, C. V., and Sasikala, C. (2011). Rubrivivaxin, a new cytotoxic and cyclooxygenase-i inhibitory metabolite from rubrivivax benzoatilyticus ja2. World J. Microbiol. Biotechnol. 27, 11–16. doi: 10.1007/s11274-010-0420-9

CrossRef Full Text | Google Scholar

Latapy, M., Magnien, C., and Vecchio, N. D. (2008). Basic notions for the analysis of large two-mode networks. Soc. Netw. 30, 31–48. doi: 10.1016/j.socnet.2007.04.006

CrossRef Full Text | Google Scholar

Law, C. W., Chen, Y., Shi, W., and Smyth, G. K. (2014). voom: precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 15:R29. doi: 10.1186/gb-2014-15-2-r29

PubMed Abstract | CrossRef Full Text

Li, G., Yang, M., Jin, Y., Li, Y., Qian, W., Xiong, H., et al. (2018). Involvement of shared mucosal-associated microbiota in the duodenum and rectum in diarrhea-predominant irritable bowel syndrome. J. Gastroenterol. Hepatol. 33, 1220–1226. doi: 10.1111/jgh.14059

PubMed Abstract | CrossRef Full Text | Google Scholar

Liao, Y., Wang, J., Jaehnig, E. J., Shi, Z., and Zhang, B. (2019). Webgestalt 2019: gene set analysis toolkit with revamped UIS and apis. Nucleic Acids Res. 47, W199–W205. doi: 10.1093/nar/gkz401

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, F., Liu, A., Lu, X., Zhang, Z., Xue, Y., Xu, J., et al. (2019). Dysbiosis signatures of the microbial profile in tissue from bladder cancer. Cancer Med. 8, 6904–6914. doi: 10.1002/cam4.2419

PubMed Abstract | CrossRef Full Text | Google Scholar

Lu, W., He, F., Lin, Z., Liu, S., Tang, L., Huang, Y., et al. (2020). Dysbiosis of the endometrial microbiota and its association with inflammatory cytokines in endometrial cancer. Int. J. Cancer. 1–12. doi: 10.1002/ijc.33428

PubMed Abstract | CrossRef Full Text | Google Scholar

Lv, W., Zuo, J., Wang, Y., Fan, Z., Feng, L., Wang, L., et al. (2020). The microbial characteristics of esophageal squamous cell carcinoma (ESCC) and healthy subjects. J. Clin. Oncol. 38:e16546. doi: 10.1200/JCO.2020.38.15_suppl.e16546

CrossRef Full Text | Google Scholar

Mansour, B., Monyók, Á., Makra, N., Gajdács, M., Vadnay, I., Ligeti, B., et al. (2020). Bladder cancer-related microbiota: examining differences in urine and tissue samples. Sci. Rep. 10, 1–10. doi: 10.1038/s41598-020-67443-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Marzban, M., Kashefian Naeeini, S., Ghazbani, A., and Karimi, Z. (2020). Systematic review of fecal and mucosa-associated microbiota compositional shifts in colorectal cancer. Ann. Colorect. Res. 8, 1–13. doi: 10.30476/ACRR.2020.46747

CrossRef Full Text | Google Scholar

Meyer, D., Hornik, K., and Feinerer, I. (2008). Text mining infrastructure in R. J. Stat. Softw. 25, 1–54. doi: 10.18637/jss.v025.i05

CrossRef Full Text | Google Scholar

O'keefe, S. J. (2016). Diet, microorganisms and their metabolites, and colon cancer. Nat. Rev. Gastroenterol. Hepatol. 13:691. doi: 10.1038/nrgastro.2016.165

PubMed Abstract | CrossRef Full Text | Google Scholar

Pannunzio, A., and Coluccia, M. (2018). Cyclooxygenase-1 (cox-1) and cox-1 inhibitors in cancer: a review of oncology and medicinal chemistry literature. Pharmaceuticals 11:101. doi: 10.3390/ph11040101

PubMed Abstract | CrossRef Full Text | Google Scholar

Peñalver Bernabé, B., Cralle, L., and Gilbert, J. A. (2018). Systems biology of the human microbiome. Curr. Opin. Biotechnol. 51, 146–153. doi: 10.1016/j.copbio.2018.01.018

CrossRef Full Text | Google Scholar

Pierce, C. M., Hong, B. Y., Hoehn, H. J., Gomez, M. F., Melas, M., McDonnell, K., et al. (2018). Microbes in the tumor microenvironment: Bacterial influences on host immunity in colorectal cancer [abstract]. Cancer Res. 78(13 Suppl):Abstract nr 4746. doi: 10.1158/1538-7445.AM2018-4746

CrossRef Full Text | Google Scholar

Plyasova, A. A., Pokrovskaya, M. V., Lisitsyna, O. M., Pokrovsky, V. S., Alexandrova, S. S., Hilal, A., et al. (2020). Penetration into cancer cells via clathrin-dependent mechanism allows l-asparaginase from rhodospirillum rubrum to inhibit telomerase. Pharmaceuticals 13:286. doi: 10.3390/ph13100286

PubMed Abstract | CrossRef Full Text | Google Scholar

Poore, G. D., Kopylova, E., Zhu, Q., Carpenter, C., Fraraccio, S., Wandro, S., et al. (2020). Microbiome analyses of blood and tissues suggest cancer diagnostic approach. Nature 579, 567–574. doi: 10.1038/s41586-020-2095-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Qiu, J., Li, N., Lu, Z., Yang, Y., Ma, Y., Niu, L., et al. (2016). Conversion of nornicotine to 6-hydroxy-nornicotine and 6-hydroxy-myosmine by Shinella sp. strain HZN7. Appl. Microbiol. Biotechnol. 100, 10019–10029. doi: 10.1007/s00253-016-7805-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Queiroz, E. A., Fortes, Z. B., da Cunha, M. A., Sarilmiser, H. K., Dekker, A. M. B., Öner, E. T., et al. (2017). Levan promotes antiproliferative and pro-apoptotic effects in MCF-7 breast cancer cells mediated by oxidative stress. Int. J. Biol. Macromol. 102, 565–570. doi: 10.1016/j.ijbiomac.2017.04.035

PubMed Abstract | CrossRef Full Text | Google Scholar

Rajilic-Stojanovic, M., Figueiredo, C., Smet, A., Hansen, R., Kupcinskas, J., Rokkas, T., et al. (2020). Systematic review: gastric microbiota in health and disease. Aliment. Pharmacol. Therap. 51, 582–602. doi: 10.1111/apt.15650

PubMed Abstract | CrossRef Full Text | Google Scholar

Rangseekaew, P., and Pathom-Aree, W. (2019). Cave actinobacteria as producers of bioactive metabolites. Front. Microbiol. 10:387. doi: 10.3389/fmicb.2019.00387

PubMed Abstract | CrossRef Full Text | Google Scholar

Raskov, H., Søby, J. H., Troelsen, J., Bojesen, R. D., and Ggenur, I. (2020). Driver gene mutations and epigenetics in colorectal cancer. Ann. Surg. 271, 75–85. doi: 10.1097/SLA.0000000000003393

PubMed Abstract | CrossRef Full Text | Google Scholar

Sano, H., Kawahito, Y., Wilder, R. L., Hashiramoto, A., Mukai, S., Asai, K., et al. (1995). Expression of cyclooxygenase-1 and-2 in human colorectal cancer. Cancer Res. 55, 3785–3789. doi: 10.1016/0928-4680(94)90594-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Shannon, P. (2003). Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi: 10.1101/gr.1239303

PubMed Abstract | CrossRef Full Text | Google Scholar

Simin, J., Fornes, R., Liu, Q., Olsen, R. S., Callens, S., Engstrand, L., et al. (2020). Antibiotic use and risk of colorectal cancer: a systematic review and dose-response meta-analysis. Br. J. Cancer. 1825–1832. doi: 10.1038/s41416-020-01082-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Sinicrope, F. A., and Gill, S. (2004). Role of cyclooxygenase-2 in colorectal cancer. Cancer Metast. Rev. 23, 63–75. doi: 10.1023/A:1025863029529

CrossRef Full Text | Google Scholar

Sturzoiu, C., Petrescu, M., Galateanu, B., Anton, M., Nica, C., Simionca, G. I., et al. (2011). Zymomonas mobilis levan is involved in metalloproteinases activation in healing of wounded and burned tissues. Sci. Pap. Anim. Sci. Biotechnol. 44, 453–458.

Google Scholar

Suri, A., BAnSAl, S. K., Ammalli, P., and Karunanand, B. (2019). Role of microbiota in aetiopathogenesis of colorectal cancer. J. Clin. Diagnost. Res. 13, 1–5. doi: 10.7860/JCDR/2019/42445.13169

CrossRef Full Text | Google Scholar

Tan, P. J., Lau, B. F., Krishnasamy, G., Ng, M. F., Husin, L. S., Ruslan, N., et al. (2018). Zebrafish embryonic development-interfering macrolides from streptomyces californicus impact growth and mitochondrial function in human colorectal cancer cells. Process Biochem. 74, 164–174. doi: 10.1016/j.procbio.2018.07.007

CrossRef Full Text | Google Scholar

Wang, J., Li, X., Wu, X., Wang, Z., Zhang, C., Cao, G., et al. (2020). Uncovering the microbiota in renal cell carcinoma tissue using 16s rRNA gene sequencing. J. Cancer Res. Clin. Oncol. 481–491. doi: 10.1007/s00432-020-03462-w

PubMed Abstract | CrossRef Full Text | Google Scholar

Wood, D. E., and Salzberg, S. L. (2014). Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 15:R46. doi: 10.1186/gb-2014-15-3-r46

CrossRef Full Text | Google Scholar

Xu, A. A., Hoffman, K., Gurwara, S., White, D. L., Kanwal, F., El-Serag, H. B., et al. (2020). Oral health and the altered colonic mucosa-associated gut microbiota. Digest. Dis. Sci. doi: 10.1007/s10620-020-06612-9

CrossRef Full Text | Google Scholar

Yang, I., Woltemate, S., Piazuelo, M. B., Bravo, L. E., Yepez, M. C., Romero-Gallo, J., et al. (2016). Different gastric microbiota compositions in two human populations with high and low gastric cancer risk in colombia. Sci. Rep. 6:18594. doi: 10.1038/srep18594

CrossRef Full Text

Yang, Q., Wang, Y., Jia, A., Wang, Y., Bi, Y., and Liu, G. (2020). The crosstalk between gut bacteria and host immunity in intestinal inflammation. J. Cell. Physiol. 1–16. doi: 10.1002/jcp.30024

PubMed Abstract | CrossRef Full Text | Google Scholar

Yu, M. R., Kim, H. J., and Park, H. R. (2020). Fusobacterium nucleatum accelerates the progression of colitis-associated colorectal cancer by promoting EMT. Cancers 12:2728. doi: 10.3390/cancers12102728

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhdanov, D. D., Pokrovsky, V. S., Pokrovskaya, M. V., Alexandrova, S. S., Eldarov, M. A., Grishin, D. V., et al. (2017a). Inhibition of telomerase activity and induction of apoptosis by rhodospirillum rubrum l-asparaginase in cancer jurkat cell line and normal human CD4+ t lymphocytes. Cancer Med. 6, 2697–2712. doi: 10.1002/cam4.1218

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhdanov, D. D., Pokrovsky, V. S., Pokrovskaya, M. V., Alexandrova, S. S., Eldarov, M. A., Grishin, D. V., et al. (2017b). Rhodospirillum rubrum l-asparaginase targets tumor growth by a dual mechanism involving telomerase inhibition. Biochem. Biophys. Res. Commun. 492, 282–288. doi: 10.1016/j.bbrc.2017.08.078

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhuang, X., Tian, Z., Li, L., Zeng, Z., Chen, M., and Xiong, L. (2018). Fecal microbiota alterations associated with diarrhea-predominant irritable bowel syndrome. Front. Microbiol. 9:1600. doi: 10.3389/fmicb.2018.01600

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: colorectal cancer, microbiome, tumor progression, probabilistic multilayer networks, information theory

Citation: Uriarte-Navarrete I, Hernández-Lemus E and de Anda-Jáuregui G (2021) Gene-Microbiome Co-expression Networks in Colon Cancer. Front. Genet. 12:617505. doi: 10.3389/fgene.2021.617505

Received: 14 October 2020; Accepted: 22 January 2021;
Published: 15 February 2021.

Edited by:

Maud Fagny, UMR7206 Eco Anthropologie et Ethnobiologie (EAE), France

Reviewed by:

Joseph Nathaniel Paulson, Dana-Farber Cancer Institute, United States
Xiaowei Zhan, University of Texas Southwestern Medical Center, United States

Copyright © 2021 Uriarte-Navarrete, Hernández-Lemus and de Anda-Jáuregui. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Enrique Hernández-Lemus, ZWhlcm5hbmRlekBpbm1lZ2VuLmdvYi5teA==; Guillermo de Anda-Jáuregui, Z2RlYW5kYUBpbm1lZ2VuLmVkdS5teA==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.