Inferring causal gene regulatory network via GreyNet: From dynamic grey association to causation

Chen, Guangyi; Liu, Zhi-Ping

doi:10.3389/fbioe.2022.954610

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol., 27 September 2022

Sec. Bioprocess Engineering

Volume 10 - 2022 | https://doi.org/10.3389/fbioe.2022.954610

Inferring causal gene regulatory network via GreyNet: From dynamic grey association to causation

Guangyi Chen¹

Zhi-Ping Liu^1,2*

¹Department of Biomedical Engineering, School of Control Science and Engineering, Shandong University, Jinan, Shandong, China
²Center for Intelligent Medicine, Shandong University, Jinan, Shandong, China

Gene regulatory network (GRN) provides abundant information on gene interactions, which contributes to demonstrating pathology, predicting clinical outcomes, and identifying drug targets. Existing high-throughput experiments provide rich time-series gene expression data to reconstruct the GRN to further gain insights into the mechanism of organisms responding to external stimuli. Numerous machine-learning methods have been proposed to infer gene regulatory networks. Nevertheless, machine learning, especially deep learning, is generally a “black box,” which lacks interpretability. The causality has not been well recognized in GRN inference procedures. In this article, we introduce grey theory integrated with the adaptive sliding window technique to flexibly capture instant gene–gene interactions in the uncertain regulatory system. Then, we incorporate generalized multivariate Granger causality regression methods to transform the dynamic grey association into causation to generate directional regulatory links. We evaluate our model on the DREAM4 in silico benchmark dataset and real-world hepatocellular carcinoma (HCC) time-series data. We achieved competitive results on the DREAM4 compared with other state-of-the-art algorithms and gained meaningful GRN structure on HCC data respectively.

1 Introduction

The gene regulatory network plays a central role in understanding the mechanisms of gene expression regulation, complex diseases, and cellular heterogeneity (Xiang et al., 2019; Fang et al., 2020; Liao et al., 2022). Compared to the genomes between human Homo sapiens and yeast Saccharomyces cerevisiae, we can conclude that the complexity in life does not result from the number of genes, but the essence and dynamics of the interactions between genes (Huynh-Thu and Sanguinetti, 2019; Freyre-González et al., 2022; Jansen et al., 2022), i.e., gene regulatory network.

Diverse methods have been proposed to infer gene regulatory networks. Recently, the emergence and rise of machine-learning methods to infer gene regulatory networks have dated back to GENIE3 (Vân et al., 2010), which won the DERAM4 (Dialog on Reverse Engineering Assessment and Methods) in silico multifactorial challenge. Then, Jump3 (Huynh-Thu and Sanguinetti, 2015) was proposed to learn the promoter state of the target gene from candidate regulators based on the decision tree. SWING (Finkle et al., 2018) introduced sliding windows to address heterogeneous time delays in the network structure inference. To further improve the inference accuracy, BTNET (Sungjoon et al., 2018) and BiXGBoost (Zheng et al., 2018) transformed the random forest into gradient boosting algorithms. Regularization-based regression introduced different constraints for de novo GRN reconstruction (Phan and Rosemary, 2018; Ghosh Roy et al., 2020; Zhang et al., 2021). BETS applied bootstrap elastic net regression based on Granger causality to infer the GRN (Lu et al., 2021). Recurrent neural network (RNN) was utilized to model gene interactions due to the superior capability of tracking complicated temporal behaviors in the real underlying regulatory system (Cheng et al., 2011; Biswas and Acharyya, 2018). Although machine-learning methods achieved great success, the internal procedures are unknown to us or they are known but hard to be understood by observers (Guidotti et al., 2018).

In this article, we propose an interpretable machine-learning method named GreyNet, i.e., dynamic grey association and regression, to infer the gene regulatory network from time-course gene expression data. We first apply dynamic grey association to model intricated underlying the regulatory system. Different from the static grey association, we assimilate the adaptive sliding window technique to conduct dynamic analysis which can better capture instant interactions over time. The dynamic grey association takes advantage of local temporal information to search for candidate regulators. Then, we embed the Granger causality framework (Arnold et al., 2007; Finkle et al., 2018; Li et al., 2020) based on regression models which can find causal and directional regulatory links. Through this hybrid strategy, GreyNet can better model gene interactions in real scenarios and is easier to be understood in network inference.

2 Materials and methods

In this section, we demonstrate GreyNet to reconstruct the directional graph of GRN G (v, e) from time-series gene expression data. Time-series can effectively disclose the dynamic interactions of genes with time (Haonan et al., 2020). As for the data D{G|G ∈ R^m×n}, v represents the nodes or vertexes and e represents the regulatory links between pairwise vertexes. For the edge e_ij, it represents gene j under the upstream regulation of gene i. The arrangement of the whole gene expression time-series is as follows:

G (i, j) = |\begin{matrix} g_{11} & g_{12} & \dots & g_{1 n} \\ g_{21} & g_{22} & \dots & g_{2 n} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ g_{m 1} & g_{m 2} & \dots & g_{m n} \end{matrix}| (1)

where each column represents an instance and each row represents the value of the selected instance at a specific timestamp. The overall framework of GreyNet is illustrated in Figure 1. We will demonstrate the major steps in the following sections.

FIGURE 1

FIGURE 1. The overview of the GreyNet framework. (A) is the expression matrix of genes. (B,C) is the procedure of dynamic grey association. The window length is automatically adjusted by information entropy (IE). We firstly sample the time points by the sliding window. Then, we input the sampled data into grey relational analysis to get the dynamic grey association coefficient as (D). (E) is the weight matrix generated by regression methods that transform the dynamic grey association to causal directional regulatory link as (F).

2.1 Dynamic grey association

The biological system is very complicated. Although we have revealed important mechanisms of gene regulation, we are still far from being fully clear about it. In the gene regulatory system, genes generally are under the regulation of various types of regulators and most of them are still unknown or unobserved (Ming et al., 2020). With poor data and limited knowledge at present, the GRN inference works on uncertain systems, namely the grey system between black and white. In other words, reconstructing the GRN is with partially known and unknown information, but we want to draw out the valuable GRN structure from observed gene expression data. In this condition, we propose the dynamic grey association to model gene interactions. The dynamic grey association consists of grey relational analysis (Deng, 1989; Sallehuddin et al., 2008; Yuansheng et al., 2019) and the adaptive sliding window (Papadimitriou et al., 2006). Under the circumstance of the relationships between two components are usually variational over time in the biological system, we integrate grey association with the adaptive sliding window which endows the capacity of flexibly tracking instant interactions in time-series data. Therefore, the dynamic grey association is much more interpretable than the “black box” and conforms to current knowledge.

We formally introduce how to obtain the dynamic grey association coefficient here. Firstly, we demonstrate the way to design the adaptive sliding window (Figure 1B). For the gene expression time-series $D {\{G | G \in R^{m \times n}}$ , we take the first-order difference to get the time derivative. We select one gene as a target or reference node y = {g₁, g₂, …, g_n} and the rest of the genes are comparative nodes X_k = {G₁, …, G_k−1, G_k+1, …, G_m}, where G_k is a vector that represents the expression data of gene k.

\nabla y = \{\nabla g_{1}, \nabla g_{2}, \dots, \nabla g_{n}\} (2)

\nabla g_{n} = g_{n} - g_{n - 1} (3)

Then, we normalize the time derivative to transform it into probability by using the softmax function. Information entropy is a good choice to evaluate the amount of information.

p_{i} = s o f t m a x (| \nabla y |) = \frac{e x p (| \nabla g_{i} |)}{\sum_{\nabla g_{i} \in \nabla y} e x p (| \nabla g_{i} |)} (4)

E_{i} = - \sum p_{i} \times l o g_{2} (p_{i}) (5)

Finally, we divide the present information entropy by the previous one to obtain the window coefficient. The preceding window length multiplies the window coefficient to get the current sliding window. The window length will be adjusted by information entropy automatically.

L_{i} = \frac{E_{i}}{E_{i - 1}} \times L_{i - 1} (6)

In every adaptive sliding window, we get the individual grey relational grade. Initially, the target node subtracts the comparative nodes’ corresponding elements to get the absolute value of the first-order norm residual ∇.

\nabla = | y (i) - x_{k} (i) | (7)

From the maximum and minimum of the residual ∇, the association coefficient r_i is given by:

ξ_{k} (i) = \frac{\min_{k} \min_{i} \nabla + ρ * \max_{k} \max_{i} \nabla}{\nabla + ρ * \max_{k} \max_{i} \nabla} (8)

r_{i} = \frac{1}{L} \sum_{i = 1}^{L} ξ_{k} (i) (9)

where L is the length of the adaptive sliding window. ρ is the distinguished coefficient which is positively related to distinguishing the difference (Kuo et al., 2008). Finally, we average all the sliding windows to get the dynamic grey association score DGA:

D G A = \frac{1}{n} \sum_{i = 1}^{n} r_{i} (10)

DGA determines the dominant factors between the multivariable and target gene based on the geometric curve. The higher the value of DGA, the higher the association between the two variables.

2.2 Causation

GRN represents directional causal regulatory relationships among genes (Leng et al., 2019). However, the dynamic grey association cannot depict causality for pairwise genes. In this case, we incorporate the Granger causality framework to turn the dynamic grey association into causation. The Granger causality is intuitive and defined that the past values of the cause make a larger contribution to predicting the future values of the effect than auto-regression. However, it is time-consuming and ignores the possible interactions between features. Inspired by LASSO Granger (Arnold et al., 2007), we apply multivariate regression strategies to identify the subset of features on which the feature is conditionally dependent, namely, we formulate it to a problem of feature selection, given the fact that the best estimator for the target variable is the one with the least error or the maximum gain (Arnold et al., 2007; Li et al., 2020).

In this study, we mainly introduce four regressors to infer GRN. The four regressors we selected are: bagging algorithm random forest (RF) (Friedman, 2001), gradient boosting algorithm XGBoost (Chen and Guestrin, 2016), L₁-penalty LASSO (Tibshirani, 1996), and L₂-penalty Ridge (Hoerl and Kennard, 1970). We regress target variable y_t in terms of X^lag. For different regressors, the objective function of the regressor and optimizing strategy are different. With regards to RF, it is similar to GENIE3 (Vân et al., 2010). The goal of the tree-based regressor is to build a fine decision tree structure in terms of splitting nodes. The criterion for measuring the quality of splitting is the mean squared error, which is equal to variance reduction. The evaluation of the objective function for the target is as:

I (N) = # S V a r (N) - # S_{p t} Var (S_{p t}) - # S_{p f} Var (S_{p f}) (11)

where # is the number of the samples; S is the sample sets that reach the node N is a single tree; S_pt is the sample sets predicted true; and S_pf is the sample sets that predicted false. Nevertheless, the criterion of Xgboost regression is modified by:

\begin{align} L_{split} = \frac{1}{2} [\frac{G_{left}^{2}}{H_{left} + λ} + \frac{G_{right}^{2}}{H_{right} + λ} - \frac{G^{2}}{H + λ}] - γ \\ G_{left} = \sum_{i \in I_{left}} g_{i} H_{left} = \sum_{j \in I_{left}} h_{j} \\ I = I_{left} \cup I_{right} \end{align} (12)

where g_i and h_i are the first-order and second-order gradient statistics. λ and γ are both complexity parameters. I_left and I_right are the sample sets after splitting. Regularization-based regression introduces different regularization terms to prevent overfitting and get optimal reconstruction. The objective function of regularized regression is:

O b j = \frac{1}{m} \sum_{i = 1}^{m} {(y_{i} - X^{lag} w)}^{2} + λ \sum_{j = 1}^{n} ‖ w_{j} ‖^{p} (13)

where ‖w_j‖^p is the regularization term. λ is the regularization coefficient. When p equals one or two, it represents the LASSO regression or Ridge regression.

Based on the aforementioned regression criteria, we can obtain weight matrix w from w^y = regressor (y, X^lag) (as shown in Figure 1E). To confirm the causal regulatory direction, we summarize LASSO Granger as an example that other regression methods are similar to it. If x regulates y, x^t ∈ w^y for some t but y_t∉w^x; if y regulates x, y^t ∈ w^x for some t but x_t∉w^y; and if x and y regulate each other, then x^t ∈ w^y and y^t ∈ w^x.

3 Results

3.1 Gene expression data

GreyNet focuses on time-course bulk gene expression data. Our method is not suitable for single-cell RNA-seq data due to dropout events. In the past decade, the DREAM challenge has been the standard benchmark dataset to evaluate the quality of the reconstructing algorithm (Marbach et al., 2009; Marbach et al., 2010). Therefore, we firstly validate GreyNet on the DREAM4 time-series dataset which contains two sizes of networks, size10 and size100, and each network includes five subnetworks. To further test the performance of our model, we evaluate it on a real-world hepatocellular carcinoma dataset (Yang et al., 2019a). HCC expression profiles are detected from 105 samples represented stepwise from pre-neoplastic lesions to HCC. 105 samples cover the nine development stages of HCC. To preprocess HCC data from NCBI GEO, we map probeset IDs to NCBI official gene symbols through the GEO annotation file. If one gene has multiple probeset mappings, the probeset with the maximum inter quartile expression range (IQR) is selected (Liu et al., 2014). Finally, we employ prior knowledge to separate the TF expression and target expression (Lambert, 2018) to conduct transcriptomic GRN inference by GreyNet. The detailed descriptions of the DREAM4 and HCC time-series expression data are shown in Table 1. The development stages of HCC from normal to hepatocellular carcinoma are shown in Table 2.

TABLE 1

TABLE 1. The dscription of the datasets used in the experiments.

TABLE 2

TABLE 2. The development stages of HCC.

3.2 Evaluation metrics

We mainly use two common metrics, AUROC and AUPRC, to evaluate our model. AUROC is calculated from the ROC curve, showing the trade-off between true positive rate (TPR) and false positive rate (FPR) across different thresholds. AUPRC is just the area under the PR curve, where the x-axis is Precision and the y-axis is Recall. The other measurement metrics, such as Precision, Matthews correlation coefficient (MCC), and Accuracy, are shown in Supplementary Table S1 (Tng et al., 2021; Le and Ho, 2022).

T P R = R e c a l l = \frac{T P}{T P + F N} (14)

F P R = \frac{F P}{F P + T N} (15)

P r e c i s i o n = \frac{T P}{T P + F P} (16)

3.3 Performance on DREAM4 data

In the DREAM4 challenge, we implement four different regression strategies on the DREAM4 dataset combined with the dynamic grey association. We select one typical model from each bagging and gradient boosting algorithm, i.e., random forest (RF) and Xgboost. Analogously, we also incorporate two different regularized regression methods L₁-norm LASSO and L₂-norm Ridge. For each regression method, we run them 100 times to reduce randomness. To further validate the capacity of the grey technique, we compare the four different regression methods with and without the dynamic grey association. The performances of bagging and gradient boosting comparisons are shown in Figure 2. As shown, GreyNet-RF and GreyNet-Xgboost are significantly superior to the corresponding regressors without the dynamic grey association in both AUROC [P_RF − value = 3.93–18; P_Xgboost − value = 6.57e − 18, Wilcoxon test] and AUPRC [P_RF − value = 3.96e − 18; P_Xgboost − value = 3.96e − 18]. In the 100 trials, two regularized regression methods have little changes in AUROC and AUPRC. The results of the two regularized regression comparisons are shown in Figure 3. From Figure 3, we can see that GreyNet-LASSO and GreyNet-Ridge significantly outperform LASSO and Ridge regressors with respect to both AUROC [P_LASSO − value = 1.86e − 18; P_Ridge − value = 1.55e − 23] and AUPRC [P_LASSO − value = 2.10e − 18; P_Ridge − value = 1.55e − 23]. Therefore, the dynamic grey association is effective and efficient to improve the GRN structure inference.

FIGURE 2

FIGURE 2. The comparison of RF, GreyNet-RF, Xgboost, and GreyNet-Xgboost on DREAM4 insilico datasets. (A) is the results of AUROC on DREAM4 size10 networks. (B) is the results of AUPRC on DREAM4 size10 networks. (C) is the results of AUROC on DREAM4 size100 networks. (D) is the results of AUPRC on DREAM4 size100 networks.

FIGURE 3

FIGURE 3. The comparison of LASSO, GreyNet-LASSO, Ridge, and GreyNet-Ridge on DREAM4 in in silico datasets. (A) is the results of AUROC on size10 networks. (B) is the results of AUPRC on size10 networks. (C) is the results of AUROC on size100 networks. (D) is the results of AUPRC on size100 networks.

We further compare our model with other seven state-of-the-art methods, including GENIE3-lag (Huynh-Thu, 2011), Jump3 (Huynh-Thu and Sanguinetti, 2015), SWING (Finkle et al., 2018), BTNET (Sungjoon et al., 2018), BiXGBoost (Zheng et al., 2018), BETS (Lu et al., 2021), and TIGRESS(Anne-Claire et al., 2012). In the five subnetworks, the average AUROC and AUPRC of GreyNet achieve 0.854 ± 0.032, and 0.622 ± 0.108 in size10 and 0.768 ± 0.036, 0.222 ± 0.039 in size100, respectively. The detailed results of AUROC and AUPRC in each subnetwork are shown in Table 3. The ROC and PR curves of GreyNet are shown in Supplementary Figure S1. From Table 3, we can see that our model achieves the highest AUROC in networks 2, 3, and 4 of size10 among the comparing methods. Other than network 1, our model gets the highest AUPRC. In DREAM4 in-silico size100, GreyNet achieves all of the highest AUROC and AUPRC in the five networks other than AUROC in network 4. The complete performances of GreyNet with four different regression strategies are shown in Supplementary Tables S2, S3.

TABLE 3

TABLE 3. The comparative results of models on DREAM4 data.

3.4 Performance on hepatocellular carcinoma data

Hepatocellular carcinoma (HCC) accounts for $> 90 %$ of liver cancers with a five-year survival of only 18 % and the fourth leading cause of cancer-related deaths (Yang et al., 2019b; Villanueva, 2019). It is estimated that more than one million individuals will be affected by liver cancer annually by 2025 and the World Health Organization (WHO) predicted that the mortality of liver cancer will also arrive at one million in 2030 (Yang et al., 2019b; Villanueva, 2019; Llovet et al., 2021). It is imperative to search for important biomarkers of molecular and immune classes to guide therapy. A GRN of HCC will significantly benefit this kind of search.

In this article, we apply GreyNet to the HCC time-course expression data. We select the top 500 regulatory links in HCC GRN by the score of the inference. The inferred HCC GRN is shown in Figure 4. From Figure 4, we can see that, TP53, PIK3CA, AXIN1, MET, APC, CTNNB1, and TERT (all aforementioned genes are protein-coding genes) are elite genes highly related to HCC. TP53, TERT (promoter), and CTNNB1 are dominant mutational driver cancer genes, which account for 21–31 %, 44–65 %, and 27–40 % of patients with HCC (Yang et al., 2019b; Llovet et al., 2021). In terms of TF genes, ARID2 is correlated with the initiation and progression of HCC(Schulze et al., 2015); NFE2L2 is involved in hepatocarcinogenesis and progression (Nault et al., 2014; Niu et al., 2016); HNF1A is related to promoting genetic liver adenomatosis occurrence and possibly further malignant transformation to HCC(Zucman-Rossi et al., 2015).

FIGURE 4

FIGURE 4. The HCC GRN reconstructed by GreyNet. In the network, the larger blue hexagon nodes represent TFs. The circle orange nodes represent the target genes. The diamond green nodes represent some elite disease genes in HCC.

Then, we enrich the HCC GRN by NOA (Wang et al., 2011). The result of the enriched Gene Ontology (GO) biological process and documented pathways in KEGG are shown in Table 4. From Table 4, we can see that the deregulations of the multiple signal pathways in HCC affect cell proliferation, RNA, nucleobase, nucleoside, nucleotide, nucleic acid metabolic process, and liver development. The ‘Wnt signaling pathway’ has a significant impact on cancer development and cancer mechanism evolution (Polakis, 2012; Zhan et al., 2017). The dysregulation of Wnt−/−β-catenin (a key component of the Wnt pathway) brings out the aberrant activation of signaling in HCC (Waisberg and Saba, 2015). Activated β-catenin translocates to the nucleus, interacting with TCF (T cell factor) and LEF (lymphoid enhancer-binding factor), and activates the transcription of the target genes which participate in CSC maintenance and EMT (Zhan et al., 2017; Farzaneh et al., 2021). Ultimately, it will lead to cell proliferation, angiogenesis, and anti-apoptosis. The “JNK pathway” is implicated in multiple cancers, including the regulation of liver tumorigenesis. In the mice model, it is shown that the increased expression of p21 (a cell-cycle inhibitor) can cause impaired proliferation. In human HCC, the activity of JNK can affect liver cell proliferation via p21 and c-Myc (a negative regulator of p21). It is found that the growth of xenografted human HCC cells can be reduced by pharmacologic inhibition of JNK (Hui et al., 2008; Dimri and Satyanarayana, 2020). The ‘receptor tyrosine kinase pathways’ implicate in activating multiple downstream signals, including the epidermal growth factor (EGF) receptor, the fibroblast growth factor (FGF) receptor, the hepatocyte growth factor (HGF/c-MET), the stem cell growth factor receptor c-KIT, the platelet-derived growth factor (PDGF) receptor, and the vascular endothelial growth factor (VEGF) receptor (Dimri and Satyanarayana, 2020). The “transforming growth factor-beta” (TGF-β) involves multiple stages of HCC development from liver injury toward fibrosis, cirrhosis, and cancer. In hepatocarcinogenesis, TGF-β performs as a suppressor factor in the early stages. However, TGF-β contributes to tumor progression latterly (Fabregat and Caballero-Díaz, 2018). The consistency between these enriched functions and the prior knowledge about HCC implies the effectiveness of GreyNet.

TABLE 4

TABLE 4. The enrichment of GO biological process and KEGG pathway in HCC GRN.

4 Discussion

Limited by current technology and knowledge, the underlying gene regulatory mechanisms in cells are not very clear to us. It is reasonable to assume that gene interactions behave as a grey system. Moreover, the similarity and association of the two components are variational, evolving the time in the biological systems. It is less useful to assign a single static score fraction to two variables over an entire time-series. In this condition, we turned the static coefficient to the dynamic grey association by incorporating the adaptive sliding window technique to capture the dynamic evolution, which is much more aligned with real and known gene regulations.

The dynamic grey association is not enough to mine the causality in GRN. Thus, we further introduced both linear and non-linear regression methods to search for causal links by temporal information. Causal relationships between the variables can disclose the origin of the outcome and contribute to decision making. The Granger causal regression model is easy to be understood and explained.

Reconstructing the causal gene regulatory network is a preliminary step to finding out the internal mechanism of the biological procedure and facilitating our understanding of the basic pathology of tumors and other diseases (Lesage et al., 2018; Femerling et al., 2022). However, current biological datasets generated by the facilities are usually accompanied by a low rate of signal-to-noise ratio. Simultaneously, GRN inference is an ill-posed problem with sparsity. A purely data-driven model will find it hard to accurately find real and key regulatory links (Santra, 2014). Fortunately, many databases (Liu et al., 2015; Fang et al., 2020; Zhang et al., 2020) have been established to provide adequate regulatory prior knowledge. Fusing prior knowledge in the model may be an anticipated solution to improve the quality of the GRN topology (Ideker et al., 2011; Abdelzaher et al., 2015; Dandan et al., 2020). It is expected to investigate their effects on GreyNet in the future.

5 Conclusion

In this article, we proposed an interpretable machine-learning framework to infer the gene regulatory network from time-series expression data. We applied grey theory with the adaptive sliding window technique to model internal interactions in real regulatory procedures in the condition of limited information and knowledge. We further incorporated the Granger causality framework to search for causal regulations between genes. In DREAM4 in silico datasets, our model got competitive performances on AUROC and AUPRC compared with other state-of-the-art models. In the real HCC dataset, GreyNet can find meaningful pathways in HCC development from the functional enrichment results of HCC GRN. In the future, we will provide an update to make our model applicable to single-cell RNA-seq data.

Data availability statement

The original contributions presented in the study are publicly available. The data and code can be found here: https://github.com/zpliulab/GreyNet.

Author contributions

GY performed the experiments, analyzed the data, and wrote the manuscript. Z-PL conceived and designed the experiments, and wrote the manuscript. Both authors read and approved the final manuscript.

Funding

National Key Research and Development Program of China under Grant Number 2020YFA0712402; National Natural Science Foundation of China (NSFC) under Grant Number 61973190; Shandong Provincial Key Research and Development Program (Major Scientific and Technological Innovation Project, 2019JZZY010423); Natural Science Foundation of Shandong Province of China under Grant Number ZR2020ZD25; the Innovation Method Fund of China (Ministry of Science and Technology of China) under Grant Number 2018IM020200; the Fundamental Research Funds for the Central Universities (No. 2022JC008); the Program of Qilu Young Scholars of Shandong University. Publication costs are funded by NSFC. The funding bodies had no role in the design of the study, collection, the interpretation of data and in writing the manuscript.

Acknowledgments

Thanks are due to the members of our lab in Shandong University for their assistance in this work.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2022.954610/full#supplementary-material

References

Abdelzaher, A. F., Al-Musawi, A. F., Ghosh, P., Mayo, M. L., and Perkins, E. J. (2015). Transcriptional network growing models using motif-based preferential attachment. Front. Bioeng. Biotechnol. 3, 157. doi:10.3389/fbioe.2015.00157

PubMed Abstract | CrossRef Full Text | Google Scholar

Anne-Claire, H., Fantine, M., Paola, V.-L., and Jean-Philippe, V. (2012). Tigress: Trustful inference of gene REgulation using stability selection. BMC Syst. Biol. 6, 145. doi:10.1186/1752-0509-6-145

PubMed Abstract | CrossRef Full Text | Google Scholar

Arnold, A., Liu, Y., and Abe, N. (2007). “Temporal causal modeling with graphical granger methods,” in Proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining (New York, NY, USA: Association for Computing Machinery), 66–75. KDD ’07. doi:10.1145/1281192.1281203