Identification of Boolean Network Models From Time Series Data Incorporating Prior Knowledge

Leifeld, Thomas; Zhang, Zhihua; Zhang, Ping

doi:10.3389/fphys.2018.00695

METHODS article

Front. Physiol., 08 June 2018

Sec. Systems Biology Archive

Volume 9 - 2018 | https://doi.org/10.3389/fphys.2018.00695

This article is part of the Research TopicLogical Modeling of Cellular Processes: From Software Development to Network DynamicsView all 23 articles

Identification of Boolean Network Models From Time Series Data Incorporating Prior Knowledge

Thomas Leifeld

Zhihua Zhang

Ping Zhang^*

Institute of Automatic Control, Technische Universität Kaiserslautern, Kaiserslautern, Germany

Motivation: Mathematical models take an important place in science and engineering. A model can help scientists to explain dynamic behavior of a system and to understand the functionality of system components. Since length of a time series and number of replicates is limited by the cost of experiments, Boolean networks as a structurally simple and parameter-free logical model for gene regulatory networks have attracted interests of many scientists. In order to fit into the biological contexts and to lower the data requirements, biological prior knowledge is taken into consideration during the inference procedure. In the literature, the existing identification approaches can only deal with a subset of possible types of prior knowledge.

Results: We propose a new approach to identify Boolean networks from time series data incorporating prior knowledge, such as partial network structure, canalizing property, positive and negative unateness. Using vector form of Boolean variables and applying a generalized matrix multiplication called the semi-tensor product (STP), each Boolean function can be equivalently converted into a matrix expression. Based on this, the identification problem is reformulated as an integer linear programming problem to reveal the system matrix of Boolean model in a computationally efficient way, whose dynamics are consistent with the important dynamics captured in the data. By using prior knowledge the number of candidate functions can be reduced during the inference. Hence, identification incorporating prior knowledge is especially suitable for the case of small size time series data and data without sufficient stimuli. The proposed approach is illustrated with the help of a biological model of the network of oxidative stress response.

Conclusions: The combination of efficient reformulation of the identification problem with the possibility to incorporate various types of prior knowledge enables the application of computational model inference to systems with limited amount of time series data. The general applicability of this methodological approach makes it suitable for a variety of biological systems and of general interest for biological and medical research.

1. Introduction

Boolean networks (BNs) are discrete-time systems, whose variables can take only two possible values (i.e., 0 and 1). Since Stuart Kaufman firstly introduced BNs in Kauffman (1969) for qualitative description of gene regulatory interactions, BNs have attracted great attention from many scientists and several results have been proposed, for instance, analysis (Albert and Barabási, 2000) and control (Fauré et al., 2006). An overview can be found in Wang et al. (2012) and a database for established models and compatible tools has been introduced (Naldi et al., 2015).

Mathematical models are important to explain dynamic behavior of a system and to understand the functionality of system components (Grieb et al., 2015) and can help scientists to design model-based targeted therapy and diagnosis (Fumia and Martins, 2013). Hence, the inference of models capturing the relevant behavior of the system is an important topic. The inference can be based on the connection of known biochemical reactions, like BN model for the yeast cell cycle in Davidich and Bornholdt (2008), or on experimental data, if the latter is the case it is also called the identification problem. One of the first approaches to identify a BN was REVEAL which is based on mutual information (Liang et al., 1998). In Akutsu et al. (1999) a similar but less complex approach is presented. Both cannot handle errors in the dataset which was solved in Lähdesmäki et al. (2003). The modeled quantities are not Boolean in the experimental data and need to be binarized first. For the binarization several approaches can be found in the literature ranging from mixture model based clustering (Zhou et al., 2003) to more complex methods where the significance of a jump in the time series is estimated in Hopfensitz et al. (2012). A comparison of some identification and binarization approaches and their combinations can be found in Berestovsky and Nakhleh (2013). Most identification approaches are based on previously binarized data, but there also exist approaches directly based on continuous data (e.g., Karlebach and Shamir, 2012). In Higa et al. (2011) the data is considered as given constraint and the set of systems fulfilling the constraints is searched. This approach was then further improved by reducing the sensitivity to noise in Ouyang et al. (2014). An example of recent research is the identification of Boolean models for transient dynamics after perturbations from time course data with answer set programming (Ostrowski et al., 2016). A BN can simply be extended to a Boolean control network (BCN) by considering manipulated external stimuli as control signal of the network. Recently, a powerful tool called semi-tensor product (STP) of matrices has been proposed in Cheng (2001), which can convert the dynamics of BCNs into a model where all information of the dynamics and the structure of the BCN is contained in two matrices (Cheng et al., 2011a). Using the STP based matrix description of BCN several approaches for identifying BCN have been proposed (Cheng and Zhao, 2011; Fornasini and Valcher, 2014; Zhang et al., 2017a).

However, in general, in order to identify the dynamical model of a BCN from its input and output data, a huge number of data is required (Cheng and Zhao, 2011; Cheng et al., 2011b). Though, in practice, data size is limited by the cost of experiments (Geier et al., 2007). In order to reduce the search space and improve the accuracy of the model, the benefit of biological prior knowledge should be taken into consideration. The prior knowledge is used in different ways either by introducing additional constraints in the optimization (Breindl et al., 2013), or reducing the number of parameters in the optimization (Cheng and Zhao, 2011). In Dorier et al. (2016) and Terfve et al. (2012) genetic algorithms are used to handle the complexity problem of large networks while satisfying prior knowledge network graphs as constraints. However, these approaches to handle prior knowledge are not compatible and the advantages of different types of prior knowledge can not be combined. In the approach proposed in this paper, all different types of prior knowledge can be utilized simultaneously and it can additionally handle hypotheses for interactions, which could be used for researcher bias free distinction between alternative hypotheses. Furthermore existing approaches can not handle the case that at some time instances some measurement values are missing, which cannot be avoided in practice due to the limitation of measuring techniques like mass spectrometry-based proteomics.

In this paper, we consider the identification problem of BCNs utilizing biological prior knowledge. A part of the results was presented at the 56th IEEE Conference on Decision and Control in Melbourne (Zhang et al., 2017b). However, the BCN model considered in Zhang et al. (2017b) contains a general output equation. By applying prediction error method (PEM), a high-dimensional BCN (i.e., 2ⁿ × 2^n+m) cannot be avoided. Different from that, although the handling of unmeasurable processes is considered in this paper, the proposed approach leads to a low-dimensional matrix for PEM. Besides, more prior biological knowledge is considered in the paper, like potential interactions, known attractors and limit cycles. Moreover, it is discussed how to deal with alternative hypotheses for interactions and missing measurement points. The main contributions of this paper are as follows:

• A suitable way to handle the prior knowledge such as known network graph, hypotheses for interactions, canalizing and unateness properties or attractor is introduced. For this purpose the BCN is described by two matrices with unknown parameters as entries. If possible, some parameters are inferred directly. Otherwise, relationships between the parameters are set up.

• An approach to deal with the identification of BCNs, in particular, from noisy measurements and missing data points is proposed. The identification problem of BCNs is formulated as a nonlinear pseudo-Boolean optimization, which can be equivalently transformed into a linear binary optimization problem and then solved efficiently.

The remainder of the paper is organized as follows. Section 2 introduces some fundamental definitions and notations. In Section 3, the identification problem of BCNs addressed in this paper will be formulated. Section 4 introduces a way to utilize prior knowledge in identification procedure. The formulation of identification problem of BCNs as an integer linear programming problem is derived and an example is given in Section 5 to illustrate the approach. Finally, a short discussion on the advantages and limitations of the proposed approach is given in Section 6.

2. Preliminaries

In this part, we list some necessary notations, which will be used in the subsequent sections.

1. ¬, ∧ and ∨ denote the logical negation (not), conjunction (and) and disjunction (or), respectively.

2. $D : = {1, 0}$ and $D^{n} = \underset{n}{\underset{︸}{D \times D \times \dots \times D}}$ .

3. $Δ_{n} : = {δ_{n}^{k} | 1 \leq k \leq n}$ , where $δ_{n}^{k}$ denotes the k-th column of the identity matrix I_n.

4. For a vector v ∈ ℝ^m, its j-th entry is denoted by [v]_j, j = 1, 2, ⋯, m.

5. An n × t matrix L is called a logical matrix, if $L = [δ_{n}^{i_{1}} δ_{n}^{i_{2}} \dots δ_{n}^{i_{t}}]$ , where i₁, i₂, ⋯, i_t ∈ {1, 2, ⋯, n}, and we express L briefly as L = δ_n[i₁i₂ ⋯ i_t]. Denote the set of n × t logical matrices by $L_{n \times t}$ . Col_i(M) denotes the i-th column of the matrix M.

6. $0_{n} : = {[\underset{n}{\underset{︸}{0 0 \dots 0}}]}^{T}$ , where the superscript T denotes the transpose.

The concept of the semi-tensor product of matrices (STP) has been introduced by Cheng et al. (2011a). The STP of two matrices A ∈ ℝ^{m × n} and B ∈ ℝ^{p × q} is defined as

\begin{array}{rcl} A ⋉ B = (A \otimes I_{l / n}) \cdot (B \otimes I_{l / p}) & (1) \end{array}

where ⊗ is the Kronecker product and l = lcm{n, p} is the least common multiple of n and p. The following property of the STP will be used in the subsequent sections.

Lemma 1. Let X ∈ ℝ^{m × 1} and Y ∈ ℝ^{n × 1}. Then Y⋉X = W_{[m, n]} ⋉ X⋉Y, where W_{[m, n]} is the swap matrix (Cheng et al., 2011a).

So the order of two vectors which are multiplied can be altered by multiplying a suitable matrix from the left, this is also called the pseudo-commutativity of the STP. In the following parts the symbol ⋉ will be omitted.

3. Problem Formulation

System identification is the determination of a model describing the dynamic behavior of a system based on measured data and known perturbations. In the context of Boolean modeling it is assumed that the transient behavior of the system can be qualitatively described by a finite number of Boolean states and that the interaction of these states can be described by Boolean functions. The perturbations are inputs to the system and cause transient behavior of the interacting states in the system. A measured time series of inputs and states form together the data basis for the identification. Depending on the system which is to be modeled, the states might represent the activity of genes or the abundance of proteins and the perturbations could be a stress like heat or oxygen or a chemical substance. In the following the identification process will be formulated as mathematical optimization problem. Therefore the mathematical model of a BCN needs to be defined first. A Boolean control network (BCN) can be described by the following equations (Cheng and Qi, 2010):

\begin{array}{rcl} {\begin{array}{l} X_{1} (t + 1) & = f_{1} (X_{1} (t), \dots, X_{n} (t), U_{1} (t), \dots, U_{m} (t)) \\ ⋮ \\ X_{n} (t + 1) & = f_{n} (X_{1} (t), \dots, X_{n} (t), U_{1} (t), \dots, U_{m} (t)) & (2) \end{array} \end{array}

where $X (t) = {[X_{1} (t) X_{2} (t) \dots X_{n} (t)]}^{T} \in D^{n}$ , $U (t) = {[U_{1} (t) U_{2} (t) \dots U_{m} (t)]}^{T} \in D^{m}$ are, respectively, the state vector, input vector at time t, f_i are logic functions. At the discrete time instances t the state variables are updated synchronously according to the logic functions f_i. As shown in Cheng and Qi (2010), a vector form of Boolean variable X_i, i = 1, 2, ⋯, n can be simply expressed as

\begin{array}{rcl} x_{i} = [\begin{matrix} X_{i} \\ \neg X_{i} \end{matrix}] . & (3) \end{array}

Let $x = ⋉_{i = 1}^{n} x_{i} \in Δ_{2^{n}}, u = ⋉_{i = 1}^{m} u_{i} \in Δ_{2^{m}}$ . According to Cheng and Qi (2010), (2) can be equivalently represented in a vector form:

\begin{array}{rcl} {\begin{array}{l} x_{1} (t + 1) = S_{1} u (t) x (t) \\ ⋮ \\ x_{n} (t + 1) = S_{n} u (t) x (t) \end{array}, & (4) \end{array}

where $S_{i} \in L_{2 \times 2^{n + m}}, i = 1, 2, \dots, n$ are logical matrices. Multiplying all Equations in (4) together, there is

\begin{array}{rcl} x (t + 1) = L u (t) x (t) & (5) \end{array}

where $L \in L_{2^{n} \times 2^{n + m}}$ is a logical matrix and $C o l_{i} (L) = ⋉_{j = 1}^{n} C o l_{i} (S_{j}), i = 1, 2, \dots, 2^{n + m}$ .

A polynomial $P_{m l} : ℝ^{k} \to ℝ$ with k variables {θ₁, θ₂, ⋯, θ_k} is called multi-linear polynomial, if its degree in each variable is at most 1 (Alon et al., 1991). So, a multi-linear polynomial can be generally expressed as

\begin{array}{rcl} P_{m l} (θ_{1}, θ_{2}, \dots, θ_{k}) = c + \sum_{i = 1}^{k} c_{i} θ_{i} + \sum_{α = 1}^{q} c_{I_{α}} \prod_{j \in I_{α}} θ_{j} & (6) \end{array}

where $c, c_{i}, c_{I_{α}} \in ℝ$ for $I_{α} \subset V = {1, 2, \dots, k}$ and the set $I_{α}$ has a cardinality of at least 2, i.e., $| I_{α} | \geq 2, α = 1, 2, \dots, q$ .

Generally, the identification problem of BCNs can be described as reconstruction of Boolean functions f_i, i = 1, 2, ⋯, n that explain the experimental data as well as possible. Because of equivalent representation of a Boolean function by a logical matrix, the identification problem is reformulated as searching for logical matrices $S_{i} \in L_{2 \times 2^{n + m}}, i = 1, 2, \dots, n$ based on the input and measurement state data.

Note that any logical matrix in $L_{2^{a} \times 2^{b}}$ can be expressed by multi-linear polynomials in a binary parameter vector θ of dimension a · 2^b. For example, any logical matrix in $L_{4 \times 8}$ can be expressed by a binary parameter vector $θ = {[θ_{1} θ_{2} \dots θ_{16}]}^{T}$ as

\begin{array}{l} {[\begin{matrix} θ_{1} \cdot θ_{2} & θ_{1} \cdot (1 - θ_{2}) & (1 - θ_{1}) \cdot θ_{2} & (1 - θ_{1}) \cdot (1 - θ_{2}) \\ θ_{3} \cdot θ_{4} & θ_{3} \cdot (1 - θ_{4}) & (1 - θ_{3}) \cdot θ_{4} & (1 - θ_{3}) \cdot (1 - θ_{4}) \\ θ_{5} \cdot θ_{6} & θ_{5} \cdot (1 - θ_{6}) & (1 - θ_{5}) \cdot θ_{6} & (1 - θ_{5}) \cdot (1 - θ_{6}) \\ θ_{7} \cdot θ_{8} & θ_{7} \cdot (1 - θ_{8}) & (1 - θ_{7}) \cdot θ_{8} & (1 - θ_{7}) \cdot (1 - θ_{8}) \\ θ_{9} \cdot θ_{10} & θ_{9} \cdot (1 - θ_{10}) & (1 - θ_{9}) \cdot θ_{10} & (1 - θ_{9}) \cdot (1 - θ_{10}) \\ θ_{11} \cdot θ_{12} & θ_{11} \cdot (1 - θ_{12}) & (1 - θ_{11}) \cdot θ_{12} & (1 - θ_{11}) \cdot (1 - θ_{12}) \\ θ_{13} \cdot θ_{14} & θ_{13} \cdot (1 - θ_{14}) & (1 - θ_{13}) \cdot θ_{14} & (1 - θ_{13}) \cdot (1 - θ_{14}) \\ θ_{15} \cdot θ_{16} & θ_{15} \cdot (1 - θ_{16}) & (1 - θ_{15}) \cdot θ_{16} & (1 - θ_{15}) \cdot (1 - θ_{16}) \end{matrix}]}^{T} \end{array}

where the superscript T denotes the transpose. In this way, each realization of the binary parameter vector $θ \in D^{a 2^{b}}$ corresponds to a unique logical matrix. It is straightforward to equivalently convert this logical matrix into readable logical equations. Based on this, the objective of the paper is to find a binary parameter vector θ, such that dynamic behavior of the BCN (5) is consistent with the important dynamics captured in the observed input-state data.

4. Incorporation of Prior Knowledge

In this section, we shall show how to utilize known network graph, potential interactions, canalizing and unateness properties and attractors in the identification procedure.

4.1. Known or Potential Interactions

Often some or all interaction partners are known in a biological system which is subject of identification. This knowledge can come from databases or can be constructed based knowledge about the underlying biochemical reactions. In some cases a known signaling network is to be complemented and different hypothesis for potential interactions shall be evaluated. If all interaction partners and the direction of the interactions are known, the underlying directed network graph of the BN is known.

In graph theory, a directed graph can be denoted by $G = {V, E}$ , where $V$ is a finite set of nodes and $E \subset V \times V$ is a finite set of edges (Bollobas, 2012). If $(v_{i}, v_{j}) \in E$ , then there is an edge from v_i → v_j. According to Cheng et al. (2011a), a BCN can be represented by a directed graph, where each gene is considered as a node. If there is an edge from X_i → X_j, then X_j is affected by X_i.

Assume that a directed graph for a BCN $G = {V, E}$ is known. Then we have the following result.

Lemma 2. If the node X_i is affected by w nodes, then 2^w binary parameters are enough to describe the corresponding logical matrix S_i.

Proof. As the node x_i is affected by w nodes, then the Boolean function can be represented in a vector form as

\begin{array}{l} x_{i} (t + 1) = S_{i} x_{i_{1}} (t) x_{i_{2}} (t) \dots x_{i_{w}} (t) \end{array}

where the matrix S_i is a logical matrix of dimension 2 × 2^w. Recall that the logical matrix S_i is a matrix containing only columns belonging to Δ₂ (Cheng et al., 2011a). Hence, 2^w binary parameters are enough for the description of the logical matrix S_i.

An example is given below to express logical matrices of a BCN with a known network graph with the help of binary parameters.

Example 1. Consider a BCN as follows.

\begin{matrix} {\begin{array}{l} X_{1} (t + 1) & = f_{1} (X_{2} (t), U_{1} (t)) \\ X_{2} (t + 1) & = f_{2} (X_{1} (t), U_{2} (t)) \end{array} & (7) \end{matrix}

where the network graph of the BCN is shown in Figure 1 (Cheng and Zhao, 2011). According to Cheng and Qi (2010), the algebraic form of the BCN is obtained,

\begin{matrix} {\begin{array}{l} x_{1} (t + 1) = S_{1} u_{1} (t) x_{2} (t) \\ x_{2} (t + 1) = S_{2} u_{2} (t) x_{1} (t) \end{array} & (8) \end{matrix}

where the logical matrices $S_{1}, S_{2} \in L_{2 \times 4}$ can be expressed by the binary parameter vector $θ = {[θ_{1} θ_{2} \dots θ_{8}]}^{T}$ in the following form:

\begin{array}{l} S_{1} & = [\begin{matrix} θ_{1} & θ_{2} & θ_{3} & θ_{4} \\ 1 - θ_{1} & 1 - θ_{2} & 1 - θ_{3} & 1 - θ_{4} \end{matrix}], \\ S_{2} & = [\begin{matrix} θ_{5} & θ_{6} & θ_{7} & θ_{8} \\ 1 - θ_{5} & 1 - θ_{6} & 1 - θ_{7} & 1 - θ_{8} \end{matrix}] . \end{array}

FIGURE 1

Figure 1. Network graph.

Potential interactions can be treated in the same way as known interactions as long as all of them could potentially be simultaneously true. If there are two alternative hypotheses and the question is which fits better to the data, then this can be done by introducing a constraint on the parameters θ.

Example 2. Assume that X₁ is influenced either by X₂ or by U₁, this could be ensured by imposing the constraint

\begin{array}{rcl} λ (θ_{1} - θ_{2}) \cdot (θ_{3} - θ_{4}) + (1 - λ) (θ_{1} - θ_{3}) \cdot (θ_{2} - θ_{4}) = 0, λ \in {0, 1}, & (9) \end{array}

4.2. Canalizing Boolean Functions

The concept of “canalizing” values in Boolean functions was introduced in developmental biology in 1940s (Waddington, 1942). The idea is, that one input is dominant and if it takes a certain value it determines the output. After that, in order to explain the phenomenon that absence of repressor or high levels of allolactose assures the operator cannot bind repressor in lac operon of the bacterium Escherichia coli, Kauffman applied this concept to BN modeling of gene regulatory networks (Kauffman, 1974).

Canalizing functions are defined as follows.

Definition 1. A Boolean function $f : D^{n} \to_{}^{f} D$ is canalizing if there exist a variable X_i, i ∈ {1, 2, ⋯, n} and a Boolean function g(X₁, ⋯, X_{i − 1}, X_{i + 1}, ⋯, X_n) and $a, b \in D$ , such that

\begin{matrix} f (X_{1}, \dots, X_{n}) = {\begin{array}{l} b, & if X_{i} = a, \\ g \neq b, & if X_{i} \neq a \end{array} & (10) \end{matrix}

where a is called the canalizing value for the variable X_i and b is the canalizing output value (Kauffman, 1974).

Based on Definition 1, this prior knowledge can be translated into imposing a specified value in the corresponding logical matrix. Assume that the logical matrix for the canalizing function (10) is denoted as S and the canalizing value a and canalizing output b can, respectively, be expressed in a vector form as $δ_{2}^{2 - a}$ and $δ_{2}^{2 - b}$ . Then, we can get the following result.

Theorem 1. Given a canalizing function (10). The corresponding logical matrix $S \in L_{2 \times 2^{n}}$ satisfies

\begin{matrix} S W_{[2, 2^{i - 1}]} δ_{2}^{2 - a} = δ_{2} \underset{2^{n - 1}}{\underset{︸}{[2 - b 2 - b \dots 2 - b] .}} & (11) \end{matrix}

where $W_{[2, 2^{i - 1}]}$ is the swap matrix.

Proof. According to Lemma 1, it is easy to obtain $S x_{1} x_{2} \dots x_{n} = S W_{[2, 2^{i - 1}]} x_{i} x_{1} x_{2} \dots x_{i - 1} x_{i + 1} \dots x_{n}$ . Applying (11), we have

\begin{array}{l} S W_{[2, 2^{i - 1}]} δ_{2}^{2 - a} x_{1} x_{2} \dots x_{i - 1} x_{i + 1} \dots x_{n} \\ = δ_{2} \underset{2^{n - 1}}{\underset{︸}{[2 - b 2 - b \dots 2 - b]}} x_{1} x_{2} \dots x_{i - 1} x_{i + 1} \dots x_{n} = δ_{2}^{2 - b} \end{array}

which corresponds to f(X₁, ⋯, X_i−1, a, X_i+1, ⋯, X_n) = b for any X₁, ⋯, X_i−1, X_i+1, ⋯, X_n ∈ {0, 1}.

Let's take an example to illustrate the result of Theorem 1.

Example 3. Consider the BCN (7). Assume that the Boolean function f₁ is a canalizing function in x₂ for a canalizing value $δ_{2}^{2}$ and the corresponding canalizing output is $δ_{2}^{1}$ . Due to the canalizing property, the logical matrix S₁ can be reduced to

\begin{array}{l} S_{1} W_{[2, 2]} δ_{2}^{2} = [\begin{matrix} 1 & 1 \\ 0 & 0 \end{matrix}] \Rightarrow S_{1} = [\begin{matrix} θ_{1} & 1 & θ_{3} & 1 \\ 1 - θ_{1} & 0 & 1 - θ_{3} & 0 \end{matrix}] . \end{array}

It can be checked that $S_{1} u_{1} δ_{2}^{2} = δ_{2}^{1}$ , no matter whether $u_{1} = δ_{2}^{1}$ or $u_{1} = δ_{2}^{2}$ . Note that the logical matrix S₁ contains only two binary parameters (i.e., θ₁ and θ₃). It shows that using canalizing property can reduce the number of binary parameters.

As an important subclass of canalizing function, k-canalizing function is defined as follows.

Definition 2. Let σ be a permutation on the set {1, 2, ⋯, n}. A Boolean function $f : D^{n} \to_{}^{f} D$ is k-canalizing in the variable order X_σ(1), X_σ(2), ⋯, X_σ(k) with canalizing input values a₁, a₂, ⋯, a_k and canalizing output values b₁, b₂, ⋯, b_k, if it can be represented in the form (Kauffman et al., 2003).

\begin{matrix} f (X_{1}, \dots, X_{n}) ​ = ​​ {\begin{array}{l} b_{1}, & if X_{σ (1)} = a_{1}, \\ b_{2}, & if X_{σ (1)} \neq a_{1}, X_{σ (2)} = a_{2}, \\ ⋮ \\ b_{k}, & if X_{σ (1)} \neq a_{1}, X_{σ (2)} \neq a_{2} \dots \\ X_{σ (k)} = a_{k}, \\ g \neq b_{k}, & if X_{σ (1)} \neq a_{1}, X_{σ (2)} \neq a_{2} \dots \\ X_{σ (k)} \neq a_{k} . \end{array} & (12) \end{matrix}

Note that if all variables have certain canalizing values, then the function is called nested canalizing function (Kauffman et al., 2003).

As a Boolean variable can only take two values, i.e., {0, 1}, (12) can be equivalently expressed as f(X₁, ⋯, X_n) = b_i, if X_σ(1) = 1−a₁, X_σ(2) = 1−a₂, ⋯, X_σ(i) = a_k, i = 1, 2, ⋯, k. Using the Boolean variables ${[X_{σ (1)} X_{σ (2)} \dots X_{σ (i)}]}^{T}$ to represent a multi-valued logic variable, it is straightforward to recognize that a k-canalizing function can be equivalently formulated as a canalizing function in a multi-valued logic variable. Therefore, Theorem 1 can be applied to specify the logical matrix for k-canalizing or nested canalizing function (12).

It is necessary to point out that different from the approaches proposed in Breindl et al. (2013) and Faisal et al. (2010), some binary parameters can be directly inferred, no matter which canalizing value the canalizing variable takes.

Example 4. Consider the BCN (7). Assume that the Boolean function f₂ is nested canalizing function, which can be represented as

f_{2} (U_{2}, X_{1}) = {\begin{array}{l} 1, & if U_{2} = 1, \\ 0, & if U_{2} \neq 1, X_{1} = 1. \end{array}

Because f₂(1, X₁) = 1 for X₁ ∈ {0, 1}, we have

\begin{array}{l} S_{2} δ_{2}^{1} = [\begin{matrix} 1 & 1 \\ 0 & 0 \end{matrix}] \Rightarrow S_{2} = [\begin{matrix} 1 & 1 & θ_{7} & θ_{8} \\ 0 & 0 & 1 - θ_{7} & 1 - θ_{8} \end{matrix}] . \end{array}

Moreover, due to f₂(0, 1) = 0, there is

\begin{array}{l} S_{2} δ_{2}^{2} δ_{2}^{1} = [\begin{matrix} 0 \\ 1 \end{matrix}] \Rightarrow S_{2} = [\begin{matrix} 1 & 1 & 0 & θ_{8} \\ 0 & 0 & 1 & 1 - θ_{8} \end{matrix}] . \end{array}

Remark 1. Theorem 1 implies that considering canalizing property of a Boolean function, the corresponding logical matrix can be expressed with fewer binary parameters. For instance, if a Boolean function f(X₁, X₂, ⋯, X_n) is a k-canalizing function, then 2^n−k different binary parameters are enough to represent the corresponding logical matrix.

4.3. Unate Boolean Functions

The behavior of some substances or genes are well studied and it is known that they act as suppressing or activating in all reactions they are involved. If they always act inhibiting they have the so called negative unateness property. For the case that a quantity exclusively induces the expression of another gene or substance it has the positive unateness property (Porreca et al., 2010).

For the mathematical modeling of the unatess properties let us consider another important type of Boolean functions, which is called the unate function (Breindl et al., 2013).

Definition 3. (Breindl et al., 2013) A Boolean function $f : D^{n} \to_{}^{f} D$ is unate in x_i, if for any ${[X_{1} X_{2} \dots X_{i - 1} X_{i + 1} \dots X_{n}]}^{T} \in D^{n - 1}$ it holds for positive unateness that

\begin{array}{rcl} f (\dots, X_{i - 1}, 0, X_{i + 1}, \dots) \leq f (\dots, X_{i - 1}, 1, X_{i + 1}, \dots) & (13) \end{array}

or it always holds for negative unateness that

\begin{array}{rcl} f (\dots, X_{i - 1}, 0, X_{i + 1}, \dots) \geq f (\dots, X_{i - 1}, 1, X_{i + 1}, \dots) & (14) \end{array}

In the same way as Breindl et al. (2013), unateness can be equivalently represented as linear formulation. Afterwards, this linear formulation can be seen as additional inequality constraints in the optimization problem. As Boolean function can be rewritten as a vector form (4) and according to Lemma 1, there is

\begin{array}{rcl} \begin{aligned} S x_{1} x_{2} \dots x_{i - 1} x_{i} x_{i + 1} \dots x_{n} = S W_{[2, 2^{i - 1}]} x_{i} x_{1} x_{2} \dots x_{i - 1} x_{i + 1} \dots x_{n} & (15) \end{aligned} \end{array}

where S is the logical matrix corresponding to the Boolean function f. Hence, f(⋯, X_i−1, 0, X_i+1, ⋯) and f(⋯, X_i−1, 1, X_i+1, ⋯) can, respectively, be represented in a vector form as

\begin{array}{rcl} S W_{[2, 2^{i - 1}]} δ_{2}^{2} x_{1} x_{2} \dots x_{i - 1} x_{i + 1} \dots x_{n} & (16) \end{array}

and

\begin{array}{rcl} S W_{[2, 2^{i - 1}]} δ_{2}^{1} x_{1} x_{2} \dots x_{i - 1} x_{i + 1} \dots x_{n} & (17) \end{array}

Furthermore, based on the vector form of Boolean variable (3) and according to (13) or (14), for each x₁, x₂, ⋯, x_i−1, x_i+1, ⋯, x_n ∈ Δ₂ an inequality can be set up. Putting all inequality constraints together, we can find a matrix A for the following expression.

\begin{array}{rcl} A \cdot θ \leq 0_{n} & (18) \end{array}

Example 5. Consider the Boolean function x₁ = f₁(x₂), this function f₁ is defined by two unknown parameters θ₁ and θ₂. Assume that the Boolean function f₁ is a unate function with respect to x₂, which satisfies (13). As the first step, the matrix $S_{1} δ_{2}^{1}$ and $S_{1} δ_{2}^{2}$ are calculated, which yields

\begin{array}{l} S_{1} δ_{2}^{1} = [\begin{matrix} θ_{1} \\ 1 - θ_{1} \end{matrix}], S_{1} δ_{2}^{2} = [\begin{matrix} θ_{2} \\ 1 - θ_{2} \end{matrix}] . \end{array}

Then, the inequality constraint is

\begin{array}{l} θ_{2} \leq θ_{1} \Leftrightarrow [\begin{matrix} - 1 & 1 \end{matrix}] \cdot [\begin{matrix} θ_{1} \\ θ_{2} \end{matrix}] \leq 0 . \end{array}

4.4. Known Attractors or Limit Cycles

When the BCN is not perturbed for a sufficiently long time it reaches the steady state. The steady state of a BCN can be exactly one state (i.e., attractor) or a fix cycle of some states (i.e., limit cycle). Attractors or limit cycles are assumed to determine the phenotype in the cell differentiation (Huang and Ingber, 2000). The experimental setup to measure the steady state of a system is simpler and measurements are easier to reproduce compared with transient dynamics. As a result, the steady state of the BN is often already known when the perturbation experiments for identification of the transient behavior are carried out. This knowledge can be utilized as follows.

An attractor corresponds to a self loop in the reachability graph. For a given input combination this fixes one specific coulumn in the matrix L. For the constant input $u (t) = δ_{2^{m}}^{i}$ and the constant state $x (t) = δ_{2^{n}}^{j}$ the k-th column is known to be $C o l_{k} (L) = δ_{2^{n}}^{j}$ with k = (i − 1)2ⁿ + j. A limit cycle can be analyzed in a similar manner. For the given state sequence of the limit cycle of length T and the constant input $u (t) = δ_{2^{m}}^{i}$ one can calculate T columns of L. For each time instant t of the cycle the actual state $x (t) = δ_{2^{n}}^{j}$ and the next state $x (t + 1) = δ_{2^{n}}^{w}$ is known. The information of this known transition is used by setting the k-th column to $C o l_{k} (L) = δ_{2^{n}}^{w}$ with k = (i−1)2ⁿ + j.

5. Identification Approach

In this part, the identification problem of BCNs will be studied. At first, it will be shown that the identification problem can be reformulated as a nonlinear pseudo-Boolean optimization problem by applying the idea of the prediction error method. The pseudo-Boolean optimization can be transformed into an equivalent linear binary integer programming problem that can be solved more efficiently. Then, we give a way to deal with missing measurement values. Finally, we discuss how dependencies between measured substances can be handled.

5.1. Optimization Problem

The prediction error method (PEM) is one of the most widely used identification methods (Isermann and Münchhof, 2011). The basic idea behind this method is to choose parameters to make the difference between a prediction based on the model and the measured values as small as possible. As the PEM minimizes the prediction error in the identified system, errors in the data set due to noise need no special treatment. Obviously the more noise is expected in the data set the more data should be acquired for identification of a reliable model.

Before applying PEM, it is necessary to specify a measure of prediction error. In information theory, the Hamming distance d(X, Y) between two vectors $X, Y \in D^{n}$ is defined as the number of positions, in which the entries differ (Hamming, 1950).

\begin{array}{rcl} d (X, Y) = | {j \in {1, 2, \dots, n} | {[X]}_{j} \neq {[Y]}_{j}} | & (19) \end{array}

As each entry in the vectors X and Y belongs to the Boolean domain {0, 1}, (19) can be equivalently written as

\begin{array}{rcl} d (X, Y) = \sum_{i = 1}^{n} {| [X]}_{i} - {[Y] |}_{i} & (20) \end{array}

Furthermore, let x_i, y_i be, respectively, the vector form of [X]_i and [Y]_i. Then, it is straightforward to get

\begin{array}{rcl} {| [X]}_{i} - {[Y]}_{i} | = (1 - x_{i}^{T} \cdot y_{i}) & (21) \end{array}

Based on this, the Hamming distance d(X, Y) can be rewritten as

\begin{array}{rcl} d (X, Y) = \sum_{i = 1}^{n} (1 - x_{i}^{T} \cdot y_{i}) & (22) \end{array}

Assume that the observed input and state data is {(U(t), X(t)), t = 0, 1, ⋯, T}. The vector form of the input data {U₁(t), U₂(t), ⋯, U_m(t)} and state data {X₁(t), X₂(t), ⋯, X_n(t)} are denoted, respectively, as u₁(t), u₂(t), ⋯, u_m(t) and x₁(t), x₂(t), ⋯, x_n(t). Since the logical matrix S_i for the state variable X_i can be represented by the parameter vector θ, we simply denote them as S_i(θ). Suppose that the state variable X_i can be influenced by the variables X_j₁, X_j₂, ⋯, X_{j_k}. According to (5), it is easy to get expression of the prediction ${\hat{x}}_{i} (θ, t)$ :

\begin{array}{rcl} {\hat{x}}_{i} (θ, t) = S_{i} (θ) u (t - 1) ⋉_{i = 1}^{k} x_{j_{i}} (t - 1) & (23) \end{array}

Recalling (21) and (22), the PEM method will estimate the binary parameters by minimizing the prediction error, i.e.,

\begin{array}{rcl} min_{θ \in D^{k}} \sum_{t = 0}^{T} d (X (t), \hat{X} (θ, t)) = min_{θ \in D^{k}} \sum_{t = 0}^{T} \sum_{i = 1}^{n} (1 - x_{i}^{T} (t) \cdot {\hat{x}}_{i} (θ, t)) & (24) \end{array}

Furthermore, the optimization problem (24) can be equivalently rewritten as

\min_{θ \in D^{k}} (T \cdot n - \sum_{t = 0}^{T} \sum_{i = 1}^{n} x_{i}^{T} (t) \cdot {\hat{x}}_{i} (θ, t))

which is actually equivalent to

\begin{array}{rcl} max_{θ \in D^{k}} \sum_{t = 0}^{T} \sum_{i = 1}^{n} x_{i}^{T} (t) \cdot {\hat{x}}_{i} (θ, t) & (25) \end{array}

Next, it will be shown that the optimization problem (25) can be formulated as a pseudo-Boolean optimization (i.e., optimization of pseudo-Boolean functions). A pseudo-Boolean function is a mapping from a finite number of Boolean variables to a real number and can be uniquely represented by a multi-linear polynomial (Boros and Hammer, 2002).

As mentioned before, any logical matrix can be expressed by a multi-linear polynomial. After calculation, the term $\sum_{t = 0}^{T} \sum_{i = 1}^{n} x_{i}^{T} (t) {\hat{x}}_{i} (θ, t)$ can be represented by a multivariate polynomial.

\begin{array}{rcl} P_{m v} (θ) = c + \sum_{Q_{β} \subset V} c_{Q_{β}} \prod_{j \in Q_{β}} θ_{j}^{r_{Q_{β}, j}} & (26) \end{array}

where $c, c_{Q_{β}} \in ℝ$ for $Q_{β} \subset V = {1, 2, \dots, k}$ and the factor $r_{Q_{β}, j}, \forall β, j$ is a natural number. In addition, using the property of Boolean variables $θ_{i}^{r} = θ_{i}, \forall r \in ℤ_{+}$ , the multivariate polynomial (26) is easily transformed into a multi-linear polynomial. Consequently, the term $\sum_{t = 0}^{T} \sum_{i = 1}^{n} x_{i}^{T} (t) \cdot {\hat{x}}_{i} (θ, t)$ can be described by a multi-linear polynomial (6) and the optimization problem (25) is transformed into a pseudo-Boolean optimization problem

\begin{array}{rcl} max_{θ \in D^{k}} P_{m l} (θ) = max_{θ \in D^{k}} c + \sum_{i = 1}^{k} c_{i} θ_{i} + \sum_{α = 1}^{q} c_{I_{α}} \prod_{j \in I_{α}} θ_{j} & (27) \end{array}

So far, several different ways to handle the nonlinear pseudo-Boolean optimization problems (27) exist, such as reduction to an equivalent linear or quadratic binary programming problem, branch-and-bound method, linear approximations (Boros and Hammer, 2002; Crama and Rodrí-guez-Heck, 2017). As the linear programming relaxation of an integer linear program can be solved efficiently and based on the solution integer solutions can be found, in this paper we consider “linearization”, so that nonlinear binary optimization can be reduced to integer linear program (Crama and Rodrí-guez-Heck, 2017). The key is to introduce auxiliary Boolean variables $z = {[z_{1} z_{2} \dots]}^{T}$ to replace the nonlinear monomial $\prod_{j \in I_{α}} θ_{j}$ in (6) by means of the AND-expression $z_{α} = \prod_{j \in I_{α}} θ_{j}$ . Simultaneously to satisfy the AND-expression, linear inequalities as constraints are considered to get feasible value of the nonlinear monomial $\prod_{j \in I_{α}} θ_{j}$ . Finally, an optimization problem equivalent to (27) is obtained as

\begin{matrix} \begin{array}{l} \max_{θ, z} L_{P} (θ, z) = \max_{θ, z} c + \sum_{i = 1}^{k} c_{i} θ_{i} + \sum_{α} c_{ℐ_{α}} z_{α} \\ s . t . z_{α} \leq θ_{j}, \forall j \in ℐ_{α}, \\ z_{α} \geq \sum_{j \in ℐ_{α}} θ_{j} - (| ℐ_{α} | - 1), \\ z_{α} \in D, θ \in D^{k} . \end{array} & (28) \end{matrix}

The constraints in the optimization problem in (27) can be complemented by additional constraints representing the prior knowledge of alternative hypotheses or unateness as shown in Section 4.1 and Section 4.3, respectively.

Remark 2. It is important to note that minimizing or maximizing a pseudo-Boolean function is known to be $NP$ -hard (Crama and Rodrí-guez-Heck, 2017). However, Breindl et al. (2013) shows that the optimization problem (28) can be solved using a relaxed problem, i.e., linear programming solver based on the simplex method, which requires less computational effort than mixed integer linear program. The relaxed problem delivers an integer as optimal solution, which is also an optimal solution of the optimization problem (28).

5.2. Handling of Large Scale Networks

With modern measurement techniques it is possible to quantify a huge amount of substances simultaneously. A Boolean network which describes the observed interactions is then also of large scale. But the number of substances which are direct relevant for the regulation of certain substance is usually limited, in other words the connectivity inside the network is bounded. For instance, as pointed out by Arnone and Davidson (1997), the connectivity is bounded by 8. Without prior knowledge the complexity of the algorithm is $O = 2^{n + m}$ as all state and input combinations have to be considered as potential regulators for all states, even though only some of them are relevant in the end. This would limit the applicability of the approach to rather small networks. If one has hypotheses about potential interaction partners and the number of potential regulators per state is limited by a set of k variables, then the complexity of the algorithm is $O = 2^{k}$ , as the regulative functions for each state can be inferred separately. The hypotheses for the interaction partners are not necessarily based on prior-knowledge, but could also be computed based on the data set. In Margolin et al. (2006) an approach is presented, which is based on the information theoretic concept of mutual information ranking and the restriction to pairwise interactions that leads to a very good scaling with big data sets.

5.3. Handling of Missing Measurement Values

Dependent on the measurement technique it is sometimes not possible to measure all states at all time instances and the missing values must be handled in the data analysis. There are approaches in the literature to compute an imputation e.g., for microarrays in Gan et al. (2006) and gel-based proteomics in Albrecht et al. (2010). These approaches are based on interpolation or heuristics. An alternative is to use a data analysis approach which can deal with incomplete data matrices.

A missing measurement value can be estimated during the identification by adding additional binary parameters in the identification process. Because of vector expression of states, all possible states belong to the set $Δ_{2^{n}}$ . In this way, n binary parameters are enough for vector expression of a completely unknown state at time k. For example, if n = 2, then we can generally express the unknown state as

\begin{array}{rcl} x (k) = [\begin{matrix} γ_{1} \cdot γ_{2} \\ γ_{1} \cdot (1 - γ_{2}) \\ (1 - γ_{1}) \cdot γ_{2} \\ (1 - γ_{1}) \cdot (1 - γ_{2}) \end{matrix}] . & (29) \end{array}

Furthermore, as the states of the system are known partially, then the number of binary parameters can be reduced accordingly. So for each missing value one parameter is added to the optimization and the imputation for this value is calculated which fits best to the other dynamic behavior of the system.

5.4. Handling of Unmeasurable Processes

In some systems post transcriptional protein-protein interactions induce dependencies between the measured abundances similar to the transcriptional regulation. This leads to the situation that the transcriptional regulation can not be observed directly and the identification procedure needs to be adapted accordingly (Geier et al., 2007). The dependencies between the states and the measured outputs can be included in boolean models easily by adding Boolean functions mapping from the actual stats X(t) to the measured outputs Y(t):

\begin{array}{rcl} Y_{j} (t) = h_{j} (X (t)), j = 1, 2, \dots, p & (30) \end{array}

where ${[Y (t) = Y_{1} (t) Y_{2} (t) \dots Y_{p} (t)]}^{T} \in D^{p}$ is the output vector at time t, h_i are logic functions. All structural information on the logic functions can be expressed with a logical matrix H

\begin{array}{rcl} y (t) = H x (t) & (31) \end{array}

which can be derived analogous to Equations (2–5). All approaches presented in this paper can be extended for the BN model with output mapping. As additional logic functions are to be identified, additional unknown parameters are added and these parameters cannot be separately identified from the parameters of the regulative functions, which impacts the computational burden drastically (Zhang et al., 2017b).

5.5. Influence of Noise

In real world experiments measurement noise is unavoidable. With a sophisticated binarization method the influence of additive noise can often be suppressed (Hopfensitz et al., 2012). But noise can still lead to wrong binarized values in some cases and consequently errors in the input to the identification method cannot be totally avoided. As the presented approach is based on an optimization, the network which optimally fits to the observed data is found. Inconsistent transitions caused by noise in the data set can be handled directly and lead to an identification result with a non-zero prediction error. If, due to noise, the observed transitions would lead to an identification result which is contradictory to prior knowledge, the identification approach ignores these transitions directly.

Example 6. Consider the BCN for oxidative stress response pathways with the PI3-Kinase-Akt pathway given in Sridharan et al. (2012).

{\begin{array}{l} X_{1} (t + 1) & = U (t) \land \neg X_{6} (t) \\ X_{2} (t + 1) & = \neg X_{1} (t) \\ X_{3} (t + 1) & = \neg X_{1} (t) \land (X_{5} (t) \lor X_{3} (t) \\ X_{4} (t + 1) & = X_{1} (t) \land \neg X_{6} (t) \\ X_{5} (t + 1) & = X_{4} (t) \lor \neg X_{3} (t) \\ X_{6} (t + 1) & = X_{5} (t) \land (\neg X_{6} (t) \lor \neg X_{2} (t)) & (32) \end{array}

In the model, X₁ represents stress reactive intermediaries, X₂ transcription factor A, X₃ key protein, X₄ protein kinase, X₅ transcription factor B, X₆ anti-stress response element, U stress signal. Using STP, (32) can be converted into the algebraic form (5) with $x (t) = ⋉_{i = 1}^{6} x_{i} (t) \in Δ_{64}, u (t) \in Δ_{2}$ .

Assume that two experiments have been executed starting in steady state with two different stimuli, the corresponding input-state data is obtained as shown in Figures 2A,B. Assume further that as prior knowledge the candidates of regulative interactions (see the dashed lines in Figure 3A) and the attractor are given. The attractor of the BCN without stress is X₁ = 0, X₂ = 1, X₃ = 1, X₄ = 0, X₅ = 0, X₆ = 0.

FIGURE 2

Figure 2. Perturbation and state measurement. (A) First experiment. (B) Second experiment.

FIGURE 3

Figure 3. Hypothesis, partially identified and fully identified network graph. (A) Hypotheses for regulative interactions. (B) Identified Boolean network without canalizing information. (C) Identified Boolean network with prior knowlege.

Based on the candidates of regulative interactions, the number of unknown binary parameters θ representing the logical matrices of the Boolean functions can be reduced from 6·2⁷ = 768 to 40 as described in Section 4.1. For instance, since the variable X₂ is connected with the variables X₁, X₃ and X₅, it means that the Boolean function of the variable X₂ can be described by f₂(X₁, X₅, X₅). Accordingly, 8 binary parameters are enough to represent the logical matrix S₂ of the Boolean function f₂, i.e.,

\begin{array}{rcl} S_{2} = [\begin{matrix} θ_{1} & θ_{2} & θ_{3} & θ_{4} & θ_{5} & θ_{6} & θ_{7} & θ_{8} \\ 1 - θ_{1} & 1 - θ_{2} & 1 - θ_{3} & 1 - θ_{4} & 1 - θ_{5} & 1 - θ_{6} & 1 - θ_{7} & 1 - θ_{8} \end{matrix}] . & (33) \end{array}

The information about the steady state is used as described in Section 4.4 to determine one parameter in each matrix, which reduces the number of unknown variables to 34. In the next, we apply the proposed approach to identify the model of the BCN from the given input-state data. Solving the optimization problem (28), in total, 31 unknown binary parameters can be determined. The identification result is depicted in Figure 3B and the identified matrices are as follows,

\begin{matrix} \begin{array}{l} S_{1} = [\begin{matrix} 0 & 1 & 0 & 0 \\ 1 & 0 & 1 & 1 \end{matrix}], S_{2} = [\begin{matrix} 0 & 0 & 0 & 0 & 1 & 1 & 1 & 1 \\ 1 & 1 & 1 & 1 & 0 & 0 & 0 & 0 \end{matrix}], \\ S_{3} = [\begin{matrix} 0 & 0 & 0 & 0 & 1 & 1 & 1 & 0 \\ 1 & 1 & 1 & 1 & 0 & 0 & 0 & 1 \end{matrix}], \\ S_{4} = [\begin{matrix} 0 & 1 & 0 & 1 & 0 & 0 & 0 & 0 \\ 1 & 0 & 1 & 0 & 1 & 1 & 1 & 1 \end{matrix}], S_{5} = [\begin{matrix} θ_{29} & 0 & 1 & 1 \\ 1 - θ_{29} & 1 & 0 & 0 \end{matrix}], \\ S_{6} = [\begin{matrix} 0 & 1 & θ_{35} & 0 & 1 & 1 & θ_{39} & 0 \\ 1 & 0 & 1 - θ_{35} & 1 & 0 & 0 & 1 - θ_{39} & 1 \end{matrix}] . \end{array} & (34) \end{matrix}

It can be seen that the logical matrices of the Boolean functions for X₅ and X₆ can not be uniquely determined. Combined with an additional information about activating or suppressing properties of the states, for instance, X₄ and X₅ are, respectively, activator to X₅ and X₆, the complete model can be uniquely reconstructed. The canalizing property of X₄ and X₅ can be utilized as described in Section 4.2. If this information is not available, one could conduct additional experiments with different stimuli and combine the data to have full reconstruction of the model as depicted in Figure 3C.

6. Discussion

The proposed method facilitates the incorporation of various types of prior knowledge. The optimization problem can be solved by efficient linear programming solvers. By using the simplex method one can guarantee to find the network which optimally fits to the observed data. In comparison, the genetic algorithms based approaches may not guarantee the optimal solution. The proposed method is developed for synchronous Boolean networks. It can be applied to large scale networks, if the connectivity of the network to be identified is limited with aid of prior knowledge or application of information theory.

In future we plan to investigate data-based approaches to infer the connections in large networks and automated partitioning into smaller subsystems (e.g., with an adapted approach from discrete event systems like Saives et al., 2018). We also work on a new method for the binarization based on the idea that the qualitative system behavior before and after the binarization shall be the same.

Author Contributions

TL, ZZ, and PZ conception and design of research. TL and ZZ performed simulation and analyzed data. TL, ZZ, and PZ interpreted simulation results. TL and ZZ prepared figures. TL, ZZ, and PZ drafted manuscript and approved final version of manuscript.

Funding

This work is supported by the Federal State of Rhineland-Palatinate, Germany in the framework of the project Complex Data Analysis in Life Sciences and Biotechnology (BioComp).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors would like to thank Michael Schroda and Timo Mühlhaus for the discussion on the data characteristics in biological systems.

References

Akutsu, T., Miyano, S., and Kuhara, S. (1999). “Identification of genetic networks from a small number of gene expression patterns under the boolean network model,” in Proceedings of the Pacific Symposium on Biocomputing (Mauna Lani), 17–28.

Google Scholar

Albert, R., and Barabási, A.-L. (2000). Dynamics of complex systems: Scaling laws for the period of boolean networks. Phys. Rev. Lett. 84, 5660–5663. doi: 10.1103/PhysRevLett.84.5660

PubMed Abstract | CrossRef Full Text | Google Scholar

Albrecht, D., Kniemeyer, O., Brakhage, A. A., and Guthke, R. (2010). Missing values in gel-based proteomics. Proteomics 10, 1202–1211. doi: 10.1002/pmic.200800576

PubMed Abstract | CrossRef Full Text | Google Scholar

Alon, N., Babai, L., and Suzuki, H. (1991). Multilinear polynomials and frankl-ray-chaudhuri-wilson type intersection theorems. J. Combinat. Theory A 58, 165–180. doi: 10.1016/0097-3165(91)90058-O

CrossRef Full Text | Google Scholar

Arnone, M., and Davidson, E. (1997). The hardwiring of development: organization and function of genomic regulatory systems. Development 124, 1851–1864.

PubMed Abstract | Google Scholar

Berestovsky, N., and Nakhleh, L. (2013). An evaluation of methods for inferring boolean networks from time-series data. PLoS ONE 8:e66031. doi: 10.1371/journal.pone.0066031

PubMed Abstract | CrossRef Full Text | Google Scholar

Bollobas, B. (2012). Graph Theory: An Introductory Course, Vol. 63. New York, NY: Springer Science & Business Media.

Google Scholar

Boros, E., and Hammer, P. L. (2002). Pseudo-boolean optimization. Discrete Appl. Math. 123, 155–225. doi: 10.1016/S0166-218X(01)00341-9

CrossRef Full Text | Google Scholar

Breindl, C., Chaves, M., and Allgöwer, F. (2013). “A linear reformulation of boolean optimization problems and structure identification of gene regulation networks,” in Proceedings of the 52th IEEE Conference on Decision and Control (Florence), 733–738.

Google Scholar

Cheng, D. (2001). Semi-tensor product of matrices and its application to morgen's problem. Sci. China Ser. Informat. Sci. 2001, 195–212. doi: 10.1007/BF02714570

CrossRef Full Text | Google Scholar

Cheng, D., and Qi, H. (2010). A linear representation of dynamics of boolean networks. IEEE Trans. Automat. Cont. 55, 2251–2258. doi: 10.1109/TAC.2010.2043294

CrossRef Full Text | Google Scholar

Cheng, D., Qi, H., and Li, Z. (2011a). Analysis and Control of Boolean Networks: A Semi-Tensor Product Approach. London: Springer.

Google Scholar

Cheng, D., Qi, H., and Li, Z. (2011b). Model construction of boolean network via observed data. IEEE Trans. Neural Netw. 22, 525–536. doi: 10.1109/TNN.2011.2106512

PubMed Abstract | CrossRef Full Text | Google Scholar

Cheng, D., and Zhao, Y. (2011). Identification of boolean control networks. Automatica 47, 702–710. doi: 10.1016/j.automatica.2011.01.083

CrossRef Full Text | Google Scholar

Crama, Y., and Rodrí-guez-Heck, E. (2017). A class of valid inequalities for multilinear 0-1 optimization problems. Discrete Optimizat. 25, 28–47. doi: 10.1016/j.disopt.2017.02.001

CrossRef Full Text | Google Scholar

Davidich, M. I., and Bornholdt, S. (2008). Boolean network model predicts cell cycle sequence of fission yeast. PLoS ONE 3:e1672. doi: 10.1371/journal.pone.0001672

PubMed Abstract | CrossRef Full Text | Google Scholar

Dorier, J., Crespo, I., Niknejad, A., Liechti, R., Ebeling, M., and Xenarios, I. (2016). Boolean regulatory network reconstruction using literature based knowledge with a genetic algorithm optimization method. BMC Bioinform. 17:410. doi: 10.1186/s12859-016-1287-z

PubMed Abstract | CrossRef Full Text | Google Scholar

Faisal, S., Lichtenberg, G., Trump, S., and Attinger, S. (2010). Structural properties of continuous representations of boolean functions for gene network modelling. Automatica 46, 2047–2052. doi: 10.1016/j.automatica.2010.09.001

CrossRef Full Text | Google Scholar

Fauré, A., Naldi, A., Chaouiya, C., and Thieffry, D. (2006). Dynamical analysis of a generic boolean model for the control of the mammalian cell cycle. Bioinformatics 22, e124–e131. doi: 10.1093/bioinformatics/btl210

PubMed Abstract | CrossRef Full Text | Google Scholar

Fornasini, E., and Valcher, M. E. (2014). “Identification problems for boolean networks and boolean control networks,” in Proceedings of the 19th IFAC World Congress (Cape Town), 5399–5404.

Google Scholar

Fumia, H. F., and Martins, M. L. (2013). Boolean network model for cancer pathways: predicting carcinogenesis and targeted therapy outcomes. PLoS ONE 8:e69008. doi: 10.1371/journal.pone.0069008

PubMed Abstract | CrossRef Full Text | Google Scholar

Gan, X., Liew, A. W.-C., and Yan, H. (2006). Microarray missing data imputation based on a set theoretic framework and biological knowledge. Nucleic Acids Res. 34, 1608–1619. doi: 10.1093/nar/gkl047

PubMed Abstract | CrossRef Full Text | Google Scholar

Geier, F., Timmer, J., and Fleck, C. (2007). Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge. BMC Sys. Biol. 1:11. doi: 10.1186/1752-0509-1-11

PubMed Abstract | CrossRef Full Text | Google Scholar

Grieb, M., Burkovski, A., Sträng, J. E., Kraus, J. M., Groß, A., Palm, G., et al. (2015). Predicting variabilities in cardiac gene expression with a boolean network incorporating uncertainty. PLoS ONE 10:e0131832. doi: 10.1371/journal.pone.0131832

PubMed Abstract | CrossRef Full Text | Google Scholar

Hamming, R. W. (1950). Error detecting and error correcting codes. Bell Labs Techn. J. 29, 147–160. doi: 10.1002/j.1538-7305.1950.tb00463.x

CrossRef Full Text | Google Scholar

Higa, C. H., Louzada, V. H., Andrade, T. P., and Hashimoto, R. F. (2011). Constraint-based analysis of gene interactions using restricted boolean networks and time-series data. BMC Proc. 5:S5. doi: 10.1186/1753-6561-5-S2-S5

PubMed Abstract | CrossRef Full Text | Google Scholar

Hopfensitz, M., Müssel, C., Wawra, C., Maucher, M., Kühl, M., Neumann, H., and Kestler, H. A. (2012). Multiscale binarization of gene expression data for reconstructing boolean networks. IEEE/ACM Trans. Comput. Biol. Bioinform. 9, 487–498. doi: 10.1109/TCBB.2011.62

PubMed Abstract | CrossRef Full Text | Google Scholar

Huang, S., and Ingber, D. E. (2000). Shape-dependent control of cell growth, differentiation, and apoptosis: switching between attractors in cell regulatory networks. Exp. Cell Res. 261, 91–103. doi: 10.1006/excr.2000.5044

PubMed Abstract | CrossRef Full Text | Google Scholar

Isermann, R., and Münchhof, M. (2011). Identification of Dynamic Systems: An Introduction With Applications. Berlin;Heidelberg: Springer.

Google Scholar

Karlebach, G., and Shamir, R. (2012). Constructing logical models of gene regulatory networks by integrating transcription factor-dna interactions with expression data: an entropy-based approach. J. Comput. Biol. 19, 30–41. doi: 10.1089/cmb.2011.0100

PubMed Abstract | CrossRef Full Text | Google Scholar

Kauffman, S. (1974). The large scale structure and dynamics of gene control circuits: an ensemble approach. J. Theor. Biol. 44, 167–190. doi: 10.1016/S0022-5193(74)80037-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Kauffman, S., Peterson, C., Samuelsson, B., and Troein, C. (2003). Random boolean network models and the yeast transcriptional network. Proc. Natl. Acad. Sci. U.S.A. 100, 14796–14799. doi: 10.1073/pnas.2036429100

PubMed Abstract | CrossRef Full Text | Google Scholar

Kauffman, S. A. (1969). Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol. 22, 437–467. doi: 10.1016/0022-5193(69)90015-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Lähdesmäki, H., Shmulevich, I., and Yli-Harja, O. (2003). On learning gene regulatory networks under the boolean network model. Mach. Learn. 52, 147–167. doi: 10.1023/A:1023905711304

CrossRef Full Text | Google Scholar

Liang, S., Fuhrman, S., and Somogyi, R. (1998). “Reveal: a general reverse engineering algorithm for inference of genetic network architectures,” in Proceedings of the Pacific Symposium on Biocomputing (Hawaii), 18–29.

Google Scholar

Margolin, A. A., Nemenman, I., Basso, K., Wiggins, C., Stolovitzky, G., Favera, R. D., et al. (2006). Aracne: an algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinformat. 7:S7. doi: 10.1186/1471-2105-7-S1-S7

PubMed Abstract | CrossRef Full Text | Google Scholar

Naldi, A., Monteiro, P. T., Müssel, C., Kestler, H. A., Thieffry, D., et al. (2015). Cooperative development of logical modelling standards and tools with colomoto. Bioinformatics 31, 1154–1159. doi: 10.1093/bioinformatics/btv013

PubMed Abstract | CrossRef Full Text | Google Scholar

Ostrowski, M., Paulevé, L., Schaub, T., Siegel, A., and Guziolowski, C. (2016). Boolean network identification from perturbation time series data combining dynamics abstraction and logic programming. Biosystems 149, 139–153. doi: 10.1016/j.biosystems.2016.07.009

PubMed Abstract | CrossRef Full Text | Google Scholar

Ouyang, H., Fang, J., Shen, L., Dougherty, E. R., and Liu, W. (2014). Learning restricted boolean network model by time-series data. EURASIP J. Bioinform. Sys. Biol. 2014:10. doi: 10.1186/s13637-014-0010-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Porreca, R., Cinquemani, E., Lygeros, J., and Ferrari-Trecate, G. (2010). Identification of genetic network dynamics with unate structure. Bioinformatics 26, 1239–1245. doi: 10.1093/bioinformatics/btq120

PubMed Abstract | CrossRef Full Text | Google Scholar

Saives, J., Faraut, G., and Lesage, J. J. (2018). Automated partitioning of concurrent discrete-event systems for distributed behavioral identification. IEEE Trans. Autom Sci. Eng. 15, 832–841. doi: 10.1109/TASE.2017.2718244

CrossRef Full Text | Google Scholar

Sridharan, S., Layek, R., Datta, A., and Venkatraj, J. (2012). Boolean modeling and fault diagnosis in oxidative stress response. BMC Genomics 13(Suppl. 6):S4. doi: 10.1186/1471-2164-13-S6-S4

PubMed Abstract | CrossRef Full Text | Google Scholar

Terfve, C., Cokelaer, T., Henriques, D., MacNamara, A., Goncalves, E., Morris, M. K., et al. (2012). Cellnoptr: a flexible toolkit to train protein signaling networks to data using multiple logic formalisms. BMC Sys. Biol. 6:133. doi: 10.1186/1752-0509-6-133

PubMed Abstract | CrossRef Full Text | Google Scholar

Videla, S., Guziolowski, C., Eduati, F., Thiele, S., Gebser, M., Nicolas, J., et al. (2015). Learning boolean logic models of signaling networks with asp. Theor. Comput. Sci. 599, 79–101. doi: 10.1016/j.tcs.2014.06.022

CrossRef Full Text | Google Scholar

Waddington, C. H. (1942). Canalization of development and the inheritance of acquired characters. Nature 150, 563–565. doi: 10.1038/150563a0

CrossRef Full Text | Google Scholar

Wang, R.-S., Saadatpour, A., and Albert, R. (2012). Boolean modeling in systems biology: an overview of methodology and applications. Phys. Biol. 9:055001. doi: 10.1088/1478-3975/9/5/055001

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, X., Han, H., and Zhang, W. (2017a). Identification of boolean networks using premined network topology information. IEEE Trans. Neural Netw. Learn. Sys. 28, 464–469. doi: 10.1109/TNNLS.2016.2514841

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, Z., Leifeld, T., and Zhang, P. (2017b). “Identification of boolean control networks incorporating prior knowledge,” in IEEE 56th Annual Conference on Decision and Control (Melbourne, VIC), 5839–5844.

Google Scholar

Zhou, X., Wang, X., and Dougherty, E. R. (2003). Binarization of microarray data on the basis of a mixture model. Mol. Cancer Therapeut. 2, 679–684.

PubMed Abstract | Google Scholar

Keywords: Boolean networks, identification, prior knowledge, time series data, network inference

Citation: Leifeld T, Zhang Z and Zhang P (2018) Identification of Boolean Network Models From Time Series Data Incorporating Prior Knowledge. Front. Physiol. 9:695. doi: 10.3389/fphys.2018.00695

Received: 01 February 2018; Accepted: 18 May 2018;
Published: 08 June 2018.

Edited by:

Tomáš Helikar, University of Nebraska-Lincoln, United States

Reviewed by:

Aurélien Naldi, École Normale Supérieure, France
Ruisheng Wang, Department of Medicine, Brigham and Women's Hospital, United States

Copyright © 2018 Leifeld, Zhang and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ping Zhang, cHpoYW5nQGVpdC51bmkta2wuZGU=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.