Forecasting real-world complex networks’ robustness to node attack using network structure indexes

Bellingeri, Michele; Turchetto, Massimiliano; Scotognella, Francesco; Alfieri, Roberto; Nguyen, Ngoc-Kim-Khanh; Nguyen, Quang; Cassi, Davide

doi:10.3389/fphy.2023.1245564

ORIGINAL RESEARCH article

Front. Phys., 11 October 2023

Sec. Statistical and Computational Physics

Volume 11 - 2023 | https://doi.org/10.3389/fphy.2023.1245564

Forecasting real-world complex networks’ robustness to node attack using network structure indexes

Michele Bellingeri^1,2*

Massimiliano Turchetto^1,2

Francesco Scotognella^3,4

Roberto Alfieri^1,2

Ngoc-Kim-Khanh Nguyen⁵

Quang Nguyen^6,7

Davide Cassi^1,2

¹Dipartimento di Scienze Matematiche, Fisiche e Informatiche, Università di Parma, Parma, Italy
²Istituto Nazionale di Fisica Nucleare, Gruppo Collegato di Parma, Parma, Italy
³Dipartimento di Fisica, Politecnico di Milano, Milano, Italy
⁴Center for Nano Science and Technology@PoliMi, Istituto Italiano di Tecnologia, Milan, Italy
⁵Faculty of Fundamental Sciences, Van Lang University, Ho Chi Minh City, Vietnam
⁶Department of Physics, International University, Ho Chi Minh City, Vietnam
⁷Vietnam National University, Ho Chi Minh City, Vietnam

In this study, we simulate the degree and betweenness node attack over a large set of 200 real-world networks from different areas of science. We perform an initial node attack approach, where the node centrality rank is computed at the beginning of the simulation, and it is not updated along the node removal process. We quantify the network damage by tracing the largest connected component ( $L C C$ ) and evaluate the network robustness with the “percolation threshold $q_{c}$ ,” i.e., the fraction of nodes removed, for which the size of the $L C C$ is quasi-zero. We correlate $q_{c}$ with 20 network structural indicators (NSIs) from the literature using single linear regression (SLR), multiple linear regression (MLR) models, and the Pearson correlation coefficient test. The NSIs cover most of the essential structural features proposed in network science to describe real-world networks. We find that the Estrada heterogeneity ( $E H$ ) index, evaluating the degree difference of connected nodes, best predicts $q_{c}$ . The $E H$ index measures the network node degree heterogeneity based on the difference of functions of node degrees for all pairs of linked nodes. We find that the $q_{c}$ value decreases as a function of the $E H$ index, unveiling that heterogeneous real-world networks with a higher variance in the degree of connected nodes are more vulnerable to node attacks.

1 Introduction

Networks can model many real-world complex systems, where nodes (vertices) represent the constituent components and links (edges) describe the relationships among the node components [1, 2]. A paramount issue in complex network science is to determine the robustness of the overall system to the failure or attack of its nodes [3–10]. On the other hand, the robustness in complex networks is a problem closely related to understanding which kind of node removal (attack) strategy is the most effective in damaging the network [3, 11–14]. The node attack may model different real-world problems of high interest, such as the nodes/species extinction in ecological networks [15–17], the aging of nodes/chromophores in the photosynthetic network [18], the vaccination of nodes/individuals in social networks [19–22], or the malfunctioning of nodes/routers in computer networks [23, 24].

Network robustness to node attack may change in real-world networks with different structures [11]. Iyer et al. [3] studied network robustness as a function of the node clustering coefficient (or node transitivity). This study demonstrates that networks with higher clustering coefficients are more robust, with the most critical effect for the node degree and node betweenness attack. Nguyen and Trang [25] studied the Facebook social network. They found that those networks with higher modularity, i.e., networks presenting communities of nodes that are highly connected among them, have lower robustness to node removal. Zhou et al. [26] observed that increasing the assortativity of a network makes the network more robust against node removal and the network less stable. Nguyen et al. [27] showed that machine learning approaches unveil the degree assortativity, global closeness, and average node degree as the most critical factors in predicting the robustness ( $R$ ) of real-world social networks.

Network science research shows contrasting outcomes about the role of the network structure in affecting its robustness to node attacks. On one hand, these studies are often based on small datasets of real-world networks, and they need more (robust) statistical analyses. On the other hand, research outcomes generally restrict the investigation, focusing on a few structural features of the networks, thus lacking a wide comparison of network structural indicators (NSIs) to forecast network robustness. For these reasons, understanding which structural features of real-world networks affect their robustness to node removal is still an urgent problem in network science.

In this research, we implement two well-known node attack strategies, i.e., the degree and betweenness node removal over a large set of 200 real-world networks from different areas of science.

We quantify the network functioning damage along the node attack sequence using the largest connected component ( $L C C$ ) indicator [3, 11, 28]. To evaluate the network robustness against the node attack, we adopt the “percolation threshold” ( $q_{c}$ ), i.e., the fraction of nodes removed at which the network becomes disconnected or, in other terms, the fraction of nodes removed for which the size of the $L C C$ is quasi-zero [29].

Then, to understand how the network structure affects the network robustness (and the node attack efficacy), we correlate $q_{c}$ with 20 NSIs from the literature. To study this correlation, we performed regression analysis, single linear regression (SLR), multiple linear regression (MLR) models, and the Pearson correlation coefficient test to find the best NSI predictors of the target variable $q_{c}$ .

We find that the Estrada heterogeneity ( $E H$ ) index [30] best predicts $q_{c}$ in both the SLR and MLR models. The $q_{c}$ value decreases as a function of the $E H$ index. The $E H$ index measures network degree heterogeneity based on the difference in functions of node degrees for all pairs of linked nodes [30]. This result indicates that the degree heterogeneity of linked nodes may negatively affect the real-world network robustness to node attack, specifically the network robustness against removing the most connected and highest betweenness nodes. Our outcomes shed light on the role of the real-world network structure in shaping their robustness and can help assemble more robust network structures.

2 Methods

2.1 The node attack strategies

We simulated two classic node attack (removal) strategies. The first is the removal of nodes according to their degree (DEG), i.e., the number of links to the node [3, 4, 31]. The DEG strategy removes nodes in decreasing order of connectivity, i.e., the most connected nodes (hubs) are removed first. The second node attack strategy removes nodes in decreasing order of betweenness centrality (BET) [3, 7, 32]. The betweenness centrality is a node centrality based on the shortest paths between node pairs (also called geodesic paths). The shortest path between two nodes is the minimum number of links required to travel from one node to another [33]. The betweenness centrality of a node returns the number of shortest paths from every node pair of the network passing along that node. The betweenness $g (i)$ of the node $i$ is $(i) = \sum_{s, t = 1}^{N} \frac{σ_{s t} (i)}{σ_{s t}}$ , where $σ_{s t}$ is the total number of shortest paths between nodes $s$ and $t$ and $σ_{s t} (i)$ is the number of these shortest paths passing through the node $i$ , and $N$ is the number of nodes.

We perform an “initial node attack approach,” i.e., the node centrality rank is computed at the beginning of the simulation, and it is not updated along the node removal process [11]. The “initial node attack approach” differs from the recalculated (also named adaptive) node attack, in which node centralities are updated after node removals [11, 28]. The initial node attack describes the case where it is not possible to collect information about node features during the node removal process, such as vaccinating nodes/individuals in a social contact network with limited resources (limited time or vaccines) [34] or attacking nodes/routers in a computer network with a simultaneous node attack [28].

For both the node attack strategies, in the case of ties, i.e., nodes with equal ranking, we randomly sort their sequence. We perform 10³ simulations for each node attack strategy. We implemented the node attack simulations using the igraph package of the R program. The simulations are carried out on the high-performance computing (HPC) cluster of the “Università degli Studi di Parma.”

2.2 Real-world networks

We analyzed a large dataset of real-world network systems composed of 200 networks from different fields of science. The real-world networks analyzed here come from social, biological, Internet, road, transportation, neuronal, and ecological networks. The networks analyzed here are undirected (i.e., do not account for link directionality) and unweighted (do not account for link weight). The number of network nodes ranges from $N$ = 25 to $N$ = 75,811; the average is $\bar{N}$ = 4,955.6. The real-world network datasets analyzed in this study are available in the “Netzschleuder” repository [https://networks.skewed.de/], in the “Stanford Large Network Dataset Collection” repository [https://snap.stanford.edu/data/index.html], and in “the Colorado Index of Complex Networks (ICON)” repository [https://icon.colorado.edu/#!/]. The complete list of the real-world networks is provided in Supplementary Table Al in Supplementary Appendix A1.

2.3 Network structure indexes

We considered 20 different NSIs from the network science literature, graph theory, and chemical graph theory to predict $q_{c}$ in a large real-world network dataset. NSI adopted in this work covers most of the salient structural features of the real-world networks proposed in the network science literature, such as the node connectivity level [35], presence of a community structure [36, 37], degree heterogeneity [30, 38], node assortativity [39], node transitivity (or clustering) [3, 40], distance among nodes [41], and different notions of node centrality [42]. The list of NSIs is provided in Table 1.

TABLE 1

TABLE 1. Network structural indicator (NSI) list with a short definition and reference.

2.4 The network robustness

To evaluate the networks’ response to node attack, we trace $L C C$ as a function of the fraction of nodes removed $q$ . $L C C$ (also named the giant component) is the maximum number of connected nodes [1]. In other terms, $L C C$ is the maximal set of nodes in the network such that a path connects each node pair. $L C C$ is the most commonly used measure to evaluate the network response to node removal [11]. Then, to evaluate the network robustness to node attack, we use $q_{c}$ that represents the fraction of nodes to remove for reducing $L C C$ to quasi-zero [29]. This work defines $q_{c}$ as the fraction $q$ of nodes removed to reduce the $L C C$ value equal to or lower to 0.05 of its initial size. The lower the $q_{c}$ value, the lower the network robustness (Figure 1). Furthermore, the lower the $q_{c}$ value, the higher the efficacy of the node attack strategies to dismantle the network [29].

FIGURE 1

FIGURE 1. $L C C$ as a function of the node removal fraction ( $q$ ). The percolation threshold $q_{c}$ value corresponds to the q-value at which $L C C$ is quasi-zero. A higher percolation threshold $q_{c}$ denotes a slower $L C C$ decrease. Consequently, a higher percolation threshold $q_{c}$ denotes a more robust network. The red line presents lower $q_{c},$ describing a more vulnerable network response to node attack than the black strategy. In other words, the black line denotes a more robust network response to a node attack.

2.5 The linear regression models

We perform regression model analyses to understand the relationship between NSI and the $q_{c}$ value of the real-world networks. First, we perform SLR. The SLR model between $q_{c}$ and an NSI x is expressed by the following linear equation:

q_{c} = a + b \cdot N S I, (1)

where a is the intercept and $b$ is the slope. We choose the one with the highest R-squared among the significant SLRs to evaluate the best SLR model and, consequently, the best predictor. In linear regression, R-squared (R²), also named the coefficient of determination, measures how close the data points are to the fitted line. Higher R² denotes better regression fitting models [48].

Then, we perform MLR models. MLR is an extension of SLR for multi-dimension variables $x = (x_{1}, x_{2}, \dots, x_{n})$ . The linear equation between the $q_{c}$ value and NSIs becomes

q_{c} = a_{0} + a_{1} x_{1} + a_{2} x_{2} + . . . + a_{n} x_{n}, (2)

where $a_{i}$ are coefficients obtained performing the ordinary least square (OLS) method and $x_{1}, x_{2}, \dots, x_{n}$ are NSIs. The $a_{i}$ coefficients quantify the association between NSI (variable) and $q_{c}$ (response). We interpret $a_{i}$ as the average effect on $q_{c}$ of a one-unit increase in NSI, holding all other NSI predictors fixed [48]. In practice, we often have more than one predictor, and the MLR model, differently from SLR, can directly accommodate multiple predictors. To evaluate the best predictor carried out by the MLR model, we choose the significant NSI with the highest absolute t-value. The t-value used in MLR is the t di-student statistic value from a two-sided t-test. The larger the absolute value of the t-test statistic, the less likely the results occurred by chance [48]. For this, larger absolute t-values are associated with better predictors (NSIs).

We use the lm function of the R program to perform the SLR and MLR models. The fitting process is computed using the OLS method, which estimates the coefficients by minimizing an appropriate loss function [49].

Last, we perform the Pearson correlation coefficient ( $r$ ) to test the goodness of the correlation between NSI and $q_{c}$ . The $r$ coefficient is the most common way of measuring the strength of a linear correlation [50]. It is a number between −1 and 1 that measures the strength and direction of the relationship between two variables. To evaluate the best correlation performed by the $r$ coefficient, we choose the significant NSI with the highest absolute t-value. Last, we furnish the p-value to show the statistical significance of each model.

3 Results

Figure 2 shows the scatterplots of q_c vs. NSIs for the DEG node attack strategy. Figure 3 shows the scatterplots of q_c vs. NSIs for the BET node attack strategy.

FIGURE 2

FIGURE 2. Scatterplots of the percolation threshold (q_c) vs. the network structural indicators (NSIs) for the DEG node attack strategy, removing nodes with higher degrees first.

FIGURE 3

FIGURE 3. Scatterplots of the percolation threshold (q_c) vs. the network structural indicators (NSIs) for the BET node attack strategy, removing nodes with higher betweenness first.

Table 2 shows the outcomes of the SLR model. The best NSI to fit an SLR model with $q_{c}$ is the $E H$ index for both DEG (p-value <10^–4, R² = 0.567) and BET (p-value <10^–4, R² = 0.671) strategies. SLR $q_{c} \sim E H$ returns the lowest p-values and the highest R² for both node attack strategies (Table 2). The $q_{c} \sim E H$ fitting slopes are negative, indicating that q_c decreases as a function of $E H$ , i.e., the robustness of the network is negatively correlated with $E H$ for both node attack strategies (Figures 2, 3).

TABLE 2

TABLE 2. Single linear regression model outcomes. The best significant predictor with the highest R² value is in bold.

Table 3 shows the outcomes of the MLR model. The best NSI to predict $q_{c}$ with the MLR model is the $E H$ index for both DEG (t-value = −11.9, p-value <10^–23) and BET (t-value = −11.8, p-value <10^–23) strategies. MLR estimates a negative correlation between $q_{c}$ and $E H$ for both node attack strategies (negative correlation estimate, Table 3).

TABLE 3

TABLE 3. Multiple linear regression model outcomes. The best significant predictor with the highest absolute t-value is in bold.

Table 4 summarizes the $r$ coefficient test outcomes. The best NSI to correlate $q_{c}$ is the $E H$ index for both DEG (t-value = −11.9, p-value <10^–23) and BET (t-value = −11.8, p-value <10^–23) strategies. The $r$ coefficient estimates a negative correlation between $q_{c}$ and $E H$ for both node attack strategies (−16.063 for DEG and −20.035 for BET, Table 4).

TABLE 4

TABLE 4. Pearson correlation coefficient test outcomes. The best significant predictor with the highest absolute t-value is in bold.

4 Discussion

The $E H$ index is the best predictor of n $q_{c}$ in our NSI set. Estrada [30] proposed the $E H$ index as a unique characterization of network degree heterogeneity based on the difference in functions of node degrees for all pairs of linked nodes. $E H$ quantifies the degree heterogeneity of the network as a quadratic form of the Laplacian matrix of the network. It takes the value of zero if all nodes have the same degree as it happens in regular networks, and it is maximized when the difference of both degrees increases. The $E H$ index has two bound or limit structures, i.e., it is equal to zero for any regular network (where all nodes present the same degree) and equal to one only for star graphs, i.e., networks in which $N$ -1 nodes are directly connected to a single central node [30]. We find that $q_{c}$ decreases as a function of the $E H$ index (Figures 2, 3). This finding indicates that heterogeneous real-world networks with a higher variance in the degree of connected nodes are more vulnerable to node attacks.

$E H$ is conceived as a refining of the Albertson index ( $A H$ ), which computes the sum of the absolute value of the degree difference of the connected nodes [44]. The $A H$ index, its normalized version $n A H$ , and the node degree standard deviation $σ_{k}$ are all indicators we used to quantify the network degree heterogeneity. The statistical analyses we performed, both SLR and MLR and the $r$ coefficient test, indicate that these NSIs are not good predictors of $q_{c}$ . $σ_{k}$ did not return significant fittings for all statistical models (Tables 2–4). $σ_{k}$ evaluates the whole node degree heterogeneity, neglecting whether the node degree variance is among connected nodes. Differently, the $E H$ index measures the degree difference among connected nodes [30]. For this reason, we can argue that the node degree heterogeneity would play a significant role in affecting the network robustness only if the node degree heterogeneity is located (and evaluated) among connected nodes.

The third and fourth ring roads of Beijing City, the capital of China, are the real-world networks of the lowest $E H$ in our dataset ( $E H$ = 0.008 and 0.009). In these networks, nodes represent the road intersections and links depict the roads connecting nodes [51]. The connected nodes present homogenous degrees, and for this reason, removing higher-degree road intersections would cause a slower network fragmentation with very high $q_{c}$ values ( $q_{c}$ = 0.6 and 0.56), indicating lower network damage. On the contrary, the academia US faculty hiring network shows the highest $E H$ value ( $E H$ = 0.73). In this network, a node is a Ph.D.-granting institution, and a link from node $i$ to node $j$ indicates that a person received their Ph.D. from node $i$ and was tenure-track faculty at node $j$ [52]. This network presents the highest degree heterogeneity of connected nodes, i.e., famous higher-degree nodes/institutions are connected with many lower-degree institutions. Therefore, the removal of the highest degree nodes, i.e., the removal of famous institutions sending many Ph.D. to other institutions, can cause a quick network disconnection. Therefore, the academia US faculty hiring network returns a lower $q_{c}$ value ( $q_{c}$ = 0.13), indicating more significant network damage.

The $n A H$ index is computed by averaging the original $A H$ index over the number of links in the network. It can be viewed as the average degree difference among connected nodes [43]. $n A H$ shows significant fitting for SLR (Table 2). Nonetheless, R² of SLR is much higher for $E H$ than that for the $n A H$ index (0.567 for $E H$ and 0.200 for $n A H$ , Table 2), indicating that the $E H$ index can better explain the data. $A H$ returns significant fitting for MLR (Table 3), but the absolute t-value for $A H$ is much lower than that for $E H$ (Table 3). Furthermore, $A H$ did not return a significant $r$ coefficient test (Table 4). These statistical results indicate that only $E H$ correlates the nodes’ degree heterogeneity of the networks with their robustness to the attack of connected nodes. On the other hand, these results suggest that networks presenting, on average, similar node degrees of the connected nodes should be robust to node attack. For this reason, networks of lower $E H$ should show higher robustness to node attack and higher $q_{c}$ .

The assortativity coefficient $A$ (Table 1) measures how nodes tend to be connected with nodes of similar degrees [39]. “Assortative networks” present a preference for a network’s nodes to attach to others with similar node degrees [39, 53]. On the contrary, a network is “disassortative” when, on average, high-degree nodes are connected to nodes with a lower degree, and on average, low-degree nodes are connected to nodes with a higher degree. Positive values of $A$ indicate a correlation between nodes of similar degrees, while negative values indicate relationships between nodes of different degrees [39].

Given a certain node degree heterogeneity, assortative networks should have, on average, lower $E H$ than disassortative networks. The linear regression $A \sim E H$ indicates a negative correlation (p-value < 0.001) in our real-world network dataset and confirms this hypothesis, i.e., higher values of $A$ are associated with lower $E H$ (Figure 4).

FIGURE 4

FIGURE 4. Scatterplot of the assortativity coefficient (A) vs. the Estrada heterogeneity ( $E H$ ) index. The black line represents the significant linear regression $A = 0.2 - 0.84 \cdot E H$ (p-value < 0.001).

Consequently, assortative networks should show higher robustness to node attack and higher $q_{c}$ . According to this hypothesis, we find that $q_{c}$ increases as a function of A (Figures 2, 3), and all models SLR, MLR (Tables 2, 3 respectively), and the $r$ coefficient (Table 4) return a positive significant fitting between $A$ and $q_{c}$ . The literature research results corroborate this finding, unveiling that increasing the assortativity of a network makes the network more robust against node removal [26], and a moderate assortativity increase positively affects the network’s robustness against targeted node attacks [54]. Therefore, real-world networks with higher-degree differences of connected nodes are likely to present lower $q_{c}$ .

To further investigate the relationship between node degree heterogeneity and network robustness, we perform an MLR model holding only $E H$ and $A$ as predictors of $q_{c}$ , i.e., we fit the model $q_{c} \sim E H + A$ . The outcomes of this analysis are shown in Table 5. $E H$ is highly significant for the DEG strategy and presents the lowest t-value, whereas $A$ is not a significant predictor. $E H$ is highly significant for the BET strategy and presents a smaller t-value than $A$ . This finding supports $E H$ as NSI that can correlate with the real-world networks $q_{c}$ .

TABLE 5

TABLE 5. Multiple linear regression model $q_{c} \sim E H + A s s o$

5 Conclusion

Investigating node attack strategies provides valuable insights into enhancing network robustness by anticipating potential threats and identifying components that need protection. On the other side of the coin, node attack research plays a crucial role when the aim is to perform a fast network disruption, such as halting the spread of a disease or stopping the diffusion of a computer virus. Here, we investigate the relationship between the network structure and its robustness to node attack in a large dataset of real-world networks. Our results indicate that the degree heterogeneity of connected nodes negatively affects the network robustness. Specifically, the $E H$ index evaluates the node degree heterogeneity, and it is the best predictor of $q_{c}$ in our NSI set. This result unveils that heterogeneous real-world networks presenting higher differences in the degree of connected nodes are more vulnerable to node attacks. These results may help quantify real-world networked systems’ robustness and build more robust networks.

This paper presents some limitations that may open new lines of research. First, we perform linear regression models only. The relationship between NSIs and the percolation threshold qc of the real-world networks may follow nonlinear models. Therefore, a natural extension of this research may consider nonlinear regression models, such as logistic, monomolecular, or exponential functions, to describe the relationship between the structure and the percolation threshold of real-world networks. Then, we adopt an initial node attack approach to study network robustness. Future research may analyze the robustness of real-world networks using recalculated node attacks, in which node ranking is updated after each node removal. Last, it would be interesting to investigate how NSIs correlate with other robustness indexes besides $q_{c}$ , such as, for example, the network robustness index $R$ robustness proposed by Schneider et al [55]. The $R$ measurement considers the size of $L C C$ during the whole node attack process not only at the point the network collapses. Therefore, adopting R may unveil a new correlation pattern between NSIs and network robustness.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.

Author contributions

MB, RA, and DC conceived the research. MB wrote the simulation codes. MB and RA performed the simulations. MB performed statistical analyses. All authors contributed to the article and approved the submitted version.

Funding

This research is funded by a grant from the Italian Ministry of Foreign Affairs and International Cooperation, by the Ecosister project, funded under the National Recovery and Resilience Plan (NRRP), and Mission 4 Component 2 Investment 1.5—Call for tender No. 3277 of 30/12/2021 of Italian Ministry of University and Research funded by the European Union—NextGenerationEU Award Number: Project code ECS00000033, Concession Decree No. 1052 of 23/06/2022 adopted by the Italian Ministry. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. (816313)). This work is supported by the Vietnam’s Ministry of Science and Technology (MOST) under the Vietnam-Italy scientific and technological cooperation program for the period 2021–2023. This work is supported by the Vietnam National University Ho Chi Minh City (VNU-HCM), Ho Chi Minh City, Vietnam, under grant number B2018-42-01. This research is funded by a grant from the Italian Ministry of Foreign Affairs and International Cooperation.

Acknowledgments

MB, MT, DC, and RA acknowledge the Italian Ministry of Foreign Affairs and International Cooperation. The authors are greatly thankful to Van Lang University, Vietnam, for providing the budget for this study. This research has benefited from the high-performance computing (HPC) cluster of the Università degli Studi di Parma. They thank Fabio Sartori for the revision of the first manuscript draft. They also thank Prof. Stefano Poletti for the intriguing discussions about this research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphy.2023.1245564/full#supplementary-material

References

1. Boccaletti S, Latora V, Moreno Y, Chavez M, Hwang DU. Complex networks: Structure and dynamics. Phys Rep (2006) 424:175–308. doi:10.1016/j.physrep.2005.10.009