| Literature DB >> 33265865 |
Matúš Medo1,2,3, Manuel Sebastian Mariani1,4, Linyuan Lü1,5.
Abstract
Real networks typically studied in various research fields-ecology and economic complexity, for example-often exhibit a nested topology, which means that the neighborhoods of high-degree nodes tend to include the neighborhoods of low-degree nodes. Focusing on nested networks, we study the problem of link prediction in complex networks, which aims at identifying likely candidates for missing links. We find that a new method that takes network nestedness into account outperforms well-established link-prediction methods not only when the input networks are sufficiently nested, but also for networks where the nested structure is imperfect. Our study paves the way to search for optimal methods for link prediction in nested networks, which might be beneficial for World Trade and ecological network analysis.Entities:
Keywords: bipartite networks; link prediction; nested networks
Year: 2018 PMID: 33265865 PMCID: PMC7512339 DOI: 10.3390/e20100777
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1A toy example of a bipartite network where and . Network links are shown with the solid lines. The dashed line shows the possible link between nodes i and whose likelihood is being evaluated. The thick lines highlight the only existing link between the neighbors of nodes i and (see the prediction method “Number of Local Community Links”). The preferential attachment index (PrefA) and LCL scores of the link between the highlighted nodes are 6 and 1, respectively.
Figure 2An illustration of synthetic nested networks. (A) Nestedness contours for various values of the parameter. (B–D) Nested networks with and : Perfectly nested network (B), nested network with low noise (C), and in-block nested network with three blocks and low noise (parameter values are specified in the panels). The results are averaged over 10 model realizations and 10 independently chosen probe sets for each realization.
Basic properties of the real datasets used to evaluate link prediction methods: Number of rows (), columns (), edges (E), and the density of edges [].
| Dataset |
|
|
|
|
|---|---|---|---|---|
| M_SD_022 | 207 | 110 | 1121 | 0.05 |
| M_PL_015 | 131 | 666 | 2933 | 0.03 |
| M_PL_021 | 91 | 677 | 1193 | 0.02 |
| M_PL_044 | 110 | 609 | 1125 | 0.02 |
| M_PL_057 | 114 | 883 | 1920 | 0.02 |
| M_PL_062 | 456 | 1044 | 15,255 | 0.03 |
| CP-2001 | 169 | 781 | 17,639 | 0.13 |
| CP-2009 | 168 | 774 | 17,739 | 0.14 |
Figure 3Link prediction results on model data with , , and no community structure. To remove the strong dependency of method performance on the randomization parameter p, the shown results are scaled with the results of the simplest PrefA method.
Figure 4Link prediction results on model data with , , , (two blocks, no links between the blocks). As in Figure 3, results are again scaled with the results of the simplest PrefA method.
Mean link prediction results on real datasets. Best performance values for a given method and metric are highlighted with bold. Results are averaged over 100 independently chosen probe sets. Standard error of the mean is less than 0.005 in all cases. If NViol produces best AUC on transposed data, it is labeled as NViol.
| method |
| AUC |
|
| method |
| AUC |
|
| |
|
| 0.20 | 0.80 | 0.11 | 0.13 |
| 0.38 | 0.61 | 0.12 | 0.10 | |
|
| 0.19 | 0.80 | 0.14 | 0.15 |
| 0.34 | 0.59 | 0.14 | 0.12 | |
|
|
|
|
|
|
| 0.33 | 0.61 |
|
| |
|
| 0.19 | 0.81 | 0.11 | 0.12 |
|
|
| 0.12 | 0.10 | |
| M_PL_015 | M_PL_062 | |||||||||
| method |
| AUC |
|
| method |
| AUC |
|
| |
|
| 0.23 | 0.77 | 0.12 | 0.09 |
| 0.20 | 0.80 | 0.05 | 0.05 | |
|
| 0.19 | 0.80 | 0.20 | 0.14 |
| 0.17 | 0.83 | 0.12 | 0.07 | |
|
|
|
|
|
|
|
|
|
|
| |
|
| 0.24 | 0.75 | 0.13 | 0.08 |
| 0.20 | 0.80 | 0.04 | 0.05 | |
| M_PL_021 | CP-2001 | |||||||||
| method |
| AUC |
|
| method |
| AUC |
|
| |
|
| 0.49 | 0.49 | 0.06 | 0.07 |
| 0.23 | 0.78 | 0.05 | 0.10 | |
|
| 0.41 | 0.46 |
| 0.09 |
| 0.21 | 0.79 |
|
| |
|
| 0.40 | 0.48 |
|
|
|
|
| 0.14 | 0.12 | |
|
|
|
| 0.06 | 0.05 |
| 0.24 | 0.76 | 0.06 | 0.10 | |
| M_PL_044 | CP-2009 | |||||||||
| method |
| AUC |
|
| method |
| AUC |
|
| |
|
| 0.47 | 0.52 | 0.05 | 0.06 |
| 0.24 | 0.77 | 0.07 | 0.10 | |
|
| 0.38 | 0.45 |
|
|
| 0.22 | 0.79 |
|
| |
|
| 0.37 | 0.46 | 0.06 |
|
|
|
| 0.13 | 0.12 | |
|
|
|
| 0.04 | 0.05 |
| 0.25 | 0.75 | 0.08 | 0.10 | |