Literature DB >> 32887799

Stacking models for nearly optimal link prediction in complex networks.

Amir Ghasemian1,2,3, Homa Hosseinmardi2, Aram Galstyan2, Edoardo M Airoldi3,4, Aaron Clauset1,5,6.   

Abstract

Most real-world networks are incompletely observed. Algorithms that can accurately predict which links are missing can dramatically speed up network data collection and improve network model validation. Many algorithms now exist for predicting missing links, given a partially observed network, but it has remained unknown whether a single best predictor exists, how link predictability varies across methods and networks from different domains, and how close to optimality current methods are. We answer these questions by systematically evaluating 203 individual link predictor algorithms, representing three popular families of methods, applied to a large corpus of 550 structurally diverse networks from six scientific domains. We first show that individual algorithms exhibit a broad diversity of prediction errors, such that no one predictor or family is best, or worst, across all realistic inputs. We then exploit this diversity using network-based metalearning to construct a series of "stacked" models that combine predictors into a single algorithm. Applied to a broad range of synthetic networks, for which we may analytically calculate optimal performance, these stacked models achieve optimal or nearly optimal levels of accuracy. Applied to real-world networks, stacked models are superior, but their accuracy varies strongly by domain, suggesting that link prediction may be fundamentally easier in social networks than in biological or technological networks. These results indicate that the state of the art for link prediction comes from combining individual algorithms, which can achieve nearly optimal predictions. We close with a brief discussion of limitations and opportunities for further improvements.

Keywords:  link prediction; metalearning; near optimality; networks; stacking

Mesh:

Year:  2020        PMID: 32887799      PMCID: PMC7519231          DOI: 10.1073/pnas.1914950117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  9 in total

1.  Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach.

Authors:  Vasyl Pihur; Susmita Datta; Somnath Datta
Journal:  Bioinformatics       Date:  2007-05-05       Impact factor: 6.937

2.  Missing and spurious interactions and the reconstruction of complex networks.

Authors:  Roger Guimerà; Marta Sales-Pardo
Journal:  Proc Natl Acad Sci U S A       Date:  2009-12-14       Impact factor: 11.205

3.  Hierarchical structure and the prediction of missing links in networks.

Authors:  Aaron Clauset; Cristopher Moore; M E J Newman
Journal:  Nature       Date:  2008-05-01       Impact factor: 49.962

4.  Consistencies and inconsistencies between model selection and link prediction in networks.

Authors:  Toni Vallès-Català; Tiago P Peixoto; Marta Sales-Pardo; Roger Guimerà
Journal:  Phys Rev E       Date:  2018-06       Impact factor: 2.529

5.  Link-Prediction Enhanced Consensus Clustering for Complex Networks.

Authors:  Matthew Burgess; Eytan Adar; Michael Cafarella
Journal:  PLoS One       Date:  2016-05-20       Impact factor: 3.240

6.  node2vec: Scalable Feature Learning for Networks.

Authors:  Aditya Grover; Jure Leskovec
Journal:  KDD       Date:  2016-08

7.  Significant communities in large sparse networks.

Authors:  Atieh Mirshahvalad; Johan Lindholm; Mattias Derlén; Martin Rosvall
Journal:  PLoS One       Date:  2012-03-30       Impact factor: 3.240

8.  The ground truth about metadata and community detection in networks.

Authors:  Leto Peel; Daniel B Larremore; Aaron Clauset
Journal:  Sci Adv       Date:  2017-05-03       Impact factor: 14.136

9.  Statistical and Machine Learning forecasting methods: Concerns and ways forward.

Authors:  Spyros Makridakis; Evangelos Spiliotis; Vassilios Assimakopoulos
Journal:  PLoS One       Date:  2018-03-27       Impact factor: 3.240

  9 in total
  2 in total

1.  Link recommendation algorithms and dynamics of polarization in online social networks.

Authors:  Fernando P Santos; Yphtach Lelkes; Simon A Levin
Journal:  Proc Natl Acad Sci U S A       Date:  2021-12-14       Impact factor: 12.779

2.  Functional Structure in Production Networks.

Authors:  Carolina E S Mattsson; Frank W Takes; Eelke M Heemskerk; Cees Diks; Gert Buiten; Albert Faber; Peter M A Sloot
Journal:  Front Big Data       Date:  2021-05-21
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.