| Literature DB >> 34491104 |
Yury Orlovich1, Kirill Kukharenko2, Volker Kaibel2, Pavel Skums3.
Abstract
We study the algorithmic problem of finding the most "scale-free-like" spanning tree of a connected graph. This problem is motivated by the fundamental problem of genomic epidemiology: given viral genomes sampled from infected individuals, reconstruct the transmission network ("who infected whom"). We use two possible objective functions for this problem and introduce the corresponding algorithmic problems termed m-SF (-scale free) and s-SF Spanning Tree problems. We prove that those problems are APX- and NP-hard, respectively, even in the classes of cubic and bipartite graphs. We propose two integer linear programming (ILP) formulations for the s-SF Spanning Tree problem, and experimentally assess its performance using simulated and experimental data. In particular, we demonstrate that the ILP-based approach allows for accurate reconstruction of transmission histories of several hepatitis C outbreaks.Entities:
Keywords: computational complexity; genomic epidemiology; integer linear programming; scale-free network; transmission network
Mesh:
Year: 2021 PMID: 34491104 PMCID: PMC8670573 DOI: 10.1089/cmb.2020.0500
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479
FIG. 1.Neighbor switch.
FIG. 2.An example of the graph G for , , , , and . Here each vertex labeled represents a set .
FIG. 3.Running times of ILP solver and QUENTIN on Erdős–Rényi graphs (a) and grids (b). ILP, integer linear programming.
FIG. 4.Running times of ILP solver on Barabási–Albert (a) and NetworkX (b) scale-free graphs.
FIG. 5.Accuracy of s-SF ILP model compared with the phylogenetic trait inference algorithm.
Results on Experimental Data with Different Models
| Methods | Evaluation metric | ||
|---|---|---|---|
| (A) | (B) | (C) | |
| QUENTIN | 0.9 | 0.78 | 0.98 |
| 0.9 | 0.92 | 1.0 | |
| 0.9 | 0.92 | 1.0 | |
(A) Superspreader inference accuracy, (B) accuracy of transmission link inference, and (C) accuracy of transmission ancestry inference.