| Literature DB >> 28451057 |
Jaime Iranzo1, Mart Krupovic2, Eugene V Koonin1.
Abstract
Viral evolution is characterized by high rates of horizontal gene transfer and fast sequence divergence. Furthermore, there are no universal genes shared by all viruses. As a result, distant relationships among viruses are better represented by a network than by a tree. Here we discuss 3 network representations of the virus world with decreasing levels of complexity, from a multilayer network that integrates sequence conservation and patterns of gene sharing to a classic genome similarity network. As new tools for network analysis are developed, we expect that novel insights into virus evolution will result from the study of more complex representations of the virus world.Entities:
Keywords: bipartite network; gene sharing network; multilayer network; phylogenomics; viral evolution; viral taxonomy
Year: 2017 PMID: 28451057 PMCID: PMC5398231 DOI: 10.1080/19420889.2017.1296614
Source DB: PubMed Journal: Commun Integr Biol ISSN: 1942-0889
Figure 1.Three network representations of a toy virus world composed of 4 viral genomes (squares) and 12 genes (black circles) that belong to 5 gene families (white circles). (A) Two-layer network, with the gene layer on top and the genome layer at the bottom. Black edges of different thickness indicate the similarity between sequences in the gene layer. (B) Bipartite network, which results from clustering groups of homologous genes in the gene layer into gene family nodes. (C) Genome similarity network; the thickness of the links is proportional to the number of shared gene families.
Figure 2.Hierarchical structure of a portion of the bipartite network for tailed bacteriophages (order Caudovirales). On the small scale, sets of related viruses and their associated gene families form densely connected modules. Within a module, genome nodes are represented as colored circles, whereas gene family nodes are denoted by the points where the edges (gray and colored lines) join. Colored edges connect the genomes of a module with the module's signature genes. On the large scale, modules connect with each other through shared connector genes, represented here as small gray circles. The 4 hallmark genes that are shared by most members of the order Caudovirales occupy a central position in the network (small black circles). This portion of the network corresponds to modules 9a, 9d, 12, 13, and 18 from ref. 15. MCP, major capsid protein.