| Literature DB >> 23329414 |
François Chevenet1, Matthieu Jung, Martine Peeters, Tulio de Oliveira, Olivier Gascuel.
Abstract
MOTIVATION: Large phylogenies are being built today to study virus evolution, trace the origin of epidemics, establish the mode of transmission and survey the appearance of drug resistance. However, no tool is available to quickly inspect these phylogenies and combine them with extrinsic traits (e.g. geographic location, risk group, presence of a given resistance mutation), seeking to extract strain groups of specific interest or requiring surveillance.Entities:
Mesh:
Year: 2013 PMID: 23329414 PMCID: PMC3582263 DOI: 10.1093/bioinformatics/btt010
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Three examples of phylotypes, ranked by increasing complexity
Fig. 2.Global separation and support. The respective sizes of A and B (=1) are used to compute global criterion values (see text and Supplementary Material for detailed formula and algorithm)
Fig. 4.Phylotype map (ACCTRAN) of the worldwide study of HIV-1C. Some of the phylotypes (colored in red) have indirect origin; for example, 789: Eastern Europe, with Southern Africa annotation(s) along the path to 1: Central Africa
Detailed table of the significant phylotypes found with HIV-1A, Albania data set
| Pi | Anc | A | Cov (%) | Tt | Df | Sl | Sg | Dv | Sl/Dv | Sg/Dv | Spg | AnB | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | root | Africa | 88 | 152 | 16 | 0.009 | 0.009 | 0.172 | 0.051 | 0.051 | 1.000 | – | ||||
| 17 | 14 | Albania | 97 | 32 | 1 | 0.017 | 0.059 | 0.024 | 0.718 | 2.415 | 0.960 | – | ||||
| 251 | 1 | EastEurope | 80 | 9 | 1 | 0.057 | 0.065 | 0.035 | 1.646 | 1.886 | 1.000 | – | ||||
| 14 | 1 | Greece | 69 | 60 | 2 | 0.009 | 0.073 | 0.066 | 0.132 | 1.097 | 0.880 | – | ||||
Selection criteria (displayed with bold characters) are Size (Sz ≥5), Persistence (Ps ≥1), Size/Different (Sz/Df ≥1) and Support (Sp ≥70%). P-values (in italic) are given as fractions, where the denominator indicates the number of shuffles. The analysis was run with ACCTRAN option. Pi, identifier of phylotype root; Anc, phylotype origin; A, annotation; Cov (%), coverage, i.e. percentage of taxa annotated with A that belongs to the phylotype; Tt, Total; Sl, Local separation; Sg, Global separation; Dv, Diversity; Spg, Global support; AnB, list of ‘Breaking’ annotations when the origin is indirect (see text and examples in Fig. 4).
Fig. 3.Tree graphics obtained in the study of the epidemiological history of HIV-1A in Albania (Salemi ). (a) Phylogenetic tree in “background” format: selected phylotypes and their strains are colored; colored regions comprise all (uniquely annotated) nodes on the path from the phylotype root to the phylotype members; not colored (black) strains do not belong to any phylotype; the root node identifiers of phylotypes are provided, to be used in conjunction with the detailed table (Table 1). (b) Phylotype map, summarizing the information contained in phylogenetic tree (a); circle surface is proportional to the Size value (number of members) of the phylotype