| Literature DB >> 30343663 |
Finlay Campbell1, Xavier Didelot1, Rich Fitzjohn1, Neil Ferguson1, Anne Cori1, Thibaut Jombart2.
Abstract
BACKGROUND: Reconstructing individual transmission events in an infectious disease outbreak can provide valuable information and help inform infection control policy. Recent years have seen considerable progress in the development of methodologies for reconstructing transmission chains using both epidemiological and genetic data. However, only a few of these methods have been implemented in software packages, and with little consideration for customisability and interoperability. Users are therefore limited to a small number of alternatives, incompatible tools with fixed functionality, or forced to develop their own algorithms at considerable personal effort.Entities:
Keywords: Bayesian; Chain; Epidemics; Genomics; Likelihood; MCMC; Software; Transmission; Tree
Mesh:
Year: 2018 PMID: 30343663 PMCID: PMC6196407 DOI: 10.1186/s12859-018-2330-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Studies on outbreak reconstruction and their availability as software
| Study | Available as software | Study | Available as software |
|---|---|---|---|
| Cottam et al. [ | ✗ | Numinnen et al. [ | ✗ |
| Aldrin et al. [ | ✗ | Hall et al. [ | ✓ |
| Jombart et al. [ | ✓ | Worby et al. [ | ✓ |
| Ypma et al. [ | ✗ | Lau et al. [ | ✗ |
| Morelli et al. [ | ✗ | Soubeyrand [ | ✗ |
| Ypma et al. [ | ✗ | De Maio et al. [ | ✓ |
| Stadler et al. [ | ✓ | Kenah et al. [ | ✗ |
| Jombart et al. [ | ✓ | Klinkenberg et al. [ | ✓ |
| Didelot et al. [ | ✓ | Worby et al. [ | ✗ |
| Mollentze et al. [ | ✗ | Didelot et al. [ | ✓ |
Fig. 1Schematic representation of the code design of outbreaker2. Each disk represents a different component of the code. Disk size matches the size of the corresponding component, indicated by numbers (in lines of code, rounded to 50). Separate disks for likelihoods, priors and movements indicate independent C++ modules. Links represent flows of information between components, colored according to the input. Infrastructure and tests are globally connected to all components. Functions indicated within rectangles are entry points into the code, indicating possible customisation by the user
Epidemiological and evolutionary parameters for Ebola virus. When several studies are cited, the mean value weighted by the sample size of the study was used
| Parameter | Value | Reference |
|---|---|---|
| Generation time in days (SD) | 14.4 (8.9) | [ |
| Time-to-collection in days (SD) | 14.4 (8.9) | Assumed same as generation time |
| Basic reproduction number R0 | 1.8 | [ |
| Mutation rate (per site per day) | 3.1 × 10− 6 | [ |
| Genome length (bases) | 18,958 | [ |
Fig. 2MCMC traces of posterior likelihood for o2mod.TransPhylo and the original TransPhylo package
Fig. 3Posterior distribution of ancestry assignments using o2mod.TransPhylo and the original TransPhylo package. The size of each circle indicates the frequency of a given individual (“infector”) in the posterior distribution of infectors for a given case (“infectee”). An infector of 0 (bottom row) indicates that the individual is the index case. Black crosses represent the true simulated ancestries
Fig. 4Similarity of consensus trees inferred by o2mod.TransPhylo and outbreaker2 compared to TransPhylo. The consensus tree of a reconstructed outbreak is defined as the tree with the ancestor of the highest posterior probability for each case. The similarity between consensus trees is calculated as the proportion of identically assigned ancestries. The x-axis indicates individual simulated outbreaks. Each outbreak was reconstructed once using the default outbreaker2 model (white dots), and once using o2mod.TransPhylo (black dots). The colour of the line represents the change in similarity to the consensus tree returned by the original TransPhylo package