| Literature DB >> 34295510 |
Christopher H Remien1, Mariah J Eckwright2, Benjamin J Ridenhour1.
Abstract
Population dynamic models can be used in conjunction with time series of species abundances to infer interactions. Understanding microbial interactions is a prerequisite for numerous goals in microbiome research, including predicting how populations change over time, determining how manipulations of microbiomes affect dynamics and designing synthetic microbiomes to perform tasks. As such, there is great interest in adapting population dynamic theory for microbial systems. Despite the appeal, numerous hurdles exist. One hurdle is that the data commonly obtained from DNA sequencing yield estimates of relative abundances, while population dynamic models such as the generalized Lotka-Volterra model track absolute abundances or densities. It is not clear whether relative abundance data alone can be used to infer parameters of population dynamic models such as the Lotka-Volterra model. We used structural identifiability analyses to determine the extent to which a time series of relative abundances can be used to parametrize the generalized Lotka-Volterra model. We found that only with absolute abundance data to accompany relative abundance estimates from sequencing can all parameters be uniquely identified. However, relative abundance data alone do contain information on relative interaction strengths, which is sufficient for many studies where the goal is to estimate key interactions and their effects on dynamics. Using synthetic data of a simple community for which we know the underlying structure, local practical identifiability analysis showed that modest amounts of both process and measurement error do not fundamentally affect these identifiability properties.Entities:
Keywords: dynamics; generalized Lotka–Volterra; mathematical model; microbiome; parameter identifiability; time-series
Year: 2021 PMID: 34295510 PMCID: PMC8292772 DOI: 10.1098/rsos.201378
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Parameter values used in numerical simulations.
| parameter | value 1 | value 2 |
|---|---|---|
| 6 | 6 | |
| 4 | 4 | |
| 2 | 2 | |
| −0.05 | −5 | |
| 0.15 | 15 | |
| −0.20 | −20 | |
| −0.01 | −1.0 | |
| 0.05 | 5.0 | |
| 0.10 | 10 | |
| −0.10 | −10 | |
| 10 | 0.10 | |
| 14 | 0.14 | |
| 4 | 0.04 | |
| 28 | 0.28 |
Figure 1Multiple parameter sets lead to identical relative abundance dynamics. Comparison of two gLV systems with different parameters (table 1) that yield different absolute abundances (top panels) but identical relative abundances (bottom panels). The non-identifiable parameters can be scaled by a constant to produce infinitely many systems with identical dynamics in terms of the relative abundances. (The scaling constant chosen here was 100.)
Figure 2Fits of the synthetic three-species gLV system with process and observation error. The left-hand column shows the results for the ‘value 1’ set of parameters (table 1), while the right-hand column shows the results for the ‘value 2’ set of parameters. Note that the scales for absolute abundance vary for the absolute abundances generated from the two sets of parameters. Fitting was done by starting the algorithm near the correct parameters set and giving it the correct total population size at t = 0 (N0 = 0.28 and N0 = 28, respectively). The fits for the relative abundance data are quite similar (last row), as are the extrapolated fits of the absolute abundance data (middle row). Relative error of the parameter estimates was defined as relative error , where x is the estimated value and is the true value. The relative error was bounded on [−0.4, 0.4] by the assumed priors.
Figure 3Reconstruction of relative population size. If a model is fit where it is assumed that N0 = 1, then information is recovered about the fold-change in the population’s size over time. The mean estimate (solid red line) does a fairly good job of matching the stochastic population trajectory (blue line). The dashed red lines show the range of population values produced by numerically solving the three-species gLV system for all accepted parameter sets in the MCMC algorithm.