Literature DB >> 30726979

Robust Design for Coalescent Model Inference.

Kris V Parag1, Oliver G Pybus1.   

Abstract

The coalescent process describes how changes in the size or structure of a population influence the genealogical patterns of sequences sampled from that population. The estimation of (effective) population size changes from genealogies that are reconstructed from these sampled sequences is an important problem in many biological fields. Often, population size is characterized by a piecewise-constant function, with each piece serving as a population size parameter to be estimated. Estimation quality depends on both the statistical coalescent inference method employed, and on the experimental protocol, which controls variables such as the sampling of sequences through time and space, or the transformation of model parameters. While there is an extensive literature on coalescent inference methodology, there is comparatively little work on experimental design. The research that does exist is largely simulation-based, precluding the development of provable or general design theorems. We examine three key design problems: temporal sampling of sequences under the skyline demographic coalescent model, spatio-temporal sampling under the structured coalescent model, and time discretization for sequentially Markovian coalescent models. In all cases, we prove that 1) working in the logarithm of the parameters to be inferred (e.g., population size) and 2) distributing informative coalescent events uniformly among these log-parameters, is uniquely robust. "Robust" means that the total and maximum uncertainty of our parameter estimates are minimized, and made insensitive to their unknown (true) values. This robust design theorem provides rigorous justification for several existing coalescent experimental design decisions and leads to usable guidelines for future empirical or simulation-based investigations. Given its persistence among models, this theorem may form the basis of an experimental design paradigm for coalescent inference.
© The Author(s) 2019. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  Coalescent theory; experimental design; population genetic inference; sequential Markovian coalescent; skyline models; structured coalescent

Mesh:

Year:  2019        PMID: 30726979     DOI: 10.1093/sysbio/syz008

Source DB:  PubMed          Journal:  Syst Biol        ISSN: 1063-5157            Impact factor:   15.683


  8 in total

1.  Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models.

Authors:  Kris V Parag; Christl A Donnelly
Journal:  Syst Biol       Date:  2020-11-01       Impact factor: 15.683

2.  Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?

Authors:  Kris V Parag; Oliver G Pybus; Chieh-Hsi Wu
Journal:  Syst Biol       Date:  2021-12-16       Impact factor: 15.683

3.  Using information theory to optimise epidemic models for real-time prediction and estimation.

Authors:  Kris V Parag; Christl A Donnelly
Journal:  PLoS Comput Biol       Date:  2020-07-01       Impact factor: 4.475

4.  Inference of past demography, dormancy and self-fertilization rates from whole genome sequence data.

Authors:  Thibaut Paul Patrick Sellinger; Diala Abu Awad; Markus Moest; Aurélien Tellier
Journal:  PLoS Genet       Date:  2020-04-06       Impact factor: 5.917

5.  Jointly Inferring the Dynamics of Population Size and Sampling Intensity from Molecular Sequences.

Authors:  Kris V Parag; Louis du Plessis; Oliver G Pybus
Journal:  Mol Biol Evol       Date:  2020-08-01       Impact factor: 16.240

6.  Adaptive Preferential Sampling in Phylodynamics With an Application to SARS-CoV-2.

Authors:  Lorenzo Cappello; Julia A Palacios
Journal:  J Comput Graph Stat       Date:  2021-11-29       Impact factor: 1.884

7.  Robust inference of population size histories from genomic sequencing data.

Authors:  Gautam Upadhya; Matthias Steinrücken
Journal:  PLoS Comput Biol       Date:  2022-09-16       Impact factor: 4.779

8.  Demographic Histories and Genome-Wide Patterns of Divergence in Incipient Species of Shorebirds.

Authors:  Xuejing Wang; Kathryn H Maher; Nan Zhang; Pinjia Que; Chenqing Zheng; Simin Liu; Biao Wang; Qin Huang; Xu Yang; Zhengwang Zhang; Tamás Székely; Araxi O Urrutia; Yang Liu
Journal:  Front Genet       Date:  2019-11-08       Impact factor: 4.599

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.