Anastasia Deckard1, Ron C Anafi, John B Hogenesch, Steven B Haase, John Harer. 1. Program in Computational Biology and Bioinformatics, Department of Mathematics, Duke University, Durham, NC 27708, USA, Department of Medicine, Department of Pharmacology, Institute for Translational Medicine and Therapeutics, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA and Department of Biology, Duke University, Durham, NC 27708, USA.
Abstract
MOTIVATION: To discover and study periodic processes in biological systems, we sought to identify periodic patterns in their gene expression data. We surveyed a large number of available methods for identifying periodicity in time series data and chose representatives of different mathematical perspectives that performed well on both synthetic data and biological data. Synthetic data were used to evaluate how each algorithm responds to different curve shapes, periods, phase shifts, noise levels and sampling rates. The biological datasets we tested represent a variety of periodic processes from different organisms, including the cell cycle and metabolic cycle in Saccharomyces cerevisiae, circadian rhythms in Mus musculus and the root clock in Arabidopsis thaliana. RESULTS: From these results, we discovered that each algorithm had different strengths. Based on our findings, we make recommendations for selecting and applying these methods depending on the nature of the data and the periodic patterns of interest. Additionally, these results can also be used to inform the design of large-scale biological rhythm experiments so that the resulting data can be used with these algorithms to detect periodic signals more effectively.
MOTIVATION: To discover and study periodic processes in biological systems, we sought to identify periodic patterns in their gene expression data. We surveyed a large number of available methods for identifying periodicity in time series data and chose representatives of different mathematical perspectives that performed well on both synthetic data and biological data. Synthetic data were used to evaluate how each algorithm responds to different curve shapes, periods, phase shifts, noise levels and sampling rates. The biological datasets we tested represent a variety of periodic processes from different organisms, including the cell cycle and metabolic cycle in Saccharomyces cerevisiae, circadian rhythms in Mus musculus and the root clock in Arabidopsis thaliana. RESULTS: From these results, we discovered that each algorithm had different strengths. Based on our findings, we make recommendations for selecting and applying these methods depending on the nature of the data and the periodic patterns of interest. Additionally, these results can also be used to inform the design of large-scale biological rhythm experiments so that the resulting data can be used with these algorithms to detect periodic signals more effectively.
Authors: Ulrik de Lichtenberg; Lars Juhl Jensen; Anders Fausbøll; Thomas S Jensen; Peer Bork; Søren Brunak Journal: Bioinformatics Date: 2004-10-28 Impact factor: 6.937
Authors: P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher Journal: Mol Biol Cell Date: 1998-12 Impact factor: 4.138
Authors: Chun-Yi Cho; Francis C Motta; Christina M Kelliher; Anastasia Deckard; Steven B Haase Journal: Cell Cycle Date: 2017-09-21 Impact factor: 4.534
Authors: Christopher M Depner; Edward L Melanson; Andrew W McHill; Kenneth P Wright Journal: Proc Natl Acad Sci U S A Date: 2018-05-21 Impact factor: 11.205
Authors: Breschine Cummins; Francis C Motta; Robert C Moseley; Anastasia Deckard; Sophia Campione; Marcio Gameiro; Tomáš Gedeon; Konstantin Mischaikow; Steven B Haase Journal: PLoS Comput Biol Date: 2022-10-10 Impact factor: 4.779