Chieh Lin1, Ziv Bar-Joseph1,2. 1. Machine Learning Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, US. 2. Computational Biology Department, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, US.
Abstract
MOTIVATION: Methods for reconstructing developmental trajectories from time-series single-cell RNA-Seq (scRNA-Seq) data can be largely divided into two categories. The first, often referred to as pseudotime ordering methods are deterministic and rely on dimensionality reduction followed by an ordering step. The second learns a probabilistic branching model to represent the developmental process. While both types have been successful, each suffers from shortcomings that can impact their accuracy. RESULTS: We developed a new method based on continuous-state HMMs (CSHMMs) for representing and modeling time-series scRNA-Seq data. We define the CSHMM model and provide efficient learning and inference algorithms which allow the method to determine both the structure of the branching process and the assignment of cells to these branches. Analyzing several developmental single-cell datasets, we show that the CSHMM method accurately infers branching topology and correctly and continuously assign cells to paths, improving upon prior methods proposed for this task. Analysis of genes based on the continuous cell assignment identifies known and novel markers for different cell types. AVAILABILITY AND IMPLEMENTATION: Software and Supporting website: www.andrew.cmu.edu/user/chiehl1/CSHMM/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Methods for reconstructing developmental trajectories from time-series single-cell RNA-Seq (scRNA-Seq) data can be largely divided into two categories. The first, often referred to as pseudotime ordering methods are deterministic and rely on dimensionality reduction followed by an ordering step. The second learns a probabilistic branching model to represent the developmental process. While both types have been successful, each suffers from shortcomings that can impact their accuracy. RESULTS: We developed a new method based on continuous-state HMMs (CSHMMs) for representing and modeling time-series scRNA-Seq data. We define the CSHMM model and provide efficient learning and inference algorithms which allow the method to determine both the structure of the branching process and the assignment of cells to these branches. Analyzing several developmental single-cell datasets, we show that the CSHMM method accurately infers branching topology and correctly and continuously assign cells to paths, improving upon prior methods proposed for this task. Analysis of genes based on the continuous cell assignment identifies known and novel markers for different cell types. AVAILABILITY AND IMPLEMENTATION: Software and Supporting website: www.andrew.cmu.edu/user/chiehl1/CSHMM/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Sean C Bendall; Kara L Davis; El-Ad David Amir; Michelle D Tadmor; Erin F Simonds; Tiffany J Chen; Daniel K Shenfeld; Garry P Nolan; Dana Pe'er Journal: Cell Date: 2014-04-24 Impact factor: 41.582
Authors: Eugenio Marco; Robert L Karp; Guoji Guo; Paul Robson; Adam H Hart; Lorenzo Trippa; Guo-Cheng Yuan Journal: Proc Natl Acad Sci U S A Date: 2014-12-15 Impact factor: 11.205
Authors: Florian Buettner; Kedar N Natarajan; F Paolo Casale; Valentina Proserpio; Antonio Scialdone; Fabian J Theis; Sarah A Teichmann; John C Marioni; Oliver Stegle Journal: Nat Biotechnol Date: 2015-01-19 Impact factor: 54.908
Authors: Jun Ding; Bruce J Aronow; Naftali Kaminski; Joseph Kitzmiller; Jeffrey A Whitsett; Ziv Bar-Joseph Journal: Genome Res Date: 2018-01-09 Impact factor: 9.043
Authors: Killian Hurley; Jun Ding; Carlos Villacorta-Martin; Michael J Herriges; Anjali Jacob; Marall Vedaie; Konstantinos D Alysandratos; Yuliang L Sun; Chieh Lin; Rhiannon B Werder; Jessie Huang; Andrew A Wilson; Aditya Mithal; Gustavo Mostoslavsky; Irene Oglesby; Ignacio S Caballero; Susan H Guttentag; Farida Ahangari; Naftali Kaminski; Alejo Rodriguez-Fraticelli; Fernando Camargo; Ziv Bar-Joseph; Darrell N Kotton Journal: Cell Stem Cell Date: 2020-01-30 Impact factor: 24.633