Shuyi Zhang1,2, Jacob R Leistico1,2, Raymond J Cho3, Jeffrey B Cheng3, Jun S Song1,2. 1. Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA. 2. Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA. 3. Department of Dermatology, University of California, San Francisco, San Francisco, CA, USA.
Abstract
MOTIVATION: Single-cell sequencing technologies that simultaneously generate multimodal cellular profiles present opportunities for improved understanding of cell heterogeneity in tissues. How the multimodal information can be integrated to obtain a common cell type identification, however, poses a computational challenge. Multilayer graphs provide a natural representation of multi-omic single-cell sequencing datasets, and finding cell clusters may be understood as a multilayer graph partition problem. RESULTS: We introduce two spectral algorithms on multilayer graphs, spectral clustering on multilayer graphs (SCML) and the weighted locally linear (WLL) method, to cluster cells in multi-omic single-cell sequencing datasets. We connect these algorithms through a unifying mathematical framework that represents each layer using a Hamiltonian operator and a mixture of its eigenstates to integrate the multiple graph layers, demonstrating in the process that the WLL method is a rigorous multilayer spectral graph theoretic reformulation of the popular Seurat weighted nearest neighbor (WNN) algorithm. Implementing our algorithms and applying them to a CITE-seq dataset of cord blood mononuclear cells yields results similar to the Seurat WNN analysis. Our work thus extends spectral methods to multimodal single-cell data analysis. AVAILABILITY: The code used in this study can be found at https://github.com/jssong-lab/sc-spectrum. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Single-cell sequencing technologies that simultaneously generate multimodal cellular profiles present opportunities for improved understanding of cell heterogeneity in tissues. How the multimodal information can be integrated to obtain a common cell type identification, however, poses a computational challenge. Multilayer graphs provide a natural representation of multi-omic single-cell sequencing datasets, and finding cell clusters may be understood as a multilayer graph partition problem. RESULTS: We introduce two spectral algorithms on multilayer graphs, spectral clustering on multilayer graphs (SCML) and the weighted locally linear (WLL) method, to cluster cells in multi-omic single-cell sequencing datasets. We connect these algorithms through a unifying mathematical framework that represents each layer using a Hamiltonian operator and a mixture of its eigenstates to integrate the multiple graph layers, demonstrating in the process that the WLL method is a rigorous multilayer spectral graph theoretic reformulation of the popular Seurat weighted nearest neighbor (WNN) algorithm. Implementing our algorithms and applying them to a CITE-seq dataset of cord blood mononuclear cells yields results similar to the Seurat WNN analysis. Our work thus extends spectral methods to multimodal single-cell data analysis. AVAILABILITY: The code used in this study can be found at https://github.com/jssong-lab/sc-spectrum. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Tim Stuart; Andrew Butler; Paul Hoffman; Christoph Hafemeister; Efthymia Papalexi; William M Mauck; Yuhan Hao; Marlon Stoeckius; Peter Smibert; Rahul Satija Journal: Cell Date: 2019-06-06 Impact factor: 41.582
Authors: Bo Wang; Aziz M Mezlini; Feyyaz Demir; Marc Fiume; Zhuowen Tu; Michael Brudno; Benjamin Haibe-Kains; Anna Goldenberg Journal: Nat Methods Date: 2014-01-26 Impact factor: 28.547
Authors: David van Dijk; Roshan Sharma; Juozas Nainys; Kristina Yim; Pooja Kathail; Ambrose J Carr; Cassandra Burdziak; Kevin R Moon; Christine L Chaffer; Diwakar Pattabiraman; Brian Bierie; Linas Mazutis; Guy Wolf; Smita Krishnaswamy; Dana Pe'er Journal: Cell Date: 2018-06-28 Impact factor: 41.582
Authors: Yale Liu; Christopher Cook; Andrew J Sedgewick; Shuyi Zhang; Marlys S Fassett; Roberto R Ricardo-Gonzalez; Paymann Harirchian; Sakeen W Kashem; Sho Hanakawa; Jacob R Leistico; Jeffrey P North; Mark A Taylor; Wei Zhang; Mao-Qiang Man; Alexandra Charruyer; Nadejda Beliakova-Bethell; Stephen C Benz; Ruby Ghadially; Theodora M Mauro; Daniel H Kaplan; Kenji Kabashima; Jaehyuk Choi; Jun S Song; Raymond J Cho; Jeffrey B Cheng Journal: iScience Date: 2020-09-19