Literature DB >> 30174778

A UNIFIED STATISTICAL FRAMEWORK FOR SINGLE CELL AND BULK RNA SEQUENCING DATA.

Lingxue Zhu1, Jing Lei1, Bernie Devlin2, Kathryn Roeder1.   

Abstract

Recent advances in technology have enabled the measurement of RNA levels for individual cells. Compared to traditional tissue-level bulk RNA-seq data, single cell sequencing yields valuable insights about gene expression profiles for different cell types, which is potentially critical for understanding many complex human diseases. However, developing quantitative tools for such data remains challenging because of high levels of technical noise, especially the "dropout" events. A "dropout" happens when the RNA for a gene fails to be amplified prior to sequencing, producing a "false" zero in the observed data. In this paper, we propose a Unified RNA-Sequencing Model (URSM) for both single cell and bulk RNA-seq data, formulated as a hierarchical model. URSM borrows the strength from both data sources and carefully models the dropouts in single cell data, leading to a more accurate estimation of cell type specific gene expression profile. In addition, URSM naturally provides inference on the dropout entries in single cell data that need to be imputed for downstream analyses, as well as the mixing proportions of different cell types in bulk samples. We adopt an empirical Bayes' approach, where parameters are estimated using the EM algorithm and approximate inference is obtained by Gibbs sampling. Simulation results illustrate that URSM outperforms existing approaches both in correcting for dropouts in single cell data, as well as in deconvolving bulk samples. We also demonstrate an application to gene expression data on fetal brains, where our model successfully imputes the dropout genes and reveals cell type specific expression patterns.

Entities:  

Keywords:  EM algorithm; Gibbs sampling; Single cell RNA sequencing; empirical Bayes; hierarchical model

Year:  2018        PMID: 30174778      PMCID: PMC6114100          DOI: 10.1214/17-AOAS1110

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  32 in total

1.  Normalizing single-cell RNA sequencing data: challenges and opportunities.

Authors:  Catalina A Vallejos; Davide Risso; Antonio Scialdone; Sandrine Dudoit; John C Marioni
Journal:  Nat Methods       Date:  2017-05-15       Impact factor: 28.547

2.  Dirichlet Process Mixture Model for Correcting Technical Variation in Single-Cell Gene Expression Data.

Authors:  Sandhya Prabhakaran; Elham Azizi; Ambrose Carr; Dana Pe'er
Journal:  JMLR Workshop Conf Proc       Date:  2016

3.  Diverse Non-genetic, Allele-Specific Expression Effects Shape Genetic Architecture at the Cellular Level in the Mammalian Brain.

Authors:  Wei-Chao Huang; Elliott Ferris; Tong Cheng; Cornelia Stacher Hörndli; Kelly Gleason; Carol Tamminga; Janice D Wagner; Kenneth M Boucher; Jan L Christian; Christopher Gregg
Journal:  Neuron       Date:  2017-02-23       Impact factor: 17.173

4.  Gene expression elucidates functional impact of polygenic risk for schizophrenia.

Authors:  Menachem Fromer; Panos Roussos; Solveig K Sieberts; Jessica S Johnson; David H Kavanagh; Thanneer M Perumal; Douglas M Ruderfer; Edwin C Oh; Aaron Topol; Hardik R Shah; Lambertus L Klei; Robin Kramer; Dalila Pinto; Zeynep H Gümüş; A Ercument Cicek; Kristen K Dang; Andrew Browne; Cong Lu; Lu Xie; Ben Readhead; Eli A Stahl; Jianqiu Xiao; Mahsa Parvizi; Tymor Hamamsy; John F Fullard; Ying-Chih Wang; Milind C Mahajan; Jonathan M J Derry; Joel T Dudley; Scott E Hemby; Benjamin A Logsdon; Konrad Talbot; Towfique Raj; David A Bennett; Philip L De Jager; Jun Zhu; Bin Zhang; Patrick F Sullivan; Andrew Chess; Shaun M Purcell; Leslie A Shinobu; Lara M Mangravite; Hiroyoshi Toyoshiba; Raquel E Gur; Chang-Gyu Hahn; David A Lewis; Vahram Haroutunian; Mette A Peters; Barbara K Lipska; Joseph D Buxbaum; Eric E Schadt; Keisuke Hirai; Kathryn Roeder; Kristen J Brennand; Nicholas Katsanis; Enrico Domenici; Bernie Devlin; Pamela Sklar
Journal:  Nat Neurosci       Date:  2016-09-26       Impact factor: 24.884

5.  Spatio-temporal transcriptome of the human brain.

Authors:  Hyo Jung Kang; Yuka Imamura Kawasawa; Feng Cheng; Ying Zhu; Xuming Xu; Mingfeng Li; André M M Sousa; Mihovil Pletikos; Kyle A Meyer; Goran Sedmak; Tobias Guennel; Yurae Shin; Matthew B Johnson; Zeljka Krsnik; Simone Mayer; Sofia Fertuzinhos; Sheila Umlauf; Steven N Lisgo; Alexander Vortmeyer; Daniel R Weinberger; Shrikant Mane; Thomas M Hyde; Anita Huttner; Mark Reimers; Joel E Kleinman; Nenad Sestan
Journal:  Nature       Date:  2011-10-26       Impact factor: 49.962

6.  The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells.

Authors:  Cole Trapnell; Davide Cacchiarelli; Jonna Grimsby; Prapti Pokharel; Shuqiang Li; Michael Morse; Niall J Lennon; Kenneth J Livak; Tarjei S Mikkelsen; John L Rinn
Journal:  Nat Biotechnol       Date:  2014-03-23       Impact factor: 54.908

7.  Bayesian approach to single-cell differential expression analysis.

Authors:  Peter V Kharchenko; Lev Silberstein; David T Scadden
Journal:  Nat Methods       Date:  2014-05-18       Impact factor: 28.547

Review 8.  A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications.

Authors:  Ashraful Haque; Jessica Engel; Sarah A Teichmann; Tapio Lönnberg
Journal:  Genome Med       Date:  2017-08-18       Impact factor: 11.117

9.  Biomarker discovery in heterogeneous tissue samples -taking the in-silico deconfounding approach.

Authors:  Dirk Repsilber; Sabine Kern; Anna Telaar; Gerhard Walzl; Gillian F Black; Joachim Selbig; Shreemanta K Parida; Stefan H E Kaufmann; Marc Jacobsen
Journal:  BMC Bioinformatics       Date:  2010-01-14       Impact factor: 3.169

10.  MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data.

Authors:  Greg Finak; Andrew McDavid; Masanao Yajima; Jingyuan Deng; Vivian Gersuk; Alex K Shalek; Chloe K Slichter; Hannah W Miller; M Juliana McElrath; Martin Prlic; Peter S Linsley; Raphael Gottardo
Journal:  Genome Biol       Date:  2015-12-10       Impact factor: 13.583

View more
  20 in total

1.  SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references.

Authors:  Meichen Dong; Aatish Thennavan; Eugene Urrutia; Yun Li; Charles M Perou; Fei Zou; Yuchao Jiang
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

2.  Correlation Imputation in Single cell RNA-seq using Auxiliary Information and Ensemble Learning.

Authors:  Luqin Gan; Giuseppe Vinci; Genevera I Allen
Journal:  ACM BCB       Date:  2020-09

3.  Missing data and technical variability in single-cell RNA-sequencing experiments.

Authors:  Stephanie C Hicks; F William Townes; Mingxiang Teng; Rafael A Irizarry
Journal:  Biostatistics       Date:  2018-10-01       Impact factor: 5.899

4.  Quantile regression for challenging cases of eQTL mapping.

Authors:  Bo Sun; Liang Chen
Journal:  Brief Bioinform       Date:  2020-09-25       Impact factor: 11.622

5.  DURIAN: an integrative deconvolution and imputation method for robust signaling analysis of single-cell transcriptomics data.

Authors:  Matthew Karikomi; Peijie Zhou; Qing Nie
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

6.  HiCImpute: A Bayesian hierarchical model for identifying structural zeros and enhancing single cell Hi-C data.

Authors:  Qing Xie; Chenggong Han; Victor Jin; Shili Lin
Journal:  PLoS Comput Biol       Date:  2022-06-13       Impact factor: 4.779

7.  Consensus clustering of single-cell RNA-seq data by enhancing network affinity.

Authors:  Yaxuan Cui; Shaoqiang Zhang; Ying Liang; Xiangyun Wang; Thomas N Ferraro; Yong Chen
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

8.  iSMNN: batch effect correction for single-cell RNA-seq data via iterative supervised mutual nearest neighbor refinement.

Authors:  Yuchen Yang; Gang Li; Yifang Xie; Li Wang; Taylor M Lagler; Yingxi Yang; Jiandong Liu; Li Qian; Yun Li
Journal:  Brief Bioinform       Date:  2021-09-02       Impact factor: 11.622

9.  Bayesian estimation of cell type-specific gene expression with prior derived from single-cell data.

Authors:  Jiebiao Wang; Kathryn Roeder; Bernie Devlin
Journal:  Genome Res       Date:  2021-04-09       Impact factor: 9.043

Review 10.  Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis.

Authors:  Abhishek Sarkar; Matthew Stephens
Journal:  Nat Genet       Date:  2021-05-24       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.