Literature DB >> 33758076

CellMixS: quantifying and visualizing batch effects in single-cell RNA-seq data.

Almut Lütge1,2, Joanna Zyprych-Walczak3, Urszula Brykczynska Kunzmann4, Helena L Crowell1,2, Daniela Calini5, Dheeraj Malhotra5, Charlotte Soneson2,4, Mark D Robinson6,2.   

Abstract

A key challenge in single-cell RNA-sequencing (scRNA-seq) data analysis is batch effects that can obscure the biological signal of interest. Although there are various tools and methods to correct for batch effects, their performance can vary. Therefore, it is important to understand how batch effects manifest to adjust for them. Here, we systematically explore batch effects across various scRNA-seq datasets according to magnitude, cell type specificity, and complexity. We developed a cell-specific mixing score (cms) that quantifies mixing of cells from multiple batches. By considering distance distributions, the score is able to detect local batch bias as well as differentiate between unbalanced batches and systematic differences between cells of the same cell type. We compare metrics in scRNA-seq data using real and synthetic datasets and whereas these metrics target the same question and are used interchangeably, we find differences in scalability, sensitivity, and ability to handle differentially abundant cell types. We find that cell-specific metrics outperform cell type-specific and global metrics and recommend them for both method benchmarks and batch exploration.
© 2021 Lütge et al.

Entities:  

Year:  2021        PMID: 33758076      PMCID: PMC7994321          DOI: 10.26508/lsa.202001004

Source DB:  PubMed          Journal:  Life Sci Alliance        ISSN: 2575-1077


  23 in total

Review 1.  Orchestrating high-throughput genomic analysis with Bioconductor.

Authors:  Wolfgang Huber; Vincent J Carey; Robert Gentleman; Simon Anders; Marc Carlson; Benilton S Carvalho; Hector Corrada Bravo; Sean Davis; Laurent Gatto; Thomas Girke; Raphael Gottardo; Florian Hahne; Kasper D Hansen; Rafael A Irizarry; Michael Lawrence; Michael I Love; James MacDonald; Valerie Obenchain; Andrzej K Oleś; Hervé Pagès; Alejandro Reyes; Paul Shannon; Gordon K Smyth; Dan Tenenbaum; Levi Waldron; Martin Morgan
Journal:  Nat Methods       Date:  2015-02       Impact factor: 28.547

2.  Snakemake--a scalable bioinformatics workflow engine.

Authors:  Johannes Köster; Sven Rahmann
Journal:  Bioinformatics       Date:  2012-08-20       Impact factor: 6.937

3.  A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor.

Authors:  Aaron T L Lun; Davis J McCarthy; John C Marioni
Journal:  F1000Res       Date:  2016-08-31

4.  Batch effects in single-cell RNA-sequencing data are corrected by matching mutual nearest neighbors.

Authors:  Laleh Haghverdi; Aaron T L Lun; Michael D Morgan; John C Marioni
Journal:  Nat Biotechnol       Date:  2018-04-02       Impact factor: 54.908

5.  A test metric for assessing single-cell RNA-seq batch correction.

Authors:  Maren Büttner; Zhichao Miao; F Alexander Wolf; Sarah A Teichmann; Fabian J Theis
Journal:  Nat Methods       Date:  2018-12-20       Impact factor: 28.547

6.  Multiplexed droplet single-cell RNA-sequencing using natural genetic variation.

Authors:  Hyun Min Kang; Meena Subramaniam; Sasha Targ; Michelle Nguyen; Lenka Maliskova; Elizabeth McCarthy; Eunice Wan; Simon Wong; Lauren Byrnes; Cristina M Lanata; Rachel E Gate; Sara Mostafavi; Alexander Marson; Noah Zaitlen; Lindsey A Criswell; Chun Jimmie Ye
Journal:  Nat Biotechnol       Date:  2017-12-11       Impact factor: 54.908

7.  Scater: pre-processing, quality control, normalization and visualization of single-cell RNA-seq data in R.

Authors:  Davis J McCarthy; Kieran R Campbell; Aaron T L Lun; Quin F Wills
Journal:  Bioinformatics       Date:  2017-04-15       Impact factor: 6.937

8.  Towards unified quality verification of synthetic count data with countsimQC.

Authors:  Charlotte Soneson; Mark D Robinson
Journal:  Bioinformatics       Date:  2018-02-15       Impact factor: 6.937

9.  Flexible comparison of batch correction methods for single-cell RNA-seq using BatchBench.

Authors:  Ruben Chazarra-Gil; Stijn van Dongen; Vladimir Yu Kiselev; Martin Hemberg
Journal:  Nucleic Acids Res       Date:  2021-04-19       Impact factor: 16.971

10.  Pooling across cells to normalize single-cell RNA sequencing data with many zero counts.

Authors:  Aaron T L Lun; Karsten Bach; John C Marioni
Journal:  Genome Biol       Date:  2016-04-27       Impact factor: 13.583

View more
  3 in total

1.  Doublet identification in single-cell sequencing data using scDblFinder.

Authors:  Pierre-Luc Germain; Aaron Lun; Carlos Garcia Meixide; Will Macnair; Mark D Robinson
Journal:  F1000Res       Date:  2021-09-28

2.  Single-cell transcriptional regulation and genetic evolution of neuroendocrine prostate cancer.

Authors:  Ziwei Wang; Tao Wang; Danni Hong; Baijun Dong; Yan Wang; Huaqiang Huang; Wenhui Zhang; Bijun Lian; Boyao Ji; Haoqing Shi; Min Qu; Xu Gao; Daofeng Li; Colin Collins; Gonghong Wei; Chuanliang Xu; Hyung Joo Lee; Jialiang Huang; Jing Li
Journal:  iScience       Date:  2022-06-13

3.  Single-cell eQTL analysis of activated T cell subsets reveals activation and cell type-dependent effects of disease-risk variants.

Authors:  Benjamin J Schmiedel; Cristian Gonzalez-Colin; Vicente Fajardo; Job Rocha; Ariel Madrigal; Ciro Ramírez-Suástegui; Sourya Bhattacharyya; Hayley Simon; Jason A Greenbaum; Bjoern Peters; Grégory Seumois; Ferhat Ay; Vivek Chandra; Pandurangan Vijayanand
Journal:  Sci Immunol       Date:  2022-02-25
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.