| Literature DB >> 33349700 |
Wanqiu Chen1, Yongmei Zhao2,3, Xin Chen1,4, Zhaowei Yang1,5, Xiaojiang Xu6, Yingtao Bi7, Vicky Chen2,3, Jing Li4,5, Hannah Choi1, Ben Ernest8, Bao Tran3, Monika Mehta3, Parimal Kumar3, Andrew Farmer9, Alain Mir9, Urvashi Ann Mehra8, Jian-Liang Li6, Malcolm Moos10, Wenming Xiao11, Charles Wang12,13.
Abstract
Comparing diverse single-cell RNA sequencing (scRNA-seq) datasets generated by different technologies and in different laboratories remains a major challenge. Here we address the need for guidance in choosing algorithms leading to accurate biological interpretations of varied data types acquired with different platforms. Using two well-characterized cellular reference samples (breast cancer cells and B cells), captured either separately or in mixtures, we compared different scRNA-seq platforms and several preprocessing, normalization and batch-effect correction methods at multiple centers. Although preprocessing and normalization contributed to variability in gene detection and cell classification, batch-effect correction was by far the most important factor in correctly classifying the cells. Moreover, scRNA-seq dataset characteristics (for example, sample and cellular heterogeneity and platform used) were critical in determining the optimal bioinformatic method. However, reproducibility across centers and platforms was high when appropriate bioinformatic methods were applied. Our findings offer practical guidance for optimizing platform and software selection when designing an scRNA-seq study.Entities:
Year: 2020 PMID: 33349700 DOI: 10.1038/s41587-020-00748-9
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908