| Literature DB >> 31501550 |
Allen W Zhang1,2,3, Ciara O'Flanagan1, Elizabeth A Chavez4, Jamie L P Lim1,2, Nicholas Ceglia2, Andrew McPherson1, Matt Wiens1, Pascale Walters1, Tim Chan1, Brittany Hewitson1, Daniel Lai1, Anja Mottok4,5, Clementine Sarkozy4, Lauren Chong4, Tomohiro Aoki4,6, Xuehai Wang7, Andrew P Weng7, Jessica N McAlpine8, Samuel Aparicio1,6, Christian Steidl4, Kieran R Campbell9,10,11, Sohrab P Shah12,13,14.
Abstract
Single-cell RNA sequencing has enabled the decomposition of complex tissues into functionally distinct cell types. Often, investigators wish to assign cells to cell types through unsupervised clustering followed by manual annotation or via 'mapping' to existing data. However, manual interpretation scales poorly to large datasets, mapping approaches require purified or pre-annotated data and both are prone to batch effects. To overcome these issues, we present CellAssign, a probabilistic model that leverages prior knowledge of cell-type marker genes to annotate single-cell RNA sequencing data into predefined or de novo cell types. CellAssign automates the process of assigning cells in a highly scalable manner across large datasets while controlling for batch and sample effects. We demonstrate the advantages of CellAssign through extensive simulations and analysis of tumor microenvironment composition in high-grade serous ovarian cancer and follicular lymphoma.Entities:
Mesh:
Year: 2019 PMID: 31501550 PMCID: PMC7485597 DOI: 10.1038/s41592-019-0529-1
Source DB: PubMed Journal: Nat Methods ISSN: 1548-7091 Impact factor: 28.547