| Literature DB >> 30514897 |
Jochen Singer1,2, Jack Kuipers1,2, Katharina Jahn1,2, Niko Beerenwinkel3,4.
Abstract
Reconstructing the evolution of tumors is a key aspect towards the identification of appropriate cancer therapies. The task is challenging because tumors evolve as heterogeneous cell populations. Single-cell sequencing holds the promise of resolving the heterogeneity of tumors; however, it has its own challenges including elevated error rates, allelic drop-out, and uneven coverage. Here, we develop a new approach to mutation detection in individual tumor cells by leveraging the evolutionary relationship among cells. Our method, called SCIΦ, jointly calls mutations in individual cells and estimates the tumor phylogeny among these cells. Employing a Markov Chain Monte Carlo scheme enables us to reliably call mutations in each single cell even in experiments with high drop-out rates and missing data. We show that SCIΦ outperforms existing methods on simulated data and applied it to different real-world datasets, namely a whole exome breast cancer as well as a panel acute lymphoblastic leukemia dataset.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30514897 PMCID: PMC6279798 DOI: 10.1038/s41467-018-07627-7
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Schematic overview of SCIΦ. First, candidate loci are identified. These loci are then used to infer the underlying phylogenetic tree and the parameters of the model. In a last step the mutation to cell assignment is sampled from the posterior distribution
Fig. 2Performance of SCIΦ and Monovar on simulated data with different number of cells. Performance measured as recall (a), precision (b), and F1 score (c)
Fig. 3Summary statistics of the F1 performance of SCIΦ and Monovar on simulated data. F1 performance depending on different levels of drop-out events (a), homozygosity (b), and copy number rates (c)
Fig. 4Summary of the mutation calls obtained with Monovar and SCIΦ on a breast cancer patient dataset[13] consisting of 16 single tumor cells and a control normal bulk sequencing dataset. a Cell lineage tree with average number of mutations per inner node as identified by SCIΦ. The area of a node is proportional to its number of assigned mutations. b Posterior probability of SCIΦ mutation calls clustered according to the tree in a. c Probability of Monovar mutation calls for loci identified as mutated by SCIΦ and clustered according to the tree in a. d Probability of Monovar mutation calls for loci identified as mutated by SCIΦ and clustered hierarchically
Fig. 5Summary of the mutation calls from SCIΦ and Monovar on a dataset consisting of 255 cells from a patient (number 3) with acute lymphoblastic leukemia[14]. a Monovar mutation calls for loci identified as mutated by SCIΦ clustered hierarchically. b Monovar mutation calls clustered according to the tree inferred by SCIΦ. c SCIΦ mutation calls clustered according to its inferred tree