| Literature DB >> 34992246 |
Yusuke Kijima1,2,3, Keito Watano1,4,5, Soh Ishiguro1,3, Naoki Konno1,4,5, Keiichiro Ono6, Mamoru Tanaka1, Hideto Mori1,7,8, Nanami Masuyama1,3,7,8, Dexter Pratt6, Trey Ideker6,9, Wataru Iwasaki4,5, Nozomu Yachie10,11,12,13.
Abstract
Phylogeny estimation (the reconstruction of evolutionary trees) has recently been applied to CRISPR-based cell lineage tracing, allowing the developmental history of an individual tissue or organism to be inferred from a large number of mutated sequences in somatic cells. However, current computational methods are not able to construct phylogenetic trees from extremely large numbers of input sequences. Here, we present a deep distributed computing framework to comprehensively trace accurate large lineages (FRACTAL) that substantially enhances the scalability of current lineage estimation software tools. FRACTAL first reconstructs only an upstream lineage of the input sequences and recursively iterates the same produce for its downstream lineages using independent computing nodes. We demonstrate the utility of FRACTAL by reconstructing lineages from >235 million simulated sequences and from >16 million cells from a simulated experiment with a CRISPR system that accumulates mutations during cell proliferation. We also successfully applied FRACTAL to evolutionary tree reconstructions and to an experiment using error-prone PCR (EP-PCR) for large-scale sequence diversification.Entities:
Mesh:
Year: 2022 PMID: 34992246 DOI: 10.1038/s41587-021-01111-2
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 68.164