| Literature DB >> 35298641 |
Ryan O Schenck1,2, Gabriel Brosula1, Jeffrey West1, Simon Leedham1, Darryl Shibata3, Alexander R A Anderson1.
Abstract
Research over the past two decades has made substantial inroads into our understanding of somatic mutations. Recently, these studies have focused on understanding their presence in homeostatic tissue. In parallel, agent-based mechanistic models have emerged as an important tool for understanding somatic mutation in tissue; yet no common methodology currently exists to provide base-pair resolution data for these models. Here, we present Gattaca as the first method for introducing and tracking somatic mutations at the base-pair resolution within agent-based models that typically lack nuclei. With nuclei that incorporate human reference genomes, mutational context, and sequence coverage/error information, Gattaca is able to realistically evolve sequence data, facilitating comparisons between in silico cell tissue modeling with experimental human somatic mutation data. This user-friendly method, incorporated into each in silico cell, allows us to fully capture somatic mutation spectra and evolution.Entities:
Keywords: computational tool; mechanistic modeling; somatic evolution
Mesh:
Year: 2022 PMID: 35298641 PMCID: PMC9034688 DOI: 10.1093/molbev/msac058
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 8.800
Fig. 1.Gattaca is a three-part workflow for simulating base-pair resolution mutations within the human genome for somatic evolution in silico studies. Gattaca consists of three parts: (i) user defines options (initialize), (ii) generate a java executable class for in silico simulations with base-pair resolution mutation tracking (execution), and (iii) analyze the output of these simulations for downstream analysis (analyze).
Fig. 2.Results comparing the fully seeded (left) and single-cell seeding (right) model types and their corresponding mutation profiles and clonal dynamics. For each case, the mutation proportion across the 96 mutation trinucleotides is shown for one of the simulation replicates. The values for each of the modeled dimensions are shown for three different population sizes, and the inset shows the distribution for that which would be within the limits of detection (a generous VAF at high depths). Beneath this, the same replicate that is shown in the mutation spectrum plot is used to highlight the differences in clonal dynamics using a Muller plot, produced using EvoFreq, with a VAF cutoff between fully seeded and single-cell seeding.
Fig. 3.Illustration of the repeated wounding of the single-cell model in where colors represent clones that differ by at least one mutation (a). For each of the dimensions, the cumulative and unique genomes is given over the course of simulations (b). R-squared values for the linear regression on distributions for mutations is plotted for all dimensions with and without wounding (c) for all replicate simulations.