| Literature DB >> 35534232 |
Arikuni Uchimura1,2, Hirotaka Matsumoto3,4, Yasunari Satoh1, Yohei Minakuchi5, Sayaka Wakayama6, Teruhiko Wakayama6,7, Mayumi Higuchi2, Masakazu Hashimoto8, Ryutaro Fukumura9, Atsushi Toyoda5, Yoichi Gondo9, Takeshi Yagi2.
Abstract
De novo mutations accumulate with zygotic cell divisions. However, the occurrence of these mutations and the way they are inherited by somatic cells and germ cells remain unclear. Here, we present a novel method to reconstruct cell lineages. We identified mosaic mutations in mice using deep whole-genome sequencing and reconstructed embryonic cell lineages based on the variant allele frequencies of the mutations. The reconstructed trees were confirmed using nuclear transfer experiments and the genotyping of approximately 50 offspring of each tree. The most detailed tree had 32 terminal nodes and showed cell divisions from the fertilized egg to germ cell- and somatic cell-specific lineages, indicating at least five independent cell lineages that would be selected as founders of the primordial germ cells. The contributions of each lineage to germ cells and offspring varied widely. At the emergence of the germ cell-specific lineages, 10-15 embryonic mutations had accumulated, suggesting that the pregastrulation mutation rate is 1.0 mutation per mitosis. Subsequent mutation rates were 0.7 for germ cells and 13.2 for tail fibroblasts. Our results show a new framework to assess embryonic lineages; further, we suggest an evolutionary strategy for preserving heterogeneity owing to postzygotic mutations in offspring.Entities:
Mesh:
Year: 2022 PMID: 35534232 PMCID: PMC9104692 DOI: 10.1101/gr.276363.121
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.438
Figure 1.Principle of cell lineage reconstruction. (A) Inferred tree of lineage relationships and genotypes of the first two postzygotic divisions, as well as the arithmetical rules of variant allele frequencies (VAFs) of mosaic mutations used for the lineage reconstruction. The red and black numbers represent independently arising mutations that were first fixed and subsequently inherited, respectively. Pie charts represent example contributions of the early embryonic cell to the adult samples. (B) Overall experimental procedure for reconstruction of lineages. (C,D) Examples of the arithmetical relationship of VAFs of mosaic mutations in individual tissues of a mouse (mouse ID: ConC31). Mutations that show the same frequency in all tissues are considered co-occurring. The sum of the VAFs of the two mutation groups is indicated by the black line. The sum of the average VAF of mutations #2, #19, and #33A (VAF#2/19/33A) and the average VAF of mutations #28 and #33C (VAF#28/33C) equals 50%, indicating the first cell cleavage after a fertilized egg (B). The sum of VAF#14/41 and VAF#51/80 is equal to the VAF#2/19/33A, indicating the subsequent cell division after the first cell cleavage (C). Other examples are shown in Supplemental Figure S1.
Figure 2.Reconstructed lineage trees based on de novo mutations occurring during the first several postzygotic divisions. Reconstructed lineage trees of five animals based on the mosaic mutations detected through whole-genome sequencing (WGS) with 100× (A–D) and 900× (E) coverages. The red numbers represent newly arising mutations. The underlined numbers represent mutations also observed in the offspring. Numbers with an asterisk represent mutations added using the genotyping results of the offspring. The numbers with a hash mark represent mutations with a GERP score of more than three (expected to have a harmful effect). The nuclear transfer embryonic stem (ntES) below the cell positions represents the lineages leading to each ntES cell line established in the mice ConB23 (A) and ConJ12 (E). The brown and green numbers (%) below each cell position represent the VAFs of labeled cells’ autosomal mutations (or the estimated values if there are no mutations) in somatic tissue averages and in a sperm sample, respectively. In the ConJ12 tree (E), in sperm or all somatic tissue samples, cells with VAF values comparable to the background were colored orange and light green as somatic cell–specific lineages and germ cell–specific lineages, respectively.
Figure 3.Contributions of each phylogeny to tissues. (A) A heatmap representing the contribution of the terminal nodes in the tree of mouse ConJ12 (Fig. 2E) for each tissue sample. The dendrogram represents the results of the hierarchical clustering. (B) The correlations of the average VAFs in tissues (somatic tissues or germ cell tissues) and the frequency of mutations inherited by the offspring. (C) The distance of the contribution of the terminal nodes between the values of each tissue and the average of all somatic tissues. (D) The contribution ratios of the two daughter cells after a single-cell division. VAF#H means the daughter cell had a higher average VAF relative to that of all samples, and VAF#L denotes the opposite condition. For comparison, the ratios of inheritance frequency of the mutations for the offspring are also shown.
Figure 4.Mutation rates and characteristics of the postzygotic mutations. (A) Schematic of the number of spontaneous mutations along with the developmental period. (B) The trinucleotide context of postzygotic mutations (validated mosaic mutations detected during deep-coverage WGS on five mice), germline mutations (de novo mutations accumulated in long-term breeding mouse lines) (see Supplemental Fig. S8), and somatic mutations (de novo mutations accumulated in ntES cell lines derived from mouse tail fibroblasts) (see Supplemental Fig. S7). (C) Hierarchical clustering of the mutation signatures. Human germline and postzygotic mutations were obtained by Jónsson et al. (2017) and Bizzotto et al. (2021), respectively. (D) Comparison of replication timing at the genome position where the mutation occurred. Significant differences were calculated using a chi-square test. Error bars indicate binomial 95% CIs. (E) Mutation occurrence frequency in genomic regions. Each nucleotide in the effective whole-genome coverage (EWC) regions was annotated with SnpEff software. “Upstream” and “Downstream” indicate 5000-bp regions of each gene. Significant differences were calculated using a chi-square test. Error bars indicate 95% CIs for the Poisson means.