| Literature DB >> 31992756 |
Merce Montoliu-Nerin1, Marisol Sánchez-García1, Claudia Bergin2, Manfred Grabherr3, Barbara Ellis1, Verena Esther Kutschera4, Marcin Kierczak3, Hanna Johannesson5, Anna Rosling6.
Abstract
The advent of novel sequencing techniques has unraveled a tremendous diversity on Earth. Genomic data allow us to understand ecology and function of organisms that we would not otherwise know existed. However, major methodological challenges remain, in particular for multicellular organisms with large genomes. Arbuscular mycorrhizal (AM) fungi are important plant symbionts with cryptic and complex multicellular life cycles, thus representing a suitable model system for method development. Here, we report a novel method for large scale, unbiased nuclear sorting, sequencing, and de novo assembling of AM fungal genomes. After comparative analyses of three assembly workflows we discuss how sequence data from single nuclei can best be used for different downstream analyses such as phylogenomics and comparative genomics of single nuclei. Based on analysis of completeness, we conclude that comprehensive de novo genome assemblies can be produced from six to seven nuclei. The method is highly applicable for a broad range of taxa, and will greatly improve our ability to study multicellular eukaryotes with complex life cycles.Entities:
Mesh:
Year: 2020 PMID: 31992756 PMCID: PMC6987183 DOI: 10.1038/s41598-020-58025-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) Schematic representation of the life-cycle in AM fungi. A spore detects a plant root in the vicinity and grows hyphae towards it. The hyphae penetrate the plant cell wall and form the characteristically branching haustoria with the shape of arbuscules. The arbuscules are used to exchange nutrients with the plant. New spores are produced in other hyphal terminations, bud off upon maturity and remain in dormant state until the cycle starts again, while the first spore dies and the fungi retracts from the plant cell. (b) Schematic representation of a spore containing nuclei, lipid vesicles and endosymbiotic bacteria. The hyphae have very reduced compartmentalization with incomplete septa and nuclei appear to move freely.
Figure 2From a soil sample to AM fungal genome assemblies. (a) Whole inoculum from the culture collection INVAM is blended with water and (b) poured into a set of sieves; the material stuck in the 38 μm sieve is placed into a (c) tube that contains a solution of 60% sucrose, then centrifuged for 1 min. The supernatant is run through a 38 μm sieve and washed with water. (d) The sieve content is placed in a Petri dish for the spores to be manually picked using a glass pipette. (e) After cleaning the spores with ddH2O, these are placed one-by-one into tubes and crushed with a pestle. (f) The DNA from a broken spore is stained with SYBR Green, giving a strong fluorescent signal for the nuclei and a lighter signal for the background, organelles and microbes. (g) The stained spore content is loaded on the FACS, where the sample moves inside a constant flow of buffer and crosses a laser beam. An excitation laser of 488 nm and 530/40 band pass filter was used for the SYBR Green fluorescence detection. In addition, scattered light, forward scatter (FSC) and side scatter (SSC) were used as proxy for size and granularity to identify the nuclei. (h) The signals can be interpreted in a scatterplot, and particles of a selected cloud (e.g., R1, blue-box) can be sorted individually or pooled (i) into individual wells of a 96-well plate by directing them with a charge. (j) The content of each well is whole genome amplified using MDA. (k) The amplified products are tested for fungi and bacteria by PCR screening with specific rDNA primers. The products confirmed to be from fungal nuclei are sequenced with (l) Illumina HiSeqX, for single nuclei; and (m) Oxford Nanopore, for pools of nuclei. (n) In workflow 1, Illumina reads are assembled separately for individual nuclei using MaSuRCA[39]. (o) In workflow 2, reads from individual nuclei are normalized and assembled with SPADES[40]. (q) In workflow 3 reads from all nuclei are combined, then normalized and finally assembled with SPADES[40]. (p) Lingon[38] is used to produce a consensus assembly from individual nuclei assemblies in both workflows 1 and 2. (r) Nanopore data is assembled with Canu[41], polished with Pilon[53] using the Illumina raw-reads and used to (s) scaffold the three assemblies generated with workflows 1, 2 and 3 using Chromosemble, of Satsuma[55].
Comparative assessment of the 3 assembly workflows.
| Assembly | Size (Mb) | # Contigs | N50 | Largest contig (Kb) | GC (%) | BUSCO (%)a | # Genes (Mb) | Repeats (Mb) | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | Raw reads | 90.16 | 11077 | 12714 | 94.39 | 27.01 | C: 77 F: 10 | 18068 (49.42) | 40.39 |
| 1n | +Nanopore | 92.38 | 3899 | 37258 | 176.652 | 27.91 | C: 78 F: 9 | 16680 (69.54) | 41.32 |
| 2 | Normalized to 100× | 124.96 | 21934 | 16055 | 155.09 | 28.07 | C: 79 F: 8 | 24930 (69.79) | 57.77 |
| 2n | +Nanopore | 130.41 | 4632 | 60974 | 338.42 | 28.07 | C: 80 F: 7 | 22618 (105.48) | 58.57 |
| 3 | Combined, normalized to 100× | 68.31 | 11246 | 15947 | 199.90 | 28.08 | C: 88 F: 4 | 15882 (43.73) | 21.71 |
| 3n | +Nanopore | 70.81 | 3883 | 33135 | 220.22 | 28.08 | C: 89 F: 3 | 14662 (55.44) | 20.64 |
| Nanopore | polished with Pilon | 96.03 | 6409 | 20944 | 151.76 | 28.15 | C: 77 F: 6 | 15858 (57.47) | 47.31 |
aCompleteness estimated in % of 290 single copy genes in fungi, scored as complete (C) or fragmented (F).
Figure 3Summary statistics for different number of assembled nuclei (1–24) using three different assembly workflows. BUSCO estimates of completeness for (a) workflow 1: raw reads of individual nuclei assembled using MaSuRCa, consensus assembly using Lingon (b) workflow 2: normalized reads of individual nuclei assembled using SPADES, consensus assembly using Lingon and (c) workflow 3: reads from individual nuclei are pooled and normalized before assembling with SPADES. Percentage of single copy core genes detected as single copy (S: grey), duplicated (D: light grey) or fragmented (F: black). Average of 3–6 replicate assemblies up to 12 nuclei with error bars indicating SEM. In (d) assembly size (dashed lines) and N50 (solid lines) for the three methods 1 (black), 2 (grey) and 3 (light grey).