| Literature DB >> 33193577 |
George S Krasnov1, Elena N Pushkova1, Roman O Novakovskiy1, Ludmila P Kudryavtseva2, Tatiana A Rozhmina1,2, Ekaterina M Dvorianinova1,3, Liubov V Povkhova1,3, Anna V Kudryavtseva1, Alexey A Dmitriev1, Nataliya V Melnikova1.
Abstract
In the present work, a highly pathogenic isolate of Fusarium oxysporum f. sp. lini, which is the most harmful pathogen affecting flax (Linum usitatissimum L.), was sequenced for the first time. To achieve a high-quality genome assembly, we used the combination of two sequencing platforms - Oxford Nanopore Technologies (MinION system) with long noisy reads and Illumina (HiSeq 2500 instrument) with short accurate reads. Given the quality of DNA is crucial for Nanopore sequencing, we developed the protocol for extraction of pure high-molecular-weight DNA from fungi. Sequencing of DNA extracted using this protocol allowed us to obtain about 85x genome coverage with long (N50 = 29 kb) MinION reads and 30x coverage with 2 × 250 bp HiSeq reads. Several tools were developed for genome assembly; however, they provide different results depending on genome complexity, sequencing data volume, read length and quality. We benchmarked the most requested assemblers (Canu, Flye, Shasta, wtdbg2, and MaSuRCA), Nanopore polishers (Medaka and Racon), and Illumina polishers (Pilon and POLCA) on our sequencing data. The assembly performed with Canu and polished with Medaka and POLCA was considered the most full and accurate. After further elimination of redundant contigs using Purge Haplotigs, we achieved a high-quality genome of F. oxysporum f. sp. lini with a total length of 59 Mb, N50 of 3.3 Mb, and 99.5% completeness according to BUSCO. We also obtained a complete circular mitochondrial genome with a length of 38.7 kb. The achieved assembly expands studies on F. oxysporum and plant-pathogen interaction in flax.Entities:
Keywords: Fusarium oxysporum f. sp. lini; Illumina; Linum usitatissimum L.; Nanopore; assembly polishers; de novo genome assembly; genome assemblers; pure high-molecular-weight DNA
Year: 2020 PMID: 33193577 PMCID: PMC7481384 DOI: 10.3389/fgene.2020.00959
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The QUAST statistics across 5 assemblers.
| Feature | Canu 1.8 contigs | Canu 1.8 contigs + Purge Haplotigs | Canu 1.8 unitigs | Flye 2.6 | MaSuRCa-CA 3.3.9 | MaSuRCa-Flye 3.3.9 | Shasta 0.4.0 | wtdbg2 2.5 |
| Number of contigs (all contigs) | 101 | 35 | 192 | 90 | 87 | 388 | 1069 | 348 |
| Number of contigs (length > 5 kb) | 101 | 35 | 192 | 77 | 87 | 140 | 389 | 345 |
| Number of contigs (length > 50 kb) | 75 | 32 | 111 | 55 | 78 | 104 | 137 | 109 |
| Largest contig, Mb | 6.75 | 6.76 | 6.75 | 6.69 | 4.56 | 4.33 | 5.78 | 5.94 |
| Total length, Mb (all contigs) | 70.9 | 59.2 | 73.8 | 69.1 | 69.2 | 68.4 | 66.2 | 65.0 |
| Total length, Mb (contigs > 5 kb) | 70.9 | 59.2 | 73.8 | 69.1 | 69.2 | 67.8 | 65.0 | 65.0 |
| Total length, Mb (contigs > 50 kb) | 70.0 | 59.1 | 71.0 | 68.7 | 68.9 | 67.3 | 61.6 | 60.8 |
| N50, Mb | 3.13 | 3.35 | 2.19 | 3.77 | 2.07 | 1.61 | 2.19 | 2.93 |
| N75, Mb | 1.25 | 2.19 | 0.72 | 1.19 | 0.94 | 0.50 | 0.29 | 0.37 |
| L50 | 8 | 7 | 9 | 7 | 11 | 13 | 10 | 9 |
| L75 | 17 | 12 | 24 | 16 | 23 | 32 | 35 | 24 |
| GC, % | 47.95 | 47.98 | 47.91 | 47.99 | 48.13 | 48.08 | 48.03 | 47.99 |
FIGURE 1The Nx statistics for Fusarium oxysporum f. sp. lini assemblies. Results for the following assemblers are presented: Canu 1.8 contigs, Canu 1.8 contigs filtered with Purge Haplotigs (Canu_contigs__PH), Canu 1.8 unitigs, Flye 2.6, MaSuRCA 3.3.9 coupled with Celera Assembler (MaSuRCA_CA), MaSuRCA 3.3.9 coupled with Flye (MaSuRCA_Flye), Shasta 0.4.0, wtdbg2 2.5. Medaka, Racon, Pilon, POLCA polishing did not impact Nx statistics.
FIGURE 2Alignment of the largest Flye 2.6 contig to Canu 1.8, Shasta 0.4.0, wtdbg2 2.5 assemblies and five NCBI chromosome-level assemblies of Fusarium oxysporum strains. The chromosome-level assemblies of F. oxysporum strains can be found in the NCBI Genome database (https://www.ncbi.nlm.nih.gov/genome/genomes/707).
Results of BUSCO analysis for various combinations of Nanopore and/or Illumina polishers.
| Polisher (ONT reads) | Polisher (Illumina reads) | Rank | Complete, % | Complete | Fragmented | Missing | Single-copy | Duplicated |
| Medaka | Pilon-bt2 + POLCA | 8 | 99.46 | 3139 | 5 | 11 | 3094 | 45 |
| Medaka | Pilon-bt2 | 8 | 99.46 | 3139 | 5 | 12 | 3093 | 45 |
| Medaka | Pilon-BWA | 10 | 99.43 | 3138 | 6 | 12 | 3092 | 45 |
| Medaka | POLCA | 2 | 99.49 | 3140 | 5 | 11 | 3095 | 45 |
| Medaka | POLCA X2 | 2 | 99.49 | 3140 | 5 | 11 | 3094 | 45 |
| Medaka | – | 14 | 98.42 | 3106 | 23 | 27 | 3061 | 45 |
| Racon | – | 18 | 93.44 | 2949 | 109 | 97 | 2912 | 37 |
| Racon X2 | Pilon-bt2 + POLCA | 2 | 99.49 | 3140 | 5 | 11 | 3095 | 45 |
| Racon X2 | Pilon-bt2 | 12 | 99.33 | 3135 | 9 | 11 | 3091 | 45 |
| Racon X2 | Pilon-BWA | 13 | 99.18 | 3130 | 12 | 14 | 3087 | 43 |
| Racon X2 | POLCA | 2 | 99.49 | 3140 | 5 | 11 | 3095 | 45 |
| Racon X2 | POLCA X2 | 2 | 99.49 | 3140 | 5 | 11 | 3095 | 45 |
| Racon X2 | – | 16 | 93.60 | 2954 | 107 | 96 | 2916 | 38 |
| Racon X3 | – | 15 | 93.63 | 2955 | 108 | 93 | 2917 | 38 |
| Racon X4 | – | 17 | 93.54 | 2952 | 110 | 94 | 2915 | 37 |
| – | POLCA X2 | 10 | 99.43 | 3138 | 6 | 11 | 3094 | 44 |
| – | – | 19 | 88.53 | 2794 | 196 | 166 | 2760 | 34 |
| MaSuRCa-CA hybrid assembly | 1 | 99.56 | 3142 | 5 | 9 | 3083 | 59 | |
| MaSuRCa-Flye hybrid assembly | 2 | 99.49 | 3140 | 6 | 10 | 3096 | 44 | |