| Literature DB >> 33020495 |
Georgios D Koutsovoulos1, Marine Poullet1, Abdelnaser Elashry2, Djampa K L Kozlowski1, Erika Sallet3, Martine Da Rocha1, Laetitia Perfus-Barbeoch1, Cristina Martin-Jimenez1, Juerg Ernst Frey4, Christian H Ahrens4,5, Sebastian Kiewnick6, Etienne G J Danchin7.
Abstract
Root-knot nematodes (genus Meloidogyne) are plant parasites causing huge economic loss in the agricultural industry and affecting severely numerous developing countries. Control methods against these plant pests are sparse, the preferred one being the deployment of plant cultivars bearing resistance genes against Meloidogyne species. However, M. enterolobii is not controlled by the resistance genes deployed in the crop plants cultivated in Europe. The recent identification of this species in Europe is thus a major concern. Here, we sequenced the genome of M. enterolobii using short and long-read technologies. The genome assembly spans 240 Mbp with contig N50 size of 143 kbp, enabling high-quality annotations of 59,773 coding genes, 4,068 non-coding genes, and 10,944 transposable elements (spanning 8.7% of the genome). We validated the genome size by flow cytometry and the structure, quality and completeness by bioinformatics metrics. This ensemble of resources will fuel future projects aiming at pinpointing the genome singularities, the origin, diversity, and adaptive potential of this emerging plant pest.Entities:
Mesh:
Year: 2020 PMID: 33020495 PMCID: PMC7536185 DOI: 10.1038/s41597-020-00666-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Genomic Sequence Data Statistics.
| Dataset | Mean read length | Insert Size | Number of reads | Sum (bp) |
|---|---|---|---|---|
| Illumina Paired-end | 101 | 250 | 382,534,946 | 38,636,029,546 |
| Illumina Mate-pair | 51 | 3,000 | 430,206,972 | 21,940,555,572 |
| PacBio | 7,008 | NA | 1,854,399 | 12,996,274,301 |
Transcriptomic Sequence Data Statistics.
| Dataset | Mean read length | Insert Size (range) | Number of reads | Sum (bp) |
|---|---|---|---|---|
| Eggs Illumina Paired-end | 196 | 300 (56–336) | 54,424,802 | 10,660,505,616 |
| J2 Illumina Paired-end | 196 | 300 (57–347) | 56,468,708 | 11,083,687,102 |
Fig. 1BlobPlot of the genome assembly before removing contamination. Each circle is a contig proportionally scaled by contig length and coloured by taxonomic annotation based on BLAST similarity search results. Contigs are positioned based on the GC content (X-axis) and the coverage of PacBio reads (Y-axis). There are some contigs of Proteobacteria origin at high GC and variable coverage indicating possible contamination. These contigs were removed from the assembly.
Root-knot nematode genome features.
| Species/Feature | Assembly size (Mb) | Nuclear DNA content (Mb) | % complete CEGMA (C) (copies) %Partial (P) | % complete BUSCO (C) % fragment (F) % missing (M) | N50 (kb) | # contigs/ scaffolds | GC % |
|---|---|---|---|---|---|---|---|
| 240 | 275 ± 19 | C: 94.76 (3.30) P: 96.77 | C:87.5% F:3.6%; M:8.9% | 143 | 4,437 | 30 | |
| 162.4 | NA | C: 81.45 (2.66) P: 89.92 | C:79.9% F:8.9%; M:11.2% | 9.3 | 46,090 | 30.2 | |
| 183.5 | 189 ± 15 | C: 94.76 (2.93) P :96.77 | C:88.5% F:3.0%; M:8.5% | 38.6 | 12,091 | 29.8 | |
| 122 | NA | C: 82.66 (2.34) P :89.52 | C:80.2% F:7.9%; M:11.9% | 16.5 | 33,735 | 30.6 | |
| 86 | 189 ± 15 | C: 74.6 (1.77) P :78.63 | C:71.3% F:7.3%; M:21.4% | 82.8 | 2,817 | 31.4 | |
| 235.8 | 297 ± 27 | C: 92.74 (3.68) P :95.56 | C:90.1% F:2.3%; M:7.6% | 10.4 | 31,341 | 30 | |
| 142.6 | NA | C :89.52 (2.71) P: 95.16 | C:87.5% F:4.3%; M:8.2% | 14.1 | 34,394 | 30.2 | |
| 258.1 | 304 ± 9 | C :94.76 (3.66) P :95.56 | C:87.1% F:4.3%; M:8.6% | 16.5 | 26,196 | 30 | |
| 284.05 | NA | C: 94.76 (3.57) P :96.77 | C:87.1% F:2.6%; M:10.3% | 204.6 | 2,224 | 30 | |
| 163.8 | NA | C :91.53 (2.74) P :95.97 | C: 78.2 F :12.2; M:9.6 | 10.5 | 46,509 | 30.3 | |
| 53.6 | 121 ± 3 | C :93.55 (1.19) P: 95.56 | C:87.4% F:4.3%; M:8.3% | 83.6 | 1,523 | 27.4 | |
| 99.9 | NA | C: 56.45 (1.95) P: 76.61 | C:54.1%F:28.7%; M:17.2% | 3.5 | 81,111 | 29.7 | |
| 74.9 | NA | C: 77.42 (1.71) P: 83.87 | C:76.5% F:7.6%; M:15.9% | 13.3 | 9,134 | 30.2 | |
| 209.2 | NA | C: 95.56 (2.92) P :96.77 | C:87.8% F:4.0%; M:8.2% | 1,712 | 327 | 30.2 | |
| 38.18 | NA | C: 84.27 (1.34) P: 90.73 | C:73.6% F:15.2%; M:11.2% | 20.4 | 4,304 | 23.05 |
*This work.
Fig. 2Relative DNA staining in nuclei of M. enterolobii. Cytogram example obtained after gating on G0/G1 nuclei (arbitrary units) from M. enterolobii when processed mixed together with C. elegans, the internal standard (diploid genome size is 200 Mb).
Estimation of the protein set completeness with BUSCO.
| BUSCO genes (303) | ||
|---|---|---|
| Complete | 94.7% (287) | 78.2% (237) |
| Single-copy complete | 9.2% (28) | 45.9% (139) |
| Duplicated complete | 85.5% (259) | 32.3% (98) |
| Fragmentary | 3.0% (9) | 11.2% (34) |
| Missing | 2.3% (7) | 10.6% (32) |
*This work.
Fig. 3Number of CDS mapping n loci in Meloidogyne genomes. Number of CDS mapping with >95% identity on >66% on their length at n loci on the M. enterolobii (red), M. incognita (green) and M. hapla (blue) genomes.
Fig. 4Distribution of genes based on the duplication depth in the collinear blocks. The peak of the distribution is observed at a depth of three, suggesting a triploid genome.
Fig. 5Nucleotide identity computed from NUCmer alignments between homologous duplicated blocks. M. enterolobii (blue) shows one major peak at 89.9% and two smaller ones at 94.2% and 99.9% nucleotide identity, whereas M. incognita (green) shows almost two overlapping peaks at 89.5% and 91.4% of identity.
Fig. 6Transposable Elements annotation in the genome of M. enterolobii. Canonical-TE annotations distribution as a genome percentage. Annotations are grouped according to Wicker’s classification[48].
Summary of M. enterolobii Canonical-TE annotations statistics.
| order | nb. of features | sum of features length (bp) | median features length (bp) | median identity with consensus (%) | |
|---|---|---|---|---|---|
| Retro-transposons | CLASS_1_LIKE | 86 | 624,868 | 7,197 | 98.65 |
| LTR | 647 | 3,329,701 | 4,012 | 97.55 | |
| DIRS | 58 | 494,483 | 8,323.5 | 97.9 | |
| LINE | 343 | 1,379,884 | 3,798 | 97.1 | |
| SINE | 36 | 36,906 | 1,076 | 99.5 | |
| LARD | 108 | 233,797 | 1,795.5 | 93.83 | |
| TRIM | 469 | 794,644 | 690 | 97.9 | |
| DNA-transposons | CLASS_2_LIKE | 46 | 184,474 | 1,193.5 | 99 |
| TIR | 3,974 | 4,415,242 | 918 | 96.95 | |
| HELITRON | 179 | 150,764 | 8,328 | 98.9 | |
| MAVERICK | 546 | 5,253,812 | 8,954 | 97.3 | |
| MITE | 4,452 | 264,544 | 590 | 96.4 | |
| Measurement(s) | genome • sequence_assembly • sequence feature annotation |
| Technology Type(s) | DNA sequencing assay • sequence assembly process • sequence annotation |
| Sample Characteristic - Organism | Meloidogyne enterolobii |