| Literature DB >> 29979772 |
Alessandra Dal Molin1, Andrea Minio1, Francesca Griggio1, Massimo Delledonne1, Alessandro Infantino2, Maria Aragona2.
Abstract
The first draft genome sequencing of the non-model fungal pathogen Pyrenochaeta lycopersici showed an expansion of gene families associated with heterokaryon incompatibility and lacking of mating-type genes, providing insights into the genetic basis of this "imperfect" fungus which lost the ability to produce the sexual stage. However, due to the Illumina short-read technology, the draft genome was too fragmented to allow a comprehensive characterization of the genome, especially of the repetitive sequence fraction. In this work, the sequencing of another P. lycopersici isolate using long-read Single Molecule Real-Time sequencing technology was performed with the aim of obtaining a gapless genome. Indeed, a gapless genome assembly of 62.7 Mb was obtained, with a fraction of repetitive sequences representing 30% of the total bases. The gene content of the two P. lycopersici isolates was very similar, and the large difference in genome size (about 8 Mb) might be attributable to the high fraction of repetitive sequences detected for the new sequenced isolate. The role of repetitive elements, including transposable elements, in modulating virulence effectors is well established in fungal plant pathogens. Moreover, transposable elements are of fundamental importance in creating and re-modelling genes, especially in imperfect fungi. Their abundance in P. lycopersici, together with the large expansion of heterokaryon incompatibility genes in both sequenced isolates, suggest the presence of possible mechanisms alternative to gene re-assorting mediated by sexual recombination. A quite large fraction (~9%) of repetitive elements in P. lycopersici, has no homology with known classes, strengthening this hypothesis. The availability of a gapless genome of P. lycopersici allowed the in-depth analysis of its genome content, by annotating functional genes and TEs. This goal will be an important resource for shedding light on the evolution of the reproductive and pathogenic behaviour of this soilborne pathogen and the onset of a possible speciation within this species.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29979772 PMCID: PMC6034849 DOI: 10.1371/journal.pone.0200217
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics of P. lycopersici ER1518 genome assembly.
| Assembly length (bp) | 62,731,747 |
| Number of unitigs | 188 |
| GC content (%) | 46 |
| Average sequence length (Kbp) | 333.7 |
| Minimum sequence length (Kbp) | 10.4 |
| Maximum sequence length (Kbp) | 2,540 |
| N50 (Kbp) | 1,076.4 |
| N90 (Kbp) | 214.4 |
Statistics of P. lycopersici ER1518 gene prediction.
| 14,186 | 14,058 | 5,032 | 6,470 | 16,141 | 17,696 | 9,820 | 12,372 | 12,469 | 11,799 | 12,169 | |
| 289.91 | -75.93 | -301.12 | 4,923.65 | 11.99 | 425.59 | 81.43 | -78.89 | 93.81 | -268.50 | -200.26 | |
| 832 | 801 | 693 | 5,363.50 | 681.50 | 1061 | 1,269 | 728 | 638 | 681 | 758 | |
| 1,473.55 | 1,434.01 | 1,536.64 | 1,587.31 | 1,092.85 | 1,345.05 | 1,529.48 | 1,316.24 | 1,255.48 | 1,387.96 | 1,423.31 | |
| 2.78 | 2.63 | 3.40 | 2.77 | 2.44 | 2.70 | 2.74 | 2.67 | 2.82 | 2.50 | 2.67 | |
| 530.81 | 544.71 | 451.92 | 573.77 | 447.03 | 497.44 | 557.36 | 493.83 | 444.72 | 554.27 | 533.59 | |
| 315 | 335 | 240 | 337 | 257 | 287 | 266 | 294 | 213 | 311 | 305 | |
| 95.02 | 100.30 | 88.79 | 67.27 | 85.72 | 100.48 | 134.89 | 89.13 | 102.05 | 88.62 | 113.40 | |
| 4,399 | 8,348 | 1,286 | 1,468 | 2,564 | 2,369 | 1,273 | 1,107 | 5,774 | 5,049 | 2,107 | |
| 3,542 | 3,989 | 639 | 1,332 | 5,111 | 4,392 | 1,832 | 2,263 | 3,016 | 3,176 | 2,726 | |
| 25 | 28 | 12 | 21 | 32 | 25 | 19 | 29 | 24 | 27 | 22 |
Comparison of gene structural features of Pyrenochaeta lycopersici ER1518 (PLY ER1518) with other 10 ascomycetes. Abbreviations: Pyrenochaeta lycopersici ER1211 (PLY ER1211), Aspergillus nidulans (AN), Blumeria graminis (BG), Colletotrichum higgisinianum (CH), Fusarium oxysporum (FO), Neurospora crassa (NC), Leptosphaeria maculans (LM), Phaeospheria nodorum (PN, Pyrenophora teres (PTT) and Pyrenophora tritici-repentis (PTR).
Fig 1Heatmap of OrthoMCL orthologous groups for the most interesting Pfam protein and CAZymes domains identified in P. lycopersici ER1518 (PLY ER1518) and ten other fungal pathogens.
The heatmap represents the type and the number of domains (rows) for each fungus (columns). The Z-score indicates that the values have been centred and scaled by rows (domains), so that negative z-scores are more likely coloured in red and high z-scores in white. Abbreviations: PLY ER1211, P. lycopersici ER1211; AN, Aspergillus nidulans; BG, Blumeria graminis; CH, Colletotrichum higgisinianum; FO, Fusarium oxysporum; LM, Leptosphaeria maculans; NC, Neurospora crassa; PTT, Pyrenophora teres; PTR, Pyrenophora tritici-repentis; PN, Phaeospheria nodorum. CMB, Carbohydrate-Binding Modules; CE, carbohydrate esterases; GH, Glycoside Hydrolases; GT, Glycosyl-Transferases; PL, Polysaccharide Lyases; HET, HETerokaryon Incompatibility-related domains, NB-ARC, Nucleotide-Binding Adaptor shared by APAF-1, R proteins, and CED-4 domain; NACHT, Neuronal Apoptosis inhibitor; ANK, ankyrin; TPR, tetratricopeptide; ABC, ATP-Binding Cassette transporters; MFS, Major Facilitator domains.
Statistics of orthologous analysis performed with OrthoMCL.
| Total number of orthologous groups with ten Ascomycetes | 16,307 |
| Number of groups with | 6,510 |
| Number of | 172 |
| Number of groups in common with | 3,535 |
| Number of | 6,828 |
Fig 2Phylogenetic tree of RPB2 protein of P. lycopersici ER1518 and other ten ascomycetes obtained with PhyML 3.0 and drawn with MEGA7.
The tree is drawn to scale, with bootstrap values on branches and branch lengths measured in the number of substitutions per site.
Fig 3Circular representation of genomic features.
Circular representation of the assembled sequences (length > 10Kb) of P. lycopersici ER1518 genome reporting the distribution of the following features: A) Repetitive elements count (blue); B) Gene density (green); C) Sequence identity percentage (red) of P. lycopersici ER1211 genomic sequences based on pairwise alignment between genome assemblies performed with MUMmer.
Statistics of repeat annotation and masking of P. lycopersici.
TE classes have been reported according to Wicker classification [10].
| Class | Superfamily | Genome Coverage (bp) | Average length (bp) | Number of copies (complete) |
|---|---|---|---|---|
| 1,761,880 | 5,999.7 | 122 (35) | ||
| 5,850,079 | 7,465.3 | 275 (58) | ||
| 160,301 | 8,039.3 | 55 (4) | ||
| 235,349 | 11,579.0 | 4 (0) | ||
| 690,466 | 8,733.4 | 28 (10) | ||
| 480 | 480.0 | 1 (0) | ||
| 1,498,181 | 6,046.0 | 59 (0) | ||
| 392,991 | 1,453.2 | 33 (0) | ||
| 3,366,772 | 3,664.3 | 135 (47) | ||
| 2,316,380 | 4,024.8 | 92 (33) | ||
| 606,580 | 3,193.3 | 34 (4) | ||
| 914,995 | 9,827.4 | 23 (2) | ||
| 79,701 | 4,401.0 | 1 (1) | ||
| 64,754 | 14,614.5 | 2 (0) | ||
| 89,568 | 1,073.0 | 19 (0) | ||
| 1,148,396 | 2,770.4 | 89 (0) | ||
| 19,176,873 | - | 972 (194) |
Fig 4Repeat content comparative analysis among P. lycopersici ER1518 and other five ascomycetes.
Histogram of percentages of different TE categories respect to the total annotated TEs reported for each species.