| Literature DB >> 30847481 |
Yann Dussert1, Isabelle D Mazet1, Carole Couture1, Jérôme Gouzy2, Marie-Christine Piron3, Claire Kuchly4, Olivier Bouchez4, Claude Rispe5, Pere Mestre3, François Delmotte1.
Abstract
Downy mildews are obligate biotrophic oomycete pathogens that cause devastating plant diseases on economically important crops. Plasmopara viticola is the causal agent of grapevine downy mildew, a major disease in vineyards worldwide. We sequenced the genome of Pl. viticola with PacBio long reads and obtained a new 92.94 Mb assembly with high contiguity (359 scaffolds for a N50 of 706.5 kb) due to a better resolution of repeat regions. This assembly presented a high level of gene completeness, recovering 1,592 genes encoding secreted proteins involved in plant-pathogen interactions. Plasmopara viticola had a two-speed genome architecture, with secreted protein-encoding genes preferentially located in gene-sparse, repeat-rich regions and evolving rapidly, as indicated by pairwise dN/dS values. We also used short reads to assemble the genome of Plasmopara muralis, a closely related species infecting grape ivy (Parthenocissus tricuspidata). The lineage-specific proteins identified by comparative genomics analysis included a large proportion of RxLR cytoplasmic effectors and, more generally, genes with high dN/dS values. We identified 270 candidate genes under positive selection, including several genes encoding transporters and components of the RNA machinery potentially involved in host specialization. Finally, the Pl. viticola genome assembly generated here will allow the development of robust population genomics approaches for investigating the mechanisms involved in adaptation to biotic and abiotic selective pressures in this species.Entities:
Keywords: zzm321990 Plasmopara viticolazzm321990 ; PacBio; long reads; oomycete; plant–pathogen interactions
Mesh:
Year: 2019 PMID: 30847481 PMCID: PMC6660063 DOI: 10.1093/gbe/evz048
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Assembly and Annotation Statistics for the Genomes of Plasmopara viticola and Plasmopara muralis
| Assembly size (Mb) | 92.94 | 59.54 |
| Number of scaffolds | 358 | 1,437 |
| N50 (size in kb/count) | 706.5/38 | 184.5/101 |
| Max. scaffold size (Mb) | 2.85 | 1.03 |
| % GC | 44.85 | 44.29 |
| % of repeat elements | 37.34 | 10.43 |
| BUSCO genome [S/D/F] (%) | 95.7 [93.6/1.7/0.4] | 95.8 [94.9/0.0/0.9] |
| Number of genes | 15,960 | 12,194 |
| BUSCO proteome [S/D/F] (%) | 97.4 [94.0/3.0/0.4] | 93.6 [93.2/0.0/0.4] |
| Median coding sequence length [range] (bp) | 957 [105–40,497] | 999 [99–41,409] |
| Median exon length [range] (bp) | 237 [1–14,776] | 207 [1–14,941] |
| Median exon number [range] (bp) | 2 [1–33] | 2 [1–35] |
| Median intron length [range] (bp) | 67 [4–3,288] | 66 [4–3,991] |
| Number of single exon genes | 7,454 | 4,602 |
Note.—BUSCO analyses have been carried out with the Alveolata–Stramenopiles data set.
S, single copy; D, duplicated; F, fragmented genes.
Fig. 1.—Genomic architecture of Plasmopara viticola. Gene density (number of genes per window), repeat coverage (proportion of the window covered by repeated sequences), GC content and the percentage of heterozygous sites are represented for genomic windows of 100 kb for the 20 largest scaffolds of the assembly (total length: 32.1 Mb). Tick marks on scaffolds represent 100 kb. The percentage of heterozygous sites is shown on a log scale.
Fig. 2.—Comparative analysis of 21 oomycete proteomes. (A) UpSet plot of the orthology analysis (3,030 ortholog groups out of 18,115 are represented). Bars represent the number of ortholog groups common to different species denoted by black dots. (B) Phylogenetic relationships and number of genes common to or specific to oomycete species. The maximum likelihood species tree (left) was built with 144 single-copy orthologs. All nodes had a bootstrap support of 100% unless otherwise indicated. The species tree with branch lengths is available in supplementary figure 4, Supplementary Material online. The bars on the right represent the numbers of genes in different categories. For both figures, biotrophic species names are shown in bold green typeface.
Putative Pathogenicity-Related Proteins Found in Plasmopara viticola and Plasmopara muralis
| Secreted proteins | 1,592 | 720 |
| Aspartic peptidases | 37 (2) | 19 (2) |
| Cutinases | 2 (1) | 2 (2) |
| Cysteine proteases | 54 (11) | 40 (8) |
| Cysteine-rich proteins | 22 (16) | 21 (14) |
| Elicitins | 19 (14) | 20 (14) |
| Glycosyl hydrolases | 195 (118) | 115 (53) |
| NPP1-like proteins | 10 (10) | 7 (5) |
| Pectate lyases | 25 (19) | 5 (3) |
| Pectin esterases | 21 (15) | 14 (5) |
| Protease inhibitors | 20 (13) | 15 (8) |
| Serine proteases | 161 (79) | 97 (27) |
| RxLRs | 540 | 143 |
| with WY motif | 317 | 66 |
| WY motif, no RxLR domain | 26 | 12 |
Note.—Number for proteins with a secretion peptide are in brackets except for secreted proteins and RxLRs, which all have a secretion peptide.
Fig. 3.—Local gene density and organization in Plasmopara viticola. Local gene density when considering (A) all genes, (B) core genes found in ortholog groups common to all oomycetes, (C) genes encoding secreted proteins and (D) genes found only in Pl. viticola or in both Pl. viticola and Pl. muralis. The number of genes for 2D (5′ and 3′) intergenic distance bins is represented as a heatmap. (E) Mean distance between genes and their closest repeat element, represented as a heatmap for 2D intergenic distance bins. (F) Orientation and intergenic distance of adjacent gene pairs. The gene count for different intergenic distances has been plotted for adjacent gene pairs with a head-to-head (HH), tail-to-head (TH) and tail-to-tail (TT) orientation. Dashed vertical lines indicate the cutoffs used for the gene orientation analysis. Distances are shown on a log scale for all figures.
Fig. 4.—Evolution rates for ortholog pairs. (A) Pairwise dN/dS values for Plasmopara viticola and Plasmopara muralis orthologs, for different categories of genes. Core genes are genes found in ortholog groups common to all oomycetes and specific genes are found in ortholog groups common only to Pl. viticola and Pl. muralis. Average dN/dS values are represented by a black dot. N: number of ortholog pairs. (B) Local gene density in Pl. viticola for all genes (heatmap) and for genes with a dN/dS above 1 (black dots).
Function of Genes with a Signal of Positive Selection in the Branch-Site Tests
| Both Branches | |||
|---|---|---|---|
| Secreted proteins | 3 | 9 | 2 |
| Transmembrane transporters | 9 | 5 | 0 |
| RNA modifications and processing | 9 | 11 | 2 |
| Other functions | 101 | 101 | 14 |
| Unknown function | 9 | 31 | 0 |
Note.—The precise function of secreted proteins and transmembrane transporters are detailed in the main text. The functions of all proteins are available in supplementary tables 13 and 14, Supplementary Material online.
Including splicing factors, mRNA capping proteins, proteins with Lsm domain, RNA methyltransferases, chromodomain proteins, DEAD/DEAH box RNA helicases, RNA polymerase subunits and proteins involved in nonsense mediated mRNA decay.
Proteins with no mapped GO term and not belonging to any other categories in this table.