| Literature DB >> 26438312 |
Rahul Sharma1,2,3,4, Xiaojuan Xia5,6,7, Liliana M Cano8,9, Edouard Evangelisti10, Eric Kemen11, Howard Judelson12, Stan Oome13, Christine Sambles14, D Johan van den Hoogen15, Miloslav Kitner16, Joël Klein17, Harold J G Meijer18, Otmar Spring19, Joe Win20, Reinhard Zipper21, Helge B Bode22, Francine Govers23, Sophien Kamoun24, Sebastian Schornack25, David J Studholme26, Guido Van den Ackerveken27, Marco Thines28,29,30,31,32.
Abstract
BACKGROUND: Downy mildews are the most speciose group of oomycetes and affect crops of great economic importance. So far, there is only a single deeply-sequenced downy mildew genome available, from Hyaloperonospora arabidopsidis. Further genomic resources for downy mildews are required to study their evolution, including pathogenicity effector proteins, such as RxLR effectors. Plasmopara halstedii is a devastating pathogen of sunflower and a potential pathosystem model to study downy mildews, as several Avr-genes and R-genes have been predicted and unlike Arabidopsis downy mildew, large quantities of almost contamination-free material can be obtained easily.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26438312 PMCID: PMC4594904 DOI: 10.1186/s12864-015-1904-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Genome assembly quality assessment in terms of length of the shortest scaffold in each N-class and the number of scaffolds. The quality of the genome assembly was assessed by first sorting all 3143 nuclear scaffolds length-wise from the largest to the smallest scaffold. Then N-classes were defined, where N represents the percentage of genome covered by considering the assembled genome size. The length given for each N-class represents the length of the smallest scaffold present in that particular N-class. The number of scaffolds represents the number of scaffolds present in the respective N-class. The sharp rise after N98 represents the unresolved small contigs, the majority of which are repeat elements
Genetic features of oomycete genomes
|
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|---|
| Assembled genome size (Mb) | 75.32 | 32.76 | 78.89 | 64.02 | 228.54 | 66.65 | 82.6 | 44.91 | 53.09 |
| N50 scaffold size (Mb) | 1.54 | 0.06 | 0.33 | 0.7 | 1.58 | 0.3 | 7.6 | 0.83 | 0.28 |
| N50 count | 16 | 130 | 70 | 29 | 38 | 63 | 4 | 19 | 46 |
| Longest scaffold size (Mb) | 3.42 | 0.58 | 1.23 | 2.71 | 6.92 | 1.24 | 13.39 | 1.82 | 1.61 |
| Number of scaffolds | 3,162 | 3,827 | 3,044 | 917 | 4,921 | 2,576 | 83 | 975 | 1,442 |
| Genes | 15,469 | 13,804 | 14,321 | 19,805 | 17,787 | 16,066 | 26,584 | 15,322 | 20,088 |
| CDS | 40,334 | 43,014 | 28,165 | 42,673 | 49,146 | 40,639 | 63,242 | 39,949 | 79,762 |
| Gaps (N %) | 11.32 | 0 | 10.22 | 12.47 | 16.81 | 18.35 | 3.96 | 4.72 | 9.33 |
| Repeat elements (%) | 39 % | 22 % | 43 % | 19 % | 74 % | 28 % | 39 % | 7 % | 40 % |
| Secretomea | 631 (631) | 262 (672) | 649 (1054) | 1141 (1176) | 1501 (1588) | 1339 (1523) | 1978 (1867) | 926 (843) | 1256 (1255) |
| Genome | |||||||||
| AT % | 54.70 | 55.65 | 52.78 | 49.57 | 49.03 | 46.14 | 45.39 | 47.69 | 41.54 |
| GC % | 45.29 | 44.34 | 47.21 | 50.42 | 50.96 | 53.85 | 54.60 | 52.30 | |
| Coding sequences | |||||||||
| AT % | 54.02 | 54.29 | 46.87 | 46.62 | 45.99 | 41.95 | 41.55 | 43.4 | 37.67 |
| GC % | 45.96 | 45.71 | 53.11 | 53.37 | 54.02 | 58.05 | 58.45 | 56.6 | 62.33 |
| CEGMA | |||||||||
| Complete KOG mapping (%) | 97.18 % | 93.40 % | 95.39 % | 98.00 % | 96.76 % | 96.44 % | 98.02 % | 97.13 % | 97.20 % |
| Partial KOG mapping (%) | 98.41 % | 95.94 % | 98.10 % | 98.83 % | 98.03 % | 98.45 % | 99.24 % | 97.58 % | 98.79 % |
aNumbers in bracket represent the published secretome size
Fig. 2Genome completeness and continuity assessments in terms of core housekeeping genes. Genome completeness in terms of core eukaryotic genes was assessed using the CEGMA pipeline. The CEGMA pipeline has categorized 458 core genes into 4 groups on the basis of their conservation, from the least conserved group 1 to the most conserved group 4. a Genome completeness in terms of complete mapping. b Genome completeness estimations in terms of partial mapping
Candidate pathogenicity related genes in oomycetes genomes
|
|
|
|
|
| |
|---|---|---|---|---|---|
| ATP-binding cassette (ABC) transportera | 32 | 35 (53) |
| 112 (173) | 26 (41) |
| Phospholipasea | 17 | 23 (13) |
| 25 (20) | 19 (13) |
| Lipasea | 24 | 30 (10) |
| 36 (31) | 23 (12) |
| Cysteine proteasea | 54 | 51 (7) | 64 (33) |
| 48 (16) |
| Serine proteasea | 62 | 73 (34) | 106 (60) |
| 52 (−) |
| Aspartic proteasea | 15 | 14 (9) | 19 (12) |
| 16 (10) |
| Cutinaseb | 2 | 2 (2) |
| 0 (0) | 3 (2) |
| NPP1-like (necrosis-inducing proteins)b | 19 | 21 |
| 7 (7) | 0 (0) |
| Pectate lyasesb | 3 | 8 (8) |
| 15 (15) | 0 (1) |
| Cytochrome P450sb | 14 | 18 (16) | 30 (19) |
| 3 (3) |
| Pectin esteraseb | 5 | 4 (4) |
| 0 (0) | 0 (0) |
| Elictins likeb | 16 | 16 (1) | 45 (40) |
| 9 (3) |
| RxLR effector family candidatesc | 274 | 134 |
| 0 | 49 |
| Crincklers (CRN family candidates)c | 77 | 20 |
| 26 | 3 |
aGenerated using PANTHER; bFrom InterproScan; cGenerated manually; Numbers in bracket represent the published number of predicted genes. Numbers in bold represent the highest number of genes
Fig. 3Phylogenetic relationship of deeply sequenced oomycetes. The phylogenetic analysis was done by considering the core orthologous genes predicted by the CEGMA pipeline. Multiple sequence alignments were performed using Mafft and phylogenetic relationships were inferred using the Maximum Likelihood algorithm as implemented in RAxML. Number on branches correspond to support values from 1000 bootstrap replicates
Fig. 4Number of ortholog groups within oomycete genomes. The number of ortholog groups among the genomes of Hy. arabidopsidis, Ph. capsici, Ph. infestans, Ph. sojae, and Pl. halstedii. a Number of ortholog groups found within the five genomes considering all protein-coding genes. b Number of ortholog groups within the five genomes considering all PSEP-encoding genes. Numbers in brackets represent the total number of genes tested in the analyses. Asterisks denote 1:1 orthologs among the five genomes
Fig. 5Heat maps illustrating gene density of the Pl. halstedii genome. Gene density as estimated by calculating the 5′ and 3′ flanking distances of (a) all protein encoding genes, (b) core genes (c) non-secreted protein encoding genes (d) secreted protein encoding genes, (e) candidate RxLR-like protein encoding genes, (f) CRN-like protein encoding genes. Grey shading highlights the area with both 5′ and 3′ distances below 3 kb
Fig. 6Features of promoters. a A + T content of coding regions and 50-nt intervals within promoters from Pl. halstedii, Ph. infestans, and Hy. arabidopsidis. b Distribution of motifs in different Straminipila. Searches for the INR + FPR supra-motif, INR, FPR, and DPEP were performed in five oomycetes (Ph. infestans, Pl. halstedii, Hy. arabidopsidis, Py. ultimum, Sa. parasitica) and the diatom Thalassiosira pseudonana. Bars show the percentage of promoters within each species that contain the motifs within 200 nt of the start codon, corrected for false discovery. The figure on the left is a neighbor-joining tree based on ribosomal RNA and internal transcribed spacer (ITS) sequences. c Positional bias of INR + FPR supra-motif and CCAAT within Pl. halstedii promoters. The right of the panel compares the content of the two motifs in Ph. infestans and Pl. halstedii
Summary of protease inhibitor effectors from seven pathogenic oomycete species
| Description | No. of protease inhibitors effectors | No. of Kazal-like inhibitor effectors | Highest No. of Kazal-like domains | No. of cystatin-like inhibitor effectors | Highest No. of cystatin-like domains |
|---|---|---|---|---|---|
|
| 41 | 33 | 7 | 8 | 1 |
|
| 21 | 15 | 5 | 6 | 1 |
|
| 5 | 1 | 4 | 4 | 1 |
|
| 10 | 8 | 2 | 2 | 2 |
|
| 23 | 19 | 5 | 4 | 1 |
|
| 14 | 8 | 5 | 6 | 3 |
|
| 2 | 1 | 5 | 1 | 2 |
a-d,fPathogenic oomycete species with available whole genome sequences
eGenome sequence and effector annotation is described in this study
gOomycete species where there are only expressed sequence tag (EST) data [102]. This genome may contain more protease inhibitors that were not detected in the transcriptome analysis
Fig. 7Features of RxLR-dEER-like effectors and frequency of the RxLR and RxLR-dEER-like proteins in the genome of Pl. halstedii: a Sequence features of the RxLR-dEER-like proteins were calculated from predicted putative RxLR-like proteins. Numbers in brackets represent the minimum and maximum values of distances and number in italics represents the corresponding mean value. Multiple sequence alignments were performed by using Mafft and sequence logos were generated using jalview. b Bar plot representing the number of RxLR-like and RxLR-dEER-like proteins in the predicted secretome of Pl. halstedii
28 Orthologs of putative RxLR-like secreted proteins in four Phytophthora spp.
| Ortholog count |
|
|
|
| Annotation representative gene |
|
|---|---|---|---|---|---|---|
| 1 | Pca_16635 | PITG_07736T0; PITG_19803T0; PITG_13535T0; PITG_13537T0; PITG_13536T0; PITG_13534T0; PITG_07766T0 | Prm_76660; Prm_78801; Prm_78158; Prm_81834; Prm_76663; Prm_76672 | Pso_287018 | PITG_07736T0 | Secreted RxLR effector peptide protein, putative |
| 2 | Pca_102742 | PITG_14880T0; PITG_14884T0; PITG_13847T0 | Prm_74367; Prm_79110; Prm_86912; Prm_79108; Prm_79107; Prm_74387; Prm_79119; Prm_85872 | Pso_285707; Pso_285703 | PITG_14880T0 | RXLR effector family protein, putative |
| 3 | Pca_13936; Pca_13937; Pca_13953 | PITG_06305T0; PITG_06290T0 | Prm_83582; Prm_77765; Prm_85589; Prm_77763; Prm_77786; Prm_74178 | Pso_286631; Pso_354880 | PITG_06305T0 | Secreted RxLR effector peptide protein, putative |
| 4 | Pca_14162; Pca_39353 | PITG_05841T0; PITG_05846T0; PITG_06308T0; PITG_11952T0; PITG_15679T0 | Prm_73724; Prm_86166; Prm_73707 | Pso_285308; Pso_286675 | PITG_05841T0 | Secreted RxLR effector peptide protein, putative |
| 5 | Pca_10713 | PITG_07566T0; PITG_07569T0 | Prm_81825; Prm_81822; Prm_81823; Prm_78748 | Pso_336774; Pso_286958 | PITG_07566T0 | Secreted RxLR effector peptide protein, putative |
| 6 | Pca_5670; Pca_133116; Pca_107349 | PITG_17063T0; PITG_18404T0 | Prm_81907; Prm_81908 | Pso_284378 | PITG_17063T0 | Secreted RxLR effector peptide protein, putative |
| 7 | Pca_121504; Pca_19144; Pca_536383 | PITG_15556T0 | Prm_82880 | Pso_288650; Pso_288648; Pso_288647 | PITG_15556T0 | Secreted RxLR effector peptide protein, putative |
| 8 | Pca_572048 | PITG_13093T0 | Prm_86463 | Pso_356035; Pso_288906; Pso_358111; Pso_292791; Pso_288815 | PITG_13093T0 | Secreted RxLR effector peptide protein, putative |
| 9 | Pca_14853; Pca_15117; Pca_19651 | PITG_12276T0; PITG_11839T0 | Prm_76339 | Pso_288968 | PITG_12276T0 | Secreted RxLR effector peptide protein, putative |
| 10 | Pca_538116; Pca_97196 | PITG_07556T0; PITG_07558T0 | Prm_77948; Prm_77945 | Pso_353461 | PITG_07556T0 | Secreted RxLR effector peptide protein, putative |
| 11 | Pca_548556 | PITG_12952T0; PITG_10654T0; PITG_02900T0 | Prm_80526 | Pso_284479 | PITG_12952T0 | Secreted RxLR effector peptide protein, putative |
| 12 | Pca_20942 | PITG_18986T0 | Prm_76324 | Pso_286791; Pso_286793; Pso_286162 | PITG_18986T0 | Secreted RxLR effector peptide protein, putative |
| 13 | Pca_119793 | PITG_15032T0 | Prm_78009 | Pso_286223; Pso_286248; Pso_286221 | PITG_15032T0 | Secreted RxLR effector peptide protein, putative |
| 14 | Pca_118417; Pca_124413 | PITG_06087T0 | Prm_81609 | Pso_286934 | PITG_06087T0 | Secreted RxLR effector peptide protein, putative |
| 15 | Pca_124376 | PITG_06099T0; PITG_06094T0 | Prm_81610 | Pso_286931 | PITG_06099T0 | Secreted RxLR effector peptide protein, putative |
| 16 | Pca_116645 | PITG_18405T0; PITG_10640T0 | Prm_81911 | Pso_284377 | PITG_18405T0 | Secreted RxLR effector peptide protein, putative |
| 17 | Pca_101904 | PITG_15226T0; PITG_15225T0 | Prm_83274 | Pso_285899 | PITG_15226T0 | Secreted RxLR effector peptide protein, putative |
| 18 | Pca_4454 | PITG_10116T0 | Prm_74395; Prm_74378 | Pso_288795 | PITG_10116T0 | Secreted RxLR effector peptide protein, putative |
| 19 | Pca_549194 | PITG_18397T0; PITG_18117T0 | Prm_81902 | Pso_476203 | PITG_18397T0 | Putative uncharacterized protein |
| 20 | Pca_101012 | PITG_04099T0 | Prm_85073 | Pso_286058 | PITG_04099T0 | Secreted RxLR effector peptide protein, putative |
| 21 | Pca_101423 | PITG_09585T0 | Prm_75817 | Pso_361266 | PITG_09585T0 | Secreted RxLR effector peptide protein, putative |
| 22 | Pca_129643 | PITG_15287T0 | Prm_78400 | Pso_286050 | PITG_15287T0 | PexRD1 secreted RxLR effector peptide, putative |
| 23 | Pca_19601 | PITG_11947T0 | Prm_78163 | Pso_246483 | PITG_11947T0 | Secreted RxLR effector peptide protein, putative |
| 24 | Pca_508923 | PITG_04668T0 | Prm_84933 | Pso_533029 | PITG_04668T0 | Polysaccharide lyase, putative |
| 25 | Pca_536039 | PITG_09824T0 | Prm_77012 | Pso_329838 | PITG_09824T0 | Metalloprotease family M12A, putative |
| 26 | Pca_546134 | PITG_13256T0 | Prm_83882 | Pso_354514 | PITG_13256T0 | Putative uncharacterized protein |
| 27 | Pca_558196 | PITG_13007T0 | Prm_76705 | Pso_520326 | PITG_13007T0 | Putative uncharacterized protein |
| 28 | Pca_129113 | PITG_15142T0 | Prm_85377 | Pso_286249 | PITG_15142T0 | Secreted RxLR effector peptide protein, putative |
Fig. 8Orthologs of RxLR-dEER-like proteins within downy mildew pathogen genomes and Phytophthora spp. genomes: High confidence RxLR-dEER-like proteins from the secretome of downy mildew and Phytophthora spp. genomes were predicted and orthology analyses were performed with OrthoMCL to predict orthologs of RxLR-dEER-like proteins. Pha, Hpa, Pca, Pin, Pso, and Prm refer to Pl. halstedii, Hy. arabidopsidis, Ph. capsici, Ph. infestans, Ph. sojae, and Ph. ramorum, respectively. a Venn diagram showing the number of orthologs among the four Phytophthora spp. genomes. b Table summarising the number of orthologs shared by downy mildews and Phytophthora spp. genomes. c Sequence alignments of the three candidate orthologs of putative RxLR-dEER proteins among the six genomes. Multiple sequence alignments were performed using Mafft and alignment graphics were generated using Jalview. Cleavage sites predicted by SignalP are highlighted by red circles, RxLR/dEER-like motifs are highlighted by red boxes