| Literature DB >> 25400655 |
Marco A Cristancho1, David Octavio Botero-Rozo2, William Giraldo1, Javier Tabima2, Diego Mauricio Riaño-Pachón3, Carolina Escobar1, Yomara Rozo1, Luis F Rivera1, Andrés Durán1, Silvia Restrepo3, Tamar Eilam4, Yehoshua Anikster4, Alvaro L Gaitán1.
Abstract
Coffee leaf rust caused by the fungus Hemileia vastatrix is the most damaging disease to coffee worldwide. The pathogen has recently appeared in multiple outbreaks in coffee producing countries resulting in significant yield losses and increases in costs related to its control. New races/isolates are constantly emerging as evidenced by the presence of the fungus in plants that were previously resistant. Genomic studies are opening new avenues for the study of the evolution of pathogens, the detailed description of plant-pathogen interactions and the development of molecular techniques for the identification of individual isolates. For this purpose we sequenced 8 different H. vastatrix isolates using NGS technologies and gathered partial genome assemblies due to the large repetitive content in the coffee rust hybrid genome; 74.4% of the assembled contigs harbor repetitive sequences. A hybrid assembly of 333 Mb was built based on the 8 isolates; this assembly was used for subsequent analyses. Analysis of the conserved gene space showed that the hybrid H. vastatrix genome, though highly fragmented, had a satisfactory level of completion with 91.94% of core protein-coding orthologous genes present. RNA-Seq from urediniospores was used to guide the de novo annotation of the H. vastatrix gene complement. In total, 14,445 genes organized in 3921 families were uncovered; a considerable proportion of the predicted proteins (73.8%) were homologous to other Pucciniales species genomes. Several gene families related to the fungal lifestyle were identified, particularly 483 predicted secreted proteins that represent candidate effector genes and will provide interesting hints to decipher virulence in the coffee rust fungus. The genome sequence of Hva will serve as a template to understand the molecular mechanisms used by this fungus to attack the coffee plant, to study the diversity of this species and for the development of molecular markers to distinguish races/isolates.Entities:
Keywords: RNA-seq; coffee; coffee rust; genetic variants; genome; plant pathogens diversity
Year: 2014 PMID: 25400655 PMCID: PMC4215621 DOI: 10.3389/fpls.2014.00594
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Nuclear DNA content of Hva urediniospore samples measured by FCM.
| La Alcancía, Antioquia | – | 1.63 | |
| El Cedral, Pereira, Risaralda | – | 1.55 | |
| Santa María, Antioquia | 1.29 | – | |
| Chinchiná, Caldas | 1.17 | – | |
| Chinchiná, Caldas | – | 1.76 | |
| Chinchiná, Caldas | – | 1.62 | |
| Chinchiná, Caldas | 1.21 | – | |
| Chinchiná, Caldas | 1.18 | – | |
| Chinchiná, Caldas | – | 1.70 | |
| Chinchiná, Caldas | – | 1.70 | |
| Mean | 1.21 | 1.66 | |
| Standard deviation | 0.05 | 0.08 | |
| CV% | 5.87 | 6.41 | |
Hva urediniospore samples were stained with Propidium Iodide for Flow Cytometry fluorescence measures following the protocol described by Eilam et al. (.
Statistics gathered from the genome assemblies of Hva individual isolates and the hybrid assembly.
| HvHybrid | 412,417,464 | 359,076,496 | 396,264 | 333,258,024 | 91.94% | |
| HvCat 454 | 5,860,446 | |||||
| HvCat Illumina | 48,396,016 | 43,704,716 | 254,645 | 122,820,521 | 57.26% | |
| Hv387 | 58,593,986 | 50,782,526 | 211,495 | 150,707,107 | 44.35% | |
| Hv494 | H89: | 55,326,774 | 47,738,686 | 211,728 | 138,293,025 | 35.08% |
| HvDQ952 | F2 – | 43,875,056 | 38,844,164 | 197,927 | 121,119,448 | 31.85% |
| HvH_179 | H3101: ( | 49,025,718 | 42,033,780 | 203,770 | 131,574,289 | 31.05% |
| HvH_569 | H3041: ( | 51,960,392 | 45,000,606 | 202,168 | 133,358,100 | 37.50% |
| HvH_701 | H2094: ( | 60,634,018 | 53,080,264 | 215,628 | 158,292,515 | 56.86% |
| HvMar_1 | 44,605,504 | 39,408,690 | 203,360 | 125,814,765 | 22.18% |
Clean reads were obtained after quality trimming and removal of duplicates.
Completeness of the genome was calculated running the software CEGMA with a set of 248 ultra-conserved Core Eukaryotic Genes (CEGs) (Parra et al., .
The hybrid assembly was generated by the combination of all short reads from the eight isolates.
Eight and a half plates were sequenced with 454 technology.
Summary of the Hva genome hybrid assembly.
| N° Contigs assembled | 396,264 | |
| N° Scaffolds assembled | 302,466 | |
| Total residues assembled | 333,481,311 | |
| Length | Max | 85,126 |
| Average | 841.56 | |
| N50 | 1,59 | |
| Reads | Total | 336,649,188 |
| Unassembled | 197,88,611 | |
| Assembled | 316,860,577 | |
| Multihit | 37,520,793 | |
| Potential pairs | ||
| Paired | 78,105,740 | |
| Not Paired | 255,469,308 | |
The 454 and Illumina clean reads were assembled and the same reads were mapped against this set of assembled contigs using the software CLC Assembly Cell v4.0.1.
Figure 1Plot representing the reads coverage per contig in the HvHybrid assembly. Low read coverage was identified in most contigs.
Figure 2Association between number of reads and size of contigs in the HvHybrid Assembly. Over-coverage was identified in short contigs.
Figure 3Association between contig size and coverage. Contigs were clustered by color by running an R script: 0–5 low coverage, 5–45 good coverage, 45–100 high coverage and more than 100 over-coverage.
Figure 4Analysis of sequence contamination in the HvHybrid assembly. The HvHybrid assembled contigs were searched for contaminants with the program Megan4 (Huson et al., 2001). Each node is labeled by a taxon and the number of reads assigned to the taxon. The size of a node (circle size) is scaled logarithmically to represent the number of assigned reads. A low bacterial and plant sequence contamination was identified in the assembly.
Figure 5Homology of Hva mitochondrial sequences to the mitochondrial genome of PGT. Clone_161 and clone_171 sequences from HvHybryd assembly were compared to the mitochondrial genome of P. graminis f.sp tritici (Supercontig_2.393; 79,2Kb) by running BLAST sequence similarities (e = 1e-5). Homologous regions are illustrated with Circoletto (Darzentas, 2010).
Homology of Hva transcript datasets to Pucciniales predicted proteins.
| Hva (gU) | 4267 | 91.0% (3884) | 93.1% (3973) | 93.9% (4007) |
| 3627 | 59.2% (2147) | 63.1% (2290) | 63.8% (2315) | |
| 4465 | 50.0% (2232) | 49.1% (2202) | 49.6% (2229) | |
| 11,630 | 42.2% (18,703) | 40.0% (22,240) | 36.6% (23,705) | |
| 15,979 | 43.8% (19,411) | 41.5% (23,129) | 37.8% (24,480) | |
| 22,815 | 43.5% (19,257) | 40.4% (22,544) | 36.9% (23,874) | |
| 16,694 | 39.2% (17,385) | 40.8% (22,765) | 37.8% (21,302) | |
I. Homology sequence analysis between Hva transcriptomes datasets (this study) and Hva germinating urediniospores, appresoria, and haustoria transcript sequences (gU, Ap, H, Talhinhas et al., .
Hva samples described in Fernandez et al. (.
Figure 6Venn diagram showing the consensus set of Hva secreted predicted proteins. PProwler (0.9 probability cut-off) and SignalP were used to predict secreted proteins. A set of 14,445 putative proteins was used for classification into secreted and non-secreted proteins. The results were compared with Hva secreted proteins predicted by Fernandez et al. (2012). Hva secreted proteins predictions: (I) PProwler (this study). (II) SignalP (this study). (III) Secreted proteins predicted by Fernandez et al. (2012).