Literature DB >> 31560373

New High-Quality Draft Genome of the Brown Rot Fungal Pathogen Monilinia fructicola.

Rita Milvia De Miccolis Angelini1, Gianfranco Romanazzi2, Stefania Pollastro1, Caterina Rotolo1, Francesco Faretra1, Lucia Landi2.   

Abstract

Brown rot is a worldwide fungal disease of stone and pome fruit that is caused by several Monilinia species. Among these, Monilinia fructicola can cause severe preharvest and postharvest losses, especially for stone fruit. Here, we present a high-quality draft genome assembly of M. fructicola Mfrc123 strain obtained using both Illumina and PacBio sequencing technologies. The genome assembly comprised 20 scaffolds, including 29 telomere sequences at both ends of 10 scaffolds, and at a single end of 9 scaffolds. The total length was 44.05 Mb, with a scaffold N50 of 2,592 kb. Annotation of the M. fructicola assembly identified a total of 12,118 genes and 13,749 proteins that were functionally annotated. This newly generated reference genome is expected to significantly contribute to comparative analysis of genome biology and evolution within Monilinia species.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  brown rot; de novo assembly; genome annotation; next-generation sequencing; stone fruit; third-generation sequencing

Mesh:

Substances:

Year:  2019        PMID: 31560373      PMCID: PMC6795239          DOI: 10.1093/gbe/evz207

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Monilinia fructicola (phylum Ascomycota, family Sclerotiniaceae) is one of most important causal agents of brown rot, which is one of the main diseases of pome and stone fruit. Brown rot can cause severe yield losses during both field production and postharvest processing (Mari et al. 2012; Karaca et al. 2014; Oliveira Lino et al. 2016; Abate, Pastore, et al. 2018). Several species of the Monilinia genus are responsible for this disease, including Monilinia laxa and Monilinia fructigena (Villarino et al. 2013; Rungjindamai et al. 2014), although M. fructicola is considered the most destructive of these pathogens on stone fruit. Indeed, M. fructicola is a quarantine pathogen in the European Union and is included in List A2 of the European and Mediterranean Plant Protection Organisation (https://www.eppo.int/ACTIVITIES/plant_quarantine/A2_list; last accessed October 1, 2019). Moniliniafructicola was introduced into Europe in 2001 (Lichou et al. 2002), and then rapidly spread and became prevalent over the former indigenous species M. laxa and M. fructigena (CABI 2018; Abate, Pastore, et al. 2018). This fungus overwinters on mummified fruit or in infected plant tissues. In spring, conidia are produced under moist conditions, with wind dispersal, and infection of blossoms, causing blossom blight. However, the most severe yield losses occur in the postharvest phase, during fruit storage and transport (Feliziani et al. 2013; Martini and Mari 2014). The development of high-throughput sequencing technologies is quickly improving biological studies on fungal pathogens (Shendure et al. 2017). Whole genome knowledge provides a complete overview in terms of their potential virulence mechanisms, with the aim being to design improved disease-control approaches (Möller and Stukenbrock 2017). The availability of complete genomic resources can be very useful in the study of evolutionary dynamics and genetic adaptions to highly diverse environments (Mitchell-Olds et al. 2007). The availability of complete genomic data of M. fructicola gives the opportunity to the scientific community to perform studies aimed at exploring the population biology of the pathogen in more detail and its interactions with host plants (Dowling et al. 2019). This should be of help for an understanding of the reasons underlying the fitness of the pathogen and its rapid spread to new geographical areas. For these reasons, we have generated a high-quality draft genome and annotated protein-coding genes of M. fructicola that represents a great improvement over the draft genomes already available in public repositories, which are highly fragmented with no gene annotation (Rivera et al. 2018). A hybrid and hierarchical de novo assembly strategy was used to obtain the genome of the M. fructicola Mfrc123 strain through a combination of Illumina short reads next-generation sequencing, and Pacific Biosciences (PacBio) long reads third-generation sequencing, as previously reported for de novo assembly of the M. fructigena genome (Landi et al. 2018).

Materials and Methods

Sample Collection, Library Construction, and Sequencing

The monoconidial Mfrc123 strain (CBS 144850) of M. fructicola (NCBI:txid38448; supplementary fig. S1, Supplementary Material online) was isolated from cherry (Prunus avium) in 2014 during monitoring of Monilinia populations in southern Italy (Abate, Pastore, et al. 2018). The strain was grown in potato dextrose broth (infusion from 200 g peeled and sliced potatoes kept for 1 h at 60 °C, with 20 g dextrose, adjusted to pH 6.5, per liter) for 48 h at 25 ± 1 °C, in darkness and under shaking (150 rpm). The strain was characterized at both the phenotypic and molecular levels (Abate, De Miccolis Angelini, et al. 2018; Abate, Pastore, et al. 2018; De Miccolis Angelini et al. 2018). Genomic DNA was extracted using Gentra Puregene tissue kits (Qiagen, Milan, Italy), according to the manufacturer instructions. The genomic DNA quality and quantity were determined using a Nanodrop 2000 spectrophotometer (Thermo Fisher Scientific Inc., Wilmington, DE) and a Qubit 2.0 fluorimeter (Life Technologies Ltd., Paisley, UK). DNA integrity was analyzed (Bioanalyser 2100; Agilent Technologies, Santa Clara, CA). Genome sequencing for short 2× 100-bp paired-end reads (HiSeq 2500 platform; Illumina Sequencing Technology) and long 20-kb reads (RSII platform; PacBio Sequencing Technology) were both performed by an external service (Macrogen Inc., Next-Generation Sequencing Service, Geumcheon-gu, Seoul, South Korea). The Illumina short reads were analyzed for quality using FastX-tools and trimmed with the Trimmomatic (version 0.36) software (Bolger et al. 2014) with the following parameters: 1) LEADING and TRAILING = 3, which removes bases from the two ends of the reads if they are below a threshold quality of 3, 2) SLIDING WINDOW = 4:2, which cuts the reads when the average quality within the window composed of four bases falls below a threshold of 2, and 3) MINLEN = 50, which removes the reads shorter than 50 bp. Adaptor-free PacBio subreads were used for the genome assembly.

De Novo Assembly of the Genome

The genome of M. fructicola Mfrc123 strain was assembled by a hybrid and hierarchical assembly strategy, using all the reads from both the Illumina and PacBio sequencing to produce scaffolds through the DBG2OLC pipeline available from https://github.com/yechengxi/DBG2OLC; last accessed October 1, 2019 (Ye et al. 2016). SparseAssembler (Ye et al. 2012; https://github.com/yechengxi/SparseAssembler; last accessed October 1, 2019) was used to preassemble the cleaned Illumina short reads into contigs, using the following parameters: LD, 0; NodeCovTh, 1; EdgeCovTh, 0; k, 61; g, 15; and PathCovTh, 100, GS 50000000. The overlap and layout were performed with the DBG2OLC module (Ye et al. 2016; https://github.com/yechengxi/DBG2OLC; last accessed October 1, 2019) on the output contigs from SparseAssembler and the PacBio long subreads, to construct the assembly backbone from the best overlaps of reads using the following parameters: AdaptiveTh, 0.001; KmerCovTh, 2; and MiniOverlap, 20. In the final step, all of the related reads were aligned to each backbone using BLASR (Chaisson and Tesler 2012; https://github.com/PacificBiosciences/blasr; last accessed October 1, 2019), and the most likely sequence was calculated as the polished assembly using the consensus module Sparc (Ye and Ma 2016; https://github.com/yechengxi/Sparc; last accessed October 1, 2019), both with the default settings. The quality assessment of the draft assembly was performed by mapping RNA-Seq reads from the same M. fructicola Mfrc123 strain (De Miccolis Angelini et al. 2018) using the CLC Genomics Workbench v. 7.0.3 software (CLC Bio, Aarhus, Denmark), with the default parameters. Unmapped reads were then assembled using the CLC de novo assembler, and the contigs obtained were submitted to BlastN searches against the GenBank nucleotide collection (downloaded, January 10, 2018) with local BLAST+ v. 2.3.0 (Camacho et al. 2009), setting the E-value cutoff at 10−3. Ribosomal DNA sequences were identified and BlastN aligned to the PacBio subreads. The subread -s1p0/78570/021991RQ = 0.891 was selected for the best homology and used as reference for the Illumina reads alignment using the SeqMan NGen software (Lasergene v. 15.0.1; DNASTAR Inc., Madison, WI) to improve the sequence accuracy. Here, we obtained a 28,000-bp scaffold made up of three rDNA repeat unit sequences. Benchmarking Universal Single-Copy Orthologs (BUSCO v. 3.0.2) (Simão et al. 2015; http://gitlab.com/ezlab/busco; last accessed July 30, 2019) was used with the default parameters to evaluate the completeness of the de novo genome assembly based on the 290 BUSCO groups in the fungi_odb9 lineage data set (http://busco.ezlab.org/; last accessed July 30, 2019) and Botrytiscinerea as the reference species for gene finding with Augustus v. 3.3.2 (http://bioinf.uni-greifswald.de/augustus/; last accessed July 30, 2019).

Repeats Annotation

Simple repeats and low complexity elements were searched using Repeat Masking (http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker; last accessed July 30, 2019) against the Repbase-derived RepeatMasker library of repetitive elements, with the default parameters. Telomere sequences were identified by searching both ends of the scaffolds for high copy number repeats using BlastN searches and manual inspection.

Protein-Coding Gene Prediction and Functional Annotation

Gene prediction was performed with Augustus implemented in the Blast2GO PRO package (v.5.2.5) using B. cinerea as the model species and the RNA-Seq reads as a guide. Blast2GO PRO was used to obtain gene functional classification based on Gene Ontology (GO) terms in the “biological process,” “molecular function,” and “cellular component” categories.

Results and Discussion

Library Construction and Sequencing

A total of 11,014 Mb of Illumina reads (109,054,254 total reads with 90.16% of bases with quality score >30) were generated with an estimated 250× average depth of sequencing coverage (http://identifiers.org/ncbi/insdc.sra:SRR8599895). A total of 1,057-Mb PacBio long reads (polymerase read N50 of 19,107, quality of 0.858; average subread length of 9.8 kb and subread N50 length of 14.34 kb) were generated with an estimated 24× average depth of sequencing coverage (http://identifiers.org/ncbi/insdc.sra:SRR8608008). A complete draft genome without gaps of 44.05 Mb was obtained with a scaffold N50 of 2,592.8 kb (table 1). It was assembled in 16 large scaffolds (from 4.24 to 1.89 Mb; table 2), three very small scaffolds (from 0.36 to 0.26 Mb), and one scaffold made up of three repeat unit sequences of ribosomal DNA. Both telomeric sequences were detected in half of the large scaffolds, with only one telomere in the remaining half (table 2). The number of large scaffolds reconstructed in the M. fructicola Mfrc123 strain corresponded to the 16 large chromosomes of the closely related species B.cinerea and Sclerotinia sclerotiorum (Amselem et al. 2011; Derbyshire et al. 2017). This is in accordance with the short phylogenetic distances existing among these species within the Sclerotiniaceae family (Amselem et al. 2011).
Table 1

Comparison among the De Novo Genome Assembly Results of Monilinia fructicola Strains Mfrc123 (This Study), BR-32 (GCA_002909715.1), and LMK125 (GCA_002162545.1)

Feature Assembly a
Mfrc123BR-32LMK125
Genome size in scaffolds (bp)44,047,90042,820,47844,684,193
Total ungapped length (bp)44,047,90042,800,19343,575,363
GC content (%)40.7941.7240.12
Number of scaffolds202,349846
Maximum scaffold length (bp)4,238,823414,5173,180,511
Minimum scaffold length (bp)28,364426200
Mean scaffold size (bp)2,202,39518,22952,818
Scaffold N502,592,82366,420992,265
Scaffold N901,968,92710,383304,023
Scaffold L50718815
Number of genes12,118NANA
Number of CDS13,749NANA
Proteins with no BLAST hits1,679NANA
Proteins with BLAST hits2,078NANA
Proteins with mapping655NANA
Proteins with GO annotation8,456NANA
Total proteins with significant hits (%)11,397 (82.9)NANA

NA, not available.

Table 2

Statistics for Scaffolds Obtained from the Monilinia fructicola Mfrc123 Strain Genome Assembly

NameLength (bp)GC (%)Number of Telomeric Repeats
Number of Repeated Elements (%)
Number of CDSNumber of Genes
Start (3′CCCTAA5′)End (5′TTAGGG3′)Simple RepeatsLow Complexity Regions
Scaffold_0014,238,82342.36NAa153,213 (4.10)285 (0.42)1,4671,259
Scaffold_0023,728,27242.0623142,773 (4.02)249 (0.42)1,2751,109
Scaffold_0033,609,85541.5819232,836 (4.06)260 (0.40)1,2251,077
Scaffold_0043,332,83941.5018192,461 (3.79)210 (0.35)1,151999
Scaffold_0052,656,65342.11NA172,063 (4.25)197 (0.41)866765
Scaffold_0062,602,48140.60NA201,823 (3.90)163 (0.35)765682
Scaffold_0072,592,82341.3017181,742 (3.68)150 (0.32)809723
Scaffold_0082,576,76240.99NA242,047 (4.46)185 (0.38)782686
Scaffold_0092,530,79641.3418NA2,160 (4.48)224 (0.49)734666
Scaffold_0102,442,59740.8319242,192 (4.73)193 (0.45)774675
Scaffold_0112,335,00941.74NA181,754 (3.94)162 (0.41)753650
Scaffold_0122,285,15340.662191,783 (4.30)174 (0.45)699623
Scaffold_0132,201,68441.88NA131,651 (3.87)0.38 (156)631566
Scaffold_0142,106,46040.8715NA1,794 (4.50)167 (0.44)662591
Scaffold_0151,968,92739.7118181,560 (4.43)141 (0.45)583522
Scaffold_0161,888,07339.2117141,461 (4.29)136 (0.45)553506
Scaffold_017355,62115.891615167 (3.45)34 (0.54)32
Scaffold_018311,59315.56NA19133 (3.18)19 (0.33)66
Scaffold_019255,11519.02NA19135 (4.21)23 (0.46)1111
Scaffold_020 (rDNA)28,36442.77NANA3 (0.45)3 (0.66)NANA
Total44,047,90040.793,131 (0.41)33,751 (4.13)13,74912,118

NA, not available.

Comparison among the De Novo Genome Assembly Results of Monilinia fructicola Strains Mfrc123 (This Study), BR-32 (GCA_002909715.1), and LMK125 (GCA_002162545.1) NA, not available. Statistics for Scaffolds Obtained from the Monilinia fructicola Mfrc123 Strain Genome Assembly NA, not available. Data obtained from genome completeness analysis by BUSCO showed that 88% of the predicted genes on the assembled genomic sequences were complete and single copy, and only a few were fragmented (10%) or missing (2%) BUSCO orthologs. RNA-Seq reads from the same M. fructicola Mfrc123 strain (De Miccolis Angelini et al. 2018) were mapped on the draft assembled genome, and about 82% of them mapped in pairs, whereas 10% of them mapped in broken pairs on the final genome draft version. The genome showed 33,751 elements (4.13% of the genome size) that were identified as single repeats, whereas 3,131 elements (0.41% of the genome size) were recognized as low complexity regions (table 2). Our assembly captured a total of 29 stretches of telomeric sequences (5′-TTAGGG-3′) at both ends of 10 scaffolds and at a single end of 9 scaffolds, with repeat numbers ranging from 9 to 23 (table 2), suggesting a good integrity of the assembled genome. A total of 12,118 genes coding for 13,749 predicted proteins were identified and functionally annotated (tables 1 and 2). In detail, 11,397 proteins (82.9%) had significant BlastP hits, and 8,456 proteins (61.5%) had GO annotation (table 1). According to generic terms distribution at level 3 in Blast2GO, 14 biological processes were identified. The most represented terms were “organic substance metabolic process” (16%), “cellular metabolic process” (15%), “primary metabolic process” (15%), and “nitrogen compound metabolic process” (13%). Among the 11 GO terms identified for molecular functions, the majority were constituted by the following GO terms: “organic cyclic compound binding” (15%), “heterocyclic compound binding” (15%), “ion binding” (14%), “hydrolase activity” (11%), and “transferase activity” (9%). For “cellular component,” a total of eight GO terms were identified, and among these the most represented were “intracellular” (20%), “intracellular part” (20%), “intracellular organelle” (16%), “membrane-bound organelle” (14%), and “intrinsic component of membrane” (12%) (supplementary fig. S2, Supplementary Material online).

Conclusion

The hybrid and hierarchical assembly strategy, using both Illumina and PacBio sequencing technologies, was successful to produce a new genomic draft of M. fructicola Mfrc123 strain that greatly improved those of other strains of the same fungus available in GenBank (BR-32 strain, GCA_002909715.1; and LMK125 strain, GCA_002162545.1) in terms of contiguity (scaffold N50 values), gap-free sequences, and read mappability, also providing a structural and functional annotation of the genome. The genome assembly has size and number of genes comparable to other annotated genomes of fungi belonging to the Sclerotiniaceae family available in public databases, such as B.cinerea strains B05.10 (GCA_000143535.4; van Kan et al. 2017), T4 (GCA_000292645.1; Staats and van Kan 2012) and BcDW1 (GCA_000349525.1; Blanco-Ulate et al. 2013), M. fructigena strains Mfrg269 (GCA_003260565.1; Landi et al. 2018), Sclerotinia borealis strain F-4128 (GCA_000503235.1; Mardanov et al. 2014), and S. sclerotiorum strain 1980 UF-70 (GCA_001857865.1; Derbyshire et al. 2017). Moreover, the number of large scaffolds reconstructed in the M. fructicola Mfrc123 strain corresponds to the 16 large chromosomes of the closely related species B. cinerea and S. sclerotiorum (Amselem et al. 2011; Derbyshire et al. 2017). The new genomic resource will allow to gain deeper knowledge on the genome biology of M. fructicola and provides valuable information for further evolutionary studies within the Monilinia genus and the Sclerotiniaceae family, which contain phytopathogenic fungi of great economical relevance worldwide.

Data Availability

The final genome assembly (GBK) including the annotated genes, mRNAs, and CDSs, RepeatMasker output (GFF) with the repetitive elements coordinates and masked genomic sequences (FASTA), as well as protein sequences (FASTA) and their functional annotations (XLSX) are uploaded into Figshare (https://doi.org/10.6084/m9.figshare.9423230).

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online. Click here for additional data file.
  24 in total

Review 1.  Evolution and genome architecture in fungal plant pathogens.

Authors:  Mareike Möller; Eva H Stukenbrock
Journal:  Nat Rev Microbiol       Date:  2017-08-07       Impact factor: 60.633

Review 2.  Brown Rot Strikes Prunus Fruit: An Ancient Fight Almost Always Lost.

Authors:  Leandro Oliveira Lino; Igor Pacheco; Vincent Mercier; Franco Faoro; Daniele Bassi; Isabelle Bornard; Bénédicte Quilot-Turion
Journal:  J Agric Food Chem       Date:  2016-05-10       Impact factor: 5.279

3.  Characterization of Monilinia spp. Populations on Stone Fruit in South Italy.

Authors:  D Abate; C Pastore; D Gerin; R M De Miccolis Angelini; C Rotolo; S Pollastro; F Faretra
Journal:  Plant Dis       Date:  2018-07-02       Impact factor: 4.438

4.  Draft Genome Resources for the Phytopathogenic Fungi Monilinia fructicola, M. fructigena, M. polystroma, and M. laxa, the Causal Agents of Brown Rot.

Authors:  Yazmín Rivera; Kurt Zeller; Subodh Srivastava; Jeremy Sutherland; Marco Galvez; Mark Nakhla; Anna Poniatowska; Guido Schnabel; George Sundin; Z Gloria Abad
Journal:  Phytopathology       Date:  2018-08-09       Impact factor: 4.025

5.  Exploiting sparseness in de novo genome assembly.

Authors:  Chengxi Ye; Zhanshan Sam Ma; Charles H Cannon; Mihai Pop; Douglas W Yu
Journal:  BMC Bioinformatics       Date:  2012-04-19       Impact factor: 3.169

6.  Draft Genome Sequence of Sclerotinia borealis, a Psychrophilic Plant Pathogenic Fungus.

Authors:  Andrey V Mardanov; Alexey V Beletsky; Vitaly V Kadnikov; Alexander N Ignatov; Nikolai V Ravin
Journal:  Genome Announc       Date:  2014-01-23

7.  Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads.

Authors:  Chengxi Ye; Zhanshan Sam Ma
Journal:  PeerJ       Date:  2016-06-08       Impact factor: 2.984

8.  The complete genome sequence of the phytopathogenic fungus Sclerotinia sclerotiorum reveals insights into the genome architecture of broad host range pathogens.

Authors:  Mark Derbyshire; Matthew Denton-Giles; Dwayne Hegedus; Shirin Seifbarghy; Jeffrey Rollins; Jan van Kan; Michael F Seidl; Luigi Faino; Malick Mbengue; Olivier Navaud; Sylvain Raffaele; Kim Hammond-Kosack; Stephanie Heard; Richard Oliver
Journal:  Genome Biol Evol       Date:  2017-02-15       Impact factor: 3.416

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  Genome sequence of the brown rot fungal pathogen Monilinia fructigena.

Authors:  Lucia Landi; Rita M De Miccolis Angelini; Stefania Pollastro; Domenico Abate; Francesco Faretra; Gianfranco Romanazzi
Journal:  BMC Res Notes       Date:  2018-10-23
View more
  3 in total

Review 1.  Recent Advances in Molecular Diagnostics of Fungal Plant Pathogens: A Mini Review.

Authors:  Ganeshamoorthy Hariharan; Kandeeparoopan Prasannath
Journal:  Front Cell Infect Microbiol       Date:  2021-01-11       Impact factor: 5.293

2.  Deciphering the Monilinia fructicola Genome to Discover Effector Genes Possibly Involved in Virulence.

Authors:  Laura Vilanova; Claudio A Valero-Jiménez; Jan A L van Kan
Journal:  Genes (Basel)       Date:  2021-04-14       Impact factor: 4.096

3.  Tracking of Diversity and Evolution in the Brown Rot Fungi Monilinia fructicola, Monilinia fructigena, and Monilinia laxa.

Authors:  Rita Milvia De Miccolis Angelini; Lucia Landi; Celeste Raguseo; Stefania Pollastro; Francesco Faretra; Gianfranco Romanazzi
Journal:  Front Microbiol       Date:  2022-03-09       Impact factor: 5.640

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.