Literature DB >> 29387740

Draft genome assembly of Colletotrichum musae, the pathogen of banana fruit.

Wilson José da Silva Junior1, Raul Maia Falcão2, Lucas Christian de Sousa-Paula2, Nicolau Sbaraini3, Willie Anderson Dos Santos Vieira1, Waléria Guerreiro Lima4, Sérgio de Sá Leitão Paiva Junior2, Charley Christian Staats3, Augusto Schrank3, Ana Maria Benko-Iseppon2, Valdir de Queiroz Balbino2, Marcos Paz Saraiva Câmara1.   

Abstract

Colletotrichum musae is an important cosmopolitan pathogenic fungus that causes anthracnose in banana fruit. The entire genome of C. musae isolate GM20 (CMM 4420), originally isolated from infected banana fruit from Alagoas State, Brazil, was sequenced and annotated. The pathogen genomic DNA was sequenced on HiSeq Illumina platform. The C. musae GM20 genome has 50,635,197 bp with G + C content of 53.74% and in its present assembly has 2763 scaffolds, harboring 13,451 putative genes with an average length of 1626 bp. Gene prediction and annotation was performed by Funannotate pipeline, using a pattern for gene identification based on BUSCO.

Entities:  

Year:  2018        PMID: 29387740      PMCID: PMC5790810          DOI: 10.1016/j.dib.2018.01.002

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the Data Colletotrichum musae is the causal agent of anthracnose in banana fruits, the main disease post-harvest worldwide. This is the first genome sequence of Colletotrichum musae using next-generation sequencing available in public database. The published genome data herein will facilitate biology, pathogenicity, evolution and interaction pathogen-host studies of Colletotrichum musae, through comparative genomes studies of Colletotrichum spp. and related species.

Data

Fungi infection in plants is the most frequent cause of extensive loses in Agriculture. The fact that many endophytic fungi can case infection adds further complexity to fungal plant pathogens. Banana (Musa sp.) is one of the world's important food crops and a staple food for more than 400 million people [1]. Over 100 million tons are produced worldwide at some 5 million hectares and the cultivated area is expected to increase in the future [2]. However, banana fruits are highly susceptible to pathogens, and anthracnose disease caused by fungi from Colletotrichum genus is amongst the most frequents. Colletotrichum comprises over 100 species that are able to infect and damage diverse crops around the world [3]. Due to its ubiquity, substantial destruction capacity and scientific importance as a model of pathosystems, Colletotrichum spp. are among the top 10 of most important plant pathogens according to the international community of plant pathology researchers [4]. Colletotrichum musae (Berk. and M.A. Curtis), the causative agent of anthracnose, is a major post-harvest pathogen of banana fruits and causes severe global crop losses [5]. The disease develops from a latent fungal infection during pre-harvest, originated from spores that are present in immature fruits in the field. Symptoms, such as patches on the bark (brown to black color) and depressed lesions, appear in the ripening of the fruits. Furthermore, under high humidity, the formation of salmon-colored acervuli can be observed [6]. The infection thus accounts for a reduction in fruit viability during maturation, transport and storage periods [7], leading to a commercial depreciation and shortening fruit's shelf life. To circumvent post-harvest losses, chemical fungicides are usually adopted, but other side-methods (e.g., radiation treatment, hot water removal, refrigeration, induced resistance and biological control agents) have also been applied [8]. However, chemical fungicide usage has been limited by potential harmful effects to human health and environment. Besides, fungal pathogens are known to quickly develop resistance to chemical defensives [9]. Furthermore, the absence of available genomic sequences from C. musae is one of the main limitations for best characterization of fungal virulence determinants and development of improved management strategies. Here we report, for the first time, the whole genome sequence of the C. musae strain GM20 (CMM 4420) isolated from infected banana fruit from Alagoas, Brazilian Northeast State. In recent years, several phytopathogenic fungal genomes have been published boosting the discovery of virulence determinants in these species. Expectedly, our analysis will encourage further studies of C. musae biology, which should provide better details about host-pathogen interaction, leading to new management measures.

Experimental design, materials, and methods

DNA extraction and genome sequence

The GM20 isolate of C. musae was cultured, and DNA was extracted as previously described [10]. Whole shotgun genome sequence of C. musae GM20 was generated using the Illumina HiSeq. 2500 platform (Illumina, San Diego, CA) at the Center for Functional Genomics - Universidade de São Paulo (Piracibaba, Brazil). The libraries were prepared with the Illumina Nextera XT DNA Library Prep Kit (Illumina, San Diego, CA) and the sequencing was performed on a HiSeq Flow Cell v4 with HiSeq SBS Kit v4 (Illumina, San Diego, CA), leading to 100 bp paired-reads (2×).

De novo assembly and genes annotation

The shotgun sequencing produced 13,273,851 paired reads. Initially, FastQC [11] was applied to analyse reads quality, and adapters were trimmed using FASTX-Toolkit 0.0.13 (http://hannonlab.cshl.edu/fastx_toolkit). Originally, three assemblers were tested: ABySS 2.0.2 [12]; SPAdes 1.10 [13]; Velvet 1.1 [14], with SPAdes showing the best results (12,435 contigs >500 bp). Additionally, Redundans [15] posteriorly ran for scaffolds assembly. Assembly statistics were generated by QUAST 3.9 (Table 1) [16]. Gene prediction and annotation was carried out with Funannotate pipeline [17] BUSCO 2.0 [18] [parameters: Sordariomycetes database (Verticillium longporum selected as closely-related species)] to generate the training files for two genome predictors: GeneMark-ES [19] and AUGUSTUS [20]. Moreover, BUSCO 2.0 was employed to evaluate genome completeness, based on conservation of single-copy benchmarking universal single-copy orthologs (BUSCOs).
Table 1

Genome assembly statics for Colletotrichum musae GM20.

C. musae GM20
Assembly size50.7 Mb
Coverage sequencing100×
Sequencing technologyIllumina HiSeq. 2500
Number of scaffolds2763
N50 scaffolds length32,818
Number of contigs10,618
Number of predicts genes13,451
Overall GC content53.74
Public access to genomeNWMS01000000
Genome assembly statics for Colletotrichum musae GM20. The final assembly of the C. musae GM20 genome was determined to be 50,635,197 bp with a G+C content of 53.74% in 2763 scaffolds (maximum 208,119 bp; N50 32,818 bp), and 13,451 genes were predicted. This whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession number NWMS00000000. The version described is this paper is version NWMS01000000. BUSCO analysis showed a high degree of completeness with a BUSCO score of 96.3%, of which 1263 genes were complete BUSCOs, four were complete duplicated BUSCOs, 23 were fragmented BUSCOs, and 25 were missing BUSCO orthologs out of the 1315 BUSCO groups searched.
Subject areaBiology
More specific subject areaMicrobiology, Agricultural, Genomics.
Type of dataGenome sequence data
How data was acquiredIllumina HiSeq. 2500 Next Generation Platforms
Data formatAssembled genome sequence.
Experimental factorsGenomic DNA was extract from mycelial growth in culture medium.
Experimental featuresGenome of Colletotrichum musae strain GM20 was sequenced and assembled.
Data source locationColletotrichum musae strain GM20 was isolated from banana lesions, in Maceio, Pernambuco Brazil.
Data accessibilityThe Colleotrichum musae GM20 genome is available in DDBJ/ENA/GenBank under the accession number NWMS01000000.
Related research article
Data accessibilityhttps://www.ncbi.nlm.nih.gov/nuccore/NWMS00000000
  12 in total

1.  GeneMarkS: a self-training method for prediction of gene starts in microbial genomes. Implications for finding sequence motifs in regulatory regions.

Authors:  J Besemer; A Lomsadze; M Borodovsky
Journal:  Nucleic Acids Res       Date:  2001-06-15       Impact factor: 16.971

2.  AUGUSTUS: a web server for gene finding in eukaryotes.

Authors:  Mario Stanke; Rasmus Steinkamp; Stephan Waack; Burkhard Morgenstern
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

3.  ABySS: a parallel assembler for short read sequence data.

Authors:  Jared T Simpson; Kim Wong; Shaun D Jackman; Jacqueline E Schein; Steven J M Jones; Inanç Birol
Journal:  Genome Res       Date:  2009-02-27       Impact factor: 9.043

Review 4.  Computational Prediction of Effector Proteins in Fungi: Opportunities and Challenges.

Authors:  Humira Sonah; Rupesh K Deshmukh; Richard R Bélanger
Journal:  Front Plant Sci       Date:  2016-02-12       Impact factor: 5.753

5.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.

Authors:  Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Bioinformatics       Date:  2015-06-09       Impact factor: 6.937

6.  QUAST: quality assessment tool for genome assemblies.

Authors:  Alexey Gurevich; Vladislav Saveliev; Nikolay Vyahhi; Glenn Tesler
Journal:  Bioinformatics       Date:  2013-02-19       Impact factor: 6.937

7.  Phenalenone-type phytoalexins mediate resistance of banana plants (Musa spp.) to the burrowing nematode Radopholus similis.

Authors:  Dirk Hölscher; Suganthagunthalam Dhakshinamoorthy; Theodore Alexandrov; Michael Becker; Tom Bretschneider; Andreas Buerkert; Anna C Crecelius; Dirk De Waele; Annemie Elsen; David G Heckel; Heike Heklau; Christian Hertweck; Marco Kai; Katrin Knop; Christoph Krafft; Ravi K Maddula; Christian Matthäus; Jürgen Popp; Bernd Schneider; Ulrich S Schubert; Richard A Sikora; Aleš Svatoš; Rony L Swennen
Journal:  Proc Natl Acad Sci U S A       Date:  2013-12-09       Impact factor: 11.205

Review 8.  The Top 10 fungal pathogens in molecular plant pathology.

Authors:  Ralph Dean; Jan A L Van Kan; Zacharias A Pretorius; Kim E Hammond-Kosack; Antonio Di Pietro; Pietro D Spanu; Jason J Rudd; Marty Dickman; Regine Kahmann; Jeff Ellis; Gary D Foster
Journal:  Mol Plant Pathol       Date:  2012-05       Impact factor: 5.663

9.  Colletotrichum - current status and future directions.

Authors:  P F Cannon; U Damm; P R Johnston; B S Weir
Journal:  Stud Mycol       Date:  2012-09-15       Impact factor: 16.097

10.  Redundans: an assembly pipeline for highly heterozygous genomes.

Authors:  Leszek P Pryszcz; Toni Gabaldón
Journal:  Nucleic Acids Res       Date:  2016-04-29       Impact factor: 16.971

View more
  1 in total

1.  Dissipation and Distribution of Prochloraz in Bananas and a Risk Assessment of Its Dietary Intake.

Authors:  Jiajian Huang; Sukun Lin; Jingtong Zhou; Huiya Chen; Shiqi Tang; Jian Wu; Suqing Huang; Dongmei Cheng; Zhixiang Zhang
Journal:  Toxics       Date:  2022-07-29
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.