Literature DB >> 31406907

Data on the genome analysis of the wild edible mushroom, Russula griseocarnosa.

Fei Yu1, Junfeng Liang1.   

Abstract

In the present article, we report data on the whole genome sequence of a wild edible and medicinal ectomycorrhizal fungus Russula griseocarnosa. The R. griseocarnosa genome consists of 64.81 Mb with a GC-pair content of 49.41%. The genome assembly consists of 471 scaffolds and 16128 coding protein genes. The coding protein genes was annotated in different databases (GO, KEGG and CAZys), respectively. The whole genome sequence and functional annotation provide important information for ectomycorrhizal fungus, which can be used as a basis for cultivation and breeding of R. griseocarnosa. The Whole Genome project of Russula griseocarnosa has been deposited at DDBJ/ENA/GenBank under the accession RMVF00000000. The version described is RMVF01000000. To further interpretation of the data provided in this article, please refer to the research article 'Whole genome sequencing and genome annotation of the wild edible mushroom, Russula griseocarnosa' [1].

Entities:  

Keywords:  Ectomycorrhizal fungus; Genome annotation; Russula griseocarnosa; Whole genome

Year:  2019        PMID: 31406907      PMCID: PMC6685690          DOI: 10.1016/j.dib.2019.104295

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table The first genome under the genus Russula to be reported. The data provide valuable information of the potential function and gene expression mechanisms about ectomycorrhizal fungus Russula griseocarnosa. The CAZymes of Russula griseocarnosa confirms the adaptation to symbiosis, and reveals the strategy for host interaction.

Data

Russula griseocarnosa (Fig. 1) is a wild edible and medicinal ectomycorrhizal fungus that is native to southern China. The resulting draft genome of R. griseocarnosa present the 64.81 Mb in size with a G+C content of 49.41%. The genome sequence was assembly with 471 scaffolds and 16128 coding protein genes [1]. The data illustrated in Fig. 2 show the Gene Ontology (GO) distribution of the protein coding genes and Fig. 3 gives a complete overview of the KEGG pathway. According comparative analysis, The GO annotations of Russula griseocarnosa genes were similar with Agaricus bisporus [2] in “Localization”, “Biological regulation”, and “Regulation of biological process”, and fewer numbers than that of Laccaria bicolor [1], [3]. Compared with KEGG metabolic annotations, the most genes of Russula griseocarnosa pathways was not significantly in Laccaria bicolor and Agaricus bisporus, but R. griseocarnosa had less genes in "Tryptophan metabolism" and "Starch and sucrose metabolism" pathways [1].
Fig. 1

Fruiting bodies of Russula griseocarnosa.

Fig. 2

The Gene Ontology (GO) function annotation of Russula griseocarnosa.

Fig. 3

The KEGG function annotation of Russula griseocarnosa.

Fruiting bodies of Russula griseocarnosa. The Gene Ontology (GO) function annotation of Russula griseocarnosa. The KEGG function annotation of Russula griseocarnosa. The CAZymes coding genes of R. griseocarnosa encode enzymes involved in the degradation of plant cell wall polysaccharides, non-plant polysaccharides (for example, animal and bacterial polysaccharides) and fungal cell wall (Fig. 4). The CAZymes coding genes of R. griseocarnosa was similar to the symbiotic fungal species Scleroderma citrinum [4] in non-plant polysaccharides degradation and fungal cell wall degradation, and higer number of plant cell wall polysaccharides degradation. The plant cell wall polysaccharides degradation associated with cellulose degrading enzymes (GH6, GH7, GH44 and GH45), hemicellulose-degrading enzymes (GH10, GH11 and GH115) and pectin-degrading enzymes (GH43, GH51, GH78, GH93, PL1, PL3, and PL4) were absent in Russula griseocarnosa, Laccaria bicolor, and Scleroderma citrinum genomes [1].
Fig. 4

Comparison of CAZys associated with cell wall degradation.

Comparison of CAZys associated with cell wall degradation.

Experimental design, materials and methods

Fungal material

Fruiting bodies of R. griseocarnosa were collected from Linjing Town, Teng County, Guangxi Province, China in 2017. The fruiting body samples was frozen in liquid nitrogen and stored at −80 °C freezer until DNA extract.

DNA extraction and sequencing

Genomic DNA was extracted using the Omega Fungal DNA Kit D3390-02. Quality of DNA was determined using TBS-380 fluorometer (Turner BioSystems Inc., Sunnyvale, CA). The concentration of at least 20 mg/L (OD260/280 = 1.8–2.0). R. griseocarnosa genome was sequenced using Illumina HiSeq X-ten sequencing and PacBio RS sequencing at Shanghai Majorbio Bio-pharm Biotechnology Co., Ltd, China. Paired-end libraries with 300 bp inserts were constructed in Illumina HiSeq X-ten sequencing. 8-10k insert shotgun libraries were generated in Pacific Biosciences RS sequencing.

Genome assembly and annotation

The genome sequence was assembled as follows: (1) PacBio long reads were corrected and assembled by Canu (v1.7) [5]; (2) Illumina reads corrected and used for scaffolding by SOAPdenovo (v2.04). Fill the gaps using GapCloser (v1.12) package; and (3) PacBio reads were modified based on Illumina reads. The final assembly produced a circular genome sequence without gaps. Protein coding sequences were predicted using the automated pipeline MAKER2 (v2.31.9) [6]. It combining data for mRNAs, proteins, the ab initio predictions of SNAP [7] and GeneMark-ES (v2.3a) [8]. The predicted protein coding sequences was annotated in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database using Blastp (v2.3.0). The Carbohydrate-active enzymes (CAZymes) were performed using blastp (cut off e-value≤1e−5) at http://www.cazy.org/ [9].

Specifications Table

Subject areaBiology
More specific subject areaMicrobiology, Genomics
Type of dataTable, figures
How data was acquiredPacBio RS and Illumina Hiseq X-Ten sequencing
Data formatAnnotated and comparative analyzed
Experimental factorsThe fruiting body samples were obtained and quickly frozen in liquid nitrogen before stored in a −80 °C freezer. Total DNA of fruiting body was extracted immediately.
Experimental featuresDNA Sequencing was performed by using PacBio RS and Illumina Hiseq X-Ten, genome assembly, annotation and analysis were carried out.
Data source locationThe fruiting bodies of Russula griseocarnosa were collected from Linjing Town, Teng County, Guangxi Province, China (2 Jun. 2017) (23.15 N, 110.66 E)
Data accessibilityThe whole genome sequence of Russula griseocarnosa has been deposited at DDBJ/ENA/GenBank under the accession RMVF00000000. The version described is RMVF01000000. The BioSample, BioProject and SRA accession number are SAMN09602224, PRJNA479704 and SRP153002, respectively.
Related research articleF. Yu, J. Song, J.F. Liang, S.K. Wang, J.K. Lu, Whole genome sequencing and genome annotation of the wild edible mushroom, Russula griseocarnosa. Genomics. (2019) in press [1]https://doi:10.1016/j.ygeno.2019.04.012.
Value of the data

The first genome under the genus Russula to be reported.

The data provide valuable information of the potential function and gene expression mechanisms about ectomycorrhizal fungus Russula griseocarnosa.

The CAZymes of Russula griseocarnosa confirms the adaptation to symbiosis, and reveals the strategy for host interaction.

  9 in total

1.  Convergent losses of decay mechanisms and rapid turnover of symbiosis genes in mycorrhizal mutualists.

Authors:  Annegret Kohler; Alan Kuo; Laszlo G Nagy; Emmanuelle Morin; Kerrie W Barry; Francois Buscot; Björn Canbäck; Cindy Choi; Nicolas Cichocki; Alicia Clum; Jan Colpaert; Alex Copeland; Mauricio D Costa; Jeanne Doré; Dimitrios Floudas; Gilles Gay; Mariangela Girlanda; Bernard Henrissat; Sylvie Herrmann; Jaqueline Hess; Nils Högberg; Tomas Johansson; Hassine-Radhouane Khouja; Kurt LaButti; Urs Lahrmann; Anthony Levasseur; Erika A Lindquist; Anna Lipzen; Roland Marmeisse; Elena Martino; Claude Murat; Chew Y Ngan; Uwe Nehls; Jonathan M Plett; Anne Pringle; Robin A Ohm; Silvia Perotto; Martina Peter; Robert Riley; Francois Rineau; Joske Ruytinx; Asaf Salamov; Firoz Shah; Hui Sun; Mika Tarkka; Andrew Tritt; Claire Veneault-Fourrey; Alga Zuccaro; Anders Tunlid; Igor V Grigoriev; David S Hibbett; Francis Martin
Journal:  Nat Genet       Date:  2015-02-23       Impact factor: 38.330

2.  Eukaryotic gene prediction using GeneMark.hmm-E and GeneMark-ES.

Authors:  Mark Borodovsky; Alex Lomsadze
Journal:  Curr Protoc Bioinformatics       Date:  2011-09

3.  Whole genome sequencing and genome annotation of the wild edible mushroom, Russula griseocarnosa.

Authors:  Fei Yu; Jie Song; Junfeng Liang; Shengkun Wang; Junkun Lu
Journal:  Genomics       Date:  2019-04-18       Impact factor: 5.736

4.  A genetic linkage map for the ectomycorrhizal fungus Laccaria bicolor and its alignment to the whole-genome sequence assemblies.

Authors:  J Labbé; X Zhang; T Yin; J Schmutz; J Grimwood; F Martin; G A Tuskan; F Le Tacon
Journal:  New Phytol       Date:  2008-09-08       Impact factor: 10.151

5.  Genome sequence of the button mushroom Agaricus bisporus reveals mechanisms governing adaptation to a humic-rich ecological niche.

Authors:  Emmanuelle Morin; Annegret Kohler; Adam R Baker; Marie Foulongne-Oriol; Vincent Lombard; Laszlo G Nagy; Robin A Ohm; Aleksandrina Patyshakuliyeva; Annick Brun; Andrea L Aerts; Andrew M Bailey; Christophe Billette; Pedro M Coutinho; Greg Deakin; Harshavardhan Doddapaneni; Dimitrios Floudas; Jane Grimwood; Kristiina Hildén; Ursula Kües; Kurt M Labutti; Alla Lapidus; Erika A Lindquist; Susan M Lucas; Claude Murat; Robert W Riley; Asaf A Salamov; Jeremy Schmutz; Venkataramanan Subramanian; Han A B Wösten; Jianping Xu; Daniel C Eastwood; Gary D Foster; Anton S M Sonnenberg; Dan Cullen; Ronald P de Vries; Taina Lundell; David S Hibbett; Bernard Henrissat; Kerry S Burton; Richard W Kerrigan; Michael P Challen; Igor V Grigoriev; Francis Martin
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-08       Impact factor: 11.205

6.  MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects.

Authors:  Carson Holt; Mark Yandell
Journal:  BMC Bioinformatics       Date:  2011-12-22       Impact factor: 3.307

7.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

8.  The Carbohydrate-Active EnZymes database (CAZy): an expert resource for Glycogenomics.

Authors:  Brandi L Cantarel; Pedro M Coutinho; Corinne Rancurel; Thomas Bernard; Vincent Lombard; Bernard Henrissat
Journal:  Nucleic Acids Res       Date:  2008-10-05       Impact factor: 16.971

9.  Gene finding in novel genomes.

Authors:  Ian Korf
Journal:  BMC Bioinformatics       Date:  2004-05-14       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.