| Literature DB >> 31167502 |
Xiaohuan Sun1,2, Cene Gostinčar3,4, Chao Fang5,6, Janja Zajc7,8, Yong Hou9,10, Zewei Song11,12, Nina Gunde-Cimerman13.
Abstract
One of the most commonly encountered species in the small basidiomycetous sub-phylum Wallemiomycotina is Wallemia mellicola, a xerotolerant fungus with a widespread distribution. To investigate the population characteristics of the species, whole genomes of twenty-five strains were sequenced. Apart from identification of four strains of clonal origin, the distances between the genomes failed to reflect either the isolation habitat of the strains or their geographical origin. Strains from different parts of the world appeared to represent a relatively homogenous and widespread population. The lack of concordance between individual gene phylogenies and the decay of linkage disequilibrium indicated that W. mellicola is at least occasionally recombining. Two versions of a putative mating-type locus have been found in all sequenced genomes, each present in approximately half of the strains. W. mellicola thus appears to be capable of (sexual) recombination and shows no signs of allopatric speciation or specialization to specific habitats.Entities:
Keywords: allergenic fungus; basidiomycete; halotolerance; population genomics; recombination; xerotolerance
Mesh:
Year: 2019 PMID: 31167502 PMCID: PMC6628117 DOI: 10.3390/genes10060427
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Strains sequenced in this study.
| Culture Collection Strain Number * | Number in This Study | Isolation Habitat | Sampling Site Location |
|---|---|---|---|
| EXF-277 | 1 | hypersaline saltern water | Spain |
| EXF-757 | 2 | hypersaline saltern water | Dominican Republic |
| EXF-1274 | 3 | peanuts | Indonesia |
| EXF-1277 | 4 | Indonesia | |
| EXF-1279 | 5 | straw hat | Philippines |
| EXF-5677 | 6 | air | Slovenia |
| EXF-5829 | 7 | chocolate | Slovenia |
| EXF-6156 | 8 | moldy white bread | United Kingdom |
| EXF-6157 | 9 | soil | Canada |
| EXF-6158 | 10 | maple syrup | Canada |
| EXF-8738 | 11 | house dust | Uruguay |
| EXF-8740 | 12 | house dust | Micronesia |
| EXF-8747 | 13 | house dust | Indonesia |
| EXF-8749 | 14 | house dust | Thailand |
| EXF-8757 | 15 | house dust | Mexico |
| EXF-10633 | 16 | dry common fig | Slovenia |
| EXF-483 | 17 | hypersaline saltern water | Spain |
| EXF-1262 | 18 | Unknown | Italy |
| EXF-1443 | 19 | Unknown | Denmark |
| EXF-5828 | 20 | chocolate | Slovenia |
| EXF-5830 | 21 | chocolate | Slovenia |
| EXF-5922 | 22 | chocolate | Slovenia |
| EXF-6152 (MUCL 45613) | 23 | forest plant | Cuba |
| EXF-6151 (MUCL 45615) | 24 | forest plant | Cuba |
| EXF-8741 | 25 | house dust | Micronesia |
* EXF strain numbers (Ex Culture Collection of the Department of Biology, Biotechnical Faculty, University of Ljubljana, Slovenia); other culture collection numbers are in parentheses (CBS—CBS-KNAW culture collection, Netherlands; UAMH—UAMH Centre For Global Microfungal Biodiversity, Canada; IBT—IBT culture collection, DTU, Denmark; MUCL—BCCM/MUCL Agro-food & Environmental Fungal Collection, Belgium).
Statistics for the sequenced Wallemia mellicola genomes.
| Statistic * | Minimum ** | Mean ** | Maximum ** | Standard Deviation** |
|---|---|---|---|---|
| Coverage | 194 | 318 | 558 | 92 |
| Genome assembly size (Mbp) | 9.68 | 9.75 | 9.95 | 0.05 |
| Number of contigs | 202 | 239 | 422 | 43 |
| Contig N50 | 115375 | 144560 | 170540 | 13888 |
| GC content (%) | 39.91% | 39.95% | 39.97% | 0.01% |
| CDS total length (Mbp) | 6.44 | 6.50 | 6.53 | 0.02 |
| CDS total length (% of genome) | 66.45% | 66.64% | 67.08% | 0.32% |
| Gene models (n) | 4317 | 4475 | 4509 | 37 |
| CDS average length (bp) | 1438 | 1453 | 1512 | 13 |
| Exons per gene (average) | 3.98 | 4.02 | 4.17 | 0.04 |
| Intron average length (bp) | 63 | 64 | 66 | 0.57 |
| Complete BUSCOs | 87.40% | 88.18% | 89.80% | 0.50% |
| Complete and single-copy BUSCOs | 85.90% | 87.91% | 88.60% | 0.56% |
| Complete and duplicated BUSCOs | 0.10% | 0.27% | 3.90% | 0.76% |
| Fragmented BUSCOs | 5.50% | 5.94% | 6.50% | 0.31% |
| Missing BUSCOs | 4.60% | 5.89% | 6.50% | 0.38% |
| SNP density | 0.41% | 0.52% | 0.60% | 0.04% |
* Complete data for each genome is available in Supplementary Table S1; ** Calculated from 25 here sequenced W. mellicola genomes; CDS, Coding Sequence; BUSCOs, Benchmarking Universal Single-Copy Orthologues; SNP, Single Nucleotide Polymorphism.
Figure 1Clustering of the Wallemia mellicola genomes. Principal component analysis of single nucleotide polymorphisms (SNP) data estimated by comparing 25 sequenced genomes to the reference genome. The genomes are represented by circles, the color of which corresponds to the habitat (left) or sampling location (right) of the sequenced strains. The first two axes explain 23.7% (x axis) and 8.80% (y axis) of variation.
Figure 2Phylogeny of W. mellicola strains. (A) Overlay of 40 core Benchmarking Universal Single-Copy Orthologue (BUSCO) gene trees estimated by PhyML 3.1 using the Hasegawa-Kishino-Yano 85 nucleotide substitution model and estimating the alpha parameter of the gamma distribution of the substitution rate categories and the proportion of invariable sites. (B) Majority rule consensus tree of 40 core gene trees described above. (C) Phylogenetic network reconstructed with the Neighbor-Net algorithm based on the dissimilarity distance matrix calculated from the SNP data.
Figure 3Linkage disequilibrium (LD) decay in Wallemia mellicola estimated by calculation of the squared correlation coefficient (r2) between pairs of biallelic loci. r2 is plotted against the physical distance of the loci in the genome. Horizontal lines mark the maximum observed value and half of the maximum observed value. Vertical lines mark the interval of the physical distance delimited by the first point of the curve under half of the maximum r2 value (left vertical line), the last point above half of the maximum r2 value (right vertical line) and the point where the fitted curve intersects with half of the maximum r2 value (middle vertical line).
Figure 4Putative mating-type loci in different strains of Wallemia mellicola. (A) Annotated mating-type loci and their flanking regions with putative gene functions assigned according to Padamsee et al. [8]. The chromosomal inversion is marked with a dashed green rectangle. Genome numbers are marked on the left. The blue or red outline of the rectangles representing the genes shows the gene orientation: left to right (blue) or right to left (red). (B) Alignment of the contigs containing the putative mating-type loci from the genomes 1 (x axis) and 5 (y axis) at different magnifications.