| Literature DB >> 28983066 |
Téo Fournier1, Jean-Sébastien Gounot1, Kelle Freel1, Corinne Cruaud2, Arnaud Lemainque2, Jean-Marc Aury2, Patrick Wincker2, Joseph Schacherer3, Anne Friedrich3.
Abstract
Genetic variation in natural populations represents the raw material for phenotypic diversity. Species-wide characterization of genetic variants is crucial to have a deeper insight into the genotype-phenotype relationship. With the advent of new sequencing strategies and more recently the release of long-read sequencing platforms, it is now possible to explore the genetic diversity of any nonmodel organisms, representing a fundamental resource for biological research. In the frame of population genomic surveys, a first step is to obtain the complete sequence and high-quality assembly of a reference genome. Here, we sequenced and assembled a reference genome of the nonconventional Dekkera bruxellensis yeast. While this species is a major cause of wine spoilage, it paradoxically contributes to the specific flavor profile of some Belgium beers. In addition, an extreme karyotype variability is observed across natural isolates, highlighting that D. bruxellensis genome is very dynamic. The whole genome of the D. bruxellensis UMY321 isolate was sequenced using a combination of Nanopore long-read and Illumina short-read sequencing data. We generated the most complete and contiguous de novo assembly of D. bruxellensis to date and obtained a first glimpse into the genomic variability within this species by comparing the sequences of several isolates. This genome sequence is therefore of high value for population genomic surveys and represents a reference to study genome dynamic in this yeast species.Entities:
Keywords: Dekkera bruxellensis; MinION sequencing; genetic diversity; genome report; population genomics; reference genome; yeast
Mesh:
Year: 2017 PMID: 28983066 PMCID: PMC5633375 DOI: 10.1534/g3.117.300128
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Description of the D. bruxellensis isolates used in this study
| Strain | Ploidy | Ecological Origin | Geographical Origin | Reference |
|---|---|---|---|---|
| AWRI1499 | 3n | Wine | Australia | |
| AWRI1608 | 3n | Wine | Australia | |
| AWRI1613 | 2n | Wine | Australia | |
| CBS11270 | 2n | Industrial ethanol | Sweden | |
| CBS2499 | 2n | Wine | France | |
| ST05_12_22 | 2n | Lambic beer | Belgium | |
| UMY315 | 2n | Must | Italy | This study |
| UMY321 | 2n | Red wine | Italy | This study |
| 133 | 2n | Merlot wine | South Africa | This study |
Figure 1Metrics related to the constructed assemblies, per assembler and dataset.
Metrics associated with the D. bruxellensis publicly available assemblies
| Strain | No. of Scaffolds | Assembly Size (Mb) | Maximum Scaffold Size | N50 | N90 | No. of Undetermined Residues |
|---|---|---|---|---|---|---|
| AWRI1499 ( | 324 | 12.7 | 170,307 | 65,420 | 22,583 | 57 |
| CBS11270 ( | 15 | 17.3 | 4,993,495 | 3,706,654 | 944,992 | 2,497,785 |
| CBS2499 ( | 84 | 13.4 | 2,877,306 | 1,792,735 | 190,560 | 586,105 |
| ST05_12_22 ( | 85 | 13.1 | 1,439,423 | 732,210 | 177,142 | 218,317 |
| UMY321 (this study) | 8 | 13 | 3,829,289 | 1,917,156 | 1,329,398 | 2708 |
Figure 2Comparison of the CBS11270 and UMY321 assemblies. The alignments and the plot were generated with the MUMmer software suite. Red lines: sequences aligning in the same direction. Blue lines: sequences aligning in the opposite direction.
Figure 3Mapping of the Illumina reads vs. the UMY321 reference assembly. (A) Illumina reads coverage along the reference genome. (B) Frequency of the reference allele at heterozygous sites along the genome. (Each color corresponds to a scaffold).
Proportion of D. bruxellensis unmapped Illumina reads on the available assemblies
| Assemblies | ||||||
|---|---|---|---|---|---|---|
| UMY321 | CBS11270 | CBS2499 | ST05.12/22 | AWRI1499 | ||
| Illumina paired-end reads | UMY321 | 1.11 | 9.95 | 4.13 | 2.12 | 5.29 |
| CBS11270 | 4.74 | 12.43 | 5.88 | 3.48 | 9.49 | |
| CBS2499 | 1.68 | 9.4 | 4.92 | 2.45 | 5.68 | |
| ST05.12/22 | 1.9 | 10.83 | 7 | 3.78 | 11.97 | |
| UMY315 | 0.66 | 10.00 | 4.04 | 2.02 | 5.35 | |
| 133 | 0.82 | 8.82 | 3.11 | 1.57 | 4.44 | |
| AWRI1608 | 14.87 | 22.65 | 17.42 | 15.39 | 19.91 | |
| AWRI1613 | 9.69 | 16.89 | 11.04 | 8.98 | 13.38 | |
Figure 4Illumina unmapped reads per assembly. Boxplot of the percentage of unmapped Illumina reads, according to the assembly used for the mapping.