| Literature DB >> 31042716 |
Ievgeniia A Tiukova1,2, Mats E Pettersson3, Marc P Hoeppner3,4,5, Remi-Andre Olsen6, Max Käller7,8, Jens Nielsen1, Jacques Dainat3,4, Henrik Lantz3,4, Jonas Söderberg9, Volkmar Passoth2.
Abstract
Here, we present the genome of the industrial ethanol production strain Brettanomyces bruxellensis CBS 11270. The nuclear genome was found to be diploid, containing four chromosomes with sizes of ranging from 2.2 to 4.0 Mbp. A 75 Kbp mitochondrial genome was also identified. Comparing the homologous chromosomes, we detected that 0.32% of nucleotides were polymorphic, i.e. formed single nucleotide polymorphisms (SNPs), 40.6% of them were found in coding regions (i.e. 0.13% of all nucleotides formed SNPs and were in coding regions). In addition, 8,538 indels were found. The total number of protein coding genes was 4897, of them, 4,284 were annotated on chromosomes; and the mitochondrial genome contained 18 protein coding genes. Additionally, 595 genes, which were annotated, were on contigs not associated with chromosomes. A number of genes was duplicated, most of them as tandem repeats, including a six-gene cluster located on chromosome 3. There were also examples of interchromosomal gene duplications, including a duplication of a six-gene cluster, which was found on both chromosomes 1 and 4. Gene copy number analysis suggested loss of heterozygosity for 372 genes. This may reflect adaptation to relatively harsh but constant conditions of continuous fermentation. Analysis of gene topology showed that most of these losses occurred in clusters of more than one gene, the largest cluster comprising 33 genes. Comparative analysis against the wine isolate CBS 2499 revealed 88,534 SNPs and 8,133 indels. Moreover, when the scaffolds of the CBS 2499 genome assembly were aligned against the chromosomes of CBS 11270, many of them aligned completely, some have chunks aligned to different chromosomes, and some were in fact rearranged. Our findings indicate a highly dynamic genome within the species B. bruxellensis and a tendency towards reduction of gene number in long-term continuous cultivation.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31042716 PMCID: PMC6493715 DOI: 10.1371/journal.pone.0215077
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Feature response curves (FRC) computed for all features (A) and low coverage features (B) (adapted from [ FRC are shown for HGAP, allpaths, dekkera_V1 (final assembly presented in [32]) and dekkera_V2 (assembly presented in this work). The decreased amount of features when removing the duplicated fragment of chromosome 1 in dekkera_V1 assembly is mostly attributed to the loss of regions below normal read coverage (B). Such regions are often indicative of incorrect repeat expansions made by the assembly program [32].
Annotation details of the B. bruxellensis CBS 11270 nuclear genome.
| Annotation feature | Counts |
|---|---|
| Genes | 4879 |
| mRNAS | 4881 |
| Exons | 6342 |
| Introns | 1461 |
| Mean introns per mRNA | 0.30 |
| Mean intron length | 217 bp |
| Mean CDS length | 1358 bp |
| Mean mRNA length | 1423 bp |
Fig 2Location of certain genes (A) and duplicated genes (B) on the chromosomes of CBS 11270.
Statistics of variant analysis in heterozygous sites in genome of CBS 11270 and between genomes of CBS 11270 and CBS 2499.
| Variant type | Variant counts in heterozygous sites in genome of CBS 11270 | Variant counts between genomes of CBS 11270 and CBS 2499 |
|---|---|---|
| SNP counts | 49890 | 88534 |
| Indel counts | 8538 | 8133 |
| Total variant counts | 58230 | 96421 |
Counts of different types of nucleotide transversions and transitions in heterozygous sites in the genome of CBS 11270 and between the genomes of CBS 11270 and CBS 2499.
| SNP type | Counts in the genome of CBS 11270 | Counts in the genome of CBS 2499 |
|---|---|---|
| A/C | 3459 | 5934 |
| A/G | 17729 | 32159 |
| T/G | 3547 | 5845 |
| T/A | 4351 | 7602 |
| C/T | 17919 | 31975 |
| C/G | 2890 | 5036 |
Fig 3Number and location of variants on chromosomes of B. bruxellensis CBS 11270.
Y-axes represent the number of variants per 10,000 bp. Black bars show occurrence of variants. Red color denotes chromosome margins.
Survey of the genes with the highest numbers of SNPs in CBS 11270 and CBS 2499.
| Gene ID and position in CBS 11270 | Name in CBS 2499 | Variants in CBS 11270 | Variants in CBS 2499 |
|---|---|---|---|
| General negative regulator of transcription subunit 1; BRETBRUG00000001221; | jgi|Dekbr2|64613|fgenesh1_pm.2_#_424 | 75 | 102 |
| AP-1 accessory protein | jgi|Dekbr2|5806|gm1.2215_g | 64 | 108 |
| DNA repair protein | jgi|Dekbr2|23614|fgenesh1_kg.1_#_362_#_Locus3870v1rpkm14.01 | 75 | 84 |
| Protein | jgi|Dekbr2|172416|CE84544_34760 | 63 | 93 |
| General negative regulator of transcription subunit 2; BRETBRUG00000002734; | jgi|Dekbr2|64613|fgenesh1_pm.2_#_424 | 11 | 102 |
Fig 4Distribution of indels of different size in heterozygous sites in the genome of CBS 11270 (A) and between genomes of CBS 11270 and CBS 2499 (B).
Duplicated genes revealed by CNV analysis and not identified in the genome assembly.
| Gene name | Gene ID and location |
|---|---|
| F0249 protein VIBHAR | BRETBRUG00000000385 |
| chr1 1018607 1018993 | |
| Probable NADPH dehydrogenase | BRETBRUG00000002123 |
| chr2 2039150 2040358 | |
| Chromatin modification-related protein | BRETBRUG00000002201 |
| chr2 2253321 2256215 | |
| Copia protein | BRETBRUG00000002416 |
| chr2 2815514 2816992 | |
| Copia protein | BRETBRUG00000002417 |
| chr2 2817465 2818151 | |
| Repressor of filamentous growth | BRETBRUG00000003666 |
| chr3 3312856 3314625 |
List of duplicated genes resolved by genome assembly.
| Gene name | Gene ID and location | |||
|---|---|---|---|---|
| A | tRNA (guanine(10)-N2)-methyltransferase | BRETBRUG00000001038 | BRETBRUG00000003060 | |
| chr1 2703203 2705172 | chr3 1805200 1807169 | |||
| B | Methylthioribulose-1-phosphate dehydratase | BRETBRUG00000003998 | BRETBRUG00000004000 | |
| chr4 1127888 1128565 | chr4 1129334 1130011 | |||
| C | Hexokinase-1 | BRETBRUG00000001423 | BRETBRUG00000004091 | |
| chr1 3856343 3857854 | chr4 1381931 1383442 | |||
| Flocculation protein | BRETBRUG00000001424 | BRETBRUG00000004090 | ||
| chr1 3864230 3868254 | chr4 1371498 1375532 | |||
| Temperature shock-inducible protein 1 | BRETBRUG00000001425 | BRETBRUG00000004089 | ||
| chr1 3870223 3870528 | chr4 1369224 1369529 | |||
| Putative transcriptional regulatory protein | BRETBRUG00000001426 | BRETBRUG00000004088 | ||
| chr1 3874728 3876680 | chr4 1363072 1365024 | |||
| Uncharacterized transcriptional regulatory protein | BRETBRUG00000001427 | BRETBRUG00000004087 | ||
| chr1 3877690 3879264 | chr4 1360488 1362062 | |||
| Allantoin permease | BRETBRUG00000001428 | BRETBRUG00000004086 | ||
| chr1 3880591 3882306 | chr4 1357446 1359161 | |||
| D | Indoleamine 2 2C3-dioxygenase | BRETBRUG00000003493 | BRETBRUG00000003500 | BRETBRUG00000003492 |
| chr3 2804460 2805389 | chr3 2823355 2824602 | chr3 2801423 2802706 | ||
| Histone acetyltransferase | BRETBRUG00000003494 | BRETBRUG00000003501 | ||
| chr3 2807345 2808712 | chr3 2826194 2827561 | |||
| Serine/threonine-protein phosphatase 2A activator | BRETBRUG00000003495 | BRETBRUG00000003502 | ||
| chr3 2808829 2809905 | chr3 2827678 2828226 | |||
| GABA-specific permease | BRETBRUG00000003496 | BRETBRUG00000003503 | ||
| chr3 2810290 2811957 | chr3 2829138 2830805 | |||
| V-type proton ATPase subunit a 2C vacuolar isoform | BRETBRUG00000003497 | BRETBRUG00000003504 | ||
| chr3 2813515 2816040 | chr3 2832363 2834888 | |||
| Protein | BRETBRUG00000003498 | BRETBRUG00000003505 | ||
| chr3 2816967 2818574 | chr3 2835815 2837422 | |||
Letter in first column indicates guide-mark in Fig 2B.
Characterization of repeats in flanking regions of duplicated genes.
| Gene id and location | Gene name | Downstream repeat | Upstream repeat |
|---|---|---|---|
| BRETBRUG00000003060 | tRNA (guanine(10)-N2)-methyltransferase | (AAGATAG)n Simple_repeat 1807186 1807233 | (T)n Simple_repeat 1805044 1805068 |
| chr3 1805200 1807169 | (CTC)n Simple_repeat 1807517 1807558 | ||
| (AGTAA)n Simple_repeat 1808962 1809011 | |||
| BRETBRUG00000003500 | Indoleamine 2 2C3-dioxygenase | A-rich Low_complexity 2826505 2826555 | |
| chr3 2823355 2824602 | |||
| BRETBRUG00000003494 | Histone acetyltransferase | A-rich Low_complexity 2807656 2807706 | |
| chr3 2807345 2808712 | |||
| BRETBRUG00000003495 | Serine/threonine-protein phosphatase 2A activator | (T)n Simple_repeat 2810049 2810080 | A-rich Low_complexity 2807656 2807706 |
| chr3 2808829 2809905 | |||
| BRETBRUG00000003496 | GABA-specific permease | AmnL2-1 LINE/L2 2812187 2812238 | (T)n Simple_repeat 2810049 2810080 |
| chr3 2810290 2811957 | |||
| BRETBRUG00000003497 | V-type proton ATPase subunit a 2C vacuolar isoform | AmnL2-1 LINE/L2 2812187 2812238 | |
| chr3 2813515 2816040 | |||
| BRETBRUG00000003501 | Histone acetyltransferase | (T)n Simple_repeat 2828899 2828928 | |
| chr3 2826194 2827561 | |||
| BRETBRUG00000003502 | Serine/threonine-protein phosphatase 2A activator | (T)n Simple_repeat 2828899 2828928 | A-rich Low_complexity 2826505 2826555 |
| chr3 2827678 2828226 | |||
| BRETBRUG00000003503 | GABA-specific permease | AmnL2-1 LINE/L2 2831034 2831085 | (T)n Simple_repeat 2828899 2828928 |
| chr3 2829138 2830805 | |||
| BRETBRUG00000003504 | V-type proton ATPase subunit a%2C vacuolar isoform | AmnL2-1 LINE/L2 2831034 2831085 | |
| chr3 2832363 2834888 |
Genes in the B. bruxellensis CBS 2499 genome absent in the CBS 11270 genome.
| Gene name | Best blast hit |
|---|---|
| jgi|Dekbr2|26744|fgenesh1_kg.19_#_10_#_Locus2611v2rpkm45.30 | Na+/H+ antiporter involved in sodium and potassium efflux through the plasma membrane [ |
| jgi|Dekbr2|8850|gm1.5259_g | maltase [ |
| jgi|Dekbr2|8855|gm1.5264_g | putative transmembrane sensor transporter [ |
| jgi|Dekbr2|145681|CE57809_24 | s-formylglutathione hydrolase [ |