Literature DB >> 35704633

High sorbic acid resistance of Penicillium roqueforti is mediated by the SORBUS gene cluster.

Maarten Punt1,2, Sjoerd J Seekles1,3, Jisca L van Dam3, Connor de Adelhart Toorop2, Raithel R Martina2, Jos Houbraken1,4, Arthur F J Ram1,3, Han A B Wösten1,2, Robin A Ohm1,2.   

Abstract

Penicillium roqueforti is a major food-spoilage fungus known for its high resistance to the food preservative sorbic acid. Here, we demonstrate that the minimum inhibitory concentration of undissociated sorbic acid (MICu) ranges between 4.2 and 21.2 mM when 34 P. roqueforti strains were grown on malt extract broth. A genome-wide association study revealed that the six most resistant strains contained the 180 kbp gene cluster SORBUS, which was absent in the other 28 strains. In addition, a SNP analysis revealed five genes outside the SORBUS cluster that may be linked to sorbic acid resistance. A partial SORBUS knock-out (>100 of 180 kbp) in a resistant strain reduced sorbic acid resistance to similar levels as observed in the sensitive strains. Whole genome transcriptome analysis revealed a small set of genes present in both resistant and sensitive P. roqueforti strains that were differentially expressed in the presence of the weak acid. These genes could explain why P. roqueforti is more resistant to sorbic acid when compared to other fungi, even in the absence of the SORBUS cluster. Together, the MICu of 21.2 mM makes P. roqueforti among the most sorbic acid-resistant fungi, if not the most resistant fungus, which is mediated by the SORBUS gene cluster.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35704633      PMCID: PMC9200314          DOI: 10.1371/journal.pgen.1010086

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   6.020


Introduction

Fungi are responsible for 5–10% of all food spoilage [1] resulting in the production of off-flavors, discoloration and acidification [1,2]. Some species also produce mycotoxins, such as Penicillin Roquefort Toxin and roquefortine C in the case of Penicillium roqueforti [3]. Toxins partially contribute to the high incidence of food-borne diseases affecting up to 30% of the people in industrialized countries every year [4]. Food spoilage by filamentous fungi is believed to be mainly caused by spores that are spread through the air, water or other vectors like insects [5]. Preservation techniques such as pasteurization, fermentation, cooling or the addition of preservatives are used to reduce spoilage [6]. Some of the most applied preservatives are weak organic acids such as benzoic, propionic and sorbic acid. Paecilomyces variotii, Penicillium paneum, Penicillium carneum and P. roqueforti are among the few filamentous fungi capable of spoiling products containing weak acids, and are therefore called preservative-resistant moulds [7]. Weak-acid preservatives inhibit microbial growth, but their mode-of-action is not completely understood. According to the classical ‘weak-acid preservative theory’ the antimicrobial activity of weak acids is derived from their undissociated form that can pass the plasma membrane. These weak acids dissociate in the cytosol due to its neutral pH, and inhibit growth through acidification of the cytoplasm [8]. The inhibitory activity of sorbic acid at pH 6.5 and the correlation of sorbic acid resistance with ethanol tolerance in Saccharomyces cerevisiae suggest that this weak acid can also act as a membrane-active compound [8]. In Aspergillus niger, sorbic acid resistance is mediated by the phenylacrylic acid decarboxylase gene padA, and the putative 4-hydroxybenzoate decarboxylase gene, known as the cinnamic acid decarboxylase (cdcA) gene [9,10]. These genes encode proteins that catalyze the conversion of sorbic acid into 1,3-pentadiene. Genes padA and cdcA are regulated by the sorbic acid decarboxylase regulator (SdrA), which is a Zn2Cys6-finger transcription factor [10]. These three genes are present on the same genetic locus in A. niger with sdrA being flanked by cdcA and padA. Orthologs of cdcA and padA are also clustered in S. cerevisiae, but this is not the case for the sdrA homologue [10,11]. Inactivation of cdcA, padA or sdrA results in reduced growth of A. niger on sorbic acid and cinnamic acid [9]. The fact that growth is not abolished and the fact that padA transcript levels are less affected than that of cdcA on sorbic acid upon deletion of sdrA suggests that an additional regulator is involved [9]. Indeed, the weak-acid regulator WarA is also involved in sorbic acid resistance as well as other weak acids such as propionic and benzoic acid [12]. Weak-acid resistance in fungi is subject to intra- and inter-strain heterogeneity. Intra-strain heterogeneity has been observed in Zygosaccharomyces bailii with the existence of a small sub-population of cells that are more resistant (MIC = 7.6 mM) to sorbic acid than the sensitive population (MIC = 3 mM) [13], while inter-strain resistance has been observed in P. roqueforti with the existence of sorbic acid-resistant and sorbic acid-sensitive strains [14]. Genome analysis of P. roqueforti strains have yielded clues about adaptive divergence in this species. Genomic islands with high identity are present in distant Penicillium species, while they are absent in closely related species, supporting the hypothesis of recent horizontal gene transfer events [15,16]. For instance, the presence of two large genomic regions, Wallaby and CheesyTer correlates with faster growth and functions relevant in a cheese matrix, respectively [16,17]. Noteworthy, the CheesyTer and Wallaby regions are only present in cheese isolates other than Roquefort, indicating that the Roquefort isolate population is potentially more closely related to the ‘ancestral’ P. roqueforti populations [18]. In this study, sorbic acid resistance of 34 P. roqueforti strains was assessed. A genome-wide association analysis revealed the presence of the 180 kbp gene cluster SORBUS in the six most resistant strains. A partial SORBUS knockout in such a strain showed a reduced sorbic acid resistance similar to that of the other 28 sensitive P. roqueforti strains.

Results

Weak-acid sensitivity screening

Weak-acid sensitivity of 34 P. roqueforti wild-type strains was assessed on MEA plates supplemented with 5 mM propionic, sorbic or benzoic acid, which corresponds to 4.42, 4.25 and 3.07 mM undissociated acid, respectively. Three strains had been isolated from blue-veined cheeses such as Roquefort and the other 31 strains had been isolated from non-cheese environments (mostly related to spoiled food) (S1 Table). The colony surface area was determined after five days of growth (Fig 1). The inhibitory effect of propionic acid at the tested concentration was limited for most strains, since 26 out of 34 strains grew to > 80% of the colony surface area reached under control conditions. Strains DTO012A1 and DTO012A8 even showed an increased colony size (up to 120%) when compared to the control. The inhibitory effects of sorbic and benzoic acid were more pronounced. MEA supplemented with potassium sorbate reduced colony area for all 34 P. roqueforti strains. The surface area under sorbic acid stress ranged from 0 to 80% of the surface area reached under control conditions. Strains DTO006G1, DTO006G7, DTO013E5 and DTO013F2 showed the highest sorbic acid resistance, followed by DTO046C5. Benzoic acid was the most inhibitory compound, resulting in a maximum colony surface area between 0 and 20% of the control. The most benzoic acid-resistant strains were DTO013F2 and DTO013E5. As these strains were also among the most sorbic acid-resistant strains, similar resistance mechanisms may be involved to cope with benzoic and sorbic acid stress.
Fig 1

Weak-acid screening shows variabe resistances to multiple acids.

Average colony size (cm2) of 34 P. roqueforti strains after five days of growth on MEA (pH 4.0) (grey) or MEA (pH 4.0) supplemented with 5 mM propionic acid (orange), sorbic acid (blue) or benzoic acid (green). Strains DTO006G1, DTO006G7, DTO013E5 and DTO013F2 (indicated in bold) are relatively resistant to sorbic acid. Error bars indicate standard deviation of biologically independent replicates.

Weak-acid screening shows variabe resistances to multiple acids.

Average colony size (cm2) of 34 P. roqueforti strains after five days of growth on MEA (pH 4.0) (grey) or MEA (pH 4.0) supplemented with 5 mM propionic acid (orange), sorbic acid (blue) or benzoic acid (green). Strains DTO006G1, DTO006G7, DTO013E5 and DTO013F2 (indicated in bold) are relatively resistant to sorbic acid. Error bars indicate standard deviation of biologically independent replicates. Sorbic acid resistance was further analyzed by determining the MICu values of sorbic acid for the 34 P. roqueforti strains. The four strains with the highest sorbic acid resistance on MEA (Fig 1), were also among the strains (DTO006G1, DTO006G7, DTO012A1, DTO012A8, DTO013E5 and DTO013F2) that showed the highest MICu (Fig 2). Although strains DTO012A1 and DTO012A8 did not exhibit increased colony growth on agar (Fig 1), they were among the resistant strains. This might be explained by the limited growth period (5 days) or their overall relative low growth rate compared to the other resistant strains (See controls Fig 1; DTO006G1, DTO006G7, DTO013E5 and DTO013F2). DTO013E5 and DTO013F2 were the most resistant and even showed growth at the highest tested undissociated sorbate concentration of 21.2 mM, indicating a MICu > 21.2 mM. The other strains showed a distinctly lower MICu, ranging between 4.2 mM and 9.95 mM. Only strain DTO012A9 showed an intermediate resistance to sorbic acid with an average MICu of 13.72 mM undissociated sorbic acid.
Fig 2

Sorbic acid resistance screening reveals 6 resistant P. roqueforti strains.

Average undissociated MICu values of sorbic acid of 34 P. roqueforti strains (mM ± standard deviation). Each bar graph represents biological triplicates. Error bars indicate standard deviation and letters indicate significant difference in MICu (p < 0.05).

Sorbic acid resistance screening reveals 6 resistant P. roqueforti strains.

Average undissociated MICu values of sorbic acid of 34 P. roqueforti strains (mM ± standard deviation). Each bar graph represents biological triplicates. Error bars indicate standard deviation and letters indicate significant difference in MICu (p < 0.05).

Genome statistics and phylogeny

The genomes of the 34 P. roqueforti strains were sequenced. Scaffold count varied between 45 and 1358, assembly length between 26.53 and 31.74 Mb and GC content between 46.85 and 48.44% (Table 1). The number of predicted genes varied between 9633 and 10644, the number of genes with PFAM domains between 73.11 and 75.38%, and the number of secondary metabolism gene clusters between 32 and 36. All strains had a BUSCO completeness of >99%, indicating high quality assembly and gene predictions, except for DTO012A8 with a completeness of 94.83%. This and the high scaffold count of the DTO012A8 assembly (1358 scaffolds) indicates that its genome assembly is not complete. The strain was kept in the downstream analysis as most other metrics did not differ much compared to the other strains (Table 1). Fig 3 presents a phylogenetic tree of the 34 P. roqueforti strains based on 6923 single-copy orthologous genes. Three of the six sorbic acid-resistant strains (DTO012A2, DTO013F2, DTO013E5 and DTO012A8) are similar with DTO012A8 being closely related to those strains. The two other resistant strains are also similar and more distantly related.
Table 1

Genome assembly and annotation statistics.

The number of scaffolds, assembly length, GC content and genes of the 34 sequenced P. roqueforti strains are listed. In addition, the number of genes with a PFAM domain, the number of secondary metabolism gene clusters and the BUSCO completeness are listed. The type column indicates if the strain is sorbic acid-resistant (R) or sorbic acid-sensitive (S).

StrainAssemblyAnnotationAnnotation qualityType
ScaffoldsTotal assembly length (Mbp)Assembly GC content (%)GenesGenes with PFAM (total, %)Secondary metabolism gene clusterBUSCO2 completeness (%)R/S
DTO002I613429.4847.75101357539 (74.39%)3399.66%S
DTO003C335330.2847.82103627612 (73.46%)3699.66%S
DTO003H138230.2947.82103497607 (73.5%)3699.66%S
DTO006G110927.9748.1899757468 (74.87%)3399.66%R
DTO006G710027.9748.1899827471 (74.84%)3399.66%R
DTO012A115829.3748.11102807557 (73.51%)33100%R
DTO012A27327.0748.2697867362 (75.23%)3399.66%S
DTO012A69327.2948.3298177376 (75.13%)3399.66%S
DTO012A76227.3348.0797487335 (75.25%)3499.66%S
DTO012A8135829.647.88102537483 (72.98%)3294.83%R
DTO012A958129.8447.89103417613 (73.62%)3499.31%S
DTO013E516529.2748.11102527537 (73.52%)33100%R
DTO013F265731.7447.49106447739 (72.71%)33100%R
DTO013F530429.5648.04103137591 (73.61%)3599.66%S
DTO027I617729.348.12103257560 (73.22%)3299.66%S
DTO032C614128.4548.13100087441 (74.35%)33100%S
DTO039G34527.2248.1497647352 (75.3%)3299.66%S
DTO046C514328.848.15100987488 (74.15%)3399.66%S
DTO070G26026.7548.3997247324 (75.32%)33100%S
DTO081F94727.2448.1397607349 (75.3%)3299.66%S
DTO101D614128.9648.07101097497 (74.16%)3399.31%S
DTO102I918628.5347.9299787439 (74.55%)3299.66%S
DTO126G217428.0648.0299397431 (74.77%)3399.31%S
DTO127F75926.5448.4396397315 (75.89%)3599.66%S
DTO127F95426.5348.4496337311 (75.9%)3699.66%S
DTO130C153730.4847.79103757612 (73.37%)3699.66%S
DTO163C32393048.16104857653 (72.99%)3399.66%S
DTO163F528430.946.85100107441 (74.34%)33100%S
DTO163G410428.4548.12100067441 (74.37%)33100%S
DTO265D56626.7648.4497517350 (75.38%)33100%S
DTO369A115628.2947.9799457437 (74.78%)3299.66%S
DTO375B16227.248.298047378 (75.25%)3399.31%S
DTO377G210128.0748.1499627444 (74.72%)3399.31%S
DTO377G38426.9748.2497627335 (75.14%)3399.66%S
Average217.3228.5548.0510038.65
Min4526.5346.859633
Max135831.7448.4410644
Fig 3

Phylogeny of the P. roqueforti strains.

Phylogenetic tree of the 34 P. roqueforti strains used in this study. Sorbic acid resistance (MICu) is indicated in blue-yellow shading. The tree is based on 6923 single-copy orthologous genes and was constructed using RAxML. P. rubens [44] was used as outgroup. Bootstrap values <100 are indicated. Strains containing the SORBUS cluster are highlighted in blue.

Phylogeny of the P. roqueforti strains.

Phylogenetic tree of the 34 P. roqueforti strains used in this study. Sorbic acid resistance (MICu) is indicated in blue-yellow shading. The tree is based on 6923 single-copy orthologous genes and was constructed using RAxML. P. rubens [44] was used as outgroup. Bootstrap values <100 are indicated. Strains containing the SORBUS cluster are highlighted in blue.

Genome assembly and annotation statistics.

The number of scaffolds, assembly length, GC content and genes of the 34 sequenced P. roqueforti strains are listed. In addition, the number of genes with a PFAM domain, the number of secondary metabolism gene clusters and the BUSCO completeness are listed. The type column indicates if the strain is sorbic acid-resistant (R) or sorbic acid-sensitive (S).

Sorbic acid resistance correlates with a genomic cluster containing genes regulating sorbic acid decarboxylation

Two complementary genome-wide association study (GWAS) approaches were taken to identify the genetic elements (e.g., genomic regions, genes, SNPs, etc) associated with sorbic acid resistance. The first approach is based on the presence of large genomic regions that are only found in the sorbic acid resistant strains, and the second approach is based on SNPs that are significantly correlated with the level of sorbic acid resistance. In the first GWAS approach, the 34 P. roqueforti strains were divided into two groups based on their sorbic acid resistance, a group of six resistant (R-type) strains (‘a’, Fig 2) and a group of 28 sensitive (S-type) strains (‘b-d’, Fig 2). The assemblies of all strains were aligned to the assembly of DTO006G7 using MUMmer, and subsequently genomic regions (and genes encoded on those regions) were identified that were unique to the R-type strains. With this method 57 genes were identified, of which 51 were present on scaffold 43 of DTO006G7 (Table 2), and the six genes outside of scaffold 43 are g6241, g8103, g9940, g9945 and g9946. In addition to the 51 unique genes in this scaffold, it contains 19 genes which are also found completely or in part in some of the S-type strains. The genomic alignment shows the genes on scaffold 43, which is present in the R-type strains (Fig 4). The first 80 kbp of scaffold 43 (protein IDs g12000-g12029) mainly contains hypothetical proteins without predicted function, while the remaining region between 80–180 kbp (g12030–g12069) contains multiple regions homologous to genes previously reported as related to weak-acid resistance in A. niger. Predicted genes orthologous (based on bidirectional best BLAST hit) to padA, cdcA and sdrA of A. niger were found alongside each other (g12064-g12066) with respective identities of 87%, 83% and 53%. Moreover, two padA paralogs with high BLAST similarities to padA of A. niger (63% and 58%) were identified on the R-type-specific cluster and named padB (g12032) and padC (g12057). Similarly, two paralogs of cdcA, named cdcB (g12056) and cdcC (g12040), were identified on the same gene cluster as well, with identities of 72% and 71%, respectively, when compared to cdcA of A. niger. An additional cdcA paralog, cdcD (g2591), is not located on this cluster and is also present in S-type strains. In contrast, no homologs of A. niger sdrA and padA were found outside of the cluster. In addition, a transcription factor (g2820 in DTO006G7) orthologous (based on a bidirectional best BLAST hit) to warA of A. niger was identified outside of the cluster in the genomes of all strains. These results indicate that the R-type strains contain a gene cluster similar to the sorbic acid resistance gene cluster described in A. niger, but considerably expanded [9,10]. For further reference, we name this cluster (i.e., scaffold 43 of strain DTO006G7) SORBUS after the tree Sorbus aucuparia, as sorbic acid has been first isolated from its berries by August Hoffman [19,20]. While SORBUS as a whole is only present in the R-type strains (DTO006G1, DTO006G7, DTO012A2, DTO012A8, DTO013F2, DTO013E5), some S-type strains share up to 5 kbp parts of the sequence, especially in the first 80 kbp of the cluster (Fig 4). The conservation of SORBUS in the R-type strains was visualized with Clinker, demonstrating that the genes of the SORBUS cluster are well conserved and highly syntenic in the R-type strains (S1 Fig). It should be noted that the R-type strains were all isolated from non-cheese environments. Based on alignments of the sequencing reads to DTO006G7 we confirmed that out of 35 previously sequenced P. roqueforti strains [18], none of the 17 cheese strains contained the SORBUS cluster, while two out of the 18 non-cheese strains contained the SORBUS cluster (S1 Table).
Table 2

Genes located on the SORBUS cluster in strain DTO006G7.

Fold change (log2FC) of the sorbic acid samples compared to the control is given. Underlined genes are significantly differentially expressed (adjusted p-value < 0.05) and the mean expression (FPKM) of three biological replicates is given per condition (control and sorbic acid). Rows highlighted in grey indicate genes that are not unique for the R-type strains.

GeneIdNameFunctional annotation(PFAM or description)Expression (FPKM)log2FCp-value (Adj.)
ControlSorbic acid
g12000FAM167480.690.66
g12001 hypothetical protein 16 86 2.18 0.00
g12002NEMP22-0.270.89
g12003BTB/POZ domain16230.480.35
g12004hypothetical protein5940-0.560.16
g12005hypothetical protein001.050.59
g12006Reverse transcriptase00--
g12007DUF372300--
g12008hypothetical protein00--
g12009Hly-III related protein00--
g12010Cyclin like F-box110.720.32
g12011Cyclin like F-box17240.470.25
g12012 Reverse transcriptase 3 7 1.02 0.03
g12013 Endonuclease/Exonuclease/phosphatase family 1 4 1.40 0.00
g12014Probable transposable element130.830.51
g12015Aldo/keto reductase family463365-0.340.36
g12016Reverse transcriptase2620-0.350.57
g12017Telomere-associated recq-like helicase13150.230.57
g12018Cyclin like F-box2925-0.230.41
g12019hypothetical protein00-1.040.83
g12020hypothetical protein110.220.96
g12021Centrosomin N-terminal motif 1002.510.24
g12022hypothetical protein371.020.09
g12023Cyctochrome C mitochondrial import factor10-0.840.73
g12024hypothetical protein8100.210.86
g12025hypothetical protein560.250.60
g12026Winged helix-turn helix00--
g12027hypothetical protein1510-0.570.23
g12028Pronucleotidyl transferase00-0.140.95
g12029Pronucleotidyl transferase10-0.830.64
g12030hypothetical protein11150.420.58
g12031hypothetical protein00--
g12032 padB Flavoprotein450288-0.620.15
g12033PHF5-like protein330.110.91
g12034Pyridoxamine 5’-phosphate oxidase370287-0.360.48
g12035Mitochondrial carrier protein11120.060.96
g12036Mitochondrial carrier protein12170.420.65
g12037 GTP cyclohydrolase II 10 2 -1.86 0.01
g12038Potassium channel tetramerisation domain26300.180.82
g12039hypothetical protein1710-0.580.58
g12040 cdcC UbiD 134 65 -1.02 0.01
g12041Pyridoxamine 5’-phosphate oxidase21-0.540.90
g12042hypothetical protein9110.270.76
g12043 GTP cyclohydrolase II 5 1 -1.63 0.03
g12044 GPR1/FUN34/yaaH family 47 9 -2.05 0.00
g12045Mitochondrial carrier protein11-0.610.73
g12046hypothetical protein1212-0.030.97
g12047Helix-turn-helix domain/endonuclease880.030.95
g12048Flavonol reductase/cinnamoyl-CoA reductase53560.070.93
g12049Flavonol reductase/cinnamoyl-CoA reductase98-0.260.63
g12050Tannase427352-0.270.55
g12051hypothetical protein011.76
g12052Transposase-like protein670.130.84
g12053Reverse transcriptase000.310.89
g12054Zinc finger transcription factor2613-0.980.07
g12055Transposase-like protein001.64
g12056 cdcB UbiD211142-0.510.65
g12057 padC Flavoprotein7875-0.070.90
g12058Pyridoxamine 5’-phosphate oxidase like3485040.500.28
g12059GTP cyclohydrolase II2524-0.040.96
g12060GPR1/FUN34/yaaH family33-0.390.67
g12061Flavin reductase like domain20281532-0.400.42
g12062hypothetical protein2722950.080.86
g120633, 4-dihydroxy-2-butanone 4-phosphate synthase6534-0.850.25
g12064 padA Flavoprotein9245-0.970.08
g12065 cdcA UbiD 65 22 -1.53 0.00
g12066 sdrA Hypothetical transcription factor21-0.760.71
g12067hypothetical protein00
g12068NACHT domain480.850.09
g12069 Histone H3 3 6 0.95 0.00
Fig 4

Genome comparison reveals unique gene cluster in R-type strains.

Genomic alignment of 33 P. roqueforti strains to the SORBUS cluster (scaffold 43 of DTO006G7). Predicted genes are indicated with arrows, repetitive DNA is indicated in red, the sequence read coverage is indicated in the green tracks, and blue bars indicate (partial) overlap with DTO006G7. The complete SORBUS cluster is only present in the R-type strains (DTO006G1, DTO006G7, DTO012A2, DTO012A8, DTO013F2 and DTO013E5).

Genome comparison reveals unique gene cluster in R-type strains.

Genomic alignment of 33 P. roqueforti strains to the SORBUS cluster (scaffold 43 of DTO006G7). Predicted genes are indicated with arrows, repetitive DNA is indicated in red, the sequence read coverage is indicated in the green tracks, and blue bars indicate (partial) overlap with DTO006G7. The complete SORBUS cluster is only present in the R-type strains (DTO006G1, DTO006G7, DTO012A2, DTO012A8, DTO013F2 and DTO013E5).

Genes located on the SORBUS cluster in strain DTO006G7.

Fold change (log2FC) of the sorbic acid samples compared to the control is given. Underlined genes are significantly differentially expressed (adjusted p-value < 0.05) and the mean expression (FPKM) of three biological replicates is given per condition (control and sorbic acid). Rows highlighted in grey indicate genes that are not unique for the R-type strains. In the second GWAS approach, PLINK [21] was used to identify which SNPs correlate with sorbic acid resistance. This method allowed for the quantitative use of log10(MICu) values as input for analysis, as opposed to the first approach described above. The correlation between the presence of SNPs and the sorbic acid resistance is visualized in a Manhattan plot (Fig 5). SNPs located on genes with a -log10(P) > 5 and either a high or moderate impact (SNPeff) were selected (Table 3). This resulted in 338 SNPs in 41 genes. Out of these SNPs, 29 had a ‘high’ impact according to SNPeff and were located in 17 genes. Only six out of these 17 genes with high impact variants (g7017, g8100, g8106, g9942, g9943, g9976) were not located in the SORBUS cluster, the other 11 genes were either among the non-unique genes present on SORBUS, or genes of which less than 90% of the sequence was found in S-type strains. Functional annotation revealed that protein g8100 contains an ankyrin-repeat domain, while protein g9943 is homologous to a zinc finger C3H1-type domain-containing protein. In contrast, the four remaining proteins had no functional annotations. In all cases, these six high impact variant-containing genes showed high similarity (> 99%) to genes in other Penicillium species. SNPs in two additional genes encoding a putative transmembrane transporter (g216) and cation transporter (g296) also correlated with increased resistance to sorbic acid.
Fig 5

Manhattan plot shows SNPs associated with sorbic acid resistance.

Scaffolds are listed on the x-axis, while the y-axis display the significance of the association (−log10(p-value)). Yellow, orange and red dots indicate ‘low’, ‘moderate’ or ‘high’ impact SNPs as determined by SNPeff, respectively. The GeneIDs associated with the SNPs with a −log10(p-value) > 7.5 are indicated. The SORBUS cluster is located between the dashed lines.

Table 3

Genes (DTO006G7) containing SNPs associated with sorbic acid resistance.

Only SNPs with a −log10(p-value) > 5 and a moderate or high impact as determined by SNPeff are listed. Grey shading indicates overlap with sequence repeats.

GeneIDSNP impactEffectPFAM annotation
ModerateHigh
g10310hypothetical protein
g21610ABC transporter transmembrane region
g23510STAG domain
g29650Cation transporter/ATPase, N-terminus
g31210Probable molybdopterin binding domain
g31310DDHD domain
g31410hypothetical protein
g31510FAD binding domain
g701511Stop lost & splice region varianthypothetical protein
g810001Stop gainedAnkyrin repeats (3 copies)
g810130DDE superfamily endonuclease
g810470hypothetical protein
g8105100hypothetical protein
g810631Stop gainedhypothetical protein
g8107260Ankyrin repeats (3 copies)
g1200220NEMP
g1200591Frameshift varianthypothetical protein
g12006422Frameshift variant & Stop gainedReverse transcriptase
g1200780Protein of unknown function (DUF3723)
g12013483Frameshift variantsEndonuclease/Exonuclease/phosphatase family
g1201481Stop lost & splice region variantProbable transposable element
g1201620Reverse transcriptase
g12019110hypothetical protein
g1202591Stop gainedhypothetical protein
g12028181Stop lost & splice region variantPronucleotidyl transferase
g12029266Stop gained, Stop lost & splice variantPronucleotidyl transferase
g12030162Frameshift variant & Stop losthypothetical protein
g1204610hypothetical protein
g1205241Stop gainedTransposase-like protein
g1205310Reverse transcriptase
g1205440hypothetical protein
g1205562Frameshift variant & Stop lostTransposase-like protein
g1206971Frameshift variantHistone H3
g974970hypothetical protein
g994212Stop gainedhypothetical protein
g994322Stop gainedAAA domain
g994480hypothetical protein
g994540ATPase family associated with various cellular activities (AAA)
g996110hypothetical protein
g996620Endonuclease-reverse transcriptase
g997601Start losthypothetical protein
TOTAL 30929

Manhattan plot shows SNPs associated with sorbic acid resistance.

Scaffolds are listed on the x-axis, while the y-axis display the significance of the association (−log10(p-value)). Yellow, orange and red dots indicate ‘low’, ‘moderate’ or ‘high’ impact SNPs as determined by SNPeff, respectively. The GeneIDs associated with the SNPs with a −log10(p-value) > 7.5 are indicated. The SORBUS cluster is located between the dashed lines.

Genes (DTO006G7) containing SNPs associated with sorbic acid resistance.

Only SNPs with a −log10(p-value) > 5 and a moderate or high impact as determined by SNPeff are listed. Grey shading indicates overlap with sequence repeats. To investigate the evolutionary origin of the SORBUS cluster, the presence of five PFAM domains (from g12060, g12061 and g12063-g12065) that are present on the SORBUS cluster was analysed in 32 Aspergilli and Penicillia as well as the 34 P. roqueforti strains (Table 4). These PFAM domains encode a putative GPR1/FUN34/yaaH family (g12060), a flavin reductase like domain (g12061), 3, 4-dihydroxy-2-butanone 4-phosphate synthase (g12063), a flavoprotein (g12064, padA) and a UbiD domain (g12065, cdcA). These domains are selected as they are clustered and because of their predicted role. The first domain has been associated with acetic acid sensitivity in S. cerevisiae [22], while the latter four domains are part of the gene cluster described in A. niger [9]. The SORBUS genes g12061, g12064 and g12065 are more similar to genes of several Aspergillus species, as determined by a gene tree approach(Table 4), whereas the PFAM domains from g12060 and g12063 did not cluster with any of the species included. In addition, the PFAMs present in the core genome (present both in S- and R-type strains) showed higher similarities to those of P. digitatum and P. oxalicum.
Table 4

Number of genes containing PFAM domains corresponding to g12060, g12061 and g12063-g12065 (PF01184, PF01613, PF00926, PF02441, PF01977) based on phylogenetic trees constructed with their respective PFAM domains.

Top six strains are R-type P. roqueforti strains containing the SORBUS cluster. C (CORE) indicates if the domains aligned closely to the PFAMs not unique for the SORBUS cluster or did not align to P. roqueforti, S (SORBUS) indicates the number of PFAM domains which aligned closely to PFAMs originated from SORBUS.

PFAM familyg12060 PF01184g12061 PF01613g12063 PF00926g12064 PF02441 (padA)g12065 PF01977 (cdcA)
Gpr1_Fun34 YaaHFlavin_Reduct3,4-dihydroxy-2-butanone 4-phosphate synthaseFlavoproteinUbiD domain
StrainCSCSCSCSCS
DTO013F27141115323
DTO013E57131115323
DTO012A87131115323
DTO012A17131115323
DTO006G77141115323
DTO006G17141115323
DTO377G37040105020
DTO377G27040105020
DTO375B17030105020
DTO369A17040105020
DTO265D57030105020
DTO163G47030105020
DTO163F57030105020
DTO163C37040105020
DTO130C17030105020
DTO127F97030105020
DTO127F77030105020
DTO126G27040105020
DTO102I97040105020
DTO101D67030105020
DTO081F97040105020
DTO070G27030105020
DTO046C57030105020
DTO039G37040105020
DTO032C67030105020
DTO027I67040105020
DTO013F57040105020
DTO012A97030105020
DTO012A77030105020
DTO012A67030105020
DTO012A27030105020
DTO003H17030105020
DTO003C37030105020
DTO002I67030105020
Penicillium roqueforti FM1648030105020
Penicillium oxalicum 4020102000
Penicillium digitatum 1020104010
Paecilomyces variotii 8060104120
Paecilomyces variotii DTO217A27070105130
Paecilomyces niveus 9080105120
Aspergillus wentii 4050104120
Aspergillus violaceofuscus 4051104020
Aspergillus vadensis 3050104121
Aspergillus uvarum 4051103020
Aspergillus tubingensis 3060104021
Aspergillus sclerotiicarbonarius 3051104120
Aspergillus sclerotioniger 3041103010
Aspergillus piperis 3050104120
Aspergillus aureofulgens 3050104120
Aspergillus niger N4023070105130
Aspergillus niger ATCC 10153070105130
Aspergillus neoniger 4050104121
Aspergillus niger (lacticoffeatus) 3050104120
Aspergillus japonicus 4051104030
Aspergillus indologenus 4051104020
Aspergillus ibericus 3051104110
Aspergillus heteromorphus 3050103120
Aspergillus fumigatus 3030102000
Aspergillus flavus 5040104020
Aspergillus fijiensis 4051104140
Aspergillus eucalypticola 3050104121
Aspergillus ellipticus 3050106270
Aspergillus costaricaensis 4050104021
Aspergillus brunneovolaceus 4061104140
Aspergillus aculeatinus 4051104030
Arthroderma benhamiae 2010012000

Number of genes containing PFAM domains corresponding to g12060, g12061 and g12063-g12065 (PF01184, PF01613, PF00926, PF02441, PF01977) based on phylogenetic trees constructed with their respective PFAM domains.

Top six strains are R-type P. roqueforti strains containing the SORBUS cluster. C (CORE) indicates if the domains aligned closely to the PFAMs not unique for the SORBUS cluster or did not align to P. roqueforti, S (SORBUS) indicates the number of PFAM domains which aligned closely to PFAMs originated from SORBUS. Furthermore, two genes with homology to transposase-like proteins (g12052 and g12055) and several reverse-transcriptase domains and other transposon-related domains were identified in encoded proteins on the SORBUS cluster (Table 2). An analysis of the repetitive content of the genomic DNA revealed that the SORBUS cluster was more repetitive than the average of the genome (6.4% and 3.8%, respectively). In particular, long interspersed nuclear elements (LINEs) of the Tad1 retrotransposon family were enriched on SORBUS, as well as unknown/unclassified repetitive elements (S2 Table).

Genome-wide expression profiles of a sorbic acid-sensitive and sorbic acid-resistant strain

A genome-wide expression profile was performed on a sorbic acid-sensitive strain (DTO377G3) and a sorbic acid-resistant strain (DTO006G7) grown on MEB in the presence or absence of 3 mM sorbic acid. The sequence reads were aligned to the assemblies of these two P. roqueforti strains. Gene expression values were calculated and differentially expressed genes were identified (S3 Table). The expression profiles of the biological replicates were similar, demonstrated by their clustering in the PCA plot (Fig 6). Combined, PC1 and PC2 explain 90% of the variation observed. The samples treated with sorbic acid separate from the control samples on Y-axis while the differences between the strains are separated by on the X-axis.
Fig 6

Principle component analysis demonstrates clustering of sorbic acid-exposed strains.

Each dot represents a biological replicate. PC1 and PC2 together describe 90% of the variation. The sample grown on sorbic acid separate on the Y-axis and the two strains are separated on the X-axis.

Principle component analysis demonstrates clustering of sorbic acid-exposed strains.

Each dot represents a biological replicate. PC1 and PC2 together describe 90% of the variation. The sample grown on sorbic acid separate on the Y-axis and the two strains are separated on the X-axis. Genes that are either up- or down-regulated in both the R-type and S-type strain when exposed to sorbic acid might be involved in a general response to sorbic acid stress in P. roqueforti. Venn diagrams were constructed revealing that 33 genes were significantly up-regulated in both the S-type and R-type strain (Fig 7A). An enrichment analysis revealed that the functional annotation terms ‘secretion signal’ and ‘small secreted protein’ are over-represented in these genes. Among the 21 shared down-regulated genes (Fig 7B) the NmrA-like family and NAD(P)H-binding domains were over-represented. S3 Table lists all genes present in both shared pools. The expression of the SORBUS genes (g12000-g12069) was analysed. This revealed that nine out of the 70 genes were significantly differentially expressed (Table 2), including two out of the three genes homologous to cdcA that were lower expressed (log2FC = -1.5) in the presence of sorbic acid. On the SORBUS cluster g12061 (with a flavin-reductase like domain) was the highest expressed gene, reaching 2000 FPKM in the control condition.
Fig 7

Venn diagram of differentially expressed genes in the presence of sorbic acid.

Up- (A) or down-regulated (B) in R-type DTO006G7 and S-type DTO377G3 when exposed to sorbic acid compared to the control in the absence of this weak acid. Genes were considered up- or down-regulated when the log2FC > 2, p-value < 0.05 and FPKM >10.

Venn diagram of differentially expressed genes in the presence of sorbic acid.

Up- (A) or down-regulated (B) in R-type DTO006G7 and S-type DTO377G3 when exposed to sorbic acid compared to the control in the absence of this weak acid. Genes were considered up- or down-regulated when the log2FC > 2, p-value < 0.05 and FPKM >10.

Partial knock-out confirms role of SORBUS in sorbic acid resistance

Strains P. roqueforti DTO013F2ΔkusA SC1 and SC2 were obtained lacking part of the SORBUS cluster (DTO013F2 was chosen since a ΔkusA mutant was only available for that strain). Nanopore sequencing was used to investigate how much of the SORBUS cluster was removed in these two transformants (Fig 8). This revealed that the deletions were larger than the 93 kbp region. A 105 and a 131 kbp part of the cluster was removed in DTO013F2 ΔkusA SC1 and SC2, respectively. Detailed analysis of the nanopore reads revealed that the smaller fragments within the SORBUS cluster present in SC1 and SC2 strains consisted of highly dissimilar sequences, indicating these are parts of repetitive DNA (Fig 8). Both SC strains had a reduced resistance to sorbic acid when compared to DTO013F2 and DTO013F2 ΔkusA with a MICu similar to the S-type strain DTO377G3 (Fig 9). This shows that part of the SORBUS cluster is involved in sorbic acid stress mitigation.
Fig 8

Schematic overview of SORBUS sequences of the knockouts strains.

Strains DTO006G7, DTO013F2 and knock-out strains DTO013F2 ΔkusA, SC1 and SC2 are depicted. Predicted genes are indicated with arrows, repetitive DNA is indicated in red, the sequence read coverage is indicated in the green tracks, and blue bars indicate (partial) overlap with DTO006G7. The orange bar indicates the targeted knock-out region (target), flanked by the 5’ and 3’ ends (in blue). Dotted lines and scissors indicate the loci targeted by the sgRNAs.

Fig 9

Sorbic acid resistance of five P. roqueforti strains and A. niger N402.

The average MIC (mM ± standard deviation) is given for the fungal strains. Each bar graph represents average value of biological independent triplicates. Error bars indicate standard deviation and letters indicate significant difference in MICu (p < 0.05).

Schematic overview of SORBUS sequences of the knockouts strains.

Strains DTO006G7, DTO013F2 and knock-out strains DTO013F2 ΔkusA, SC1 and SC2 are depicted. Predicted genes are indicated with arrows, repetitive DNA is indicated in red, the sequence read coverage is indicated in the green tracks, and blue bars indicate (partial) overlap with DTO006G7. The orange bar indicates the targeted knock-out region (target), flanked by the 5’ and 3’ ends (in blue). Dotted lines and scissors indicate the loci targeted by the sgRNAs.

Sorbic acid resistance of five P. roqueforti strains and A. niger N402.

The average MIC (mM ± standard deviation) is given for the fungal strains. Each bar graph represents average value of biological independent triplicates. Error bars indicate standard deviation and letters indicate significant difference in MICu (p < 0.05).

Discussion

P. roqueforti is often encountered as a spoilage organism of food and feed. This can partly be attributed to its ability to grow at refrigeration temperatures [23], low O2 levels [24] and/or its resistance to preservatives, such as sorbic acid [25]. The inhibitory effect of propionic, benzoic and sorbic acid on P. roqueforti growth was assessed and benzoic acid was found to have the strongest inhibitory effect on 34 P. roqueforti strains. Previous studies report that P. roqueforti is resistant to benzoic acid. Growth was observed up to levels of 3000 ppm sodium benzoate [26,27], while we observed growth at 610 ppm (5 mM). Propionic acid hardly affected the growth of P. roqueforti, which is not surprising since it has already been reported that this fungus germinates on potato dextrose agar containing 0.5 M propionic acid (pH 5.6) with an estimated MIC of 0.79 M [23]. The sorbic acid resistance of P. roqueforti has also been assessed previously [14,25-30], reporting MIC values ranging from 0–40 mM sorbic acid (i.e. a MICu = 0–20 mM). This is similar to the range (MICu = 4.2–21.3 mM) found in this study. In fact, a resistant and sensitive group (R- and S-types) consisting of six and 28 strains, respectively, were found in this study. S-type and R-type strains showed a resistance up to 11.9 mM and 21.2 mM undissociated sorbic acid, respectively. The R-type but not the S-type strains were found to contain a gene cluster (SORBUS) containing 70 genes, of which 51 genes are unique for the R-type strains. Even though the R-type strain DTO006G7 was used with the least fragmented Illumina assembly, the limits of the SORBUS cluster could not be established based on these results alone. The Oxford Nanopore of the DTO013F2 ΔkusA assembly revealed that SORBUS is part of a 2.3 Mb contig. Genes homologous to sorbic acid degradation-associated genes in A. niger (sdrA, cdcA, padA, warA) were identified in the P. roqueforti genome [9,12]. A total of 1, 4, 3 and 1 homologs were found, respectively, and only warA and one cdcA homolog were not located on the SORBUS cluster. Two genes with putative transmembrane transport (g216) or cation transporter (g296) function were identified in the PLINK analysis. The encoded proteins might also be involved in sorbic acid stress mediation alongside the SORBUS cluster, because in addition to decarboxylation, sorbic acid stress could be mediated by an efflux pump or through removal of protons from the plasma membrane by H+-ATPase [12,31]. As mentioned, only six out of the 34 strains assessed in this study were found to contain the SORBUS cluster. This might be explained by the ecology of P. roqueforti. P. roqueforti is found in forest soil and wood, but is also associated with lactic acid bacteria (e.g. in silage). Frisvad & Samson [32] already speculated that these micro-organisms may have very well co-evolved, because all the metabolics produced by lactic acid bacteria (e.g. lactic and acetic acid, CO2) are tolerated by P. roqueforti [33]. The elevated levels of weak acids might act as selection pressure to maintain SORBUS in P. roqueforti strains which grow in this niche environment. In contrast, this selection pressure is not present in cheese which might explain that none of P. roqueforti strains in the ‘cheese’ population contain the SORBUS cluster [18]. It should be noted that the S-type sequence fragments that align with SORBUS mostly consists of proteins annotated as transposase-like proteins or reverse transcriptase, which might explain why these fragments are found in the S-type strains and it suggests that SORBUS has been obtained from a different species by horizontal gene transfer. This is supported by the phylogenetic analysis on PFAM domains of five SORBUS genes, as the results show that three out of the five SORBUS specific genes are more closely aligned to Aspergillus species than Penicillium species. The gene cluster SORBUS was also present in two of 35 previously sequenced P. roqueforti strains [18] and their sorbic acid resistance could be determined to validate the role of SORBUS in sorbic acid resistance. The transcriptome analysis of the R-type P. roqueforti strain DTO006G7 revealed that two of the three homologs of A. niger cdcA that are located on SORBUS (cdcA and cdcC) are significantly down-regulated during growth in the presence of sorbic acid. This is in contrast with the results previously described [9], where the authors found a > 500-fold change for cdcA when A. niger was cultivated on sorbic acid. This difference might be caused by the difference in medium type, because that study [9] used sorbic acid as the sole carbon source, whereas in our experiments sorbic acid was used as a stressor in a nutrient-rich medium. This indicates that the sorbic acid content in the medium does not increase gene expression of the loci leading to increased resistance, suggesting that either these genes are constitutively expressed or expression is induced based on a different compound present in MEB. Two partial SORBUS knockout strains in the DTO013F2 ΔkusA R-type strain showed reduced sorbic acid resistance to a level similar to that of the S-type strains. In contrast to the DTO013F2 ΔkusA strain, the SORBUS knockout strains were not repaired through homology directed repair using the donor DNA nor non-homologous end-joining. This might be due to the size of the fragment. Possibly, an alternative DNA repair mechanism such as microhomology-mediated end joining as described in A. fumigatus is employed in the DTO013F2 ΔkusA strain [34]. Despite the absence of a full SORBUS cluster in the S-type strains and the deletion strain, their MIC is still relatively high when compared to other fungal species such as A. niger (as shown in Fig 9) or A. fumigatus [12]. This suggests that along with the genes on the SORBUS cluster, other proteins are involved in sorbic acid stress mitigation. Our transcriptomics analysis revealed 21 down-regulated and 33 up-regulated genes that were similarly expressed both in a S-type and a R-type strain when exposed to sorbic acid. One of these up-regulated genes is the cation transporter (g1689, S4 Table) which could have H+ ATPase activity in the plasma membrane to counteract the acidification caused by the undissociated sorbic acid in the cytosol [31]. In conclusion, the results presented in this study demonstrate that weak-acid resistance varies between P. roqueforti strains and the SORBUS cluster contributes to a high sorbic acid resistance. Yet, even in the absence of this cluster the resistance is still relatively high, implying that other mechanisms are also involved in resistance to this weak acid.

Methods

Strain and cultivation conditions

All fungal strains (S1 Table) were provided by the Westerdijk Fungal Biodiversity Institute. Conidia were harvested with a cotton swab after seven days of growth at 25°C on malt extract agar (MEA, Oxoid, Hampshire, UK) and suspended in 10 mL ice-cold ACES buffer (10 mM N(2-acetamido)-2-aminoethanesulfonic acid, 0.02% Tween 80, pH 6.8). The conidia suspension was passed through a syringe containing sterilized glass wool and washed twice with ACES buffer after centrifugation at 4°C for 5 min at 2,500 g. The spore suspension was set to 2·108 spores mL-1 using a Bürker-Türk haemocytometer (VWR, Amsterdam, The Netherlands) and kept on ice until further use.

Weak acid growth assay

Conidial suspension (5 μL) was inoculated in the centre of MEA plates containing 5 mM potassium sorbate, benzoic acid, or sodium propionate (all from Sigma). Medium was set at pH 4.0 using HCl, which corresponds to undissociated concentrations of 4.26, 3.07 and 4.42 mM of these acids, respectively. The absence of preservative was used as a control. Cultures were photographed after 5 days and colony surface area was measured using a manual threshold in ImageJ.

Sorbic acid resistance

Conidial suspensions of the P. roqueforti strains were diluted to 107 spores mL-1 and mixed in a 1:99 ratio with MEB pH 4.0 with and without 25 mM potassium sorbate. 300 μL of the resulting mixture was added in a well of a 96 wells plate (Greiner Bio-One, Cellstar 650180, www.gbo.com). Serial dilutions were made by mixing 225 μL MEB with potassium sorbate and 75 μL MEB without potassium sorbate, resulting in wells with potassium sorbate concentrations of 25, 18.75, 14.06, 10.55, 7.91, 5.93, 4.45 and 0 mM. This corresponded to undissociated sorbic acid concentrations of 21.22, 15.92, 11.94, 8.95, 6.72, 5.04, 3.78 and 0 mM, respectively. The undissociated sorbic acid concentrations were determined using the Henderson-Hasselbach equation. The 96-wells plates were sealed with parafilm and incubated for 28 days at 25°C using biologically independent replicates. After 28 days, growth was assessed and the undissociated minimal inhibitory concentration (MICu) was determined for each strain. The MICu was defined as the lowest undissociated concentration in which no hyphal growth was observed. An one-way ANOVA followed by a Tukey’s HSD test was used to test for significant differences in MICu (P < 0.05).

DNA extraction, genome sequencing, assembly and annotation

DNA extraction was performed as described [35] and Illumina NextSeq500 2x150 bp paired-end technology was used for sequencing (Utrecht Sequencing Facility, useq.nl). The sequence reads have been deposited in the Short Read Archive (SRA) under PRJNA787443, and the accession numbers of the individual strains are listed in S1 Table. The reads were trimmed on both ends when quality was lower than 15 using bbduk from the BBMap tool suite (BBmap version 37.88; https://sourceforge.net/projects/bbmap/). The trimmed reads were assembled with SPAdes v3.11.1 applying kmer lengths of 21, 33, 55, 77, 99 and 127 and the–careful setting was used to reduce the number of indels and mismatches [36]. Genes were predicted with Augustus version 3.0.3 [37] using the parameter set that was previously generated for P. roqueforti [35]. Functional annotation of the predicted genes was performed as described [38]. The assemblies, gene predictions and functional annotations can be accessed interactively at https://fungalgenomics.science.uu.nl. Moreover, the assemblies and gene predictions have been deposited to GenBank under BioProject ID PRJNA745079, and the accession numbers of the individual strain are listed in S1 Table. Repetitive sequences in the assembly were identified de novo with RepeatModeler version 2.0.3 [39], which uses RepeatScout version 1.0.6 [40], RECON version 1.08 and Tandem Repeats Finder version 4.09 [41]. RepeatMasker version 4.1.2 (http://www.repeatmasker.org) was subsequently used to annotate the identified repetitive content, using the default Dfam database provided with RepeatMasker.

Genomic phylogeny and analysis

Single-copy orthologous groups were identified and aligned using OrthoFinder v2.5.2 [42]. A maximum likelihood (ML) inference was performed using RAxML [43] under the PROTGAMMAAUTO model. The number of bootstraps used was 200 (Average WRF = 0.43%) and Penicillium rubens Wisconsin 54–1255 [44] was used to root the tree. The phylogenetic tree was visualized using iTOL v5 [45].

Genome-wide association study

Two complementary approaches were taken to identify genomic regions associated with sorbic acid resistance. The first approach is based on the presence of genomic regions that are only found in the sorbic acid resistant strains, and the second approach is based on SNPs (single nucleotide polymorphisms) that are significantly associated with sorbic acid resistance. In the first approach, P. roqueforti strains were grouped into resistant (R-type) and sensitive (S-type) groups. DTO006G7 was selected from the R-group as reference for the analysis, because the assembly of this strain was the least fragmented in this group. Next, the program nucmer from the MUMmer suite (http://mummer.sourceforge.net/, version 4.0) was used to perform whole-genome alignment. Each genome was aligned to the reference and regions and genes unique for the R-type isolates were identified with the BEDtools package. The genome alignment was visualized with pyGenomeTracks [46]. A gene was considered absent when 90% or more of its sequence was not found in the genome of a strain. In the second approach, the best practices recommended by GATK (Genome Analysis Toolkit) were used to obtain single nucleotide polymorphisms (SNPs) for each strain. In short, the sequence reads were aligned to the reference genome (DTO006G7) using Bowtie2 (version 2.2.9) and PCR duplicates were removed with Picard tools (MarkDuplicates; version 2.9.2). For variant calling, the HaplotypeCaller (GATK, version 3.7) was used with the following parameters: -stand_call_conf 30 (only retaining variants with a QUAL score > 30), -ploidy 1 and -ERC. The single-sample variant files (GVCFs) were joined into a GenomicsDB before joint genotyping. The variants were annotated using SNPeff (version 4.3) based on their predicted biological effect, such as the introduction of an early stop-codon or a synonymous annotation. The number of SNPs and their putative impact on gene function (‘low’, ‘moderate’ or ‘high’, as defined by SNPeff) was listed for each gene per genome. The SNPs were then correlated to sorbic acid resistance using PLINK v1.9 [21] with parameters ‘–maf 0.5 –allow-extra-chr’. The resulting association files were analysed and visualized using R.

Analysis of SORBUS cluster

A synteny analysis of the whole-genome alignments to the SORBUS cluster (scaffold_043 in strain DTO006G7) was performed for the R-type strains. From the MUMmer suite, nucmer was used to identify matching sequences larger than 1.5 kbp (minimal match length was set to 1.5 kbp). A Genbank file was created for each matching sequence with the SORBUS cluster and these sequences were aligned and visualized in a schematic gene overview of the SORBUS clusters using Clinker (v.0.0.23) [47]. Gene orthology was determined by reciprocal BlastP analysis (i.e., a bidirectional best blast hit) or a gene tree approach, as indicated. NCBI BLAST+ version 2.12.0 was used for the reciprocal BlastP approach. For the gene tree approach, the protein domains indicated in the text were first identified by PFAM domain version 32 [48]. Next, the amino acid sequences corresponding to these domains were aligned with MAFFT version 7.310 using auto settings [49] and a gene tree was constructed using FastTree version 2.1 with default settings [50] This gene tree was visualized with DendroScope 3 [51] and the orthologs of the protein of interest were manually identified based on coclustering in the gene tree.

RNA extraction and sequencing

A genome-wide transcriptome analysis was performed on the sorbic acid sensitive P. roqueforti strain DTO377G3 and the sorbic acid-resistant P. roqueforti strain DTO006G7. Erlenmeyer flasks containing 50 mL MEB (pH 4.0) were inoculated with 100 μl ACES containing 107 conidia and incubated for 48 h at 25°C and 200 rpm. Mycelium was harvested using a sterilized miracloth filter and equally divided in Erlenmeyer flasks with 50 mL MEB (pH 4.0) or 50 mL MEB containing 3 mM potassium sorbate (pH 4.0). Growth was continued for another four hours, after which the mycelium was harvested using a sterilized Miracloth filter and frozen in liquid nitrogen. Total RNA was isolated with the RNeasy Plant Mini Kit (Qiagen) and purified by on-column DNase digestion according to the manufacturer’s protocol. RNA was sequenced with Illumina NextSeq2000 2x50 bp paired-end technology (Utrecht Sequencing Facility; useq.nl). The transfer experiment and subsequent RNA-sequencing was performed in biological triplicates. The transcript lengths, counts per gene and read mapping were determined using Salmon v1.5.2 with—validateMappings [52]. The transcript abundance of reads was quantified using custom constructed indices for DTO006G7 and DTO377G3. DESeq2 [53] was used for pairwise comparisons of the samples and the identification of differentially expressed genes. Genes with low read counts (<10) were excluded from the analysis and a gene was considered differentially expressed when the adjusted p-value was < 0.05. In addition, genes were considered up- or down-regulated when they had a log2 fold change of > 2 or < -2, respectively. A Fisher Exact test as implemented in PyRanges [54] was employed to identify over- and under-representation of functional annotation terms in sets of genes. To correct for multiple testing the False Discovery Rate method was used, with a P-value < 0.05 as cut off. The sequence reads are available in the Short Read Archive under BioProject PRJNA796729.

Plasmid construction and generation knockout strain

A kusA deficient DTO013F2 strain was constructed using the protocol and plasmid pPT22.4 as described [55]. Deletion was confirmed through diagnostic PCR and nanopore sequencing (S2 Fig). CRISPR/Cas9 technology was employed to remove a 93 kbp region (from 84 to 177 kbp) of the SORBUS cluster in DTO013F2 ΔkusA. Two single-guide RNAs (sgRNAs) were designed to perform a simultaneous double restriction in the 93 kbp SORBUS region. Transformation procedures were performed as described, with some modifications [55]. In short, plasmid pFC332 [56] was used as a vector to express the sgRNA, cas9 and a hygromycin selection marker (for primer sequences used in this study see S5 Table). The 5’ and 3’ flanking regions of sgRNA were amplified using plasmids pTLL108.1 and pTLL109.2 as template [57]. The amplified products were fused and introduced into pFC332 using Gibson assembly (NEBuilder HiFi DNA Assembly Master Mix, New England Biolabs, MA, USA). The vectors containing the sgRNA were then transformed into competent Escherichia coli TOP10 cells for multiplication overnight. Plasmids were recovered using Quick Plasmid Miniprep Kit (ThermoFisher, Waltham, MA, USA) and digested using SacII to verify the presence of the sgRNA. In addition, the correct integration of the sgRNA was confirmed with sequencing. To construct donor DNA, two 1 kbp homologous regions located at scaffold 43 at nucleotide position 83573 to 84629 and 177760 to 178854 were amplified and fused using a unique 23 nucleotide sequence GGAGTGGTACCAATATAAGCCGG with a PAM site for further genetic engineering. Transformation was performed as described with adjustments [58]. In short, P. roqueforti conidia were incubated 48 h at 25°C in 100 mL potato dextrose broth at 200 rpm. The mycelium was washed in SMC and incubated for 4 h at 37°C in lysing enzymes from Trichoderma harzianum (Sigma) dissolved in SMC. Protoplasts were resuspended in 1 mL STC and kept on ice after centrifuging for 5 min at 3000 g. To 100 μL of this suspension, 2 μg donor DNA, 2 μg of each pFC332 vector containing sgRNA (pTF and pSdrA) and 1.025 mL of freshly made PEG solution was added (see S6 Table for the vectors used in this study). After 5 min, 2 mL STC was added and the protoplasts were mixed with 20 mL liquid MMS containing 0.3% agar and 200 μg hygromycin mL-1 (InvivoGen, San Diego, CA, USA). The mixture was poured on MMS containing 0.6% agar and 200 μg hygromycin mL-1. Transformants were grown for 7–14 days at 25°C and then single streaked on MM containing 100 μg mL-1 hygromycin until sporulating colonies appeared. Next, the plasmid was removed by a single streak on MM plates without antibiotic. Finally, transformants were single streaked on MM, MM containing 100 μg hygromycin mL-1 and MEA plates to confirm that transformants lost the plasmids. To verify the transformants, genomic DNA from DTO013F2 ΔkusA and two DTO013F2 ΔkusA ΔSORBUS strains was sequenced with Oxford Nanopore MinION technology (FLO-MIN106) at the Utrecht Sequencing Facility (useq.nl). The reads were assembled using Canu v2.2 using the option for raw nanopore data and guided by a genome size of 28 Mbp [59]. In addition, the nanopore reads were aligned to the DTO006G7 assembly using Minimap2 [60]. Only reads aligning once and that had a mapping quality > 60 were selected using samtools. The sequencing data generated with the MinION technology is deposited in the SRA archive under the following accession numbers: SRR17178875, SRR17178876 and SRR17178877.

Complete data set of colony growth measurements presented in Fig 1, all individual MIC values of graph presented in Fig 2, and all individual MIC values of graph presented in Fig 9.

(XLSX) Click here for additional data file.

Schematic clinker-style comparison of all genes present on the SORBUS cluster of the R-type strains (DTO006G1, DTO006G7, DTO012A1, DTO012A8, DTO013E5, DTO013F2).

Gene identity (%) is indicated in greyscale and (partial) scaffolds corresponding to scaffold 43 of DTO006G7 are listed in order below the strain names. (TIFF) Click here for additional data file.

Schematic overview of the kusA gene in DTO013F2.

Genes are indicated with blue arrows. The nanopore sequencing read coverage of DTO013F2 ΔkusA is indicated in green. The orange bar indicates the targeted knock-out region, flanked by the 5’ and 3’ ends (in lightblue). Dotted lines and scissors indicate the loci targeted by the sgRNA. (TIFF) Click here for additional data file.

Penicillium spp. used in this study.

(XLSX) Click here for additional data file.

Repetitive content annotations in the genome assembly of DTO006G7.

The first tab provides summary statistics of repeat coverage of the full assembly as well as the SORBUS cluster. The second tab provides the individual annotations of repetitive content. (XLSX) Click here for additional data file.

FPKM list as determined by DESeq2 of 9993 genes of P. roqueforti strain DTO377G3 and DTO006G7.

Strain DTO006G7 was used as reference, and the corresponding protein sequences are provided for each gene. (XLSX) Click here for additional data file.

Significantly differentially expressed genes between P. roqueforti strain DTO377G3 and DTO006G7.

(XLSX) Click here for additional data file.

Primers used for plasmid and donor DNA construction and diagnostic PCR.

(XLSX) Click here for additional data file.

Plasmids used in this study.

(XLSX) Click here for additional data file.
  54 in total

1.  De novo identification of repeat families in large genomes.

Authors:  Alkes L Price; Neil C Jones; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

Review 2.  Fungal spores: Highly variable and stress-resistant vehicles for distribution and spoilage.

Authors:  Jan Dijksterhuis
Journal:  Food Microbiol       Date:  2018-11-21       Impact factor: 5.516

3.  Clinker & clustermap.js: Automatic generation of gene cluster comparison figures.

Authors:  Cameron L M Gilchrist; Yit-Heng Chooi
Journal:  Bioinformatics       Date:  2021-01-18       Impact factor: 6.937

4.  Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation.

Authors:  Ivica Letunic; Peer Bork
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

5.  A CRISPR-Cas9 System for Genetic Engineering of Filamentous Fungi.

Authors:  Christina S Nødvig; Jakob B Nielsen; Martin E Kogle; Uffe H Mortensen
Journal:  PLoS One       Date:  2015-07-15       Impact factor: 3.240

6.  Emerging microbial concerns in food safety and new control measures.

Authors:  Moreno Bondi; Patrizia Messi; Prakash M Halami; Chrissanthy Papadopoulou; Simona de Niederhausern
Journal:  Biomed Res Int       Date:  2014-07-06       Impact factor: 3.411

7.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation.

Authors:  Sergey Koren; Brian P Walenz; Konstantin Berlin; Jason R Miller; Nicholas H Bergman; Adam M Phillippy
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

8.  Efficient marker free CRISPR/Cas9 genome editing for functional analysis of gene families in filamentous fungi.

Authors:  Tim M van Leeuwe; Mark Arentshorst; Tim Ernst; Ebru Alazi; Peter J Punt; Arthur F J Ram
Journal:  Fungal Biol Biotechnol       Date:  2019-09-21

9.  Ant-infecting Ophiocordyceps genomes reveal a high diversity of potential behavioral manipulation genes and a possible major role for enterotoxins.

Authors:  Charissa de Bekker; Robin A Ohm; Harry C Evans; Andreas Brachmann; David P Hughes
Journal:  Sci Rep       Date:  2017-10-02       Impact factor: 4.379

10.  Independent domestication events in the blue-cheese fungus Penicillium roqueforti.

Authors:  Emilie Dumas; Alice Feurtey; Ricardo C Rodríguez de la Vega; Stéphanie Le Prieur; Alodie Snirc; Monika Coton; Anne Thierry; Emmanuel Coton; Mélanie Le Piver; Daniel Roueyre; Jeanne Ropars; Antoine Branca; Tatiana Giraud
Journal:  Mol Ecol       Date:  2020-02-03       Impact factor: 6.185

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.