Literature DB >> 31755931

Evolution of Aspergillus oryzae before and after domestication inferred by large-scale comparative genomic analysis.

Naoki Watarai1, Nozomi Yamamoto1, Kazunori Sawada2, Takuji Yamada1.   

Abstract

Aspergillus oryzae is an industrially useful species, of which various strains have been identified; however, their genetic relationships remain unclear. A. oryzae was previously thought to be asexual and unable to undergo crossbreeding. However, recent studies revealed the sexual reproduction of Aspergillus flavus, a species closely related to A. oryzae. To investigate potential sexual reproduction in A. oryzae and evolutionary history among A. oryzae and A. flavus strains, we assembled 82 draft genomes of A. oryzae strains used practically. The phylogenetic tree of concatenated genes confirmed that A. oryzae was monophyletic and nested in one of the clades of A. flavus but formed several clades with different genomic structures. Our results suggest that A. oryzae strains have undergone multiple inter-genomic recombination events between A. oryzae ancestors, although sexual recombination among domesticated species did not appear to have occurred during the domestication process, at least in the past few decades. Through inter- and intra-cladal comparative analysis, we found that evolutionary pressure induced by the domestication of A. oryzae appears to selectively cause non-synonymous and gap mutations in genes involved in fermentation characteristics, as well as intra-genomic rearrangements, with the conservation of industrially useful catalytic enzyme-encoding genes.
© The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  zzm321990 Aspergillus flavuszzm321990 ; zzm321990 Aspergillus oryzaezzm321990 ; comparative genomics; domestication

Mesh:

Year:  2019        PMID: 31755931      PMCID: PMC6993814          DOI: 10.1093/dnares/dsz024

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


Introduction

Aspergillus oryzae is an industrially important species mainly used in the manufacture of fermented foods in East Asia because of its strong amylase and protease activities. Particularly, in Japan, tane-koji (seed rice malt) manufacturers use various strains that are sold to companies producing fermented foods. These strains show diversity in colour and fermentation function and are managed for different applications such as in sake, miso (soybean paste), and shoyu (soy sauce). However, the relationship between the diversity of A. oryzae species and genetic factors remains unclear. In 2005, the whole genome of A. oryzae RIB40, a wild-type strain, was sequenced. Comparative genomic analysis of the whole genomes of Aspergillus nidulans and Aspergillus fumigatus revealed that the A. oryzae genome was 7–9 Mb larger., However, genes in newly acquired regions are only minimally expressed under normal conditions, and most of their functions remain unknown, particularly for genes not directly involved in fermentation. Aspergillus flavus and A. oryzae are genetically very closely related, with their genomes showing 99.5% similarity in coding regions, and numerous comparative analyses have been performed between these species. Aspergillus flavus is an important species linked to food safety, as some strains produce fungal toxins, particularly aflatoxin,, and it has historically been distinguished from A. oryzae based on morphological differences and toxicity., In addition, some A. oryzae strains contain all or parts of the aflatoxin biosynthetic gene cluster, although they are non-aflatoxigenic. Some researchers suggested that A. oryzae can be detoxified and differentiated from A. flavus by domestication.,, Based on the phylogenetic analysis of 11 genes, comparative analysis of the aflatoxin gene cluster,, and single-nucleotide polymorphism (SNP) analysis of the whole genome,A. oryzae was shown to form a monophyletic clade derived from one clade of A. flavus. Aspergillus oryzae and A. flavus have long been considered asexual species with no sexual reproduction cycle. However, recent studies of A. flavus revealed that sexual reproduction occurs in laboratory and field environments., Genome analysis also showed that the two species contain a nearly complete gene set necessary for sexual reproduction., All strains of A. oryzae and A. flavus possess one mating type (MAT type) locus in the genome, at which either MAT1-1 or MAT1-2 is encoded., However, complete sexual reproduction has not been confirmed in A. oryzae. Breeding currently carried out by tane-koji manufacturers utilizes a single strain with mutations or recombination, but crossbreeding has not been successful. Genome analysis suggested that recombination occurred between the ancestors of A. oryzae based on the linkage disequilibrium between MAT types and the phylogeny of a single gene. In this study, to uncover genomic diversity and evolutionary relationships among A. oryzae isolates, we acquired 82 industrial strains from five independent Japanese tane-koji manufacturers in different locations and conducted whole-genome sequencing to determine their draft genomes. For the classification of these strains, we performed orthologue clustering of predicted genes from each genome, phylogenetic tree inference of the chromosomal genome, and chromosome recombination analysis. Through these analyses, we hypothesized that A. oryzae strains have undergone multiple inter-genomic recombination events between A. oryzae ancestors, and that evolutionary pressure by A. oryzae domestication is extremely limited to intra-genomic mutations and rearrangements. Moreover, we identified genes that are mutated/duplicated/deleted within clades, which might reflect the fact that Japanese tane-koji manufacturers have passaged their strains to prevent changes in industrially useful traits in parallel with breeding.

Materials and methods

A full description of the methods, including software versions and parameters, is available in Supplementary Data (‘supplementary_methods.pdf’).

2.1. Sample collection and DNA preparation

For genomic sequencing, 82 A. oryzae and three Aspergillus sojae (as an out group) industrially used strains were collected from five independent tane-koji manufacturers in Japan (Supplementary Table S1). Tane-koji manufacturers have their own isolates and have not shared them for several decades. Whole genomic DNA was extracted using ‘Extraction method5’. Yatalase was used for some samples (Supplementary Table S1).

2.2. Genome sequencing and assembly

For genome assembly, fragmented genome libraries were prepared based on 350 bp (for run no. 1) and 550 bp (for run no. 2–5) on average and sequenced on an Illumina HiSeq2500 system using 150 bp (for run no. 1) and 250 bp (for run no. 2–5) paired-end runs. Quality filtering and assembly of the paired-end reads were performed with Platanus. The scaffolds aligned to bacterial genomes or the mitochondrial genome of RIB40 were removed. The reference primer sequences for the MAT type were mapped to the genome sequences with bowtie2.

2.3. Gene prediction and orthologous clustering

Next, 152 genomic scaffolds (85 from our samples and 67 from NCBI GenBank) of the newly sequenced or NCBI GenBank Aspergillus strains were used (Supplementary Table S2). Gene coding regions were predicted using two methods, namely, GeneMark-ES and AUGUSTUS for ab initio prediction, and GMAP for reference-based prediction, and these were combined with EVidenceModeler. The predicted protein sets were evaluated with BUSCO. Orthologue clustering was performed with OrthoFinder. Orthogroups (OGs) were annotated with GhostKOALA for protein function and InterProScan for protein motifs/domains.

2.4. Comparative genomics

Alignments of degapped gene sequences (DGSs) of single-copy OGs (SCGs), which were common to 152 protein sets, were generated with MAFFT and tandemly concatenated. Maximum likelihood-based phylogenetic inference was performed with RAxML. Similarly, a concatenated gene tree of 19 SCGs in the aflatoxin biosynthetic cluster was generated. To test the neutrality of the mutation, revised coding sequences were generated from the alignments by clade/species and synonymous, non-synonymous, and gap mutations were counted. Chromosomal duplications and deletions were inferred by direct read mapping to the RIB40 genome with bowtie2. Depths were calculated with samtools.

Results and discussion

3.1. Genome sequencing and assembly

The number of scaffolds (>1,000 bp) and total lengths were 36–256 and 35.9–38.66 Mb for A. oryzae and 45–71 and 40.0–40.1 Mb for A. sojae, respectively (Supplementary Table S2). The two draft genome sizes were not significantly different from those reported previously., All samples were sequenced at depths of ≥100, but the depths of some samples from runs no. 3, no. 4, and no. 5 showed relatively low average coverage of the final scaffolds because the genomic DNA of Actinobacteria (Corynebacterium) used to produce yatalase was contaminated.

3.2. Gene prediction and evaluation

Gene prediction for our sample showed that the number of predicted genes of A. oryzae and A. sojae were 11, 196–11, 716 and 13, 309–13, 317, respectively (Supplementary Table S2, column S). To confirm the accuracy of assembly and gene prediction, the predicted gene set and reference gene set were analyzed with BUSCO. In the complete genome of RIB40 (GCF_000184455.2), the score of the gene set was 98.6% in our prediction pipeline.

3.3. Orthologue clustering

By orthologue clustering, 15,614 OGs were created from 1,882,788 proteins, while 1,395 proteins became unclustered. Of these, 3,951 were common SCGs. We succeeded in the annotation of the KEGG KO and protein domains/motifs at 28.6% (4,465/15,614) and 88.0% (13,752/15,614) (Supplementary Table S3), and assigned locus IDs starting with ‘AO’ as defined previously to 13,006 OGs clustered with the A. oryzae RIB40 reference proteins (s01-m08-r29 or s01-m09-r06).

3.4. Comparative genomics

3.4.1. Phylogenetic tree with concatenated genes

The length of the sequence of the concatenated DGSs of 4,361 SCGs was 5,677,852 columns (bp), corresponding to approximately 15% of the total genome length. We confirmed that A. oryzae was monophyletic and nested in a clade of A. flavus (Fig. 1, original full figure: Supplementary Fig. S1). This is consistent with the results of previous studies., In contrast, some putative A. flavus strains (WRRL1519, NRRL35739, IFM54693, IFM57535, IFM59975, IFM60655, and 2017 Washington T4) were nested in the A. oryzae clade. However, WRRL1519, NRRL35739, and the IFM strains were confirmed as non-aflatoxigenic with A. oryzae-type aflatoxin gene clusters (also confirmed in this study). Thus, considering the location in the phylogenetic tree and toxigenicity, these strains were reclassified as A. oryzae.
Figure 1

Phylogenetic tree inferred by concatenated DGSs, focusing on Aspergillus oryzae and Aspergillus flavus. A–H: clade names for Japanese industrial strains defined in this study. AflaGuard, AF36, BP2-1, K, and SRCM are temporary clade names.

Phylogenetic tree inferred by concatenated DGSs, focusing on Aspergillus oryzae and Aspergillus flavus. A–H: clade names for Japanese industrial strains defined in this study. AflaGuard, AF36, BP2-1, K, and SRCM are temporary clade names. Industrial strains of A. oryzae formed several clades, showing an intra-cladal DGS dissimilarity within 0.01%. For further analysis, we defined clades A–H to which Japanese industrial strains of A. oryzae belong. Although koji manufacturers do not share their strains, interestingly, many strains clustered in the same clade. Particularly, clade E included several strains provided by four different tane-koji manufacturers, one strain collected in Thailand (BCC7051), and one strain as a clinical isolate from Japan. In contrast, some Chinese and Korean industrial strains, namely, AS3.951, 100.8/3.042 (China), BP2-1, and the two SRCM strains (Korea), formed a group with a relatively large distance from the Japanese industrial strains. These strains were very closely related to each other but should be classified into different clades because of their different MAT types. Industrial A. oryzae strains were not clustered based on their industrial uses. However, all strains in clades D, E, and H (and a single strain TK-29) were those used in shoyu production. Strains used for other purposes such as in miso, sake, sake/miso, mirin, or other products were mixed in the same clade, which is consistent with classification by appearance and enzymatic activity.

3.4.2. MAT type

Of the 152 genomic sequences, MAT regions were uniquely detected as either MAT1-1 or MAT1-2 from 146 sequences but not from six downloaded sequences (Fig. 1). We examined the presence/absence of genes in each strain and identified OG0012491/AO090020000089 as MAT1-1 and OG0012281 + OG0012282 as the MAT1-2 gene (these two were contiguous, and thus possibly merged into one gene). The identified region of MAT1-1 was consistent with that reported previously. The genes unique to each MAT type were only these MAT genes. All strains included in the same clade showed the same MAT type. Linkage disequilibrium was observed between the topology of the phylogenetic tree of concatenated genes and MAT type (e.g. A&B and G&H). This strongly suggests that clade divergence was caused not by mutation but rather by the recombination of different strains.

3.4.3. Putative genetic recombination

In addition to MAT type, as an example of linkage disequilibrium, we also found that the phylogenetic tree of the concatenated gene showed a different topology from those of individual genes. For example, on comparing the phylogenetic tree of ytk6/OG0002894/AO090023000584 to that of mdlB/OG0004275/AO090701000644, the combinations of clades with the same sequence were different (Supplementary Fig. S2-1A and B). In addition, the phylogenetic trees of adjacent genes showed a similar topology. For each strain, we identified the closest clade along the chromosomal positions (Supplementary Methods 2.4.3). As a result, the genomes of all clades of A. oryzae and A. flavus exhibited high mosaic structures; regardless of the clades, the closest clade differed depending on the chromosomal position. For example, TK-22 (as a representative strain of clade A) shared 46% of the exact same gene sequences with clade B, but the homologous genes were distributed in a mosaic manner (Fig. 2, unsmoothed figure: Supplementary Fig. S3-1). From the distribution of similarity scores, we can see that some non-syntenic regions against clade B could be syntenic regions of clade F (Supplementary Fig. S3-5). This means that clades closer on the concatenated gene tree had larger proportions of homologous regions, and the pattern of syntenic regions depended on the clade to be compared. We also detected locally exclusively homologous regions among distant clades (Supplementary Fig. S2-2). This suggests that clade divergence was caused by multiple recombination events in multiple strains, at least as many times as the number of clades, and not by passage mutation in only one ancestral strain.
Figure 2

Visualized chromosomal mixture of TK-22 of clade A. I–VIII represent chromosome numbers. Sequence depths are shown in the inner circle with black bars. Chromosomal mixture is shown in the outer circle with coloured bars, where heights represent similarity scores (percentage of identical gene sequences) and colours represent the closest clade.

Visualized chromosomal mixture of TK-22 of clade A. I–VIII represent chromosome numbers. Sequence depths are shown in the inner circle with black bars. Chromosomal mixture is shown in the outer circle with coloured bars, where heights represent similarity scores (percentage of identical gene sequences) and colours represent the closest clade. In the A. oryzae clade, most genome regions were represented as a mixture with other A. oryzae clades; particularly, some were highly homologous to the A. flavus clade and vice versa. Thus, we cannot completely exclude the possibility of recombination between two species after speciation. However, because the exclusively homologous regions between the two species were very small, the strain used in this study might have been insufficient. Because our samples were biased toward practically used strains, information about wild-type strains of A. oryzae is lacking. Thus, the genomic mosaicism shown in this study does not directly represent the frequency of recombination. By analyzing more strains, the mosaic structure might be simplified and recombination processes in the clades could be clarified. In addition, because our samples were human-managed strains, it might be possible to track whether hybridization occurred among them. However, none of the clades appeared to be expressed as a mixture of two other clades. In contrast, there were some A. flavus strains for which genomes were represented by a mixture of two or three other strains (Supplementary Fig. S3-2). Interestingly, the genome of A. oryzae TK-27, a strain maintained by a tane-koji manufacturer for more than six decades and the use of which started in the 2000s, had an unusual structure; 85% or more of the genome was homologous to clade G, to which the MAT types were also identical, while some regions were closer to those of the other clades (Supplementary Fig. S3-3). A Chinese strain, AS3.951, and the strains in clade K showed a similar pattern, although they had different MAT types (Supplementary Fig. S3-4). However, they were not a clear mixture of any strain pair. Based on these results, we consider that the domestication process influenced the evolution of A. oryzae mostly through rearrangements within a single genome, and rarely via sexual recombination.

3.5. Influence of domestication

3.5.1. Evolutionary hypothesis

Our results highlighted the fact that the ancestor of A. oryzae underwent multiple complex recombination events. In contrast, considering that no simple recombination mixtures between strains from the tane-koji manufacturer were observed and many strains belonged to the same clade with wild-type RIB40, clinical isolates, or the strain from Thailand, humans might have chosen strains from nature as suitable for brewing and maintained them without crossbreeding. Therefore, the influence of domestication on the evolution of A. oryzae likely appeared only after clade divergence, suggesting that the domestication process does not contribute to genetic recombination. In previous reports, vegetative compatibility group (VCG) divergence in A. flavus was estimated to have occurred 50,000–189,000 years ago. VCG is a self-identification system, and there are at least 13 VCGs in A. flavus. Considering that A. oryzae is monophyletic and nested in one of the clades of A. flavus based on the phylogenetic tree, and that A. oryzae is one type of VCG, the speciation of A. oryzae and A. flavus might have occurred contemporaneously. Domestication and industrial utilization of koji (rice malt) began in China over 3,000–2,000 years ago, and stocking and selling of these products began in Japan 700–500 years ago. Therefore, domestication likely began influencing the evolution of A. oryzae in the last 3,000 or 700 years after clade divergence (Fig. 3).
Figure 3

Hypothesis for the influence of domestication on the evolution of Aspergillus oryzae.

Hypothesis for the influence of domestication on the evolution of Aspergillus oryzae. In a previous study, mutational pressure was estimated by comparing the SNP frequencies of A. flavus and A. oryzae or by comparative genomics of RIB40 and RIB326. According to such studies, mutations tend to accumulate in non-synteny blocks (NSBs) and sub-telomeric regions. However, these estimates are considered to reflect the influence of selection pressure in nature during the period from species or clade divergence to domestication. Therefore, we focused on recently accumulated mutations or gene duplications/deletions by comparing them within each clade.

3.5.2. Intra-cladal gene variants

We calculated the numbers of inter- and intra-cladal mutations in coding sequences for 14,711 SCGs present in 101 strains with BUSCO scores of 96 or more (Supplementary Method 2.4.5). There were a few intra-cladal mutations; in total, 265 synonymous mutations, 528 non-synonymous mutations, and 93 gap mutations were found in 201, 453, and 70 genes, respectively (Supplementary Table S4, 1–3), of which synonymous mutations were not prevalent enough for neutrality tests within each gene (Supplementary Table S4-4). Remarkably, however, the odds of the total number of intra-cladal non-synonymous/synonymous mutations were significantly higher than those of inter-cladal mutations in A. oryzae and much higher than those of intra-species mutations in A. flavus, with a similar tendency for gap/synonymous mutations (Table 1). This tendency was almost the same between synteny blocks (SB) and NSBs with A. nidulans/A. fumigatus (Supplementary Table S4-5).
Table 1

Total number of mutations in coding sequences of Aspergillus oryzae and Aspergillus flavus

Synonymous mutationsNon-synonymous mutationsGap mutations
A. oryzae Intra-cladeCounts26552893
Odds2.9*20.8*
Inter-cladeCounts76,54366,0011,795
Odds1.3*1.4*
A. flavus Counts79,06853,9221,336
Odds(1.0)(1.0)

P < 1.0E-10.

Total number of mutations in coding sequences of Aspergillus oryzae and Aspergillus flavus P < 1.0E-10. These results suggest that the influence of domestication, reflected by intra-cladal mutations and somewhat by inter-cladal mutations, causes the accumulation of non-neutral mutations changing gene functionality, such as loss of function. Even though there were no genes with sufficient number of inter-cladal mutations for the calculation of odds or P values (Supplementary Table S4-4), we found 1–4 intra-cladal non-synonymous or gap mutations among all the named genes (Table 2). For example, both intra-cladal non-synonymous and gap mutations were found in two annotated genes, specifically wA/OG0003175/AO090102000545 and laeA/OG0001970/AO0900030000489. A. oryzae wA is an industrially important polyketide synthase gene, an orthologue of A. nidulans wA, which is required for the synthesis of a green pigment. The down-regulation of the A. oryzae wA gene leads to the production of white conidia. Non-synonymous or gap mutations in wA were only found in some white mutants of clade A, C, and G used for miso production wherein the control of product color is important. laeA is a global regulator of secondary metabolism in Aspergillus, and it is also involved in the production of kojic acid.
Table 2

Annotated genes with intra-cladal non-synonymous/gap mutations

RIB40_s01-m09-r06SymbolAnnotated features
Genes with intra-cladal non-synonymous mutations
AO090003001208 amyR Regulatory protein that indirectly affects the production of hemicellulolytic and cellulolytic enzymes, likely through carbon catabolite repression-mediated control
AO090003000489 laeA Methyltransferase; global transcriptional regulator of secondary metabolic gene clusters; required for kojic acid gene regulation and biosynthesis
AO090009000638 steA Orthologue of Ste12p with a predicted role in the regulation of transcription
AO090001000237 veA Orthologue of Aspergillus nidulans VeA, a global gene regulator involved in light-sensitive control of differentiation and secondary metabolism; positively regulates penicillin production in Aspergillus oryzae
AO090003000491 rpbA Predicted RNA polymerase II largest subunit; has 25 repeats in its C-terminal domain
AO090026000360 gprD Family A G-protein coupled receptor (GPCR)-like
AO090001000439 schA Ser/Thr protein kinase related to the PKA catalytic subunit
AO090001000512 cyaA Adenylate cyclase
AO090003001305 aglB Putative alpha-galactosidase; expression altered by manR disruption but not by ManR overexpression
AO090003001507 tglA Triacylglycerol lipase with a role in the degradation of triglycerides
AO090001000445 CYP505A3 Cytochrome P450 monooxygenase
AO090005000070 CYP620H3 Cytochrome P450 monooxygenase; involved in 7-hydroxycoumarin production
AO090012000465 CYP620H9 Cytochrome P450 monooxygenase; involved in 7-hydroxycoumarin production
AO090038000488 csyC Putative type III polyketide synthase
AO090011000926 dffA l-Ornithine N5-oxygenase; enzyme required for the biosynthesis of an iron-chelating compound, deferriferrichrysin; siderophore biosynthesis
AO090001000009 wykN Non-ribosomal peptide synthase (NRPS) involved in the synthesis of a dipeptidyl peptidase IV 2 inhibitor
AO090102000632 can1 Has domain(s) with predicted role in amino acid transport, transmembrane transport, integral component of membranes, membrane localization
AO090103000127 bglF Secretory aryl beta-glucosidase
AO090003001144 vti1 T-SNARE
AO090701000589 chsB Chitin synthase; required for normal hyphal growth and conidiation
AO090026000337 sec31 Vesicle coat complex COPII, subunit; expression increased in MAT1-2 strain compared with that in MAT1-1 strain
AO090102000545 wA Hydroquinone: oxygen oxidoreductase; orthologues have a role in asexual spore wall assembly, melanin biosynthesis, pathogenesis, and pigment metabolism
Genes with intra-cladal gap mutation
AO090102000545 wA (Above)
AO090003000489 laeA (Above)
AO090009000612 amdA Sequence-specific DNA-binding transcription factor
Annotated genes with intra-cladal non-synonymous/gap mutations Interestingly, a few mutations in catabolic enzymes were considered industrially useful; moreover, no intra-cladal non-synonymous or gap mutations were found in known proteolytic enzyme-encoding genes. In contrast, there were several non-synonymous/gap mutations in genes involved in expression, signaling, secondary metabolites, secretion/transporters, and cell traits (Table 2). Therefore, the domestication process might have contributed to altering the traits of strains while maintaining the activity of industrially useful catalytic enzyme-encoding genes by applying selective pressure to those peripheral genes rather than enzyme-encoding genes. This might reflect the fact that Japanese tane-koji manufacturers have continued to passage their strains to prevent changes in their traits in parallel with breeding.

3.5.3. Intra-cladal gene duplication

We estimated gene duplication/deletion by direct read mapping and calculation of the normalized depth, equal to the copy number of genes (Supplementary Table S5). As a result, we found 221 OGs (117 SCGs) with different estimated copy numbers in at least one clade and 179 OGs (20 SCGs) in more than two clades. We performed statistical analysis on KEGG BRITE annotation but found no significant feature (number of total detection > 1, P = 0.05, Fisher’s exact test), suggesting that selection pressure for gene duplication is not explained by gene function. The estimated copy numbers of OG0000041/PF03221/Tc5 transposase and OG0000321/PF14529/endonuclease-reverse transcriptase, which are transposon-derived genes, had changed more than twice as compared with that in RIB40; in particular, the clade B and G strains showed 15–20 times the estimated copy number. Furthermore, rRNA genes exhibited a wide range in estimated copy number (e.g. it varied 13–48-fold among eight strains of clade C without bacterial contamination from yatalase). However, the selective pressure on rRNA gene duplication is hard to estimate, because a previous report showed strain-dependent copy number variation in rRNA genes in A. fumigatus by quantitative PCR, suggesting that the duplication of rRNA genes is likely to occur in the natural environment. The estimated copy number of the tRNA gene also tended to vary, and intra-cladal change was observed at 102/276 tRNA gene loci. As a general trend, changes in the sequence depth were more frequent in the sub-telomeric region (Supplementary Fig. S4-1, Chr. I, Fig. S4-2). Change in depth was also observed in the sub-centromeric regions, which represents changes in the number of non-coding repeats in the unassembled region. In contrast, in some strains, a 60–70-kbp region containing tRNA genes (e.g. Supplementary Fig. S4-1, Chr. III/V) was duplicated, which might have drastically altered transcription or translation. Three copies of α-amylase (amyA/amyB/amyC/OG0011956) have been detected in RIB40. We found both intra-cladal and inter-cladal variation in the duplication number of OG0011956 (α-amylase) as follows: three to four copies in clade A, two to three copies in clade D/E, two copies in clade B, and four copies in clade G. Similarly, we found one copy in TK-24/TK-29 and four copies in TK-27. Moreover, the A. sojae strains TK-83, TK-84, and TK-85 had one copy, which is consistent with previous studies.,

3.5.4. Aflatoxin biosynthetic gene cluster

The types of aflatoxin biosynthetic gene clusters in A. oryzae, including those in all of our samples (confirmed as non-aflatoxigenic) and strains estimated to be A. oryzae in this study, were classified into three groups as defined in a previous study. Moreover, we found that Kusumoto Group 3 was nested in Group 2, while Group 1 was located far from the other two (Supplementary Fig. S5). Aflatoxin cluster sequences were not clustered by their toxigenicity, suggesting that detoxification in A. oryzae/A. flavus had occurred in parallel. This is consistent with a previous report inferring that the selective pressure against toxins was lost in nature. Because the non-toxicity of A. oryzae is phylogenetically guaranteed, tane-koji manufacturers might have distinguished A. oryzae from A. flavus based on their growth ability on rice and incidentally selected aflatoxicity.

3.6. Concluding remarks

We acquired 82 industrial strains from five Japanese tane-koji manufacturers and conducted whole-genome sequencing to determine their draft genomes. Through phylogenetic tree-based inferences of the chromosomal genome and chromosome recombination analysis, we showed that A. oryzae strains have undergone multiple inter-genomic recombination events between A. oryzae ancestors. However, sexual recombination among domesticated species did not appear to have occurred during the domestication process, at least in the past few decades; therefore, we hypothesized that evolutionary pressure introduced by the domestication of A. oryzae is extremely limited to intra-genomic mutation and rearrangements. Through intra- and inter-cladal comparative analysis, we showed that the evolutionary pressure of domestication selectively caused non-synonymous and gap mutations and intra-genomic recombination. Our results suggest that the domestication process might have contributed to altering strain traits while maintaining the activity of industrially useful catalytic enzyme genes by applying selective pressure to peripheral genes involved in fermentation rather than the enzyme-encoding genes themselves. Our study provides suggestions on the relationship between the evolution and domestication of A. oryzae, and importantly, the whole genomic data and phylogenetic tree will help to develop breeding methods based on sexual reproduction using industrial strains. Click here for additional data file.
  44 in total

1.  The phylogenetics of mycotoxin and sclerotium production in Aspergillus flavus and Aspergillus oryzae.

Authors:  D M Geiser; J W Dorner; B W Horn; J W Taylor
Journal:  Fungal Genet Biol       Date:  2000-12       Impact factor: 3.495

2.  LaeA, a regulator of secondary metabolism in Aspergillus spp.

Authors:  Jin Woo Bok; Nancy P Keller
Journal:  Eukaryot Cell       Date:  2004-04

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

4.  Cryptic speciation and recombination in the aflatoxin-producing fungus Aspergillus flavus.

Authors:  D M Geiser; J I Pitt; J W Taylor
Journal:  Proc Natl Acad Sci U S A       Date:  1998-01-06       Impact factor: 11.205

5.  Aspergillus oryzae laeA regulates kojic acid synthesis genes.

Authors:  Ken Oda; Akiko Kobayashi; Shinichi Ohashi; Motoaki Sano
Journal:  Biosci Biotechnol Biochem       Date:  2011-09-07       Impact factor: 2.043

6.  Genome Characterization of Oleaginous Aspergillus oryzae BCC7051: A Potential Fungal-Based Platform for Lipid Production.

Authors:  Chinae Thammarongtham; Intawat Nookaew; Tayvich Vorapreeda; Tanawut Srisuk; Miriam L Land; Sukanya Jeennor; Kobkul Laoteng
Journal:  Curr Microbiol       Date:  2017-09-01       Impact factor: 2.188

7.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

8.  Sexual reproduction in Aspergillus flavus.

Authors:  Bruce W Horn; Geromy G Moore; Ignazio Carbone
Journal:  Mycologia       Date:  2009 May-Jun       Impact factor: 2.696

9.  Genome sequence and comparative analyses of atoxigenic Aspergillus flavus WRRL 1519.

Authors:  Guohua Yin; Sui Sheng T Hua; Kayla K Pennerman; Jiujiang Yu; Lijing Bu; Richard T Sayre; Joan W Bennett
Journal:  Mycologia       Date:  2018-07-03       Impact factor: 2.696

Review 10.  Genomics of Aspergillus oryzae.

Authors:  Tetsuo Kobayashi; Keietsu Abe; Kiyoshi Asai; Katsuya Gomi; Praveen Rao Juvvadi; Masashi Kato; Katsuhiko Kitamoto; Michio Takeuchi; Masayuki Machida
Journal:  Biosci Biotechnol Biochem       Date:  2007-03-07       Impact factor: 2.043

View more
  3 in total

Review 1.  Aspergillus Hydrophobins: Physicochemical Properties, Biochemical Properties, and Functions in Solid Polymer Degradation.

Authors:  Takumi Tanaka; Yuki Terauchi; Akira Yoshimi; Keietsu Abe
Journal:  Microorganisms       Date:  2022-07-25

2.  Identifying candidate Aspergillus pathogenicity factors by annotation frequency.

Authors:  Kayla K Pennerman; Guohua Yin; Anthony E Glenn; Joan W Bennett
Journal:  BMC Microbiol       Date:  2020-11-11       Impact factor: 3.605

3.  Comparison of Two Aspergillus oryzae Genomes From Different Clades Reveals Independent Evolution of Alpha-Amylase Duplication, Variation in Secondary Metabolism Genes, and Differences in Primary Metabolism.

Authors:  Katherine Chacón-Vargas; Colin O McCarthy; Dasol Choi; Long Wang; Jae-Hyuk Yu; John G Gibbons
Journal:  Front Microbiol       Date:  2021-07-13       Impact factor: 5.640

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.