| Literature DB >> 34568928 |
Takeshi Kashiwa1, Tomohiro Suzuki2.
Abstract
Plant diseases caused by the Cercospora genus of ascomycete fungi are a major concern for commercial agricultural practices. Several Cercospora species can affect soybeans, such as Cercospora kikuchii which causes soybean leaf blight. Speciation in Cercospora on soybean has not been adequately studied. Some cryptic groups of Cercospora also cause diseases on soybean. Moreover, it has been known C. kikuchii population genetic structure is different between countries. Consequently, further genomic information could help to elucidate the covert differentiation of Cercospora diseases in soybean. Here, we report for the first time, a chromosome-level genome assembly for C. kikuchii. The genome assembly of 9 contigs was 34.44 Mb and the N50 was 4.19 Mb. Based on ab initio gene prediction, several candidates for pathogenicity-related genes, including 242 genes for putative effectors, 55 secondary metabolite gene clusters, and 399 carbohydrate-active enzyme genes were identified. The genome sequence and the features described in this study provide a solid foundation for comparative and evolutionary genomic analysis for Cercospora species that cause soybean diseases worldwide.Entities:
Keywords: zzm321990 Cercospora kikuchiizzm321990 ; Cercospora leaf blight; purple seed stain; soybean
Mesh:
Year: 2021 PMID: 34568928 PMCID: PMC8496228 DOI: 10.1093/g3journal/jkab277
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Summary of the genome assembly
| Features | Value |
|---|---|
| Isolate name | MAFF 305040 |
| Culture collection | NARO Genebank, Japan |
| Origin of isolate | Kagoshima, Japan |
| Assembly size (bp) | 34,440,063 |
| Number of contigs | 9 |
| Maximum contig length (bp) | 5,855,908 |
| N50 of contigs | 4,190,438 |
| GC content (%) | 53.0 |
| BUSCO completeness (%) | 99.4 |
| Predicted genes | 13,001 |
| Genome accession | BOLY00000000 |
Data obtained from NARO Genebank (https://www.gene.affrc.go.jp, accessed on August 31, 2020).
DDBJ/EMBL/GenBank accession number for the assembled contigs.
Candidates for pathogenicity-related genes and gene clusters
| Features | Number |
|---|---|
| Effector candidates | 242 |
| SM gene clusters | 55 |
| Polyketide | 15 |
| Nonribosomal peptide | 22 |
| Terpene | 6 |
| Beta-lactone | 1 |
| Siderophore | 2 |
| Fungal-RiPP | 1 |
| Others | 8 |
| CAZymes | 399 |
|
GH | 220 |
| GT | 86 |
| CBM | 10 |
| AA | 66 |
| CE | 19 |
| PL | 7 |
Genes encoding effector candidates predicted by EffectorP.
Numbers of SM gene clusters for each metabolite type predicted by antiSMASH. Fungal-RiPP, fungal ribosomally synthesized and post-translationally modified peptide.
Genes encoding carbohydrate-activate enzymes (CAZymes) predicted by dbCAN. Some genes were annotated with more than one category. GH, glycoside hydrolase; GT, glycosyltransferase; CBM, carbohydrate-binding module; AA, auxiliary activity; CE, carbohydrate esterase; and, PL, polysaccharide lyase
Figure 1Circos plot of MAFF 305040 genome. Tracks indicates: (A) Contigs of MAFF 305040, minor ticks indicate 0.1 Mb. Numbers for tracks indicate identifier for contigs (1, contig_00001; 2, contig_00002; 3, contig_00003; 4, contig_00004; 5, contig_00005; 6, contig_00006; 7, contig_00007; 8, contig_00008; and 9, contig_00009); (B) Positions of the genes encoding effector candidates (blue); (C) Positions of the SM gene clusters. Colors indicate predicted SM type of the cluster: orange, polyketide; green, nonribosomal peptide; red, siderophore; purple, terpene and beta-lactone; and dark blue, ribosomally synthesized and post-translationally modified peptide. Other clusters are colored blue. (D) Positions of the genes encoding CAZymes (red).
Figure 2Number of genes in the C. kikuchii MAFF 305040 genome and C. sojina Race 15 genome for each CAZyme category. GH, glycoside hydrolase; GT, glycosyltransferase; CBM, carbohydrate-binding module; AA, auxiliary activity; CE, carbohydrate esterase; PL, polysaccharide lyase.
Number of genes related to GH families identified from C. kikuchii MAFF 305040 and C. sojina Race 15
|
| MAFF 305040 | Race 15 |
|
|---|---|---|---|
| GH64 | 4 | 4 | CW (β-1,3-glucan) |
| GH1 | 3 | 3 | CW (β-glycans) |
| GH2 | 6 | 6 | CW (β-glycans) |
| GH3* | 17 | 16 | CW (β-glycans) |
| GH5* | 14 | 13 | CW (β-glycans) |
| GH32 | 3 | 3 | ESR (sucrose/inulin) |
| GH37 | 2 | 2 | ESR (trehalose) |
| GH65 | 1 | 1 | ESR (trehalose) |
| GH15* | 2 | 1 | ESR (α-glucans) |
| GH30 | 2 | 2 | FCW |
| GH85 | 1 | 1 | FCW |
| GH18* | 6 | 5 | FCW (chitin) |
| GH20 | 1 | 1 | FCW (chitin) |
| GH76 | 9 | 9 | FCW (chitin) |
| GH17 | 4 | 5 | FCW (β-1,3-glucan) |
| GH55 | 5 | 5 | FCW (β-1,3-glucan) |
| GH71 | 2 | 2 | FCW (β-1,3-glucan) |
| GH72 | 6 | 6 | FCW (β-1,3-glucan) |
| GH81 | 1 | 1 | FCW (β-1,3-glucan) |
| GH16* | 11 | 9 | FCW (β-glycans) |
| GH13 | 15 | 15 | FCW + ESR (α-glucans) |
| GH7 | 1 | 1 | PCW (cellulose) |
| GH12 | 1 | 1 | PCW (cellulose) |
| GH10* | 4 | 3 | PCW (hemicellulose) |
| GH11 | 3 | 3 | PCW (hemicellulose) |
| GH27 | 2 | 3 | PCW (hemicellulose) |
| GH29* | 3 | 2 | PCW (hemicellulose) |
| GH35* | 2 | 1 | PCW (hemicellulose) |
| GH36* | 2 | 1 | PCW (hemicellulose) |
| GH39 | 0 | 1 | PCW (hemicellulose) |
| GH51* | 3 | 2 | PCW (hemicellulose) |
| GH53 | 1 | 1 | PCW (hemicellulose) |
| GH54 | 1 | 1 | PCW (hemicellulose) |
| GH62 | 1 | 1 | PCW (hemicellulose) |
| GH67 | 1 | 1 | PCW (hemicellulose) |
| GH93* | 1 | 0 | PCW (hemicellulose) |
| GH115 | 1 | 2 | PCW (hemicellulose) |
| GH43* | 12 | 10 | PCW (pectin + hemicellulose) |
| GH28 | 5 | 5 | PCW (pectin) |
| GH78* | 3 | 2 | PCW (pectin) |
| GH88* | 2 | 1 | PCW (pectin) |
| GH105 | 3 | 3 | PCW (pectin) |
| GH38 | 1 | 1 | PG (N-/O-glycans) |
| GH47 | 9 | 9 | PG (N-/O-glycans) |
| GH63 | 1 | 1 | PG (N-glycans) |
| GH125 | 3 | 3 | PG (N-glycans) |
| GH31* | 9 | 8 | PG + ESR + PCW (hemicellulose) |
| GH33 | 1 | 1 | NA |
| GH42 | 1 | 1 | NA |
| GH79 | 3 | 4 | NA |
| GH92 | 7 | 7 | NA |
| GH95* | 2 | 0 | NA |
| GH97* | 1 | 0 | NA |
| GH106 | 1 | 1 | NA |
| GH114 | 1 | 1 | NA |
| GH127 | 1 | 1 | NA |
| GH128* | 2 | 1 | NA |
| GH130 | 1 | 1 | NA |
| GH131 | 1 | 1 | NA |
| GH132 | 1 | 1 | NA |
| GH135 | 1 | 2 | NA |
| GH139* | 1 | 0 | NA |
| GH141* | 1 | 0 | NA |
| GH142 | 1 | 1 | NA |
| GH152 | 1 | 1 | NA |
| GH154* | 2 | 1 | NA |
Asterisk indicates that the number of genes identified from the C. kikuchii MAFF 305040 was greater than the number identified from C. sojina Race 15.
Information on the substrate was obtained from Zhao . CW, cell wall; ESR, energy storage and recovery; FCW, fungal cell wall; PCW, plant cell wall; PG, protein glycosylation; NA, not assigned.