| Literature DB >> 31873088 |
Naoki Yamamoto1,2, Yanran Wang1,2, Runmao Lin1,2,3, Yueyang Liang1,4, Yao Liu1,2, Jun Zhu1,4, Lingxia Wang1,4, Shiquan Wang1,4, Huainian Liu1,4, Qiming Deng1,4, Shuangcheng Li1,4, Ping Li1,4, Aiping Zheng5,6.
Abstract
Rhizoctonia solani is a fungal species complex that causes necrotrophic crop diseases. It comprises several anastomosis groups, some of which include intra-subgroups, such as AG-1 IA and AG-1 IB, exhibiting varying pathogenicity. Owing to its heterozygous and multinucleate features, genomic analyses of R. solani are still challenging, and understanding of its genetic diversity and genic components is limited. In this study, in order to elucidate the molecular basis of this phytopathogen complex, an integrated transcriptome analysis was undertaken for three subgroups of AG-1, i.e. AG-1 IA, AG-1 IB, and AG-1 IC. Sequence variations suggested substantial evolutionary distances within AG-1. Transcript simple sequence repeats showed comparable characteristics among AG-1, but contained polymorphic sites. Intra-subgroup polymorphisms suggested varying genic heterozygosity within AG-1, suggesting their independent evolutionary trajectory. Sequences of pathogenic factors, phytotoxin biosynthesis pathway enzymes, secreted lignocellulosic enzymes, secreted reactive oxygen species detoxification enzymes, apoplastic/cytoplasmic effector candidates, were conserved among those subgroups. dN/dS ratios of a secretome subset suggested core secreted proteins in AG-1 and distinct evolution of Cys-rich small secreted proteins after differentiation of AG-1 subgroups. Identification of likely pathogenic factors including allergen protein homologues, oxidative phosphorylation and ethylene biosynthesis pathways, and diversification of polysaccharide monooxygenases provides molecular insight into key genomic components that play a role in R. solani pathogenesis.Entities:
Mesh:
Year: 2019 PMID: 31873088 PMCID: PMC6928066 DOI: 10.1038/s41598-019-55734-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Data statistics of the transcriptome sequencing in AG 1 subgroups.
| IA-TCs | IB-TCs | IC-TCs | IA-STCs | |
|---|---|---|---|---|
| Number of paired-end reads | 6,367,931 | 6,038,800 | 5,104,151 | 35,555,860 |
| Number of contigs | 22,845 | 21,905 | 19,719 | 43,553 |
| Total nucleotides in contigs (bp) | 19,657,497 | 20,550,584 | 16,395,799 | 63,798,492 |
| N50 length of contigs (bp) | 1,336 | 1,397 | 1,199 | 2,424 |
| Average length of contigs (bp) | 861 | 938 | 832 | 1,465 |
| GC content (%) | 50.8 | 51.4 | 51.3 | 50.3 |
| Minimum length of contigs (bp) | 201 | 201 | 201 | 201 |
| Maximum length of contigs (bp) | 12,071 | 12,107 | 9,686 | 13,306 |
| BUSCO (fungi) | 273/290 | 283/290 | 282/290 | 289/290 |
| BUSCO (basidiomycetes) | 1,121/1,335 | 1,215/1,335 | 1,205/1,335 | 1,320/1,335 |
*Number of present genes/all conserved common genes tested by BUSCO.
Figure 1Distribution of SSRs on the mycelial transcriptome contigs. The SSRs with 2–10 bases of units were shown. Extend: SSRs extending between non-coding and coding sequences, Non-coding: SSRs in non-coding sequence, Coding: SSRs in coding sequence.
Figure 2Distribution of PHI-base accessions of which sequences showed homologies with predicted proteins in the three AG-1 strains. Numerals represent numbers of PHI-base accessions. (A) IA-TCs, (B) IB-TCs, (C) IC-TCs, (D) IA-STCs, and (E) the AG-1 IA reference genome.
Figure 3Candidate genes on the biosynthetic pathway for PAA production. Identifiers of transcriptome contigs of IA-TCs, IB-TCs, and IC-TCs were shown in boxes. Expression patterns of candidate genes in IA-STCs (RPKM in log10-scale) for this pathway were represented in heatmaps (numerals indicate hours after inoculation onto rice). IA-STC_DN15809_c0_g1_i2 could be annotated as shikimate kinase or EPSP synthase based on the result of the pfam domain search.
Figure 4Relationship of protein sequence similarity and dN/dS ratios of 123 putative orthologous secreted proteins among AG-1. This protein set was based on the search by cd-hit-2d, a subtool of CD-HIT[64]. The horizontal axis and the vertical axis represent protein sequence identify and dN/dS, respectively. White and gray circles indicate pair-wise comparisons of non-small (no short than 300 amino acids) secreted proteins between AG-1 IA and AG-1 IB and AG-1 IA and AG-1 IC, respectively. White and gray triangles indicate pair-wise comparisons of small secreted proteins between AG-1 IA and AG-1 IB and AG-1 IA and AG-1 IC, respectively.
Figure 5Graphical representation of differentially regulated genes by inoculation of the AG-1 IA strain. The heat map represents expression patterns of genes in log10-scale. The expression patterns were grouped into five clusters using the complete linkage of Pearson correlation coefficient. Black boxes in front of the contig identifiers indicate probable secreted protein gene transcripts. Short functional annotations of the contigs were shown in the right of the contig identifiers. Dot boxes in yellow indicate up-regulated clusters.
Figure 6Structure and gene expression levels of AA9 in the up-regulated cluster of 48 hours after inoculation. (A) graphical representation of AA9 proteins and RPKM. Asterisks with the contig identifier indicate that part of coding sequences were recovered from the AG-1 IA draft genome sequence. (B) Partial sequence alignment of AA9 proteins. The N-terminus and protein domain for glycosyl hydrolase family 61 were shown. Asterisks indicate completely identical amino acid residues among the AA9 proteins.