| Literature DB >> 33811222 |
Alexandre Perochon1, Harriet R Benbow1, Katarzyna Ślęczka-Brady1, Keshav B Malla1, Fiona M Doohan2.
Abstract
There is increasing evidence that some functionally related, co-expressed genes cluster within eukaryotic genomes. We present a novel pipeline that delineates such eukaryotic gene clusters. Using this tool for bread wheat, we uncovered 44 clusters of genes that are responsive to the fungal pathogen Fusarium graminearum. As expected, these Fusarium-responsive gene clusters (FRGCs) included metabolic gene clusters, many of which are associated with disease resistance, but hitherto not described for wheat. However, the majority of the FRGCs are non-metabolic, many of which contain clusters of paralogues, including those implicated in plant disease responses, such as glutathione transferases, MAP kinases, and germin-like proteins. 20 of the FRGCs encode nonhomologous, non-metabolic genes (including defence-related genes). One of these clusters includes the characterised Fusarium resistance orphan gene, TaFROG. Eight of the FRGCs map within 6 FHB resistance loci. One small QTL on chromosome 7D (4.7 Mb) encodes eight Fusarium-responsive genes, five of which are within a FRGC. This study provides a new tool to identify genomic regions enriched in genes responsive to specific traits of interest and applied herein it highlighted gene families, genetic loci and biological pathways of importance in the response of wheat to disease.Entities:
Year: 2021 PMID: 33811222 PMCID: PMC8018971 DOI: 10.1038/s41598-021-86362-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Genome mining of stress-responsive gene clusters. Differential expression analysis was used to identify stress-responsive genes from transcriptomics data. Using chromosomal coordinates of genes from a reference genome, the average number of differentially expressed genes (DEGs) within a sliding window of a user-defined size (hereby called gene density), and the length of a string of consecutive DEGs was calculated. Shuffled data sets were also analysed to determine what values of gene density and consecutiveness can be considered non-random. Clusters of genes that breach this threshold were designated as significantly non-random gene clusters, and their expression profiles are used for co-expression analysis.
Figure 2Chromosomal distribution of Fusarium-responsive gene clusters (FRGCs). (a) Fusarium-responsive gene density along each wheat chromosome. A sliding window of 10 genes was used to screen each chromosome and calculate the Fusarium-responsive gene density. A significance threshold (dashed line) was calculated by 10,000 random permutations of the data and enriched Fusarium-responsive gene loci (those that passed the threshold) were identified and are represented in red vermillion. (b) Number of FRGCs and (c) total number of high confidence genes are shown in bar diagrams with their distribution in different chromosomal zone such as distal telomeric (short arm (R1) and long arm (R3)), interstitial regions (short arm (R2a) and long arm (R2b)) and centromere regions (C), according to[17].
Fusarium-responsive gene cluster categories and contents.
| Gene cluster category | Chromosome | Proteins within the cluster | Cluster ID |
|---|---|---|---|
| Nonhomologous, non-metabolic | 1A | Embryogenesis-associated protein EMB8, subtilisin-like protease, kinase family protein, transmembrane protein, putative DUF1218 proteins | 2 |
| 1B, 1D | Receptor-like kinases, glucan endo-1,3-beta-glucosidase, cysteine-rich receptor kinase | 3, 6 | |
| 2B | Receptor-like kinases, glucan endo-1,3-beta-glucosidase, receptor kinase, cysteine-rich receptor kinase | 12 | |
| 2D | Nucleotide-binding leucine-rich repeats, receptor kinase, plastid transcriptionally active 5 | 16 | |
| 2D | B3 domain-containing protein, RING/U-box superfamily proteins, 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein | 17 | |
| 2D | Blue copper proteins, FAD-binding Berberine family protein-putative, Ethylene-responsive transcription factor-putative | 19 | |
| 3D | Lectin receptor kinases, FMN-dependent NADPH-azoreductases, pyruvate decarboxylase, transcription factor, sugar transporter-putative | 20 | |
| 3D | Glutathione S-transferases, DNAse I-like superfamily protein, SAUR-like auxin-responsive family proteins, nucleotide-binding leucine-rich repeat | 21 | |
| 4A, 7D | Lectin receptor kinases, cytochrome P450, ABC transporter (Leaf rust resistance gene 34 (LR34)), sugar transporter-putative | 23, 40 | |
| 4A | Carbonic anhydrases, Coatomer beta subunit, putative, Fatty acyl-CoA reductase, Peroxidase | 24 | |
| 4D | Fusarium resistance orphan protein (TaFROG), RZ53, lectin receptor kinase, alcohol dehydrogenases, SWAP (Suppressor-of-White-APricot)/surp domain-containing protein, carboxypeptidase | 26 | |
| 5A, 5B | MYB transcription factor, histone mono-ubiquitination 2, receptor kinase, alcohol dehydrogenases, putative, TSL-kinase interacting protein 1 | 28, 31 | |
| 5D | Kinases putative, cysteine-rich receptor-kinase-like protein, wall-associated receptor kinase-like proteins | 35 | |
| 6D | Receptor-like kinases, TMV response-related proteins | 39 | |
| 7D | Protein EXORDIUM-like 1, glycosyltransferase, RNA-binding protein | 41 | |
| 7D | J-type co-chaperone jac1, mitochondrial, processing peptidase, TMV response-related proteins | 42 | |
| 7D | Magnesium transporter NIPA, eukaryotic aspartyl protease proteins, 1-amino-cyclopropane-1-carboxylate oxidase 1, DUF668 family protein, chaperone protein dnaJs | 43 | |
| Nonhomologous, metabolic | 2A, 2B, 2D, Un | Cytochrome P450s, kaurene synthases, copalyl diphosphate synthases, UDP-dependent glycosyltransferases, vacuolar sorting receptor family protein, wall-associated receptor kinase-like protein, receptor-like protein kinase | 8, 13, 15, 44 |
| 2B | Cytochrome P450s, wall-associated receptor kinase | 11 | |
| 5A, 5D | Cytochrome P450s, hydroxysteroid dehydrogenases, terpene cyclase/mutases family member, receptor-like protein kinase, always EARLY-like protein, obtusifoliol 14-alpha demethylase | 27, 34 | |
| 5D | O-methyltransferases, cytochrome P450, chalcone synthases, chalcone-flavonone isomerase family, leucine rich repeat family protein | 36 | |
| Paralogous, metabolic | 1D | 7 | |
| 2A | 9 | ||
| 2A | 10 | ||
| 5B | 32 | ||
| Paralogous, non-metabolic | 1A, 1D | 1, 4 | |
| 1D | 5 | ||
| 2B, 2D | 14, 18 | ||
| 3D | 22 | ||
| 4D | 25 | ||
| 5A | 29 | ||
| 5A, 5B | 30, 33 | ||
| 6A, 6B | 37, 38 |
Encoded proteins by Fusarium-responsive gene within each cluster are listed. Paralogous proteins that represented ≥ 60% of the total protein content of the cluster are highlighted in bold.
Figure 3Content, arrangement, and co-expression profiles of representative FRGCs. Schematic representations of Fusarium-responsive gene clusters are represented as: Fusarium-responsive (coloured arrows), non-responsive genes (grey arrows) and two neighbouring genes (black arrows). Different colours indicate distinct classes of proteins. Co-expression matrices are shown below each cluster. Circle characteristics represent the direction (colour, blue = positive, red = negative) and the strength (size) of the correlation. Four examples of FRGCs are illustrated and correspond to nonhomologous metabolic ((a) benzoxazinoid-like and (b) momilactone-like cluster), paralogous non-metabolic ((c) GSTs cluster), and nonhomologous non-metabolic ((d) TaFROG cluster) categories. FRGC ID and chromosome number are indicated between brackets. (a) The benzoxazinoid-like cluster contain genes coding for a wall-Associated kinase (WAK) and two class of CYP450s genes with four benzoxazinoid-like genes coding for CYP71C, and two CYP79A genes. (b) The momilactone-like cluster is similar to the characterised rice momilactone gene cluster[37], with CYP450s (CYP) mono‐oxygenases (CYP99A2/A3), two terpene synthase kaurene synthase (KS) and copalyl diphosphate synthase (CPS) genes, but without the momilactone A synthase gene absent in the wheat cluster. (c) The GSTs cluster contains an RNA binding protein (RBP) and 8 glutathione-S-transferases (GST) genes. (d) The TaFROG cluster contains genes coding for a lectin receptor kinase (LecRK), the Fusarium resistant orphan protein (TaFROG), a poaceae-specific protein (RZ53), a SWAP/surp domain-containing protein (SWAP), two alcohol dehydrogenase (ADH) and a carboxypeptidase (CDP). Cluster representation is not to scale.
Figure 4Expression profiles of Fusarium-responsive genes within the TaFROG cluster. Transcript levels in wheat (cv. CM-82036) heads after treatment with Fusarium graminearum was measured by qRT-PCR using the formula 2−(Ct target gene—Ct average housekeeping genes). Spikelets were harvested at various hours post-inoculation (hpi) as indicated. Results represent the mean of three independent trials (each include two technical replicates per treatment from a pooled of 4 biological sample) and error bars represent ± S.E.M. Asterisks show significant differences (*P < 0.05, **P < 0.01).
Figure 5Chromosomal distribution of FRGCs, FHB QTL and associated markers. Fusarium-responsive gene clusters and 216 FHB QTL molecular markers were mapped based on their genome positions on the 21 wheat chromosomes. Six QTL regions (Chromosome 1A, 2B, 2D, 4A, 4D and 7D), in which the distance between flanking markers is less than 150 Mb and encompass FRGCs, are indicated with a black vertical bar. FRGCs are represented with a blue circle, and markers with a square, either red if the marker is physically > 5 Mb from a cluster, or green if the marker is physically < 5 Mb from a cluster. Centromeric region are represented as the restricted central region of the chromosomes.
FHB QTL that are smaller than 150 Mb and encompass FRGC.
| QTL | Chromosomes | Sources | Flanking markers | Distance between markers (Mb) | Cluster(s) within QTL | References | |
|---|---|---|---|---|---|---|---|
| QTL-1A | 1A | Pelikan | Xgwm164 | Xbarc28 | 118 | 1 | Häberle et al.[ |
| QTL-2B | 2B | Goldfield | Xbarc200 | Xgwm210-2B | 44.2 | 11 | Gilsinger et al.[ |
| QTL-2D | 2D | Frontana | wPt-732882 | wPt-667765 | 67.4 | 18, 19 | Szabó-Hevér et al.[ |
| QTL-4A | 4A | 86ISMN_2137 | Xbarc246 | Xwmc262 | 44.4 | 23, 24 | McCartney et al.[ |
| QTL-4D | 4D | 86ISMN_2137 | Xwmc720 | wPt-0431 | 52.8 | 26 | McCartney et al.[ |
| QTL-7D | 7D | Haiyanzhong | Xcfd46 | Xwmc702 | 4.7 | 41 | Li et al.[ |