| Literature DB >> 35310628 |
Francesco Pancaldi1, Dennis Vlegels1, Hugo Rijken1, Eibertus N van Loo1, Luisa M Trindade1.
Abstract
Translational genomics can enable a quicker improvement of orphan crops toward novel agricultural applications, including the advancement of orphan biomass species for cultivation on marginal lands. In this sense, cell wall quality is a preeminent breeding target. However, tools to efficiently project genetic data on target traits across large sets of species are currently missing. This study aimed at closing this gap by developing a strategy to project a set of cell wall QTLs across a large group of plants by using genome synteny. This strategy is suited for large-scale analyses and detected 362 syntenic cell wall QTLs (SQTLs) across 74 angiosperms, including several (orphan) biomass species. SQTLs analyses revealed that they span large portions of the initial cell wall QTLs and are extensively conserved across diverse species. Moreover, numerous QTLs cell wall genes were conserved through SQTLs, including genes displaying allelic variation associated with cell wall composition. Functional analyses showed that highly conserved genes of SQTLs include important cell wall transcription factors and genes involved in the remodeling of cell wall polymers. For some of these gene families, SQTLs indicated the presence of differentially conserved genomic contexts for different gene members, highlighting their utility as a tool to pinpoint gene targets that maximize the likelihood of functional gene conservation. Overall, the results of this study can facilitate "universal" approaches for breeding (orphan) biomass crops, while the strategy for QTLs translation can be applied to other sets of traits and species, helping to unlock the potential of orphan species.Entities:
Keywords: QTLs; biomass quality; breeding tools; cell wall; crop improvement; orphan crops; syntenic QTLs; synteny
Year: 2022 PMID: 35310628 PMCID: PMC8928447 DOI: 10.3389/fpls.2022.855093
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Lists of plant families used for SQTLs detection for each of the plant families for which initial QTLs were retrieved.
| Initial QTLs families | Plant families selected for SQTL detection |
| Brassicaceae | Brassicaceae, Malvaceae, Cleomaceae, Anacardaceae, Actinidiaceae, Myrtaceae |
| Fabaceae | Fabaceae, Salicaceae, Moraceae, Rhamnaceae, Linaceae, Cannabaceae, Euphorbiaceae, Rosaceae, Vitaceae, Cucurbitaceae, Crassulaceae, Nelumbonaceae, Ranunculaceae, Myrtaceae, Papaveraceae, Lythraceae, Anacardaceae, Rutaceae, Malvaceae, Brassicaceae, Cleomaceae, Amaranthaceae, Actinidiaceae, Convolvulaceae, Solanaceae, Rubiaceae, Oleaceae, Pedaliaceae, Phrymaceae, Asteraceae, Apiaceae |
| Myrtaceae | Lythraceae, Anacardaceae, Rutaceae, Malvaceae, Cucurbitaceae, Rosaceae, Rhamnaceae, Fabaceae, Actinidiaceae, Salicaceae, Rubiaceae, Vitaceae, Oleaceae, Pedaliaceae, Phrymaceae, Apiaceae |
| Salicaceae | Salicaceae, Linaceae, Fabaceae, Euphorbiaceae, Moraceae, Rhamnaceae, Vitaceae, Cannabaceae, Rosaceae, Crassulaceae, Cucurbitaceae, Nelumbonaceae, Ranunculaceae, Papaveraceae, Myrtaceae, Lythraceae, Anacardaceae, Rutaceae, Malvaceae, Brassicaceae, Cleomaceae, Amaranthaceae, Lauraceae, Theaceae, Amborellaceae, Actinidiaceae, Convolvulaceae, Solanaceae, Rubiaceae, Oleaceae, Pedaliaceae, Phrymaceae, Asteraceae, Apiaceae |
| Poaceae | Arecaceae, Araceae, Bromeliaceae, Asparagaceae, Orchidaceae, Musaceae, Poaceae |
The selection of plant families was based on the synteny level of the different initial QTLs against the 151 species of the project, as described in paragraph “Filtering the Syntenic Quantitative Trait Loci Network”.
FIGURE 1The level of synteny of different relevant classes of genes. (A) Boxplots of the number of syntenic connections of cell wall genes and of non-cell wall genes from the 151 genomes of the study. (B) Boxplots of the number of syntenic connections of cell wall genes and of non-cell wall genes from the 594 cell wall QTLs of the study spanning at least one gene. (C) Boxplots of the number of syntenic connections of cell wall QTL genes and of genes not included in cell wall QTLs from the 151 genomes of the study. **Significant at alpha = 0.01.
FIGURE 2The pipeline followed for detecting syntenic cell wall QTLs (SQTLs). (A) The starting data sets of 151 angiosperm genomes, of 610 cell wall QTLs from 8 diverse plant species, and of ∼250000 candidate cell wall genes identified across all the genomes of the study. (B) A synteny network of QTL genes across the species of the study was created, where each node (gray circles) represents a gene and black edges represent syntenic connections between genes. (C) Syntenic communities grouping the members of specific gene families that are syntenic across (groups of) plants were detected and annotated for the initial cell wall QTLs harbored by (some of) their members. Colored circles represent genes, and circles with the same color are genes belonging to the same syntenic community. Colored stars represent initial cell wall QTLs, with different colors indicating different initial cell wall QTLs. (D) Syntenic communities were clustered into SQTLs based on similarity of the cell wall QTLs harbored by each community. A SQTLs network was therefore obtained, where each node represent a syntenic community (gray circles), black edges between communities indicate a similarity >0.6 between two syntenic communities in terms of cell wall QTLs harbored by them. (E) Representation of the genomic meaning of SQTLs. Each double strip with circles represents the genome of a species, and colored circles represent genes of different types (different colors). A SQTL represents a genomic region syntenic between multiple species (same genes in the same order) and that in at least one species spans at least (a part of) one initial cell wall QTL (black rectangle).
FIGURE 3(A) Example of a SQTL (SQTL_196). The orange nodes represent all the colinear syntenic communities that group syntenic genes of which at least some are retained within initial cell wall QTLs. (B) Two of the syntenic communities that constitute the SQTL_196. Green rectangles represent community nodes. Pink stars indicate maize (ND_) and miscanthus (GQ_) nodes included within initial cell wall QTLs. Red nodes indicate a gene from maize (ND_NP_001106047.1) and a gene from rice (HN_XP_015650053.1) that correspond to the maize Brown Midrib 3 locus and the rice CAldOMT1 locus, respectively. Both these loci encode for a COMT gene involved in lignin synthesis and are known to display allelic variation associated to phenotypic variation in cell wall quality in across maize and rice populations (see Supplementary Table 9).
FIGURE 4(A) The level of fragmentation of the initial cell wall QTLs from the different groups of Table 1 over the SQTLs in which they are involved. Fragmentation is expressed as number of different SQTLs over which different parts of initial QTL resulted involved. (B) The variability of SQTL length (expressed as number of SQTL gene per species involved in a SQTL) of the SQTLs detected for the plant groups of Table 1.
The extensiveness of SQTLs across the groups of species used for their detection.
| SQTL group | Total species in SQTL group | Total number of SQTLs | Average species spanned by SQTLs | Percentage over total species |
| Brassicaceae | 29 | 9 | 6 | 21% |
| Fabaceae | 111 | 16 | 31 | 28% |
| Myrtaceae | 59 | 6 | 21 | 35% |
| Salicaceae | 112 | 50 | 25 | 22% |
| Poaceae | 27 | 281 | 17 | 62% |
The 22 SQTLs selected for candidate gene inspection based on a high co-localization of initial QTLs and the inclusion of diverse plant species.
| SQTL | SQTL group | Number of QTLs co-localizing in SQTL | Total species in SQTL | Species included in SQTL |
| SQTL_217 | Fabaceae | 6 | 2 | Eucalyptus grandis; Glycine max |
| SQTL_23 | Myrtaceae | 9 | 2 | Eucalyptus grandis; Populus trichocarpa |
| SQTL_121 | Myrtaceae | 7 | 3 | Arabidopsis thaliana; Eucalyptus grandis; Populus trichocarpa |
| SQTL_169 | Myrtaceae | 7 | 4 | Eucalyptus grandis; Glycine max; Populus trichocarpa; Populus trichocarpa |
| SQTL_187 | Salicaceae | 14 | 3 | Arabidopsis thaliana; Eucalyptus grandis; Populus trichocarpa |
| SQTL_246 | Salicaceae | 11 | 3 | Arabidopsis thaliana; Eucalyptus grandis; Populus trichocarpa |
| SQTL_50 | Salicaceae | 8 | 3 | Arabidopsis thaliana; Eucalyptus grandis; Populus trichocarpa |
| SQTL_174 | Salicaceae | 8 | 3 | Eucalyptus grandis; Populus trichocarpa; Populus trichocarpa |
| SQTL_14 | Salicaceae | 8 | 2 | Arabidopsis thaliana; Populus trichocarpa |
| SQTL_39 | Salicaceae | 7 | 3 | Arabidopsis thaliana; Eucalyptus grandis; Populus trichocarpa |
| SQTL_53 | Salicaceae | 6 | 4 | Arabidopsis thaliana; Eucalyptus grandis; Populus trichocarpa; Populus trichocarpa |
| SQTL_2 | Poaceae | 45 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_160 | Poaceae | 25 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_188 | Poaceae | 24 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_245 | Poaceae | 23 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_20 | Poaceae | 20 | 2 | Sorghum bicolor; Zea mays |
| SQTL_91 | Poaceae | 20 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_47 | Poaceae | 20 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_56 | Poaceae | 19 | 3 | Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_69 | Poaceae | 18 | 2 | Miscanthus sinensis; Zea mays |
| SQTL_73 | Poaceae | 16 | 4 | Miscanthus sinensis; Oryza sativa; Sorghum bicolor; Zea mays |
| SQTL_60 | Poaceae | 16 | 3 | Miscanthus sinensis; Sorghum bicolor; Zea mays |
FIGURE 5The proportions of cell wall genes participating to different cell wall processes observed among a set of 22 SQTLs showing an exceptionally high co-localization of initial cell wall QTLs from diverse plant species and highly conserved across different angiosperms (A), among the genes included in all the initial 594 cell wall QTLs spanning at least one gene (B), and among all the genes from the 151 angiosperm genomes of the study (C).
FIGURE 6The structure of the syntenic communities of some of the highly conserved cell wall gene families included within 22 SQTLs showing an exceptionally high co-localization of initial cell wall QTLs from diverse plant species and a high extensiveness across angiosperms. Each independent network represents a gene community conserved across a set of species. Network nodes indicate the gene members of syntenic communities from different species (see legend). Network nodes with black bold edges indicate genes contained within initial cell wall QTLs. (A) Vascular-related NAC domain (VND); (B) WRKY12; (C) NAC Secondary cell wall Thickening factor (NST); (D) caffeoyl shikimate esterases (CSE); (E) affeoyl CoA O-methyltransferases (CCoAOMT); (F) mediator complex subunits (MED); (G) Reduced Wall Acetylation; (H) BAHD Acyltransferase.
FIGURE 7The structure of the syntenic communities of some of the monocot and dicot cell wall genes known to display allelic/mutational variation associated with variation in cell wall quality phenotypes and included in SQTLs. Each independent network represents a gene community conserved across a set of species. Network nodes indicate the gene members of syntenic communities from different species (see legend). Network nodes with black bold edges indicate genes contained within initial cell wall QTLs. Triangular nodes indicate that the gene was shown to display allelic/mutational variation associated with variation in cell wall quality phenotypes. Circular nodes are positional orthologs of those genes. (A) Zea mays Brown Midrib 1 (ZmBM1); (B) Zea mays Brown Midrib 3 (ZmBM3); (C) Zea mays Brown Midrib 4 (ZmBM4); (D) Oryza sativa Brittle culm-like 8 (OsBCL8); (E) Oryza sativa Brittle culm-like 9 (OsBCL9); (F) Arabidopsis thaliana Cobra-like 4 (AtCOBL4); (G) Arabidopsis thaliana Irregular xylem 14 (AtIRX14).
The cell wall related gene classes from the initial cell wall QTLs whose conservation through SQTLs resulted highest and lowest.
| Gene function | Broad cell wall process | Conservation level in SQTLs | Copy number in cell wall QTLs | Copy number in SQTLs (for species with available QTLs) | Copy number decrement SQTLs/QTLs (%) |
|
| Cellulose synthesis | Highly conserved | 1 | 1 | 0 |
|
| Transcription factor | Highly conserved | 4 | 4 | 0 |
|
| Transcription factor | Highly conserved | 2 | 2 | 0 |
|
| Transcription factor | Highly conserved | 4 | 4 | 0 |
|
| Transcription factor | Highly conserved | 5 | 5 | 0 |
|
| Transcription factor | Highly conserved | 5 | 5 | 0 |
|
| Transcription factor | Highly conserved | 4 | 4 | 0 |
|
| Cellulose synthesis | Highly conserved | 10 | 9 | 10 |
|
| Hemicellulose metabolism | Highly conserved | 10 | 9 | 10 |
|
| Hemicellulose metabolism | Highly conserved | 10 | 9 | 10 |
|
| Hemicellulose metabolism | Highly conserved | 19 | 17 | 11 |
|
| Transcription factor | Highly conserved | 8 | 7 | 13 |
|
| Transcription factor | Highly conserved | 8 | 7 | 13 |
|
| Transcription factor | Highly conserved | 8 | 7 | 13 |
|
| Transcription factor | Highly conserved | 8 | 7 | 13 |
|
| Pectin metabolism | Highly conserved | 14 | 12 | 14 |
|
| Pectin metabolism | Highly conserved | 14 | 12 | 14 |
|
| Transcription factor | Highly conserved | 7 | 6 | 14 |
|
| Hemicellulose metabolism | Highly conserved | 12 | 10 | 17 |
|
| Transcription factor | Highly conserved | 6 | 5 | 17 |
|
| Transcription factor | Highly conserved | 6 | 5 | 17 |
|
| Cellulose synthesis | Highly conserved | 5 | 4 | 20 |
|
| Lignin metabolism | Highly conserved | 5 | 4 | 20 |
|
| Transcription factor | Highly conserved | 39 | 31 | 21 |
|
| Transcription factor | Highly conserved | 34 | 27 | 21 |
|
| Hemicellulose metabolism | Highly conserved | 54 | 42 | 22 |
|
| Pectin metabolism | Highly conserved | 54 | 42 | 22 |
|
| Hemicellulose metabolism | Highly conserved | 44 | 34 | 23 |
|
| Pectin metabolism | Poorly conserved | 72 | 21 | 71 |
|
| Lignin metabolism | Poorly conserved | 14 | 4 | 71 |
|
| Hemicellulose metabolism | Poorly conserved | 4 | 1 | 75 |
|
| Transcription factor | Poorly conserved | 4 | 1 | 75 |
|
| Transcription factor | Poorly conserved | 4 | 1 | 75 |
|
| Lignin metabolism | Poorly conserved | 52 | 11 | 79 |
|
| Lignin metabolism | Poorly conserved | 5 | 1 | 80 |
|
| Lignin metabolism | Poorly conserved | 44 | 8 | 82 |
|
| Callose synthesis | Poorly conserved | 12 | 2 | 83 |
|
| Cellulose synthesis UDP Glu supply | Poorly conserved | 6 | 1 | 83 |
|
| Other cell wall protein | Poorly conserved | 6 | 1 | 83 |
|
| Other cell wall protein | Poorly conserved | 12 | 2 | 83 |
|
| Lignin metabolism | Poorly conserved | 19 | 3 | 84 |
|
| Lignin metabolism | Poorly conserved | 20 | 3 | 85 |
|
| Other cell wall protein | Poorly conserved | 2 | 0 | 100 |
|
| Other cell wall protein | Poorly conserved | 1 | 0 | 100 |
|
| Other cell wall protein | Poorly conserved | 1 | 0 | 100 |
|
| Hemicellulose metabolism | Poorly conserved | 1 | 0 | 100 |
|
| Hemicellulose metabolism | Poorly conserved | 4 | 0 | 100 |
|
| Transcription factor | Poorly conserved | 2 | 0 | 100 |
|
| Transcription factor | Poorly conserved | 3 | 0 | 100 |
FIGURE 8The structure of the syntenic communities of some of the gene families included in initial cell wall QTLs and overall highly conserved through SQTLs. Each independent network represents a gene community conserved across a set of species. Network nodes indicate the gene members of syntenic communities from different species (see legend). Network nodes with black bold edges indicate genes contained within initial cell wall QTLs. (A) Xylem NAC domain (XND); (B) BEL1-like Homeodomain 9 (BLH9); (C) Knotted1-like homeobox gene 3 (KNAT3); (D) PARVUS; (E) Ferulate 5-hydroxylase (F5H); (F) STELLO (STL).