| Literature DB >> 34992619 |
Felipe Roberto Francisco1, Alexandre Hild Aono1, Carla Cristina da Silva1, Paulo S Gonçalves2, Erivaldo J Scaloppi Junior2, Vincent Le Guen3,4, Roberto Fritsche-Neto5, Livia Moura Souza1,6, Anete Pereira de Souza1,7.
Abstract
Hevea brasiliensis (rubber tree) is a large tree species of the Euphorbiaceae family with inestimable economic importance. Rubber tree breeding programs currently aim to improve growth and production, and the use of early genotype selection technologies can accelerate such processes, mainly with the incorporation of genomic tools, such as marker-assisted selection (MAS). However, few quantitative trait loci (QTLs) have been used successfully in MAS for complex characteristics. Recent research shows the efficiency of genome-wide association studies (GWAS) for locating QTL regions in different populations. In this way, the integration of GWAS, RNA-sequencing (RNA-Seq) methodologies, coexpression networks and enzyme networks can provide a better understanding of the molecular relationships involved in the definition of the phenotypes of interest, supplying research support for the development of appropriate genomic based strategies for breeding. In this context, this work presents the potential of using combined multiomics to decipher the mechanisms of genotype and phenotype associations involved in the growth of rubber trees. Using GWAS from a genotyping-by-sequencing (GBS) Hevea population, we were able to identify molecular markers in QTL regions with a main effect on rubber tree plant growth under constant water stress. The underlying genes were evaluated and incorporated into a gene coexpression network modelled with an assembled RNA-Seq-based transcriptome of the species, where novel gene relationships were estimated and evaluated through in silico methodologies, including an estimated enzymatic network. From all these analyses, we were able to estimate not only the main genes involved in defining the phenotype but also the interactions between a core of genes related to rubber tree growth at the transcriptional and translational levels. This work was the first to integrate multiomics analysis into the in-depth investigation of rubber tree plant growth, producing useful data for future genetic studies in the species and enhancing the efficiency of the species improvement programs.Entities:
Keywords: GBS; GWAS; Hevea brasiliensis; QTL; RNA-Seq; WGCNA; linkage disequilibrium; metabolic networks
Year: 2021 PMID: 34992619 PMCID: PMC8724537 DOI: 10.3389/fpls.2021.768589
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Workflow summarizing the main analyses performed.
Figure 2(A) Quantile-quantile plot for the broad genomic association model (GWAS), with the inclusion of the first main component (PC1 and PC2) as a covariate. (B) Manhattan plot for the GWAS. The x axis shows the chromosomes containing the discovered markers in their respective positions. The y axis shows the log (value of p) of the association. The green line represents the threshold obtained based on the data, and the red line represents the Bonferroni-corrected threshold of 0.05.
SNPs identified through the GWAS model.
| SNP | Chrom | Position |
| MAF | Effect | Va | PVE | Gene |
|---|---|---|---|---|---|---|---|---|
| SNP6421 | chrom02 | 14,565,718 | 1.06E + 08 | 0.10 | −1.00 | 0.18 | 0.05 | SBT4.6 |
| SNP30209 | chrom05 | 75,998,329 | 8.38E + 08 | 0.17 | 0.54 | 0.08 | 0.02 | GEK1 |
| SNP43760 | chrom08 | 26,946,649 | 9.61E + 06 | 0.45 | 0.84 | 0.35 | 0.09 | - |
| SNP92152 | chrom15 | 50,878,458 | 2.71E + 08 | 0.29 | 0.43 | 0.08 | 0.02 | IQM2 |
.
Figure 3Physical position of snpsGWAS in red, snpsLD in black and QTLs discovered by Conson et al. (2018). The QTLs for plant height (PH) are in blue and those for stem diameter (SD) are in green.
Figure 4Treemap representing the biological processes for the GO terms of the annotated SNPs.
Figure 5Coexpression network containing the SNP gene modules discovered by GWAS. Yellow shows the genes annotated for the snpsGWAS, blue shows the genes annotated for the snpsLD and gray shows the genes identified in the modules. The highlighted genes with a red border represent the 10 hubs with the most connectivity, while the size of the nodes shows the number of connected genes.
Figure 6Treemap representing the biological processes for the GO terms of the annotated functional modules.
Figure 7(A) Enzyme network. The yellow nodes represent the enzymes discovered in the coexpression modules, and the rectangular nodes indicate the enzymes with the highest centrality values. (B) Communities. The blue nodes are represented by communities containing enzymes discovered in the coexpression modules.
| ABA | Hormone abscisic acid |
| BLUP | Best linear unbiased predictor |
| CINV2 | Alkaline/neutral invertase CINV2 |
| CPM | Counts per million |
| DEGs | Differentially expressed genes |
| EC | Enzyme commission |
| EST | Expressed sequence tag |
| FarmCPU | Fixed and random model Circulating Probability Unification |
| FBX5 | Arabidillo 1 protein |
| GBS | Genotyping-by-sequencing |
| GO | Gene Ontology |
| GOLS2 | Protein galactinol synthase 2 |
| GSO1 | LRR receptor-like serine/threonine-protein kinase |
| GWAS | Genome-wide association studies |
|
| Broad heritability |
| HGN1 | Protein glucan endo-1,3-beta-glucosidase |
| INV | Invertase enzyme |
| JUB1 | Transcription factor jumgbrunnen 14 |
| KEGG | Kyoto encyclopedia of genes and genomes |
| kNNI | k-nearest neighbor imputation |
| LD | Linkage disequilibrium |
| LRT | Likelihood ratio test |
| MAF | Minimum allele frequency |
| MAS | Marker-assisted selection |
| MIOX | Substrate of the myo-inositol oxygenase |
| ML4 | MEI2-like 4 |
| NGS | Next-generation sequencing |
| PCA | Principal component analysis |
| PE | Paired-end |
| PUB23 | Protein E3 ubiquitin-protein ligase PUB23 |
| Q-Q plot | Quantile-quantile graph |
| QTLS | Quantitative trait loci |
|
| Squared Pearson correlation |
| RAPTOR1 | Regulatory-associated protein of TOR 1 |
| RE1 RNA-seq | RNA-sequencing |
| RE1 | Retrovirus-related Pol polyprotein from transposon |
| SD | Stem diameter |
| SNPs | Single nucleotide polymorphism |
| snpsGWAS | SNPs discovered by GWAS |
| snpsLD | GWAS LD-associated markers |
| SSRs | Microsatellite markers |
| SUS | Sucrose synthase enzyme |
| TE | Transposons elements |
| Thr | Theonine |
| thrC | Threonine synthase |
| TOM | Topological overlap measure |
| UPGMA | Unweighted Pair-Group Method using Arithmetic Avarages |
| USP | UDP-sugar pyrophosphorylase |
| WGCNA | Weighted gene correlation analysis |