| Literature DB >> 30423067 |
Alex Warwick Vesztrocy1,2, Christophe Dessimoz1,2,3,4,5, Henning Redestig6.
Abstract
Motivation: A key goal in plant biotechnology applications is the identification of genes associated to particular phenotypic traits (for example: yield, fruit size, root length). Quantitative Trait Loci (QTL) studies identify genomic regions associated with a trait of interest. However, to infer potential causal genes in these regions, each of which can contain hundreds of genes, these data are usually intersected with prior functional knowledge of the genes. This process is however laborious, particularly if the experiment is performed in a non-model species, and the statistical significance of the inferred candidates is typically unknown.Entities:
Mesh:
Year: 2018 PMID: 30423067 PMCID: PMC6129274 DOI: 10.1093/bioinformatics/bty615
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Conceptual overview of QTLSearch—to identify the most likely causal genes, by identifying the intersection of genes associated with a given trait based on an evolutionary analysis and QTL analyses
Fig. 2.Overview of the HOGPROP algorithm, for propagating through hierarchical orthologous groups. This visualizes the propagation of a single gene-function association
The six metabolites and their mapped GO and ChEBI terms used to find the distribution of finding at least one spurious candidate in A. thaliana
| Metabolite | GO term | ChEBI term |
|---|---|---|
Fig. 3.Probability of finding at least one spurious candidate in A. thaliana for six metabolites, as a function of QTL length (left y-axis). In the background, histogram of the distribution of QTL lengths reported by Lisec (right y-axis)
Statistics of the number of QTL that could be mapped to GO and/or ChEBI terms from the two datasets in A. thaliana (Lisec ) and O. sativa subsp. indica (Gong )
| Dataset | Metabolites | QTL | Metabolites | QTL |
|---|---|---|---|---|
| 50 | 141 | 35 | 107 | |
| 302 | 1, 260 | 121 | 638 | |
Fig. 4.Proportion of QTL with at least one candidate from Lisec et al. (left) and Gong et al. (right) for each method
Fig. 5.Overlap with the candidate genes reported by Lisec et al. (left) and Gong et al. (right), for QTLSearch (at 1% and 5% significance levels) and the naïve BLAST method
Table of significantly associated genes for a QTL in the Lisec et al. dataset, associated with Galactose
| QTLSearch | |||||
|---|---|---|---|---|---|
| OMA ID | Increase | Direct Annotation | Found by BLAST | Author Candidate | |
| 0.996764 | 0.003126 | ChEBI | ✓ | ✗ | |
| 0.375134 | 0.003916 | ✗ | ✗ | ✗ | |
Fig. 6.Visualization of the propagation of the annotation of ARATH09154 to CHEBI :28260 (Galactose), which leads to an increase in the score for ARATH16587. (Left) before propagation; (Middle) after up-propagation; (Right) After both up-propagation and down-propagation. Note: this hierarchical orthologous group extends above the level of the Rosids
Significantly associated genes for a QTL in the Gong et al. dataset, associated with Chrysoeriol c-hexoside
| QTLSearch | |||||
|---|---|---|---|---|---|
| OMA ID | Increase | Direct Annotation | Found by BLAST | Author Candidate | |
| 1.980263 | 0.000021 | ✗ | ✗ | ✗ | |
| 1.494041 | 0.000048 | UniProt-GOA | ✗ | ✓ | |
| 0.638598 | 0.000260 | ✗ | ✓ | ✗ | |
| 0.638598 | 0.000260 | ✗ | ✓ | ✓ | |
| 0.541781 | 0.000418 | ✗ | ✓ | ✗ | |