| Literature DB >> 29519937 |
Palle Duun Rohde1,2,3, Solveig Østergaard4, Torsten Nygaard Kristensen5,6, Peter Sørensen4, Volker Loeschcke5, Trudy F C Mackay7,8,9, Pernille Sarup4.
Abstract
Understanding the genetic underpinnings of complex traits requires knowledge of the genetic variants that contribute to phenotypic variability. Reliable statistical approaches are needed to obtain such knowledge. In genome-wide association studies, variants are tested for association with trait variability to pinpoint loci that contribute to the quantitative trait. Because stringent genome-wide significance thresholds are applied to control the false positive rate, many true causal variants can remain undetected. To ameliorate this problem, many alternative approaches have been developed, such as genomic feature models (GFM). The GFM approach tests for association of set of genomic markers, and predicts genomic values from genomic data utilizing prior biological knowledge. We investigated to what degree the findings from GFM have biological relevance. We used the Drosophila Genetic Reference Panel to investigate locomotor activity, and applied genomic feature prediction models to identify gene ontology (GO) categories predictive of this phenotype. Next, we applied the covariance association test to partition the genomic variance of the predictive GO terms to the genes within these terms. We then functionally assessed whether the identified candidate genes affected locomotor activity by reducing gene expression using RNA interference. In five of the seven candidate genes tested, reduced gene expression altered the phenotype. The ranking of genes within the predictive GO term was highly correlated with the magnitude of the phenotypic consequence of gene knockdown. This study provides evidence for five new candidate genes for locomotor activity, and provides support for the reliability of the GFM approach.Entities:
Keywords: DGRP; Drosophila melanogaster; genomic prediction; locomotor activity; set test
Mesh:
Year: 2018 PMID: 29519937 PMCID: PMC5940157 DOI: 10.1534/g3.118.200082
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Outline of the study workflow starting with quantifying levels of activity for the 204 DGRP lines. A genomic feature prediction model was used to identify marker sets, defined by gene ontology (GO) terms that increased the predictive ability compared to a model where all markers were used simultaneously. The genes within a predictive GO term are likely to contribute unequally to the predictive performance, thus, the genes were ranked according to their contribution (quantified using CVAT). Top genes were functionally validated by suppressing gene expression and assessing the effect of such reduction on locomotor activity.
Figure 2Rank-ordered mean for locomotor activity (green) and the corresponding mean values adjusted for experimental fixed effects (brown). Shaded area depicts the standard error of the mean.
The top five GO terms with highest predictive ability (PA). For each GO term the following information is listed: Number of genes (No. genes) and SNPs (No. SNPs) within the GO term, the mean PA with standard errors (SE), the raw (p) and adjusted p-values (by false discovery rate (FDR)) for increased predictive performance compared to the GBLUP model, and the proportion of genomic variance explained by the GO term ()
| 1. GO:0022857 | 59 | 2563 | 0.35 ± 0.026 | 0.53 | ||
| 2. GO:0006730 | 17 | 749 | 0.27 ± 0.029 | 0.28 | ||
| 3. GO:0006810 | 80 | 6893 | 0.25 ± 0.028 | 0.44 | ||
| 4. GO:0055114 | 368 | 22029 | 0.25 ± 0.029 | 1.00 | ||
| 5. GO:0030866 | 21 | 2161 | 0.25 ± 0.027 | 0.30 |
1: transmembrane transporter activity; 2: one-carbon metabolic process; 3: transport; 4: oxidation-reduction process; 5: cortical actin cytoskeleton organization.
Figure 3Results from the GFBLUP models. Each point corresponds to one GO term that is plotted within the space of genomic variance explained () and predictive ability (PA). The size of each point relates to the number of SNPs within the GO term, and the color indicates the p-value of increased predictive ability compared to the GBLUP model. The mean predictive ability ± standard error (SE) of the GBLUP model is indicated with green vertical and horizontal lines, respectively.
Figure 4Effects on distance moved by gene expression knockdown expressed as absolute deviations from control line (red: increased activity; blue: decreased activity) ranked according to estimated effect size. Asterisks (∗) indicate statistical significance (i.e., p < 0.05) between knockdown strain and control strain. Gray points (belongs to the right y-axis) show the estimated relative effect size within the predictive GO term, here illustrated as −log10(p) from CVAT.