| Literature DB >> 27980669 |
Katherine L Thompson1, David W Fardo2.
Abstract
A central goal in the biomedical and biological sciences is to link variation in quantitative traits to locations along the genome (single nucleotide polymorphisms). Sequencing technology has rapidly advanced in recent decades, along with the statistical methodology to analyze genetic data. Two classes of association mapping methods exist: those that account for the evolutionary relatedness among individuals, and those that ignore the evolutionary relationships among individuals. While the former methods more fully use implicit information in the data, the latter methods are more flexible in the types of data they can handle. This study presents a comparison of the 2 types of association mapping methods when they are applied to simulated data.Entities:
Year: 2016 PMID: 27980669 PMCID: PMC5133494 DOI: 10.1186/s12919-016-0063-4
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Example of the evolutionary history within a particular SNP represented by a phylogeny. In the phylogenetic tree, time moves from past (left) to present (right) across the tree. The tips of the tree represent observations from the present time. Suppose the SNP represented by this tree is associated with a trait. Then, a large covariance is expected among trait values from 2 observations (e.g., the blue diamonds) sharing a large portion of their evolutionary history (shown by the branches in blue). In contrast, the 2 observations denoted by black circles share a smaller portion of their evolutionary history, so that little covariance in the corresponding trait values is expected
Comparing power and type I error across methods using simulated data
| Gene |
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Type I error | LSS | 0.010 | 0.045 | 0.020 | 0.015 | 0.020 |
|
| 0.075 | 0.100 | 0.085 | 0.100 | 0.100 | |
| Power | LSS | 1.000 | 0.855 | 0.025 | 0.030 | 0.100 |
|
| 1.000 | 0.995 | 0.140 | 0.110 | 0.035 | |
This table shows the power of detection for each of the 5 considered genes when considering 200 simulated Q1 (null) and 200 SBP phenotypes. The type I error of LSS appeared to be well controlled below 0.05 (row 1), whereas the t-statistic shows slightly inflated type I error rates (row 2). Both LSS and the t-test performed well when analyzing TNN and LEPR (rows 3 and 4). Both methods showed smaller power in the analysis of FLT3, TCIRG1, and GSN (rows 3 and 4). Neither method showed uniquely better performance across the 5 genes studied
Fig. 2Example plot of each test statistic across the genes analyzed. Each upper plot in panels a to e shows the absolute value of the t-statistic against the location of each SNP along the chromosome. Each lower plot shows the average LSS value plotted against the base pair location of each SNP. Each red dot shows the true absolute value of the effect size at that respective locus. Both the t-test and LSS show similar precision in localizing the true absolute effects