| Literature DB >> 24905468 |
Gulnara R Svishcheva1, Nadezhda M Belonogova1, Tatiana I Axenovich2.
Abstract
The kernel machine-based regression is an efficient approach to region-based association analysis aimed at identification of rare genetic variants. However, this method is computationally complex. The running time of kernel-based association analysis becomes especially long for samples with genetic (sub) structures, thus increasing the need to develop new and effective methods, algorithms, and software packages. We have developed a new R-package called fast family-based sequence kernel association test (FFBSKAT) for analysis of quantitative traits in samples of related individuals. This software implements a score-based variance component test to assess the association of a given set of single nucleotide polymorphisms with a continuous phenotype. We compared the performance of our software with that of two existing software for family-based sequence kernel association testing, namely, ASKAT and famSKAT, using the Genetic Analysis Workshop 17 family sample. Results demonstrate that FFBSKAT is several times faster than other available programs. In addition, the calculations of the three-compared software were similarly accurate. With respect to the available analysis modes, we combined the advantages of both ASKAT and famSKAT and added new options to empower FFBSKAT users. The FFBSKAT package is fast, user-friendly, and provides an easy-to-use method to perform whole-exome kernel machine-based regression association analysis of quantitative traits in samples of related individuals. The FFBSKAT package, along with its manual, is available for free download at http://mga.bionet.nsc.ru/soft/FFBSKAT/.Entities:
Mesh:
Year: 2014 PMID: 24905468 PMCID: PMC4048315 DOI: 10.1371/journal.pone.0099407
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Dependence of the running times of the second step of mini-exome analysis of quantitative trait Q1 on sample size for different methods (using one processor at 3.07 GHz).
Points show the estimated running times (RT), lines correspond to the linear regression equations: RT ASKAT = 9×10−6 n 3–.753; RT famSKAT = 6.7×10–5 n 2–2.8, and RT FFBSKAT = 1.7×10–5 n 2–3.7, where n is the sample size.
Figure 2Comparison of the P values (shown as minus base 10 logarithm) computed with famSKAT, ASKAT, and FFBSKAT given a sample of 500 individuals, for two causal genes, FLT1 and VEGFA.
200 realizations of Q1 quantitative trait in GAW17 data were analyzed. The line indicates one-to-one correspondence.