| Literature DB >> 28419606 |
Maggie Haitian Wang1,2, Billy Chang1, Rui Sun1,2, Inchi Hu3, Xiaoxuan Xia1, William Ka Kei Wu4, Ka Chun Chong1,2, Benny Chung-Ying Zee1,2.
Abstract
Genetic data consists of a wide range of marker types, including common, low-frequency, and rare variants. Multiple genetic markers and their interactions play central roles in the heritability of complex disease. In this study, we propose an algorithm that uses a stratified variable selection design by genetic architectures and interaction effects, achieved by a dataset-adaptive W-test. The polygenic sets in all strata were integrated to form a classification rule. The algorithm was applied to the Critical Assessment of Genome Interpretation 4 bipolar challenge sequencing data. The prediction accuracy was 60% using genetic markers on an independent test set. We found that epistasis among common genetic variants contributed most substantially to prediction precision. However, the sample size was not large enough to draw conclusions for the lack of predictability of low-frequency variants and their epistasis.Entities:
Keywords: W-test; bipolar; classification of complex disorder; disease prediction; epistasis; interaction effect; mutation; polygenic risk stratification
Mesh:
Year: 2017 PMID: 28419606 PMCID: PMC5561515 DOI: 10.1002/humu.23229
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878