Literature DB >> 30307540

Joint analysis of individual-level and summary-level GWAS data by leveraging pleiotropy.

Mingwei Dai1,2, Xiang Wan3, Hao Peng4, Yao Wang1, Yue Liu5, Jin Liu6, Zongben Xu1, Can Yang2.   

Abstract

MOTIVATION: A large number of recent genome-wide association studies (GWASs) for complex phenotypes confirm the early conjecture for polygenicity, suggesting the presence of large number of variants with only tiny or moderate effects. However, due to the limited sample size of a single GWAS, many associated genetic variants are too weak to achieve the genome-wide significance. These undiscovered variants further limit the prediction capability of GWAS. Restricted access to the individual-level data and the increasing availability of the published GWAS results motivate the development of methods integrating both the individual-level and summary-level data. How to build the connection between the individual-level and summary-level data determines the efficiency of using the existing abundant summary-level resources with limited individual-level data, and this issue inspires more efforts in the existing area.
RESULTS: In this study, we propose a novel statistical approach, LEP, which provides a novel way of modeling the connection between the individual-level data and summary-level data. LEP integrates both types of data by LEveraging Pleiotropy to increase the statistical power of risk variants identification and the accuracy of risk prediction. The algorithm for parameter estimation is developed to handle genome-wide-scale data. Through comprehensive simulation studies, we demonstrated the advantages of LEP over the existing methods. We further applied LEP to perform integrative analysis of Crohn's disease from WTCCC and summary statistics from GWAS of some other diseases, such as Type 1 diabetes, Ulcerative colitis and Primary biliary cirrhosis. LEP was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.39% (±0.58%) to 68.33% (±0.32%) using about 195 000 variants.
AVAILABILITY AND IMPLEMENTATION: The LEP software is available at https://github.com/daviddaigithub/LEP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30307540     DOI: 10.1093/bioinformatics/bty870

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  2 in total

1.  Coupled mixed model for joint genetic analysis of complex disorders with two independently collected data sets.

Authors:  Haohan Wang; Fen Pei; Michael M Vanyukov; Ivet Bahar; Wei Wu; Eric P Xing
Journal:  BMC Bioinformatics       Date:  2021-02-05       Impact factor: 3.169

2.  LEP: A Statistical Method Integrating Individual-Level and Summary-Level Data of the Same Trait From Different Populations.

Authors:  Mingwei Dai; Jin Liu; Can Yang
Journal:  Biomed Inform Insights       Date:  2019-10-17
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.