Literature DB >> 26661113

Exploiting Linkage Disequilibrium for Ultrahigh-Dimensional Genome-Wide Data with an Integrated Statistical Approach.

Michelle Carlsen1, Guifang Fu2, Shaun Bushman3, Christopher Corcoran1.   

Abstract

Genome-wide data with millions of single-nucleotide polymorphisms (SNPs) can be highly correlated due to linkage disequilibrium (LD). The ultrahigh dimensionality of big data brings unprecedented challenges to statistical modeling such as noise accumulation, the curse of dimensionality, computational burden, spurious correlations, and a processing and storing bottleneck. The traditional statistical approaches lose their power due to [Formula: see text] (n is the number of observations and p is the number of SNPs) and the complex correlation structure among SNPs. In this article, we propose an integrated distance correlation ridge regression (DCRR) approach to accommodate the ultrahigh dimensionality, joint polygenic effects of multiple loci, and the complex LD structures. Initially, a distance correlation (DC) screening approach is used to extensively remove noise, after which LD structure is addressed using a ridge penalized multiple logistic regression (LRR) model. The false discovery rate, true positive discovery rate, and computational cost were simultaneously assessed through a large number of simulations. A binary trait of Arabidopsis thaliana, the hypersensitive response to the bacterial elicitor AvrRpm1, was analyzed in 84 inbred lines (28 susceptibilities and 56 resistances) with 216,130 SNPs. Compared to previous SNP discovery methods implemented on the same data set, the DCRR approach successfully detected the causative SNP while dramatically reducing spurious associations and computational time.
Copyright © 2016 by the Genetics Society of America.

Entities:  

Keywords:  GWAS; GenPred; case–control; feature screening; genomic selection; large-scale modeling; linkage disequilibrium; shared data resource

Mesh:

Year:  2015        PMID: 26661113      PMCID: PMC4788225          DOI: 10.1534/genetics.115.179507

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


  70 in total

Review 1.  Association study designs for complex diseases.

Authors:  L R Cardon; J I Bell
Journal:  Nat Rev Genet       Date:  2001-02       Impact factor: 53.242

Review 2.  Linkage disequilibrium and the search for complex disease genes.

Authors:  L B Jorde
Journal:  Genome Res       Date:  2000-10       Impact factor: 9.043

3.  Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21.

Authors:  N Patil; A J Berno; D A Hinds; W A Barrett; J M Doshi; C R Hacker; C R Kautzer; D H Lee; C Marjoribanks; D P McDonough; B T Nguyen; M C Norris; J B Sheehan; N Shen; D Stern; R P Stokowski; D J Thomas; M O Trulson; K R Vyas; K A Frazer; S P Fodor; D R Cox
Journal:  Science       Date:  2001-11-23       Impact factor: 47.728

4.  High-resolution haplotype structure in the human genome.

Authors:  M J Daly; J D Rioux; S F Schaffner; T J Hudson; E S Lander
Journal:  Nat Genet       Date:  2001-10       Impact factor: 38.330

5.  Linkage disequilibrium in the human genome.

Authors:  D E Reich; M Cargill; S Bolk; J Ireland; P C Sabeti; D J Richter; T Lavery; R Kouyoumjian; S F Farhadian; R Ward; E S Lander
Journal:  Nature       Date:  2001-05-10       Impact factor: 49.962

6.  The structure of haplotype blocks in the human genome.

Authors:  Stacey B Gabriel; Stephen F Schaffner; Huy Nguyen; Jamie M Moore; Jessica Roy; Brendan Blumenstiel; John Higgins; Matthew DeFelice; Amy Lochner; Maura Faggart; Shau Neen Liu-Cordero; Charles Rotimi; Adebowale Adeyemo; Richard Cooper; Ryk Ward; Eric S Lander; Mark J Daly; David Altshuler
Journal:  Science       Date:  2002-05-23       Impact factor: 47.728

7.  A first-generation linkage disequilibrium map of human chromosome 22.

Authors:  Elisabeth Dawson; Gonçalo R Abecasis; Suzannah Bumpstead; Yuan Chen; Sarah Hunt; David M Beare; Jagjit Pabial; Thomas Dibling; Emma Tinsley; Susan Kirby; David Carter; Marianna Papaspyridonos; Simon Livingstone; Rocky Ganske; Elin Lõhmussaar; Jana Zernant; Neeme Tõnisson; Maido Remm; Reedik Mägi; Tarmo Puurand; Jaak Vilo; Ants Kurg; Kate Rice; Panos Deloukas; Richard Mott; Andres Metspalu; David R Bentley; Lon R Cardon; Ian Dunham
Journal:  Nature       Date:  2002-07-10       Impact factor: 49.962

8.  Sample sizes required to detect linkage disequilibrium between two or three loci.

Authors:  A H Brown
Journal:  Theor Popul Biol       Date:  1975-10       Impact factor: 1.570

Review 9.  Linkage disequilibrium in humans: models and data.

Authors:  J K Pritchard; M Przeworski
Journal:  Am J Hum Genet       Date:  2001-06-14       Impact factor: 11.025

10.  Conditional linkage disequilibrium analysis of a complex disease superlocus, IDDM1 in the HLA region, reveals the presence of independent modifying gene effects influencing the type 1 diabetes risk encoded by the major HLA-DQB1, -DRB1 disease loci.

Authors:  P Zavattari; R Lampis; C Motzo; M Loddo; A Mulargia; M Whalen; M Maioli; E Angius; J A Todd; F Cucca
Journal:  Hum Mol Genet       Date:  2001-04-01       Impact factor: 6.150

View more
  3 in total

1.  Improved Prediction of Bacterial Genotype-Phenotype Associations Using Interpretable Pangenome-Spanning Regressions.

Authors:  John A Lees; T Tien Mai; Marco Galardini; Nicole E Wheeler; Samuel T Horsfield; Julian Parkhill; Jukka Corander
Journal:  mBio       Date:  2020-07-07       Impact factor: 7.867

2.  Detecting PCOS susceptibility loci from genome-wide association studies via iterative trend correlation based feature screening.

Authors:  Xiaotian Dai; Guifang Fu; Randall Reese
Journal:  BMC Bioinformatics       Date:  2020-05-04       Impact factor: 3.169

Review 3.  Statistical Learning Methods Applicable to Genome-Wide Association Studies on Unbalanced Case-Control Disease Data.

Authors:  Xiaotian Dai; Guifang Fu; Shaofei Zhao; Yifei Zeng
Journal:  Genes (Basel)       Date:  2021-05-13       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.