Literature DB >> 33476339

RAFFI: Accurate and fast familial relationship inference in large scale biobank studies using RaPID.

Ardalan Naseri1, Junjie Shi2, Xihong Lin3,4,5, Shaojie Zhang6, Degui Zhi1,7.   

Abstract

Inference of relationships from whole-genome genetic data of a cohort is a crucial prerequisite for genome-wide association studies. Typically, relationships are inferred by computing the kinship coefficients (ϕ) and the genome-wide probability of zero IBD sharing (π0) among all pairs of individuals. Current leading methods are based on pairwise comparisons, which may not scale up to very large cohorts (e.g., sample size >1 million). Here, we propose an efficient relationship inference method, RAFFI. RAFFI leverages the efficient RaPID method to call IBD segments first, then estimate the ϕ and π0 from detected IBD segments. This inference is achieved by a data-driven approach that adjusts the estimation based on phasing quality and genotyping quality. Using simulations, we showed that RAFFI is robust against phasing/genotyping errors, admix events, and varying marker densities, and achieves higher accuracy compared to KING, the current leading method, especially for more distant relatives. When applied to the phased UK Biobank data with ~500K individuals, RAFFI is approximately 18 times faster than KING. We expect RAFFI will offer fast and accurate relatedness inference for even larger cohorts.

Entities:  

Year:  2021        PMID: 33476339      PMCID: PMC7853505          DOI: 10.1371/journal.pgen.1009315

Source DB:  PubMed          Journal:  PLoS Genet        ISSN: 1553-7390            Impact factor:   5.917


  28 in total

1.  Estimating kinship in admixed populations.

Authors:  Timothy Thornton; Hua Tang; Thomas J Hoffmann; Heather M Ochs-Balcom; Bette J Caan; Neil Risch
Journal:  Am J Hum Genet       Date:  2012-06-28       Impact factor: 11.025

Review 2.  Identity by descent: variation in meiosis, across genomes, and in populations.

Authors:  Elizabeth A Thompson
Journal:  Genetics       Date:  2013-06       Impact factor: 4.562

3.  Maximum-likelihood estimation of recent shared ancestry (ERSA).

Authors:  Chad D Huff; David J Witherspoon; Tatum S Simonson; Jinchuan Xing; W Scott Watkins; Yuhua Zhang; Therese M Tuohy; Deborah W Neklason; Randall W Burt; Stephen L Guthery; Scott R Woodward; Lynn B Jorde
Journal:  Genome Res       Date:  2011-02-08       Impact factor: 9.043

4.  SeqArray-a storage-efficient high-performance data format for WGS variant calls.

Authors:  Xiuwen Zheng; Stephanie M Gogarten; Michael Lawrence; Adrienne Stilp; Matthew P Conomos; Bruce S Weir; Cathy Laurie; David Levine
Journal:  Bioinformatics       Date:  2017-08-01       Impact factor: 6.937

Review 5.  Relatedness in the post-genomic era: is it still useful?

Authors:  Doug Speed; David J Balding
Journal:  Nat Rev Genet       Date:  2014-11-18       Impact factor: 53.242

6.  The "All of Us" Research Program.

Authors:  Joshua C Denny; Joni L Rutter; David B Goldstein; Anthony Philippakis; Jordan W Smoller; Gwynne Jenkins; Eric Dishman
Journal:  N Engl J Med       Date:  2019-08-15       Impact factor: 176.079

7.  RaPID: ultra-fast, powerful, and accurate detection of segments identical by descent (IBD) in biobank-scale cohorts.

Authors:  Ardalan Naseri; Xiaoming Liu; Kecong Tang; Shaojie Zhang; Degui Zhi
Journal:  Genome Biol       Date:  2019-07-25       Impact factor: 13.583

8.  A mixed-model approach for genome-wide association studies of correlated traits in structured populations.

Authors:  Arthur Korte; Bjarni J Vilhjálmsson; Vincent Segura; Alexander Platt; Quan Long; Magnus Nordborg
Journal:  Nat Genet       Date:  2012-08-19       Impact factor: 38.330

9.  Reference-based phasing using the Haplotype Reference Consortium panel.

Authors:  Po-Ru Loh; Petr Danecek; Pier Francesco Palamara; Christian Fuchsberger; Yakir A Reshef; Hilary K Finucane; Sebastian Schoenherr; Lukas Forer; Shane McCarthy; Goncalo R Abecasis; Richard Durbin; Alkes L Price
Journal:  Nat Genet       Date:  2016-10-03       Impact factor: 38.330

10.  Accurate, scalable and integrative haplotype estimation.

Authors:  Olivier Delaneau; Jean-François Zagury; Matthew R Robinson; Jonathan L Marchini; Emmanouil T Dermitzakis
Journal:  Nat Commun       Date:  2019-11-28       Impact factor: 14.919

View more
  1 in total

1.  Evaluating the utility of identity-by-descent segment numbers for relatedness inference via information theory and classification.

Authors:  Jesse Smith; Ying Qiao; Amy L Williams
Journal:  G3 (Bethesda)       Date:  2022-05-30       Impact factor: 3.542

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.