Literature DB >> 25640419

Population genetics identifies challenges in analyzing rare variants.

Henry Richard Johnston1, Yijuan Hu, David J Cutler.   

Abstract

Geneticists have, for years, understood the nature of genome-wide association studies using common genomic variants. Recently, however, focus has shifted to the analysis of rare variants. This presents potential problems for researchers, as rare variants do not always behave in the same way common variants do, sometimes rendering decades of solid intuition moot. In this paper, we present examples of the differences between common and rare variants. We show why one must be significantly more careful about the origin of rare variants, and how failing to do so can lead to highly inflated type I error. We then explain how to best avoid such concerns with careful understanding and study design. Additionally, we demonstrate that a seemingly low error rate in next-generation sequencing can dramatically impact the false-positive rate for rare variants. This is due to the fact that rare variants are, by definition, seen infrequently, making it hard to distinguish between errors and real variants. Compounding this problem is the fact that the proportion of errors is likely to get worse, not better, with increasing sample size. One cannot simply scale their way up in order to solve this problem. Understanding these potential pitfalls is a key step in successfully identifying true associations between rare variants and diseases.
© 2015 Wiley Periodicals, Inc.

Entities:  

Keywords:  error rates; rare variants; whole genome sequencing

Mesh:

Year:  2015        PMID: 25640419      PMCID: PMC4366269          DOI: 10.1002/gepi.21881

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  8 in total

1.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

Review 2.  Five years of GWAS discovery.

Authors:  Peter M Visscher; Matthew A Brown; Mark I McCarthy; Jian Yang
Journal:  Am J Hum Genet       Date:  2012-01-13       Impact factor: 11.025

3.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

4.  Rare-variant association testing for sequencing data with the sequence kernel association test.

Authors:  Michael C Wu; Seunggeun Lee; Tianxi Cai; Yun Li; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2011-07-07       Impact factor: 11.025

5.  Microdeletions of 3q29 confer high risk for schizophrenia.

Authors:  Jennifer Gladys Mulle; Anne F Dodd; John A McGrath; Paula S Wolyniec; Adele A Mitchell; Amol C Shetty; Nara L Sobreira; David Valle; M Katharine Rudd; Glen Satten; David J Cutler; Ann E Pulver; Stephen T Warren
Journal:  Am J Hum Genet       Date:  2010-08-13       Impact factor: 11.025

6.  Recent explosive human population growth has resulted in an excess of rare genetic variants.

Authors:  Alon Keinan; Andrew G Clark
Journal:  Science       Date:  2012-05-11       Impact factor: 47.728

7.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

8.  Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing.

Authors:  Jason O'Rawe; Tao Jiang; Guangqing Sun; Yiyang Wu; Wei Wang; Jingchu Hu; Paul Bodily; Lifeng Tian; Hakon Hakonarson; W Evan Johnson; Zhi Wei; Kai Wang; Gholson J Lyon
Journal:  Genome Med       Date:  2013-03-27       Impact factor: 11.117

  8 in total
  9 in total

1.  Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data.

Authors:  Jaleal S Sanjak; Anthony D Long; Kevin R Thornton
Journal:  G3 (Bethesda)       Date:  2016-04-07       Impact factor: 3.154

2.  Testing Rare-Variant Association without Calling Genotypes Allows for Systematic Differences in Sequencing between Cases and Controls.

Authors:  Yi-Juan Hu; Peizhou Liao; H Richard Johnston; Andrew S Allen; Glen A Satten
Journal:  PLoS Genet       Date:  2016-05-06       Impact factor: 5.917

3.  Identifying tagging SNPs for African specific genetic variation from the African Diaspora Genome.

Authors:  Henry Richard Johnston; Yi-Juan Hu; Jingjing Gao; Timothy D O'Connor; Gonçalo R Abecasis; Genevieve L Wojcik; Christopher R Gignoux; Pierre-Antoine Gourraud; Antoine Lizee; Mark Hansen; Rob Genuario; Dave Bullis; Cindy Lawley; Eimear E Kenny; Carlos Bustamante; Terri H Beaty; Rasika A Mathias; Kathleen C Barnes; Zhaohui S Qin
Journal:  Sci Rep       Date:  2017-04-21       Impact factor: 4.379

4.  NLRP3 expression in mesencephalic neurons and characterization of a rare NLRP3 polymorphism associated with decreased risk of Parkinson's disease.

Authors:  Katharine M von Herrmann; Lucas A Salas; Eileen M Martinez; Alison L Young; Joseph M Howard; Mary S Feldman; Brock C Christensen; Owen M Wilkins; Stephen L Lee; William F Hickey; Matthew C Havrda
Journal:  NPJ Parkinsons Dis       Date:  2018-08-15

5.  P2X2 receptor subunit interfaces are missense variant hotspots, where mutations tend to increase apparent ATP affinity.

Authors:  Federica Gasparri; Debayan Sarkar; Sarune Bielickaite; Mette Homann Poulsen; Alexander Sebastian Hauser; Stephan Alexander Pless
Journal:  Br J Pharmacol       Date:  2022-03-29       Impact factor: 9.473

6.  PEMapper and PECaller provide a simplified approach to whole-genome sequencing.

Authors:  H Richard Johnston; Pankaj Chopra; Thomas S Wingo; Viren Patel; Michael P Epstein; Jennifer G Mulle; Stephen T Warren; Michael E Zwick; David J Cutler
Journal:  Proc Natl Acad Sci U S A       Date:  2017-02-21       Impact factor: 11.205

7.  One Size Doesn't Fit All - RefEditor: Building Personalized Diploid Reference Genome to Improve Read Mapping and Genotype Calling in Next Generation Sequencing Studies.

Authors:  Shuai Yuan; H Richard Johnston; Guosheng Zhang; Yun Li; Yi-Juan Hu; Zhaohui S Qin
Journal:  PLoS Comput Biol       Date:  2015-08-12       Impact factor: 4.475

8.  Allele balance bias identifies systematic genotyping errors and false disease associations.

Authors:  Francesc Muyas; Mattia Bosio; Anna Puig; Hana Susak; Laura Domènech; Georgia Escaramis; Luis Zapata; German Demidov; Xavier Estivill; Raquel Rabionet; Stephan Ossowski
Journal:  Hum Mutat       Date:  2018-11-23       Impact factor: 4.878

9.  CRL4-Cereblon complex in Thalidomide Embryopathy: a translational investigation.

Authors:  Thayne Woycinck Kowalski; Julia do Amaral Gomes; Gabriela Barreto Caldas Garcia; Lucas Rosa Fraga; Vanessa Rodrigues Paixao-Cortes; Mariana Recamonde-Mendoza; Maria Teresa Vieira Sanseverino; Lavinia Schuler-Faccini; Fernanda Sales Luiz Vianna
Journal:  Sci Rep       Date:  2020-01-21       Impact factor: 4.379

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.