Literature DB >> 34267211

Effective variant filtering and expected candidate variant yield in studies of rare human disease.

Brent S Pedersen1, Joe M Brown2, Harriet Dashnow2, Amelia D Wallace2, Matt Velinder2, Martin Tristani-Firouzi3, Joshua D Schiffman4, Tatiana Tvrdik5, Rong Mao6,7, D Hunter Best3,5,7, Pinar Bayrak-Toydemir6,7, Aaron R Quinlan2,8,9.   

Abstract

In studies of families with rare disease, it is common to screen for de novo mutations, as well as recessive or dominant variants that explain the phenotype. However, the filtering strategies and software used to prioritize high-confidence variants vary from study to study. In an effort to establish recommendations for rare disease research, we explore effective guidelines for variant (SNP and INDEL) filtering and report the expected number of candidates for de novo dominant, recessive, and autosomal dominant modes of inheritance. We derived these guidelines using two large family-based cohorts that underwent whole-genome sequencing, as well as two family cohorts with whole-exome sequencing. The filters are applied to common attributes, including genotype-quality, sequencing depth, allele balance, and population allele frequency. The resulting guidelines yield ~10 candidate SNP and INDEL variants per exome, and 18 per genome for recessive and de novo dominant modes of inheritance, with substantially more candidates for autosomal dominant inheritance. For family-based, whole-genome sequencing studies, this number includes an average of three de novo, ten compound heterozygous, one autosomal recessive, four X-linked variants, and roughly 100 candidate variants following autosomal dominant inheritance. The slivar software we developed to establish and rapidly apply these filters to VCF files is available at https://github.com/brentp/slivar under an MIT license, and includes documentation and recommendations for best practices for rare disease analysis.
© 2021. The Author(s).

Entities:  

Year:  2021        PMID: 34267211      PMCID: PMC8282602          DOI: 10.1038/s41525-021-00227-3

Source DB:  PubMed          Journal:  NPJ Genom Med        ISSN: 2056-7944            Impact factor:   8.617


  25 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.

Authors:  Pablo Cingolani; Adrian Platts; Le Lily Wang; Melissa Coon; Tung Nguyen; Luan Wang; Susan J Land; Xiangyi Lu; Douglas M Ruden
Journal:  Fly (Austin)       Date:  2012 Apr-Jun       Impact factor: 2.160

3.  Parental influence on human germline de novo mutations in 1,548 trios from Iceland.

Authors:  Hákon Jónsson; Patrick Sulem; Birte Kehr; Snaedis Kristmundsdottir; Florian Zink; Eirikur Hjartarson; Marteinn T Hardarson; Kristjan E Hjorleifsson; Hannes P Eggertsson; Sigurjon Axel Gudjonsson; Lucas D Ward; Gudny A Arnadottir; Einar A Helgason; Hannes Helgason; Arnaldur Gylfason; Adalbjorg Jonasdottir; Aslaug Jonasdottir; Thorunn Rafnar; Mike Frigge; Simon N Stacey; Olafur Th Magnusson; Unnur Thorsteinsdottir; Gisli Masson; Augustine Kong; Bjarni V Halldorsson; Agnar Helgason; Daniel F Gudbjartsson; Kari Stefansson
Journal:  Nature       Date:  2017-09-20       Impact factor: 49.962

4.  Nextflow enables reproducible computational workflows.

Authors:  Paolo Di Tommaso; Maria Chatzou; Evan W Floden; Pablo Prieto Barja; Emilio Palumbo; Cedric Notredame
Journal:  Nat Biotechnol       Date:  2017-04-11       Impact factor: 54.908

Review 5.  Toward better understanding of artifacts in variant calling from high-coverage samples.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2014-06-27       Impact factor: 6.937

6.  GEMINI: integrative exploration of genetic variation and genome annotations.

Authors:  Umadevi Paila; Brad A Chapman; Rory Kirchner; Aaron R Quinlan
Journal:  PLoS Comput Biol       Date:  2013-07-18       Impact factor: 4.475

7.  Analysis of protein-coding genetic variation in 60,706 humans.

Authors:  Monkol Lek; Konrad J Karczewski; Eric V Minikel; Kaitlin E Samocha; Eric Banks; Timothy Fennell; Anne H O'Donnell-Luria; James S Ware; Andrew J Hill; Beryl B Cummings; Taru Tukiainen; Daniel P Birnbaum; Jack A Kosmicki; Laramie E Duncan; Karol Estrada; Fengmei Zhao; James Zou; Emma Pierce-Hoffman; Joanne Berghout; David N Cooper; Nicole Deflaux; Mark DePristo; Ron Do; Jason Flannick; Menachem Fromer; Laura Gauthier; Jackie Goldstein; Namrata Gupta; Daniel Howrigan; Adam Kiezun; Mitja I Kurki; Ami Levy Moonshine; Pradeep Natarajan; Lorena Orozco; Gina M Peloso; Ryan Poplin; Manuel A Rivas; Valentin Ruano-Rubio; Samuel A Rose; Douglas M Ruderfer; Khalid Shakir; Peter D Stenson; Christine Stevens; Brett P Thomas; Grace Tiao; Maria T Tusie-Luna; Ben Weisburd; Hong-Hee Won; Dongmei Yu; David M Altshuler; Diego Ardissino; Michael Boehnke; John Danesh; Stacey Donnelly; Roberto Elosua; Jose C Florez; Stacey B Gabriel; Gad Getz; Stephen J Glatt; Christina M Hultman; Sekar Kathiresan; Markku Laakso; Steven McCarroll; Mark I McCarthy; Dermot McGovern; Ruth McPherson; Benjamin M Neale; Aarno Palotie; Shaun M Purcell; Danish Saleheen; Jeremiah M Scharf; Pamela Sklar; Patrick F Sullivan; Jaakko Tuomilehto; Ming T Tsuang; Hugh C Watkins; James G Wilson; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2016-08-18       Impact factor: 49.962

8.  Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data.

Authors:  Manojkumar Kumaran; Umadevi Subramanian; Bharanidharan Devarajan
Journal:  BMC Bioinformatics       Date:  2019-06-17       Impact factor: 3.169

9.  An open resource for accurately benchmarking small variant and reference calls.

Authors:  Justin M Zook; Jennifer McDaniel; Nathan D Olson; Justin Wagner; Hemang Parikh; Haynes Heaton; Sean A Irvine; Len Trigg; Rebecca Truty; Cory Y McLean; Francisco M De La Vega; Chunlin Xiao; Stephen Sherry; Marc Salit
Journal:  Nat Biotechnol       Date:  2019-04-01       Impact factor: 54.908

10.  Large, three-generation human families reveal post-zygotic mosaicism and variability in germline mutation accumulation.

Authors:  Thomas A Sasani; Brent S Pedersen; Ziyue Gao; Lisa Baird; Molly Przeworski; Lynn B Jorde; Aaron R Quinlan
Journal:  Elife       Date:  2019-09-24       Impact factor: 8.140

View more
  11 in total

1.  Novel Disease-Associated Missense Single-Nucleotide Polymorphisms Variants Predication by Algorithms Tools and Molecular Dynamics Simulation of Human TCIRG1 Gene Causing Congenital Neutropenia and Osteopetrosis.

Authors:  Khyber Shinwari; Hafiz Muzzammel Rehman; Guojun Liu; Mikhail A Bolkov; Irina A Tuzankina; Valery A Chereshnev
Journal:  Front Mol Biosci       Date:  2022-04-28

2.  A spectrum of free software tools for processing the VCF variant call format: vcflib, bio-vcf, cyvcf2, hts-nim and slivar.

Authors:  Erik Garrison; Zev N Kronenberg; Eric T Dawson; Brent S Pedersen; Pjotr Prins
Journal:  PLoS Comput Biol       Date:  2022-05-31       Impact factor: 4.779

3.  A complete pedigree-based graph workflow for rare candidate variant analysis.

Authors:  Charles Markello; Charles Huang; Alex Rodriguez; Andrew Carroll; Pi-Chuan Chang; Jordan Eizenga; Thomas Markello; David Haussler; Benedict Paten
Journal:  Genome Res       Date:  2022-04-28       Impact factor: 9.438

4.  Case Review: Whole-Exome Sequencing Analyses Identify Carriers of a Known Likely Pathogenic Intronic BRCA1 Variant in Ovarian Cancer Cases Clinically Negative for Pathogenic BRCA1 and BRCA2 Variants.

Authors:  Wejdan M Alenezi; Caitlin T Fierheller; Timothée Revil; Corinne Serruya; Anne-Marie Mes-Masson; William D Foulkes; Diane Provencher; Zaki El Haffaf; Jiannis Ragoussis; Patricia N Tonin
Journal:  Genes (Basel)       Date:  2022-04-15       Impact factor: 4.141

5.  Studying mutation rate evolution in primates-the effects of computational pipelines and parameter choices.

Authors:  Susanne P Pfeifer
Journal:  Gigascience       Date:  2021-10-21       Impact factor: 6.524

6.  Structural variants in the Chinese population and their impact on phenotypes, diseases and population adaptation.

Authors:  Zhikun Wu; Zehang Jiang; Tong Li; Chuanbo Xie; Liansheng Zhao; Jiaqi Yang; Shuai Ouyang; Yizhi Liu; Tao Li; Zhi Xie
Journal:  Nat Commun       Date:  2021-11-11       Impact factor: 14.919

7.  A comprehensive WGS-based pipeline for the identification of new candidate genes in inherited retinal dystrophies.

Authors:  María González-Del Pozo; Elena Fernández-Suárez; Nereida Bravo-Gil; Cristina Méndez-Vidal; Marta Martín-Sánchez; Enrique Rodríguez-de la Rúa; Manuel Ramos-Jiménez; María José Morillo-Sánchez; Salud Borrego; Guillermo Antiñolo
Journal:  NPJ Genom Med       Date:  2022-03-04       Impact factor: 8.617

8.  Heterogeneous clinical features in Cockayne syndrome patients and siblings carrying the same CSA mutations.

Authors:  Asma Chikhaoui; Ichraf Kraoua; Nadège Calmels; Sami Bouchoucha; Cathy Obringer; Khouloud Zayoud; Benjamin Montagne; Ridha M'rad; Sonia Abdelhak; Vincent Laugel; Miria Ricchetti; Ilhem Turki; Houda Yacoub-Youssef
Journal:  Orphanet J Rare Dis       Date:  2022-03-05       Impact factor: 4.123

9.  Comprehensive variant calling from whole-genome sequencing identifies a complex inversion that disrupts ZFPM2 in familial congenital diaphragmatic hernia.

Authors:  Thomas J Nicholas; Najla Al-Sweel; Andrew Farrell; Rong Mao; Pinar Bayrak-Toydemir; Christine E Miller; Dawn Bentley; Rachel Palmquist; Barry Moore; Edgar J Hernandez; Michael J Cormier; Eric Fredrickson; Katherine Noble; Shawn Rynearson; Carson Holt; Mary Anne Karren; Joshua L Bonkowsky; Martin Tristani-Firouzi; Mark Yandell; Gabor Marth; Aaron R Quinlan; Luca Brunelli; Reha M Toydemir; Brian J Shayota; John C Carey; Steven E Boyden; Sabrina Malone Jenkins
Journal:  Mol Genet Genomic Med       Date:  2022-02-04       Impact factor: 2.183

10.  Clin.iobio: A Collaborative Diagnostic Workflow to Enable Team-Based Precision Genomics.

Authors:  Alistair Ward; Matt Velinder; Tonya Di Sera; Aditya Ekawade; Sabrina Malone Jenkins; Barry Moore; Rong Mao; Pinar Bayrak-Toydemir; Gabor Marth
Journal:  J Pers Med       Date:  2022-01-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.