Literature DB >> 30591557

Blacklisting variants common in private cohorts but not in public databases optimizes human exome analysis.

Patrick Maffucci1,2,3, Benedetta Bigio1,4,5, Franck Rapaport1, Aurélie Cobat4,5, Alessandro Borghesi6, Marie Lopez7,8,9, Etienne Patin7,8,9, Alexandre Bolze10, Lei Shang1, Matthieu Bendavid1, Eric M Scott11, Peter D Stenson12, Charlotte Cunningham-Rundles2,3, David N Cooper12, Joseph G Gleeson11,13, Jacques Fellay6, Lluis Quintana-Murci7,8,9, Jean-Laurent Casanova14,4,5,13,15, Laurent Abel1,4,5, Bertrand Boisson1,4,5, Yuval Itan14,16,17.   

Abstract

Computational analyses of human patient exomes aim to filter out as many nonpathogenic genetic variants (NPVs) as possible, without removing the true disease-causing mutations. This involves comparing the patient's exome with public databases to remove reported variants inconsistent with disease prevalence, mode of inheritance, or clinical penetrance. However, variants frequent in a given exome cohort, but absent or rare in public databases, have also been reported and treated as NPVs, without rigorous exploration. We report the generation of a blacklist of variants frequent within an in-house cohort of 3,104 exomes. This blacklist did not remove known pathogenic mutations from the exomes of 129 patients and decreased the number of NPVs remaining in the 3,104 individual exomes by a median of 62%. We validated this approach by testing three other independent cohorts of 400, 902, and 3,869 exomes. The blacklist generated from any given cohort removed a substantial proportion of NPVs (11-65%). We analyzed the blacklisted variants computationally and experimentally. Most of the blacklisted variants corresponded to false signals generated by incomplete reference genome assembly, location in low-complexity regions, bioinformatic misprocessing, or limitations inherent to cohort-specific private alleles (e.g., due to sequencing kits, and genetic ancestries). Finally, we provide our precalculated blacklists, together with ReFiNE, a program for generating customized blacklists from any medium-sized or large in-house cohort of exome (or other next-generation sequencing) data via a user-friendly public web server. This work demonstrates the power of extracting variant blacklists from private databases as a specific in-house but broadly applicable tool for optimizing exome analysis.

Entities:  

Keywords:  WES analysis; WES annotation; blacklist; exome; variant

Mesh:

Year:  2018        PMID: 30591557      PMCID: PMC6338851          DOI: 10.1073/pnas.1808403116

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  46 in total

1.  Whole-genome sequencing is more powerful than whole-exome sequencing for detecting exome variants.

Authors:  Aziz Belkadi; Alexandre Bolze; Yuval Itan; Aurélie Cobat; Quentin B Vincent; Alexander Antipenko; Lei Shang; Bertrand Boisson; Jean-Laurent Casanova; Laurent Abel
Journal:  Proc Natl Acad Sci U S A       Date:  2015-03-31       Impact factor: 11.205

2.  Detecting false-positive signals in exome sequencing.

Authors:  Karin V Fuentes Fajardo; David Adams; Christopher E Mason; Murat Sincan; Cynthia Tifft; Camilo Toro; Cornelius F Boerkoel; William Gahl; Thomas Markello
Journal:  Hum Mutat       Date:  2012-03-05       Impact factor: 4.878

3.  Human Gene Mutation Database (HGMD): 2003 update.

Authors:  Peter D Stenson; Edward V Ball; Matthew Mort; Andrew D Phillips; Jacqueline A Shiel; Nick S T Thomas; Shaun Abeysinghe; Michael Krawczak; David N Cooper
Journal:  Hum Mutat       Date:  2003-06       Impact factor: 4.878

4.  The human gene connectome as a map of short cuts for morbid allele discovery.

Authors:  Yuval Itan; Shen-Ying Zhang; Guillaume Vogt; Avinash Abhyankar; Melina Herman; Patrick Nitschke; Dror Fried; Lluis Quintana-Murci; Laurent Abel; Jean-Laurent Casanova
Journal:  Proc Natl Acad Sci U S A       Date:  2013-03-18       Impact factor: 11.205

Review 5.  Sequencing studies in human genetics: design and interpretation.

Authors:  David B Goldstein; Andrew Allen; Jonathan Keebler; Elliott H Margulies; Steven Petrou; Slavé Petrovski; Shamil Sunyaev
Journal:  Nat Rev Genet       Date:  2013-06-11       Impact factor: 53.242

6.  A framework for variation discovery and genotyping using next-generation DNA sequencing data.

Authors:  Mark A DePristo; Eric Banks; Ryan Poplin; Kiran V Garimella; Jared R Maguire; Christopher Hartl; Anthony A Philippakis; Guillermo del Angel; Manuel A Rivas; Matt Hanna; Aaron McKenna; Tim J Fennell; Andrew M Kernytsky; Andrey Y Sivachenko; Kristian Cibulskis; Stacey B Gabriel; David Altshuler; Mark J Daly
Journal:  Nat Genet       Date:  2011-04-10       Impact factor: 38.330

7.  Analysis of protein-coding genetic variation in 60,706 humans.

Authors:  Monkol Lek; Konrad J Karczewski; Eric V Minikel; Kaitlin E Samocha; Eric Banks; Timothy Fennell; Anne H O'Donnell-Luria; James S Ware; Andrew J Hill; Beryl B Cummings; Taru Tukiainen; Daniel P Birnbaum; Jack A Kosmicki; Laramie E Duncan; Karol Estrada; Fengmei Zhao; James Zou; Emma Pierce-Hoffman; Joanne Berghout; David N Cooper; Nicole Deflaux; Mark DePristo; Ron Do; Jason Flannick; Menachem Fromer; Laura Gauthier; Jackie Goldstein; Namrata Gupta; Daniel Howrigan; Adam Kiezun; Mitja I Kurki; Ami Levy Moonshine; Pradeep Natarajan; Lorena Orozco; Gina M Peloso; Ryan Poplin; Manuel A Rivas; Valentin Ruano-Rubio; Samuel A Rose; Douglas M Ruderfer; Khalid Shakir; Peter D Stenson; Christine Stevens; Brett P Thomas; Grace Tiao; Maria T Tusie-Luna; Ben Weisburd; Hong-Hee Won; Dongmei Yu; David M Altshuler; Diego Ardissino; Michael Boehnke; John Danesh; Stacey Donnelly; Roberto Elosua; Jose C Florez; Stacey B Gabriel; Gad Getz; Stephen J Glatt; Christina M Hultman; Sekar Kathiresan; Markku Laakso; Steven McCarroll; Mark I McCarthy; Dermot McGovern; Ruth McPherson; Benjamin M Neale; Aarno Palotie; Shaun M Purcell; Danish Saleheen; Jeremiah M Scharf; Pamela Sklar; Patrick F Sullivan; Jaakko Tuomilehto; Ming T Tsuang; Hugh C Watkins; James G Wilson; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2016-08-18       Impact factor: 49.962

8.  Evaluating Variant Calling Tools for Non-Matched Next-Generation Sequencing Data.

Authors:  Sarah Sandmann; Aniek O de Graaf; Mohsen Karimi; Bert A van der Reijden; Eva Hellström-Lindberg; Joop H Jansen; Martin Dugas
Journal:  Sci Rep       Date:  2017-02-24       Impact factor: 4.379

9.  Systematic comparison of variant calling pipelines using gold standard personal exome variants.

Authors:  Sohyun Hwang; Eiru Kim; Insuk Lee; Edward M Marcotte
Journal:  Sci Rep       Date:  2015-12-07       Impact factor: 4.379

10.  Using high-resolution variant frequencies to empower clinical genome interpretation.

Authors:  Nicola Whiffin; Eric Minikel; Roddy Walsh; Anne H O'Donnell-Luria; Konrad Karczewski; Alexander Y Ing; Paul J R Barton; Birgit Funke; Stuart A Cook; Daniel MacArthur; James S Ware
Journal:  Genet Med       Date:  2017-05-18       Impact factor: 8.822

View more
  21 in total

1.  Reply: CYLD variants in frontotemporal dementia associated with severe memory impairment in a Portuguese cohort.

Authors:  Lisa J Oyston; Zac Chatterton; Marianne Hallupp; Neil Rajan; John B Kwok; Carol Dobson-Stone
Journal:  Brain       Date:  2020-08-01       Impact factor: 13.501

2.  Children's rare disease cohorts: an integrative research and clinical genomics initiative.

Authors:  Shira Rockowitz; Nicholas LeCompte; Mary Carmack; Andrew Quitadamo; Lily Wang; Meredith Park; Devon Knight; Emma Sexton; Lacey Smith; Beth Sheidley; Michael Field; Ingrid A Holm; Catherine A Brownstein; Pankaj B Agrawal; Susan Kornetsky; Annapurna Poduri; Scott B Snapper; Alan H Beggs; Timothy W Yu; David A Williams; Piotr Sliz
Journal:  NPJ Genom Med       Date:  2020-07-06       Impact factor: 8.617

3.  Homozygous NLRP1 gain-of-function mutation in siblings with a syndromic form of recurrent respiratory papillomatosis.

Authors:  Scott B Drutman; Filomeen Haerynck; Franklin L Zhong; David Hum; Nicholas J Hernandez; Serkan Belkaya; Franck Rapaport; Sarah Jill de Jong; David Creytens; Simon J Tavernier; Katrien Bonte; Sofie De Schepper; Jutte van der Werff Ten Bosch; Lazaro Lorenzo-Diaz; Andy Wullaert; Xavier Bossuyt; Gérard Orth; Vincent R Bonagura; Vivien Béziat; Laurent Abel; Emmanuelle Jouanguy; Bruno Reversade; Jean-Laurent Casanova
Journal:  Proc Natl Acad Sci U S A       Date:  2019-09-04       Impact factor: 11.205

4.  Multifocal organoids reveal clonal associations between synchronous intestinal tumors with pervasive heterogeneous drug responses.

Authors:  Nahyun Jeong; Soon-Chan Kim; Ji Won Park; Seul Gi Park; Ki-Hoan Nam; Ja Oh Lee; Young-Kyoung Shin; Jeong Mo Bae; Seung-Yong Jeong; Min Jung Kim; Ja-Lok Ku
Journal:  NPJ Genom Med       Date:  2022-07-19       Impact factor: 6.083

5.  Investigating the characteristics of genes and variants associated with self-reported hearing difficulty in older adults in the UK Biobank.

Authors:  Morag A Lewis; Bradley A Schulte; Judy R Dubno; Karen P Steel
Journal:  BMC Biol       Date:  2022-06-27       Impact factor: 7.364

6.  Inherited human c-Rel deficiency disrupts myeloid and lymphoid immunity to multiple infectious agents.

Authors:  Romain Lévy; David Langlais; Vivien Béziat; Franck Rapaport; Geetha Rao; Tomi Lazarov; Mathieu Bourgey; Yu J Zhou; Coralie Briand; Kunihiko Moriya; Fatima Ailal; Danielle T Avery; Janet Markle; Ai Ing Lim; Masato Ogishi; Rui Yang; Simon Pelham; Mehdi Emam; Mélanie Migaud; Caroline Deswarte; Tanwir Habib; Luis R Saraiva; Eman A Moussa; Andrea Guennoun; Bertrand Boisson; Serkan Belkaya; Ruben Martinez-Barricarte; Jérémie Rosain; Aziz Belkadi; Sylvain Breton; Kathryn Payne; Ibtihal Benhsaien; Alessandro Plebani; Vassilios Lougaris; James P Di Santo; Bénédicte Neven; Laurent Abel; Cindy S Ma; Ahmed Aziz Bousfiha; Nico Marr; Jacinta Bustamante; Kang Liu; Philippe Gros; Frédéric Geissmann; Stuart G Tangye; Jean-Laurent Casanova; Anne Puel
Journal:  J Clin Invest       Date:  2021-09-01       Impact factor: 19.456

7.  Current genetic landscape in common variable immune deficiency.

Authors:  Hassan Abolhassani; Lennart Hammarström; Charlotte Cunningham-Rundles
Journal:  Blood       Date:  2020-02-27       Impact factor: 22.113

8.  A computational approach for detecting physiological homogeneity in the midst of genetic heterogeneity.

Authors:  Peng Zhang; Aurélie Cobat; Yoon-Seung Lee; Yiming Wu; Cigdem Sevim Bayrak; Clémentine Boccon-Gibod; Daniela Matuozzo; Lazaro Lorenzo; Aayushee Jain; Soraya Boucherit; Louis Vallée; Burkhard Stüve; Stéphane Chabrier; Jean-Laurent Casanova; Laurent Abel; Shen-Ying Zhang; Yuval Itan
Journal:  Am J Hum Genet       Date:  2021-05-19       Impact factor: 11.025

9.  Inherited PD-1 deficiency underlies tuberculosis and autoimmunity in a child.

Authors:  Silvia Vilarinho; Richard P Lifton; Bertrand Boisson; Laurent Abel; Dusan Bogunovic; Nico Marr; Luigi D Notarangelo; Stuart G Tangye; Tasuku Honjo; Philippe Gros; Stéphanie Boisson-Dupuis; Jean-Laurent Casanova; Masato Ogishi; Rui Yang; Caner Aytekin; David Langlais; Mathieu Bourgey; Taushif Khan; Fatima Al Ali; Mahbuba Rahman; Ottavia M Delmonte; Maya Chrabieh; Peng Zhang; Conor Gruber; Simon J Pelham; András N Spaan; Jérémie Rosain; Wei-Te Lei; Scott Drutman; Matthew D Hellmann; Margaret K Callahan; Matthew Adamow; Phillip Wong; Jedd D Wolchok; Geetha Rao; Cindy S Ma; Yuka Nakajima; Tomonori Yaguchi; Kenji Chamoto; Samuel C Williams; Jean-Francois Emile; Flore Rozenberg; Michael S Glickman; Franck Rapaport; Gaspard Kerner; Garrett Allington; Ilhan Tezcan; Deniz Cagdas; Ferda O Hosnut; Figen Dogu; Aydan Ikinciogullari; V Koneti Rao; Leena Kainulainen; Vivien Béziat; Jacinta Bustamante
Journal:  Nat Med       Date:  2021-06-28       Impact factor: 53.440

10.  Detection of homozygous and hemizygous complete or partial exon deletions by whole-exome sequencing.

Authors:  Benedetta Bigio; Yoann Seeleuthner; Gaspard Kerner; Mélanie Migaud; Jérémie Rosain; Bertrand Boisson; Carla Nasca; Anne Puel; Jacinta Bustamante; Jean-Laurent Casanova; Laurent Abel; Aurelie Cobat
Journal:  NAR Genom Bioinform       Date:  2021-05-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.