Literature DB >> 34129815

Exome variant discrepancies due to reference-genome differences.

He Li1, Moez Dawood2, Michael M Khayat1, Jesse R Farek1, Shalini N Jhangiani1, Ziad M Khan1, Tadahiro Mitani3, Zeynep Coban-Akdemir4, James R Lupski5, Eric Venner1, Jennifer E Posey3, Aniko Sabo1, Richard A Gibbs6.   

Abstract

Despite release of the GRCh38 human reference genome more than seven years ago, GRCh37 remains more widely used by most research and clinical laboratories. To date, no study has quantified the impact of utilizing different reference assemblies for the identification of variants associated with rare and common diseases from large-scale exome-sequencing data. By calling variants on both the GRCh37 and GRCh38 references, we identified single-nucleotide variants (SNVs) and insertion-deletions (indels) in 1,572 exomes from participants with Mendelian diseases and their family members. We found that a total of 1.5% of SNVs and 2.0% of indels were discordant when different references were used. Notably, 76.6% of the discordant variants were clustered within discrete discordant reference patches (DISCREPs) comprising only 0.9% of loci targeted by exome sequencing. These DISCREPs were enriched for genomic elements including segmental duplications, fix patch sequences, and loci known to contain alternate haplotypes. We identified 206 genes significantly enriched for discordant variants, most of which were in DISCREPs and caused by multi-mapped reads on the reference assembly that lacked the variant call. Among these 206 genes, eight are implicated in known Mendelian diseases and 53 are associated with common phenotypes from genome-wide association studies. In addition, variant interpretations could also be influenced by the reference after lifting-over variant loci to another assembly. Overall, we identified genes and genomic loci affected by reference assembly choice, including genes associated with Mendelian disorders and complex human diseases that require careful evaluation in both research and clinical applications.
Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  GRCh37; GRCh38; Human Genome Reference; clinical genome sequencing; exome sequencing; hg19

Mesh:

Year:  2021        PMID: 34129815      PMCID: PMC8322936          DOI: 10.1016/j.ajhg.2021.05.011

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  50 in total

1.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

2.  Lessons Learned from Large-Scale, First-Tier Clinical Exome Sequencing in a Highly Consanguineous Population.

Authors:  Dorota Monies; Mohammed Abouelhoda; Mirna Assoum; Nabil Moghrabi; Rafiullah Rafiullah; Naif Almontashiri; Mohammed Alowain; Hamad Alzaidan; Moeen Alsayed; Shazia Subhani; Edward Cupler; Maha Faden; Amal Alhashem; Alya Qari; Aziza Chedrawi; Hisham Aldhalaan; Wesam Kurdi; Sameena Khan; Zuhair Rahbeeni; Maha Alotaibi; Ewa Goljan; Hadeel Elbardisy; Mohamed ElKalioby; Zeeshan Shah; Hibah Alruwaili; Amal Jaafar; Ranad Albar; Asma Akilan; Hamsa Tayeb; Asma Tahir; Mohammed Fawzy; Mohammed Nasr; Shaza Makki; Abdullah Alfaifi; Hanna Akleh; Suad Yamani; Dalal Bubshait; Mohammed Mahnashi; Talal Basha; Afaf Alsagheir; Musad Abu Khaled; Khalid Alsaleem; Maisoon Almugbel; Manal Badawi; Fahad Bashiri; Saeed Bohlega; Raashida Sulaiman; Ehab Tous; Syed Ahmed; Talal Algoufi; Hamoud Al-Mousa; Emadia Alaki; Susan Alhumaidi; Hadeel Alghamdi; Malak Alghamdi; Ahmed Sahly; Shapar Nahrir; Ali Al-Ahmari; Hisham Alkuraya; Ali Almehaidib; Mohammed Abanemai; Fahad Alsohaibaini; Bandar Alsaud; Rand Arnaout; Ghada M H Abdel-Salam; Hasan Aldhekri; Suzan AlKhater; Khalid Alqadi; Essam Alsabban; Turki Alshareef; Khalid Awartani; Hanaa Banjar; Nada Alsahan; Ibraheem Abosoudah; Abdullah Alashwal; Wajeeh Aldekhail; Sami Alhajjar; Sulaiman Al-Mayouf; Abdulaziz Alsemari; Walaa Alshuaibi; Saeed Altala; Abdulhadi Altalhi; Salah Baz; Muddathir Hamad; Tariq Abalkhail; Badi Alenazi; Alya Alkaff; Fahad Almohareb; Fuad Al Mutairi; Mona Alsaleh; Abdullah Alsonbul; Somaya Alzelaye; Shakir Bahzad; Abdulaziz Bin Manee; Ola Jarrad; Neama Meriki; Bassem Albeirouti; Amal Alqasmi; Mohammed AlBalwi; Nawal Makhseed; Saeed Hassan; Isam Salih; Mustafa A Salih; Marwan Shaheen; Saadeh Sermin; Shamsad Shahrukh; Shahrukh Hashmi; Ayman Shawli; Ameen Tajuddin; Abdullah Tamim; Ahmed Alnahari; Ibrahim Ghemlas; Maged Hussein; Sami Wali; Hatem Murad; Brian F Meyer; Fowzan S Alkuraya
Journal:  Am J Hum Genet       Date:  2019-05-23       Impact factor: 11.025

Review 3.  Long-read human genome sequencing and its applications.

Authors:  Glennis A Logsdon; Mitchell R Vollger; Evan E Eichler
Journal:  Nat Rev Genet       Date:  2020-06-05       Impact factor: 53.242

4.  Benchmark study comparing liftover tools for genome conversion of epigenome sequencing data.

Authors:  Phuc-Loi Luu; Phuc-Thinh Ong; Thanh-Phuoc Dinh; Susan J Clark
Journal:  NAR Genom Bioinform       Date:  2020-08-06

5.  The NIH Roadmap Epigenomics Mapping Consortium.

Authors:  Bradley E Bernstein; John A Stamatoyannopoulos; Joseph F Costello; Bing Ren; Aleksandar Milosavljevic; Alexander Meissner; Manolis Kellis; Marco A Marra; Arthur L Beaudet; Joseph R Ecker; Peggy J Farnham; Martin Hirst; Eric S Lander; Tarjei S Mikkelsen; James A Thomson
Journal:  Nat Biotechnol       Date:  2010-10       Impact factor: 54.908

6.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

7.  Clinical whole-exome sequencing for the diagnosis of mendelian disorders.

Authors:  Yaping Yang; Donna M Muzny; Jeffrey G Reid; Matthew N Bainbridge; Alecia Willis; Patricia A Ward; Alicia Braxton; Joke Beuten; Fan Xia; Zhiyv Niu; Matthew Hardison; Richard Person; Mir Reza Bekheirnia; Magalie S Leduc; Amelia Kirby; Peter Pham; Jennifer Scull; Min Wang; Yan Ding; Sharon E Plon; James R Lupski; Arthur L Beaudet; Richard A Gibbs; Christine M Eng
Journal:  N Engl J Med       Date:  2013-10-02       Impact factor: 91.245

8.  ClinVar: improving access to variant interpretations and supporting evidence.

Authors:  Melissa J Landrum; Jennifer M Lee; Mark Benson; Garth R Brown; Chen Chao; Shanmuga Chitipiralla; Baoshan Gu; Jennifer Hart; Douglas Hoffman; Wonhee Jang; Karen Karapetyan; Kenneth Katz; Chunlei Liu; Zenith Maddipatla; Adriana Malheiro; Kurt McDaniel; Michael Ovetsky; George Riley; George Zhou; J Bradley Holmes; Brandi L Kattman; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

9.  Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects.

Authors:  Allison A Regier; Yossi Farjoun; David E Larson; Olga Krasheninina; Hyun Min Kang; Daniel P Howrigan; Bo-Juen Chen; Manisha Kher; Eric Banks; Darren C Ames; Adam C English; Heng Li; Jinchuan Xing; Yeting Zhang; Tara Matise; Goncalo R Abecasis; Will Salerno; Michael C Zody; Benjamin M Neale; Ira M Hall
Journal:  Nat Commun       Date:  2018-10-02       Impact factor: 14.919

10.  Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers.

Authors:  Jiayun Chen; Xingsong Li; Hongbin Zhong; Yuhuan Meng; Hongli Du
Journal:  Sci Rep       Date:  2019-06-27       Impact factor: 4.379

View more
  8 in total

Review 1.  Advancing genomic technologies and clinical awareness accelerates discovery of disease-associated tandem repeat sequences.

Authors:  Terence Gall-Duncan; Nozomu Sato; Ryan K C Yuen; Christopher E Pearson
Journal:  Genome Res       Date:  2021-12-29       Impact factor: 9.438

2.  PanCancer analysis of somatic mutations in repetitive regions reveals recurrent mutations in snRNA U2.

Authors:  Pablo Bousquets-Muñoz; Ander Díaz-Navarro; Ferran Nadeu; Ana Sánchez-Pitiot; Sara López-Tamargo; Shimin Shuai; Milagros Balbín; Jose M C Tubio; Sílvia Beà; Jose I Martin-Subero; Ana Gutiérrez-Fernández; Lincoln D Stein; Elías Campo; Xose S Puente
Journal:  NPJ Genom Med       Date:  2022-03-14       Impact factor: 6.083

Review 3.  Methods to Improve Molecular Diagnosis in Genomic Cold Cases in Pediatric Neurology.

Authors:  Magda K Kadlubowska; Isabelle Schrauwen
Journal:  Genes (Basel)       Date:  2022-02-11       Impact factor: 4.096

4.  A joint NCBI and EMBL-EBI transcript set for clinical genomics and research.

Authors:  Joannella Morales; Shashikant Pujar; Jane E Loveland; Alex Astashyn; Ruth Bennett; Andrew Berry; Eric Cox; Claire Davidson; Olga Ermolaeva; Catherine M Farrell; Reham Fatima; Laurent Gil; Tamara Goldfarb; Jose M Gonzalez; Diana Haddad; Matthew Hardy; Toby Hunt; John Jackson; Vinita S Joardar; Michael Kay; Vamsi K Kodali; Kelly M McGarvey; Aoife McMahon; Jonathan M Mudge; Daniel N Murphy; Michael R Murphy; Bhanu Rajput; Sanjida H Rangwala; Lillian D Riddick; Françoise Thibaud-Nissen; Glen Threadgold; Anjana R Vatsan; Craig Wallin; David Webb; Paul Flicek; Ewan Birney; Kim D Pruitt; Adam Frankish; Fiona Cunningham; Terence D Murphy
Journal:  Nature       Date:  2022-04-06       Impact factor: 49.962

5.  Comprehensive short and long read sequencing analysis for the Gaucher and Parkinson's disease-associated GBA gene.

Authors:  Marco Toffoli; Xiao Chen; Michael A Eberle; Christos Proukakis; Fritz J Sedlazeck; Chiao-Yin Lee; Stephen Mullin; Abigail Higgins; Sofia Koletsi; Monica Emili Garcia-Segura; Esther Sammler; Sonja W Scholz; Anthony H V Schapira
Journal:  Commun Biol       Date:  2022-07-06

6.  Gene-Based Variant Analysis of Whole-Exome Sequencing in Relation to Eosinophil Count.

Authors:  Julia Höglund; Fatemeh Hadizadeh; Weronica E Ek; Torgny Karlsson; Åsa Johansson
Journal:  Front Immunol       Date:  2022-07-22       Impact factor: 8.786

Review 7.  DECIPHER: Supporting the interpretation and sharing of rare disease phenotype-linked variant data to advance diagnosis and research.

Authors:  Julia Foreman; Simon Brent; Daniel Perrett; Andrew P Bevan; Sarah E Hunt; Fiona Cunningham; Matthew E Hurles; Helen V Firth
Journal:  Hum Mutat       Date:  2022-02-21       Impact factor: 4.700

8.  Quality control of large genome datasets.

Authors:  Max Robinson; Arpita Joshi; Ansh Vidyarthi; Mary Maccoun; Sanjay Rangavajjhala; Gustavo Glusman
Journal:  HGG Adv       Date:  2022-06-07
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.