Literature DB >> 35230682

Somatic and Germline Variant Calling from Next-Generation Sequencing Data.

Ti-Cheng Chang1, Ke Xu2, Zhongshan Cheng2, Gang Wu2.   

Abstract

Re-sequencing of the human genome by next-generation sequencing (NGS) has been widely applied to discover pathogenic genetic variants and/or causative genes accounting for various types of diseases including cancers. The advances in NGS have allowed the sequencing of the entire genome of patients and identification of disease-associated variants in a reasonable timeframe and cost. The core of the variant identification relies on accurate variant calling and annotation. Numerous algorithms have been developed to elucidate the repertoire of somatic and germline variants. Each algorithm has its own distinct strengths, weaknesses, and limitations due to the difference in the statistical modeling approach adopted and read information utilized. Accurate variant calling remains challenging due to the presence of sequencing artifacts and read misalignments. All of these can lead to the discordance of the variant calling results and even misinterpretation of the discovery. For somatic variant detection, multiple factors including chromosomal abnormalities, tumor heterogeneity, tumor-normal cross contaminations, unbalanced tumor/normal sample coverage, and variants with low allele frequencies add even more layers of complexity to accurate variant identification. Given the discordances and difficulties, ensemble approaches have emerged by harmonizing information from different algorithms to improve variant calling performance. In this chapter, we first introduce the general scheme of variant calling algorithms and potential challenges at distinct stages. We next review the existing workflows of variant calling and annotation, and finally explore the strategies deployed by different callers as well as their strengths and caveats. Overall, NGS-based variant identification with careful consideration allows reliable detection of pathogenic variant and candidate variant selection for precision medicine.
© 2022. Springer Nature Switzerland AG.

Entities:  

Keywords:  Contamination; Ensemble variant calling; Germline variant; Low-frequency variants; Machine learning; Next-generation sequencing; Single-cell sequencing; Somatic variant; Third-generation sequencing; Tumor-only variant calling; Variant annotation; Variant calling; Variant prioritization

Mesh:

Year:  2022        PMID: 35230682     DOI: 10.1007/978-3-030-91836-1_3

Source DB:  PubMed          Journal:  Adv Exp Med Biol        ISSN: 0065-2598            Impact factor:   2.622


  89 in total

1.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data.

Authors:  Aaron McKenna; Matthew Hanna; Eric Banks; Andrey Sivachenko; Kristian Cibulskis; Andrew Kernytsky; Kiran Garimella; David Altshuler; Stacey Gabriel; Mark Daly; Mark A DePristo
Journal:  Genome Res       Date:  2010-07-19       Impact factor: 9.043

2.  A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2011-09-08       Impact factor: 6.937

Review 3.  Toward better understanding of artifacts in variant calling from high-coverage samples.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2014-06-27       Impact factor: 6.937

Review 4.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

5.  A universal SNP and small-indel variant caller using deep neural networks.

Authors:  Ryan Poplin; Pi-Chuan Chang; David Alexander; Scott Schwartz; Thomas Colthurst; Alexander Ku; Dan Newburger; Jojo Dijamco; Nam Nguyen; Pegah T Afshar; Sam S Gross; Lizzie Dorfman; Cory Y McLean; Mark A DePristo
Journal:  Nat Biotechnol       Date:  2018-09-24       Impact factor: 54.908

6.  A comprehensive custom panel evaluation for routine hereditary cancer testing: improving the yield of germline mutation detection.

Authors:  Carolina Velázquez; Enrique Lastra; Francisco Avila Cobos; Luis Abella; Virginia de la Cruz; Blanca Ascensión Hernando; Lara Hernández; Noemí Martínez; Mar Infante; Mercedes Durán
Journal:  J Transl Med       Date:  2020-06-10       Impact factor: 5.531

7.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

8.  A practice guideline from the American College of Medical Genetics and Genomics and the National Society of Genetic Counselors: referral indications for cancer predisposition assessment.

Authors:  Heather Hampel; Robin L Bennett; Adam Buchanan; Rachel Pearlman; Georgia L Wiesner
Journal:  Genet Med       Date:  2014-11-13       Impact factor: 8.822

9.  SpeedSeq: ultra-fast personal genome analysis and interpretation.

Authors:  Colby Chiang; Ryan M Layer; Gregory G Faust; Michael R Lindberg; David B Rose; Erik P Garrison; Gabor T Marth; Aaron R Quinlan; Ira M Hall
Journal:  Nat Methods       Date:  2015-08-10       Impact factor: 28.547

10.  Systematic comparison of variant calling pipelines using gold standard personal exome variants.

Authors:  Sohyun Hwang; Eiru Kim; Insuk Lee; Edward M Marcotte
Journal:  Sci Rep       Date:  2015-12-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.