Literature DB >> 30247488

A universal SNP and small-indel variant caller using deep neural networks.

Ryan Poplin1,2, Pi-Chuan Chang2, David Alexander2, Scott Schwartz2, Thomas Colthurst2, Alexander Ku2, Dan Newburger1, Jojo Dijamco1, Nam Nguyen1, Pegah T Afshar1, Sam S Gross1, Lizzie Dorfman1,2, Cory Y McLean1,2, Mark A DePristo1,2.   

Abstract

Despite rapid advances in sequencing technologies, accurately calling genetic variants present in an individual genome from billions of short, errorful sequence reads remains challenging. Here we show that a deep convolutional neural network can call genetic variation in aligned next-generation sequencing read data by learning statistical relationships between images of read pileups around putative variant and true genotype calls. The approach, called DeepVariant, outperforms existing state-of-the-art tools. The learned model generalizes across genome builds and mammalian species, allowing nonhuman sequencing projects to benefit from the wealth of human ground-truth data. We further show that DeepVariant can learn to call variants in a variety of sequencing technologies and experimental designs, including deep whole genomes from 10X Genomics and Ion Ampliseq exomes, highlighting the benefits of using more automated and generalizable techniques for variant calling.

Entities:  

Mesh:

Year:  2018        PMID: 30247488     DOI: 10.1038/nbt.4235

Source DB:  PubMed          Journal:  Nat Biotechnol        ISSN: 1087-0156            Impact factor:   54.908


  26 in total

1.  Mastering the game of Go with deep neural networks and tree search.

Authors:  David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

2.  16GT: a fast and sensitive variant caller using a 16-genotype probabilistic model.

Authors:  Ruibang Luo; Michael C Schatz; Steven L Salzberg
Journal:  Gigascience       Date:  2017-07-01       Impact factor: 6.524

3.  Mapping short DNA sequencing reads and calling variants using mapping quality scores.

Authors:  Heng Li; Jue Ruan; Richard Durbin
Journal:  Genome Res       Date:  2008-08-19       Impact factor: 9.043

Review 4.  The role of replicates for error mitigation in next-generation sequencing.

Authors:  Kimberly Robasky; Nathan E Lewis; George M Church
Journal:  Nat Rev Genet       Date:  2013-12-10       Impact factor: 53.242

5.  Feature-based classifiers for somatic mutation detection in tumour-normal paired sequencing data.

Authors:  Jiarui Ding; Ali Bashashati; Andrew Roth; Arusha Oloumi; Kane Tse; Thomas Zeng; Gholamreza Haffari; Martin Hirst; Marco A Marra; Anne Condon; Samuel Aparicio; Sohrab P Shah
Journal:  Bioinformatics       Date:  2011-11-13       Impact factor: 6.937

6.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

7.  A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree.

Authors:  Michael A Eberle; Epameinondas Fritzilas; Peter Krusche; Morten Källberg; Benjamin L Moore; Mitchell A Bekritsky; Zamin Iqbal; Han-Yu Chuang; Sean J Humphray; Aaron L Halpern; Semyon Kruglyak; Elliott H Margulies; Gil McVean; David R Bentley
Journal:  Genome Res       Date:  2016-11-30       Impact factor: 9.043

8.  Shining a light on dark sequencing: characterising errors in Ion Torrent PGM data.

Authors:  Lauren M Bragg; Glenn Stone; Margaret K Butler; Philip Hugenholtz; Gene W Tyson
Journal:  PLoS Comput Biol       Date:  2013-04-11       Impact factor: 4.475

9.  Evaluation and optimisation of indel detection workflows for ion torrent sequencing of the BRCA1 and BRCA2 genes.

Authors:  Zhen Xuan Yeo; Joshua Chee Leong Wong; Steven G Rozen; Ann Siew Gek Lee
Journal:  BMC Genomics       Date:  2014-06-24       Impact factor: 3.969

10.  Medical implications of technical accuracy in genome sequencing.

Authors:  Rachel L Goldfeder; James R Priest; Justin M Zook; Megan E Grove; Daryl Waggott; Matthew T Wheeler; Marc Salit; Euan A Ashley
Journal:  Genome Med       Date:  2016-03-02       Impact factor: 11.117

View more
  189 in total

1.  Deep Learning Applied on Next Generation Sequencing Data Analysis.

Authors:  Artem Danilevsky; Noam Shomron
Journal:  Methods Mol Biol       Date:  2021

2.  novoCaller: a Bayesian network approach for de novo variant calling from pedigree and population sequence data.

Authors:  Anwoy Kumar Mohanty; Dana Vuzman; Laurent Francioli; Christopher Cassa; Agnes Toth-Petroczy; Shamil Sunyaev
Journal:  Bioinformatics       Date:  2019-04-01       Impact factor: 6.937

3.  MetaPheno: A critical evaluation of deep learning and machine learning in metagenome-based disease prediction.

Authors:  Nathan LaPierre; Chelsea J-T Ju; Guangyu Zhou; Wei Wang
Journal:  Methods       Date:  2019-03-16       Impact factor: 3.608

4.  Semi-supervised learning for somatic variant calling and peptide identification in personalized cancer immunotherapy.

Authors:  Elham Sherafat; Jordan Force; Ion I Măndoiu
Journal:  BMC Bioinformatics       Date:  2020-12-30       Impact factor: 3.169

5.  Sensitive alignment using paralogous sequence variants improves long-read mapping and variant calling in segmental duplications.

Authors:  Timofey Prodanov; Vikas Bansal
Journal:  Nucleic Acids Res       Date:  2020-11-04       Impact factor: 16.971

6.  Comprehensive Detection of Single Amino Acid Variants and Evaluation of Their Deleterious Potential in a PANC-1 Cell Line.

Authors:  Zhijing Tan; Jianhui Zhu; Paul M Stemmer; Liangliang Sun; Zhichang Yang; Kendall Schultz; Matthew J Gaffrey; Anthony J Cesnik; Xinpei Yi; Xiaohu Hao; Michael R Shortreed; Tujin Shi; David M Lubman
Journal:  J Proteome Res       Date:  2020-02-27       Impact factor: 4.466

7.  Interpretable detection of novel human viruses from genome sequencing data.

Authors:  Jakub M Bartoszewicz; Anja Seidel; Bernhard Y Renard
Journal:  NAR Genom Bioinform       Date:  2021-02-01

8.  Accurate, scalable cohort variant calls using DeepVariant and GLnexus.

Authors:  Taedong Yun; Helen Li; Pi-Chuan Chang; Michael F Lin; Andrew Carroll; Cory Y McLean
Journal:  Bioinformatics       Date:  2021-01-05       Impact factor: 6.937

9.  Detection of Pathogenic Variants With Germline Genetic Testing Using Deep Learning vs Standard Methods in Patients With Prostate Cancer and Melanoma.

Authors:  Saud H AlDubayan; Jake R Conway; Sabrina Y Camp; Leora Witkowski; Eric Kofman; Brendan Reardon; Seunghun Han; Nicholas Moore; Haitham Elmarakeby; Keyan Salari; Hani Choudhry; Abdullah M Al-Rubaish; Abdulsalam A Al-Sulaiman; Amein K Al-Ali; Amaro Taylor-Weiner; Eliezer M Van Allen
Journal:  JAMA       Date:  2020-11-17       Impact factor: 56.272

Review 10.  Next Generation Sequencing and Bioinformatics Analysis of Family Genetic Inheritance.

Authors:  Aquillah M Kanzi; James Emmanuel San; Benjamin Chimukangara; Eduan Wilkinson; Maryam Fish; Veron Ramsuran; Tulio de Oliveira
Journal:  Front Genet       Date:  2020-10-23       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.