Literature DB >> 33429424

DeepCNV: a deep learning approach for authenticating copy number variations.

Joseph T Glessner1,2, Xiurui Hou3, Cheng Zhong3, Jie Zhang4, Munir Khan1,2, Fabian Brand5, Peter Krawitz5, Patrick M A Sleiman2, Hakon Hakonarson2, Zhi Wei3.   

Abstract

Copy number variations (CNVs) are an important class of variations contributing to the pathogenesis of many disease phenotypes. Detecting CNVs from genomic data remains difficult, and the most currently applied methods suffer from an unacceptably high false positive rate. A common practice is to have human experts manually review original CNV calls for filtering false positives before further downstream analysis or experimental validation. Here, we propose DeepCNV, a deep learning-based tool, intended to replace human experts when validating CNV calls, focusing on the calls made by one of the most accurate CNV callers, PennCNV. The sophistication of the deep neural network algorithm is enriched with over 10 000 expert-scored samples that are split into training and testing sets. Variant confidence, especially for CNVs, is a main roadblock impeding the progress of linking CNVs with the disease. We show that DeepCNV adds to the confidence of the CNV calls with an optimal area under the receiver operating characteristic curve of 0.909, exceeding other machine learning methods. The superiority of DeepCNV was also benchmarked and confirmed using an experimental wet-lab validation dataset. We conclude that the improvement obtained by DeepCNV results in significantly fewer false positive results and failures to replicate the CNV association results.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  copy number variation; deep learning

Mesh:

Year:  2021        PMID: 33429424      PMCID: PMC8681111          DOI: 10.1093/bib/bbaa381

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  33 in total

1.  Genome-wide discovery of pre-miRNAs: comparison of recent approaches based on machine learning.

Authors:  Leandro A Bugnon; Cristian Yones; Diego H Milone; Georgina Stegmayer
Journal:  Brief Bioinform       Date:  2021-05-20       Impact factor: 11.622

2.  tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors:  T M Lowe; S R Eddy
Journal:  Nucleic Acids Res       Date:  1997-03-01       Impact factor: 16.971

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Mastering the game of Go without human knowledge.

Authors:  David Silver; Julian Schrittwieser; Karen Simonyan; Ioannis Antonoglou; Aja Huang; Arthur Guez; Thomas Hubert; Lucas Baker; Matthew Lai; Adrian Bolton; Yutian Chen; Timothy Lillicrap; Fan Hui; Laurent Sifre; George van den Driessche; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2017-10-18       Impact factor: 49.962

Review 5.  Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies.

Authors:  David T Miller; Margaret P Adam; Swaroop Aradhya; Leslie G Biesecker; Arthur R Brothman; Nigel P Carter; Deanna M Church; John A Crolla; Evan E Eichler; Charles J Epstein; W Andrew Faucett; Lars Feuk; Jan M Friedman; Ada Hamosh; Laird Jackson; Erin B Kaminsky; Klaas Kok; Ian D Krantz; Robert M Kuhn; Charles Lee; James M Ostell; Carla Rosenberg; Stephen W Scherer; Nancy B Spinner; Dimitri J Stavropoulos; James H Tepperberg; Erik C Thorland; Joris R Vermeesch; Darrel J Waggoner; Michael S Watson; Christa Lese Martin; David H Ledbetter
Journal:  Am J Hum Genet       Date:  2010-05-14       Impact factor: 11.025

6.  A comprehensive analysis of common copy-number variations in the human genome.

Authors:  Kendy K Wong; Ronald J deLeeuw; Nirpjit S Dosanjh; Lindsey R Kimm; Ze Cheng; Douglas E Horsman; Calum MacAulay; Raymond T Ng; Carolyn J Brown; Evan E Eichler; Wan L Lam
Journal:  Am J Hum Genet       Date:  2006-12-05       Impact factor: 11.025

Review 7.  Challenges and standards in integrating surveys of structural variation.

Authors:  Stephen W Scherer; Charles Lee; Ewan Birney; David M Altshuler; Evan E Eichler; Nigel P Carter; Matthew E Hurles; Lars Feuk
Journal:  Nat Genet       Date:  2007-07       Impact factor: 38.330

8.  Deep learning for drug response prediction in cancer.

Authors:  Delora Baptista; Pedro G Ferreira; Miguel Rocha
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

9.  Mapping and sequencing of structural variation from eight human genomes.

Authors:  Jeffrey M Kidd; Gregory M Cooper; William F Donahue; Hillary S Hayden; Nick Sampas; Tina Graves; Nancy Hansen; Brian Teague; Can Alkan; Francesca Antonacci; Eric Haugen; Troy Zerr; N Alice Yamada; Peter Tsang; Tera L Newman; Eray Tüzün; Ze Cheng; Heather M Ebling; Nadeem Tusneem; Robert David; Will Gillett; Karen A Phelps; Molly Weaver; David Saranga; Adrianne Brand; Wei Tao; Erik Gustafson; Kevin McKernan; Lin Chen; Maika Malig; Joshua D Smith; Joshua M Korn; Steven A McCarroll; David A Altshuler; Daniel A Peiffer; Michael Dorschner; John Stamatoyannopoulos; David Schwartz; Deborah A Nickerson; James C Mullikin; Richard K Wilson; Laurakay Bruhn; Maynard V Olson; Rajinder Kaul; Douglas R Smith; Evan E Eichler
Journal:  Nature       Date:  2008-05-01       Impact factor: 49.962

10.  QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data.

Authors:  Stefano Colella; Christopher Yau; Jennifer M Taylor; Ghazala Mirza; Helen Butler; Penny Clouston; Anne S Bassett; Anneke Seller; Christopher C Holmes; Jiannis Ragoussis
Journal:  Nucleic Acids Res       Date:  2007-03-06       Impact factor: 16.971

View more
  1 in total

1.  Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure.

Authors:  Laura Balagué-Dobón; Alejandro Cáceres; Juan R González
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.