| Literature DB >> 34046589 |
Benedetta Bigio1, Yoann Seeleuthner2, Gaspard Kerner2, Mélanie Migaud2, Jérémie Rosain2, Bertrand Boisson1, Carla Nasca3, Anne Puel1, Jacinta Bustamante1, Jean-Laurent Casanova1, Laurent Abel1, Aurelie Cobat2.
Abstract
The detection of copy number variations (CNVs) in whole-exome sequencing (WES) data is important, as CNVs may underlie a number of human genetic disorders. The recently developed HMZDelFinder algorithm can detect rare homozygous and hemizygous (HMZ) deletions in WES data more effectively than other widely used tools. Here, we present HMZDelFinder_opt, an approach that outperforms HMZDelFinder for the detection of HMZ deletions, including partial exon deletions in particular, in WES data from laboratory patient collections that were generated over time in different experimental conditions. We show that using an optimized reference control set of WES data, based on a PCA-derived Euclidean distance for coverage, strongly improves the detection of HMZ complete exon deletions both in real patients carrying validated disease-causing deletions and in simulated data. Furthermore, we develop a sliding window approach enabling HMZDelFinder_opt to identify HMZ partial deletions of exons that are undiscovered by HMZDelFinder. HMZDelFinder_opt is a timely and powerful approach for detecting HMZ deletions, particularly partial exon deletions, in WES data from inherently heterogeneous laboratory patient collections.Entities:
Year: 2021 PMID: 34046589 PMCID: PMC8140739 DOI: 10.1093/nargab/lqab037
Source DB: PubMed Journal: NAR Genom Bioinform ISSN: 2631-9268