| Literature DB >> 35294448 |
Thomas W Laver1, Elisa De Franco1, Matthew B Johnson1, Kashyap A Patel1, Sian Ellard1, Michael N Weedon1, Sarah E Flanagan1, Matthew N Wakeling1.
Abstract
Identifying copy number variants (CNVs) can provide diagnoses to patients and provide important biological insights into human health and disease. Current exome and targeted sequencing approaches cannot detect clinically and biologically-relevant CNVs outside their target area. We present SavvyCNV, a tool which uses off-target read data from exome and targeted sequencing data to call germline CNVs genome-wide. Up to 70% of sequencing reads from exome and targeted sequencing fall outside the targeted regions. We have developed a new tool, SavvyCNV, to exploit this 'free data' to call CNVs across the genome. We benchmarked SavvyCNV against five state-of-the-art CNV callers using truth sets generated from genome sequencing data and Multiplex Ligation-dependent Probe Amplification assays. SavvyCNV called CNVs with high precision and recall, outperforming the five other tools at calling CNVs genome-wide, using off-target or on-target reads from targeted panel and exome sequencing. We then applied SavvyCNV to clinical samples sequenced using a targeted panel and were able to call previously undetected clinically-relevant CNVs, highlighting the utility of this tool within the diagnostic setting. SavvyCNV outperforms existing tools for calling CNVs from off-target reads. It can call CNVs genome-wide from targeted panel and exome data, increasing the utility and diagnostic yield of these tests. SavvyCNV is freely available at https://github.com/rdemolgen/SavvySuite.Entities:
Mesh:
Year: 2022 PMID: 35294448 PMCID: PMC8959187 DOI: 10.1371/journal.pcbi.1009940
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Benchmarking off-target CNV calling from targeted panel data.
The data points on the plot are generated by a parameter sweep for each tool and show the precision and recall that can be achieved with each tool. The f statistic is the harmonic mean of precision and recall (see Materials and Methods for details).
Fig 2Benchmarking on-target CNV calling from targeted panel data.
The data points on the plot are generated by a parameter sweep for each tool and show the precision and recall that can be achieved with each tool. The f statistic is the harmonic mean of precision and recall (see Materials and Methods for details).
Fig 3Benchmarking off-target CNV calling from exome data.
The data points on the plot are generated by a parameter sweep for each tool and show the precision and recall that can be achieved with each tool. The f statistic is the harmonic mean of precision and recall (see Materials and Methods for details).
Clinically-relevant CNVs detected.
| Row | CNV detected (GRCh37) | CNV size | Clinical confirmation | Reason for referral | Clinical implications |
|---|---|---|---|---|---|
| 1 | Chr8:6,800,000–11,800,000 deletion | 5Mb | Deletion includes | Diabetes | Genetic diagnosis of monogenic diabetes [ |
| 2 | Chr8:8,000,000–10,400,000 duplication | 2.4Mb and 1.4Mb | Deletion includes | Diabetes | Genetic diagnosis of monogenic diabetes [21). |
| 3 | Chr18:19,400,000–21,800,000 deletion | 2.4Mb | Deletion includes | Diabetes | Genetic diagnosis of monogenic diabetes [ |
| 4 | Chr21:14,400,000–48,200,000 duplication | Chromosome | Patient known to have Down Syndrome at referral. | Hyperinsulinism | Confirms the diagnosis of Down syndrome. |
| 5 | Chr22:18,800,000–21,600,000 deletion | 1.8Mb | Confirmed by array CGH. | Diabetes | Provided the diagnosis of DiGeorge syndrome. |
| 6 | ChrX:0–155,400,000 duplication | Chromosome | Patient known to have XXX syndrome at referral. | Diabetes | Confirms the diagnosis of XXX syndrome. |