| Literature DB >> 25887570 |
Wai Yi Leung1, Tobias Marschall2,3,4, Yogesh Paudel5, Laurent Falquet6, Hailiang Mei7, Alexander Schönhuth8, Tiffanie Yael Maoz Moss9.
Abstract
BACKGROUND: Many tools exist to predict structural variants (SVs), utilizing a variety of algorithms. However, they have largely been developed and tested on human germline or somatic (e.g. cancer) variation. It seems appropriate to exploit this wealth of technology available for humans also for other species. Objectives of this work included: a) Creating an automated, standardized pipeline for SV prediction. b) Identifying the best tool(s) for SV prediction through benchmarking. c) Providing a statistically sound method for merging SV calls.Entities:
Mesh:
Year: 2015 PMID: 25887570 PMCID: PMC4520269 DOI: 10.1186/s12864-015-1376-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Overview of test datasets
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Illumina 1.9 FastQ Paired End | Human Genome hg19 | SimSeq Illumina Profile | 100 | 500 | 15 | 30x |
| Illumina 1.9 FastQ Paired End | Human Genome hg19 | SimSeq Illumina Profile | 100 | 500 | 50 | 30x |
Arabisopsis (Tair9) and Human (hg19) datasets were simulated using a SimSeq Illumina 1.9 Paired End profile with 100 bp reads and an insert size of 500. Two standard deviations of insert size were created for each dataset, more ideal (15) and less ideal (50). All datasets were simulated to 30x coverage.
Counts of Validated SVs used to benchmark SV tool performance
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Human chr. 21 | insertion | 136 | 37 | 30 | 19 | 10 |
|
|
|
|
|
|
|
|
| Human chr. 21 |
| 118 | 33 | 19 | 19 | 4 |
|
|
|
|
|
|
|
|
Figure 1SV-AUTOPILOT pipeline. Illumina pair-end NGS data in the form of a fastq file is submitted to the pipeline for SV analysis along with a genomic reference sequence. A quality report is provided by Fastqc, and Sickle is used for trimming low quality reads. Modularity allows for a choice of read aligner and SV tools. Samtools flagstat is run to evaluate the quality of the mapping. Each tool’s output is converted to a VCF format, unless already provided by the program, for downstream use by the researcher. For those wanting to benchmark tool performance, the performance metrics for the tools can be compared in the PDF report provided. Finally, when using multiple tools as part of a pipeline leading to SV validation, the option to merge SV calls according to the statistical method provided here is available to enrich the call set with true calls by merging results and reducing false-positive calls.
Figure 2Radar plot interpretation. Each corner of the pentagon represents a size class of SV. Performance is measured on a scale of 0–1.0, with 1.0 as the most accurate calls. Each tool is associated with a color as indicated in the associated figure legend. Tool performance across all size classes is easily assessed by evaluating the total area of the radar plot covered by a given tool.
Typical computational performance by SV tools used for a single run
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| GASV | n | 1058 | 594 | 0:02.08 | 0:01.20 | PE | IDVT |
| Delly | n | 578 | 1236 | 0:15.02 | 0:03.18 | PE & SR | DVTP |
| Breakdancer | n | 21.9 | 7 | 0:02.41 | 0:27.7 | PE | IDVT |
| Pindel | y | 3500.5 | 5779 | 3:02.46 | 1:16.0 | SR | IDVP |
| Clever | y | 238.7 | 1598 | 0:15.47 | 0:14.04 | PE | ID |
| SVdetect | n | 172.3 | 3223 | 0:07.56 | 0:07.31 | PE | IDVTP |
| Prism | n | 1024.9 | 6817 | 0:28.15 | 0:05.59 | PE & SR | IDVP |
Log files document computation performance for each tool used in this benchmarking study. Documentation from a single run shows memory (mem) usage and CPU time need to run each tool on Arabidopsis (Tair) and on the Human dataset used in the benchmarking. Additional columns refer to the type of algorithm used (PE: Paired-end; SR: Split-read) and the SVs that the tool is reported to be able to predict (I: Insertion; D: Deletion; V: Inversion; T: Translocation; P: Duplication). Raw log files are included in the supplementary data.
Best performing SV tool for each size class of insertion and deletion using normal and less-ideal datasets of Arabidopsis and Human Chromsome 21
|
| |||||||
|
|
|
|
|
|
|
| |
|
| |||||||
| Human chr. 21 | 15 | Clever | Pindel | Clever | Clever | n/a | |
| 88.3/77.8 | 85.7/68.8 | 88.5/45.8 | 100.0/50 | ||||
| Tair v.9 | 15 | Pindel | Clever/ Pindel | Clever | Clever | n/a | |
| 95.0/93.2 | 94.3/47.4 | 68.1/23 | 66.7/55.6 | ||||
| Human chr. 21 | 50 | Pindel | Clever/Pindel | Clever | n/a | n/a | |
| 87.9/75.2 | 83.3/55.6 | 66.7/17.9 | |||||
| Tair v.9 | 50 | Pindel | Clever/Pindel | Clever | Clever | n/a | |
| 94.9/92.7 | 94.6/46.2 | 73.9/8.3 | 60.0/20 | ||||
|
| |||||||
| Human chr. 21 | 15 | Clever/Pindel | Pindel | Clever/Pindel | Pindel | Breakdancer/Clever | |
| 92.8/89 | 92.3/76.9 | 84.6/42.9 | 100/100 | 100/33.3 | |||
| Tair v.9 | 15 | Pindel | Delly | Pindel | Pindel | Clever | |
| 94.6/94.2 | 100/100 | 89.2/90.2 | 87.9/88.7 | 68.3/58.1 | |||
| Human chr. 21 | 50 | Clever/Pindel | Pindel | Clever/Pindel | Pindel | Breakdancer/Clever | |
| 90.9/88.2 | 100/90 | 59.1/42.9 | 81.8/81.8 | 100/50 | |||
| Tair v.9 | 50 | Pindel | Delly/Pindel | Pindel | Pindel | Clever/Pindel | |
| 94.5/94.3 | 100/90.6 | 86.5/86.3 | 89.4/89.4 | 69.6/61 | |||
|
| |||||||
|
|
|
|
|
|
|
| |
|
| |||||||
| Human chr. 21 | 15 | Clever | Clever | Clever | Clever | n/a | |
| 72.8 | 83.3 | 60 | 5.3 | n/a | |||
| Tair v.9 | 15 | Clever | Clever | Clever | Clever | n/a | |
| 56.4 | 81.2 | 92.7 | 15.9 | n/a | |||
| Human chr. 21 | 50 | Pindel | Clever | Clever | Clever | n/a | |
| 61.8 | 48.6 | 76.7 | 10.5 | n/a | |||
| Tair v.9 | 50 | Pindel | Pindel | Clever | Breakdancer | n/a | |
| 34.6 | 43 | 95.1 | 51.2 | n/a | |||
|
| |||||||
| Human chr. 21 | 15 | Prism | Clever | Clever | Delly | Breakdancer | |
| 88.1 | 90.9 | 52.6 | 73.7 | 50 | |||
| Tair v.9 | 15 | Clever | Clever | Clever | Delly | SVDetect | |
| 65 | 82.1 | 89.8 | 93 | 97 | |||
| Human chr. 21 | 50 | Prism | Prism | Clever | Breakdancer | Breakdancer | |
| 85.6 | 72.7 | 63.2 | 73.7 | 50 | |||
| Tair v.9 | 50 | Pindel | Clever | Clever | Delly | SVDetect | |
| 94.5 | 52.9 | 91.9 | 94.9 | 96.2 | |||
For this work, a standard deviation of 15 is considered normal while a standard deviation of 50 is considered less-ideal. Recall and Precision were two measures used to evaluate the ability of a tool to accurately predict SVs. Here the winner for each length class is provided along with the tools winning value for that category. (P = Precision; R = Recall; Std. Dev = Standard Deviation of the Insert size; n/a = no call was made by any tools tested). The Additional file 1 contains all tool performance statistics in the PDF reports. In Table 4a both ‘relaxed’ and ‘strict’ criteria (REL/STR) (see Methods) are provided for the precision measurements which indicates how accurate the tools are at making their calls. In Table 4b the scores of tool recall demonstrate how much of the SVs the tools are able to discover.
Figure 3Data quality affects the performance of SV tools in human and Arabidopsis data sets. Some tools are more affected by changes in data quality than others. The standard deviation of the insert size of paired end reads was used as a measure of data quality. The Recall and Precision of Deletion calls are measured for Human and Arabidopsis datasets at the less optimal (sd = 50) and more optimal standard deviation (sd = 15).
Figure 4Example of merged call set compared to individual call sets. Integrated Genome Browser view of merged predictions. Calls made by each tool are shown in individual tracks, and the merged call set provided by SV-AUTOPILOT is shown in the bottom track. In this example a larger call by Breakdancer has been recentered by the merging algorithm. The red lines on the bottom indicate the position of the reference variants.