| Literature DB >> 35448948 |
Aurélie Canaguier1, Romane Guilbaud1, Erwan Denis2, Ghislaine Magdelenat2, Caroline Belser3, Benjamin Istace3, Corinne Cruaud2, Patrick Wincker3, Marie-Christine Le Paslier1, Patricia Faivre-Rampant4, Valérie Barbe3.
Abstract
BACKGROUND: Structural Variations (SVs) are genomic rearrangements derived from duplication, deletion, insertion, inversion, and translocation events. In the past, SVs detection was limited to cytological approaches, then to Next-Generation Sequencing (NGS) short reads and partitioned assemblies. Nowadays, technologies such as DNA long read sequencing and optical mapping have revolutionized the understanding of SVs in genomes, due to the enhancement of the power of SVs detection. This study aims to investigate performance of two techniques, 1) long-read sequencing obtained with the MinION device (Oxford Nanopore Technologies) and 2) optical mapping obtained with Saphyr device (Bionano Genomics) to detect and characterize SVs in the genomes of the two ecotypes of Arabidopsis thaliana, Columbia-0 (Col-0) and Landsberg erecta 1 (Ler-1).Entities:
Keywords: Arabidopsis thaliana; Bionano Genomics optical mapping; High molecular weight DNA; Oxford Nanopore technologies; Structural variations
Mesh:
Year: 2022 PMID: 35448948 PMCID: PMC9026655 DOI: 10.1186/s12864-022-08499-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Fig. 1Circos visualization of Evry.Ler-1 SVs landscape. All comparisons were performed against the Col-0 TAIR10.1 reference genome per 100kb bins. From external to internal layer (Circle1 to Circle7): Circle1: Col-0 TAIR10.1 chromosomes (ticks every 100 kb): black and light grey rectangles represent centromeric and NOR regions respectively; Circle2: Average mapping coverage for Evry.Col-0 ONT reads (grey line) and Evry.Ler-1 ONT reads (orange line) with dark orange if coverage > 46X; Circle3: DLE-1 label density as purple line (dark purple if density > 18 label per 100 kb); Circle4: Genes density as green line (dark green if density > 23), NLR Genes [58] indicated as green rectangles; Circle5: TEs density as blue line (dark blue if density > 58); Circle6: ONT SVs occurrences as orange outward bars (dark orange bars represent ONT- specific SVs); Circle7: Bionano SVs occurrences as purple inward bars (dark purple bars represent Bionano-specific SVs)
Characteristics of Evry.Ler-1 ONT and Bionano SVs, obtained after alignment against Col-0 TAIR10.1 reference genome
| Technology | ONT | Bionano | |||||||
|---|---|---|---|---|---|---|---|---|---|
| SV type | INS | DEL | INV | TOTAL | INS | DEL | INV | TRA | TOTAL |
| SV > 1 kb (%) | 591 (49.9) | 581 (49.1) | 12 (1.0) | 1184 | 289 (48.9) | 295 (49.9) | 5 (0.8) | 2 (0.4) | 591 |
| Cumulated Size | 3.4 | 4.0 | 0.3 | 7.7 | 2.9 | 2.3 | 1.6 | 0.4 | 7.2 |
| Median Size | 3358 | 3446 | 17,012 | 3455 | 4383 | 4296 | 166,007 | 189,454 | 4383 |
| Average Size | 5724 | 6801 | 26,865 | 6467 | 10,021 | 7885 | 310,470 | 189,454 | 12,104 |
Cumulated sizes are in Mb, Median and Average sizes in bp
Fig. 2Number of Evry.Ler-1 structural variations detected by ONT and Bionano against the Col-0 TAIR10.1 reference genome and overlaps in locations between the two technologies. The bars and circles colored in orange and purple correspond respectively to the ONT and Bionano technologies. A Barplot of SV number for insertions (INS), deletions (DEL), inversions (INV), translocations (TRA) and all SVs (TOTAL) obtained using ONT and Bionano technologies. B Venn diagramm of common and specific locations detected by ONT and Bionano technologies
Fig. 3Focus of large structural variations (MU) located in complex locations. For each location, optical maps are colored in green for the Col-0 TAIR10.1 reference maps (ChrM for mitochondrial chromosome map) and light blue for Evry.Ler-1 maps. Consistent DLE-1 enzyme label between reference and Evry.Ler-1 maps are represented by dark blue bars with grey links between the genome maps. Inconsistent DLE-1 enzyme label are shown by yellow bars on the two genome maps. The purple bars represent the insertion events on the Evry.Ler-1 maps / read / assembly, and the black, yellow and pink bars correspond to deletions, inversions and translocations respectively. Araport11 annotation of the Col-0 TAIR10.1 reference (Genes and TE) and IGV view of the Evry.Ler-1 trimmed ONT reads and SMARTdenovo contigs minimap alignments are also presented. A Bionano Chr2 Evry.Ler-1 translocations against Col-0 TAIR10.1 reference genome (MU_153). B Bionano Chr4 Evry.Ler-1 extra-range size inversion against Col-0 TAIR10.1 reference genome (MU_097). C Bionano Chr4 Evry.Ler-1 large deletion against Col-0 TAIR10.1 reference genome (MU_102). D Bionano Chr5 Evry.Ler-1 inversion against Col-0 TAIR10.1 reference genome (MU_138)
Characteristics of Evry.Ler-1 ONT and Evry.Ler-1 optical map SVs identified in common and specific Col-0 TAIR10.1 reference locations
| Technology | ONT | Bionano | ||
|---|---|---|---|---|
| Common | Specific | Common | Specific | |
| Locations | 563 | 236 | 563 | 28 |
| SVs > 1 kb (%) | 948 (80.1) | 236 (19.9) | 563 (95.3) | 28 (4.7) |
| Min Size | 1003 | 1003 | 1034 | 1017 |
| Max Size | 87,533 | 347,239 | 1,143,224 | 166,007 |
| Cumulated Size | 5.9 | 1.8 | 6.9 | 0.3 |
| Median Size | 3759 | 2656 | 4456 | 1374 |
| Average Size | 6221 | 7453 | 12,171 | 11,104 |
Cumulated sizes are in Mb, and all other sizes in bp
Fig. 4Size distribution and median comparison of ONT and Bionano SV. All p-values were obtained with a two-sided oriented Wilcoxon rank-sum test. Hypothesis H0 was “median of ONT SV size is less than Bionano one”. ONT and Bionano boxplots are colored in orange and purple respectively. Medians are represented by red dots. A Boxplot of ONT (n=1184) and Bionano (n=591) SV>1kb. B Boxplot of ONT (n=1169) and Bionano (n=573) 1kb
Characteristics of the svID MU identified in ONT and Bionano SVs
| Technology | ONT | Bionano |
|---|---|---|
| Location | 153 | 153 |
| Number | 538 | 153 |
| Min Size | 1010 | 1253 |
| Max Size | 87,533 | 1,143,224 |
| Cumulated Size | 3.9 | 4.4 |
| Median Size | 4236 | 9523 |
| Average Size | 7406 | 28,734 |
svID MU corresponds to locations where Multiple ONT SVs overlap a Unique Bionano SV location. Cumulated sizes are in Mb, and all other sizes in bp
Evry.Ler-1 ONT SVs (> 1 kb) overlapping Col-0 TAIR10.1 reference genes and TEs annotation features
| Features | noTE | TE | TOTAL (%) | |||
|---|---|---|---|---|---|---|
| noGENE | GENE | noGENE | GENE | |||
| Common SV | UU | 79 | 59 | 93 | 179 | 410 (34.6) |
| MU | 59 | 38 | 150 | 291 | 538 (45.5) | |
| Specific SV | UN | 28 | 28 | 71 | 109 | 236 (19.9) |
| SV number (%) | 166 (14.0) | 125 (10.6) | 314 (26.5) | 579 (48.9) | 1184 | |
| TOTAL (%) | 291 (24.6) | 893 (75.4) | 1184 | |||
Summary of comparisons of ONT assembly and Bionano Optical mapping (DLS labeling) for SV detection
| Technology | ONT | Bionano |
|---|---|---|
| √√ | √√ | |
| Deletion | √√ | √ |
| Insertion | √√ | √ |
| Inversion | √ | √√ |
| Translocation | NA | √ |
| Deletion | √ | √√ |
| Insertion | √ | √√ |
| Inversion | – | √√ |
| Translocation | NA | √√ |
| Genic | √√ | √√ |
| Complex | √ | √√ |
| √√ | NA | |
| √ | √√ | |
| √ | √√ | |
√√ Robust detection or user friendly. √ Detection depending on variables such as coverage, contig length and label density or need of bioinformatic skills. NA: Not available; − Undetectable