| Literature DB >> 35209840 |
Simone Maestri1, Giorgio Gambino2, Giulia Lopatriello1, Andrea Minio3, Irene Perrone2, Emanuela Cosentino1, Barbara Giovannone1, Luca Marcolungo1, Massimiliano Alfano1, Stephane Rombauts4,5, Dario Cantu3, Marzia Rossato1, Massimo Delledonne6, Luciano Calderón7.
Abstract
BACKGROUND: 'Nebbiolo' is a grapevine cultivar typical of north-western Italy, appreciated for producing high-quality red wines. Grapevine cultivars are characterized by possessing highly heterozygous genomes, including a great incidence of genomic rearrangements larger than 50 bp, so called structural variations (SVs). Even though abundant, SVs are an under-explored source of genetic variation mainly due to methodological limitations at their detection.Entities:
Mesh:
Year: 2022 PMID: 35209840 PMCID: PMC8867635 DOI: 10.1186/s12864-022-08389-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Contiguity and completeness statistics for ‘Nebbiolo’ CVT 71 genome assemblies
| Total assembly length (Mbp) | 560.26 | 533.95 |
| Assembly N50 (Mbp) | 5.37 | 1.18 |
| Total scaffolds length (Mbp) | 487.09 | 225.01 |
| Number of scaffolds | 109 | 107 |
| Number of gaps | 384 | 247 |
| Gaps size (Mbp) | 20.51 | 8.68 |
| Contigs in scaffolds | 493 | 354 |
| Remaining contigs | 121 | 1,880 |
| Remaining contigs total length (Mbp) | 73.17 | 308.94 |
| BUSCO statistics | C:94.8%, F:2.0%, M:3.2% | C:77.9%, F:1.9%, M:20.2% |
| Num. genes | 35,038 | 32,865 |
| Perc. repetitive content | 52.9% | 56.5% |
For BUSCO statistics, ‘C’ refers to gene completeness, ‘F’ to fragmented genes, and ‘M’ to missing genes. ‘Nebbiolo’ primary assembly and alternative haplotypes refer to the genome assemblies obtained after haplotypic separation
Fig. 1A comparison among ‘Nebbiolo’, ‘Cabernet Sauvignon’, ‘Chardonnay’ and ‘Zinfandel’ gene families. a) Venn diagram showing the number of shared and cultivar-specific gene families. b) Functional enrichment analysis of the biological processes associated to ‘Nebbiolo’ private genes. For each significantly enriched GO term, the number of ‘Nebbiolo’ associated private genes is represented, along with the percentage of genes associated with a GO that are private to ‘Nebbiolo’. The size and color of the circles represent the number of ‘Nebbiolo’ private genes and the adjusted p-value, respectively. Only GO terms with adjusted p-value < 0.01 are shown
Raw genomic data obtained for ‘Nebbiolo’ clones CVT 71 and CVT 185
| ‘Nebbiolo’ | Sequencing | Data type | N50 reads/ | Number of | Number of |
|---|---|---|---|---|---|
| CVT 71 | PacBio SMRT | Long-reads | 27,197 | 3,286,690 | 51 |
| CVT 71 | Bionano Genomics | Single-molecule maps | 241,361 | 654,883 | 157 |
| CVT 71 | 10 × Genomics | Linked-reads | 2 × 150 | 273,179,528 | 82 |
| CVT 185 | 10 × Genomics | Linked-reads | 2 × 150 | 261,527,174 | 78 |
Structural Variant (SV) types identified comparing ‘Nebbiolo’ CVT 71 haplotypes
| SV calling approach | DELs | INSs/DUPs | INVs | TOT | Cumulative size of SVs (Mbp) |
|---|---|---|---|---|---|
| PacBio SMRT | 13,138 | 8,016 | 87 | 21,241 | 79.6 |
| 10 × Genomics | 10,056 | 18 | 9 | 10,083 | 54.1 |
| Genome | 9,845 | 9,352 | 87 | 20,010 | 52.0 |
Fig. 2Structural Variants (SVs) between ‘Nebbiolo’ CVT 71 haplotypes identified by three different methodological approaches. a) Bar plot showing SVs size distribution. For each size, a coloured bar represents the number of SVs identified by each of the three approaches. The sizes of SVs are represented in log-scale. b) Venn diagram showing SVs intersection identified by each approach. Intersection is based on the genomic coordinates at which SVs were called. Abbreviations and colour code for both images: 10xG = 10 × Genomics linked-reads (red); SMRT = PacBio SMRT long-reads (blue); Genome = Genome to genome alignment (green)
Structural Variant (SV) types identified with 10 × Genomics data for ‘Nebbiolo’ clones CVT 71 and CVT 185
| SV calling approach | Clone | DELs | INSs/DUPs | INVs | TOT | Cumulative size |
|---|---|---|---|---|---|---|
| 10 × Genomics | CVT185 | 10,215 | 11 | 8 | 10,234 | 51.8 |
| 10 × Genomics | CVT71 | 10,056 | 18 | 9 | 10,083 | 54.1 |
Structural Variant (SV) types identified comparing ‘Nebbiolo’ to ‘Cabernet Sauvignon’
| SV calling approach | DELs | INSs/DUPs | INVs | TOT | Cumulative size of SVs (Mbp) |
|---|---|---|---|---|---|
| SMRT | 17,622 | 9,632 | 65 | 27,319 | 136.1 |
| 10xG | 18,975 | 24 | 17 | 19,016 | 141.8 |
| Genome | 12,509 | 10,823 | 1,664 | 24,996 | 76.4 |
Fig. 3Structural Variants (SVs) between ‘Nebbiolo’ and ‘Cabernet Sauvignon’ identified with three different methodological approaches. a) Bar plot showing the size distribution of the called SVs. For each size, a coloured bar represents the number of SVs identified by each approach. The sizes of SVs are represented in log-scale. b) Venn diagram showing SVs intersections, based on the genomic coordinates at which SVs were called. Abbreviations and colour code for both images: 10xG = 10 × Genomics (red); SMRT = PacBio SMRT (blue); Genome = Genome to genome alignment (green)