| Literature DB >> 32785184 |
Urban Kunej1, Aida Dervishi2, Valérie Laucou3, Jernej Jakše1, Nataša Štajner1.
Abstract
The main challenge associated with genotyping based on conventional length polymorphisms is the cross-laboratory standardization of allele sizes. This step requires the inclusion of standards and manual sizing to avoid false results. Capillary electrophoresis (CE) approaches limit the information to the length polymorphism and do not allow the determination of a complete marker sequence. As an alternative, high-throughput sequencing (HTS) offers complete information regarding marker sequences and their flanking regions. In this work, we investigated the suitability of a semi-quantitative sequencing approach for microsatellite genotyping using Illumina paired-end technology. Twelve microsatellite loci that are well established for grapevine CE typing were analysed on 96 grapevine samples from six different countries. We redesigned primers to the length of the amplicon for short sequencing (~100 bp). The primer pair was flanked with a 10 bp overhang for the introduction of barcodes on both sides of the amplicon to enable high multiplexing. The highest data peaks were determined as simple sequence repeat (SSR) alleles and compared with the CE dataset based on 12 reference samples. The comparison showed that HTS SSR genotyping can successfully replace the CE system in further experiments. We believe that, with next-generation sequencing, genotyping can be improved in terms of its speed, accuracy, and price.Entities:
Keywords: SSR markers; Vitis vinifera L.; genotyping; high-throughput sequencing; microsatellites
Mesh:
Substances:
Year: 2020 PMID: 32785184 PMCID: PMC7464945 DOI: 10.3390/genes11080917
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
The 96 cultivars analysed in this study, sorted by barcodes and assigned to their country of origin.
| Barcode | Cultivar | Country of Origin | Barcode | Cultivar | Country of Origin |
|---|---|---|---|---|---|
| F1-R1 | Kratošija | North Macedonia | F7-R1 | GnetKras | Slovenia |
| F1-R2 | Neznana Bela | Slovenia | F7-R2 | Kratošija I | Montenegro |
| F1-R3 | Rebula | Slovenia | F7-R3 | Muštoš Feher | Serbia |
| F1-R4 | Sremska Zelenika | Serbia | F7-R4 | Rebula Portalis | Slovenia |
| F1-R5 | Zimsko Belo | Serbia | F7-R5 | Smederevka | Bosnia and Herzegovina |
| F1-R6 | Manastirsko Belo | North Macedonia | F7-R6 | Vranac | Bosnia and Herzegovina |
| F1-R7 | Dobrogostina | Bosnia and Herzegovina | F7-R7 | Belovina | North Macedonia |
| F1-R8 | Godominka | Serbia | F7-R8 | Gnjet | Slovenia |
| F2-R1 | Bagrina | Serbia | F8-R1 | Kreaca | Serbia |
| F2-R2 | DrenakCrni | Serbia | F8-R2 | Refosco | Slovenia |
| F2-R3 | Kadarka Bela | Serbia | F8-R3 | Stanušina | North Macedonia |
| F2-R4 | Krkošija Šupljica | Serbia | F8-R4 | Žametovka | Bosnia and Herzegovina |
| F2-R5 | Prokupac | Bosnia and Herzegovina | F8-R5 | Ohridsko Belo | North Macedonia |
| F2-R6 | Ružica | Serbia | F8-R6 | Refošk | Slovenia |
| F2-R7 | Bela Zgodnja | Slovenia | F8-R7 | DolgiGrozdi | Slovenia |
| F2-R8 |
| Slovenia | F8-R8 | Gročanka | Serbia |
| F3-R1 | Drenak | Bosnia and Herzegovina | F9-R1 | PlovdinaCrna | Serbia |
| F3-R2 | Kadarka | Serbia | F9-R2 | Rezaklija | Bosnia and Herzegovina |
| F3-R3 | Kujundžuša | Bosnia and Herzegovina | F9-R3 | Stari Rizling VI | Montenegro |
| F3-R4 | Prokupac | Serbia | F9-R4 | Žlozder | Bosnia and Herzegovina |
| F3-R5 | Ružica V | Montenegro | F9-R5 | Debinë e Zezë | Albania |
| F3-R6 | TamjanikaCrna | Serbia | F9-R6 | Kallmet | Albania |
| F3-R7 |
| Slovenia | F9-R7 | Potek e Zezë | Albania |
| F3-R8 | Beli Medenac | Serbia | F9-R8 | Shesh i Zi | Albania |
| F4-R1 | Ružica VI | Montenegro | F10-R1 | Stambolleshë | Albania |
| F4-R2 | TrbljanBeli | Serbia | F10-R2 | Sheshi Bardhë | Albania |
| F4-R3 |
| Slovenia | F10-R3 | Kosinjot | Albania |
| F4-R4 | Bena | Bosnia and Herzegovina | F10-R4 | Vlosh | Albania |
| F4-R5 | Elezovka | Bosnia and Herzegovina | F10-R5 | Tajgë e Zezë | Albania |
| F4-R6 | Kavčina | Serbia | F10-R6 | Meresnik | Albania |
| F4-R7 | Marburger | Slovenia | F10-R7 | Korith i Bardhë | Albania |
| F4-R8 | Prošip | Bosnia and Herzegovina | F10-R8 | Tajgë e Bardhë | Albania |
| F5-R1 | Sipa | Slovenia | F11-R1 | Pulëz | Albania |
| F5-R2 | Trnjak | Bosnia and Herzegovina | F11-R2 | Razaki e Kuqe | Albania |
| F5-R3 |
| Slovenia | F11-R3 | Serinë e Bardhë | Albania |
| F5-R4 | BlatinaI | Bosnia and Herzegovina | F11-R4 | Debinë e Bardhë | Albania |
| F5-R5 | Furmint | Serbia | F11-R5 | Furmint | France |
| F5-R6 | Menigovka | Bosnia and Herzegovina | F11-R6 |
| France |
| F5-R7 | Radovača VII | Montenegro | F11-R7 |
| France |
| F5-R8 | Sipon | Slovenia | F11-R8 | Mourvedre | France |
| F6-R1 |
| Slovenia | F12-R1 | Ugni B/Trebbianotoscano | France |
| F6-R2 | CrnValandovskiDrenok | North Macedonia | F12-R2 | Muscat a petit grains | France |
| F6-R3 | Gavran | Serbia | F12-R3 | Muscat d’Alexandrie | France |
| F6-R4 | Končanka | North Macedonia | F12-R4 |
| France |
| F6-R5 | Muskat Ruža | Serbia | F12-R5 |
| France |
| F6-R6 | Slankamenka Crvena | Serbia | F12-R6 | Cabernet franc | France |
| F6-R7 |
| Slovenia | F12-R7 |
| France |
| F6-R8 | Čauš Bel | North Macedonia | F12-R8 | Richter110 | France |
* Bolded cultivars are, in general, considered as references.
The reference sequence, microsatellite core repeat, and reference length of each locus.
| Locus | Reference Sequence | Microsatellite Core Repeat | Reference Length |
|---|---|---|---|
| VMC1b11-NGS | GACCTAAGTTTCTGAGGCTTTGAAAATTACCTTCCGGGTTTCTAGAGAGGGAGAGAGAGAGAGAGAGAGAGAGAGAGGAAGGTTCGGCAACACAAAATGAGAGGCA | (GA)n | 106 |
| VrZAG79-NGS | TTAGCCGAAGCCATCTCTGTTCTCAAGCAGAATGGAAGTGAGAGAGAGAGAGAGAGARGAGAGAGAGAGAGATAAAGGTGGTGAGGTGCTTGTGTTTCTTGA | (CT)n | 102 |
| VVIb01-NGS | CCTGTGAAACCACCACTATCCTCAGAGAAGCTCTCTCTCTCTCTCTCTCTCTTCACACTCACATCACTCGTTTACCTTGTGCAACCA | (CT)n | 87 |
| VVIn73-NGS | AGGCTTCAAAGCCCTCTCATCTTAATTCGTGTGTGTGTGTGTGTGTGTTGGGGCCTTTGGGGCTCCACTGACACCCACAAGGGTGT | (CA)n | 86 |
| VVIp31-NGS | TTGGGAAACCACAGAAGTGACAATTTATAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGGCATATCCATTAGAATGATCACATTCCAGGAACAACCCATT | (GA)n | 101 |
| VVIq52-NGS | CAGGAAAGTGTTCAATGGTTACAAAACAGGAGAGAGAGAGAGAGAGAGTGTGTCACTGGTTCTGTCATCTACCATCCTT | (CT)n | 79 |
| VVIv37-NGS | ACCAGTATTAAGAACGCAGTCACTGCCCACAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGATGGGGTGAGTGGGAAGTTAAGAGTAGGG | (TC)n(GT)n | 101 |
| VVMD24-NGS | AGAAGACTTGTCTCTCTCAATCAAATTGTGGTCCTCCTCTCTCTCTCTCTCTCTCTCTCTCTACTACTGCATATCATTGATAGTCCTTGTCTCAATTTCTTTGCG | (CT)n | 105 |
| VVMD25-NGS | TGAAAAGTGTAGTGACCCTTTGACTAGGCCTCCCTTCTCTCTCTCTCTCTCTCTCATGTTTATGTTATTTATTGTTTTTTTCCTTGAAACCACAAGACAAGCCTCCA | (CT)n | 107 |
| VVMD27-NGS | CCTCTCTCTCCGGCGGTATTCTCAATCTCCCTCCTCCTTCCGCCCAAGTTGAGGTCTCTCTCTCTCTCTCTCTCTCTCTATTTATATACTTACGGATGTATTCAGATCTGGT | (CT)n | 112 |
| VVMD32-NGS | TGAAACGTCTCGCCATTACCCCTCCCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCAAGCCAGGCGTCAAAACATGAACTGTTTGTC | (CT)n | 109 |
| VVMD7-NGS | CCTCAAGCAGCGTATCCATAGCGAGTGGAGGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGTGAGCGCCAAAGAGAGAGGGAGGAGGG | (CT)n | 88 |
Figure 1Workflow to amplify short sequence repeats in high-throughput sequencing (HTS) analysis. Amplifying begins with locus-specific amplification (step 1) using locus-specific forward (F) and reverse (R) primers extended with universal tails (Table 2); tail 1 (for F primer) is AATTAACCCT, tail 2 (for R primer) is CAGTCGGGCG. In step 2, the loci are pooled by sample and re-amplified to integrate the barcoding primers (BC-F, BC-R) listed in Supplementary Material, Figure S1.
Table of the simple sequence repeat (SSR) locus-specific primers with universal tail (letters in bold), linkage group, and reference.
| SSR Name | Linkage | Locus Specific Forward Primer | Locus Specific Reverse Primer | Reference |
|---|---|---|---|---|
| VMC1b11-NGS | 8 | BV681754 | ||
| VrZAG79-NGS | 5 | [ | ||
| VVIb01-NGS | 2 | [ | ||
| VVIn73-NGS | 17 | [ | ||
| VVIp31-NGS | 19 | [ | ||
| VVIq52-NGS | 9 | [ | ||
| VVIv37-NGS | 10 | [ | ||
| VVMD24-NGS | 14 | [ | ||
| VVMD25-NGS | 11 | [ | ||
| VVMD27-NGS | 5 | [ | ||
| VVMD32-NGS | 4 | [ | ||
| VVMD7-NGS | 7 | [ |
Sequencing statistics for 96 grapevine cultivars over 12 loci.
| Locus | Reference Allele Length | Mapped Reads 1 | Amount of Data [bp] after Mapping 2 | Average Coverage after Mapping | No. of Sequences after Filtering 3 | Amount of Data (bp) after Demultiplexing | Average Coverage after Filtering |
|---|---|---|---|---|---|---|---|
| VMC1b11 | 106 | 3,649,804 | 551,120,404 | 2,888,215 | 2,927,098 | 305,691,052 | 2,883,878 |
| VrZAG79 | 102 | 1,192,832 | 180,117,632 | 1,148,842 | 1,694,202 | 156,444,090 | 1,533,766 |
| VVIb01 | 87 | 2,713,933 | 409,803,883 | 2,712,447 | 2,444,143 | 223,916,517 | 2,573,753 |
| VVIn73 | 86 | 1,085,003 | 163,835,453 | 1,084,374 | 957,723 | 81,434,445 | 946,912 |
| VVIp31 | 101 | 1,963,722 | 296,522,022 | 1,819,017 | 1,447,397 | 149,040,267 | 1,475,646 |
| VVIq52 | 79 | 792,078 | 119,603,778 | 791,626 | 704,921 | 56,007,493 | 708,956 |
| VVIv37 | 101 | 1,119,484 | 169,042,084 | 1,022,452 | 1,264,019 | 115,785,796 | 1,146,394 |
| VVMD24 | 105 | 1,813,906 | 273,899,806 | 1,810,404 | 1,588,685 | 162,085,558 | 1,543,672 |
| VVMD25 | 107 | 2,348,776 | 354,665,176 | 2,192,870 | 2,000,291 | 221,586,577 | 2,070,903 |
| VVMD27 | 112 | 2,911,241 | 439,597,391 | 2,895,057 | 2,468,380 | 277,195,183 | 2,474,957 |
| VVMD32 | 109 | 1,556,828 | 235,081,028 | 1,163,432 | 1,037,861 | 92,983,922 | 853,064 |
| VVMD7 | 88 | 949.509 | 143,375,859 | 937,902 | 899,408 | 75,335,819 | 856,089 |
| Total | 22,097,116 | 3,336,664,516 | 19,434,128 | 1.917,506,719 |
1 Raw reads mapped to reference alleles using CLC genomics Workbench/Server. 2 Total number reported with unaligned part included. 3 Full length sequences starting and ending with amplification primer.
Figure 2Tree construction based on simple-matching dissimilarity coefficient and the weighted neighbour-joining clustering method using alleles (A) obtained by capillary electrophoresis (CE) analysis (B) and by HTS (Infoseq) analysis. The numbers on the branches indicate the percentage of bootstrap analysis (1000).
Figure 3Example of the low sequence coverage for long alleles in the cultivar Furmint at locus VVMD27, allele 125 bp, and in cultivar Mourverde at locus VVMD25, allele 131 bp.
Figure 4Example of the triallelic profile of cultivar Mourverde at locus VrZag79.
Figure 5Example of the intense amplification of stutter bands at locus VVMD7 for two cultivars, Furmint and Mourverde.