| Literature DB >> 34207415 |
Matthew Ordidge1, Suzanne Litthauer2, Edward Venison1, Marine Blouin-Delmas3, Felicidad Fernandez-Fernandez2, Monika Höfer4, Christina Kägi5, Markus Kellerhals6, Annalisa Marchese7, Stephanie Mariette8, Hilde Nybom9, Daniela Giovannini10.
Abstract
The objective of our study was the alignment of microsatellite or simple sequence repeat (SSR) marker data across germplasm collections of cherry within Europe. Through the European Cooperative program for Plant Genetic Resources ECPGR, a number of European germplasm collections had previously been analysed using standard sets of SSR loci. However, until now these datasets remained unaligned. We used a combination of standard reference genotypes and ad-hoc selections to compile a central dataset representing as many alleles as possible from national datasets produced in France, Great Britain, Germany, Italy, Sweden and Switzerland. Through the comparison of alleles called in data from replicated samples we were able to create a series of alignment factors, supported across 448 different allele calls, that allowed us to align a dataset of 2241 SSR profiles from six countries. The proportion of allele comparisons that were either in agreement with the alignment factor or confounded by null alleles ranged from 67% to 100% and this was further improved by the inclusion of a series of allele-specific adjustments. The aligned dataset allowed us to identify groups of previously unknown matching accessions and to identify and resolve a number of errors in the prior datasets. The combined and aligned dataset represents a significant step forward in the co-ordinated management of field collections of cherry in Europe.Entities:
Keywords: Prunus avium; SSR; genetic resource; germplasm; microsatellite; sweet cherry
Year: 2021 PMID: 34207415 PMCID: PMC8235247 DOI: 10.3390/plants10061243
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Summary of allelic comparisons from alignment of two phases of GB national data.
| Locus | Allele Calls Compared | Alignment Factor (bp) | Allele Calls in Agreement with Alignment Factor (%) | Null Alleles (%) | Range of Error from Alignment Factor (bp) |
|---|---|---|---|---|---|
| CPPCT022 | 75 | 0 | 81% | 13% | 1 |
| CPPCT006 | 84 | 0 | 99% | 1% | n/a 2 |
| EMPaS02 | 78 | −3 or −4 1 | 90% | 8% | −1 |
| BPPCT037 | 75 | 1 | 77% | 3% | −1 |
| EMPaS06 | 72 | 0 | 89% | 4% | −1 to 1 |
| EMPa004 | 83 | 0 | 90% | 10% | n/a |
| EMPa017 | 68 | 0 | 82% | 9% | 1 |
| EMPa018 | 72 | 0 | 94% | 6% | n/a |
| EMPaS12 | 74 | -8 | 95% | 3% | 1 or −7 |
| EMPaS14 | 80 | 0 | 83% | 16% | 1 |
1 An allele-range-specific alignment factor was used for EMPaS02. 2 n/a: not applicable.
Summary of allelic comparisons from alignment of national and central data.
| Country | Allele Calls 1 | EMPa002 | CPSCT038 | CPPCT022 | CPPCT006 | BPPCT034 | EMPaS02 | PAV-Rf-SSR | BPPCT037 | EMPaS06 | EMPaS12 | EMPaS14 | EMPa004 | EMPa018 | EMPa017 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| France | Compared | 41 | n/a | n/a | n/a | 57 | 55 | n/a | n/a | 49 | 51 | 50 | 53 | 50 | 44 |
| In agreement (%) | 90% | n/a | n/a | n/a | 79% | 95% | n/a | n/a | 90% | 78% | 88% | 94% | 94% | 89% | |
| Null (%) | 10% | n/a | n/a | n/a | 21% | 5% | n/a | n/a | 6% | 18% | 6% | 6% | 6% | 7% | |
|
|
|
|
|
|
|
|
|
|
| ||||||
| Germany | Compared | 72 | n/a | 78 | 82 | n/a | 73 | n/a | 83 | 79 | 77 | 76 | n/a | n/a | 57 |
| In agreement (%) | 99% | n/a | 97% | 99% | n/a | 100% | n/a | 94% | 95% | 97% | 99% | n/a | n/a | 84% | |
| Null (%) | 1% | n/a | 0% | 1% | n/a | 0% | n/a | 1% | 0% | 1% | 0% | n/a | n/a | 2% | |
|
|
|
|
|
|
|
|
|
|
| ||||||
| Great Britain | Compared | 65 | 64 | 77 | 81 | 78 | 83 | 61 | 80 | 81 | 80 | 78 | 79 | 76 | 67 |
| In agreement (%) | 80% | 61% | 86% | 91% | 74% | 85% | 46% | 85% | 96% | 97% | 95% | 96% | 97% | 72% | |
| Null (%) | 18% | 36% | 4% | 3% | 24% | 5% | 52% | 3% | 0% | 1% | 1% | 4% | 3% | 6% | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Italy | Compared | 39 | n/a | 44 | 50 | n/a | 53 | n/a | 54 | 55 | 60 | n/a | n/a | n/a | 48 |
| In agreement (%) | 97% | n/a | 84% | 78% | n/a | 87% | n/a | 69% | 78% | 82% | n/a | n/a | n/a | 77% | |
| Null (%) | 0% | n/a | 5% | 12% | n/a | 6% | n/a | 11% | 5% | 8% | n/a | n/a | n/a | 2% | |
|
|
|
|
|
|
|
|
|
| |||||||
| Sweden | Compared | 41 | n/a | 34 | 39 | n/a | n/a | n/a | 43 | n/a | 39 | n/a | n/a | n/a | n/a |
| In agreement (%) | 78% | n/a | 91% | 97% | n/a | n/a | n/a | 65% | n/a | 62% | n/a | n/a | n/a | n/a | |
| Null (%) | 12% | n/a | 9% | 3% | n/a | n/a | n/a | 16% | n/a | 8% | n/a | n/a | n/a | n/a | |
|
|
|
|
|
|
| ||||||||||
| Switzerland | Compared | n/a | n/a | 15 | 12 | n/a | 15 | n/a | 14 | 14 | 14 | 13 | n/a | n/a | n/a |
| In agreement (%) | n/a | n/a | 67% | 83% | n/a | 100% | n/a | 93% | 86% | 93% | 69% | n/a | n/a | n/a | |
| Null (%) | n/a | n/a | 0% | 0% | n/a | 0% | n/a | 7% | 0% | 0% | 0% | n/a | n/a | n/a | |
|
|
|
|
|
|
|
|
|
1 Allele calls—’Compared’ indicates the total number of calls compared across all samples replicated between the national and central data (including calls marked in one or other dataset as null). n/a: not applicable (data not available).
Summary statistics from allele frequency analysis of unique diploid samples after alignment.
| Locus | Allele no. | N | HO | HE | PIC | NE-I | NE-SI |
|---|---|---|---|---|---|---|---|
| EMPa002 | 15 | 844 | 0.47 | 0.46 | 0.37 | 0.38 | 0.62 |
| CPSCT038 | 6 | 209 | 0.52 | 0.54 | 0.49 | 0.26 | 0.54 |
| CPPCT022 | 17 | 1111 | 0.66 | 0.68 | 0.62 | 0.16 | 0.45 |
| CPPCT006 | 22 | 1122 | 0.73 | 0.75 | 0.70 | 0.11 | 0.40 |
| BPPCT034 | 19 | 376 | 0.76 | 0.74 | 0.70 | 0.10 | 0.41 |
| EMPaS02 | 17 | 1252 | 0.76 | 0.80 | 0.77 | 0.07 | 0.37 |
| PAV-Rf-SSR | 7 | 120 | 0.74 | 0.73 | 0.70 | 0.11 | 0.41 |
| BPPCT037 | 22 | 1125 | 0.80 | 0.80 | 0.78 | 0.07 | 0.37 |
| EMPaS06 | 16 | 1248 | 0.83 | 0.84 | 0.82 | 0.05 | 0.34 |
| EMPaS12 | 17 | 1275 | 0.78 | 0.78 | 0.74 | 0.09 | 0.38 |
| EMPaS14 | 13 | 1107 | 0.63 | 0.58 | 0.50 | 0.25 | 0.52 |
| EMPa004 | 10 | 423 | 0.78 | 0.71 | 0.66 | 0.14 | 0.43 |
| EMPa018 | 11 | 425 | 0.60 | 0.65 | 0.61 | 0.16 | 0.47 |
| EMPa017 | 11 | 855 | 0.35 | 0.37 | 0.35 | 0.42 | 0.67 |
| Mean | 14.5 | 821 | 0.67 | 0.67 | 0.63 | 0.17 | 0.46 |
N: Number of individuals typed; HO: Observed homozygosity; HE: Expected heterozygosity; PIC: Polymorphic information content; NE-I: Average non-exclusion probability for identity of two unrelated individuals; NE-SI: Average non-exclusion probability for identity of two siblings.
Total allele number from the whole dataset (including polyploid samples) after alignment.
| Locus | Allele No. |
|---|---|
| EMPa002 | 30 |
| CPSCT038 | 12 |
| CPPCT022 | 29 |
| CPPCT006 | 27 |
| BPPCT034 | 32 |
| EMPaS02 | 23 |
| PAV-Rf-SSR | 7 |
| BPPCT037 | 30 |
| EMPaS06 | 22 |
| EMPaS12 | 28 |
| EMPaS14 | 27 |
| EMPa004 | 15 |
| EMPa018 | 13 |
| EMPa017 | 17 |
Figure 1Summarised process for the alignment of datasets from national collections. Generic stages in the alignment process are listed in arrows with detail specific to our datasets and process expanded in the accompanying bullets.