| Literature DB >> 35287164 |
Alexander T Sentinella1, Angela T Moles1, Jason G Bragg1,2, Maurizio Rossetto2, William B Sherwin1.
Abstract
Accurately detecting sudden changes, or steps, in genetic diversity across landscapes is important for locating barriers to gene flow, identifying selectively important loci, and defining management units. However, there are many metrics that researchers could use to detect steps and little information on which might be the most robust. Our study aimed to determine the best measure/s for genetic step detection along linear gradients using biallelic single nucleotide polymorphism (SNP) data. We tested the ability to differentiate between linear and step-like gradients in genetic diversity, using a range of diversity measures derived from the q-profile, including allelic richness, Shannon Information, GST, and Jost-D, as well as Bray-Curtis dissimilarity. To determine the properties of each measure, we repeated simulations of different intensities of step and allele proportion ranges, with varying genome sample size, number of loci, and number of localities. We found that alpha diversity (within-locality) based measures were ineffective at detecting steps. Further, allelic richness-based beta (between-locality) measures (e.g., Jaccard and Sørensen dissimilarity) were not reliable for detecting steps, but instead detected departures from fixation. The beta diversity measures best able to detect steps were: Shannon Information based measures, GST based measures, a Jost-D related measure, and Bray-Curtis dissimilarity. No one measure was best overall, with a trade-off between those measures with high step detection sensitivity (GST and Bray-Curtis) and those that minimised false positives (a variant of Shannon Information). Therefore, when detecting steps, we recommend understanding the differences between measures and using a combination of approaches.Entities:
Year: 2022 PMID: 35287164 PMCID: PMC8920294 DOI: 10.1371/journal.pone.0265110
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of variables and symbols used in text.
| Variable | Definition |
|---|---|
| Variants of beta diversity measures based on whether values are averaged across loci before or after calculation of measure. | |
|
| Bray-Curtis. The average absolute value of differences in allele proportions across loci. |
|
| Distance along landscape, where |
|
0
| “ |
|
1
| “ |
|
2
| “ |
|
| Average |
|
| |
|
| |
|
| |
|
| |
|
| |
| One of the three Eqs ( | |
|
0
| “ |
|
1
| “ |
|
2
| “ |
|
| Average |
|
| |
|
0
| Jaccard dissimilarity, |
|
0
| Jaccard dissimilarity, |
|
0
| Sorenson dissimilarity, |
|
0
| Sorenson dissimilarity, |
|
1
| Mutual Information, |
|
1
| Mutual Information, |
|
1
| Shannon Differentiation, |
|
1
| Shannon Differentiation, |
|
2
| |
|
2
| |
|
2
| |
|
2
| |
|
| Number of localities sampled along landscape including |
|
| Total number of loci |
|
| Number of individual genomes sampled from each locality used to calculate measured allele proportion |
| The minor allele proportion at a biallelic locus (alternate allele is: 1 – | |
| Allele proportion at | |
|
| “Order” of the |
|
| The number of shared alleles between localities 1 and 2 |
|
| Relative Bray-Curtis. Diversity measure– |
| Number of alleles at a locus in locality 1, locality 2, and in the metapopulation containing both localities (for biallelic loci, this can only be 1 or 2). | |
|
| Intensity of step (0 –linear, 1 –gentle step, 5 –moderate step, 50 –steep step). Used as input to simulate gradient. |
Fig 1The experimental design used for this study.
Step detection sensitivity of all diversity measures for standard conditions across five allele proportion treatments (p = 0–1, p = 0.1–0.9, p = 0–0.5, p = 0–0.2, p = 0.3–0.5), each with four step treatments (linear– 0, gentle step– 1, moderate step– 5, steep step– 50).
Both true positives (where a step was detected at the correct location, d = 0.5) and false positives (where a step is detected at the wrong location or any location for the linear treatments) are shown. The remaining values (out of 100) for each treatment were simulations where no step was detected (true negatives for the linear treatments, false negatives for the step treatments). Darker blue represent higher numbers of true positives out of 100 simulations, darker red represents higher numbers of false positives out of 100 simulations.
| Maximal range: | Maximal range without fixation: | Half maximal range: | Narrow range near fixation: | Narrow range far from fixation: | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Measure | Type of Step | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 |
|
| |||||||||||||||||||||
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 | 3 | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 |
|
| 0 | 0 | 0 | 1 | 1 | 0 |
|
| 0 |
|
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 |
|
| 0 | 0 | 0 | 1 | 1 | 0 |
|
| 0 |
|
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 |
|
| 0 | 0 | 0 | 1 | 1 | 0 |
|
| 0 |
|
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 | 3 | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 |
|
| 0 | 0 | 0 | 1 | 1 | 0 |
|
| 0 |
|
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 | 2 | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 | 0 |
| 0 | 0 | 0 | 1 | 1 | 0 |
|
| 0 |
|
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 | 0 | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 0 |
| False Positives | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 0 |
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 | 0 | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 |
|
| 0 | 7 |
|
| 0 | 0 | 0 | 0 | 0 | |
|
0
| True Positives | - | 0 | 0 |
| - | 0 | 0 | 2 | - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 | 1 |
| False Positives | 0 | 0 |
| 0 | 0 | 0 | 1 | 1 | 0 |
|
| 0 |
|
|
| 0 | 0 | 0 | 0 | 0 | |
|
| |||||||||||||||||||||
|
1
| True Positives | - | 0 |
|
| - | 6 |
|
| - | 0 |
|
| - |
|
|
| - | 1 |
|
|
| False Positives | 0 |
| 0 | 0 | 2 | 3 | 0 | 0 |
|
| 0 | 0 | 3 | 7 | 0 | 0 | 0 | 2 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 6 |
|
| - | 0 |
|
| - |
|
|
| - | 1 |
|
|
| False Positives | 0 |
| 0 | 0 | 2 | 3 | 0 | 0 |
|
| 0 | 0 | 3 | 7 | 0 | 0 | 0 | 2 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 7 |
|
| - | 0 |
|
| - |
|
|
| - | 1 |
|
|
| False Positives | 0 |
| 0 | 0 | 2 | 3 | 0 | 0 |
|
| 0 | 0 | 3 | 6 | 0 | 0 | 0 | 2 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 7 |
|
| - | 0 |
|
| - | 14 |
|
| - | 1 |
|
|
| False Positives | 0 |
| 0 | 0 | 2 | 3 | 0 | 0 |
|
| 0 | 0 | 3 | 6 | 0 | 0 | 0 | 2 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 0 |
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 | 0 |
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 0 |
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 | 0 |
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 0 |
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 | 0 |
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
1
| True Positives | - | 0 |
|
| - | 0 |
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 | 0 |
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
| |||||||||||||||||||||
|
2
| True Positives | - | 4 |
|
| - | 8 |
|
| - | 2 |
|
| - | 5 |
|
| - | 1 |
|
|
| False Positives | 0 |
| 0 | 0 | 3 | 3 | 0 | 0 |
|
| 0 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 1 |
|
|
| False Positives |
| 0 | 0 | 0 |
| 0 | 0 | 1 | 1 |
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 1 |
|
|
| False Positives |
| 0 | 0 | 0 |
| 0 | 0 | 1 | 0 |
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 1 |
|
|
| False Positives |
| 0 | 0 | 0 |
| 0 | 0 | 1 | 0 |
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | |
|
2
| True Positives | - | 0 |
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 | 0 |
| - | 0 | 0 |
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 6 |
|
|
| False Positives |
| 0 | 0 | 0 |
| 0 | 0 | 0 | 2 |
| 0 | 0 | 0 | 0 | 0 | 0 | 8 |
| 1 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 | 0 |
|
| False Positives | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 6 |
|
|
| False Positives |
| 0 | 0 | 0 |
| 0 | 0 | 0 | 3 |
| 0 | 0 | 0 | 0 | 0 | 0 |
|
| 2 | 0 | |
|
| |||||||||||||||||||||
| Bray-Curtis | True Positives | - |
|
|
| - |
|
|
| - | 1 |
|
| - | 0 | 1 |
| - | 2 |
|
|
| False Positives |
| 0 | 0 | 0 | 4 | 2 | 0 | 1 | 0 | 5 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | |
Fig 2Example of how true positives generally respond to the varied number of genomes (n), number of loci (L) and number of localities (K).
The dashed line indicates the values used in the standard treatments. These results were taken from an AvLast variant of a q = 1 measure under the moderate step treatment (step = 5) and the narrow range far from fixation allele proportion treatment (p = 0.3–0.5). For full results for the six chosen measures and treatments, see Supplement S3 File.
Step detection sensitivity, averaged over all optimal and suboptimal conditions, for the best six diversity measures (Mutual Information AvLast variant, Mutual Information AvFirst variant, G AvLast variant, G AvFirst variant, 2D, and Bray-Curtis) across five allele proportion treatments (p = 0–1, p = 0.1–0.9, p = 0–0.5, p = 0–0.2, p = 0.3–0.5), each with four step treatments (linear– 0, gentle step– 1, moderate step– 5, steep step– 50).
Both true positives (where a step was detected at the correct location, d = 0.5) and false positives (where a step is detected at the wrong location or any location for the linear treatments) are shown. Values are in percentages, averaged across all tested simulations (where genome sample size, number of loci and number of localities was varied. Remaining percentages (out of 100) for each treatment were simulations where no step was detected (true negatives for the linear treatments, false negatives for the step treatments). Darker green represents higher percentages of true positives, darker red represents higher percentages of false positives. Comprehensive data are shown in Supplement S3 File.
| Maximal range: | Maximal range without fixation: | Half maximal range: | Narrow range near fixation: | Narrow range far from fixation: | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Measure | Type of Step | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 | 0 | 1 | 5 | 50 |
|
1
| True Positives | - |
|
|
| - |
|
|
| - |
|
|
| - |
|
|
| - |
|
|
|
| False Positives | 3.8 |
| 0 | 0 | 1.5 | 3.5 | 0 | 0.2 |
|
| 1.2 | 0 |
|
| 0.2 | 0 | 0.8 | 0.7 | 0.2 | 0.2 | |
|
1
| True Positives | - |
|
|
| - |
|
|
| - |
|
|
| - | 1.2 |
|
| - | 0 |
|
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - |
|
|
| - |
|
|
| - |
|
|
|
| False Positives | 2.6 |
| 0 | 0 | 1.9 | 2.5 | 0 | 0.2 |
|
| 0.1 | 0 | 2 | 0.9 | 0.2 | 0 | 0.8 | 0.7 | 0.2 | 0.2 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - | 0 |
|
| - | 0 | 0 |
| - | 0 |
|
|
| False Positives | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
|
2
| True Positives | - |
|
|
| - |
|
|
| - |
|
|
| - | 0 | 1.1 |
| - | 0.4 |
|
|
| False Positives |
| 0 | 0 | 0 |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Bray-Curtis | True Positives | - |
|
|
| - |
|
|
| - |
|
|
| - | 1.9 |
|
| - |
|
|
|
| False Positives |
| 0.2 | 0 | 0 |
| 0.3 | 0 | 0.3 |
|
| 0.2 | 0 | 0 | 0.4 | 0.9 | 0.1 | 1 | 0.8 | 0.2 | 0.3 | |
Summary of properties of each of the six best candidate measures—Mutual Information (AvLast and AvFirst variants), GST (AvLast and AvFirst variants), 2Dβ.A (AvLast variant) and Bray-Curtis.
Note that we detail the properties of Mutual Information here, but the properties are the same for each of the other q = 1 measures. ‘True Positive Detections’ was calculated as the percentage of true positives across all step simulations. ‘False Positive Detections’ was calculated as the percentage of false positives across all simulations. ‘True Negative Detections’ was calculated as the percentage of true negatives across all linear simulations. Properties are shaded based on their usefulness as a step detection measure: most desirable properties (blue), undesirable properties (red).
| Diversity Measure | True Positive Detection | False Positive Detection | True Negative Detection | Effect of Allele Proportion Position | Rate of true positives with narrow steps |
|---|---|---|---|---|---|
|
1
| 65.0% | 5.3% | 94.9% | Largely unaffected | High |
|
1
| 51.9% | 0.0% | 100% | Largely unaffected | Moderate |
|
2
| 65.1% | 1.8% | 96.5% | Largely unaffected | High |
|
2
| 43.6% | 0% | 100% | Largely unaffected | Low |
|
2
| 57.2% | 0.8% | 97.0% | Strong effect | Low |
| Bray-Curtis | 65.8% | 1.6% | 95.8% | Affected. | High |