| Literature DB >> 29166845 |
Wolfgang Arthofer1, Carina Heussler1, Patrick Krapf1, Birgit C Schlick-Steiner1, Florian M Steiner1.
Abstract
Small, isolated populations are constantly threatened by loss of genetic diversity due to drift. Such situations are found, for instance, in laboratory culturing. In guarding against diversity loss, monitoring of potential changes in population structure is paramount; this monitoring is most often achieved using microsatellite markers, which can be costly in terms of time and money when many loci are scored in large numbers of individuals. Here, we present a case study reducing the number of microsatellites to the minimum necessary to correctly detect the population structure of two Drosophila nigrosparsa populations. The number of loci was gradually reduced from 11 to 1, using the Allelic Richness (AR) and Private Allelic Richness (PAR) as criteria for locus removal. The effect of each reduction step was evaluated by the number of genetic clusters detectable from the data and by the allocation of individuals to the clusters; in the latter, excluding ambiguous individuals was tested to reduce the rate of incorrect assignments. We demonstrate that more than 95% of the individuals can still be correctly assigned when using eight loci and that the major population structure is still visible when using two highly polymorphic loci. The differences between sorting the loci by AR and PAR were negligible. The method presented here will most efficiently reduce genotyping costs when small sets of loci ("core sets") for long-time use in large-scale population screenings are compiled.Entities:
Keywords: Drosophila nigrosparsa; genetic drift; genetic monitoring; loss of genetic variation; microsatellite markers; population genetics; population structure
Mesh:
Year: 2017 PMID: 29166845 PMCID: PMC5927656 DOI: 10.1080/19336934.2017.1396400
Source DB: PubMed Journal: Fly (Austin) ISSN: 1933-6934 Impact factor: 2.160
Figure 1.Allelic and Private Allelic Richness of populations Kaserstattalm (K) and Pfitscherjoch (P) in Generation 0 and Generation 5. Pairwise comparisons by two-sided t-tests are indicated by squared brackets. Significant differences (α = 0.05) are indicated by an asterisk.
Results from Analyses of Molecular Variance and F-Statistics from various combinations of populations. df … degrees of freedom; reg … regions; pop … populations; ind … individuals; FST … fixation index of subpopulation compared with total population; FIT … inbreeding coefficient of individuals relative to total population; FIS … inbreeding coefficient of individuals relative to subpopulation; K0, K5 … populations from Kaserstattalm at Generation 0 and Generation 5; P0, P5 … populations from Pfitscherjoch at Generation 0 and Generation 5. All … all hierarchy levels (populations and generations).
| AMOVA | Fixation indices | |||||
|---|---|---|---|---|---|---|
| Sample | Source of variation | df | % Variation | Index | Value | |
| All | Among reg | 1 | 1.37 | FST | 0.02 | 0.01 |
| Among pop | 2 | 1.07 | FIS | 0.15 | 0.01 | |
| Among ind | 554 | 14.82 | FIT | 0.17 | 0.01 | |
| Within ind | 558 | 82.74 | ||||
| Total | 1115 | 100.00 | ||||
| K0 & K5 | Among pop | 1 | 0.87 | FST | 0.01 | 0.01 |
| Among ind | 277 | 18.60 | FIS | 0.19 | 0.01 | |
| Within ind | 279 | 80.53 | FIT | 0.19 | 0.01 | |
| Total | 557 | 100.00 | ||||
| P0 & P5 | Among pop | 1 | 1.27 | FST | 0.01 | 0.01 |
| Among ind | 277 | 11.38 | FIS | 0.12 | 0.01 | |
| Within ind | 279 | 87.35 | FIT | 0.13 | 0.01 | |
| Total | 557 | 100.00 | ||||
| K0 & P0 | Among pop | 1 | 0.01 | FST | 0.00 | 0.48 |
| Among ind | 60 | 11.19 | FIS | 0.11 | 0.01 | |
| Within ind | 62 | 88.80 | FIT | 0.11 | 0.01 | |
| Total | 123 | 100.00 | ||||
| K5 & P5 | Among pop | 1 | 2.71 | FST | 0.03 | 0.01 |
| Among ind | 494 | 14.56 | FIS | 0.15 | 0.01 | |
| Within ind | 496 | 82.73 | FIT | 0.17 | 0.01 | |
| Total | 991 | 100.00 | ||||
Figure 2.STRUCTURE results of the populations Kaserstattalm (K) and Pfitscherjoch (P) in Generation 0 and Generation 5. K = 2 was the most probable number of clusters. Loss of different alleles due to genetic drift during five generations laboratory rearing lead to increasing population differentiation.
Sequence of locus removal for microsatellite data on population Kaserstattalm in Generation 5. Loci were sorted from lowest to highest Allelic Richness (AR) and Private Allelic Richness (PAR) and sequentially removed. The number of alleles remaining and the value suggested for the best K using the method of Evanno et al. (2005) are given. In AR, locus DN16 (AR = 15.96) remained as last marker, in PAR, locus DN41 (PAR = 1.38).
| Removal by AR value | Removal by PAR value | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| n loci | removed locus | AR of removed locus | n alleles in dataset | best K | n loci | removed locus | PAR of removed locus | n alleles in dataset | best K |
| 11 | none | — | 193 | 2 | 11 | none | — | 193 | 2 |
| 10 | DN40 | 7.73 | 174 | 2 | 10 | DN40 | 0.19 | 174 | 2 |
| 9 | DN45 | 9.13 | 159 | 2 | 9 | DN31 | 0.27 | 158 | 2 |
| 8 | DN35 | 9.83 | 135 | 2 | 8 | DN45 | 0.32 | 143 | 2 |
| 7 | DN49 | 10.24 | 121 | 2 | 7 | DN48 | 0.33 | 122 | 2 |
| 6 | DN48 | 10.64 | 100 | 2 | 6 | DN36 | 0.59 | 110 | 2 |
| 5 | DN31 | 11.19 | 84 | 2 | 5 | DN39 | 0.59 | 94 | 2 |
| 4 | DN41 | 12.12 | 69 | 2 | 4 | DN35 | 0.74 | 70 | 2 |
| 3 | DN39 | 12.90 | 53 | 2 | 3 | DN16 | 0.75 | 47 | 2 |
| 2 | DN36 | 12.94 | 41 | 2 | 2 | DN49 | 0.92 | 33 | 2 |
| 1 | DN37 | 13.54 | 23 | 7 | 1 | DN37 | 1.25 | 15 | 4 |
Figure 3.STRUCTURE results after sequential removal of loci, sorted according to Allelic (AR) and Private Allelic Richness (PAR). Plots in which the estimate for the best K deviated from 2 are printed in grey. While the separation quality gradually deteriorated, the major population structure was still visible when only the two loci with highest AR or PAR are used.
Figure 4.Based on the STRUCTURE results, individuals genotyped at one to ten loci were assigned to two clusters. Individuals with intermediate probabilities to belong to one cluster were excluded using a variable threshold x; x = 0 results in no exclusions, x = 0.5 excludes all individuals. The cluster assignment with 11 loci was used as a benchmark. Each individual could be either correctly assigned (i. e., in the same cluster as with 11 loci; upper row of plots), incorrectly assigned (i.e., in the other cluster than with 11 loci; middle row), or excluded from assignment based on the threshold value (lower row). High thresholds minimize incorrect assignments at the cost of many excluded individuals.