| Literature DB >> 32431749 |
Ramiro Morales-Hojas1, Asier Gonzalez-Uriarte2, Fernando Alvira Iraizoz1, Todd Jenkins1, Lynda Alderson1, Tracey Kruger1, Mike J Hall1, Alex Greenslade1, Chris R Shortall1, James R Bell1.
Abstract
Genetic diversity is the determinant for pest species' success and vector competence. Understanding the ecological and evolutionary processes that determine the genetic diversity is fundamental to help identify the spatial scale at which pest populations are best managed. In the present study, we present the first comprehensive analysis of the genetic diversity and evolution of Rhopalosiphum padi, a major pest of cereals and a main vector of the barley yellow dwarf virus (BYDV), in England. We have used a genotyping-by-sequencing approach to study whether (a) there is any underlying population genetic structure at a national and regional scale in this pest that can disperse long distances; (b) the populations evolve as a response to environmental change and selective pressures; and (c) the populations comprise anholocyclic lineages. Individual R. padi were collected using the Rothamsted Insect Survey's suction-trap network at several sites across England between 2004 and 2016 as part of the RIS long-term nationwide surveillance. Results identified two genetic clusters in England that mostly corresponded to a North-South division, although gene flow is ongoing between the two subpopulations. These genetic clusters do not correspond to different life cycle types, and cyclical parthenogenesis is predominant in England. Results also show that there is dispersal with gene flow across England, although there is a reduction between the northern and southern sites with the south-western population being the most genetically differentiated. There is no evidence for isolation by distance and other factors such as primary host distribution, uncommon in the south and absent in the south-west, could influence the dispersal patterns. Finally, results also show no evidence for the evolution of the R. padi population, and it is demographically stable despite the ongoing environmental change. These results are discussed in view of their relevance to pest management and the transmission of BYDV.Entities:
Keywords: DNA extraction; aphids; archive insect samples; barley yellow dwarf virus; genomics; temporal data analysis
Year: 2020 PMID: 32431749 PMCID: PMC7232763 DOI: 10.1111/eva.12917
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 5.183
Figure 1Map of Great Britain showing the location of the suction traps from which the samples were used
Summary of number of samples used for DNA extraction, kits used and the mean amount of DNA obtained per aphid
| Sites | 2004 | 2007 | 2010 | 2013 | 2016 | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| B&T | MK | B&T | MK | B&T | MK | B&T | MK | B&T | MK | |
| Starcross | ‐ | 10 | ‐ | 10 | 20 | 10 | 20 | 10 | 26 | ‐ |
| Wye | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | 15 | ‐ |
| Preston | ‐ | ‐ | ‐ | ‐ | 10 | ‐ | 14 | ‐ | 16 | ‐ |
| Newcastle | ‐ | 10 | ‐ | 10 | 10 | 10 | 10 | 10 | 15 | 5 |
| Hereford | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | 25 |
| Writtle | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | 23 |
| York | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | 27 |
| Mean DNA yield (ng) | ‐ | 182.50 | ‐ | 280.87 | 57.86 | 149.78 | 34.19 | 160.56 | 325.38 | 433.04 |
| Standard deviation | ‐ | 125.07 | ‐ | 337.54 | 81.86 | 181.35 | 50.55 | 261.73 | 303.93 | 243.52 |
Abbreviations: B&T, Qiagen's Blood & Tissue Kit; MK, Qiagen's DNA Micro Kit.
Number of samples genotyped using GBS per population and year; the number of successfully sequenced samples is shown in brackets
| Starcross | Newcastle | Preston | Wye | Writtle | York | |
|---|---|---|---|---|---|---|
| 2016 | 15 (15) | 15 (15) | 15 (15) | 15 (15) | 10 (10) | 10 (10) |
| 2013 | 19 (13) | 19 (11) | 10 (2) | ‐ | ‐ | ‐ |
| 2010 | 20 (16) | 16 (14) | 10 (5) | ‐ | ‐ | ‐ |
| 2007 | 10 (10) | 9 (7) | ‐ | ‐ | ‐ | ‐ |
| 2004 | 9 (9) | 8 (8) | ‐ | ‐ | ‐ | ‐ |
Genome assembly statistics of the Illumina assembly of Thorpe et al. (2018) and the short‐ and long‐read hybrid assembly obtained with MaSuRCA
| Illumina genome | MaSuRCA genome | |
|---|---|---|
| Assembly size (Mb) | 319 | 321 |
| Scaffolds | 15,616 | 2,172 |
| Scaffold N50 (bp) | 116,185 | 652,723 |
| Longest scaffold (bp) | 616,405 | 4,088,110 |
|
| 54,488 | 0 |
| GC (%) | 27.8 | 27.8 |
| BUSCO (complete, duplicated, fragmented, missing) | 82%, 8.1%, 7.8%, 9.4% |
Insecta: 93.9%, 3.6%, 1.4%, 4.7% Arthropoda: 96.8%, 3.1%, 0.7%, 2.5% |
| Genes | 26, 286 | 26,535 |
| BlastP hit | 20,368 (77%) | 20,481 (77%) |
Sequential steps done in the filtering schemes (FS) that provided the best data set to be used in subsequent population analyses. The order of rows indicates the sequential filters applied to the data
| Filter | All samples | Samples Newcastle | Samples Starcross |
|---|---|---|---|
| FS7 | FS2 | FS2 | |
| Missing data |
max‐missing > 50% remove‐indels |
remove‐indels max‐missing > 50% |
remove‐indels max‐missing > 25% |
| Low‐confidence SNP call |
mac > 3 minQ > 20 minDP > 3 |
min‐meanDP > 5 mac > 3 minQ > 20 |
min‐meanDP > 5 mac > 3 minQ > 20 |
| Missing data |
imiss < 60% max‐missing > 90% |
imiss < 95% max‐missing > 80% |
imiss < 95% max‐missing > 70% |
| Low‐confidence SNP call | minDP > 5 | ||
| Missing data | max‐missing > 95% | ||
| additional filters | Thin 2000 | Thin 2000 | Thin 2000 |
| SNPs | 4,802 | 3,186 | 907 |
| Individuals | 86 | 36 | 31 |
minDP—includes only genotypes greater or equal to the value; min‐meanDP—retains loci with a minimum mean depth of the given value; minQ—includes sites with quality above the value; mac—includes sites with minor allele count greater or equal to the value; max‐missing—retains loci that have been successfully genotyped in the given proportion of individuals; imiss—retains individuals with a proportion of missing data smaller than the value; thin—removes loci that are closer than the given number of bp.
Figure 2Distribution of the missing data per individual (a) and locus (b) in all samples after applying the filter scheme 7 (FS7). Vertical dashed lines in red correspond to the mean missing data
Genetic diversity estimates for the six sampling locations; all the samples; north and south populations; and the genetic clusters (GCs) identified by Structure analyses
|
|
| π | θw | Ho | He |
|
|
| |
|---|---|---|---|---|---|---|---|---|---|
| Starcross | 24 | 24 | 0.20 | 0.20 | 0.20 | 0.34 |
| 0.004 | 1.1 |
| Wye | 30 | 30 | 0.25 | 0.20 | 0.26 | 0.30 |
| 0.796 | 0.831 |
| Writtle | 20 | 20 | 0.21 | 0.18 | 0.08 | 0.34 |
| 0.738 | 1.610 |
| Newcastle | 54 | 54 | 0.19 | 0.20 | 0.14 | 0.25 |
| −0.062 | −1.237 |
| Preston | 24 | 24 | 0.19 | 0.21 | 0.16 | 0.29 |
| −0.266 | 1.076 |
| York | 20 | 20 | 0.18 | 0.15 | 0.11 | 0.33 |
| 0.656 | 1.409 |
| All | 172 | 172 | 0.21 | 0.17 | 0.13 | 0.24 |
| 0.80 |
|
| South | 74 | 74 | 0.22 | 0.18 | 0.17 | 0.27 |
| 0.82 | −2.32 |
| North | 98 | 98 | 0.19 | 0.19 | 0.12 | 0.23 |
| 0.15 | −4.85 |
| GC North | 120 | 120 | 0.19 | 0.17 | 0.12 | 0.23 |
| 0.49 |
|
| GC South | 52 | 52 | 0.21 | 0.17 | 0.23 | 0.31 |
| 0.81 | −0.97 |
N—number of gene copies (2 × number of individuals), H—number of haplotypes, π—nucleotide diversity (average proportion of pairwise differences over all loci), θw—nucleotide diversity (average of segregating sites across all loci), Ho—mean observed heterozygosity over all loci, He—mean expected heterozygosity (gene diversity) over all loci and FIS—population‐specific inbreeding coefficient (significant values at 5% are shown in italics), and D corresponds to Tajima's D and F to Fu's F tests (F significance is set to be ≤ 2% level as recommended in the original paper).
Figure 3Bar plot resulting from Structure analysis when K = 2 and sorted by population. The bars represent individuals, and the colour of the bars represents the probability of membership to a certain population
Hierarchical AMOVA results for genetic structure of Rhopalosiphum padi. (A) Two geographic clusters comprising individuals from the north (Newcastle, Preston and York) and the south (Starcross, Wye and Writtle); (B) two genetic clusters as determined by the Structure analyses
| Source of variation |
| Sum of squares | Variance components | % variation | Fixation indices |
|
|---|---|---|---|---|---|---|
| (A) | ||||||
| Among north and south | 1 | 2,704.904 | 19.832 Va | 3.74 |
| .1 |
| Among locations within north and south | 4 | 3,802.365 | 17.077 Vb | 3.22 |
| 0 |
| Within locations | 166 | 82,101.888 | 494.041 Vb | 93.05 |
| 0 |
| (B) | ||||||
| Among genetic clusters | 1 | 8,037.878 | 99.332 Va | 17.20 |
| .001 |
| Among locations within clusters | 9 | 6,788.132 | 20.589 Vb | 3.56 |
| 0 |
| Within locations | 161 | 73,692.147 | 457.715 Vc | 79.24 |
| 0 |
Genetic differentiation between the sampled populations. Population pairwise F ST values are shown below the diagonal. Significant F ST values are shown in italics
| Newcastle | Preston | York | Starcross | Wye | Writtle | |
|---|---|---|---|---|---|---|
| Newcastle | ‐ | |||||
| Preston | 0.010 | ‐ | ||||
| York |
|
| ‐ | |||
| Starcross |
|
|
| ‐ | ||
| Wye |
|
|
| 0.005 | ‐ | |
| Writtle |
|
|
|
|
| ‐ |
Figure 4Midpoint‐rooted ML phylogenetic tree of haplotypes from the FS7 data set. Branch line weight relates to bootstrap support, and wider line corresponds to a bootstrap > 90%. Clades representing the genetic clusters identified with Structure are highlighted with different colours (blue—northern cluster, red—southern cluster). Haplotypes from individuals collected in the southern sites are in red, and those from the north are in blue
Pairwise F ST values between samples from Newcastle (A) and Starcross (B) collected in different years. Significant values are shown in italics. N shows the number of individuals in each year in the FS2 data set
|
| 2016 | 2013 | 2010 | 2007 | 2004 | |
|---|---|---|---|---|---|---|
| (A) | ||||||
| 2016 | 15 | ‐ | ||||
| 2013 | 2 | −0.481 | ‐ | |||
| 2010 | 8 | −0.037 | −0.284 | ‐ | ||
| 2007 | 5 | 0.008 | −0.228 | 0.069 | ‐ | |
| 2004 | 6 | −0.201 | 0.084 | 0.0005 | 0.005 | ‐ |
| (B) | ||||||
| 2016 | 14 | ‐ | ||||
| 2013 | 5 | −0.095 | ‐ | |||
| 2010 | 7 | −1.301 | −2.098 | ‐ | ||
| 2004 | 5 | −0.727 | −0.457 | −0.602 | ‐ | |
Genetic differentiation (F ST) between samples collected in spring, summer and autumn in the north (A) and south (B) locations. Values in italics are significant. N shows the number of samples
|
| Spring | Summer | Autumn | |
|---|---|---|---|---|
| (A) | ||||
| Spring | 9 | ‐ | ||
| Summer | 18 |
| ‐ | |
| Autumn | 12 |
|
| ‐ |
| (B) | ||||
| Spring | 9 | ‐ | ||
| Summer | 8 | −0.005 | ‐ | |
| Autumn | 10 |
|
| ‐ |
Figure 5Plot of the genetic diversity calculated for the different regions and populations at different times of the season
Number and proportion of flying females collected in the Rothamsted suction trap between 16 September and 14 November in 2004, 2007, 2010, 2013 and 2016 that were virginoparae and gynoparae
| Year | 2004 | 2007 | 2010 | 2013 | 2016 | All |
|---|---|---|---|---|---|---|
| Virginoparae | 35 | 96 | 10 | 13 | 29 | 183 |
| Gynoparae | 312 | 82 | 398 | 125 | 20 | 937 |
| Total | 347 | 178 | 408 | 138 | 49 | 1,120 |
| % virginoparae | 10.1 | 53.9 | 2.4 | 9.4 | 59.2 | 16.3 |