| Literature DB >> 35205227 |
Chao Chen1,2, Melanie Parejo3,4, Jamal Momeni5, Jorge Langa3, Rasmus O Nielsen5, Wei Shi1,2, Rikke Vingborg5, Per Kryger6, Maria Bouga7, Andone Estonba3, Marina Meixner8.
Abstract
BACKGROUND: Whole-genome sequencing has become routine for population genetic studies. Sequencing of individuals provides maximal data but is rather expensive and fewer samples can be studied. In contrast, sequencing a pool of samples (pool-seq) can provide sufficient data, while presenting less of an economic challenge. Few studies have compared the two approaches to infer population genetic structure and diversity in real datasets. Here, we apply individual sequencing (ind-seq) and pool-seq to the study of Western honey bees (Apis mellifera).Entities:
Keywords: Apis mellifera; diversity; pool-sequencing; population structure; whole-genome sequencing
Mesh:
Year: 2022 PMID: 35205227 PMCID: PMC8872436 DOI: 10.3390/genes13020182
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Sampling locations in Europe and adjacent regions plotted in R using ggplot2. M-lineage: ibe_esp_eus = A. m. iberiensis from Spain; mel_irl = A. m. mellifera from Ireland; mel_rus = A. m. mellifera from Russia; C-lineage: car_aut_hun = A. m. carnica from Austria and Hungary; rod_bgr = A. m. rodopica from Bulgaria; carp_rou_mda = A. m. carpatica from Romania and Moldova; mac_mkd_grc = A. m. macedonica from North Macedonia a Northern Greece; cec_grc = A. m. cecropia from Greece; ada_grc = A. m. adami from Crete, Greece; O-lineage: cyp_cyp = A. m. cypria from Cyprus; ana_tur = A. m. anatoliaca from Turkey; rem_arm = A. m. remipes from Armenia; cau_tur_geo = A. m. caucasia from North-East Turkey and Georgia; and A-lineage: rut_mlt = A. m. ruttneri from Malta. Locations of mel_irl samples are exemplary, as exact coordinates are unavailable.
Sample sizes and origin of the 14 populations used in this study.
| Lineage | Population | Subspecies | Country | Pool Sequencing Samples (N) | Individual Sequencing Samples (N) | Origin of Samples/References |
|---|---|---|---|---|---|---|
| M | ibe_esp_eus |
| Spain | 100 | 10 | Miguel et al., 2007 [ |
| mel_irl |
| Ireland | 100 | 10 | Hassett et al., 2018 [ | |
| mel_rus |
| Russia (Ural) | 100 | 10 | This study, Momeni et al., 2021 [ | |
| C | car_aut_hun |
| Austria & Hungary | 100 | 10 | This study, Momeni et al., 2021 [ |
| rod_bgr |
| Bulgaria | 95 | 10 | This study, Momeni et al., 2021 [ | |
| carp_rou_mda |
| Romania & Moldova | 90 | 10 | This study, Momeni et al., 2021 [ | |
| mac_mkd_grc |
| North Macedonia & N-Greece | 86 | 10 | This study, Uzunov et al., 2014 [ | |
| cec_grc |
| Greece | 93 | 10 | This study, Momeni et al., 2021 [ | |
| ada_grc |
| Greece (Crete) | 88 | 9 | This study, Momeni et al., 2021 [ | |
| O | cyp_cyp |
| Cyprus | 100 | 10 | This study, Momeni et al., 2021 [ |
| ana_tur |
| Turkey | 100 | 10 | This study, Francis et al., 2014 [ | |
| rem_arm |
| Armenia | 90 | 10 | This study, Momeni et al., 2021 [ | |
| cau_tur_geo |
| NE-Turkey & Georgia | 105 | 10 | This study, Momeni et al., 2021 [ | |
| A | rut_mlt |
| Malta | 100 | 10 | This study, Momeni et al., 2021 [ |
| TOTAL | 1347 | 139 | ||||
Figure 2Allele frequency correlation between pool- and ind-seq in each population.
Figure 3Genetic variation between and within populations. (A) Heatmap based on whole-genome pairwise average FST between each population pair calculated from pool-seq data. (B) Correlation of pairwise FST estimates between pool-seq and ind-seq data. (C) Expected heterozygosity for each population as estimated by pool and ind-seq. (D) Correlation between heterozygosities as estimated by pool-seq and ind-seq data. M-lineage: ibe_esp_eus = A. m. iberiensis from Spain; mel_irl = A. m. mellifera from Ireland; mel_rus = A. m. mellifera from Russia; C-lineage: car_aut_hun = A. m. carnica from Austria and Hungary; rod_bgr = A. m. rodopica from Bulgaria; carp_rou_mda = A. m. carpatica from Romania and Moldova; mac_mkd_grc = A. m. macedonica from North Macedonia a Northern Greece; cec_grc = A. m. cecropia from Greece; ada_grc = A. m. adami from Crete, Greece; O-lineage: cyp_cyp = A. m. cypria from Cyprus; ana_tur = A. m. anatoliaca from Turkey; rem_arm = A. m. remipes from Armenia; cau_tur_geo = A. m. caucasia from North-East Turkey and Georgia; and A-lineage: rut_mlt = A. m. ruttneri from Malta.
Figure 4Principal component analysis (PCA) of pool-seq data (left two panes) and ind-seq data (right panes). Upper panels show the first and second principal components explaining most of the variance, while the lower panels display the first and third components, which only account for 6.4% and 3.8% of the total variation, respectively. M-lineage: ibe_esp_eus = A. m. iberiensis from Spain; mel_irl = A. m. mellifera from Ireland; mel_rus = A. m. mellifera from Russia; C-lineage: car_aut_hun = A. m. carnica from Austria and Hungary; rod_bgr = A. m. rodopica from Bulgaria; carp_rou_mda = A. m. carpatica from Romania and Moldova; mac_mkd_grc = A. m. macedonica from North Macedonia a Northern Greece; cec_grc = A. m. cecropia from Greece; ada_grc = A. m. adami from Crete, Greece; O-lineage: cyp_cyp = A. m. cypria from Cyprus; ana_tur = A. m. anatoliaca from Turkey; rem_arm = A. m. remipes from Armenia; cau_tur_geo = A. m. caucasia from North-East Turkey and Georgia; and A-lineage: rut_mlt = A. m. ruttneri from Malta.
Figure 5Model-based ancestry as calculated with NGSAdmix for best (K = 3) and second-best (K = 6) number of K ancestral populations. Each individual is represented by a vertical bar and colored according to the proportion of the genome that was derived from one of K clusters. Samples are ordered according to evolutionary lineage and sampling population. M-lineage: ibe_esp_eus = A. m. iberiensis from Spain; mel_irl = A. m. mellifera from Ireland; mel_rus = A. m. mellifera from Russia; C-lineage: car_aut_hun = A. m. carnica from Austria and Hungary; rod_bgr = A. m. rodopica from Bulgaria; carp_rou_mda = A. m. carpatica from Romania and Moldova; mac_mkd_grc = A. m. macedonica from North Macedonia a Northern Greece; cec_grc = A. m. cecropia from Greece; ada_grc = A. m. adami from Crete, Greece; O-lineage: cyp_cyp = A. m. cypria from Cyprus; ana_tur = A. m. anatoliaca from Turkey; rem_arm = A. m. remipes from Armenia; cau_tur_geo = A. m. caucasia from North-East Turkey and Georgia; and A-lineage: rut_mlt = A. m. ruttneri from Malta.
Figure 6Cost ratio ind-seq/pool-seq. The x-axis represents the genome size or part of the genome sequenced (Gb) in case of reduced representation techniques. The y-axis is the ratio of the total cost of ind-seq/pool-seq (assuming 1400× total coverage for ind-seq and 700× total coverage for pool-seq).