| Literature DB >> 26159619 |
Zachary L Fuller1, Elina L Niño2, Harland M Patch3, Oscar C Bedoya-Reina4, Tracey Baumgarten5, Elliud Muli6, Fiona Mumoki7, Aakrosh Ratan8, John McGraw9, Maryann Frazier10, Daniel Masiga11, Stephen Schuster12, Christina M Grozinger13, Webb Miller14.
Abstract
BACKGROUND: With the development of inexpensive, high-throughput sequencing technologies, it has become feasible to examine questions related to population genetics and molecular evolution of non-model species in their ecological contexts on a genome-wide scale. Here, we employed a newly developed suite of integrated, web-based programs to examine population dynamics and signatures of selection across the genome using several well-established tests, including F ST, pN/pS, and McDonald-Kreitman. We applied these techniques to study populations of honey bees (Apis mellifera) in East Africa. In Kenya, there are several described A. mellifera subspecies, which are thought to be localized to distinct ecological regions.Entities:
Mesh:
Year: 2015 PMID: 26159619 PMCID: PMC4496815 DOI: 10.1186/s12864-015-1712-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Specimen location information
| Site | Geographic region | Site Name | Coordinates | Elevation | Individual ID | Köppen-Geiger classificationa | Parasites & Pathogensb |
|---|---|---|---|---|---|---|---|
| 1S | savannah | Nairobi, Kasarani | −1°13.3631, 36°53.7867 | 1602 m | 1.4.15 | Cfb-Aw | VD |
| 2S | savannah | Kitui, SEKU | −1°18.3005, 37°45.9075 | 1150 m | 2.2.15 | Aw | DWV |
| 3S | savannah | Malewa | −0°31.5024, 36°24.1969 | 1973 m | 4.2.15 | Csb | VD, DWV |
| 4S | savannah | Mt. Elgon, Chepkui | 0°49.543, 34°42.1740 | 1869 m | 21.3.15 | Am | VD, BQCV, DWV |
| 1C | coast | Gete Ruins | −3°18.3899, 40°1.0793 | 36 m | 12.2.15 | Aw | VD, BQCV, DWV, N. apis |
| 2C | coast | Oceanside | −3°20.2514, 39°59.1495 | 15 m | 13.4.15 | Aw | VD, DWV, ABPV, N. apis |
| 3C | coast | Tanzania Border | −4°31.7394, 39°9.2171 | 62 m | 15.4.25 | Aw | VD |
| 1D | desert | Mandera Town 1 | 3°56.2050, 41°52.0900 | 212 m | 16.1.5 | BWh | No Pathogens |
| 2D | desert | Mandera Town 2 | 3°56.1970, 41°52.0820 | 221 m | 17.1.5 | BWh | No Pathogens |
| 3D | desert | Mandera West | 3°53.3790, 40°16.0440 | 894 m | 18.1.5 | BSh | No Pathogens |
| 1 M | mountain | Mt. Elgon, Moorland | 0°57.2460, 34°36.2860 | 2956 m | 22.2.5 | Am | BQCV |
aAm-Tropical monsoon climate, Aw- Tropical savannah climate, Bsh- Subtropical steppe climate, Bwh- Subtropical desert climate, Cfb- Marine coastal climate, Csb- Mediterranean climate
bABPV- Acute bee paralysis virus, BQCV- Black queen cell virus, DWV- Deformed wing virus, N.apis- Nosema apis, VD-Varroa destructor
The site number, geographic region, site name, coordinates, elevation, and identification numbers for individual specimens used in this study with Köppen-Geiger classification (Peel et al. 2007) [106] and previous identified pathogens at each site (Muli, et al. 2014) [12]. The apiary, colony and individual identification (ID) numbers were assigned based on our initial survey scheme (Muli, et al. 2014) [12]
Fig. 1Geographic, phylogenetic, and principal component relationship of Kenyan honeybees. a) Geographic locations of sampled apiaries. Each individual is designated according to the ecological region in which it was sampled, with S = savannah, C = coast, D = desert, and M = mountains. Colors indicate Köppen-Geiger climate regions, see (Peel, et al. 2007) [106]. b) Neighbor-joining tree inferred from full mitochondrial sequences. The evolutionary distances were computed using the Maximum Composite Likelihood method and are in the units of the number of base substitutions per site. The tree is drawn to scale. c) Neighbor-joining tree inferred from full nuclear sequences. Numbers indicate the branch lengths
Descriptions of statistical genomic analyses used
| Test Number | Common name | Brief description | Region examined | Protein | Gene |
|---|---|---|---|---|---|
| 1 | pN/pS | nonsynonymous vs synonymous differences within Kenyan bees | Protein-coding | Venom allergen Api m 6 | GB45615 |
| 2 | — | nonsynonymous differences between Kenyan and European | Protein-coding | FMRFamide receptor | GB51916 |
| 3 | McDonald-Kreitman test | nonsynonymous differences between Kenyan bees and | Protein-coding | NADH dehydrogenase (ubiquinone) 1 beta subcomplex | GB51330 |
| 4 | ROH | runs of homozygosity | Genome-wide | FoxO | GB48301 |
| 5 |
| differences between two populations of Kenyan bees | Genome-wide | Neuroligin-3 | GB42603 |
| 6 | Tajima’s D | allele frequency spectrum | Genome-wide | RpA70, ZFYVE26 | GB44421 GB44416 |
Fig. 2Example of a workflow using Galaxy commands. The workflow depicts the commands and tools used to carry out the test for genes located in a “run of homozygosity” (ROH; see the section detailing Test 4). The workflows and command histories for each test are available at https://usegalaxy.org/r/kenyanbee
Fig. 3Predicted FMRFamide receptor structures in European a and Kenyan b bees. Five residues in the extracellular domains (gray) were found to differ at positions 33, 121, 219, 295, and 308 between these two populations. The configuration of the extracellular domains is altered despite the fact that the mutations are not drastic. The transmembrane regions (cyan) follow the homologues positions of D. meloganogaster studied by Merte and Nichols (2002) [107]. Red labels represent the position and amino acid of the structure altering mutation
Fig. 4Evidence for a selective sweep near Forkhead Box Protein O (Foxo) in Desert population. The average heterozygosity is shown for all individuals sequenced in the Savannah and Desert populations across a 1.5 Mb region on chromosome 7. The shaded area indicates the location of Foxo. A marked reduction in heterozygosity for the region surrounding Foxo is observed in the Desert population
Fig. 5F ST analysis of the chromosomal region containing the Neuroligin-3 gene. a) Plot of F ST (Reich-Patterson formulation) between Desert and Plains bees at 4,507 SNPs in a 200-kb interval on Chromosome 9 (Group9.10) that spans the first two exons of the Neuroligin-3 gene (GB42603; blue rectangles; transcribed right-to-left). Orange rectangles indicate the two exons of annotated gene GB42882, putatively transcribed on the opposite strand, though we find the evidence for existence of a functional gene unconvincing. Genome-wide, 10 % of the F ST values are larger than 0.20 — indicated with red. Light blue bars represent non-significant F values. b) The interval 3,143,154-3,143,747 has a statistically significant concentration of SNPs with F ST in the highest 10 %, meeting our criterion for putative positive selection by the F ST criterion
Fig. 6Selected chromosomal regions with signatures of positive selection according to Tajima’s D analysis. The top panel shows a section of chromosome 5. Tajima’s D was calculated in 5 kb windows and scores are represented by vertical bars. Any bar colored red represents a window where Tajima’s D is significantly negative. Conversely, bars colored light blue represent windows where Tajima’s D is not significantly different from values expected under a site evolving neutrally. The bottom panel is a magnification of the area containing a ~20 kb stretch of significant Tajima’s D scores. Green blocks represent locations of genes contained within the interval