Literature DB >> 35741769

Population Structure and Genetic Diversity of Chinese Honeybee (Apis Cerana Cerana) in Central China.

Fang Fang1,2, Xiasang Chen3, Jie Lv1,2, Xinyan Shi1,2, Xiaojuan Feng1,2, Zhen Wang1,2, Xiang Li1,2.   

Abstract

Central China has a rich terrain with a temperate monsoon climate and varied natural environments for the Chinese honeybee (Apis cerana cerana). However, little comprehensive research on population genetic diversity has been done in this area. A population survey of the structure and genetic diversity of Apis cerana cerana in this area is deeply needed for understanding adaptation to variable environments and providing more references for the protection of honeybee biodiversity. In this study, we present a dataset of 72 populations of Chinese honeybees collected from nine sites by whole genome sequencing in Central China. We obtained 2,790,214,878 clean reads with an average covering a depth of 22×. A total of 27,361,052 single nucleotide polymorphisms (SNPs) were obtained by mapping to the reference genome with an average mapping rate of 93.03%. Genetic evolution analysis was presented via the population structure and genetic diversity based on the datasets of SNPs. It showed that Apis cerana cerana in plains exhibited higher genetic diversity than in mountain areas. The mantel test between Apis cerana cerana groups revealed that some physical obstacles, especially the overurbanization of the plains, contributed to the differentiation. This study is conducive to elucidating the evolution of Apis cerana in different environments and provides a theoretical basis for investigating and protecting the Chinese honeybee.

Entities:  

Keywords:  Apis cerana cerana; environments; genetic diversity; population structure

Mesh:

Year:  2022        PMID: 35741769      PMCID: PMC9222672          DOI: 10.3390/genes13061007

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.141


1. Introduction

Adaptation to a changing environment is a fundamental problem in evolutionary biology. Understanding the impact of the environment on biological genetic diversity can not only clarify the course of evolution but also provide information on how to protect the ecological environment [1]. Therefore, taking knowledge of genetic resources in a specific natural area will provide references for the strategy design of biodiversity conservation and balancing ecology and economy. Population structure and genetic diversity are the two main factors leading to ecological and evolutionary dynamics flourishing in the natural environment [2]. Experiment evidence showed that characterization of structural variation has a significant influence on the evolutionary processes of Tibetans’ adaptation to the Qinghai-Tibet Plateau [3]. Except for humans, genetic population structure was also proven to have a regulatory effect on the adaptive evolution of fishes, like sticklebacks [4,5], or other reproducing individuals, including cells, viruses, multicellular organisms and so on [6]. In addition, it has been demonstrated that genetic diversity maintenance is crucial for all species, not just a few domestic species like cultivated plants or domesticated animals [7]. Genetic diversity of the wild animals, insects, and any other special economic animals or crops are also worthy of attention for ensuring all aspects of biodiversity conservation. Some parameters such as expected heterozygous number (He), observed heterozygous number (Ho), nucleic acid diversity (Pi), and polymorphism information content (PIC) can be used to indicate the level of an animal’s genetic diversity efficiency and subdivision of population [8]. Chinese honeybee (Apis cerana cerana) is a subspecies and special honeybee resource of Apis cerana that evolved over time for long-term adaption to the natural ecological environment in China [9,10]. It mainly nests in distinct habitats of complex topography, divergent climate, and varied flora, resulting in high resistance to changes in the external environment [11,12]. As one of the local advantage species and pollinators with a wide distribution in China [13], Apis cerana cerana is a vital component of natural ecosystems that pollinates numerous flowering plants and agricultural crops, providing ecological and financial support to residents [14,15]. Abundant scientific evidence shows that the functional genes of honeybees have excellent adaptability in coping with external environmental changes, ecological diversity and economic profits [16,17,18]. Previous studies on Apis cerana cerana have classified it into nine ecological types based on morphometric measurement indicators [19], including Hainan, the Yunnan-Guizhou Plateau, Tibet, Aba, Changbai Mountain, southern Yunnan, North China, South China, and Central China [20]. As more investigators began to study the genetic diversity of Apis cerana cerana, our understanding of Apis cerana cerana continued to improve until 2015, when the reference genome of Apis cerana was published [21]. Whole genome sequencing of a single population can provide a detailed characterization of genetic variation and may clarify the evolutionary development of the population [22]. A comprehensive genomic study of Apis cerana cerana in most parts of China indicated the simple hierarchical division. It proposed that physical barriers (such as islands and mountains) were the main obstacles leading to gene exchange, thereby leading to population differentiation [15]. Other studies have reported that Apis cerana cerana, even in specific regions, can be highly variable [23,24,25], suggesting that populations from limited geographic areas still have large genetic differences. Hubei Province is located in Central China, in the middle reaches of the Yangtze River, with a rich terrain and belongs to the temperate monsoon climate. The geographical environment of the area is hilly and mountainous, with elevations ranging from 200 m to 3100 m, and there are more than 150 kinds of nectar source plants, creating excellent regional natural conditions for Apis cerana cerana [26,27]. Therefore, surveying the population structure and genetic diversity of Apis cerana cerana in this area is deeply needed to understand its adaptation to variable environments and provide more references for protecting honeybee biodiversity in Central China.

2. Materials and Methods

2.1. Sample Collection

As experimental materials for this study, 72 worker bees (Apis cerana cerana) were obtained from nine different geographic mountain areas in Central China (Table 1 and Figure 1). Sampled honeybees were divided into nine groups of 4 to 10 colonies by geographic names (Table 1). All honeybee samples were stored in a −80 °C refrigerator for further DNA extraction and sequencing.
Table 1

Sample information of the Apis cerana cerana from nine Central China regions.

Sampling PositionColony NumberRearing ModeLongitude(E)Latitude(N)Altitude(m)
WF9barrel breeding110°37′41.61″30°12′47.39″1833
BD10crate breeding110°19′20.35″30°41′48.58″1208
LZ9barrel breeding108°48′49.95″30°19′29.23″1192
SQ9barrel breeding108°56′31.13″30°4′1.62″798
SNJ8barrel breeding110°21′55.43″31°26′1.29″ 1464
YS4wall hole breeding115°58′23.89″30°59′36.18″ 399
SY9crate breeding111°6′15.09″ 32°57′30.09″ 1292
TS5barrel breeding114°19′34.19″29°34′43.23″ 110
SZ9barrel breeding113°24′24.08″31°26′20.08″ 108
Figure 1

Specific geographic locations of the sampled honeybees in China, where the yellow area represents the sampling site of the representative area of Central China (Hubei Province). Sampling positions in Hubei Province were highlighted in a colorful magnified area map (right). Base layer map information comes from Resource and Environment Science and Data Center (page: https://www.resdc.cn/data.aspx?DATAID=200 (accessed on 1 January 2015)).

2.2. DNA Extraction

For each bee colony, one worker bee was randomly selected, and total genomic DNA was extracted from its thorax tissue using the Universal Genome DNA Kit (Cat.No.CW0553, ComWin Biotech, Beijing, China) according to the manufacturer’s protocol. After extraction, DNA was treated with RNase A (Cat.No.CW0601S, ComWin Biotech, Beijing, China). The quality and total amount of DNA were measured using Nanodrop 2000 (Thermo Scientific, Wilmington, DE, USA).

2.3. Whole Genome Sequencing

The whole genome sequencing of qualified sample genomic DNA was performed on the Illumina Hi-Seq 4000 platform provided by Biomarker Technologies Corporation, Beijing. The DNA was primarily segmented by mechanical interruption (ultrasonic purified). Then the fragment size was selected by agarose gel electrophoresis after the end repairing, dA-tailing and adapter ligation. Eventually, PCR amplification was performed to obtain a 350 bp insert size paired-end library. The established library was first purified and qualified, and finally, the qualified library was sequenced.

2.4. Quality Control of Sequencing Reads

The quality of raw reads was validated using FastQC (version 1.11.4) [28]. Adapter sequences, primers, poly-A tails, and low-quality reads were removed with cutadapt [29]. Low-quality data filtering included removing reads with adapter, reads containing more than 10% of N, and reads containing more than 50% bases with a quality value of less than 10 to obtain clean reads for subsequent bioinformatic analysis.

2.5. Mapping and Variation Detection

Clean reads were mapped to the reference Apis cerana genome (Apis_cerana.ACSNU2.0) with BWA (version 0.7.17) [30]. We statistically processed the information on sequencing depth and genome coverage of each sample for variation detection. Using the Haplotype Caller of GATK (version 4.2.0) [31] to detect the variation of SNP, the annotation of SNP was realized by SnpEff (version 5.1) [32].

2.6. Genetic Diversity Analysis

In practical scientific research, when it comes to average correlations between individuals in large genomic datasets, population structure results in systematic patterns visualized using dimension reduction techniques like principal component analysis (PCA) [33], or bayesian clustering methods like STRUCTURE [34,35,36]. PCA, phylogenetic tree, structure and genetic diversity analysis were carried out using vcftools [37] to convert vcf into plink format. PCA analysis based on the allele frequency distribution among 72 populations was performed to assess the significance of each principal component. PC1 and PC2 were plotted using the ggplot2 package [38] in R 3.6.1 software. To establish the evolutionary relationship among honeybees sampled from nine sites, a neighbor-joining phylogenetic tree was constructed to represent the genetic distances among the population. To accurately identify the ancestral components of the 72 individuals from nine sampling sites, population structure was constructed by a Bayesian clustering program using admixture software [39] to estimate the ancestral composition of each individual with genome-wide unlinked sites. The value of K = 2 to 5 (two–five ancestors) was selected using the admixture model and visualized using R 3.6.1.

2.7. Correlations between Environmental Variables and Genetic Diversity

The Mantel test was performed using the Fst matrix and distance matrix with the ade4 software package [40] in R3.6.1. To describe the possible changes in the correlation between the genetic distance and geographical distance, we used PASSaGE (version 2) [41], for Manchester correlation analysis according to the distance. The rank divided the distance matrix into submatrices. The populations within the geographical boundary distance described by each submatrix corresponded to populations with different genetic distances. The average Fst of the set “distance level” was calculated separately to generate a polyline “interference graph” that was combined with the Manchester correlation map for a more intuitive view.

3. Results

3.1. Genome Sequencing

A total of 72 individual Apis cerana cerana from nine geographic units (Hubei Province), including WF, SNJ, YS, LZ, SQ, SY, TS, SZ, and BD (for each, n = 4–9), were sequenced. Raw reads of the sequencing data were first filtered to remove the adapter and low-quality sequences, filtered reads with more than a 10% N content, or more than 50% low-quality bases (quality value < 10). After quality control, 2,790,214,878 clean reads in total were generated (Table 2). The average percentage of the sequencing data that yielded a Phred quality score of 30 (Q score > 30) is 91.81% (Table 2). It indicates the probability that a given base is incorrectly called by the sequencer. Subsequently, 150 bp insert size of paired-end reads of clean reads were mapped to the reference genome of Apis cerana (Apis_cerana.ACSNU2.0), resulting in 2,7361,052 SNP numbers totally with 22× average coverage depth and an average mapping rate of 93.03% (Table 2).
Table 2

The summary of whole genome resequencing and read mapping results.

Sampling LocationSample IDClean ReadsQ30 (%)Mapped (%)Ave-DepthSNP-NumberAccession Number
SYSY145,364,08291.4893.3725378,329SAMN23443044
SY235,194,09093.2092.8520381,604SAMN23443045
SY339,560,53892.2994.0123383,098SAMN23443046
SY443,425,58891.0092.9925379,006SAMN23443047
SY539,156,84890.5791.8622377,414SAMN23443048
SY637,591,65490.8091.7621374,457SAMN23443049
SY737,202,59891.6393.5222375,448SAMN23443050
SY835,763,89691.7393.5821377,506SAMN23443051
SY940,626,32090.9392.6423376,364SAMN23443052
TSTS149,050,42092.3893.3729383,607SAMN23443053
TS242,171,75493.0393.0824384,279SAMN23443054
TS343,492,76292.8694.2825376,482SAMN23443055
TS444,082,93290.8791.9124384,675SAMN23443056
TS541,638,82492.9492.8524381,053SAMN23443057
BDBD135,060,54892.6491.9220378,334SAMN23443058
BD240,674,00492.4592.0423383,033SAMN23443059
BD343,630,33892.7392.3025383,190SAMN23443060
BD434,968,12091.5492.1419375,924SAMN23443061
BD539,744,32485.9990.6222376,272SAMN23443062
BD636,715,40489.9691.5720376,435SAMN23443063
BD735,171,31292.8692.6120382,123SAMN23443064
BD836,710,57492.0093.5220377,951SAMN23443065
BD935,435,81691.2092.7320376,754SAMN23443066
BD1035,898,37691.1593.7020382,417SAMN23443067
SZSZ134,454,83487.2890.8019375,325SAMN23443068
SZ237,622,31691.0193.1622383,291SAMN23443069
SZ341,510,88092.4892.7924381,838SAMN23443070
SZ434,251,78292.2692.5820379,841SAMN23443071
SZ533,690,25089.1993.620374,548SAMN23443072
SZ639,078,40690.9293.0223375,130SAMN23443073
SZ743,829,00291.2592.3124378,748SAMN23443074
SZ837,989,12691.5892.8421377,272SAMN23443075
SZ933,996,27090.7792.9220376,068SAMN23443076
SQSQ137,304,92692.5792.2121381,208SAMN23443077
SQ238,685,75893.0393.6922381,085SAMN23443078
SQ334,725,07692.2594.0821380,394SAMN23443079
SQ434,891,04292.7793.4120380,657SAMN23443080
SQ533,717,04892.9393.0819380,598SAMN23443081
SQ634,039,49293.1594.1119381,039SAMN23443082
SQ734,107,92690.9091.7719376,245SAMN23443083
SQ833,680,29291.4592.7418372,788SAMN23443084
SQ936,811,50691.6093.4821379,179SAMN23443085
LZLZ145,157,87490.9293.4226381,808SAMN23443086
LZ238,943,45891.0394.0023375,544SAMN23443087
LZ334,314,03291.6590.4219377,757SAMN23443088
LZ440,019,91092.6893.6022384,578SAMN23443089
LZ537,750,66490.8693.8421380,767SAMN23443090
LZ634,197,77893.3193.7519379,718SAMN23443091
LZ735,764,69690.7294.6721383,465SAMN23443092
LZ843,571,55492.8194.3626385383SAMN23443093
LZ941,338,22090.6992.7224383,021SAMN23443094
WFWF134,728,33290.2393.4021384,993SAMN23443095
WF235,853,81891.4992.2921377,900SAMN23443096
WF34536872692.9894.1427384,826SAMN23443097
WF439,606,16492.6092.8623383,232SAMN23443098
WF537,016,13692.7394.0922382,531SAMN23443099
WF63759140692.6293.6122384,982SAMN23443100
WF735,029,44493.1093.9621384,174SAMN23443101
WF847,641,08292.6893.1128385,814SAMN23443102
WF937,455,55092.9693.8322382,697SAMN23443103
YSYS134,459,11092.3193.2720379,546SAMN23443104
YS235,456,23292.7694.2121381,627SAMN23443105
YS333,847,56491.7692.5722375,780SAMN23443106
YS458,232,87093.0392.1932386,355SAMN23443107
SNJSNJ139,745,49692.8393.6724381,098SAMN23443108
SNJ257,018,61492.9393.6334384,589SAMN23443109
SNJ333,734,50293.2593.2919379,347SAMN23443110
SNJ440,561,77691.0591.8523377,471SAMN23443111
SNJ542,923,91493.4394.1225382,094SAMN23443112
SNJ635,246,25691.8594.3021375,640SAMN23443113
SNJ737,636,43691.4492.8022374,102SAMN23443114
SNJ837,286,21091.7392.6421379,204SAMN23443115
Average-38,752,98491.8193.0322
Total722,790,214,878 --27,361,052

Note: Description of the column titles contained in the dataset are as follows: Sampling location: Where the samples were collected from Central China. Sample ID: Numbers of the sample population. Clean reads: Number of reads after filtering. Mapped: The percentage of clean reads mapped to the reference genome. Ave-depth: Average coverage depth of sequencing data. Q30: The probability of a given base being called incorrectly is 99.9%. Or we can also think of it as one base out of 1000 is wrong. SNP-number: Number of SNPs.

3.2. Population Structure Analysis

Cluster analysis showed that when the K value was 2, Apis cerana cerana in SZ and YS regions were divided into one class. The other regions formed a large group with clear boundaries between regions. When the K value was 3, SZ and YS still formed one group, while the large cluster was split. SNJ formed a cluster. Other regions formed their unique pedigree composition with SNJ, SZ and YS. When the K value was 4, SNJ and WF formed a cluster, TS and YS formed a cluster and SZ formed a cluster, respectively. The other regions (SY, SQ, LZ, BD) formed their unique pedigree composition with these three lineages. The BD and the other three regions showed different degrees of pedigree, and part of the BD area showed a special lineage with clear differences in the area. The result of the PCA scatter plot of principal factor 1 (variation 2.51%) and principal factor 2 (variation 2.18%) further supported the patterns (Figure 2B). When the K value was 5, TS, YS and SZ still formed two clusters, respectively, while SNJ was split into two areas, and the differentiation of SY, SQ, LZ, BD and WF was further emphasized, split into three areas, showing a highly mixed state.
Figure 2

Analysis of the population structure from nine geographic locations. (A): Population structure of 72 individuals from different locations. (B): Principal component analysis of 72 individuals from different locations. (C): Neighbor-joining tree of 72 individuals.

These results were further supported by the phylogenetic tree of the SNP data (Figure 2C). SNJ did not split as a whole and was clustered with SY. TS, YS, and SZ were close. LZ and SQ were clustered into one class. SNJ, SY, BD, TS, YS, SZ, and WF were clustered in a large cluster. These results are consistent with the results of STRUCTURE at K values ranging from 3 to 5. After a comprehensive comparison, we considered that K = 4 was relatively reasonable, and all regions were divided into 5 genetic components; blue region (SNJ, WF), gray region (BD), brown region (SZ), yellow region (YS, TS), and a mixed area of blue, gray, brown and yellow (SY, SQ, LZ). According to the environmental characteristics of the bee sampling sites (Table 1), we divided the bees collected from the measurement areas into high-altitude type (SNJ, WF), middle-altitude type (BD, SY, SQ, LZ), and low-altitude type (SZ, YS, TS), consistent with the results of STRUCTURE.

3.3. Genetic Diversity Analysis

Genetic diversity analysis was performed by calculating MAF, He, Ho and PIC (Table 3). The fixation index (Fst) between populations was calculated to quantify the genetic differentiation (Table 4). The Fst ranged from 0.00706 (LZ, SQ) to 0.05165 (SNJ, YS), with an overall mean of 0.02470, indicating low genetic differentiation. Compared with Apis cerana cerana in North China (average Fst = 0.132) [13], Apis cerana cerana in Central China was less differentiated at the level of genetic differentiation. The Fst difference between the different regions of the honeybee was small, showing low species diversity at the ecotype level, while the values of Fst were consistent with the overall heterogeneity of the population structure. Apis cerana cerana in the central low-altitude regions (SZ, TS, YS) had the highest Fst distribution (average Fst = 0.029965). The average values of Fst in the central high-altitude region and middle-latitude region are 0.02456 and 0.01101, respectively. The Fst 0.051650 between SNJ (high-altitude region) and YS (low-altitude) was moderately differentiated.
Table 3

Genetic diversity of the Apis cerana cerana from nine Central China regions.

BDLZSNJSQSYSZTSWFYS
MAF 0.19740.19960.21100.19830.19870.20430.23390.20070.2543
He0.28280.28550.29880.28400.28460.29090.32570.28700.3477
Ho0.30110.30490.32300.30030.30250.30940.36800.30850.4126
PIC0.23340.23540.24520.23430.23490.23930.26540.23650.2812
Pi0.29620.30070.31750.29910.29980.30640.36140.30230.3720

Note: He: expected heterozygous number; Ho: observed heterozygous number; Pi: nucleic acid diversity; PIC: polymorphism information content.

Table 4

Fst value of the Apis cerana cerana from nine Central China regions.

Sampling PositionSYTSBDSZSQLZWFYSSNJ
SY00.0240020.0124180.0184600.0090560.0110010.0126410.0378600.018846
TS0.02400200.0275040.0271320.0217960.0233190.0252010.0252010.036935
BD0.0124180.02750400.0271890.0120090.0144910.0153110.0433290.023939
SZ0.0184600.0271320.02718900.0235360.0256410.0259870.0375620.034366
SQ0.0090560.0217960.0120090.02353600.0070600.0112140.0388030.021860
LZ0.0110010.0233190.0144910.0256410.00706000.0137820.0401890.023538
WF0.0126410.0252010.0153110.0259870.0112140.01378200.0416810.02456
YS0.0378620.0252010.0433290.03756210.0388030.0401890.04168100.051650
SNJ0.0188460.0369350.0239390.0343660.0218600.0235380.0245600.0516500
The region of Central China (SNJ, WF, BD, SY, SQ, LZ, SZ, YS, TS) showed variable differentiation states, with most being in a low differentiation state, though SNJ and YS were in a moderately differentiated state. Although the genetic differentiation of Apis cerana cerana in low-altitude regions (SZ, TS, YS) was higher than in other regions, the Fst average value (0.024696) is lower than 0.05, indicating that the genetic composition of Apis cerana cerana in these regions was similar, and the degree of differentiation was low. The PIC, Ne, Pi, Ho, and He of each SNP locus in the population were separately calculated (Table 3). PIC measured the degree of variation of the population DNA (the overall PIC average was 0.2451). Among these regions, the BD region has the lowest PIC at 0.2334, whereas the average PIC of the YS region was the highest at 0.2812. The populations in lower altitude regions had higher genetic polymorphisms (SZ, TS, YS) with an average PIC of 0.2620, which is consistent with the distribution of Pi. Nucleotide diversity is an indicator of diversity within or between populations. The average Pi values of TS and YS were 0.3614 and 0.3720, respectively, higher than the average (the average Pi = 0.3176). Furthermore, the low-altitude region (TS, SZ, YS) surpassed the mean polymorphic information (Pi = 0.3477 > 0.3176), and there was high differentiation (mean Fst = 0.02456) and high genetic variation (high polymorphism, PIC = 0.2620 > 0.2451). Apis cerana cerana in the lower altitude regions have more biodiversity and heterozygosity than in other areas. Nevertheless, there were differences among heterozygosity (He), variance (PIC), and diversity (Pi) in YS, SZ, and TS, among which YS had higher heterozygosity, variation, and diversity, and while YS and TS belonged to one cluster, SZ belonged to another and had lower heterozygosity, variation, and diversity. A comparison of the STRUCTURE map with a K value of 4 (Figure 2A) revealed that the low-altitude area was divided into another cluster. In addition, we calculated all sampled groups with an average Ho of 0.3368, which was higher than the He of 0.2865. The low-altitude region (YS) has the highest Ho, He, and Pic (0.4126, 0.3477 and 0.2812, respectively), indicating that the population heterozygosity was high, which could be affected by the combined effects of selection, mutation, and genetic drift.

3.4. Mantel Test between Apis Cerana Cerana Groups

The correlation between a single or a group of environmental variable factors (such as latitude, longitude, altitude, geographical distance, etc.) was tested. The significant correlation between matrices was analyzed by p-value < 0.01 in this study. Mantel test was performed on genetic distance (Fst), environmental factors (Table 5), and geographical distance (Table 6). The results showed that among the seven environmental factors, the genetic distance between populations in nine different geographical regions was positively correlated with the geographical distance, longitude and altitude (p = 0.0038 < 0.01, p = 0.0015 < 0.01, p = 0.0197 < 0.05), but had no significant correlation with other environmental variables (Table 7). The correlation degree between physical distance and genetic distance was r = 0.54243. The correlation degree between altitude and genetic distance was r = 0.32431. These results indicate that geographical distance and location had a significant correlation with the genetic differentiation of Apis cerana cerana populations in Central China.
Table 5

Information of environmental factors of nine Central China regions.

Sampling PositionAltitude (m)Annual Minimum Temperature (°C)Annual Maximum Temperature (°C)Annual Preipitation (mm)Longitude (E)Longitude (N)
SY2921021.01769.6111.10432.958
TS1101423.011500114.32629.579
BD120814231000110.32230.697
SZ1081218967.5113.40731.439
SQ79811181450108.94230.067
LZ119211181450108.81430.325
WF183311211400110.62830.213
YS39913231403115.97330.993
SNJ14649191170.2110.36531.441
Table 6

Geographic distance between nine Central China regions (unit: km).

Sampling PositionSYTSBDSZSQLZWFYSSNJ
SY0484.269 261.475 274.807 380.713 363.952 307.734 497.882 181.129
TS484.269 0405.175 224.318 523.178 538.449 364.013 198.809 427.997
BD261.475 405.175 0305.630 149.914 150.551 61.141 521.081 82.178
SZ274.807 224.318 305.630 0453.694 456.169 298.541 233.457 284.364
SQ380.713 523.178 149.914 453.694 031.114 163.273 660.899 207.017
LZ363.952 538.449 150.551 456.169 31.114 0175.022 669.051 196.382
WF307.734 364.013 61.141 298.541 163.273 175.022 0497.988 136.839
YS497.882 198.809 521.081 233.457 660.899 669.051 497.988 0513.726
SNJ181.129 427.997 82.178 284.364 207.017 196.382 136.839 513.726 0
Table 7

p-value calculated by Mantel test of Fst matrix of the Chinese honeybee populations and environmental factors.

Altitude (m)Annual Minimum Temperature (°C)Annual Maximum Temperature (°C)Annual Precipitation (mm)Longitude (E)Longitude (N)
p-value0.019780.27300.13530.73590.00150.07572

4. Discussion

The evolution of species and the generation of diversity are mostly due to environmental changes. Central China has a rich terrain and belongs to the temperate monsoon climate. The geographical environment of this area is plainly and mountainous, with elevations ranging from 200 m to 3100 m, creating a rich source of nectar source plants and excellent regional natural conditions for Chinese honeybee [26,27]. To understand the relationships between genetic differentiation and the environment, this study collected Apis cerana cerana from nine regions in Central China and analyzed the population structure and genetic diversity. To obtain as much information as possible about local Apis cerana cerana, we included as many collection points as possible in one collection area. The samples were mainly collected from traditional beehives, including crate hives, barrel beehives, and wall hole hives. The domestication process of Apis cerana cerana is rather slow, and domestic bees and wild bees often undergo mutual transformation [42]. Therefore, in this study, we believe that different beehive types have a negligible effect on bee society. The traditional method used 40 morphological indicators to classify Apis cerana [13]. These morphological indicators could be classified into several categories, such as color, angle of the right forewing, and body length, to obtain more comprehensive classification results [43]. As the reference genome of Apis cerana was published [21], more investigators began to study the genetic diversity of Apis cerana [13,44,45]. To analyze the classification, population structure analysis was performed to calculate the degree of mixed blood among individuals if gene exchange has occurred in subpopulations (Figure 2A). When the K value was 4, the STRUCTURE map revealed that SNJ and WF formed a cluster, TS and YS formed a cluster and SZ formed a cluster, respectively. The other region (SY, SQ, LZ, BD) formed its unique pedigree composition with these three lineages. These results were further supported by the PCA analysis and phylogenetic tree (Figure 2B,C), which inferred the differentiation relationships of populations in a low-altitude area separated from populations in high- and low-altitude areas. Genetic variation is critical for the survival of bees in changing climates. For bees, high genetic diversity at the population level can increase the adaptability of colonies, as bee colonies with higher diversity promote stability and homeostasis [46,47]. To obtain the genetic diversity, we calculated the values of PIC, He, Ho, Pi, and the MAF (Table 3), which can reflect the degree of genetic variation within populations. The Fst value can reveal the genetic differentiation distance between populations (Table 4), which is divided into three levels of genetic differentiation: moderate differentiation (0.05 < Fst < 0.15), larger differentiation (0.15 < Fst < 0.25) and high differentiation (Fst > 0.25). A previous study calculated Apis cerana cerana at the subspecies level between different ecological-type populations, for which the average Fst differentiation was 0.162, resulting in high differentiation [15]. In this study, we observed an overall mean Fst value of 0.02470, indicating low genetic differentiation. Compared with Apis cerana cerana in a specific region (North China) (average Fst = 0.132) [13], Apis cerana cerana in Central China was more lowly differentiated at the level of genetic differentiation. It may be because the geographical distance between populations was limited, and there are no strong physical barriers in this area. There was, however, a special observation that the average Fst between YS and SNJ was 0.05165 (0.05 < Fst < 0.15), reaching the medium differentiation level. We also noticed that the genetic differentiation of Apis cerana cerana in low-altitude regions (SZ, TS, YS) was higher than that in other regions. Overall, we speculate that long geographical distances enlarge the genetic differentiation, and the plains are more conducive to gene flow. Previous studies have suggested that honeybees migrate rapidly to areas without strong physical barriers, which can promote gene flow and reduce genetic differentiation [48]. The rapid spread of the African honeybees in European countries can better illustrate the potential for the rapid migration of honeybees [49]. To understand the effect of environmental factors on genetic distance, we calculated the Fst and environmental variable factor of Apis cerana cerana in Central China. The correlation between the genetic distance and environmental factors calculated by the Fst is one of the most common methods for assessing the spatial communication of the population structure [50]. Our results showed that geographical distance, altitude, and longitude significantly correlated with the genetic differentiation of Apis cerana cerana populations in Central China. We inferred that the mixed terrain of plain and mountain could cause the differentiation. Previous studies have suggested that islands and mountains can form physical barriers that, in turn, cause group isolation [15]. In China, various human activities, such as the loss of agricultural land and urbanization, led to the loss of the habitat of Apis cerana cerana, especially in the vast plains of China. However, complex terrains are more resistant to human activities, which could be the reason for the medium differentiation of Apis cerana cerana between SNJ and YS. Therefore, we think that the isolation of Apis cerana cerana by human urban agglomerations is as important as the physical isolation caused by mountains. In summary, as a wide-range species, Apis cerana shows good potential concerning climate change. Still, stresses such as pesticides, pollutants, pathogens, parasites, and limited flower resources caused by human activities threaten the survival of Apis cerana in China [51,52]. This study provides a theoretical basis for investigating and protecting Chinese honeybees by comparing the population structure and genetic diversity of Apis cerana cerana from nine geographical regions in Central China.

5. Conclusions

This study used genomic data to analyze a wide-range Apis cerana cerana species, and provided insights into the differentiation and diversity of the species in Central China. Apis cerana cerana in plains exhibited high genetic diversity, and physical barriers (altitude) and distance were the important factors for the differentiation. Human activities, such as urbanization, also greatly impact differentiation. The results of this study contribute to elucidating the evolution of Apis cerana in different environments and promote our understanding of how to protect honeybees from current and future challenges.
  37 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.

Authors:  Pablo Cingolani; Adrian Platts; Le Lily Wang; Melissa Coon; Tung Nguyen; Luan Wang; Susan J Land; Xiangyi Lu; Douglas M Ruden
Journal:  Fly (Austin)       Date:  2012 Apr-Jun       Impact factor: 2.160

Review 3.  Why Bees Are So Vulnerable to Environmental Stressors.

Authors:  Simon Klein; Amélie Cabirol; Jean-Marc Devaud; Andrew B Barron; Mathieu Lihoreau
Journal:  Trends Ecol Evol       Date:  2017-01-19       Impact factor: 17.712

4.  Genetic population structure constrains local adaptation in sticklebacks.

Authors:  Petri Kemppainen; Zitong Li; Pasi Rastas; Ari Löytynoja; Bohao Fang; Jing Yang; Baocheng Guo; Takahito Shikano; Juha Merilä
Journal:  Mol Ecol       Date:  2021-02-10       Impact factor: 6.185

5.  Genetic diversity of the honeybee Apis cerana in Yunnan, China, based on mitochondrial DNA.

Authors:  L Yin; T Ji
Journal:  Genet Mol Res       Date:  2013-06-20

6.  Honey bee nest thermoregulation: diversity promotes stability.

Authors:  Julia C Jones; Mary R Myerscough; Sonia Graham; Benjamin P Oldroyd
Journal:  Science       Date:  2004-06-24       Impact factor: 47.728

Review 7.  Mantel test in population genetics.

Authors:  José Alexandre F Diniz-Filho; Thannya N Soares; Jacqueline S Lima; Ricardo Dobrovolski; Victor Lemes Landeiro; Mariana Pires de Campos Telles; Thiago F Rangel; Luis Mauricio Bini
Journal:  Genet Mol Biol       Date:  2013-11-08       Impact factor: 1.771

Review 8.  Evolutionary dynamics in structured populations.

Authors:  Martin A Nowak; Corina E Tarnita; Tibor Antal
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-01-12       Impact factor: 6.237

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

10.  An overview of STRUCTURE: applications, parameter settings, and supporting software.

Authors:  Liliana Porras-Hurtado; Yarimar Ruiz; Carla Santos; Christopher Phillips; Angel Carracedo; Maria V Lareu
Journal:  Front Genet       Date:  2013-05-29       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.