| Literature DB >> 34599331 |
Dan Lin1, Lan Lan2, Tingting Zheng1, Peng Shi2, Jinshan Xu2, Jun Li1,3.
Abstract
The Himalayan giant honeybee, Apis laboriosa, is the largest individual honeybee with major ecological and economic importance in high-latitude environments. However, our understanding of its environmental adaptations is circumscribed by the paucity of genomic data for this species. Here, we provide a draft genome of wild A. laboriosa, along with a comparison to its closely related species, Apis dorsata. The draft genome of A. laboriosa based on the de novo assembly is 226.1 Mbp in length with a scaffold N50 size of 3.34 Mbp, a GC content of 32.2%, a repeat content of 6.86%, and a gene family number of 8,404. Comparative genomics analysis revealed that the genes in A. laboriosa genome have undergone stronger positive selection (2.5 times more genes) and more recent duplication/loss events (6.1 times more events) than those in the A. dorsata genome. Our study implies the potential molecular mechanisms underlying the high-altitude adaptation of A. laboriosa and will catalyze future comparative studies to understand the environmental adaptation of modern honeybees.Entities:
Keywords: zzm321990 Apis laboriosazzm321990 ; comparative genomics; evolution; honeybee; whole genome sequencing
Mesh:
Year: 2021 PMID: 34599331 PMCID: PMC8536543 DOI: 10.1093/gbe/evab227
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.The summary for genome assembly, GC, genes, gene families, repeats, homologous groups, nucleotide divergency and phylogeny relationship of Apis laboriosa and Apis dorsata genome. (A) Summary statistics of the genome assembly, GC%, completeness and gene prediction of two species genomes; (B) Bar charts describing the repeat elements of two genomes. ** indicates a statistical significance (Chi-squared test, P < 0.05) between two genomes; (C) Venn diagram describing the number of gene families and genes in two genomes; (D) Bar charts and pie charts describing homologous groups of two genomes in different categories. ** indicates a statistical significance (Chi-squared test, P < 0.05) between two genomes; (E) Summary statistics (mean ± standard deviation/SD) of genetic distances between single-copy genes from Apis laborisa genome and Apis dorsata genome. The distances were calculated using R package ‘ape’; (F) Phylogenetic tree of the nine Hymenopterans species. The tree was reconstructed using maximum-likelihood method under the GTR model based on the concatenated alignments of the single-copy genes and the dating time with 95% Confidence Interval was assessed using Markov chain Monte Carlo analysis under the GTR model based on the concatenated alignments of fourfold degenerate sites from single-copy genes. The bootstrap values of all nodes were 100%. Apis laboriosa was indicated by pink label, and Apis dorsata was denoted by blue label. Branch length represented time of divergence in million years.
Fig. 2.Gene family evolution based on nucleotide substitutions model and recent duplication/loss model. (A) The quality control procedure to achieve more accurate estimation of positive selected genes; (B) The frequency distribution of dS in 6,718 single-copy gene families, derived from the results of branch model in PAML. The sub-table showed the mean and standard deviation of dS, the number and percent of single-copied genes with dS equal or larger than specific thresholds (0.05, 0.1, 0.2 and 1); (C) Total number of positively selected genes in 6,718 single-copy gene families, derived from the results of branch-site model in BUSTED (Chi-squared test, P < 0.05); (D) Enriched functional categories and the frequency of positively selected genes derived from the results of branch-site model in BUSTED (Chi-squared test, adjusted P < 0.1); (E) The overall summary (number and percent) of the DL events in two species. ** indicates that there is statistical significance (Chi-squared test, P < 0.05) between two genomes; (F) Frequency of the positive selection on the duplicated genes in two species; (G) Representative examples of positive selection on the duplicated genes in Apis laboriosa. Yellow boxes represented putatively functional genes. Double slashes indicated that genes were retrieved from different scaffolds. Red boxes represented nonsynonymous mutations on the duplicated genes in Apis laboriosa. The homolog sequences from the other seven Hymenopterans species had the same nucleotide composition as Apis dorsata. Note that each example just represents one gene family; (H) Heatmap of the total number of DL events in each functional category in two species. Each functional category is statistically significantly different (Chi-squared test, adjusted P < 0.1) between two species in at least one of the DL events. All the terms were enriched in recently duplicated genes, and only 2 terms were enriched in recent losses in Apis laboriosa (GTP binding: 0 vs 13, adjusted P = 0.013; GTPase activity: 0 vs 13, adjusted P = 0.013).