| Literature DB >> 23636946 |
Daniel F Simola1, Lothar Wissler, Greg Donahue, Robert M Waterhouse, Martin Helmkampf, Julien Roux, Sanne Nygaard, Karl M Glastad, Darren E Hagen, Lumi Viljakainen, Justin T Reese, Brendan G Hunt, Dan Graur, Eran Elhaik, Evgenia V Kriventseva, Jiayu Wen, Brian J Parker, Elizabeth Cash, Eyal Privman, Christopher P Childers, Monica C Muñoz-Torres, Jacobus J Boomsma, Erich Bornberg-Bauer, Cameron R Currie, Christine G Elsik, Garret Suen, Michael A D Goodisman, Laurent Keller, Jürgen Liebig, Alan Rawls, Danny Reinberg, Chris D Smith, Chris R Smith, Neil Tsutsui, Yannick Wurm, Evgeny M Zdobnov, Shelley L Berger, Jürgen Gadau.
Abstract
Genomes of eusocial insects code for dramatic examples of phenotypic plasticity and social organization. We compared the genomes of seven ants, the honeybee, and various solitary insects to examine whether eusocial lineages share distinct features of genomic organization. Each ant lineage contains ∼4000 novel genes, but only 64 of these genes are conserved among all seven ants. Many gene families have been expanded in ants, notably those involved in chemical communication (e.g., desaturases and odorant receptors). Alignment of the ant genomes revealed reduced purifying selection compared with Drosophila without significantly reduced synteny. Correspondingly, ant genomes exhibit dramatic divergence of noncoding regulatory elements; however, extant conserved regions are enriched for novel noncoding RNAs and transcription factor-binding sites. Comparison of orthologous gene promoters between eusocial and solitary species revealed significant regulatory evolution in both cis (e.g., Creb) and trans (e.g., fork head) for nearly 2000 genes, many of which exhibit phenotypic plasticity. Our results emphasize that genomic changes can occur remarkably fast in ants, because two recently diverged leaf-cutter ant species exhibit faster accumulation of species-specific genes and greater divergence in regulatory elements compared with other ants or Drosophila. Thus, while the "socio-genomes" of ants and the honeybee are broadly characterized by a pervasive pattern of divergence in gene composition and regulation, they preserve lineage-specific regulatory features linked to eusociality. We propose that changes in gene regulation played a key role in the origins of insect eusociality, whereas changes in gene composition were more relevant for lineage-specific eusocial adaptations.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23636946 PMCID: PMC3730098 DOI: 10.1101/gr.155408.113
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Overview of protein-coding gene composition and genome size in Hymenoptera. (A) Gene and genome content in seven ant species and honeybee (red), with representative solitary insects (blue) as outgroups. Orthology delineation among protein-coding genes from 12 insects identified orthologs present in all (Universal, n = 12) or almost all (Broad, 10 ≤ n ≤ 11) species, conserved as single-copy genes or with paralogs (with duplications). Differential gene losses leave orthologs shared among fewer species across the phylogeny (Patchy, n < 10). Remaining ant genes exhibit orthology with honeybee ([AMELL] Apis mellifera) and/or jewel wasp ([NVITR] Nasonia vitripennis) (Hymenoptera), among ants (Formicidae), or lack orthology (Undetectable). Total estimated genome sizes vary among Hymenoptera, largely due to repetitive regions (orange bars); however, hymenopterans share a nonrepetitive core of ∼200 Mb (green bars). A maximum-likelihood species tree computed from the concatenated alignment of all universal single-copy orthologs confirms the established ant phylogeny (Moreau et al. 2006). Rates of molecular evolution are comparable to the other hymenopterans, flour beetle ([TCAST] Tribolium castaneum; genome size ∼200 Mb), and body louse ([PHUMA] Pediculus humanus; genome size ∼108 Mb), but are much slower than the dipteran representative ([DMELA] Drosophila melanogaster; genome size 175 Mb). The ant species are (HSALT) Harpegnathos saltator; (LHUMI) Linepithema humile; (CFLOR) Camponotus floridanus; (PBARB) Pogonomyrmex barbatus; (SINVI) Solenopsis invicta; (AECHI) Acromyrmex echinatior; and (ACEPH) Atta cephalotes. (B) Occurrence (blue) and emergence rate (red) of taxonomically restricted genes (TRGs) in different taxonomic clades of Hymenoptera (colors) and Diptera (gray). The youngest clades of both Hymenoptera and Diptera exhibit the highest rates of TRG accumulation. Age is measured as the time between the most distant members of each group and hence does not reflect a clade's absolute age. (C) Rate of change of TRGs versus divergence time, for eight species groupings. Pearson's correlation coefficient is shown. P-value was computed using a two-tailed t-test.
Figure 2.Analysis of 64 pan-ant taxonomically restricted genes (TRGs). (A) RNA expression support for 64 TRGs that are orthologous among all seven ant species but not found in other genomes. RNA expression levels, estimated as log2(FPKM + 1), are shown for various developmental stages and adult castes of C. floridanus and H. saltator. (B) Expression correlation between adult worker castes in C. floridanus (major vs. minor; green) and H. saltator (gamergate vs. worker; blue) for the 64 novel ant TRGs; Pearson's correlation coefficients are shown. (Inset) Histogram of differences in gene expression levels between castes (major − minor in green, gamergate − worker in blue) per gene. (C) Length distribution (in amino acids) of the 64 novel ant TRGs. (Inset) Distribution of the number (left) and percentage (right) of conserved alignment positions (see Supplemental Text 1).
Figure 3.DNA methylation profiles in ant genomes. (A) Normalized CpG content (CpG O/E) of different genomic elements, including exons, introns, and promoter regions (1.5 kb upstream of coding sequence start sites) for protein-coding genes, nongenic conserved elements, and genome-wide background (1-kb fragments). Exons show the strongest evidence of CpG depletion in ants, indicating that they are the most highly methylated regions of the genome in all taxa (confirmed by Bisulfite-seq; below). Introns also show slight depletion of CpGs in ants, suggesting some intron methylation. (B) Scatterplot of log10(mCG/CG) methylation levels estimated by Bisulfite-seq versus CpG O/E for coding sequences in S. invicta reveals a bimodal distribution of gene body methylation. (C) Average methylation levels (mCG/CG) for protein-coding genes in S. invicta males, grouped according to the number of taxa in Hymenoptera with orthologs for each gene; indicating that conserved genes tend to be highly methylated. Error bars indicate 95% confidence intervals for the mean.
Figure 4.Evolution of transcription factor binding sites (TFBS) in insects. (A) Heatmap showing number of promoter TFBSs per gene for 59 TFs in 28 insect species (n = 4189 genes associated with 2-kb promoter CEs in ants). Species (rows), ordered by phylogenetic grouping, are denoted as solitary (blue) or eusocial (red). TFs were clustered hierarchically using average linkage by computing Euclidean distance between TFBS profiles over all queried genes. (Right) Boxplots show distributions of Euclidean distance values for pairs of species, computed using genome-wide TFBS abundance profiles over genes and TFs (see Supplemental Fig. 30). Each boxplot reflects a group of paired comparisons. P-values estimated by two-tailed Mann–Whitney U-test. (**) P < 10−5; (***) P < 10−10. (B) Genes and TFs exhibiting significant TFBS evolution between solitary and eusocial groups. Three thousand two hundred and thirty-one of 4189 genes had sufficient data for significance testing. (*)TF with significant promoter TFBS evolution (two-tail Mann–Whitney U-test; FDR < 0.25). Top two rows indicate numbers of genes showing significant gain or loss of binding sites for the specified TF. Bottom row indicates proportion of significant genes showing more TFBSs in eusocial compared with solitary insects. More than 93% of tested genes are single-copy in the ant genomes. Bottom panels show the mean and standard error of the standard deviation in RNA expression levels (y-axis) for 96 genes with greatest significance in multiple TFs (top 5%), grouped by TF. Expression levels estimated by log2(FPKM + 1). (FPKM) Fragments per kilobase per million reads. (*) Significantly increased caste variation in RNA expression (compared with all ant orthologs, Background, P < 0.05). (C) TFBS abundance profiles for significant genes, shown for three TFs. Species order (x-axis) as in A. (D) TFBS abundance profiles for two neuronal genes with significance in multiple TFs. Cell colors are row-normalized. Periods (.) Missing data. P-values were computed by a Mann–Whitney U-test. (E) mRNA expression level estimates for the genes in D, shown for different worker castes in H. saltator (reproductive/nonreproductive) and C. floridanus (major/minor). Error bars indicate standard error over three biological replicates. (**) FDR < 0.01; (*) FDR < 0.25.
Figure 5.Transcription-factor-mediated signaling pathways controlling salivary gland development. (A) Sex combs reduced (SCR) in combination with extradenticle (EXD) and homothorax (HTH) direct the specification of cells to the salivary gland fate in PS2 of the Drosophila embryo; these TFs are essential for the downstream regulation of genes required for gland cell differentiation and morphogenesis. Boundaries of salivary gland development are restricted along the anterior/posterior axis by abdominal B (Abd-B) and teashirt (TSH), along the dorsal axis by decapentaplegic (DPP) signaling, and along underlying mesoderm by twist (TWI) and snail (SNA). Epidermal growth factor (EGF) signaling determines the decision to differentiate into duct or secretory cells. (B) Regulation of programmed cell death of embryonic salivary gland by 20-hydroxyecdysone (20E). Cell death is inhibited by fork head (FKH) expression in embryonic salivary gland cells. A pulse of 20E at the late larval stage triggers broad (BR-C)–mediated FKH inhibition. A second pulse of 20E at the prepupal stage leads to BR-C and ecdysone-induced protein 74EF (EIP74EF) directed transcription of apoptotic genes, including wrinkled (HID). TFs associated with TFBS evolution in more than 100 target genes (cis-regulatory evolution) (see Fig. 4) (red). The promoter of the fork head locus (yellow), which encodes a TF involved in the regulation of secretory cells, shows significant changes in the gain or loss of promoter TFBSs in ants (trans-regulatory evolution), as detailed in C. (Right) mRNA expression level estimates for fork head, shown for different worker castes in H. saltator (reproductive/non-reproductive) and C. floridanus (major/minor). Error bars indicate standard error over three biological replicates.