| Literature DB >> 32359152 |
Rosa Fernández1,2, Marina Marcet-Houben1,3,4, Fabrice Legeai5,6, Gautier Richard5,7, Stéphanie Robin5,8, Valentin Wucher1, Cinta Pegueroles1,3,4, Toni Gabaldón1,3,9,10,4, Denis Tagu5.
Abstract
Ecology of insects is as wide as their diversity, which reflects their high capacity of adaptation in most of the environments of our planet. Aphids, with over 4,000 species, have developed a series of adaptations including a high phenotypic plasticity and the ability to feed on the phloem sap of plants, which is enriched in sugars derived from photosynthesis. Recent analyses of aphid genomes have indicated a high level of shared ancestral gene duplications that might represent a basis for genetic innovation and broad adaptations. In addition, there are a large number of recent, species-specific gene duplications whose role in adaptation remains poorly understood. Here, we tested whether duplicates specific to the pea aphid Acyrthosiphon pisum are related to genomic innovation by combining comparative genomics, transcriptomics, and chromatin accessibility analyses. Consistent with large levels of neofunctionalization, we found that most of the recent pairs of gene duplicates evolved asymmetrically, showing divergent patterns of positive selection and gene expression. Genes under selection involved a plethora of biological functions, suggesting that neofunctionalization and tissue specificity, among other evolutionary mechanisms, have orchestrated the evolution of recent paralogs in the pea aphid and may have facilitated host-symbiont cooperation. Our comprehensive phylogenomics analysis allowed us to tackle the history of duplicated genes to pave the road toward understanding the role of gene duplication in ecological adaptation.Entities:
Keywords: FAIRE-Seq; gene duplicates; insect; neofunctionalization; phylogenomics; positive selection
Mesh:
Year: 2020 PMID: 32359152 PMCID: PMC7475028 DOI: 10.1093/molbev/msaa110
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
. 1.(A) Chronogram of Sternorrhyncha interrelationships. Systematic classifications (superfamily, family, subfamily, and tribe) are shown in each node/branch. Images selected from PhyloPic. Divergence times taken from TimeTree (Kumar et al. 2017). Dotted branches represent lineages for which divergence times were not available. (B) Example of individual gene tree showing a duplication in Acyrthosiphon pisum, as the genes selected from the present study (see Materials and Methods). Pre- and post-duplication branches as defined for the positive selection analysis are highlighted in red.
. 2.dS values for the selected duplications after filtering genes with dS > 2 and dS < 0.01 (A) and zoom by limiting x-axis to 0.5 (B). Cinara cedri was removed from this plot for visualization. dS for Acyrthosiphon pisum was calculated before (A. pisum Pre-Dup) and after (A. pisum Post-Dup) the duplication took place. See figure 1 for “Pre-Dup” and “Post-Dup” explanation.
. 3.Evolutionary rate (dN/dS) for the selected duplications after filtering duplicates with dS > 2 and dS < 0.01 in any of the genes (201 strict duplicates remained). Acyrthosiphon pisum genes are colored, whereas single-copy orthologs are shown in gray scale. (A) A. pisum preduplication (Pre-Dup) and postduplication (Post-Dup) branches are shown. (B) A. pisum postduplication branches were classified as Fast (F) or Slow (S) according to dS. (C) P values comparing Fast (F) or Slow (S) copies from duplicates binned according to their percentage of identity (increasing by 10 the percentage of identity between bins). Background was colored according to the P value: dark gray for P value < 0.01, light gray for P value > 0.01 and <0.05, and white for P value > 0.05. (D) A. pisum postduplication branches were classified as having (PS = 1) or not (PS = 0) signals of positive selection. (E) A. pisum postduplication branches were classified as having (BS + MNM = 1) or not (BS + MNM = 0) signals of positive selection after BS + MNM model. P values for all plots were estimated using wilcox.test function from R.
. 4.Acyrthosiphon pisum genes from the selected duplications classified as Fast (F) or Slow (S) according to dS, after filtering duplications with dS > 2 and dS < 0.01 (201 strict duplicates remained, see Materials and Methods for details). (A) Percentage of genes under positive selection (PS = 1 in ochre) or with no signal of positive selection (PS = 0 in gray); (B) percentage of genes positively selected after BS + MNM model (BS + MNM = 1 in ochre) or not (BS + MNM = 0 in gray); (C) cDNA length (in aa); and (D) dN/dS after classifying duplicates according to their relative location (i.e., tandem and nontandem duplicates). P values were estimated using wilcox.test function from R.
. 5.Scatterplot showing median gene expression between pairs of duplicates for the “strict duplications” subset. Colors correspond to the P values of a Pearson correlation (see Materials and Methods for details). The significance of the test should be interpreted with caution for those genes that are lowly expressed (i.e., those located in the lower left corner). For visualization purposes, we discarded five gene pairs that were outliers in the subset “not DEG according to binning.”
Duplications in Which the Copy under Positive Selection (PS = 1) Is Expressed in at Least One Tissue in Which the Nonselected Copy (PS = 0) Has No Expression (highlighted in dark gray).
| Duplication Name | Gene Code | Putative Function | PS | BS-M | AM | AP | AO | E0 | E1A | E1K | E2A | E2K | E3A | E3K | H | HP | HR2 | HR4 | LP | G | SG | B |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| duplication_159 | LOC103311559 | Uncharacterized | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | 0 | NA | 0 |
| duplication_159 | LOC100166252 | PiggyBac transposable element-derived protein 4-like | 1 | 0 | NA | NA | NA | NA | NA | NA | NA | NA | 1 | 1 | NA | NA | 0 | 0 | 0 | 1 | NA | NA |
| duplication_213 | LOC100570324 | Uncharacterized | 0 | 0 | 1 | 0 | NA | NA | 1 | 1 | NA | 1 | NA | NA | 0 | 1 | NA | 0 | 0 | NA | NA | NA |
| duplication_213 | LOC100574933 | PAX-interacting protein 1-like | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | NA | 1 | 1 | 1 | NA | 1 | 1 |
| duplication_288 | LOC100572588 | Uncharacterized | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 1 |
| duplication_288 | LOC103308356 | Uncharacterized | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | 1 | 1 | 1 |
| duplication_322 | LOC107883251 | Uncharacterized | 0 | 0 | 0 | NA | NA | 1 | NA | 1 | 1 | 1 | 1 | 1 | 1 | NA | 1 | 1 | NA | 1 | 1 | 1 |
| duplication_322 | LOC107883068 | Uncharacterized | 1 | 1 | 1 | 1 | NA | 1 | 1 | 1 | 1 | 1 | 1 | NA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| duplication_340 | LOC103310866 | Uncharacterized RING finger protein C32D5.10-like | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 |
| duplication_340 | LOC100571229 | E3 ubiquitin-protein ligase Topors-like | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | NA | NA | 0 | 1 | NA | NA | 1 |
| duplication_397 | LOC100162340 | Dynein heavy chain 1, axonemal | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | 0 | NA | 0 | NA | 0 | 0 |
| duplication_397 | LOC107882216 | Dynein heavy chain 1, axonemal-like | 1 | 0 | 0 | NA | 1 | 0 | 0 | 0 | NA | 0 | NA | 0 | NA | 1 | 0 | NA | 0 | NA | NA | NA |
| duplication_4 | LOC100568916 | Uncharacterized | 0 | 0 | 1 | NA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | NA | 0 | 0 | NA | NA | NA | NA |
| duplication_4 | LOC100160128 | 26S proteasome non-ATPase regulatory subunit 12-like | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | 1 | 1 | 1 | 1 | NA | NA | 1 |
| duplication_460 | LOC107882168 | Zinc finger protein 134-like | 0 | 0 | 0 | 0 | NA | 0 | NA | NA | 0 | 0 | NA | NA | 0 | 0 | 0 | 0 | 0 | NA | NA | 0 |
| duplication_460 | LOC103310004 | Zinc finger protein 134-like | 1 | 0 | 1 | 0 | 1 | NA | NA | 0 | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | NA | 0 |
| duplication_489 | LOC107882711 | Uncharacterized | 0 | 0 | NA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | 1 | 1 | 1 | 0 | 1 | NA | 1 |
| duplication_489 | LOC100162929 | Uncharacterized | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| duplication_576 | LOC100569858 | Uncharacterized | 0 | 0 | 1 | 0 | 0 | NA | NA | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | NA | NA | 0 |
| duplication_576 | LOC100574180 | Uncharacterized | 1 | 0 | 1 | 0 | 0 | NA | NA | 0 | NA | NA | 0 | NA | 0 | NA | 1 | NA | 0 | 1 | 1 | 0 |
| duplication_633 | LOC103309550 | Zinc finger MYM-type protein 1-like | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | NA | NA | 0 |
| duplication_633 | LOC103307955 | Zinc finger MYM-type protein 1-like | 1 | 0 | 0 | NA | NA | 0 | NA | 0 | 0 | NA | 0 | 0 | NA | NA | NA | 0 | 0 | NA | NA | 1 |
| duplication_663 | LOC107884962 | Uncharacterized | 0 | 0 | 0 | 0 | 0 | 1 | NA | 1 | NA | NA | NA | 1 | NA | 0 | 0 | 0 | 0 | 1 | 1 | NA |
| duplication_663 | LOC100570263 | Uncharacterized | 1 | 0 | NA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | NA | 1 | NA | 0 | 1 | NA | 1 | 1 |
| duplication_693 | LOC100165085 | Uncharacterized | 0 | 0 | 1 | 0 | 0 | NA | 0 | NA | 0 | NA | 0 | 0 | NA | 0 | NA | NA | NA | NA | NA | 1 |
| duplication_693 | LOC100165046 | Dynein heavy chain 7, axonemal-like | 1 | 1 | NA | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| duplication_795 | LOC107882727 | Sialin-like | 0 | 0 | 0 | NA | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 1 | 1 | 1 | 1 | NA | NA | NA |
| duplication_795 | LOC100164217 | Putative inorganic phosphate cotransporter | 1 | 0 | 1 | 1 | 1 | NA | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 |
| duplication_840 | LOC100571978 | Uncharacterized | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NA | 0 |
| duplication_840 | LOC107884413 | Uncharacterized | 1 | 0 | 0 | 0 | 0 | 0 | NA | NA | 0 | NA | NA | 0 | NA | NA | 1 | 1 | 0 | NA | NA | 0 |
Note.—NA, no information on gene expression available (see Materials and Methods for further details about each condition type); PS, positive selection; BS-M, branch-site test model accounting for MNM; AM, adult males; AP, adult females parthenogenetic; AO, adult female oviparae; E0, embryos stage 17; E1A, embryos stage 18 sex; E1K, embryos stage 18 asex; E2A, embryos stage 19 sex; E2K, embryos stage 19 asex; E3A, embryos stage 20 sex; E3K, embryos stage 20 asex; H, Head_Mix; HP, head adult female parthenogenetic; HR2, head larvae 2; HR4, head larvae 4; LP, legs adult female parthenogenetic; G, gut; SG, salivary glands; B, bacteriocyte.