| Literature DB >> 28082607 |
Eric Fontanillas1, Oxana V Galzitskaya2, Odile Lecompte3, Mikhail Y Lobanov2, Arnaud Tanguy1, Jean Mary1, Peter R Girguis4, Stéphane Hourdez1, Didier Jollivet1.
Abstract
Temperature, perhaps more than any other environmental factor, is likely to influence the evolution of all organisms. It is also a very interesting factor to understand how genomes are shaped by selection over evolutionary timescales, as it potentially affects the whole genome. Among thermophilic prokaryotes, temperature affects both codon usage and protein composition to increase the stability of the transcriptional/translational machinery, and the resulting proteins need to be functional at high temperatures. Among eukaryotes less is known about genome evolution, and the tube-dwelling worms of the family Alvinellidae represent an excellent opportunity to test hypotheses about the emergence of thermophily in ectothermic metazoans. The Alvinellidae are a group of worms that experience varying thermal regimes, presumably having evolved into these niches over evolutionary times. Here we analyzed 423 putative orthologous loci derived from 6 alvinellid species including the thermophilic Alvinella pompejana and Paralvinella sulfincola. This comparative approach allowed us to assess amino acid composition, codon usage, divergence, direction of residue changes and the strength of selection along the alvinellid phylogeny, and to design a new eukaryotic thermophilic criterion based on significant differences in the residue composition of proteins. Contrary to expectations, the alvinellid ancestor of all present-day species seems to have been thermophilic, a trait subsequently maintained by purifying selection in lineages that still inhabit higher temperature environments. In contrast, lineages currently living in colder habitats likely evolved under selective relaxation, with some degree of positive selection for low-temperature adaptation at the protein level.Entities:
Keywords: RNAseq; hydrothermal vents; protein composition; selection; thermal adaptation
Mesh:
Substances:
Year: 2017 PMID: 28082607 PMCID: PMC5381640 DOI: 10.1093/gbe/evw298
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Nucleotidic and Amino Acid Compositions (Percentages Based on Direct Counts) of Alvinellid Coding Sequences and Translated Proteins Obtained from the Six Transcriptome Assemblies
| Species |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| ||||||
| Total GC% | 44.42 | 44.58 | 43.75 | 44.85 | 44.26 | 44.17 |
| GC12% | 44.90 | 44.80 | 44.90 | 45.00 | 44.70 | 44.60 |
| GC3% | 43.55 | 44.05 | 41.51 | 44.64 | 43.47 | 43.32 |
| Purine Load | 101.82 | 102.52 | 105.20 | 103.98 | 104.23 | 103.90 |
| rDNA-18S | 52.22 | 52.20 | 51.92 | 51.76 | 51.71 | 51.10 |
|
|
|
|
|
| 26.85 | 26.70 |
| D (Asp) | 6.08 | 6.11 | 5.97 | 6.06 |
|
|
| E (Glu) |
|
|
|
| 6.69 | 6.71 |
| H (His) | 2.46 | 2.34 | 2.22 | 2.32 | 2.37 | 2.25 |
| K (Lys) | 6.62 | 6.85 | 6.45 | 6.51 | 6.55 | 6.43 |
| R (Arg) |
|
|
|
| 4.62 | 4.44 |
|
| 29.99 | 29.84 | 30.18 | 29.55 |
|
|
| Q (Gln) | 4.08 | 4.09 | 4.19 | 4.04 | 4.24 | 4.09 |
| G (Gly) | 3.28 | 3.28 | 3.10 | 2.97 |
|
|
| N (Asn) | 5.08 | 5.07 | 5.31 | 5.28 | 5.29 | 5.17 |
| P (Pro) |
|
|
|
| 2.24 | 2.12 |
| S (Ser) | 8.08 | 8.13 | 8.27 | 8.05 |
|
|
| T (Thr) | 6.92 | 6.65 | 6.83 | 6.65 | 6.84 | 6.70 |
|
|
|
|
|
| 36.21 | 36.39 |
| A (Ala) |
|
|
|
| 6.99 | 7.15 |
| C (Cys) | 1.42 | 1.35 | 1.36 | 1.29 | 1.37 | 1.17 |
| I (Ile) | 8.40 | 8.56 | 8.00 | 8.38 | 8.38 | 8.85 |
| L (Leu) | 6.98 | 6.90 | 6.96 | 6.69 | 6.63 | 6.68 |
| M (Met) | 3.42 | 3.50 | 3.20 | 3.22 |
|
|
| V (Val) | 8.88 | 8.75 | 9.23 | 8.93 | 9.25 | 8.56 |
|
|
|
|
|
| 5.76 | 5.81 |
| F (Phe) | 3.12 | 3.08 | 3.25 | 3.29 | 3.19 | 3.32 |
| W (Trp) |
|
|
|
| 0.27 | 0.29 |
| Y (Tyr) |
|
|
|
| 2.30 | 2.20 |
Note.—All frequencies of each species were compared against either “cold” species (§) or “hot” species (#) according to an a priori ecological sign-test. Bold results discriminated hot from cold lineages.
Level of significance associated with the a priori ecological sign-test,
P < 0.05,
P < 0.01,
P < 0.001.
. 1.—Main prokaryotic thermophilic amino acid indices (IVYWREL, EK/QH, and CvP_bias criteria) in “hot” and “cold” alvinellid species using translated sequences of 423 orthologous transcripts. (a) Overall CvP_bias between species, (b) overall EK/QH vs. IVYWREL biplot between species.
. 2.—Factorial correspondence analysis of the amino acid frequencies between protein orthologous sequences of six alvinellid species.
. 3.—Unrooted phylogenomic ML tree of alvinellids with patristic distances (in bold). All nodes were fully supported by 100% bootstrap values.
Gain and Loss of Residue Classes in the Present-day Species Obtained from the Derived Substitutions Accumulated Within Each Terminal Branch of the Phylogenomic ML Tree
| Species | Polar | Aliphatic | Aromatic | Aa + | Aa − | Cys | Pro | Gly |
|---|---|---|---|---|---|---|---|---|
|
| +11 | −4 | +1 | −4 | −3 | +1 | −5 | +3 |
|
| −4 | +13 | +9 | −1 | −1 | −5 | −4 | −7 |
|
| +12 | +4 | 0 | −1 | −5 | +1 | −4 | −7 |
|
| +4 | +10 | +9 | +8 | −21 | +3 | −4 | −9 |
|
| +73 | −14 | −6 | −18 | −13 | +2 | −16 | −8 |
|
| +65 | +36 | +4 | −45 | −33 | −9 | −27 | +9 |
Note.—Aa+: positively charged amino acids, Aa−: negatively charged amino acids.
. 4.—Factorial correspondence analysis (FCA) of orientated amino acid substitution frequencies between orthologous protein sequences of alvinellid species. The dashed envelope represents the cloud of remaining substitution points.
. 5.—Derived amino acid replacement rates and median intergenes dN/dS ratios observed on the terminal branches of the tree. Each rate was weighted with the length (t) of the corresponding terminal branch obtained after tree ultrametrization. Absolute timing inferred under a clock hypothesis on the mt Cox1 gene (see supplementary fig. S1, Supplementary Material online) and its evolutionary rate estimated from Chevaldonné et al. (2002) produced exactly the same results. The null hypothesis of constant rate of derived replacements was rejected using a χ 2 test (χ 2 = 1326, dfl = 5). (a) Derived mutation rates, (b) dN/dS ratios in terminal branches, (c) P-values represent the level of dN/dS significance following an ecological a priori Binomial test (i.e. values of a given species significantly greater or lower than either “cold” (§) or “hot” (#) species).
. 6.—Biplot of dN/dS versus dS medians obtained over branches of the phylogenomic ML tree of the six alvinellid species with the free-ratio branch-model M1 implemented in PaML 4.0 (Yang 2007). Branch values are significantly (P < 0.01: Pearson’s r test) linearly regressed without using the branch leading to Alvinella spp. (outside of the confidence envelope at 95% for the whole dataset).
. 7.—Ancestral reconstruction of amino acid changes using the alvinellid criteria PAYLE vs. DGMS along the phylogenomic tree of alvinellid worms following the Bayesian reconstruction of ancestral sequences implemented in aaML of PaML (Yang 2007).