| Literature DB >> 31638910 |
Xin Yao1,2, Yun-Hong Tan3,4,5, Jun-Bo Yang6, Yan Wang7, Richard T Corlett8,9, Jean-François Manen10.
Abstract
BACKGROUND: The genus Ilex (Aquifoliaceae) has a near-cosmopolitan distribution in mesic habitats from tropical to temperate lowlands and in alpine forests. It has a high rate of hybridization and plastid capture, and comprises four geographically structured plastid groups. A previous study showed that the plastid rbcL gene, coding for the large subunit of Rubisco, has a particularly high rate of non-synonymous substitutions in Ilex, when compared with other plant lineages. This suggests a strong positive selection on rbcL, involved in yet unknown adaptations. We therefore investigated positive selection on rbcL in 240 Ilex sequences from across the global range.Entities:
Keywords: Environmental adaptation; Molecular adaptation; PALM; Positive selection; Rubisco
Year: 2019 PMID: 31638910 PMCID: PMC6805373 DOI: 10.1186/s12862-019-1521-1
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Fig. 1Variable codons (91 to 436) found among 116 Ilex sequences (Worldwide dataset) mapped on the accession I. canariensis_90, as reference. Uncolored codons are synonymous substitutions. Yellow codons are non-synonymous substitutions. # means positively selected codons (Bayesian posterior probability > 0.95 using the M8 model of Codeml, PAML (Table 1). + means positively selected residues found among the 20 most often positively selected residues of [29] (Table 1). § means residues involved in coevolution (Table 4)
Fig. 3Mapping of positively selected sites in the 20-species tree. Only SLR positively selected sites are included, as well as Ilex positively selected sites recorded in Table 6. Asian/NAm means Asian/North American
List of positively selected sites of rbcL of Ilex with posterior probabilities (PAML, Codeml, model M8) > 0.95 in the 5 subsets of the Worldwide dataset. Groups 1, 2, 3 and 4 represent the 4 phylogenetic plastid groups of [36] (Asian/North-American Group 1, Asian/North/American Group 2, American Group 3, and Eurasian Group 4. The 20-species sample represents a selection of the 116 specimens of [36], reflecting the 4 groups and a wide geographic distribution (see Fig. 3). In red, selected Ilex residues belonging to the list of the 20 most often positively selected sites detected in the 151 lineages of Kapralov and Filatov [29]
(*) indicates residues having ≥95% SLR support for positive selection
(a) Number of sequences
Pairs of coevolving residues detected by CAPS (Coevolution Analysis using Protein Sequences) in 116 rbcL sequences of the Ilex Worldwide dataset. Parameters used in CAPS are: alpha threshold 0.001, number of simulated alignments 1000, bootstrap value 0.95, await convergence and time correction on
| Codon 1 | Codon 2 | Correlation | Bootstrap | Distance in Å |
|---|---|---|---|---|
| D 19 | G 196 | 0.51 | 0.99 | 68.6 |
| D 30 | G 196 | 0.50 | 0.99 | 57.2 |
| A 91 | N 95 | 0.99 | 0.97 | 8.2 |
| Y 97 | M 309 | 0.70 | 1.00 | 12.8 |
| F 117 | T 279 | 0.67 | 0.98 | 13.2 |
| C 221 | M 251 | 0.51 | 1.00 | 20.9 |
| M 309 | A 328 | 0.70 | 1.00 | 19.7 |
| A 317 | M 320 | 0.62 | 0.98 | 6.7 |
| A 328 | D 340 | 0.78 | 1.00 | 16.9 |
| A 328 | V 354 | 0.70 | 1.00 | 23.9 |
| D 340 | V 354 | 0.70 | 1.00 | 21.0 |
| F 353 | V 354 | 0.73 | 1.00 | 5.5 |
List of positively selected residues of rbcL with posterior probabilities (PAML, Codeml, Model M8) > 0.95 in the Chinese dataset. This dataset represents group 1, 2, and 4 (The American Group 3 is not present in China). In red, selected Ilex residues belonging to the list of 20 most often positively selected sites detected in the 151 lineages of Kapralov and Filatov [29]. (*) indicates that the residues have ≥95% SLR support for positive selection. (a) indicates that residues are not positively selected in the Worldwide dataset or that residues (461 and 470) were not available in the truncated Worldwide dataset. (b) Number of residues
Fig. 2Positively selected residues of Ilex represented on the surface of the Rubisco large subunit. a Front and back of folded large subunit of the modeled Ilex canariensis 3-D structure. Visible positively selected residues from the Worldwide dataset are in red, while additional positively selected residues from the Chinese dataset are in blue. b Extrapolation on the Rubisco Spinacia oleracea 3-D structure (1RCX). Presented are the 4 large subunits (E,B,V and R) and 3 small subunits (f, s and w) that are in contact with one large subunit (here L, in orange). Positively selected residues are represented with the same colors as in A. Most of these residues (represented here on the L subunit) are exposed on the equatorial surface the L8S8 Rubisco hexadecamer. Sites 142, 429 and 461 are involved in intra-dimer, dimer-dimer and L/S interactions (see Table 3). All other sites (15, 19, 21, 23, 28, 30, 84, 86, 91, 95, 97, 340, 353, 354 and 470) represent positively selected residues potentially involved in interactions with chaperones
Positively selected residues of the large subunit of Ilex Rubisco that are in contact with another L or S subunit, extrapolated from the Spinacia oleracea Rubisco 3D-structure (see Fig. 2). Thirteen positively selected residues of the Rubisco large subunit of Ilex are at the interface (≤ 6 Å) of intra-dimer (ID), of dimer-dimer (DD), and of large/small subunits (SSU). Two of them (219 and 461) share a contact with another large and with a small subunit. In brackets, the total number of residues of the Rubisco large subunit that are in contact with another large subunit or the small subunit in ID, DD and SSU interactions
| Residues | Molecular interactions | ||
|---|---|---|---|
| ID (96) | DD (39) | SSU (76) | |
| 117 | X | ||
| 142 | X | ||
| 219 | X | X | |
| 225 | X | ||
| 226 | X | ||
| 247 | X | ||
| 248 | X | ||
| 249 | X | ||
| 251 | X | ||
| 279 | X | ||
| 309 | X | ||
| 429 | X | ||
| 461 | X | X | |
Comparison of positive selection in rbcL of different lineages detected by PAML (M8 model) and by SLR (probability ≥0.95)
| Lineages | N. seqa | dN/dSb | %pc | PALMd | SLRd |
|---|---|---|---|---|---|
| 20 | 20.5 | 8.2 | 26 | 7 | |
| eurosids I-26 (Fagaceae)e | 21 | 62.0 (61.0) | 0.4 (0.4) | 2 (2) | 1 |
| euasterids I-9 (Lamiaceae)e | 19 | 27.9 (26.7) | 6.3 (6.0) | 3 (2) | 0 |
| Coniferales 6 ( | 35 | 13.1 (24.8) | 1.6 (1.1) | 6 (6) | 2 |
| commelinids-7 (Cyperaceae)e | 27 | 1.1 (1.2) | 12.7 (12.9) | 0 (0) | 0 |
| Commelinids-15 (Poaceae)e | 28 | 2.4 (2.4) | 10.0 (9.9) | 15 (15) | 0 |
| eurosids I-24 (Fagales)e | 20 | 1.7 (1.6) | 7.4 (8.5) | 0 (0) | 0 |
| euasterids I-8 (Lamiaceae)e | 20 | 2.7 (2.8) | 10.7 (10.3) | 2 (2) | 0 |
| Gnetales-1 ( | 23 | 3.7 (3.5) | 8.5 (8.6) | 3 (3) | 0 |
|
| 16 | 19.2 | 1.8 | 5 | 1 |
|
| 24 | 6.2 | 5.4 | 7 | 1 |
|
| 14 | 5.1 | 3.9 | 6 | 0 |
|
| 18 | 11.6 | 1.2 | 1 | 1 |
|
| 30 | 18.4 | 0.6 | 2 | 1 |
|
| 28 | 2.5 | 9.7 | 1 | 0 |
|
| 27 | 26.1 | 0.6 | 0 | 0 |
anumber of sequences
bmean dN/dS, “eleventh class” of the M8 model (PAML)
cproportion of positively selected residues, “eleventh class” of the M8 model (PAML)
dnumber of positively selected sites
ePAML re-calculations from data of Kapralov and Filatov [29], with their published results between brackets
Positively selected residues detected in Ilex (in both the Worldwide and Chinese datasets) also described in other lineages and associated with biological and environment traits
| Residues |
| Literatureb | Biological and environmental traits | Authors |
|---|---|---|---|---|
| 86* | Q to E,G | H,D to E,G | Rubisco kinetics | [ |
| Q to E,G | H to Y | Wet to dry environment | [ | |
| 95* | N to S | N,T to D,S | Leaf traits and climate characteristics | [ |
| N to S | N.A. | Rubisco kinetics | [ | |
| 142* | P to T | P to I,T,V | Specificity O2/CO2 | [ |
| P to T | P to T | C3 to C4 plants | [ | |
| P to T | P to A,T | C4 to C4 plants | [ | |
| P to T | N.A. | C3 to C4 plants | [ | |
| 219* | L to V | N.A. | Leaf traits and climate characteristics | [ |
| 225* | L to I | I to L | [ | |
| 230 | A to T | A to T | Wet to dry environment | [ |
| 281* | A to S | S to A | [ | |
| A to S | A to S | Rubisco kinetics | [ | |
| 309* | M to I | M to I | C3 to C4 plants | [ |
| M to I | M to I | Specificity O2/CO2 | [ | |
| M to I | M to I | C3 to C4 plants | [ | |
| M to I | M to I | C4 to C4 plants | [ | |
| M to I | N.A. | C3 to C4 plants | [ | |
| 320 | M to L | M to L | Rubisco kinetics | [ |
| 328* | A to S,G | A to S | Rubisco kinetics | [ |
| A to S,G | A to S | C3 to C4 plants | [ | |
| A to S,G | A to S | C4 to C4 plants | [ | |
| A to S,G | N.A. | C3 to C4 plants | [ | |
| A to S,G | N.A. | Leaf traits and climate characteristics | [ |
*Ilex positively selected residues belonging to the 20-most often selected residues of Kapralov and Filatov [29]
aAmino acid substitution in Ilex
bAmino acid substitution in other lineages recorded in the literature