| Literature DB >> 19383167 |
Mariana Mondragón-Palomino1, Luisa Hiese, Andrea Härter, Marcus A Koch, Günter Theissen.
Abstract
BACKGROUND: Positive selection is recognized as the prevalence of nonsynonymous over synonymous substitutions in a gene. Models of the functional evolution of duplicated genes consider neofunctionalization as key to the retention of paralogues. For instance, duplicate transcription factors are specifically retained in plant and animal genomes and both positive selection and transcriptional divergence appear to have played a role in their diversification. However, the relative impact of these two factors has not been systematically evaluated. Class B MADS-box genes, comprising DEF-like and GLO-like genes, encode developmental transcription factors essential for establishment of perianth and male organ identity in the flowers of angiosperms. Here, we contrast the role of positive selection and the known divergence in expression patterns of genes encoding class B-like MADS-box transcription factors from monocots, with emphasis on the family Orchidaceae and the order Poales. Although in the monocots these two groups are highly diverse and have a strongly canalized floral morphology, there is no information on the role of positive selection in the evolution of their distinctive flower morphologies. Published research shows that in Poales, class B-like genes are expressed in stamens and in lodicules, the perianth organs whose identity might also be specified by class B-like genes, like the identity of the inner tepals of their lily-like relatives. In orchids, however, the number and pattern of expression of class B-like genes have greatly diverged.Entities:
Mesh:
Year: 2009 PMID: 19383167 PMCID: PMC2680841 DOI: 10.1186/1471-2148-9-81
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Phylogeny of monocot . The figure shows the Bayesian inference phylogeny of those DEF-like sequences for which at least 80% of the complete coding sequence is available and the new ones reported here. An appropriate model of nucleotide substitution was selected by Modeltest for the nucleotide sequences encoding the M-, I- and K- domains, and used to infer this phylogeny where the C-terminal domain was excluded. The bars indicate the different clades of DEF-like genes from the Orchidaceae, Liliales, Commelinales, Poales and the outgroup. The numbers on every node indicate the Bayesian posterior probabilities (PP). Black arrows emphasize nodes that are discussed in the text. The inset shows a diagram of the monocot relationships based on phylogenetic analyses by [65,66].
Figure 2Phylogeny of monocot . The figure shows the Bayesian inference phylogeny with all GLO-like sequences where the complete coding sequence is available and the new ones reported here. The bars indicate the single clade formed by sequences from the Orchidaceae and of Clades 1 and 2 formed by the rest of the Asparagales, Poales Zingiberales, Liliales, Arecales, Alismatales and the outgroup. The number on every node corresponds to the Bayesian posterior probabilites (PP). Black arrows emphasize nodes that are discussed in the text.
Models of codon substitution employed on the present analysis.
| Sites | M7 | Distribution of ω along sites follows a β distribution, no selection | 7 | 2 | Alignments of | |||
| M8 | β distribution where ω > 1. Implements BEBb. | 8 | 4 | |||||
| Branches | M0 | one ω value for all branches | 0 | 1 | ω | Alignments of | ||
| M2 | n different ω values on n specified branches | 0 | User-specified | Different ω values corresponding to each of the user-specified groups of branches | ||||
| Clades and sites | M1a | nearly neutral evolution | 1 | 2 | Clades 1 and 2: 0 < ω0 < 1, ω1 = 1 | Alignment of orchid | ||
| MC | In site classes 2 and 3 selective pressure varies in different parts of the phylogeny. Implements BEBb. | 2 | 5 | Clade 1: 0 < ω0 < 1, ω1 = 1, ω2 | Alignment of orchid | |||
| Clade 2: 0 < ω0 < 1, ω1 = 1, ω3 | Alignment of Poales | |||||||
| Proportions: | ||||||||
| M3 | Assumes several site classes with independently estimated ω. | 3 | 5 | ω0, ω1, ω2 | Same datasets analyzed with M1a vs. MC. | |||
| Proportions: | ||||||||
| MD | In site class 2, selective pressure is different on each clade. | 3 | 6 | Clade 1: ω0, ω1, ω2 Clade 1 | ||||
| Clade 2: ω0, ω1, ω2 Clade 2 | ||||||||
| Proportions: | ||||||||
| Branches and sites | MA1 | Neutral or purifying selection on individual codons along specific clades. | 4 | 3 | Site class Background Foreground | Alignments of | ||
| 0 | 0 < ω0 < 1 | 0 < ω0 < 1 | ||||||
| 1 | ω1 = 1 | ω1 = 1 | ||||||
| 2a | 0 < ω0 < 1 | ω2 = 1 | ||||||
| 2b | ω1 = 1 | ω2 = 1 | ||||||
| Proportions: p0, p1 d | ||||||||
| MA | Tests for positive selection on individual codons along specific clades. Implements BEBb. Only foreground clades experience positive selection. | 4 | 4 | Site class Background Foreground | ||||
| 0 | 0 < ω0 < 1 | 0 < ω0 < 1 | ||||||
| 1 | ω1 = 1 | ω1 = 1 | ||||||
| 2a | 0 < ω0 < 1 | ω2>1 | ||||||
| 2b | ω1 = 1 | ω2>1 | ||||||
| Proportions: p0, p1 d | ||||||||
a. Refers to the kind of data where the process of codon substitution is analyzed
b. Bayes Empirical Bayes calculation indicates the posterior probability a site belongs to a given class. This information in valuable to identify sites under positive selection.
c. The values of p0 and p1 are also used to calculate p2 proportions of sites with ω2 and ω3
d. The values of p0 and p1 are used to calculate the proportions of site classes 2a and 2b
e. As previously explained, each dataset includes an un-rooted phylogeny reconstructed with Mr. Bayes using the most appropriate model identified by Modeltest.
Figure 3Test of hypotheses about the variation of ω along the phylogeny of . The table summarizes the hypotheses tested and their maximum likelihood estimates of ω, which depending on the hypothesis, are free to vary in specific clades (indicated on the phylogeny), while being uniform in the rest of the phylogeny. Column "L" indicates the likelihood value that corresponds to each hypothesis, and P the probability of each model rejecting its corresponding null hypothesis in the series of LRTs indicated at the bottom of the table. The phylogeny employed in these tests is based on the alignment of the complete coding sequences of DEF-like genes. However, to facilitate the inference of the parameters for each model, the outgroup and sequences from clades not directly relevant to the analyses were removed from the alignment and the phylogeny was re-estimated with an ad hoc model as described in Methods. The branches labeled with italics were tested for positive selection with branch-site models A1 and A.
Figure 4Test of hypotheses about the variation of ω along the phylogeny of . This figure is organized like Figure 3, but the phylogeny employed in these tests is based on the alignment of the complete coding sequences of GLO-like genes. However, to facilitate the inference of the parameters for each model, the outgroup and sequences from clades not directly relevant to the analyses were removed from the alignment and the phylogeny was re-estimated with an ad hoc model as described in Methods. The branches labeled with italics were tested for positive selection with branch-site models A1 and A.
Parameter estimates and LRT of M3 vs. MD and M1a vs. MC in DEF-like genes from Orchidaceae-specific clades 1 and 2.
| M3 discrete | ω0 = 0.04544, f0 = 0.63768 | -3516.356560 |
| M3 discrete | ω0 = 0.04544, f0 = 0.63768 | -3516.356560 |
| Model D | ω0 = 0.04102, f0 = 0.60628 | -3508.313165 |
| Model 1a | ω0= 0.09066, f0 = 0.75319 | -3540.435701 |
LRTs of variable ω's among sites
M3(k = 2) vs M3 (k = 3) 2δ = 3.617532 df = 2 P = 0.1639
LRTs of variable ω's among sites and branches
M3 (k = 2) vs MD (k = 2) 2δ = 16.08679 df = 2 P = 0.0003
M3 (k = 3) vs MD (k = 3) 2δ = 16.309046 df = 2 P = 0.0003
M3 (k = 2) vs MD (k = 3) 2δ = 19.926578 df = 4 P = 0.0005
M1a vs MC 2δ = 67.681514 df = 2 P < 0.0001
1. Bonferroni correction for the 5 tests applied to each pair of clades: 0.008333
Parameter estimates and LRT of M3 vs MD and M1a vs MC in DEF-like genes from Orchidaceae-specific clades 3 and 4.
| M3 discrete | ω0 = 0.02182, f0 = 0.76519 | -3729.728309 |
| M3 discrete | ω0 = 0.02182, f0 = 0.09116 | -3729.728309 |
| Model D | ω0 = 0.02175, f0= 0.76410 | -3729.649245 |
| Model D | ω0 = 0.23556, f0 = 0.23301 | -3729.138799 |
| Model 1a | ω0 = 0.05538, f0 = 0.95421 | -3764.994659 |
LRTs of variable ω's among sites
M3(k = 2) vs M3 (k = 3) 2δ = 0 df = 2 P = 1
LRTs of variable ω's among sites and branches
M3 (k = 2) vs MD (k = 2) 2δ = 0.158128 df = 2 P = 0.9240
M3 (k = 3) vs MD (k = 3) 2δ = 1.17902 df = 2 P = 0.5546
M3 (k = 2) vs MD (k = 3) 2δ = 1.17902 df = 4 P = 0.8815
M1a vs MC 2δ = 72.514684 df = 2 P < 0.0001
1. Bonferroni correction for the 6 tests applied to each pair of clades: 0.008333
Parameter estimates and LRT of M3 vs. MD and M1a vs. MC in GLO-like genes from Poales-specific clades 1 and 2.
| M3 discrete | ω0 = 0.01674, f0 = 0.58033 | -3440.487607 |
| M3 discrete | ω0 = 0.01674, f0 = 0.58033 | -3440.487645 |
| Model D | ω0 = 0.22754, f0 = 0.41968 | -3440.475354 |
| Model D | ω0 = 48.10696, f0 = 0.00000 | -3439.561525 |
| Model 1a | ω0 = 0.08103, f0 = 0.96439 | -3484.524831 |
LRTs of variable ω's among sites
M3(k =) vs M3 (k =) 2δ = -7.6E-05 df = 2 P = 1
LRTs of variable ω's among sites and branches
M3 (k = 2) vs MD (k = 2) 2δ = 0.024506 df = 2 P = 0.9878
M3 (k = 3) vs MD (k = 3) 2δ = 1.85224 df = 2 P = 0.3961
M3 (k = 2) vs MD (k = 3) 2δ = 1.852164 df = 4 P = 0.7629
M1a vs MC 2δ = 89.746018 df = 2 P < 0.0001
1. Bonferroni correction for the 6 tests applied to each pair of clades: 0.008333
Parameter estimates and LRT of MA1 vs. MA in DEF-like genes from Poales.
| Poales | A | ω0 = 0.08303, f0 = 0.89319 | 45 L 0.612 | -9975.021057 |
| A1 | ω0 = 0.08196, f0 = 0.78512 | -9979.237754 | ||
| LRT | 2δ = 8.433394 df = 1 P = 0.0037 | |||
| Poales | A | ω0 = 0.08281, f0 = 0.86235 | 14 S 0.813 | -9983.412368 |
| A1 | ω0 = 0.08278, f0 = 0.76740 | -9984.514801 | ||
| LRT | 2δ = 2.204866 df = 1 P = 0.1376 | |||
| Poales | A | ω0 = 0.08393, f0 = 0.95687 | 33 T 0.517 | -9987.927062 |
| A1 | ω0 = 0.08401, f0 = 0.95276 | -9988.396587 | ||
| LRT | 2δ = 0.93905 df = 1 P = 0.3325 | |||
The branches tested are those labeled with cursive fonts in Figure 3.
Figure 5Sites under positive selection in Poales, mapped on the alignment of monocot . The sites with a posterior probability of ≥ 0.90 listed in Table 5 are marked with an asterisk on the alignment. Although the sequences under positive selection correspond to the Poales, the rest of the sequences in the alignment are included for a better comparison.
Figure 6Trends in the molecular evolution of . In this schematic phylogeny the major morphological and evolutionary features of the two lineages of class B genes from Orchidaceae and Poales are summarized: (a) Three gene duplications resulted in four DEF-like genes in the Orchidaceae. The points where these events happened are labeled 1 to 3 without assuming any specific order for duplication events 2 and 3. This includes the reasonable possibility that duplications 2 and 3 occurred simultaneously (e.g. by partial or complete genome duplication). We propose that duplication of DEF-like genes is associated to the emergence of new perianth organ identities. Specifically, from an ancestor with an actinomorphic perianth evolved after the first round of DEF-like gene duplication an intermediate morphology where inner and outer tepals were different. Following a final round of duplication and functional divergence of clade 4 genes the lip or labellum organ identity emerged and made the perianth zygomorphic. The arrow indicates a 180° rotation of the pedicel and or ovary (resupination) that in most orchids sets the lip, the developmentally uppermost tepal, in a lowermost portion of the perianth. (b) Purifying selection prevails in DEF- and GLO-like genes (ω << 1), but is less stringent in Orchidaceae DEF-like genes from clade 2 (ω < 1); (c) Some proteins encoded by DEF-like genes from clade 2 miss part of the C-terminal domain; (d) Positive selection (ω>1) took place at the base of the Poales DEF-like genes. Most of the positions involved are in the K-domain. Positive selection was also detected, although not statistically significant, on the lineage linking DEF-like genes from Restionaceae with the rest of the Poales. The highly reduced and zygomorphic perianth or grasses evolved from a common ancestor with the "typical" monocot perianth morphology. Here represented by black half-moons are the inner perianth organs of Streptochaeta angustifolia (Anomochlooideae) and Oryza sativa (Pooideae) to illustrate the morphological transition that was preceded by the process of positive selection here documented. In this diagram the palea is in gray and the outer perianth whorl is absent.