| Literature DB >> 19543388 |
Joel R McNeal1, Jennifer V Kuehl, Jeffrey L Boore, Jim Leebens-Mack, Claude W dePamphilis.
Abstract
Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19543388 PMCID: PMC2694360 DOI: 10.1371/journal.pone.0005982
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Intron distribution in relevant taxa.
| Taxon | Subgenus | Group IIA | Group IIB | ||||||||
|
|
|
|
| 3′ |
|
|
|
|
| ||
|
| + | + | + | + | + | + | + | + | + | + | |
|
| + | + | + | − | + | + | + | + | + | + | |
|
|
| × | + | +/× | − | + | + | + | + | + | + |
|
|
| × | + | +/× | − | + | + | + | + | + | + |
|
|
| × | + | +/× | − | + | + | + | + | + | + |
|
|
| × | + | +/× | − | + | + | + | + | + | + |
|
|
| × | + | × | − | + | + | + | + | × | + |
|
|
| × | + | × | − | + | + | + | × | × | + |
|
|
| × | − | × | − | + | × | × | + | × | − |
|
|
| × | − | × | − | − | × | × | + | × | − |
|
|
| × | − | × | − | − | × | × | + | × | − |
|
|
| × | − | × | − | − | × | × | + | × | − |
|
|
| × | − | × | − | − | × | × | + | × | − |
|
|
| × | − | × | − | − | × | × | + | × | − |
|
|
| × | − | × | − | − | × | × | + | × | − |
|
| × | × | × | + | + | × | × | + | × | × | |
trnV introns in Cuscuta subg. Monogyna have deletions that may render them pseudogenes.
Intron presence or absence is shown for Nicotiana tabacum (Solanaceae), Ipomoea purpurea and Cuscuta spp. (Convolvulaceae), and Epifagus virginiana (Orobanchaceae). Nicotiana, Ipomoea, and Cuscuta are classified in the order Solanales, while Epifagus virginiana is in the closely related order Lamiales. Subgeneric taxonomic classifications are listed for Cuscuta spp. Nicotiana intron distribution is typical of most angiosperms. “+” indicates intron present, “−” indicates precise intron loss from an intact gene, and “X” indicates loss of gene (and intron) from the plastid genome. Intron data for Nicotiana, Ipomoea, Cuscuta exaltata, Cuscuta reflexa, Cuscuta gronovii, Cuscuta obtusiflora, and Epifagus were gleaned from complete genome sequences available on genbank; all other data are based on PCR and PCR sequencing assays.
Figure 1Phylogenies of Convolvulaceae (a) and Orobanchaceae (b) inferred from full and partial nucleotide sequences of matK.
Branches shown in red denote significant increases in selection within domain X of matK. Trees were produced using Maximum Likelihood with bootstrap values (100 replications) shown at the nodes (bootstrap values of 100 are denoted by asterisks). Taxonomic delimitations of Cuscuta subgenera and Convolvulaceae are boxed and labeled. Group IIA intron losses are mapped on branches where they are inferred to have occurred. The branch joining Ipomoea with Cuscuta species, denoted with a dotted line, was collapsed for analysis due to low support in other studies.
Shifting patterns of selection on matK.
| Model (parameters) | Omega ω0, ω1, ω2 | -ln (Likelihood) | Model Comparison | LRT statistic (df, p) |
|
| ||||
| 1. M0 (29) | 0.21 | 1440.306 | --- | --- |
|
|
|
|
|
|
|
|
|
|
|
|
| 4. M1a, Nearly Neutral (30) | 0.115, 1.0 | 1425.3 | --- | --- |
| 5. M2a, Pos. Sel. (32) | 0.115, 1.0, 1.0 | 1425.3 | M1a vs M2a | 0 (2,1) |
| 6. Pos. Sel BranchXSites null | 0.094, 1.0, 1.0 | 1421.596 | --- | --- |
| 7. |
|
|
|
|
| 8. | Null | 1.753 (0∶1, >0.5) | ||
|
| ||||
| 9. M0 (29) | 0.392 | 4875.979 | --- | --- |
| 10. M3 (31) | 0.202, 0.854 | 4839.185 | M0 vs M3 | 73.588 (2,<<0.001) |
| 11. Discrete BranchXSites (33) | 0.207, 0.875, 0 | 4838.633 | M3 vs Discrete bXs | 1.103 (2, 0.576) |
| 12. M1a, Nearly Neutral (30) | 0.233, 1.0 | 4839.67 | --- | --- |
| 13. M2a, Pos. Sel. (32) | 0.238, 1.0, 2.66 | 4839.206 | M1a vs M2a | 0.927 (2, 0.629) |
| 14. Pos. Sel BranchXSites null | 0.233, 1.0, 1.0 | 4839.67 | --- | --- |
| 15. Pos. Sel BranchXSites (32) | 0.233, 1.0, 1.0 | 4839.67 | M1a vs Pos. Sel. bXs | 0 (2, 1) |
| 16. | Null | 0 (0∶1, 1) | ||
|
| ||||
| 17. M0 (16) | 0.2823 | 990.106 | --- | --- |
| 18. |
|
|
|
|
| 19. |
|
|
|
|
| 20. M1a, Nearly Neutral (17) | 0.204, 1.0 | 985.919 | --- | --- |
| 21. M2a, Pos. Sel. (19) | 0.17218, 1.0, 1.0 | 985.919 | M1a vs M2a | 0 |
| 22. Pos. Sel BranchXSites null | 0.1, 1.0, 1.0 | 983.387 | --- | --- |
| 23. |
|
|
|
|
| 24. | Null | 1.097 (0∶1, >0.05) | ||
|
| ||||
| 25. M0 (16) | 0.47 | 4228.341 | --- | --- |
| 26. |
|
|
|
|
| 27. |
|
|
|
|
| 28. M1a, Nearly Neutral (17) | 0.255, 1.0 | 4204.865 | --- | --- |
| 29. M2a, Pos. Sel. (19) | 0.27, 1.0, 4.773 | 4202.478 | M1a vs M2a | 4.774 (2, 0.092) |
| 30. Pos. Sel BranchXSites null | 0.237, 1.0, 1.0 | 4203.087 | --- | --- |
| 31. |
|
|
|
|
| 32. |
|
| ||
Likelihood Ratio Tests indicate shifts in pattern of selection on Cuscuta nitida (Convolvulaceae) and Epifagus virginiana (Orobanchaceae) matK genes following loss of all but one (3′rps12 in Cuscuta nitida) or two (3′rps12 and rpl2 in Epifagus virginiana) group IIA introns from the plastid genome. Models and parameters are described in text. Models with significantly improved likelihoods relative to null hypothesis are shown in boldface. Clade and branch descriptions refer to relationships depicted in Figure 1.
Models M0, M1a, M2a, and M3 are described in text with branchXsites (bXs) models for M3 and M1a.
Null model 2 for positive selection is BranchXSites model constraining foreground ω2 to neutrality (ω2 = 1.0).
Results of Branch-sites analyses.
| Site Class | 0 | 1 | 2a | 2b | Pos. Selec. Sites |
|
| |||||
| Proportion of sites | 0.56562 | 0.22014 | 0.15421 | 0.06002 | |
| Background ω | 0.06745 | 0.62299 | 0.06745 | 0.62299 | |
|
| 0.06745 | 0.62299 | 4.58222 | 4.58222 | |
|
| |||||
| Proportion of sites | 0.62212 | 0.17038 | 0.16288 | 0.04461 | 16 E (0.981) |
| Background ω | 0.0938 | 1 | 0.0938 | 1 | |
|
| 0.0938 | 1 | 4.37149 | 4.37149 | |
|
| |||||
| Proportion of site | 0.33139 | 0.37839 | 0.1355 | 0.15472 | |
| Background ω | 0 | 0.45392 | 0 | 0.45392 | |
|
| 0 | 0.45392 | 4.07717 | 4.07717 | |
|
| |||||
| Proportion of site | 0.58225 | 0.16591 | 0.19599 | 0.05585 | None with pp>0.95 |
| Background ω | 0.11534 | 1 | 0.11534 | 1 | |
|
| 0.11534 | 1 | 4.5397 | 4.5397 | |
|
| |||||
| Proportion of site | 0.75647 | 0.18138 | 0.05013 | 0.01202 | |
| Background ω | 0.31107 | 1.38574 | 0.31107 | 1.38574 | |
|
| 0.31107 | 1.38574 | 14.88588 | 14.88588 | |
|
| |||||
| Proportion | 0.65688 | 0.27813 | 0.04566 | 0.01933 | 130 P (0.968), 177 F (0.951) |
| Background ω | 0.26096 | 1 | 0.26096 | 1 | |
|
| 0.26096 | 1 | 14.42669 | 14.42669 | |
Foreground (Cuscuta nitida or Epifagus virginiana lineages) and background omega (ω = dN/dS) parameters for Branch-sites models with significant LRT results (Table 2).
Background and foreground values of ω0, ω1 and ω2 listed in the table.
Sites implicated as evolving under positive selection in Positive Selection bXs model (Bayes empirical Bayes posterior probabilities >0.95 [25] in tests of positive selection on foreground branches (Table 2) listed as position in alignment and derived amino acid residue.
Voucher information and matK GenBank accession numbers.
| Species | Voucher # | Genbank accession |
|
| * | NC009963 |
|
| # | EU330285 |
|
| # | EU330283 |
|
| (PAC) JRM03.0808 | EU330284 |
|
| (PAC) JRM03.1101 | EU330282 |
|
| (PAC) JRM03.1210a | EU330281 |
|
| * | EU330280 |
|
| (PAC) JRM03.1103 |
|
|
| * |
|
|
| * |
|
|
| (PAC) JRM03.1001 |
|
|
| (PAC) JRM03.0207 |
|
|
| (PAC) JRM03.1203 | NC009808 |
|
| (MO) 00883399 | EU330286 |
|
| # | EU330287 |
|
| (MO) 3854462 | EU330288 |
|
| N/A |
|
|
| N/A |
|
|
| N/A |
|
|
| N/A |
|
|
| N/A |
|
|
| N/A |
|
|
| N/A |
|
|
| N/A |
|
Sequences generated outside of our group are shown in bold. Specimens that lacked enough material for herbarium voucher are denoted by an asterisk (*); photographs of the dissected flowers used for identification are available upon request. Plant material or DNA from other labs where no voucher information was provided were verified by sequence identity to existing vouchered sequences on GenBank and are marked with a #.
Primer sequences designed for this study.
| atpF-F |
|
| atpF-R |
|
| ycf3-F |
|
| ycf3-R |
|
| rrn16-endF |
|
| rrn23-R1 |
|
| clpP-1F |
|
| clpP-C562R |
|
| trnKConv-endF |
|
| matKConv-54F |
|
| matKConv-783F |
|
| matKConv-801F |
|
| matKConv-882R |
|
| matKConv-1339R |
|
| matKConv-1423R |
|
| matKConv-1450F |
|
| trnKsubgM-F1 |
|
| matKsubgM-2R |
|
| matKsubgM-3F |
|
| matKsubgM-ex3R |
|
| matKsubgM-ex4F |
|
| matKsubgM-ex5R |
|
| matKsubgM-ex6R |
|
| matKsubgC-R1 |
|
| matKsubgC-R2 |
|
matK primers designed using sequence from subgenus Monogyna are designated by the suffix subgM, ones designed using subgenus Cuscuta sequences by subgC, and ones designed with Convolvulaceae sequences by Conv.