| Literature DB >> 35456513 |
Monique Romeiro-Brito1, Milena Cardoso Telhe1, Danilo Trabuco Amaral1,2, Fernando Faria Franco1, Evandro Marsola Moraes1.
Abstract
The molecular phylogenies of Cactaceae have enabled us to better understand their systematics, biogeography, and diversification ages. However, most of the phylogenetic relationships within Cactaceae major groups remain unclear, largely due to the lack of an appropriate set of molecular markers to resolve its contentious relationships. Here, we explored the genome and transcriptome assemblies available for Cactaceae and identified putative orthologous regions shared among lineages of the subfamily Cactoideae. Then we developed a probe set, named Cactaceae591, targeting both coding and noncoding nuclear regions for representatives from the subfamilies Pereskioideae, Opuntioideae, and Cactoideae. We also sampled inter- and intraspecific variation to evaluate the potential of this panel to be used in phylogeographic studies. We retrieved on average of 547 orthologous regions per sample. Targeting noncoding nuclear regions showed to be crucial to resolving inter- and intraspecific relationships. Cactaceae591 covers 13 orthologous genes shared with the Angiosperms353 kit and two plastid regions largely used in Cactaceae studies, enabling the phylogenies generated by our panel to be integrated with angiosperm and Cactaceae phylogenies, using these sequences. We highlighted the importance of using coalescent-based species tree approaches on the Cactaceae591 dataset to infer accurate phylogenetic trees in the presence of extensive incomplete lineage sorting in this family.Entities:
Keywords: cacti; coalescent inference; phylogenomics; rapid radiation; target capture sequencing
Mesh:
Year: 2022 PMID: 35456513 PMCID: PMC9032687 DOI: 10.3390/genes13040707
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Summary statistics of nucleotide variation calculated for coding, noncoding, and anonymous regions in the Cactaceae and Cereus datasets. Note: bp, base pairs; S, variable sites; PIS (%), parsimony informative sites (proportion of PIS).
| Dataset | Alignment Length (pb) | % Missing Data | S | PIS (%) |
|---|---|---|---|---|
| Cactaceae | ||||
| Exons | 258,375 | 12.83 | 100,715 | 54,156 (0.211) |
| Introns | 309,041 | 23.28 | 149,244 | 78,597 (0.254) |
| anonymous regions | 154,174 | 64.45 | 44,159 | 22,416 (0.145) |
| Cactaceae-coding | 258,375 | 12.83 | 100,715 | 54,156 (0.211) |
| Cactaceae-all | 721,59 | 33.52 | 294,118 | 155,169 (0.215) |
| Exons | 303,688 | 12.67 | 24,729 | 10,278 (0.034) |
| Introns | 424,012 | 5.49 | 42,732 | 18,454 (0.044) |
| anonymous regions | 136,613 | 25.02 | 11,036 | 5148 (0.038) |
| Cereus-noncoding | 560,025 | 15.26 | 53,768 | 23,603 (0.042) |
| Cereus-all | 864,313 | 14.39 | 78,497 | 33,880 (0.039) |
Figure 1Comparison of coalescent-based species trees of Cactaceae estimated with Cactaceae-coding (A) and Cactaceae-all (B) sub-datasets. Cactaceae-coding and Cactaceae-all sub-datasets included 290 and 512 nuclear regions, respectively. The branches with high support (LPP > 0.95) are colored with thick dark lines, the branches with moderate support (LPP 0.95–0.70) are colored with thick gray lines, and the branches with low support (LPP < 0.70) are shown as thin light lines. Voucher number of each accession is provided in parenthesis after species names. Species from the subtribes Rebutinae, Trichocereinae, and Cereinae are colored red, green, and blue, respectively (sensu Nyffeler and Eggli [48]). LPP: local posterior probability.
Figure 2Comparison of phylogenies of Cereus clade A inferred with Cereus-noncoding (A) and Cereus-all (B) sub-datasets, using coalescent-based (upper) and ML concatenated (down) approaches. The branches with high support (LPP > 0.95 and BS > 95) are highlighted with thick branch lines, the branches with moderate support (LPP 0.95–0.70 and BS 95–70) are colored with thick gray lines, and the branches with low support (LPP < 0.70 and BS < 70) are shown as thin light lines. LPP, local posterior probability; BS, ultrafast bootstrap. Voucher number of each accession is provided in parenthesis after species names.
Node support in the phylogenetic trees inferred from the Cactaceae and Cereus datasets, using different sub-datasets (coding, noncoding, and all sequences) and phylogenetic approaches. Branch support values of coalescent (LPP) and concatenated (BS) trees were classified as high (>0.95, >95), moderate (0.95–0.70, 95–70), or low (<0.70, <70), respectively.
| Dataset | Total | High Support | Moderate Support | Low Support |
|---|---|---|---|---|
| Cactaceae-coding | ||||
| concatenated | 57 | 51 | 5 | 1 |
| coalescent-based | 57 | 48 | 7 | 2 |
| Cactaceae-all | ||||
| concatenated | 57 | 52 | 3 | 2 |
| coalescent-based | 57 | 53 | 0 | 4 |
| Cereus-noncoding | ||||
| concatenated | 26 | 19 | 4 | 3 |
| coalescent-based | 26 | 19 | 4 | 3 |
| Cereus-all | ||||
| concatenated | 26 | 19 | 5 | 2 |
| coalescent-based | 26 | 21 | 3 | 2 |