| Literature DB >> 35205394 |
Serena Acha1, Lucas C Majure1.
Abstract
Relationships within the major clades of Cactaceae are relatively well known based on DNA sequence data mostly from the chloroplast genome. Nevertheless, some nodes along the backbone of the phylogeny, and especially generic and species-level relationships, remain poorly resolved and are in need of more informative genetic markers. In this study, we propose a new approach to solve the relationships within Cactaceae, applying a targeted sequence capture pipeline. We designed a custom probe set for Cactaceae using MarkerMiner and complemented it with the Angiosperms353 probe set. We then tested both probe sets against 36 different transcriptomes using Hybpiper preferentially retaining phylogenetically informative loci and reconstructed the relationships using RAxML-NG and Astral. Finally, we tested each probe set through sequencing 96 accessions, representing 88 species across Cactaceae. Our preliminary analyses recovered a well-supported phylogeny across Cactaceae with a near identical topology among major clade relationships as that recovered with plastome data. As expected, however, we found incongruences in relationships when comparing our nuclear probe set results to plastome datasets, especially at the generic level. Our results reveal great potential for the combination of Cactaceae-specific and Angiosperm353 probe set application to improve phylogenetic resolution for Cactaceae and for other studies.Entities:
Keywords: Hyb-seq; Opuntia; cacti; neotropics; target enrichment
Mesh:
Year: 2022 PMID: 35205394 PMCID: PMC8871817 DOI: 10.3390/genes13020350
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Transcriptomes used for phylogenetic reconstruction in this study.
| Species | Clade | NCBI SRA # |
|---|---|---|
|
| Outgroup | SRR6435355 (Bioproject: PRJNA428216) |
|
| Cactoideae | SRR7905834 |
|
| Cactoideae | SRR7905836 |
|
| Cactoideae | SRR5036296 |
|
| Cactoideae | SRR7905838 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905839 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR1698109 (Bioproject: PRJNA269655) |
|
| Cactoideae | SRR7905840 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905831 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905830 (Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR7905852 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905853 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR11603181 |
|
| SRR1698112 | |
|
| SRR7905854 (Bioproject: PRJNA493215) | |
|
|
| SRR7905849 (Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR7905848 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905855 (Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR7905850 (Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR1698108 |
|
| Opuntioideae | SRR3567682 |
|
| Opuntioideae | SRR3478181 |
|
| Cactoideae | SRR7905847 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905861 (Bioproject: PRJNA493215) |
|
| SRR1698106 | |
| Outgroup | SRR10247085 | |
| Outgroup | SRR10247116 | |
|
| Opuntioideae | SRR7905860 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905851 (Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR7905862 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905856 (Bioproject: PRJNA493215) |
|
| Cactoideae | SRR7905865(Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR7905864 (Bioproject: PRJNA493215) |
|
| Outgroup | SRR6435354 (Bioproject: PRJNA428216) |
|
| Opuntioideae | SRR7905863 (Bioproject: PRJNA493215) |
|
| Opuntioideae | SRR7905866 (Bioproject: PRJNA493215) |
Figure 1Flowchart of the pipeline with general steps and tools. For more details see the Methods Section.
Figure 2Heatmap plot showing recovery efficiency. Columns represent the targeted loci grouped based on probe sets and rows represent the 36 transcriptomes grouped based on the main groups in Cactaceae (OG: outgroup). (a) Length proportion of the target sequences recovered. (b) Size of the target gene sequences recovered.
Comparison of loci recovered per probe set and main group in Cactaceae. Numbers in white cells represent the number of loci shared between main groups in Angiosperms353 (above the diagonal line) and Cactaceae120 (below the diagonal line). The gray diagonal cells correspond to the number of loci recovered per probe set in each of the main groups.
| Angiosperms353 | ||||||
|---|---|---|---|---|---|---|
| Cactoideae |
| Opuntioideae | Outgroup | |||
| Cactaceae120 | Cactoideae | 303 | 272 | 252 | 288 | 288 |
| 109 | 273 | 246 | 267 | 269 | ||
|
| 92 | 91 | 254 | 249 | 253 | |
| Opuntioideae | 119 | 109 | 92 | 295 | 281 | |
| Outgroup | 117 | 108 | 92 | 116 | 292 | |
Figure 3RAxML best-scoring ML tree of the concatenated database of the two probe sets. All nodes have ≥95% support value bootstrap unless noted. OG: Outgroups; Teph.: Tephrocacteae; C. Cact.: Core Cactoideae.
Figure 4ASTRAL species tree with pie charts on the nodes representing gene tree topology proportions: blue represents concordance with the shown topology, green is the top alternative bipartition; red is all the other alternative bipartitions; and grey is no support for conflicting bipartition. Branch values represent concordance (top) and conflicting (bottom) gene trees. Asterisks mark all nodes with a posterior probability of ≤90 and scale bars represent coalescent units. (a) C120 and A353 probe sets combined. OG: Outgroups; Teph.: Tephrocacteae; C. Cact.: Core Cactoideae. (b) C120 probe set. (c) A353 probe set.