| Literature DB >> 35456376 |
Delil A Chincoya1,2, Sofía Solórzano1.
Abstract
In angiosperms, huge advances in massive DNA sequencing technologies have impacted phylogenetic studies. Probe sets have been developed with the purpose of recovering hundreds of orthologous loci of targeted DNA sequences (TDS) across different plant lineages. We tested in silico the effectiveness of two universal probe sets in the whole available genomes of Caryophyllids, emphasizing phylogenetic issues in cacti species. A total of 870 TDS (517 TDS from Angiosperm v.1 and 353 from Angiosperms353) were individually tested in nine cacti species and Amaranthus hypochondriacus (external group) with ≥17 Gbp of available DNA data. The effectiveness was measured by the total number of orthologous loci recovered and their length, the percentage of loci discarded by paralogy, and the proportion of informative sites (PIS) in the alignments. The results showed that, on average, Angiosperms353 was better than Angiosperm v.1 for cacti species, since the former obtained an average of 275.6 loci that represent 123,687 bp, 2.48% of paralogous loci, and 4.32% of PIS in alignments, whereas the latter recovered 148.4 loci (37,683 bp), 10.38% of paralogous loci, and 3.49% of PIS. We recommend the use of predesigned universal probe sets for Caryophyllids, since these recover a high number of orthologous loci that resolve phylogenetic relationships.Entities:
Keywords: cactaceae; caryophyllids; phylogenomics; target enrichment; universal probe sets
Mesh:
Year: 2022 PMID: 35456376 PMCID: PMC9032312 DOI: 10.3390/genes13040570
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Heatmap per species is presented by raw data. To the left of the species names are the results obtained with Angiosperm v.1, and to the right, the results obtained with Angiosperms353. The gaps (white color) are caused by the absence of vertical lines, which indicates that the targeted DNA sequences did not find those loci in the genome tested.
Comparison of the four parameters used to estimate the effectiveness between the two probe sets in silico tested. The results outside parentheses correspond to results obtained with Angiosperm v.1, and those inside parentheses with Angiosperms353. Total number of loci recovered with each probe set; number of loci that found a proportion ≥50% of the length of the targeted DNA sequence; the total length (bp) of sequences identified as exons and the quotient calculated; total length of exons from Angiosperms353/length of exons from Angiosperm v.1; and the absolute number of paralogous loci identified per species.
| Species | Number of Loci Recovered | Number of Loci with >50% Sequence Length | Total Length of Exons (bp): Quotient | Number of Paralogous Loci |
|---|---|---|---|---|
|
| 264 (316) | 256 (167) | 68,205 (65,658):0.96 | 24 (8) |
|
| 129 (270) | 123 (165) | 32,133 (124,773):3.88 | 12 (8) |
|
| 132 (269) | 125 (145) | 30,822 (116,277):3.77 | 13 (1) |
|
| 137 (287) | 135 (194) | 35,127 (140,259):3.99 | 18 (6) |
|
| 177 (319) | 168 (245) | 43,875 (174,285):3.97 | 36 (21) |
|
| 155 (292) | 149 (204) | 40,611 (150,720):3.71 | 18 (8) |
|
| 123 (253) | 121 (158) | 31,683 (122,844):3.88 | 6 (4) |
|
| 68 (173) | 67 (69) | 18,885 (65,658):3.48 | 2 (2) |
|
| 154 (287) | 149 (170) | 37,866 (133,071):3.51 | 16 (6) |
|
| 145 (290) | 143 (204) | 37,656 (143,322):3.81 | 16 (8) |
Figure 2Total number of alignments obtained with Angiosperms353 and Angiosperm v.1 per species.
Figure 3Phylogenetic trees obtained with ML for the nine studied cacti species; the prince’s feather (A. hypochondriacus) was the outgroup. The tree constructed with data from Angiosperm v.1 was based on 71 loci (20,082 bp), and that constructed with data obtained from Angiosperms353 was based on 262 loci (186,973 bp). The numbers to the side of the branches correspond to UFBoot (above slash) and gCF (below slash) values. The species grouped in the subfamily Cactoideae are indicated by the letter C.