| Literature DB >> 32708269 |
Delil A Chincoya1, Alejandro Sanchez-Flores2, Karel Estrada2, Clara E Díaz-Velásquez3, Antonio González-Rodríguez4, Felipe Vaca-Paniagua3,5, Patricia Dávila6, Salvador Arias7, Sofía Solórzano1.
Abstract
In plants, partial DNA sequences of chloroplasts have been widely used in evolutionary studies. However, the Cactaceae family (1500-1800 species) lacks molecular markers that allow a phylogenetic resolution between species and genera. In order to identify sequences with high variation levels, we compared previously reported complete chloroplast genomes of seven species of Mammillaria. We identified repeated sequences (RSs) and two types of DNA variation: short sequence repeats (SSRs) and divergent homologous loci. The species with the highest number of RSs was M. solisioides (256), whereas M. pectinifera contained the highest amount of SSRs (84). In contrast, M. zephyranthoides contained the lowest number (35) of both RSs and SSRs. In addition, five of the SSRs were found in the seven species, but only three of them showed variation. A total of 180 homologous loci were identified among the seven species. Out of these, 20 loci showed a molecular variation of 5% to 31%, and 12 had a length within the range of 150 to 1000 bp. We conclude that the high levels of variation at the reported loci represent valuable knowledge that may help to resolve phylogenetic relationships and that may potentially be convenient as molecular markers for population genetics and phylogeographic studies.Entities:
Keywords: Mammillaria; SSRs; locus; molecular variation; non-coding regions; repeated sequences
Year: 2020 PMID: 32708269 PMCID: PMC7397273 DOI: 10.3390/genes11070830
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Number of repeated sequences (RSs) by type (complementary, palindromic, reverse and forward) identified in the seven species of Mammillaria.
Figure 2Number of repeated sequences identified in seven Mammillaria species. The color of the bars indicates the number of RSs by length in bp.
Type and number of Short Sequence Repeats (SSRs) in the complete chloroplast genomes of the seven analyzed species.
| Type of SSRs |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 1. Homopolymer | |||||||
| A/T | 56 | 60 | 57 | 69 | 65 | 60 | 31 |
| C/G | 1 | 1 | 1 | 2 | 2 | 1 | 0 |
| 2. Dinucleotide | |||||||
| AT/AT | 5 | 6 | 7 | 9 | 9 | 6 | 3 |
| 3. Trinucleotide | |||||||
| AAG/CTT | 2 | 1 | 0 | 3 | 2 | 1 | 0 |
| AGG/CCT | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| AAT/ATT | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| 4. Tetranucleotide | |||||||
| AAAT/ATTT | 0 | 2 | 0 | 0 | 0 | 2 | 0 |
| 5. Pentanucleotide | |||||||
| AATAT/ATATT | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| 6. Hexanucleotide | |||||||
| ACGAGG/CCTCGT | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| Total number of SSRs | |||||||
| 65 | 70 | 65 | 84 | 80 | 70 | 35 | |
Figure 3Total number of Short Sequence Repeats (SSRs) identified in the cpDNA of seven species of Mammillaria. The color of the bars corresponds to Inverted Repeats (red), Large Single Copy (yellow) and Small Single Copy (green).
Figure 4Distribution of the percentage of molecular variation across 180 homologous loci. Average percentage variation of the (A) coding and (B) non-coding homologous loci in the seven chloroplast genomes of Mammillaria. The 20 loci with the highest molecular variation are indicated with asterisks (✴).
Distribution of the 180 homologous loci identified in the seven Mammillaria species. The number of loci in each structural region of the chloroplast genome is shown.
| Species | Number of Loci | ||
|---|---|---|---|
| LSC Region | SSC Region | IRs Region | |
| Arrangement 1 ( | 144 | 132 | 142 |
| Arrangement 2 ( | 34 | 34 | 17 |
| Arrangement 3 ( | 2 | 14 | 21 |
Percentage of variation and length of the 20 most variable chloroplast loci recorded in the seven Mammillaria species. The type of locus (gene, pseudogene or IGS) and the structural region of the genome where it is located are shown. SSC/IRs indicates that the locus is in the SSC region in all the species, except M. zephyranthoides, which has it in IRs. * indicates the 14 loci that were used to obtain the phylogenetic relationships of the seven studied species. The last column represents the average length (bp) in the seven species.
| Locus Name | Locus Type | Location | Variation Percentage (%) | Average Length (bp) |
|---|---|---|---|---|
| 1. | IGS | LSC | 31.37 | 2176 |
| 2. | IGS | SSC | 18.56 | 721 |
| 3. | IGS | LSC | 11.52 | 103 |
| 4. | IGS | LSC | 10.64 | 325 |
| 5. | IGS | SSC | 10.63 | 455 |
| 6. | IGS | LSC | 10.34 | 89 |
| 7. | Gene | LSC | 10.26 | 102 |
| 8. | IGS | LSC | 9.82 | 399 |
| 9. | Pseudogene | LSC | 8.99 | 1574 |
| 10. | IGS | SSC/IRs | 8.74 | 360 |
| 11. | IGS | LSC | 8.68 | 619 |
| 12. | Gene | LSC | 7.61 | 616 |
| 13. | IGS | SSC/IRs | 7.34 | 354 |
| 14. | IGS | SSC/IRs | 7.02 | 109 |
| 15. | IGS | SSC | 6.90 | 146 |
| 16. | IGS | LSC | 6.21 | 227 |
| 17. | IGS | LSC | 5.87 | 67 |
| 18. | IGS | LSC | 5.82 | 753 |
| 19. | IGS | SSC | 5.34 | 256 |
| 20. | IGS | LSC | 5.14 | 277 |
Figure 5Phylogenetic ML tree for the seven studied Mammillaria species based on 14 loci of high molecular variation (marked with an asterisk in Table 3), with Portulaca oleracea and Carnegiea gigantea used as outgroups. The branch length is shown above the branches, whereas bootstrap values are displayed below the branches.