| Literature DB >> 29788279 |
James E Tarver1, Richard S Taylor1, Mark N Puttick1,2, Graeme T Lloyd3, Walker Pett4, Bastian Fromm5, Bettina E Schirrmeister1, Davide Pisani1, Kevin J Peterson6, Philip C J Donoghue1.
Abstract
microRNAs are conserved noncoding regulatory factors implicated in diverse physiological and developmental processes in multicellular organisms, as causal macroevolutionary agents and for phylogeny inference. However, the conservation and phylogenetic utility of microRNAs has been questioned on evidence of pervasive loss. Here, we show that apparent widespread losses are, largely, an artefact of poorly sampled and annotated microRNAomes. Using a curated data set of animal microRNAomes, we reject the view that miRNA families are never lost, but they are rarely lost (92% are never lost). A small number of families account for a majority of losses (1.7% of families account for >45% losses), and losses are associated with lineages exhibiting phenotypic simplification. Phylogenetic analyses based on the presence/absence of microRNA families among animal lineages, and based on microRNA sequences among Osteichthyes, demonstrate the power of these small data sets in phylogenetic inference. Perceptions of widespread evolutionary loss of microRNA families are due to the uncritical use of public archives corrupted by spurious microRNA annotations, and failure to discriminate false absences that occur because of incomplete microRNAome annotation.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29788279 PMCID: PMC6007596 DOI: 10.1093/gbe/evy096
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Cumulative numbers of gains and losses of miRNA families in the lineages leading to the two most completely sampled taxa (Homo and Drosophila) over the past 720 Myr based on our curated data set.
Sum Total of miRNA Family Gains and Losses Along the Path from the Root of the Phylogenetic Tree to Each Tip Based on our Curated Data Set
| Node | Lineage | Gains | Losses | Gain:Loss Ratio | Percentage Loss |
|---|---|---|---|---|---|
| 37 | 8 | 0 | N/A | N/A | |
| 38 | 28 | 0 | N/A | N/A | |
| 40 | 68 | 0 | N/A | N/A | |
| 39 | 61 | 2 | 30.50 | 3.28 | |
| 47 | 77 | 4 | 19.25 | 5.19 | |
| 46 | 57 | 3 | 19.00 | 5.26 | |
| 54 | 56 | 3 | 18.67 | 5.36 | |
| 52 | 54 | 3 | 18.00 | 5.56 | |
| 56 | 99 | 6 | 16.50 | 6.06 | |
| 60 | 130 | 8 | 16.25 | 6.15 | |
| 63 | 154 | 10 | 15.40 | 6.49 | |
| 48 | 88 | 6 | 14.67 | 6.82 | |
| 61 | 128 | 9 | 14.22 | 7.03 | |
| 70 | 301 | 22 | 13.68 | 7.31 | |
| 51 | 133 | 10 | 13.30 | 7.52 | |
| 53 | 46 | 4 | 11.50 | 8.70 | |
| 62 | 146 | 13 | 11.23 | 8.90 | |
| 71 | 235 | 21 | 11.19 | 8.94 | |
| 59 | 124 | 12 | 10.33 | 9.68 | |
| 67 | 219 | 22 | 9.95 | 10.05 | |
| 50 | 99 | 10 | 9.90 | 10.10 | |
| 44 | 105 | 11 | 9.55 | 10.48 | |
| 58 | 113 | 12 | 9.42 | 10.62 | |
| 57 | 115 | 13 | 8.85 | 11.30 | |
| 49 | 88 | 10 | 8.80 | 11.36 | |
| 66 | 198 | 24 | 8.25 | 12.12 | |
| 69 | 259 | 33 | 7.85 | 12.74 | |
| 65 | 143 | 19 | 7.53 | 13.29 | |
| 41 | 58 | 9 | 6.44 | 15.52 | |
| 64 | 129 | 21 | 6.14 | 16.28 | |
| 68 | 221 | 36 | 6.14 | 16.29 | |
| 43 | 140 | 25 | 5.60 | 17.86 | |
| 42 | 64 | 14 | 4.57 | 21.88 | |
| 45 | 56 | 13 | 4.31 | 23.21 | |
| 55 | 40 | 14 | 2.86 | 35.00 |
Note.—The gain to loss ratio, and percentage loss, for each individual lineage are also shown. Thus, the three gains and two losses observed on the lineage leading to chordates are recorded in all descendent taxa.
. 2.—The observed number of losses of miRNA families inferred from our phylogenetic analysis of the curated data set.
. 3.—The gains and losses of miRNA families for all taxa with well-annotated miRNAomes, (A) per branch within the phylogenetic tree, (B) per unit time across the phylogenetic tree. miRNA gains showed an increased rate toward the Recent, compatible with some theories which posit an early transient phase in which miRNA gain and loss is rapid (Peterson et al. 2009), making early losses difficult to detect.
. 4.—Phylogenetic tree derived from the Dollo analysis of the curated presence/absence of microRNA families considered in our analysis. (A) Shows results whereby all miRNAs were included and (B) where singletons (synapomorphic) miRNAs were excluded. Node values are clade posterior probabilities.
. 5.—Phylogenetic tree derived from the Dollo analysis of presence/absence of microRNA families in our uncurated miRBase data set. (A) Shows results whereby all miRNAs were included and (B) where singletons (synapomorphic) miRNAs were excluded. Node values are clade posterior probabilities.
. 6.—Phylogenetic tree derived from phylogenetic analysis of the concatenated analysis of pri-miRNA sequences. Node values are clade posterior probabilities.