| Literature DB >> 22336713 |
Antonio Marco1, Katarzyna Hooks, Sam Griffiths-Jones.
Abstract
MicroRNAs are essential post-transcriptional regulators. Many animal microRNAs are clustered in the genome, and it has been shown that clustered microRNAs may be transcribed as a single transcript. Polycistronic microRNAs are often members of the same family, suggesting a role of tandem duplication in the emergence of clusters. The mir-2 microRNA family is the largest in Drosophila melanogaster, with 8 members that are mostly clustered in the genome. Previous studies suggest that the copy number and genomic distribution of mir-2 family members has been subject to significant change during evolution. The effects of such changes on their function are still unknown. Here we study the evolution of function in the mir-2 family. Our analyses show that, in spite of the change in number and organization among invertebrates, most mir-2 loci produce very similar mature microRNA products. Multiple mature miR-2 sequences are predicted to target genes involved in neural development in Drosophila. These targeting properties are conserved in the distant species Caenorhabditis elegans. Duplication followed by functional diversification is frequent during protein-coding gene evolution. However, our results suggest that the production of microRNA clusters by gene duplication rarely involves functional changes. This pattern of functional redundancy among clustered paralogous microRNAs reflects birth-and-death evolutionary dynamics. However, we identified a small number of mir-2 sequences in Drosophila that may have undergone functional shifts associated with genomic rearrangements. Therefore, redundancy in microRNA families may facilitate the acquisition of novel functional features.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22336713 PMCID: PMC3384581 DOI: 10.4161/rna.19160
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652

Figure 1. Sequence conservation in the mir-2 family. (A) The alignment of mir-2 precursor sequences in representative genomes, shadowed by sequence conservation (visualized using Ralee), where darker tones reflect higher conservation. Structure of the consensus sequence is shown below the alignment in dot-bracket annotation. The open white box over the alignment indicates the canonical mature product, with the seed sequence highlighted (black). (B) Consensus structure of the mir-2 precursor in invertebrates, colored with VARNA according to sequence conservation. The canonical and non-canonical mature products produced by some mir-2 precursors are also indicated.

Figure 2. Copy distribution of mir-2 sequences. Phylogenetic tree of invertebrate species and genomic organization of mir-2 sequences. Divergence times were extracted from ref. 47. Black arrows depict mir-2 family members, and white arrow mir-71 sequences. Arrows linked by the same straight line indicate microRNAs linked in the genome by less than 10 kb.
Table 1. Top 20 enriched GO terms in the developmental process category
| Species | Enriched GO term | # genes | q-value |
|---|---|---|---|
| Drosophila | multicellular organismal development | 121 | 0.0000 |
| | nervous system development | 67 | 0.0000 |
| | central nervous system development | 25 | 0.0000 |
| | sensory organ development | 37 | 0.0000 |
| | anatomical structure morphogenesis | 91 | 0.0000 |
| | organ morphogenesis | 48 | 0.0000 |
| | neurogenesis | 54 | 0.0000 |
| | cell differentiation | 84 | 0.0000 |
| | neuron differentiation | 46 | 0.0000 |
| | developmental process | 129 | 0.0000 |
| | cell fate commitment | 31 | 0.0000 |
| | organ development | 79 | 0.0000 |
| | generation of neurons | 53 | 0.0000 |
| | system development | 106 | 0.0000 |
| | anatomical structure development | 123 | 0.0000 |
| | cellular developmental process | 85 | 0.0000 |
| | brain development | 16 | 0.0011 |
| | eye development | 30 | 0.0022 |
| | neuron development | 38 | 0.0023 |
| | regionalization | 37 | 0.0023 |
| cellular component morphogenesis | 38 | 0.0047 | |
| | anatomical structure morphogenesis | 112 | 0.0062 |
| | neurogenesis | 22 | 0.0164 |
| | generation of neurons | 22 | 0.0164 |
| | neuron development | 20 | 0.0165 |
| | cell morphogenesis | 22 | 0.0167 |
| | neuron differentiation | 21 | 0.0167 |
| | muscle structure development | 23 | 0.0187 |
| | muscle organ development | 6 | 0.0191 |
| | nervous system development | 22 | 0.0204 |
| | neuron projection morphogenesis | 18 | 0.0209 |
| | organ morphogenesis | 10 | 0.0226 |
| | axonal fasciculation | 11 | 0.0232 |
| | neuron projection development | 18 | 0.0233 |
| | anatomical structure formation involved in morphogenesis | 25 | 0.0238 |
| | cell projection morphogenesis | 19 | 0.0245 |
| | syncytium formation by plasma membrane fusion | 3 | 0.0334 |
| | syncytium formation | 3 | 0.0334 |
| | cell part morphogenesis | 19 | 0.0347 |
| neuron recognition | 11 | 0.0356 |
1Number of genes with predicted canonical seed targets (see Methods) annotated to a GO term; 2q-value is the p-value corrected for a false discovery rate of 0.05 (ref. 46)