| Literature DB >> 25644246 |
Abstract
Gene duplication provides large numbers of new genes that can lead to the evolution of new functions. Duplicated genes can diverge by changes in sequences, expression patterns, and functions. MicroRNAs play an important role in the regulation of gene expression in many eukaryotes. After duplication, two paralogs may diverge in their microRNA binding sites, which might impact their expression and function. Little is known about conservation and divergence of microRNA binding sites in duplicated genes in plants. We analyzed microRNA binding sites in duplicated genes in Arabidopsis thaliana and Brassica rapa. We found that duplicates are more often targeted by microRNAs than singletons. The vast majority of duplicated genes in A. thaliana with microRNA binding sites show divergence in those sites between paralogs. Analysis of microRNA binding sites in genes derived from the ancient whole-genome triplication in B. rapa also revealed extensive divergence. Paralog pairs with divergent microRNA binding sites show more divergence in expression patterns compared with paralog pairs with the same microRNA binding sites in Arabidopsis. Close to half of the cases of binding site divergence are caused by microRNAs that are specific to the Arabidopsis genus, indicating evolutionarily recent gain of binding sites after target gene duplication. We also show rapid evolution of microRNA binding sites in a jacalin gene family. Our analyses reveal a dynamic process of changes in microRNA binding sites after gene duplication in Arabidopsis and highlight the role of microRNA regulation in the divergence and contrasting evolutionary fates of duplicated genes.Entities:
Keywords: gene duplication; gene regulation; microRNA binding sites; microRNAs; tandem duplicates; whole-genome duplication
Mesh:
Substances:
Year: 2015 PMID: 25644246 PMCID: PMC5322543 DOI: 10.1093/gbe/evv023
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FDuplicated genes are more likely to be targeted by miRNAs than singletons. The proportions of duplicates and singletons among all miRNA targets based on binding site prediction data set (A) and experimental data set (B) are indicated. The proportions of all duplicates and singletons in the genome are shown in (C). Lighter and darker portions of the pie charts represent singletons and duplicates, respectively.
Conservation and Divergence of miRNA Binding Site Patterns in Duplicated Genes in Arabidopsis thaliana
| WGD | TD | Others | Total | |
|---|---|---|---|---|
| miRNA binding site prediction data set | ||||
| Same | 21 | 8 | 22 | 51 |
| Divergent | 211 | 65 | 231 | 507 |
| Total | 232 | 73 | 253 | 558 |
| Experimental data set | ||||
| Same | 12 | 1 | 7 | 20 |
| Divergent | 14 | 9 | 20 | 43 |
| Total | 26 | 10 | 27 | 63 |
Note.—The numbers of paralog pairs showing the same or divergent miRNA binding site patterns based on the miRNA binding site prediction data set and the experimental data set are indicated. Each category (same, divergent, and total) of miRNA binding site pattern is divided into three classes corresponding to the three types of duplicated genes, from left to right, whole-genome duplicates (WGD), tandem duplicates (TD), and other types of duplicates (others).
Conservation and Divergence of miRNA Binding Site Patterns in Whole-Genome Duplicates and Triplicates in Brassica rapa
| Duplicates | Triplicates | Total | ||||
|---|---|---|---|---|---|---|
| No. of miRNA targets | 1 | 2 | 1 | 2 | 3 | |
| Same | — | 17 | — | — | 1 | 18 |
| Divergent | 34 | 1 | 0 | 14 | 3 | 52 |
| Total | 34 | 18 | 0 | 14 | 4 | 70 |
Note.—Numbers are indicated of paralog pairs and triplicates showing the same or divergent miRNA binding site patterns based on the miRNA binding site prediction data set for Brassica rapa. Genes generated through WGT are divided into duplicates and triplicates based on how many genes are retained. “No. of targets” indicates how many genes are miRNA targets (1 or 2 for duplicates and 1, 2, or 3 for triplicates).
FExpression correlation analysis between paralog pairs with the same and divergent miRNA regulation patterns. All paralog pairs with at least one gene targeted by an miRNA are classified into two categories based on whether they show divergent miRNA regulation patterns for both miRNA binding site prediction data set (A) and experimental data set (B). The Pearson correlation coefficient between two paralogous genes is calculated based on the microarray data with 63 different organ types and developmental stages (see Materials and Methods).
FPhylogenetic analysis reveals dynamic evolution of miRNA regulation in jacalin family genes in Arabidopsis thaliana. Maximum-likelihood analysis was performed using RAxML. WAG+G+F+I was chosen as the most suitable substitution model based on the result of ProtTest before the phylogenetic reconstruction. Gene symbols with the color of green, blue, and red indicate targeting by miR842, miR846, and both miRNAs, respectively. Numbers next to the nodes correspond to bootstrap values obtained from 1,000 bootstrap replicates. Only the nodes with bootstrap values greater than or equal to 50 are shown in the tree.