| Literature DB >> 24009676 |
Jun Wang1, Nicholas C Marowsky, Chuanzhu Fan.
Abstract
Gene duplication is an important mechanism for the origination of functional novelties in organisms. We performed a comparative genome analysis to systematically estimate recent lineage specific gene duplication events in Arabidopsis thaliana and further investigate whether and how these new duplicate genes (NDGs) play a functional role in the evolution and adaption of A. thaliana. We accomplished this using syntenic relationship among four closely related species, A. thaliana, A. lyrata, Capsella rubella and Brassica rapa. We identified 100 NDGs, showing clear origination patterns, whose parental genes are located in syntenic regions and/or have clear orthologs in at least one of three outgroup species. All 100 NDGs were transcribed and under functional constraints, while 24% of the NDGs have differential expression patterns compared to their parental genes. We explored the underlying evolutionary forces of these paralogous pairs through conducting neutrality tests with sequence divergence and polymorphism data. Evolution of about 15% of NDGs appeared to be driven by natural selection. Moreover, we found that 3 NDGs not only altered their expression patterns when compared with parental genes, but also evolved under positive selection. We investigated the underlying mechanisms driving the differential expression of NDGs and their parents, and found a number of NDGs had different cis-elements and methylation patterns from their parental genes. Overall, we demonstrated that NDGs acquired divergent cis-elements and methylation patterns and may experience sub-functionalization or neo-functionalization influencing the evolution and adaption of A. thaliana.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24009676 PMCID: PMC3756979 DOI: 10.1371/journal.pone.0072362
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The phylogeny and divergent time among four species.
Lineage specific new gene ‘B’ is identified using comparative genomics and syntenic relationship among four genomes.
The proportions of NDGs and parental genes with different Ka/Ks.
| Ka and Ks = 0 | Ka/Ks >1 | Ka/Ks <1 | |
| number of paralogs | 44 | 18 | 75(50 |
| percentage | 32.12% | 13.14% | 54.74%(36.50% |
Ka/Ks significantly less than 1 by LRT test.
The average values of π and θ for all the sites, synonymous and non-synonymous sites of NDGs and parental genes.
| Average value | πa | πs | πn | θa | θs | θn |
| NDG | 0.0054 | 0.0069 | 0.0039 | 0.0082 | 0.0101 | 0.0063 |
| Parental gene | 0.0049 | 0.0071 | 0.0029 | 0.0071 | 0.0094 | 0.0047 |
πa and θa for all sites; πs and θs for synonymous sites; πn and θn for non-synonymous sites.
The number of NDGs showing selective signatures under population genetic tests.
| Tajima’s D | Fu and Li’s F | Fu and Li’s D | MK test | # of gene |
| + | + | + | + | 0 |
| + | + | + | − | 1 |
| + | + | − | + | 1 |
| + | − | + | + | 0 |
| − | + | + | + | 1 |
| + | + | − | − | 1 |
| + | − | + | − | 0 |
| + | − | − | + | 0 |
| − | + | + | − | 1 |
| − | + | − | + | 0 |
| − | − | + | + | 0 |
| + | − | − | − | 0 |
| − | + | − | − | 1 |
| − | − | + | − | 0 |
| − | − | − | + | 4 |
“+” yes; “−” no.
The 24 paralog pairs having differential expression pattern between NDGs and parental genes.
| NDG | Parental gene | Ka | NDG enriched tissue | Parental gene enriched tissue | Data source |
|
|
| 0 | Leave | Non specific | MPSS |
|
|
| 0.1411 | Silique | Inflorescence | MPSS |
|
|
| 0.1369 | Non specific | Root | MPSS |
|
|
| 0.0549 | Non specific | Root | MPSS |
|
|
| 0.0019 | Root | Inflorescence | MPSS |
|
|
| 0.0029 | Inflorescence | Seedlings | MPSS |
|
|
| 0.0482 | Inflorescence | Silique | MPSS |
|
|
| 0.0015 | Callus | Callus and root | MPSS |
|
|
| 0.0363 | Non specific | Callus | MPSS |
|
|
| 0.0572 | Root-specific | Non specific | Tiling array |
|
|
| 0.079 | Root-specific | Non specific | Tiling array |
|
|
| 0.0978 | Non specific | Expanding-leave specific/seedling | Tiling array/RNA-seq |
|
|
| 0.0792 | Non specific | Seedling specific | Tiling array |
|
|
| 0.0138 | Root-specific | Non specific | Tiling array |
|
|
| 0.1353 | Senescing-leave specific | Non specific | Tiling array |
|
|
| 0.0582 | Non specific | Bud | Unigene |
|
|
| 0.0098 | Cell culture | Root | Unigene |
|
|
| 0.1104 | Root | Non specific | Unigene |
|
|
| 0.0337 | Non specific | Flower | Unigene |
|
|
| 0 | Non specific | Silique | Unigene |
|
|
| 0 | Flower/Callus | Non specific | Unigene/MPSS |
|
|
| 0.0244 | Vegetative/Inflorescence | Non specific/callus | Unigene/MPSS |
|
|
| 0.0513 | Non-specific/seedling | Root/callus | Unigene/MPSS |
|
|
| 0.2633 | Seedling | Non specific | RNA-seq |