| Literature DB >> 20966100 |
Annette M Evangelisti1, Gavin C Conant.
Abstract
By comparing the patterns of evolution in the coding and upstream noncoding regions of yeast ribosomal protein (RP) genes duplicated in a genome duplication, we find that although nonsynonymous sites in the coding sequences show strong evidence for the fixation of recent gene conversion events, similar patterns are less evident among the synonymous positions and noncoding regulatory elements. This result suggests a potential explanation for the somewhat puzzling fact that duplicated RP genes are not functionally redundant despite their very high protein sequence identity. An analysis of the patterns of regulatory network evolution after genome duplication also indicates that the duplicated proteins have diverged considerably in expression despite their similar protein sequences.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20966100 PMCID: PMC2988524 DOI: 10.1093/gbe/evq067
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FIllustration of the pattern of genome evolution after WGD in five yeast species in a region surrounding a pair of duplicated RP genes (RPL26A and RPL26B). The upper five tracks and the lower five tracks are inferred to be two orthologous groups. Lines connect genes that are adjacent on their respective contigs or chromosomes. Duplicate genes surviving from WGD are colored blue, green genes are cases where one member of the duplicate pair has been lost post-WGD. The orthology assignments between the paired Saccharomyces cerevisiae and S. bayanus genes on the upper and lower tracks are all inferred with greater than 99.99% confidence.
FAnalysis of duplicated Saccharomyces cerevisiae genes and a S. bayanus ortholog. (A) The format of our triplet-based sequence analysis. Because the models used are time reversible, only a single, three taxa tree is required. Independent estimates of K are made for each branch. (B) The expected pattern of branch lengths for the tree in A if the genes follow the known species tree. Note that we expect Ka2 to be large as it represents both the divergence of the gene Scer2 as well as the shared divergence of Sbay and Scer1 post-WGD. (C) The expected gene tree if Scer1 and Scer2 have undergone recent gene conversion events. Here, we expect KaB to be the largest of the three K values, under the same reasoning as in B.
Prevalence of Gene Conversion in Coding Regions and Noncoding Regions
| Coding Regions ( | Upstream Regions | ||||
| Gene Class | Gene Conversion | WGD | Gene Conversion | WGD | |
| RP | 12 | 3 | 1 | 14 | 0.0002 |
| MP | 1 | 38 | 5 | 34 | 0.2 |
Cases where the two Saccharomyces cerevisiae paralogs share higher sequence identity to each other than either does to its respective ortholog (see text).
Cases where at least one S. cerevisiae paralog shows higher sequence identity to its ortholog than to the other S. cerevisiae paralog.
P value for the test of equal proportions of gene conversion events in the coding and upstream regions (Fisher’s exact test).
RP gene duplicates.
MP gene duplicates.
FGene expressions networks of duplicated RPs have diverged since WGD. Pairs of duplicated RPs are arranged opposite each other. Edges connect pairs of genes with correlation in gene expression >0.8. We searched among the 2 – 1 permutations of the column arrangements to find this arrangement, which has the minimal number of edges (102) crossing between the two partitions. The minimal number of crossing edges seen in randomized networks was 107, the mean was 116. Note also the high degree of asymmetry in the number of interactions seen between duplicated RPs.