| Literature DB >> 25062920 |
Toni I Gossmann1, Anna W Santure2, Ben C Sheldon3, Jon Slate4, Kai Zeng1.
Abstract
Determining the rate of protein evolution and identifying the causes of its variation across the genome are powerful ways to understand forces that are important for genome evolution. By using a multitissue transcriptome data set from great tit (Parus major), we analyzed patterns of molecular evolution between two passerine birds, great tit and zebra finch (Taeniopygia guttata), using the chicken genome (Gallus gallus) as an outgroup. We investigated whether a special feature of avian genomes, the highly variable recombinational landscape, modulates the efficacy of natural selection through the effects of Hill-Robertson interference, which predicts that selection should be more effective in removing deleterious mutations and incorporating beneficial mutations in high-recombination regions than in low-recombination regions. In agreement with these predictions, genes located in low-recombination regions tend to have a high proportion of neutrally evolving sites and relaxed selective constraint on sites subject to purifying selection, whereas genes that show strong support for past episodes of positive selection appear disproportionally in high-recombination regions. There is also evidence that genes located in high-recombination regions tend to have higher gene expression specificity than those located in low-recombination regions. Furthermore, more compact genes (i.e., those with fewer/shorter introns or shorter proteins) evolve faster than less compact ones. In sum, our results demonstrate that transcriptome sequencing is a powerful method to answer fundamental questions about genome evolution in nonmodel organisms.Entities:
Keywords: Hill–Robertson interference (HRI); RNAseq; natural selection; protein evolution; recombination; tissue specificity in gene expression
Mesh:
Year: 2014 PMID: 25062920 PMCID: PMC4231635 DOI: 10.1093/gbe/evu157
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhylogenetic relationship of great tit and seven bird species (Hackett et al. 2008).
Median Estimates of ds and Based on Pairwise Alignment between the Great Tit Transcriptome and Each of the Seven Different Bird Species (fig. 1)
| Genome Reference | ||
|---|---|---|
| Collared flycatcher ( | 0.104 | 0.081 |
| Zebra finch ( | 0.103 | 0.099 |
| Medium ground finch ( | 0.111 | 0.083 |
| Budgerigar ( | 0.248 | 0.075 |
| Mallard duck ( | 0.318 | 0.073 |
| Chicken ( | 0.339 | 0.074 |
| Turkey ( | 0.346 | 0.071 |
aPasserine.
bNonpasserine.
Pairwise Correlation Coefficients (Spearman’s ρ) for Variables Covary with the Rates of Protein Evolution in Passerines
| GC3 | Intron Number | Intron Length | Protein Length | Chromosome Size | Gene Density | |||
|---|---|---|---|---|---|---|---|---|
| −0.04 | 0.22 | 0.10 | NS | −0.10 | NS | −0.10 | 0.09 | |
| – | −0.17 | 0.06 | −0.07 | −0.14 | −0.05 | 0.07 | NS | |
| GC3 | – | 0.19 | −0.13 | −0.19 | −0.12 | −0.35 | 0.29 | |
| τ | – | 0.06 | 0.11 | 0.10 | NS | NS | ||
| Intron number | – | 0.66 | 0.79 | −0.10 | 0.13 | |||
| Intron length | – | 0.57 | 0.04 | −0.11 | ||||
| Protein length | – | −0.09 | 0.11 | |||||
| Chromosome size | – | −0.55 |
Note.–ds and were estimated using pairwise alignments between great tit and zebra finch. GC3, GC content at third codon position; τ, expression specificity; NS, not significant.
***P < 0.001.
Partial Correlation Analyses Based on Kendall’s τ to Investigate the Effect of Variation in Various Covariates on the Correlation between the Two Variables of Interest
| Case | Variables | Covariates | Kendall’s τ |
|---|---|---|---|
| 1 | ω, τ | GC3 | 0.047 |
| 2 | ω, τ | GC3, read depth | 0.033 |
| 3 | ω, intron length | GC3, chromosome size | −0.031 |
| 4 | ω, intron number | GC3, chromosome size | −0.098 |
| 5 | ω, protein length | GC3, chromosome size | −0.028 |
| 6 | GC3 | −0.031 | |
| 7 | ω, genomic location | GC3, gene density, τ, intron number, protein length | 0.02 |
| 8 | ω, genomic location | GC3, gene density, τ, intron number, protein length | 0.023 |
| 9 | ω, | GC3, gene density, τ, intron number, protein length | 0.075 |
| 10 | GC3, gene density, τ, intron number, protein length | 0.047 | |
| 11 | Δln | GC3, gene density, τ, intron number, protein length | −0.13 |
Note.—NS, not significant.
aOuter parts of macrochromosomes and microchromosomes versus inner parts of macrochromosomes.
bOuter parts of macrochromosomes versus microchromosomes.
cω at nonneutral sites (nearly neutral model M1a, fig. 3).
dProportion of neutral sites (nearly neutral model M1a, fig. 3).
eLog-likelihood difference (model M7 vs. M8, test for positive selection, genes with P <1.0).
***.
**.
*1P = 0.032.
*2P = 0.027
NSP > 0.05.
FBox plots of evolutionary rates for subsets of genes according to their (a) chromosomal positions and (b) recombination rate estimates. Whiskers are drawn as implemented in the R-function box plot (see Materials and Methods). The bar plots show the proportion of sites estimated to be evolving under neutrality and the median value for sites inferred to be evolving under purifying selection by the nearly neutral (model M1a) in PAML. Error bars indicate median absolute deviations (MDA). under the MWU.
FBoxplots of evolutionary rates for subsets of genes specifically expressed in certain tissues; boxes in blue and red denote significantly reduced and increased values, respectively. Whiskers were drawn as implemented in the R-function boxplot (see Materials and Methods).
Location of Genes with Evidence for Positive Selection
| FDR | Recombination Region | Positively Selected | Not Positively Selected | |
|---|---|---|---|---|
| 10% | High | 8 | 1,782 | |
| Low | 0 | 659 | 0.02 | |
| 10% | Jungle | 10 | 1,323 | |
| Desert | 2 | 1,129 | 0.03 |
Note.—Genes were classified according to their genomic locations (recombination jungles and deserts) or their estimated local recombination rate (high- and low/very low recombination, see Materials and Methods). Comparisons were conducted between genes located in different recombinational environments using G tests.
aOuter parts of macrochromosomes and microchromosomes.
bInner parts of macrochromosomes.
FBox plots of (a) GC content at 3rd positions (GC3), (b) expression specificity τ, (c) ds, and (d) dn for subsets of genes according to their chromosomal positions. Whiskers are drawn as implemented in the R-function box plot (see Materials and Methods). , , .