| Literature DB >> 25971280 |
Josephine T Daub1, Isabelle Dupanloup2, Marc Robinson-Rechavi3, Laurent Excoffier4.
Abstract
Because natural selection is likely to act on multiple genes underlying a given phenotypic trait, we study here the potential effect of ongoing and past selection on the genetic diversity of human biological pathways. We first show that genes included in gene sets are generally under stronger selective constraints than other genes and that their evolutionary response is correlated. We then introduce a new procedure to detect selection at the pathway level based on a decomposition of the classical McDonald-Kreitman test extended to multiple genes. This new test, called 2DNS, detects outlier gene sets and takes into account past demographic effects and evolutionary constraints specific to gene sets. Selective forces acting on gene sets can be easily identified by a mere visual inspection of the position of the gene sets relative to their two-dimensional null distribution. We thus find several outlier gene sets that show signals of positive, balancing, or purifying selection but also others showing an ancient relaxation of selective constraints. The principle of the 2DNS test can also be applied to other genomic contrasts. For instance, the comparison of patterns of polymorphisms private to African and non-African populations reveals that most pathways show a higher proportion of nonsynonymous mutations in non-Africans than in Africans, potentially due to different demographic histories and selective pressures.Entities:
Keywords: McDonald–Kreitman test; human evolution; pathway analysis; polygenic selection
Mesh:
Year: 2015 PMID: 25971280 PMCID: PMC4494071 DOI: 10.1093/gbe/evv083
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FResults from the 2DNS test. Here, shared SNPs (SNPs that are polymorphic both in African and non-African populations) are compared with fixed substitutions in the human branch. (A) Distribution of DN/DS and PN/PS ratios of all tested pathways. Each dot represents a pathway with color corresponding to its significance and size to its total exon length. The six pathways that scored significant (q value < 0.2; table 2) in the 2DNS test are highlighted by a black circle. Note that seemingly outlier gene sets may not reach significance due to their small size, which increases the variance of their null distribution. For example, the Hormone ligand-binding receptors pathway (depicted in gray; H), has a high DN/DS ratio, but because of its small size (ten genes, 24 kb total exon length) its null distribution is very widespread in the 2D space. (B–G) Null distributions for six significant (q < 0.2) pathways. (H) Null distribution for the Hormone ligand-binding receptors pathway as a typical example of a nonsignificant outlier. The observed positions of gene sets are indicated as orange dots highlighted by a black circle in the DN/DS–PN/PS plane, whereas the empirical null distribution is shown as gray dots (N = 400,000). Light-orange dots show the scores of the jackknifed gene sets. The contour lines mark the proportion (0.5, 0.9, and 0.99) of the null distribution that falls within these areas.
FResults of the 2DNS test on PN/PS ratios of all tested pathways. Here, non-African SNPs (SNPs that are only polymorphic in non-African populations) are compared with African SNPs (SNPs unique to African populations). (A) Distribution of DN/DS and PN/PS ratios of all tested pathways. Each dot represents a pathway with color corresponding to its significance and size to its total exon length. The four pathways that scored significant (q value < 0.2) in the 2DNS test are highlighted by a black circle. (B–E) Null distributions for the four significant (q < 0.2) pathways. The observed positions of gene sets are indicated as orange dots highlighted by a black circle in the DN/DS-PN/PS plane, whereas the empirical null distribution (N = 400,000) is shown as gray dots. Light-orange dots show the scores of the jackknifed gene sets. The contour lines mark the proportion (0.5, 0.9, and 0.99) of the null distribution that falls within these areas.
Pathways Scoring a q Value < 0.2 in the 2DNS Test Comparing Fixed Mutations in the Human Branch (DN/DS) and Polymorphisms Shared Between African and non-African Populations (PN/PS)
| Rank | Gene Set | Size (Genes) | Length (kb) | DN | DS | DN/DS | PN | PS | PN/PS | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Olfactory transduction | 291 | 425 | 628 | 375 | 1.67 | 592 | 323 | 1.83 | 9.49e-59 | 1.06e-55 |
| 2 | Olfactory Signaling Pathway | 250 | 290 | 548 | 298 | 1.84 | 532 | 274 | 1.94 | 9.99e-43 | 5.58e-40 |
| 3 | Glutathione conjugation | 22 | 78 | 19 | 14 | 1.36 | 27 | 9 | 3.00 | 5.55e-04 | 0.1513 |
| 4 | Beta-catenin phosphorylation cascade | 17 | 109 | 24 | 22 | 1.09 | 1 | 12 | 0.08 | 6.63e-04 | 0.1513 |
| 5 | Meiotic Recombination | 57 | 171 | 127 | 88 | 1.44 | 26 | 34 | 0.76 | 7.41e-04 | 0.1513 |
| 6 | mRNA Splicing | 101 | 437 | 41 | 233 | 0.18 | 9 | 75 | 0.12 | 8.13e-04 | 0.1513 |
Note.—Gene set length, total exon length of genes in set. Pathways marked with a "*" represent a union of highly similar pathways.
Counts and Ratios of Nonsynonymous (N) and Synonymous (S) Fixed Mutations and Polymorphisms in the Human Lineage
| Divergence (D) | |||
| All genic substitutions | 32,332 | 41,596 | 0.78 |
| Substitutions in genes belonging to a gene set | 13,802 | 20,811 | 0.66 |
| Substitutions in genes not belonging to a gene set | 18,530 | 20,785 | 0.89 |
| Polymorphism (P) | |||
| All genic SNPs | 63,812 | 57,672 | 1.11 |
| Private African SNPs | 27,413 | 24,953 | 1.10 |
| Private non-African SNPs | 19,965 | 13,990 | 1.43 |
| Shared SNPs | 16,434 | 18,729 | 0.88 |
| SNPs in genes belonging to a gene set | 28,876 | 28,704 | 1.01 |
| Private African SNPs | 12,309 | 12,377 | 0.99 |
| Private non-African SNPs | 9,263 | 6,911 | 1.34 |
| Shared SNPs | 7,304 | 9,416 | 0.78 |
| SNPs in genes not belonging to a gene set | 34,936 | 28,968 | 1.21 |
| Private African SNPs | 15,104 | 12,576 | 1.20 |
| Private non-African SNPs | 10,702 | 7,079 | 1.51 |
| Shared SNPs | 9,130 | 9,313 | 0.98 |
FPotential selective forces having acted on significant outlier gene sets depending on their position in the N/S plane.