| Literature DB >> 32365201 |
Xueyuan Jiang1, Raquel Assis1,2,3,4.
Abstract
Much of the enormous phenotypic variation observed across human populations is thought to have arisen from events experienced as our ancestors peopled different regions of the world. However, little is known about the genes involved in these population-specific adaptations. Here, we explore this problem by simultaneously examining population-specific genetic and expression differentiation in four human populations. In particular, we derive a branch-based estimator of population-specific differentiation in four populations, and apply this statistic to single-nucleotide polymorphism and RNA-seq data from Italian, British, Finish, and Yoruban populations. As expected, genome-wide estimates of genetic and expression differentiation each independently recapitulate the known relationships among these four human populations, highlighting the utility of our statistic for identifying putative targets of population-specific adaptations. Moreover, genes with large copy number variations display elevated levels of population-specific genetic and expression differentiation, consistent with the hypothesis that gene duplication and deletion events are key reservoirs of adaptive variation. Further, many top-scoring genes are well-known targets of adaptation in Europeans, including those involved in lactase persistence and vitamin D absorption, and a handful of novel candidates represent promising avenues for future research. Together, these analyses reveal that our statistic can aid in uncovering genes involved in population-specific genetic and expression differentiation, and that such genes often play important roles in a diversity of adaptive and disease-related phenotypes in humans.Entities:
Keywords: expression divergence; genetic divergence; human evolution; population genetics
Mesh:
Year: 2020 PMID: 32365201 PMCID: PMC7197493 DOI: 10.1093/gbe/evaa021
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
. 1.—Relationships among TSI, GBR, FIN, and YRI populations inferred from genome-wide patterns of genetic and expression differentiation. Population trees supported by the majority of gene trees constructed using (A) , (B) with , and (C) with . Numbers indicate proportions of corresponding nodes in all gene trees (see Materials and Methods for details).
. 2.—Schematic for calculating the PBS value of a gene in population W. Depicted are scenarios in which population-specific differentiation of a gene has occurred in population W of a set of (A) three populations W, X, and Y and (B) four populations W, X, Y, and Z. In each case, population-specific differentiation results in elongation of external branch W (red). To estimate the length of external branch W, we unroot the tree (top of each panel) and apply the formula shown (bottom of each panel) to pairwise genetic () or expression () distances between populations. We can use an analogous approach to estimate lengths of other external branches.
. 3.—PBS4 values of genes with CNVs. Distributions of (A) genetic PBS4 values calculated from , (B) expression PBS4 values calculated from with , and (C) expression PBS4 values calculated from with of genes without (gray) and with (blue) CNVs. * and ** (see Materials and Methods for details).
Genes with Top Five Genetic PBS4 Values in TSI, GBR, and FIN
| TSI | GBR | FIN | |
|---|---|---|---|
| 1 |
|
|
|
| 2 |
|
|
|
| 3 |
|
|
|
| 4 |
|
|
|
| 5 |
|
|
|
Genes with Top Five Expression PBS4 Values ( with and ) in TSI, GBR, and FIN
| TSI | GBR | FIN | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| 1 |
|
|
|
|
|
|
| 2 |
|
|
|
|
|
|
| 3 |
|
|
|
|
|
|
| 4 |
|
|
|
|
|
|
| 5 |
|
|
|
|
|
|