| Literature DB >> 31529038 |
Cemalettin Bekpen1, Diethard Tautz1.
Abstract
Illuminating the role of specific gene duplications within the human lineage can provide insights into human-specific adaptations. The so-called human core duplicon gene families have received particular attention in this respect, due to special features, such as expansion along single chromosomes, newly acquired protein domains and signatures of positive selection. Here, we summarize the data available for 10 such families and include some new analyses. A picture emerges that suggests broad functions for these protein families, possibly through modification of core cellular pathways. Still, more dedicated studies are required to elucidate the function of core-duplicons gene families and how they have shaped adaptations and evolution of humans.Entities:
Keywords: adaptation; copy number variation; duplication; duplicons; gene family; human
Mesh:
Year: 2019 PMID: 31529038 PMCID: PMC6920530 DOI: 10.1093/bfgp/elz016
Source DB: PubMed Journal: Brief Funct Genomics ISSN: 2041-2649 Impact factor: 4.241
Figure 1The Morpheus (NPIP) gene family as an example for human core duplicons. Figure adapted from [4, 7]. The individual members of the Morpheus core duplicons are ordered according to location on chromosome 16 (left). The mosaic structure of the complex duplication blocks is depicted on the right. The core duplicon part, which is shared by all duplication blocks, is marked by the red broken lines. Numbers next to the fragments refer to the start and end positions on the chromosome (in kb), and arrows indicate the relative orientation. Different colors refer to distinct duplication blocks, which are predicted based on reciprocal comparisons of each human subunit and its flanking sequence to outgroup mammalian genomes [4].
Signatures of positive selection in human core duplicons
| Chr. | Name | Alternative name(s) | Ancestral gene (mouse) | Signatures of positive selection | Selection test | Literature |
|---|---|---|---|---|---|---|
| 1 | Olduvai protein domain family | Neuroblastoma breakpoint gene family, NBPF | Pde4dip | Yes | PAML, | [ |
| 2 | RGPD | RANBP2-like gene family | Ranbp2 | Yes | PAML, | [ |
| 5 | SMA-GUSBP | SMA | Gusbp2 | None described | ||
| 7 | PMS2P | PMS2 gene family | Pms2 | Yes | PAML, | [ |
| 9 | SPATA31 | FAM75A | Spata31 | Yes | PAML, | [ |
| 11 | TRIM51 | SPRYD5 | no homolog | Yes | PAML, | [ |
| 15 | GOLGA8 | GOLGA, golgin | Golga2 | Yes | PAML, | [ |
| 16 | Morpheus | NPIP | no homolog | Yes | PAML, | [ |
| 17 | TBC1D3 | - | Usp6nl | None described | ||
| 17 | LRRC37 | - | Lrrc37a1 | None described |
Figure 2Overview of human core duplicon gene families. The figure depicts the segmentally duplicated copies of core duplicon gene families from human, macaque and mouse and includes other mammals. Functional properties of Olduvai, SPATA31 and TBC1D3, which have been studied in detail, and possible/predicted functions of other core duplicon gene families are included. CNV (median:variance) was extracted from a previous study [10]. * indicates CNV estimate based on number of Olduvai domains. # indicates CNV information for TRIM51 genes was extracted from a previous study [17], and & indicates CNV for SPATA31 genes was extracted from a previous study [18].
Linear regression analysis of human core duplicon gene families with brain volumes and body mass across primates
| NBPF / Olduvai | RGPD | SMA-GUSB | PMS2P | SPATA31 | TRIM51 (SPRYD5) | GOLGA | MORPHEUS | TBC1D3 | LRRC37 | |
|---|---|---|---|---|---|---|---|---|---|---|
| Brain mass (mg) | 0.90** | 0.91** | 0.90** | 0.38* | 0.72** | 0.87** | 0.68** | 0.66** | 0.82** | 0.83** |
| Brain volume (mm3) | 0.90** | 0.91** | 0.90** | 0.38* | 0.72** | 0.87** | 0.68** | 0.66** | 0.83** | 0.83** |
| Neocortex volume (mm3) | 0.92** | 0.90** | 0.92** | 0.37* | 0.72** | 0.86** | 0.68** | 0.65** | 0.81** | 0.83** |
| Cerebellum volume (mm3) | 0.83** | 0.91** | 0.81** | 0.36* | 0.72** | 0.85** | 0.64** | 0.64** | 0.81** | 0.77** |
| Body mass (g) | 0.23ns | 0.57** | 0.24ns | 0.22ns | 0.38* | 0.43* | 0.28 ns | 0.35* | 0.44* | 0.27 ns |
* P < 0.05.
** P < 0.001.
ns P > 0.05.