| Literature DB >> 28583075 |
Courtney C Babbitt1,2,3, Ralph Haygood4, William J Nielsen5, Gregory A Wray5,6,7.
Abstract
BACKGROUND: Despite evidence for adaptive changes in both gene expression and non-protein-coding, putatively regulatory regions of the genome during human evolution, the relationship between gene expression and adaptive changes in cis-regulatory regions remains unclear.Entities:
Keywords: Adaptation; Gene expression; Gene function; Gene regulation; Human evolution
Mesh:
Substances:
Year: 2017 PMID: 28583075 PMCID: PMC5460488 DOI: 10.1186/s12864-017-3831-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Boxplot of the –log10(qvalue) distributions from the GLM examining the Tissue (left, orange) and Species (right, green) effects
Fig. 2Correlation between tissue specificity and gene expression divergence between humans and chimpanzees. There is a significant correlation between higher specificity to a tissue and expression divergence (adipose tissue as an example here)
Functions of overall differentially expressed genes
| Set | Collection | # genes | r_rb | SE(r_rb) |
|---|---|---|---|---|
| chr17q11 | C1:All | 66 | 0.88 | 0.021 |
| chr12q13 | C1:All | 152 | 0.84 | 0.026 |
| NIKOLSKY BREAST CANCER 17Q11 Q21 AMPLICON | C2:CGP | 80 | 0.84 | 0.033 |
| chr17q21 | C1:All | 164 | 0.69 | 0.033 |
| chr4q21 | C1:All | 61 | 0.62 | 0.066 |
| chr1p13 | C1:All | 79 | 0.59 | 0.06 |
| GNF2 DNM1 | C4:CGN | 63 | 0.41 | 0.046 |
| RICKMAN HEAD AND NECK CANCER A | C2:CGP | 77 | 0.34 | 0.06 |
| LEIN NEURON MARKERS | C2:CGP | 50 | 0.34 | 0.052 |
| CAHOY NEURONAL | C6:All | 84 | 0.33 | 0.053 |
| GCM MAP1B | C4:CGN | 53 | 0.33 | 0.076 |
| ANASTASSIOU CANCER MESENCHYMAL TRANSITION SIGNATURE | C2:CGP | 53 | 0.33 | 0.069 |
| SABATES COLORECTAL ADENOMA UP | C2:CGP | 86 | 0.33 | 0.049 |
| GNF2 CCNA2 | C4:CGN | 52 | 0.32 | 0.065 |
| KRAS.KIDNEY UP.V1 UP | C6:All | 113 | 0.32 | 0.047 |
| VOLTAGE GATED CATION CHANNEL ACTIVITY | C5:MF | 57 | 0.31 | 0.061 |
| NAKAYAMA SOFT TISSUE TUMORS PCA2 UP | C2:CGP | 71 | 0.31 | 0.053 |
| GCM MAPK10 | C4:CGN | 71 | 0.31 | 0.059 |
| CERVERA SDHB TARGETS 1 UP | C2:CGP | 95 | 0.3 | 0.052 |
| VOLTAGE GATED CHANNEL ACTIVITY | C5:MF | 63 | 0.3 | 0.058 |
The 20 (of 5247) MSigDB gene sets most enriched with genes scoring high for overall differential expression between humans and chimpanzees are listed. Collection is the MSigDB collection containing the set (one of 16). genes is the number of genes whose expression we measured in the set (at least 50). r rb is the rank-biserial correlation between scores for differential expression and membership in the set; sets are ordered by decreasing r rb. SE(r rb) is the standard error of r rb via bootstrapping with 10,000 replicates
Fig. 3Bubble plot of GO Biological Process enrichments for gene expression differences across five tissues between human and chimpanzee. To display these traits visually, we calculated an optimal 2-dimensional arrangement using non-metric multidimensional scaling on the between-category sematic similarity scores, a measure of a priori relatedness of traits. Traits whose SimREL distance is less than 0.5 are considered similar within the GO Biological Process ontology tree (code available on request). The significantly differentially expressed categories are displayed on the same axes, but separated into two plots for clarity. In each plot, the intensity of the colors of the circles and text indicate the evidence for differential expression of each trait. The area of the circle is proportional to the log of the number of genes counted in each trait
Fig. 4Ideogram of clustering of significant differentially expressed genes in humans compared to chimpanzees across tissues (red bars highlight the chromosomal enrichment regions in Table 1)
Functional correlations between overall differential expression and noncoding adaptation
| Collection # | sets | r_r | p(r_r) | q(r_r) |
|---|---|---|---|---|
| C3:TFT (transcription factor targets) | 472 | 0.35 | < 0.0001 | < 0.0005 |
| C2:CGP (chemical and genetic perturbations) | 645 | 0.28 | < 0.0001 | < 0.0005 |
| C7:All (immunologic signatures) | 1723 | 0.14 | < 0.0001 | < 0.0005 |
| C4:CGN (cancer gene neighborhoods) | 88 | 0.5 | 0.012 | 0.045 |
| C6:All (oncogenic signatures) | 109 | 0.22 | 0.019 | 0.057 |
| C4:CM (cancer modules) | 87 | 0.29 | 0.036 | 0.09 |
| C5:CC (GO cellular components) | 37 | 0.33 | 0.14 | 0.27 |
| C3:MIR (microRNA targets) | 109 | 0.11 | 0.14 | 0.27 |
| C1:All (positional gene sets) | 9 | 0.17 | 0.34 | 0.53 |
| H:All (hallmark gene sets) | 25 | 0.089 | 0.35 | 0.53 |
| C2:CP:Reactome (Reactome gene sets) | 39 | 0.058 | 0.4 | 0.55 |
| C2:CP:KEGG (KEGG gene sets) | 11 | −0.17 | 0.7 | 0.88 |
| C2:CP:Other (other canonical pathways) | 6 | −0.49 | 0.8 | 0.93 |
| C5:BP (GO biological processes) | 123 | −0.28 | 0.95 | 0.96 |
| C5:MF (GO molecular functions) | 44 | −0.41 | 0.96 | 0.96 |
All MSigDB collections we considered are listed except for C2:CP:BioCarta, in which no gene set contained at least 50 genes analyzed in a previous study of adaptive noncoding changes. # sets is the number of gene sets we considered in the collection. r r is the rank correlation between rank-biserial correlations for enrichment with differential expression (each of which is a rank-biserial correlation as in Table 1) and rank-biserial correlations for enrichment with noncoding adaptation (each of which is likewise a rank-biserial correlation). p(r r) is the one-tailed p-value of r r via permutation test with 10,000 permutations; collections are ordered by increasing p(r r), which reflects r r, # sets, and patterns of overlap among gene sets in the collection