| Literature DB >> 31810449 |
Nasser Bashkeel1, Theodore J Perkins2, Mads Kærn3, Jonathan M Lee4.
Abstract
BACKGROUND: Phenotypic variability of human populations is partly the result of gene polymorphism and differential gene expression. As such, understanding the molecular basis for diversity requires identifying genes with both high and low population expression variance and identifying the mechanisms underlying their expression control. Key issues remain unanswered with respect to expression variability in human populations. The role of gene methylation as well as the contribution that age, sex and tissue-specific factors have on expression variability are not well understood.Entities:
Keywords: Aging; Essentiality; Expression variability; Methylation; Tissue specificity
Mesh:
Year: 2019 PMID: 31810449 PMCID: PMC6898959 DOI: 10.1186/s12864-019-6308-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Expression variability (EV) in human breast, cerebellum, and frontal cortex tissue. (a) Expected expression MAD for curve as a function of median probe expression (solid black line). (b) Kernel density estimation function of EV. The vertical black lines represent the EV classification ranges. (c) Expression variability as a function of median gene expression. Adjusted R2 values for the linear regression model shown in red were 0.0002, 0.0008, and 0.005 and the associated Kendall rank correlation coefficients were − 0.208, − 0.201, − 0.213 for breast, cerebellum, and frontal cortex tissues respectively
Correlation analysis of EV and probe expression. Adjusted R2 values were calculated using a linear regression model
| Breast | Cerebellum | Frontal Cortex | |
|---|---|---|---|
| Kendall Rank Correlation Coefficient | −0.208 | − 0.201 | −0.213 |
| Linear Regression Adjusted R2 Value | 2 × 10− 4 | 8 × 10− 4 | 5 × 10− 3 |
Fig. 2Bimodal Hyper-Variable gene expression detection. Gaussian mixture modelling method of detecting bimodal probes. The dashed lines represent the overall gene kernel density estimation function of gene expression. The two Gaussian models are shown in dark grey and light grey, and the dotted vertical lines represent the distribution means
Fig. 3Cross-Validation of EV Classifications. (a) Relative frequency of EV classification accuracy between original distribution and 50–50 split retest replicates (n = 100). (b) Number of probes in each EV probe set before and after split-retest protocol
Count summary of probes before and after 50–50 split-retest procedure. Hypervariable and Hypovariable probes that were not retained after the split-retest were relabeled as “Non-Variable”
| Probe Set | Tissue | Number of Probes Before Retesting | Number of Probes After Retesting | % of Probes After Retesting |
|---|---|---|---|---|
| Hypervariable | Breast | 3125 | 1448 | 46.34 |
| Cerebellum | 2987 | 1640 | 54.90 | |
| Frontal Cortex | 2949 | 1760 | 59.68 | |
| Hypovariable | Breast | 4371 | 957 | 21.89 |
| Cerebellum | 2619 | 837 | 31.96 | |
| Frontal Cortex | 3019 | 1254 | 41.54 | |
| Non-Variable | Breast | 34,456 | 39,547 | 114.78 |
| Cerebellum | 36,356 | 39,485 | 108.61 | |
| Frontal Cortex | 35,994 | 38,948 | 108.21 |
Fig. 4Tissue Specificity of EV. (a) Venn diagrams comparing EV classifications of probe mapped genes sets between breast, cerebellum, and frontal cortex tissues. (b) Effect of genomic position on EV. Each chromosome is divided into 100 bins (x-axis) based on the maximum gene coordinate annotation, and the average EV in each bin is measured (y-axis). Bins with an average EV greater than 0 are represented in green, while those with a negative EV are represented in red. Bins with less than three probes were assigned an average EV of zero
Top 5 common and tissue-specific REVIGO GO annotations in the Hyper-Variable and Hypo-Variable probe mapped gene sets of breast, cerebellum, and frontal cortex tissues
| Common Probe-Mapped Genes | Breast-Specific Probe-Mapped Genes | Cerebellum-Specific Probe-Mapped Genes | Frontal Cortex-Specific Probe-Mapped Genes | |
|---|---|---|---|---|
| Hyper-Variable | Regulation of bone remodeling | Epithelial cell differentiation | Regulation of nervous system development | Histamine secretion |
| Regulation of inflammatory response | Primary alcohol metabolism | Regulation of transmembrane transport | Regulation of cell morphogenesis | |
| Response to zinc ion | Positive regulation of cellular component movement | Regulation of neuron death | Trans-synaptic signaling | |
| Carboxylic acid biosynthesis | Response to corticosteroid | Negative regulation of response to external stimulus | Regulation of neurological system process | |
| Regulation of ion transport | Transmembrane receptor protein tyrosine kinase signaling pathway | Response to calcium ion | Dephosphorylation | |
| Hypo-Variable | Proteolysis involved in cellular protein catabolism | Golgi vesicle transport | DNA conformation change | ncRNA metabolism |
| Ribonucleoprotein complex assembly | Nucleoside monophosphate metabolism | Modification-dependent macromolecule catabolism | Response to interleukin-1 | |
| Regulation of cellular amino acid metabolism | Proteolysis involved in cellular protein catabolism | Response to camptothecin | Regulation of enter of bacterium into host cell | |
| Innate immune response activating cell surface receptor signaling pathway | Cellular response to nitrogen starvation | Retrograde transport, endosome to Golgi | ||
| Negative regulation of autophagy | Mitochondrial respiratory chain complex I assembly | Regulation of ubiquitin-protein transferase activity |
Pearson’s Chi-squared test for Essentiality in Hyper-Variable, Hypo-Variable, and Non-Variable probe mapped gene sets
| Tissue | Probe Set | Total Gene Count | Essential Gene Counts | Standardized Residuals | |
|---|---|---|---|---|---|
| Breast | Hyper | 1448 | 165 | 8.65 | 1.48 × 10−22 |
| Hypo | 957 | 103 | 4.94 | ||
| NV | 39,547 | 2095 | −9.87 | ||
| Cerebellum | Hyper | 1640 | 160 | 5.88 | 4.85 × 10−10 |
| Hypo | 837 | 76 | 2.69 | ||
| NV | 39,485 | 2128 | −6.42 | ||
| Frontal Cortex | Hyper | 1760 | 181 | 7.28 | 1.43 × 10−16 |
| Hypo | 1254 | 121 | 4.15 | ||
| NV | 38,948 | 2062 | −8.38 |
Top 5 common and unique REVIGO GO annotation subsets of Hyper-Variable and Hypo-Variable essential genes in breast, cerebellum, and frontal cortex tissues
| Breast-Specific Probe-Mapped Genes | Cerebellum-Specific Probe-Mapped Genes | Frontal Cortex-Specific Probe-Mapped Genes | |
|---|---|---|---|
| Hyper-Variable Essential Genes | Chordate embryonic development | Regulation of cell development | Positive regulation of cell differentiation |
| Cellular response to growth factor stimulus | Epithelial cell migration | Transmembrane receptor protein tyrosine kinase signalling pathway | |
| Mesenchymal cell apoptotic process | Positive regulation of cell proliferation | Epithelial cell migration | |
| Carboxylic acid biosynthesis | Cellular response to growth factor stimulus | Regulation of actin cytoskeleton organization | |
| Cell-substrate junction assembly | Anterograde trans-synaptic signalling | Regulation of lipase activity | |
| Hypo-Variable Essential Genes | DNA repair | DNA repair | DNA repair |
| Regulation of cellular protein localization | Protein oligomerization | Peptide transport | |
| Mitochondrial genome maintenance | Positive regulation of viral process | Regulation of type I interferon production | |
| Chordate embryonic development | Negative regulation of cell cycle | Response to UV | |
| Protein modification by small protein removal | Lysosomal transport | Phosphorylation |
Fig. 5Methylation in human cerebellum and frontal cortex tissue. (A) Kernel density estimation function of average gene methylation. Gaussian mixture models were used to classify the genes into Non-, Medium- and Highly- methylated clusters. (B) Kernel density estimation function of average gene methylation by EV classification. The dashed vertical lines represent the methylation state cluster cut-offs generated by the Gaussian mixture modelling
Pearson’s Chi-Squared Test Standardized Residuals. We tested the independence between the methylation state clusters and the EV classifications in cerebellum and frontal cortex tissues and found a significant relationship between the two variables (p = 7.57 × 10–36 and p = 1.58 × 10–59, respectively)
| Cerebellum Tissue | Frontal Cortex Tissue | |||||
|---|---|---|---|---|---|---|
| Non-Methylated | Medium Methylated | Highly Methylated | Non-Methylated | Medium Methylated | Highly Methylated | |
| Hypo-Variable | 11.98 | − 5.69 | −9.04 | 14.84 | − 7.11 | −10.79 |
| Non-Variable | −7.52 | 0.06 | 8.59 | −10.00 | −0.04 | 11.73 |
| Hyper-Variable | 0.07 | 4.21 | −3.58 | −0.23 | 6.23 | −5.47 |
Probe-Wise Multiple Linear Regression of Sex, PMI, and Age. Probes that exhibit an FDR < 0.01 are considered significant for the specific coefficient
| Sex | PMI | Age | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Up | Down | Total | Up | Down | Total | Up | Down | Total | |
| Cerebellum | 12 | 10 | 22 | 2 | 0 | 2 | 247 | 267 | 514 |
| Frontal | 8 | 15 | 23 | 7 | 9 | 16 | 373 | 354 | 727 |
Fig. 6Hierarchical clustering of Hyper-Variable genes by age in (A) cerebellum tissue, and (B) frontal cortex tissue. The vertical axis represents the age-regulated Hyper-Variable genes while the samples were clustered by age and plotted on the horizontal axis. The top heatmaps represent the positively correlated age-regulated genes while the bottom heatmaps represent the negatively correlated age-regulated genes. The age clusters decrease in age from left to right in both heatmaps and correspond to the following age ranges: , , and
Fig. 7Expression and methylation correlation. Histogram of Pearson correlation coefficient between paired gene expression and gene methylation levels in the Hyper-Variable and Hypo-Variable probe sets
Description of brain sample dataset cohorts. Clinical annotations were not available for breast tissue samples
| Clinical Annotation | Dataset | Min | Q1 | Median | Mean | Q3 | Max |
|---|---|---|---|---|---|---|---|
| Age | Expression | 1 | 24 | 46 | 47.79 | 71 | 98 |
| Methylation | 1 | 21 | 44 | 47.48 | 74 | 96 | |
| PMI | Expression | 1 | 14 | 25 | 36.14 | 61 | 94 |
| Methylation | 1 | 14 | 21 | 26.65 | 36 | 62 | |
| Dataset | Females (n) | Males (n) | Females (%) | Males (%) | |||
| Sex | Expression | 289 | 622 | 31.72% | 68.28% | ||
| Methylation | 243 | 481 | 33.56% | 66.44% | |||