| Literature DB >> 36060942 |
Abstract
Hormones act within in highly dynamic systems and much of the phenotypic response to variation in hormone levels is mediated by changes in gene expression. The increase in the number and power of large genetic association studies has led to the identification of hormone linked genetic variants. However, the biological mechanisms underpinning the majority of these loci are poorly understood. The advent of affordable, high throughput next generation sequencing and readily available transcriptomic databases has shown that many of these genetic variants also associate with variation in gene expression levels as expression Quantitative Trait Loci (eQTLs). In addition to further dissecting complex genetic variation, eQTLs have been applied as tools for causal inference. Many hormone networks are driven by transcription factors, and many of these genes can be linked to eQTLs. In this mini-review, we demonstrate how causal inference and gene networks can be used to describe the impact of hormone linked genetic variation upon the transcriptome within an endocrinology context.Entities:
Keywords: causal inference; eQTL; genetics; hormones; networks
Mesh:
Substances:
Year: 2022 PMID: 36060942 PMCID: PMC9428692 DOI: 10.3389/fendo.2022.949061
Source DB: PubMed Journal: Front Endocrinol (Lausanne) ISSN: 1664-2392 Impact factor: 6.055
Top 10 GWAS as determined by discovery sample size, obtained from GWAS catalog (7) under EFO term “hormone measurment” (EFO_0004730).
| Reported Trait | Discovery sample number | Publication date First author | |
|---|---|---|---|
| Nonatopic asthma or fasting insulin levels | 502,660 European | 2019-10-24 | Zhu Z ( |
| IGF-1 measurement | 435,516 European | 2021-07-05 | Barton AR ( |
| Total testosterone levels | 425,097 European | 2020-02-10 | Ruth KS ( |
| Testosterone levels x smoking behaviour interaction | 414,294 European | 2021-01-06 | Liang X ( |
| IGF 1 (Gene-based burden) | 409,926 European | 2021-10-18 | Backman JD ( |
| Sex hormone-binding globulin levels | 389,354 European | 2021-05-12 | Martin S ( |
| Body mass index and fasting insulin (pairwise) | 374,012 European, 16,962 African American or Afro-Caribbean, East Asian, Hispanic or Latin American, South East Asian | 2021-02-22 | Huang LO ( |
| Insulin-like growth factor 1 levels | 340,567 European, 5974 African unspecified, 7283 South Asian | 2021-01-18 | Sinnott-Armstrong N ( |
| Circulating leptin levels or HOMA-IR | 254,263 Asian unspecified, European, Hispanic or Latin American, NR, South Asian | 2020-04-01 | Wang X ( |
| Fasting insulin | 213,645 African American or Afro- Caribbean, East Asian, European, Hispanic or Latin American, South Asian | 2021-05-31 | Chen J ( |
Figure 1(A) Genetic variation (Left) influences complex traits (Right) through quantitative changes in intermediate phenotypes (Middle). Molecular interactions are shown as arrows, where the direction of the arrow indicates the direction of the flow of biological information. (B) Intermediate phenotypes can be modelled as biological networks using causal inference to uncover directed relationships between the molecular determinants that mediate the effect of genetic changes on complex traits. (C) Cis and trans gene regulation. Gene A (green) encodes a transcription factor (TF) which regulates the expression of gene B (purple). The eQTL (yellow), acts as a cis-eQTL for gene A by causing a change in the sequence of gene A’s cis-regulatory element (orange) which may either increase or decrease the binding affinity of any corresponding TFs. The same eQTL is a trans-eQTL for Gene B as by changing the expression of the TF encoded by gene A, this in turn influences the expression of gene B.
Top 10 RNA-seq based eQTL studies as determined by sample size, obtained from eQTL catalogue (47).
| Study name | Cell type or tissues | Number of samples | Number of donors |
|---|---|---|---|
| GTEx (v8) ( | 49 tissues | 15,178 | 838 |
| TwinsUK ( | adipose, LCLs1, skin, blood | 1364 | 433 |
| Schmiedel ( | 15 immune cell types | 1331 | 91 |
| Quach ( | monocytes | 969 | 200 |
| CommonMind ( | brain (DLPFC3) | 590 | 590 |
| ROSMAP ( | brain (DLPFC3) | 576 | 576 |
| GENCORD ( | LCLs1, fibroblasts, T cells | 560 | 195 |
| FUSION ( | adipose, muscle | 559 | 302 |
| BLUEPRINT ( | monocytes, neutrophils, CD4+ T cells | 554 | 197 |
| Nedelec (2016) ( | macrophages | 493 | 168 |
Figure 2(A) Instrumental Variable paradigm. The instrumental variable (Z) is causally associated with the exposure (X) which in turn is causally associated with the outcome (Y). The IV will account for any confounding (U) that affects the exposure or outcome, assuming independence of U. (B) Causal modelling of pairwise gene-gene relationships. (Left) Simple causal model where Trait A is influenced by Gene A, through Gene B (Middle) Reactive model where Gene B influences both Gene A and Trait A, therefore any association between Gene A and Trait A is a non-causal relationship. (Right) Association between Genes A and B is a result of unobserved confounding, therefore there is no causal relationship between Gene A and Trait A. (C) Reconstructing gene networks from pairwise relationships. (Left) Prospective pairwise relationships between genes with a robust eQTL (blue and orange) and other genes within a dataset. (Middle) Causal inference approaches are employed to obtain a probability matrix for the likelihood of a causal relationship between gene pairs. (Right) A filtering step is imposed e.g. a False Discovery Rate (FDR) cut-off, which will return relationships that cross this threshold to be assembled as directed networks.