| Literature DB >> 28847336 |
R M Maier1, P M Visscher1, M R Robinson2, N R Wray1.
Abstract
The availability of genome-wide genetic data on hundreds of thousands of people has led to an equally rapid growth in methodologies available to analyse these data. While the motivation for undertaking genome-wide association studies (GWAS) is identification of genetic markers associated with complex traits, once generated these data can be used for many other analyses. GWAS have demonstrated that complex traits exhibit a highly polygenic genetic architecture, often with shared genetic risk factors across traits. New methods to analyse data from GWAS are increasingly being used to address a diverse set of questions about the aetiology of complex traits and diseases, including psychiatric disorders. Here, we give an overview of some of these methods and present examples of how they have contributed to our understanding of psychiatric disorders. We consider: (i) estimation of the extent of genetic influence on traits, (ii) uncovering of shared genetic control between traits, (iii) predictions of genetic risk for individuals, (iv) uncovering of causal relationships between traits, (v) identifying causal single-nucleotide polymorphisms and genes or (vi) the detection of genetic heterogeneity. This classification helps organise the large number of recently developed methods, although some could be placed in more than one category. While some methods require GWAS data on individual people, others simply use GWAS summary statistics data, allowing novel well-powered analyses to be conducted at a low computational burden.Entities:
Keywords: Genetics; methods; polygenic; review
Mesh:
Year: 2017 PMID: 28847336 PMCID: PMC6088780 DOI: 10.1017/S0033291717002318
Source DB: PubMed Journal: Psychol Med ISSN: 0033-2917 Impact factor: 7.723
Fig. 1.Schematic representation of the basic models underlying the polygenic methods reviewed. All models assume that a phenotype (P) is influenced by genetic (G) and environmental (E) factors (with environmental defined loosely as anything not captured by G including stochastic variation and measurement error). (a) Model which considers only one phenotype and no gene expression. (b) Model which considers only one phenotype and gene expression (X). (c) Model which considers two or more phenotypes and no gene expression. Methods can be grouped into those where the focus lies on individual SNPs, genes or people (nodes highlighted), and those where the focus lies on aggregate measures affecting the relationship between genetic and environmental factors and a phenotype (edges highlighted).
Overview of selected polygenic methods
| Program/method | Data needed | URL |
|---|---|---|
| Estimation of | ||
| GCTA (GREML) | Individual-level genotype data | |
| BOLT-REML/BOLT-LMM | Individual-level genotype data | |
| LD score regression | Summary statistics + LD scores | |
| HESS (local heritability) | Summary statistics + LD reference panel | |
| Polygenic risk prediction | ||
| PLINK | Summary statistics + individual-level data | |
| PRSice | Summary statistics + individual-level data | |
| GCTA, MTG2 (GBLUP, MTGBLUP) | Individual-level genotype data | |
| BayesR | Individual-level genotype data | |
| LDpred | Summary statistics + individual-level data | |
| Causality of phenotypes | ||
| Mendelian randomisation | Individual-level genotype data or summary statistics | |
| gwas-pw | Summary statistics | |
| Causality of genes (gene prioritisation) | ||
| gwas-pw | Summary statistics | |
| SMR | Summary statistics + eQTL | |
| PrediXcan | Individual-level genotype data + eQTL | |
| metaXcan | Summary statistics + eQTL | |
| TWAS/FUSION | Summary statistics + eQTL | |
| DEPICT | Summary statistics | |
| Causality of SNPs (fine-mapping) | ||
| PICS (Fine-mapping) | Summary statistics | |
| GCTA (COJO) | Summary statistics | |
| MANTRA | Summary statistics | Available on request from the author |
| Detection of genetic heterogeneity | ||
| BUHMBOX | Summary statistics + individual-level data | |
| Subtest | Summary statistics |
Fig. 2.Genetic correlations between psychiatric disorders and traits, and almost 200 other traits. For each trait, the 10 traits with the highest absolute genetic correlations are shown. Colours indicate whether genetic correlations are positive or negative. One star indicates a genetic correlation p value <0.05. Three starts indicate a p value below the Bonferroni threshold of 2.81 × 10−6 for 17 766 tested trait pairs. Data obtained from LD Hub (Zheng et al. 2017).
Fig. 3.Causal pathways in an MR experiment. A causal effect of exposure (X) on outcome (Y) can be inferred if three core assumptions are met. These assumptions concern the genetic instrumental variable, Z, and state that: (i) Z has to be robustly associated with the exposure variable, (ii) Z cannot be related to any common causal factors of X and the outcome Y (these are labelled U), and (iii) Z may only be related to Y through X (Solovieff et al. 2013). The latter two assumptions can be summarised as the absence of pleiotropy for the instrumental variable.