| Literature DB >> 27577545 |
Tom G Richardson1, Nicholas J Timpson1, Colin Campbell2, Tom R Gaunt3.
Abstract
Current endeavours in rare variant analysis are typically underpowered when investigating association signals from individual genes. We undertook an approach to rare variant analysis which utilises biological pathway information to analyse functionally relevant genes together. Conventional filtering approaches for rare variant analysis are based on variant consequence and are therefore confined to coding regions of the genome. Therefore, we undertook a novel approach to this process by obtaining functional annotations from the Combined Annotation Dependent Depletion (CADD) tool, which allowed potentially deleterious variants from intronic regions of genes to be incorporated into analyses. This work was undertaken using whole-genome sequencing data from the UK10K project. Rare variants from the KEGG pathway for arginine and proline metabolism were collectively associated with systolic blood pressure (P=3.32x10-5) based on analyses using the optimal sequence kernel association test. Variants along this pathway also showed evidence of replication using imputed data from the Avon Longitudinal Study of Parents and Children cohort (P=0.02). Subsequent analyses found that the strength of evidence diminished when analysing genes in this pathway individually, suggesting that they would have been overlooked in a conventional gene-based analysis. Future studies that adopt similar approaches to investigate polygenic effects should yield value in better understanding the genetic architecture of complex disease.Entities:
Mesh:
Year: 2016 PMID: 27577545 PMCID: PMC5136291 DOI: 10.1038/ejhg.2016.113
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Figure 1A flowchart illustrating the pipeline for analysis in this study. CADD, combined annotation dependent depletion tool; SKAT-O, optimal sequence kernel association tool; ALSPAC, avon longitudinal study of parents and children.
Systolic blood pressure results using a minor allele frequency cutoff of 1%
| KEGG: arginine and proline metabolism | 115 | 3.32 × 10−5 | 7.42 × 10−4 | 0.04 |
| Reactome: amino acid synthesis and interconversion transamination | 26 | 9.69 × 10−4 | 0.04 | 0.06 |
| KEGG: antigen processing and presentation | 18 | 5.78 × 10−3 | 0.27 | 0.02 |
| Reactome: synthesis secretion and deacylation of ghrelin | 13 | 6.12 × 10−3 | 0.04 | 0.13 |
| KEGG: tight junction | 412 | 8.48 × 10−3 | 0.29 | 0.36 |
| KEGG: Jak Stat signalling pathway | 90 | 0.01 | 0.34 | 0.04 |
| BioCarta: cytokine pathway | 5 | 0.01 | 1.91 × 10−3 | 0.48 |
| Reactome: regulation of IFNG signalling | 20 | 0.01 | 0.38 | 0.02 |
| Reactome: assembly of the pre replicative comple × | 97 | 0.02 | 0.12 | 0.12 |
| BioCarta: RAB pathway | 9 | 0.02 | 0.15 | 0.05 |
Variants=number of variants, UK10K P-value=SKAT-O P-value for entire UK10K sample, Twins P-value, SKAT-O P-value for only TwinsUK individuals, ALSPAC P-value=SKAT-O P-value for only ALSPAC individuals.
Figure 2A Manhattan plot illustrating gene-based collapsing of rare variants (MAF≤1%) genome-wide using SKAT-O with systolic blood pressure. Red annotated points represent genes which reside along the arginine and proline pathway.
Figure 3Circos plot representing the genomic location of Genes in the Arginine and Proline pathway and how their protein products interact according to Stringdb. Stringdb suggested that the protein products of three genes along the Arginine and Proline pathway had the most interactions based on experimental evidence. The following three subnetworks are based on the interactions for each of these genes (CPS1, PRODH2 and GLUD1): CPS1, NOS1, NOS2, NOS3, ALDH9A1, ARG2, ALDH18A1, ALDH1B1, ALDH2, ALDH7A1, ALDH4A1, GOT1 and GOT2. PRODH2, NOS1, NOS2, NOS3, PYCR1, ALDH9A1, ALDH4A1, ALDH2, ALDH7A1 and ALDH1B1. GLUD1, CKM, CKMT2, CKB, ALDH1B1, ALDH7A1, ALDH2, ALDH9A1, ALDH4A1 and GOT2. Red, blue and purple links are used to differentiate the three subnetworks.
Subnetwork analysis for the KEGG arginine and proline metabolism pathway with systolic blood pressure
| P | P | P | ||||
|---|---|---|---|---|---|---|
| 57 | 1.58 × 10−4 | 38 | 4.00 × 10−3 | 40 | 0.03 | |
| 25 | 0.02 | 18 | 0.16 | 20 | 0.15 | |
| 34 | 1.30 × 10−3 | 25 | 0.02 | 26 | 0.06 | |
Variants=number of variants analysed, UK10K=all individuals within UK10K project, TwinsUK=only TwinsUK partition of the UK10K project, ALSPAC=only ALSPAC partition of the UK10K project, P-value=P-values according to SKAT-O test.