| Literature DB >> 32093692 |
Nan Jiang1, Sungyoung Lee2, Taesung Park3,4.
Abstract
BACKGROUND: Genome-wide association studies (GWAS) have been widely used to identify phenotype-related genetic variants using many statistical methods, such as logistic and linear regression. However, GWAS-identified SNPs, as identified with stringent statistical significance, explain just a small portion of the overall estimated genetic heritability. To address this 'missing heritability' issue, gene- and pathway-based analysis, and biological mechanisms, have been used for many GWAS studies. However, many of these methods often neglect the correlation between genes and between pathways.Entities:
Keywords: Common variants; Genome-wide association study; Hierarchical components; Pathway analysis
Year: 2020 PMID: 32093692 PMCID: PMC7038534 DOI: 10.1186/s12920-019-0650-0
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Demographic variables for KARE cohort (T2D)
| T2D subjects | Normal subjects | |
|---|---|---|
| Area (Ansan/Ansung) | 673/615 | 1607/2080 |
| Gender (Male/Female) | 671/617 | 1679/2008 |
| Age (Mean ± SD) | 55.92(± 8.80) | 49.88(± 8.31) |
| BMI (Mean ± SD) | 25.54(± 3.27) | 24.10(± 2.90) |
| Number of subjects | 1288 | 3687 |
SD standard deviation, BMI body mass index
Basic characteristics of study subjects
| (a) Basic characteristics of hypertensive cases and normotensive controls | ||
| HT subjects | Normal subjects | |
| Area(Ansan/Ansung) | 1204/804 | 1756/2813 |
| Gender(Male/Female) | 916/1092 | 2065/2504 |
| Age(Mean ±SD) | 56.74(± 8.42) | 49.43(± 8.09) |
| BMI(Mean ±SD) | 25.62(± 3.27) | 24.03(± 2.94) |
| Number of subjects | 2008 | 4569 |
| (b) Basic characteristics of subjects for blood pressure analysis | ||
| Subjects | ||
| Area(Ansan/Ansung) | 3591/4225 | |
| Gender(Male/Female) | 3784/4032 | |
| Age(Mean ±SD) | 51.45(± 8.74) | |
| BMI(Mean ±SD) | 24.40(± 3.07) | |
| SBP(Mean ±SD) | 115.56(± 17.22) | |
| DBP(Mean ±SD) | 74.11(± 11.24) | |
| Number of subjects | 7816 | |
SD standard deviation, BMI body mass index, SBP systolic blood pressure, DBP diastolic blood pressure
Fig. 1A schematic diagram of HisCoM-PCA y is the phenotype of subject; pathway means k pathway; Gene − PC denotes i PC of the t gene in k pathway; SNP is r SNP of the t gene in k pathway; l denotes a loading value of SNP from PCA; w denotes the weight assigned to Gene − PC; β denote the coefficient connecting the k pathway to the phenotype
Fig. 2Empirical type I errors and powers of HisCoM-PCA and other methods (a) Empirical type I errors of HisCoM-PCA, sARTP, two versions of MAGMA, and GSA-SNP2. Empirical type I error indicates the times of identifying a null pathway among 200 replicates. (b) Empirical powers of HisCoM-PCA, competitive version of MAGMA and GSA-SNP2. Empirical power indicates the times of identifying VEGF pathway among 200 replicates
Pathways identified for T2D
| Pathway | HisCoM-PCAa | HisCoM-PCAb | MAGMA | GSA-SNP2 |
|---|---|---|---|---|
| folate biosynthesis | 0.004 (0.1518) | 0.002 (0.0633) | 0.0537 (0.891) | 0.0019 (0.2936) |
| hedgehog signaling pathway | 0.006 (0.1627) | 0.016 (0.2531) | 0.3073 (0.917) | 0.028 (0.5284) |
| olfactory transduction* | 0.006 (0.1627) | 0.002 (0.0633) | 0.0482 (0.891) | 0.0036 (0.2936) |
| biosynthesis of unsaturated fatty acids | 0.01 (0.2373) | 0.012 (0.2071) | 0.089 (0.891) | 0.0373 (0.5284) |
| Alzheimer’s disease | 0.014 (0.2657) | 0.036 (0.4555) | 0.1808 (0.891) | 0.0446 (0.5284) |
| calcium signaling pathway | 0.014 (0.2657) | 0.026 (0.3796) | 0.0791 0.891) | 0.0329 (0.5284) |
| asthma | 0.016 (0.2761) | 0.008 (0.1687) | 0.0517 0.891) | 0.0737 (0.6529) |
| acute myeloid leukemia | 0.032 (0.4302) | 0.042 (0.4621) | 0.0781 (0.891) | 0.5442 (1) |
| melanogenesis | 0.034 (0.4302) | 0.012 (0.2071) | 0.0786 (0.891) | 0.0223 (0.5284) |
| long term potentiation | 0.04 (0.4466) | 0.028 (0.3796) | 0.1284 (0.891) | 0.1616 (0.8349) |
| phosphatidylinositol signaling system | 0.1119 (0.5185) | 0.006 (0.1424) | 0.7656 (0.9778) | 0.1107 (0.7355) |
| dilated cardiomyopathy | 0.03 (0.4302) | 0.0819 (0.5188) | 0.3994 (0.917) | 0.0408 (0.5284) |
| renin angiotensin system | 0.038 (0.4466) | 0.0539 (0.4621) | 0.3794 (0.917) | 0.0917 (0.6923) |
HisCoM-PCAa is HisCoM-PCA with the first PC of each gene. HisCoM-PCAb is HisCoM-PCA with the PCs whose cumulative proportion of variance is more than 30%. MAGMA is competitive version of MAGMA. The q-value is the FDR corrected p-value. Pathway with “*” was identified by 3 methods
Pathways identified for BP
| Pathway | HisCoM-PCAa | HisCoM-PCAb | MAGMA | GSA-SNP2 |
|---|---|---|---|---|
| P value (q-value) | ||||
| inositol phosphate metabolism | 0.002 (0.0633) | 0.004 (0.1265) | 0.1966 (1) | 9.008e-06 (0.0017) |
| phosphatidylinositol signaling system | 0.002(0.0633) | 0.004 (0.1265) | 0.2256 (1) | 1e-04 (0.013) |
| ubiquitin mediated proteolysis | 0.002 (0.0633) | 0.002 (0.0949) | 0.1076 (1) | 0.5963 (0.9839) |
| calcium signaling pathway | 0.008 (0.2169) | 0.006 (0.1627) | 0.0994 (1) | 0.0228 (0.4708) |
| neurotrophin signaling pathway | 0.01 (0.2373) | 0.01 (0.2373) | 0.6801 (1) | 0.3157 (0.971) |
| epithelial cell signaling in helicobacter pylori infection | 0.012 (0.2531) | 0.05 (0.4126) | 0.0826 (1) | 0.1245 (0.7719) |
| complement and coagulation cascades | 0.014 (0.2657) | 0.03 (0.3417) | 0.1428 (1) | 0.3405 (0.9839) |
| maturity onset diabetes of the young | 0.018 (0.3106) | 0.036 (0.3417) | 0.067 (1) | 0.0239 (0.4708) |
| snare interactions in vesicular transport | 0.022 (0.348) | 0.012 (0.2531) | 0.2071 (1) | 0.0653 (0.5878) |
| hypertrophic cardiomyopathy (HCM) * | 0.1259 (0.5573) | 0.03 (0.3417) | 0.0074(1) | 3e-04 (0.0157) |
HisCoM-PCAa is HisCoM-PCA with the first PC of each gene. HisCoM-PCAb is HisCoM-PCA with the PCs whose cumulative proportion of variance is more than 30%. MAGMA is competitive version of MAGMA. The q-value is the FDR corrected p-value. Pathway with “*” was identified by 3 methods