| Literature DB >> 25329069 |
Jose A Seoane1, Colin Campbell2, Ian N M Day1, Juan P Casas3, Tom R Gaunt4.
Abstract
Genome-wide association studies have identified a wealth of genetic variants involved in complex traits and multifactorial diseases. There is now considerable interest in testing variants for association with multiple phenotypes (pleiotropy) and for testing multiple variants for association with a single phenotype (gene-based association tests). Such approaches can increase statistical power by combining evidence for association over multiple phenotypes or genetic variants respectively. Canonical Correlation Analysis (CCA) measures the correlation between two sets of multidimensional variables, and thus offers the potential to combine these two approaches. To apply CCA, we must restrict the number of attributes relative to the number of samples. Hence we consider modules of genetic variation that can comprise a gene, a pathway or another biologically relevant grouping, and/or a set of phenotypes. In order to do this, we use an attribute selection strategy based on a binary genetic algorithm. Applied to a UK-based prospective cohort study of 4286 women (the British Women's Heart and Health Study), we find improved statistical power in the detection of previously reported genetic associations, and identify a number of novel pleiotropic associations between genetic variants and phenotypes. New discoveries include gene-based association of NSF with triglyceride levels and several genes (ACSM3, ERI2, IL18RAP, IL23RAP and NRG1) with left ventricular hypertrophy phenotypes. In multiple-phenotype analyses we find association of NRG1 with left ventricular hypertrophy phenotypes, fibrinogen and urea and pleiotropic relationships of F7 and F10 with Factor VII, Factor IX and cholesterol levels.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25329069 PMCID: PMC4199483 DOI: 10.1371/journal.pcbi.1003876
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Hive plot for single gene/single phenotype.
The vertical axis represents the association value (higher, more association). The left axis represents phenotypes and the right axis represents genes. An interactive hive plot is published on the project webpage (http://pleioexp.epi.bris.ac.uk/cca/1gene1phenhive.html).
Single gene/single phenotype association.
| Gene | Entrez gene# | Phenotype | P-value | Publication |
|
| 80128 | Serum Magnesium | 4.63E-010 | Not Reported |
|
| 4582 | Serum Magnesium | 1.37E-014 | 20700443 |
|
| 7361, 54577, 54575, 54600, 54579, 54657 | Bilirubin | 1.29E-112 | 19419973,21646302 |
|
| 28 | Von Willebrand Factor | 9.43E-112 | 21534939 |
|
| 28 | Factor VIII | 2.29E-077 | 23381943 |
|
| 28 | ALP | 4.35E-012 | 18940312 |
|
| 2155 | Factor VII | 1.49E-079 | 17903294 |
|
| 2153 | Ratio APC/APTT | 2.77E-062 | 23188048 |
|
| 2159 | Factor VII | 1.24E-060 | 17903294 |
|
| 6836 | Von Willebrand Factor | 5.66E-037 | 21534939 |
|
| 6836 | Factor VIII | 1.57E-024 | Not Reported |
|
| 6836 | ALP | 3.10E-009 | Not Reported |
|
| 2161 | APTT | 5.77E-035 | 23188048 |
|
| 5819 | LDL cholesterol | 2.26E-016 | 19913121 |
|
| 5819 | Total cholesterol | 1.12E-008 | 19913121 |
|
| 1071 | HDL cholesterol | 2.55E-017 | 18193044 |
|
| 116519 | Triglycerides | 2.32E-013 | 19913121 |
|
| 428256 | Triglycerides | 8.56E-012 | 19913121 |
|
| 84811 | Triglycerides | 5.40E-011 | 19913121 |
|
| 3273 | APTT | 2.34E-012 | 23188048 |
|
| 3273 | APC/APTT | 2.56E-012 | 23188048 |
|
| 3077 | Haemoglobin | 9.76E-012 | 19862010 |
|
| 2099 | Triglycerides | 1.84E-011 | 16099331 |
|
| 3827 | APTT | 1.84E-011 | 20303064 |
|
| 7036 | Haemoglobin | 1.04E-009 | 19862010 |
|
| 10560 | APC/APTT | 2.21E-009 | 23188048 |
|
| 346 | LDL cholesterol | 8.17E-009 | 23119086 |
|
| 4905 | Triglycerides | 2.18E-008 | Not Reported |
|
| 55902 | Factor VII | 2.18E-008 | 20231535 |
|
| 2056 | Haemoglobin | 2.31E-008 | 19862010 |
|
| 343263 | LDL cholesterol | 3.19E-008 | Not Reported |
|
| 6296 | Cornell product | 2.38E-008 | Not Reported |
|
| 112479 | Cornell product | 7.87E-009 | Not Reported |
|
| 8807 | Cornell product | 1.07E-008 | Not Reported |
|
| 8807 | QRS vol. product | 1.72E-010 | Not Reported |
|
| 149233 | Cornell product | 3.30E-012 | 23108651 |
|
| 3084 | Cornell product | 2.71E-014 | Not Reported |
|
| 3084 | QRS vol. product | 2.97E-011 | Not Reported |
This table shows single phenotype/single genotype association gene-centered association value. The first column represents genes, the second column phenotypes, the third column represents the CCA association value and the fourth column is the Pubmed ID if the association has been previously reported.
Single gene/multiple phenotype association.
| Gene | Entrez gene# | CCA p-value | Fisher p-value | Phenotypes |
|
| 28 | 3.47E-147 | 1.53E-194 | Creatinine (0.2771) ALP (4.35e-12) FVIII (2.288e-77) Tissue plasminogen activator (0.002849) Von Willebrand Factor (9.43e-112) |
|
| 2155 | 4.42E-114 | 7.27E-078 | Cholesterol (0.9719) HDL cholesterol (0.1073) FVII (1.486e-79) FIX (0.5965) |
|
| 2159 | 3.58E-079 | 1.61E-059 | Cholesterol (0.8131) FVII (1.24e-60) FIX (0.8744) |
|
| 6836 | 2.02E-056 | 1.98E-066 | Mean cell volume (0.01924) ALP (3.104e-09) GGT (0.7735) E vitamin_t1 (t1) (0.1303) E vitamin_t2 (0.03584) FVIII (1.573e-24) D-dimer (0.4159) Tissue plasminogen activator (0.1705) Von Willebrand Factor (5.655e-37) MMP9 (0.006049) BMI (0.1158) PR interval (0.04652) Cornell index (0.001895) |
|
| 3084 | 1.03E-028 | 4.73E-019 | Urea (0.005643) beta-carotene (0.001871) fibrinogen (0.007379) QRS duration (0.0008578) Cornell index (0.651) Cornell product (2.709e-14) |
|
| 3273 | 8.02E-024 | 1.03E-022 | Haematocrit (0.05769) Total protein (0.007418) FIX (0.3289) APTT (2.342e-12) Ratio APC/APTT (2.555e-12) |
|
| 8807 | 5.07E-021 | 1.52E-017 | White blood count (0.1395) Haematocrit (0.04132) Neutrophils (0.4115) Lymphocytes (0.831) ALT (0.0287) Glucose (0.06899) FIX (0.03512) Cornell index (0.4867) Cornell product (1.072e-08) Sokolowlyn index (0.2884) QRS voltage product (1.723e-10) |
|
| 1071 | 3.65E-020 | 5.94E-020 | HBA1C (0.01231) HDL cholesterol (2.552e-17) Glucose (0.06243) FVII (0.0178) Von Willebrand Factor (0.003123) CD40 (0.01286) |
|
| 241 | 3.49E-018 | 7.28E-011 | White blood count (0.2124) Mean cell volume (0.004036) Neutrophils (0.007984) Urate (0.02255) Phosphate (0.07024) Total protein (0.002632) C Vitamin (0.06383) Tissue plasminogen activator (0.06189) CD40 (0.02214) BMI (0.03787) Height (0.002794) QTC interval (0.08854) Cornell index (0.1151) QRS voltage sum (0.2935) QRS voltage prod (0.08101) |
|
| 149233 | 2.19E-018 | 1.09E-012 | Adiponectin (0.04484) IL18 (0.01227) QRS duration (0.1137) Cornell index (0.2268) Cornell product (3.299e-12) |
|
| 84059 | 3.37E-008 | 1.31E-006 | White blood count (0.005413) Mean platelet volume (0.1353) Lymphocytes (0.2363) Potassium (0.08101) Sodium (0.05957) Total protein (0.02314) E vitamin (0.03851) Insulin (0.05201) Adiponectin (0.1668) TNFA (0.2093) homa-score (0.2542) CVD (0.02455) |
This table shows single gene/multiple phenotype association. The first column represents the gene, the second column represents the CCA association value, the third column is the Fisher's combined association value and the fourth column the phenotypes associated. In parentheses is the single phenotype association value of each phenotype.
Figure 2Heatmap single gene/multiple phenotype.
This figure shows the 22 most important association rules as columns in a heatmap. Note that only one member of the UGT1A family is listed and rules for the same gene are summarized in the same column.
Figure 3Hive plot single gene/multiple phenotype.
This figure shows a hive plot for gene/phenotype association rules. The vertical axis represents the association rules (higher, more association). The left axis represents phenotypes and the right axis represents genes. An interactive hive plot is published on the project webpage (http://pleioexp.epi.bris.ac.uk/cca/geneNphenHive.html).
Multiple gene/single phenotype association table.
| Phenotype | CCA p-value | Fisher p-value | Genes |
| Von Willebrand Factor | 2.48E-119 | 3.12E-115 |
|
| Bilirubin total | 3.79E-118 | 7.80E-113 |
|
| Factor VII | 8.49E-087 | 3.09E-083 |
|
| Factor VIII | 7.01E-082 | 1.28E-077 |
|
This table shows multiple gene/single phenotype association values. The first column is the phenotype, the second column is the CCA association value, the third column is Fisher's combined association value and the fourth column are the genes associated. In parentheses the single gene/single phenotype association value is given for each gene.
Multiple gene/multiple phenotype association table.
| Genes | CCA p-value | Phenotypes |
|
| <2.22E-308 | Bilirubin, Factor VII, Von Willebrand Factor |
|
| 1.35E-168 | ALP, Ratio APC/APTT, Von Willebrand Factor |
|
| 6.68E-153 | Bilirubin, Ratio APC/APTT |
|
| 1.56E-123 | Cholesterol, factor VII, factor IX, diastolic blood pressure |
This table shows multiple gene/multiple phenotypes association rules. The first column are the multiple genes, the second column the CCA association value and the third column the multiple phenotypes.