| Literature DB >> 33960523 |
Hirotaka Iwaki1,2,3, Hampton L Leonard1,2,3, Mary B Makarious3, Matt Bookman4, Barry Landin5, David Vismer5, Bradford Casey6, J Raphael Gibbs3, Dena G Hernandez3, Cornelis Blauwendraat3, Daniel Vitale1,2,3, Yeajin Song1,2,3, Dinesh Kumar7, Clifton L Dalgard8,9, Mahdiar Sadeghi7,10, Xianjun Dong11, Leonie Misquitta12, Sonja W Scholz13,14, Clemens R Scherzer11, Mike A Nalls1,2,3, Shameek Biswas15, Andrew B Singleton2,3.
Abstract
BACKGROUND: Whole-genome sequencing data are available from several large studies across a variety of diseases and traits. However, massive storage and computation resources are required to use these data, and to achieve sufficient power for discoveries, harmonization of multiple cohorts is critical.Entities:
Keywords: Parkinson's disease; clinical; genetics; open science
Mesh:
Year: 2021 PMID: 33960523 PMCID: PMC8453903 DOI: 10.1002/mds.28549
Source DB: PubMed Journal: Mov Disord ISSN: 0885-3185 Impact factor: 9.698
FIG 1WGS sample flowchart. WGS, whole‐genome sequencing; QC, quality control. [Color figure can be viewed at wileyonlinelibrary.com]
Whole‐genome sequenced participants
| Overall | BioFIND | HBS | PDBP | PPMI | |
|---|---|---|---|---|---|
| Total, n | 3941 | 172 | 867 | 1469 | 1433 |
| Sex and age | |||||
| Female, n (%) | 1725 (43.8) | 71 (41.3) | 372 (42.9) | 640 (43.6) | 642 (44.8) |
| Age at baseline (years), mean (SD) | 63.5 (10.7) | 67.1 (6.9) | 66.1 (10.1) | 64.0 (10.0) | 61.1 (11.7) |
| Self‐reported race | |||||
| White, n (%) | 3726 (94.6) | 161 (93.6) | 844 (97.3) | 1397 (95.1) | 1324 (92.5) |
| Mixed ancestry, n (%) | 65 (1.6) | 2 (1.2) | 2 (0.2) | 6 (0.4) | 55 (3.8) |
| Black or African American, n (%) | 63 (1.6) | 3 (1.7) | 10 (1.2) | 32 (2.2) | 18 (1.3) |
| Asian, n (%) | 34 (0.9) | 1 (0.6) | 7 (0.8) | 16 (1.1) | 10 (0.7) |
| Study arms | |||||
| Parkinson's disease, n (%) | 2005 (51.1) | 99 (57.6) | 640 (73.8) | 858 (58.9) | 408 (28.5) |
| Healthy control, n (%) | 963 (24.5) | 73 (42.4) | 227 (26.2) | 470 (32.3) | 193 (13.5) |
| Genetic cohort PD, n (%) | 179 (4.6) | 179 (12.5) | |||
| Genetic cohort unaffected, n (%) | 222 (5.7) | 222 (15.5) | |||
| Genetic registry PD, n (%) | 125 (3.2) | 125 (8.7) | |||
| Genetic registry unaffected, n (%) | 179 (4.6) | 179 (12.5) | |||
| Prodromal, n (%) | 64 (1.6) | 64 (4.5) | |||
| SWEDD, n (%) | 62 (1.6) | 62 (4.3) | |||
| Disease control, n (%) | 127 (3.2) | 127 (8.7) | |||
| Variant metrics | |||||
| MEAN_COVERAGE, median (Q1, Q3) | 33.9 (31.2, 36.2) | 35.0 (34.1, 35.7) | 33.4 (30.7, 36.3) | 33.3 (30.8, 36.3) | 34.2 (31.8, 36.4) |
| MEDIAN_COVERAGE, median (Q1, Q3) | 34.0 (32.0, 37.0) | 35.0 (35.0, 36.0) | 34.0 (31.0, 37.0) | 34.0 (31.0, 37.0) | 35.0 (32.0, 37.0) |
| READS/K, median (Q1, Q3) | 3519 (3254, 3760) | 3629 (3552, 3690) | 3489 (3198, 3765) | 3446 (3194, 3742) | 3565 (3322, 3789) |
| AVG_DP, median (Q1, Q3) | 35.2 (32.6, 37.6) | 36.3 (35.5, 36.9) | 34.9 (32.0, 37.7) | 34.5 (31.9, 37.4) | 35.7 (33.2, 37.9) |
| FREEMIX, median (Q1, Q3) |
| 0.0 (0.0, 0.0) | 0.0 (0.0, 0.0) | 0.0 (0.0, 0.0) | 0.0 (0.0, 0.0) |
AVG_DP, average depth; PD, Parkinson's disease; SWEDD, scan without evidence of dopamine deficit.
Pathogenic/LoF variant distribution of known PD genes
| Genes | Category queried | Number of variants in the category | Among all participants, n = 3074 carriers (Homo, Multi) | Among cases in Nongenetically enriched cohorts, n = 1365 carriers (Homo, Multi) | Among controls in Nongenetically enriched cohorts n = 736 carriers (Homo, Multi) |
|---|---|---|---|---|---|
|
| Pathogenic and LoF | 4 | 4 (0, 0) | 1 (0, 0) | 3 (0, 0) |
| Pathogenic+ and LoF | 4 | 4 (0, 0) | 1 (0, 0) | 3 (0, 0) | |
|
| Pathogenic | 3 | 8 (0, 0) | 6 (0, 0) | 2 (0, 0) |
| Pathogenic+ | 6 | 300 (8, 0) | 30 (1, 0) | 5 (0, 0) | |
|
| Pathogenic | 5 | 467 (7, 0) | 30 (1, 0) | 9 (0, 0) |
| Pathogenic+ | 5 | 467 (7, 0) | 30 (1, 0) | 9 (0, 0) | |
|
| Pathogenic and LoF | 2 | 2 (0, 0) | 1 (0, 0) | 1 (0, 0) |
| Pathogenic+ and LoF | 2 | 2 (0, 0) | 1 (0, 0) | 1 (0, 0) | |
|
| Pathogenic and LoF | 5 | 9 (0, 0) | 3 (0, 0) | 3 (0, 0) |
| Pathogenic+ and LoF | 5 | 9 (0, 0) | 3 (0, 0) | 3 (0, 0) | |
|
| Pathogenic and LoF | 7 | 204 (1, 2) | 82 (1, 1) | 56 (0, 0) |
| Pathogenic+ and LoF | 10 | 214 (1, 4) | 88 (1, 1) | 58 (0, 1) | |
|
| Pathogenic | 1 | 27 (0, 0) | 0 (0, 0) | 0 (0, 0) |
| Pathogenic+ | 1 | 27 (0, 0) | 0 (0, 0) | 0 (0, 0) | |
|
| Pathogenic | 2 | 3 (0, 0) | 2 (0, 0) | 1 (0, 0) |
| Pathogenic+ | 2 | 3 (0, 0) | 2 (0, 0) | 1 (0, 0) |
Homo, homozygous; Multi, multiple variants carriers; pathogenic, “pathogenic” in ClinVar; pathogenic+, clinical significance containing “pathogenic” in ClinVar; LoF, IMPACT “HIGH” in VEP annotation.
Queried but no variants in the model categories for FBXO7 and PARK7 (DJ‐1).
Cohort characteristics and polygenic risk score for European ancestry individuals
| Nongenetically enriched cohorts | Genetically enriched cohorts | |||||
|---|---|---|---|---|---|---|
| HC | PD | Prodromal | SWEDD | Unaffected | Affected | |
| n | 905 | 1905 | 58 | 57 | 369 | 295 |
| Female, n (%) | 471 (52.0) | 675 (35.4) | 13 (22.4) | 21 (36.8) | 221 (59.9) | 153 (51.9) |
| Inferred AJ, n (%) | 73 (8.1) | 144 (7.6) | 2 (3.4) | 4 (7.0) | 263 (71.3) | 203 (68.8) |
| Age at baseline, years | 63.6 (10.8) | 64.7 (9.5) | 69.3 (5.9) | 60.9 (10.4) | 56.1 (12.7) | 65.5 (10.9) |
| Education level | ||||||
| Less than 12 years, n (%) | 13 (1.4) | 58 (3.0) | 13 (22.4) | 10 (17.5) | 17 (4.6) | 35 (11.9) |
| 12–16 years, n (%) | 659 (72.8) | 1405 (73.8) | 19 (32.8) | 32 (56.1) | 131 (35.5) | 136 (46.1) |
| Greater than 16 years, n (%) | 233 (25.7) | 440 (23.1) | 25 (43.1) | 15 (26.3) | 219 (59.3) | 123 (41.7) |
| Latest case/control status | ||||||
| Case, n (%) | 3 (0.3) | 1887 (99.1) | 10 (17.2) | 50 (87.7) | 3 (0.8) | 292 (99.0) |
| Control, n (%) | 896 (99.0) | 2 (3.4) | 1 (1.8) | 352 (95.4) | 1 (0.3) | |
| Other (including prodromal state), n (%) | 6 (0.7) | 18 (0.9) | 46 (79.3) | 6 (10.5) | 14 (3.8) | 2 (0.7) |
| Polygenic risk score | ||||||
| 90 Common risk SNPs from Nalls et al (2019) | 0.0 (1.0) | 0.6 (1.1) | 0.1 (0.9) | 0.3 (0.9) | 2.7 (1.9) | 3.2 (2.0) |
| 83 SNPs from Nalls et al (2019) ‐ excluding 7 Variants in | 0.0 (1.0) | 0.6 (1.0) | 0.1 (0.9) | 0.3 (0.9) | 0.2 (1.0) | 0.5 (1.0) |
Mean (SD) if not specified. Polygenic risk scores were standardized by the mean and the standard deviation of the scores in healthy controls in general cohorts.
P values for the score differences from the healthy controls (nongenetically enriched cohorts) or the unaffected (genetically enriched cohorts) are shown.
AJ, Ashkenazi Jewish; HC, Healthy controls or unaffected participants in genetic cohort/registry; PD, Parkinson's disease; SWEDD, scan without evidence of dopaminergic deficit.
P < 0.05. (t test).
P < 0.01.
P < 0.001.
FIG 2Density plots for polygenic risk score. HC, healthy volunteers; PD, participants with Parkinson's disease; SWEDD, scan without evidence of dopamine deficit. PRS90 is a weighted sum of the independent risk loci reported in Nalls et al (2019). PRS83 is the same but removing the 7 variants in GBA, LRRK2, and SNCA regions. [Color figure can be viewed at wileyonlinelibrary.com]