| Literature DB >> 32566740 |
Zachary D Wallen1, Mary Appah1, Marissa N Dean1, Cheryl L Sesler1, Stewart A Factor2, Eric Molho3, Cyrus P Zabetian4, David G Standaert1, Haydeh Payami1.
Abstract
In Parkinson's disease (PD), gastrointestinal features are common and often precede the motor signs. Braak and colleagues proposed that PD may start in the gut, triggered by a pathogen, and spread to the brain. Numerous studies have examined the gut microbiome in PD; all found it to be altered, but found inconsistent results on associated microorganisms. Studies to date have been small (N = 20 to 306) and are difficult to compare or combine due to varied methodology. We conducted a microbiome-wide association study (MWAS) with two large datasets for internal replication (N = 333 and 507). We used uniform methodology when possible, interrogated confounders, and applied two statistical tests for concordance, followed by correlation network analysis to infer interactions. Fifteen genera were associated with PD at a microbiome-wide significance level, in both datasets, with both methods, with or without covariate adjustment. The associations were not independent, rather they represented three clusters of co-occurring microorganisms. Cluster 1 was composed of opportunistic pathogens and all were elevated in PD. Cluster 2 was short-chain fatty acid (SCFA)-producing bacteria and all were reduced in PD. Cluster 3 was carbohydrate-metabolizing probiotics and were elevated in PD. Depletion of anti-inflammatory SCFA-producing bacteria and elevated levels of probiotics are confirmatory. Overabundance of opportunistic pathogens is an original finding and their identity provides a lead to experimentally test their role in PD.Entities:
Keywords: Genomics; Parkinson's disease
Year: 2020 PMID: 32566740 PMCID: PMC7293233 DOI: 10.1038/s41531-020-0112-6
Source DB: PubMed Journal: NPJ Parkinsons Dis ISSN: 2373-8057
Fig. 1The gut microbiome compositions of the two dataset differed significantly.
Principal component (PC) analysis was used to generate the graphs for PD cases (left, N = 522), controls (middle, N = 316), and cases and controls combined (right, N = 838), where each point represents the composition of the gut microbiome of one individual and distances indicate degree of similarity to other individuals. Percentages on the x-axis and y-axis correspond to the percent variation in gut microbiome compositions explained by PC1 and PC2. The difference between dataset 1 and dataset 2 was formally tested using PERMANOVA and was significant (P < 1E-5). Dataset 1: red (Albany, NY), purple (Seattle, WA), and green (Atlanta, GA). Dataset 2: blue (Birmingham, AL).
Effect of PD and other key variables on the global composition of gut microbiome.
| Dataset 1 | Dataset 2 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Aitchison | GUniFrac | Canberra | Aitchison | GUniFrac | Canberra | |||||||
| %var | %var | %var | %var | %var | %var | |||||||
| 0.71 | <1E − 05 | 1.38 | <1E − 05 | 0.57 | <1E − 05 | 0.56 | <1E − 05 | 0.89 | <1E − 05 | 0.38 | <1E − 05 | |
| Geography (Seattle, Atlanta, Albany) | 0.99 | 2E − 03 | 1.10 | 0.02 | 0.84 | 2E − 03 | – | – | – | – | – | – |
| PD (case vs. control) | 0.58 | 1E − 03 | 1.12 | 7E − 05 | 0.53 | 4E − 05 | 0.48 | <1E − 5 | 0.62 | 9E − 05 | 0.32 | 2E − 05 |
| Sex (male vs. female) | 0.51 | 9E − 03 | 0.52 | 0.08 | 0.49 | 2E − 04 | 0.48 | 2E − 05 | 0.49 | 2E − 03 | 0.34 | 2E − 05 |
| Age (continuous) | 0.45 | 0.04 | 0.76 | 5E − 03 | 0.43 | 0.01 | 0.45 | <1E − 5 | 0.62 | 1E − 04 | 0.34 | 3E − 05 |
| GI discomfort on day of stool collection (yes vs. no) | 0.45 | 0.04 | 0.40 | 0.26 | 0.43 | 9E − 03 | 0.24 | 0.2 | 0.22 | 0.39 | 0.23 | 0.2 |
| Fruits or vegetables daily (yes vs. no) | 0.38 | 0.3 | 0.55 | 0.05 | 0.42 | 0.02 | – | – | – | – | – | – |
| Constipation in the past 3 months (yes vs. no) | 0.34 | 0.77 | 0.38 | 0.35 | 0.37 | 0.39 | 0.26 | 0.06 | 0.38 | 0.02 | 0.24 | 0.05 |
| BMI (continuous) | 0.40 | 0.21 | 0.48 | 0.12 | 0.39 | 0.13 | 0.33 | 3E − 03 | 0.34 | 0.04 | 0.27 | 6E − 03 |
| Drinks alcohol (yes vs. no) | 0.35 | 0.66 | 0.31 | 0.64 | 0.37 | 0.35 | 0.26 | 0.07 | 0.28 | 0.15 | 0.24 | 0.1 |
| Lost >10 pounds in past year (yes vs. no) | 0.34 | 0.71 | 0.36 | 0.42 | 0.36 | 0.64 | 0.20 | 0.87 | 0.15 | 0.91 | 0.21 | 0.71 |
| Stool sample travel time (continuous) | 0.35 | 0.66 | 0.70 | 0.01 | 0.36 | 0.58 | 0.23 | 0.26 | 0.3 | 0.09 | 0.24 | 0.11 |
| PD not on levodopa vs. control | 0.93 | 0.01 | 1.12 | 0.04 | 0.78 | 0.02 | 0.48 | 0.17 | 0.54 | 0.16 | 0.47 | 0.11 |
| PD not on COMT inhibitors vs. control | 0.66 | 9E − 05 | 1.27 | <1E − 05 | 0.56 | <1E − 05 | 0.55 | <1E − 05 | 0.88 | <1E − 05 | 0.38 | <1E − 05 |
| PD not on anticholinergics vs. control | 0.73 | <1E − 05 | 1.31 | <1E − 05 | 0.58 | <1E − 05 | 0.57 | <1E − 05 | 0.92 | 2E − 05 | 0.39 | <1E − 05 |
| PD not on MAO-B inhibitors vs. control | 0.81 | <1E − 05 | 1.50 | 3E − 05 | 0.66 | <1E − 05 | 0.71 | <1E − 05 | 1.07 | <1E − 05 | 0.45 | <1E − 05 |
| PD not on dopamine agonists vs. control | 0.81 | 2E − 04 | 1.51 | 3E − 05 | 0.70 | <1E − 05 | 0.57 | 1E − 04 | 0.80 | 3E − 04 | 0.44 | 4E − 05 |
| PD not on amantadine vs. control | 0.73 | 3E − 05 | 1.37 | <1E − 05 | 0.60 | <1E − 05 | 0.48 | 3E − 05 | 0.74 | 3E − 05 | 0.37 | <1E − 05 |
| PD not on any PD drug vs. control | 1.00 | 0.07 | 0.89 | 0.22 | 0.82 | 0.06 | 0.48 | 0.58 | 0.52 | 0.37 | 0.48 | 0.79 |
Model A tested PD vs. control without any other variable in the model. Sample size for Model A was 201 cases and 132 controls in dataset 1 and 323 cases and 184 controls in dataset 2. Model B included 11 variables (including case/control) and each variable was tested while adjusting for the other 10, without priority. Model B included subset of samples that had complete data on all 11 variables: N = 160 cases and 111 controls in dataset 1 and 283 cases, and 167 controls in dataset 2. For Model C, patients were stratified by each PD medication they were taking at the time of stool collection; those not on medication (varying N for different medications, see Supplementary Table 1) were tested against controls (N = 132 in dataset 1 and 184 in dataset 2). Power was low for patients not on L-dopa (N patients <50) and patients not on any PD medication (<20) due to small sample sizes, but not for other medications (N patients not on medication = 88–179 in dataset 1 and 153–312 in dataset 2). All analyses were repeated with three different distance measures: Aitchison, Canberra, and GUniFrac (generalized UniFrac). % var was the inter-individual variation explained by each variable. P-value was calculated using 99,999 permutations, setting the highest achievable significance at P = 1E − 05.
PD-associated genera identified via MWAS.
| PD-associated genera | MWAS significant In Dataset 1 | MWAS significant In Dataset 2 | Cluster | PubMed | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phylum | Class | Order | Family | Genus | MRA | FC | ANCOM (W) | KW (FDR) | MRA | FC | ANCOM (W) | KW (FDR) | ||
| 0.001 | 4.20 | 406 | 1E − 03 | 0.001 | 2.94 | 468 | 2E − 02 | 1 | Opp path | |||||
| 0.002 | 2.56 | 400 | 6E − 03 | 0.001 | 4.39 | 463 | 2E − 02 | 1 | Opp path | |||||
| 0.001 | 1.96 | 360 | 1E − 02 | 0.002 | 2.53 | 465 | 8E − 03 | 1 | Opp path | |||||
| 0.06 | 0.63 | 411 | 1E − 03 | 0.04 | 0.66 | 535 | 3E − 03 | 2 | SCFA | |||||
| 0.04 | 0.53 | 441 | 2E − 04 | 0.02 | 0.56 | 545 | 6E − 05 | 2 | SCFA | |||||
| 0.02 | 0.68 | 410 | 2E − 03 | 0.02 | 0.79 | 533 | 4E − 02 | 2 | SCFA | |||||
| 0.02 | 0.48 | 391 | 4E − 03 | 0.01 | 0.60 | 541 | 3E − 04 | 2 | SCFA | |||||
| 0.004 | 0.56 | 388 | 2E − 02 | 0.005 | 0.69 | 521 | 3E − 02 | 2 | SCFA | |||||
| 0.004 | 0.80 | 426 | 1E − 03 | 0.005 | 0.68 | 521 | 1E − 02 | 2 | SCFA | |||||
| 0.002 | 0.66 | 382 | 7E − 03 | 0.002 | 0.68 | 505 | 6E − 02 | 2 | SCFA | |||||
| 0.001 | 0.37 | 418 | 2E − 04 | 0.001 | 0.59 | 538 | 6E − 04 | 2 | NC | |||||
| 0.001 | 0.48 | 384 | 2E − 02 | 0.001 | 0.38 | 544 | 1E − 05 | 2 | NC | |||||
| 0.0006 | 0.65 | 367 | 2E − 02 | 0.0005 | 0.64 | 525 | 1E − 02 | 2 | NC | |||||
| 0.01 | 1.83 | 410 | 1E − 03 | 0.01 | 2.72 | 553 | 6E − 07 | 3 | Probiotic | |||||
| 0.0004 | 6.61 | 407 | 2E − 04 | 0.004 | 1.57 | 458 | 1E − 02 | 3 | Probiotic | |||||
MWAS was conducted in two datasets independently, testing differential abundance of genera in PD vs. controls, using two statistical methods (ANCOM and KW). The 15 genera shown are those that achieved microbiome-wide significance for association with PD in both datasets and by both methods, with (ANCOM) and without (KW) covariate adjustment (see “Methods” for covariates). Sample size: ANCOM included subset of subjects for whom complete data were available on all covariates tested: N = 171 cases and 117 controls in dataset 1 and 306 cases and 177 controls in dataset 2. KW included all subjects: N = 201 cases and 132 controls in dataset 1, and 323 cases and 184 controls in dataset 2. Clusters were identified hypothesis-free using correlation network analysis (Fig. 3). PubMed search was conducted after analyses were completed using genus and species name as search term (Supplementary Table 6). Function (opportunistic pathogen, SCFA, probiotic) was taken strictly from PubMed and is likely oversimplified. Microbiota have been studied under a narrow lens of what is already known about them. Opportunistic pathogens are often looked for in clinical specimen with infection, SCFA bacteria are studied intensively for their anti-inflammatory and other protective effects, and probiotics are understudied but highly advertised. The full function of the microbiota are not yet fully understood. In comparing results across published studies, note that a “genus” classified by one study may not be the same as the genus by the same name in another study. Taxonomic classifications and nomenclature are not standardized across reference databases, e.g., “Prevotella”, as annotated in some databases including NCBI, is further divided by SILVA (used here) into several non-monophyletic groups that SILVA calls, Prevotella_2, Prevotella_6, Prevotella_7, Prevotella_9, and Prevotella (see Discussion).
ANCOM analysis of composition of microbiomes. FC fold change in patients (MRA in patients/MRA in controls). FDR Benjamini–Hochberg false discovery rate (multiple testing corrected P-value). KW Kruskal–Wallis. MWAS microbiome-wide association study. MRA mean relative abundance in controls. NC not uncultured (uncharacterized). Opp path opportunistic pathogen (often commensal microorganism that can become pathogenic in immune-compromised individuals). Probiotic carbohydrate-metabolizing bacteria commonly known as probiotics. SCFA short-chain fatty acid-producing bacteria. W ANCOM score indicating the number of times a genus achieved FDR 0.05 as compared with other genera (maximum W possible: 444 in dataset 1, 560 in dataset 2, threshold 0.8 was used for significance, all shown genera were above significance threshold).
Fig. 2Differential abundances of 15 PD-associated genera replicated in two datasets.
Relative abundances in PD cases (blue) and controls (orange) were plotted as log10 scale on the y-axis. Sample size was 201 cases and 132 controls in dataset 1, and 323 cases and 184 controls in dataset 2. Each dot represents a sample, plotted according to the relative abundance of the genus in the sample. The notch in each box indicates the confidence interval of the median. The bottom, middle, and top boundaries of each box represent the first, second (median), and third quartiles of the relative abundances. The whiskers (lines extending from the top and bottom of the box and ending in horizontal cap) extend to points within 1.5 times the interquartile range. The points extending above the whiskers are outliers.
Fig. 3Correlation network analysis mapped PD-associated genera to three polymicrobial clusters.
Pairwise correlations in relative abundances were calculated for all genera microbiome-wide and were used to detect clusters of co-occurring microorganisms. To display, we used an arbitrary correlation coefficient threshold at r ≥ |0.4| to connect the genera that were correlated. All correlations noted were significant at P < 3E − 4 (the limit for 3000 permutations). Here we show the result for PD cases in dataset 2, because it had larger sample size (N = 323 cases) and greater sequencing depth than dataset 1 (see Supplementary Fig. 1 for cases and controls in dataset 1 and dataset 2). a Algorithm-detected clusters shown in different colors. b The algorithm-detected clusters, as in a but shown in gray, and PD-associated genera highlighted in blue (if increased in PD) or red (if decreased in PD). c Zoomed in version of b. The 15 PD-associated genera fell in three clusters. Cluster 1 was a tightly correlated cluster of microorganisms (r approaching 0.8), which included Porphyromonas, Prevotella, and Corynebacterium_1 (all elevated in PD). Cluster 2 included the ten genera that were reduced in PD, eight of which are shown connected at r ≥ 0.4, and two are unconnected but correlated significantly (P = 3E − 4) with the others in the cluster at r = 0.25 and r = 0.35. Lactobacillus and Bifidobacterium (correlated at r = 0.33 (P < 3E − 4)) were denoted cluster 3. For unconnected genera (r < 0.4), the proximity between nodules does not imply relatedness, e.g., Oscillospira (M) falls closer to Lactobacillus (N) than to Roseburia (G) but it is correlated significantly with Roseburia (r = 0.25, P < 3E − 4) and not with Lactobacillus (r = 0.04, P = 0.44).