| Literature DB >> 33303834 |
Natalja Kurbatova1, Manik Garg2, Luke Whiley3,4, Elena Chekmeneva3, Beatriz Jiménez3, María Gómez-Romero3, Jake Pearce3, Torben Kimhofer5, Ellie D'Hondt6, Hilkka Soininen7, Iwona Kłoszewska8, Patrizia Mecocci9, Magda Tsolaki10, Bruno Vellas11, Dag Aarsland12, Alejo Nevado-Holgado13, Benjamine Liu13, Stuart Snowden12, Petroula Proitsi12, Nicholas J Ashton12,14,15,16, Abdul Hye12, Cristina Legido-Quigley12, Matthew R Lewis3, Jeremy K Nicholson3,17, Elaine Holmes4,17,18, Alvis Brazma2, Simon Lovestone13,19.
Abstract
Finding early disease markers using non-invasive and widely available methods is essential to develop a successful therapy for Alzheimer's Disease. Few studies to date have examined urine, the most readily available biofluid. Here we report the largest study to date using comprehensive metabolic phenotyping platforms (NMR spectroscopy and UHPLC-MS) to probe the urinary metabolome in-depth in people with Alzheimer's Disease and Mild Cognitive Impairment. Feature reduction was performed using metabolomic Quantitative Trait Loci, resulting in the list of metabolites associated with the genetic variants. This approach helps accuracy in identification of disease states and provides a route to a plausible mechanistic link to pathological processes. Using these mQTLs we built a Random Forests model, which not only correctly discriminates between people with Alzheimer's Disease and age-matched controls, but also between individuals with Mild Cognitive Impairment who were later diagnosed with Alzheimer's Disease and those who were not. Further annotation of top-ranking metabolic features nominated by the trained model revealed the involvement of cholesterol-derived metabolites and small-molecules that were linked to Alzheimer's pathology in previous studies.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33303834 PMCID: PMC7730184 DOI: 10.1038/s41598-020-78031-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Overview of study participants.
| CTL | sMCI | cMCI | AD | Total | |
|---|---|---|---|---|---|
| 214 | 200 | 55 | 197 | 666 | |
| Age | |||||
| Mean | |||||
| Male | 103 | 99 | 20 | 101 | 323 |
| Female | 111 | 101 | 35 | 96 | 343 |
| Mean | |||||
| Metabolic UHPLC-MS data obtained | 172 | 167 | 45 | 177 | 561 |
| Metabolic NMR data obtained | 174 | 173 | 46 | 182 | 575 |
| Metabolic data obtained (UHPLC-MS and NMR) | 132 | 140 | 36 | 162 | 470 |
| Genetic and metabolic UHPLC-MS data available | 119 | 80 | 24 | 122 | 345 |
| Genetic and metabolic NMR data available | 120 | 83 | 23 | 126 | 352 |
| APOE genotype available | 154 | 155 | 44 | 174 | 527 |
| E3E3 | 84 | 84 | 13 | 70 | 251 |
| E3E4 | 37 | 50 | 24 | 66 | 177 |
| E2E3 | 22 | 10 | 4 | 9 | 45 |
| E4E4 | 7 | 7 | 3 | 27 | 43 |
| E2E4 | 3 | 4 | 0 | 2 | 9 |
| E2E2 | 1 | 1 | 0 | 0 | 2 |
CTL normal cognition (control) participants, sMCI stable mild cognitive impairment, cMCI mild cognitive impairment converting to dementia, AD participants with Alzheimer’s disease, MMSE Mini-Mental State Examination.
Summary of samples and metabolic features available for the analysis.
| Platform | Assay | Abbreviation | # metabolic features | # samples | # metabolic QTL samples |
|---|---|---|---|---|---|
| UHPLC-MS | HILIC ESI+ | UHPOS | 6851 | 561 | 345 |
| RPC ESI− | URNEG | 16,961 | 561 | 345 | |
| RPC ESI+ | URPOS | 13,217 | 561 | 345 | |
| NMR | 1H NMR | NMR | 18,646 | 575 | 352 |
Details of used metabolomic platforms and assays are available in “Methods” section. Number of metabolic QTL samples—genetic and metabolic data availability.
Metabolic QTL mapping results.
| Assay | # SNP/metabolic feature associations | # unique SNPs | # unique metabolic features |
|---|---|---|---|
| UHPOS | 26,256 | 3004 | 256 |
| URNEG | 50,251 | 3974 | 518 |
| URPOS | 46,617 | 4479 | 535 |
| NMR | 12,518 | 876 | 233 |
Numbers of associations between metabolic features and SNPs found using q-value cut-off 0.01, resulting in a number of unique metabolic features and a number of unique SNPs for each metabolomic assay.
Figure 1Manhattan plots presenting significant QTL associations with metabolic features across the metabolite phenotyping datasets. The x-axis shows each SNP that was analysed, sorted by chromosome and position. The y-axis shows the log10 of the p-value for association with metabolic features concentration. Four sections correspond to four different metabolomic assays presented in our study: UHPOS, URNEG, URPOS and NMR.
Figure 2Performance of Random Forest models for different feature sets and three tested ways of classification. Tested feature sets: (A) metabolic features only, (B) metabolic and genomic features, (C) metabolic and genomic features together with sample covariates, and (D) metabolic features with sample covariates. Tested ways of classification: original multi-class—AD/CTL/cMCI/sMCI, binary over-sampling—AD + cMCI/CTL + sMCI, and binary under-sampling— AD/CTL. The best performing final model: set (D), binary under-sampling classification AD/CTL. The x-axis shows a number of trees used in the Random Forest run. The y-axis shows the Out-Of-Bug (OOB) prediction error.
Figure 3Receiver Operating Characteristic (ROC) curves for RF model discriminating AD vs CTL and then applied to cMCI and sMCI study groups. The model was trained with 1542 prioritised metabolic features and three covariates (age, sex, study site) identified from the AD vs CTL comparison only. The area under the ROC curve (AUROC) value for the AD vs CTL is 0.99. The AUROC value for cMCI vs sMCI classes is 0.88.
Performance of the final classification model.
| Dataset | Balanced accuracy | AUROC | Sensitivity | Specificity | Positive predictive value | Negative predictive value |
|---|---|---|---|---|---|---|
| Final model AD vs CTL | 0.9872 | 1 | 1 | 0.9744 | 0.9796 | 1 |
| Final model cMCI vs sMCI | 0.8037 | 0.8785 | 0.7778 | 0.8296 | 0.5490 | 0.9333 |
Performance of the final classification model in discriminating AD vs CTL and cMCI vs sMCI.
Annotated metabolites.
| Metabolite family | Metabolite annotation | Assay | # | PIC | F (3, 471) | p-value | CTL vs AD | CTL vs cMCI | CTL vs sMCI |
|---|---|---|---|---|---|---|---|---|---|
| Exogenous metabolites | N-Desmethyl O-desacetyl diltiazem glucuronide | UHPOS | 1 | 0.04943 | 112.8 | < 2E − 16 | 1.53 E − 11 | 1.54 E − 11 | 5.94 E − 02 |
| N-Desmethyl O-desacetyl hydroxy diltiazem glucuronide | UHPOS | 1 | 0.02553 | 80.3 | < 2 E − 16 | 1.53 E − 11 | 1.54 E − 11 | 3.32 E − 02 | |
| N-Desmethyl hydroxy diltiazem glucuronide | URPOS | 1 | 0.00741 | 40.13 | < 2 E − 16 | 1.53 E − 11 | 1.54 E − 11 | 1.27 E − 01 | |
| Paracetamol | UHPOS | 1 | 0.02314 | 59.34 | < 2 E − 16 | 1.53 E − 11 | 9.98 E − 01 | 7.30 E − 09 | |
| Paracetamol sulphate | URPOS | 1 | 0.003 | 26.66 | 6.19 E − 16 | 1.53 E − 11 | 9.26 E − 01 | 9.66 E − 01 | |
| 3-Methoxy-paracetamol sulphate | URPOS | 1 | 0.00105 | 11.97 | 1.45 E − 07 | 4.00 E − 07 | 1.26 E − 01 | 7.85 E − 01 | |
| Quinine | UHPOS | 1 | 0.00028 | 13.26 | 2.56 E − 08 | 3.66 E − 07 | 2.66 E − 03 | 1.02 E − 06 | |
| Cholesterol derived metabolites | Hydroxylated pregnenolone sulphate N-Acetylglucosamine isomer 2* | URNEG | 1 | 0.0051 | 34.75 | < 2 E − 16 | 1.53 E − 11 | 5.61 E − 02 | 9.86 E − 01 |
| Hydroxylated pregnenolone sulphate N-acetylglucosamine isomer 1* | URNEG | 1 | 0.01073 | 46.28 | < 2 E − 16 | 1.53 E − 11 | 8.84 E − 03 | 8.26 E − 02 | |
| Pregnenolone sulphate N-acetylglucosamine | URNEG | 5 | 0.00463 | 26.42 | 8.39 E − 16 | 1.54 E − 11 | 4.87 E − 03 | 8.22 E − 02 | |
| Pregnanediol sulphate N-acetylglucosamine | URNEG | 1 | 0.00025 | 13.06 | 3.34 E − 08 | 3.02 E − 07 | 9.49 E − 01 | 9.30 E − 01 | |
| Taurochenodeoxycholicc or taurodeoxycholic acid Nacetylglucosaminide* | URNEG | 1 | 0.00052 | 20.31 | 2.18 E − 12 | 3.14 E − 10 | 9.99 E − 01 | 9.03 E − 01 | |
| Nucleosides, amines, carnitines, glycines | 3-Aminoisobutyrate | NMR | 2 | 0.00015 | 2.18 | 8.99 E − 02 | 9.44 E − 01 | 6.25 E − 01 | 3.75 E − 01 |
| N,N,N-Trimethyl-L-alanyl-L-proline betaine | URPOS | 2 | 0.00299 | 34.09 | < 2 E − 16 | 1.53 E − 11 | 8.65 E − 02 | 5.57 E − 02 | |
| Butyryl or isobutyryl carnitine* | UHPOS | 1 | 0.00054 | 14.54 | 4.57 E − 09 | 1.16 E − 08 | 7.25 E − 03 | 3.53 E − 01 | |
| Trimethylamine | NMR | 1 | 0.00014 | 1.68 | 1.71 E − 01 | 2.81 E − 01 | 9.48 E − 01 | 1.67 E − 01 | |
| L-Lysine | NMR | 1 | 0.00011 | 1.33 | 2.56 E − 01 | 9.90 E − 01 | 9.07 E − 01 | 4.65 E − 01 | |
| 5-Methylcytidine | UHPOS | 2 | 0.00075 | 18.56 | 2.16 E − 11 | 2.67 E − 11 | 1.79 E − 01 | 9.89 E − 02 | |
| 2-O-Methylcytidine | UHPOS | 1 | 0.00033 | 20.54 | 1.62 E − 12 | 1.93 E − 09 | 7.12 E − 01 | 9.79 E − 01 | |
| Unknown nucleoside with adenosyl moiety | UHPOS | 1 | 0.00026 | 9.24 | 5.99 E − 06 | 2.90 E − 06 | 1.71 E − 01 | 3.18 E − 01 | |
| N-Acetylisoputreanine-gamma-lactam | URPOS | 2 | 0.00046 | 13.42 | 2.06 E − 08 | 4.55 E − 09 | 1.74 E − 02 | 5.36 E − 03 | |
| Sugars | Sucrose | NMR | 1 | 0.00027 | 0.17 | 9.20 E − 01 | 6.78 E − 04 | 5.99 E − 01 | 1.56 E − 01 |
| Galactose | NMR | 2 | 0.0001 | 0.17 | 9.20 E − 01 | 9.05 E − 01 | 9.77 E − 01 | 9.74 E − 01 |
Note (*) signifies isomers that cannot be differentiated using mass spectrometry fragmentation data. Column headers: Assay—metabolomic assay; #—a number of metabolic features in the dataset; PIC—Permutation Importance Score from Random Forest algorithm showing the importance of metabolite for classification purpose; F (3, 471)—ANOVA results (MANOVA in case of multiple metabolic features) presented as F-statistic; p-value—ANOVA (MANOVA in case of multiple metabolic features) results presented as adjusted p-value; CTL vs AD, CTL vs cMCI and CTL vs sMCI—post hoc tests results presented as adjusted p-value. In the last four columns, Scientific Notation is used due to the presence of very small numbers.
Figure 4The heatmap showing concentrations of the annotated metabolic features. Note (*) indicates metabolite conjugation with N-acetylglucosamine, note (**) indicates metabolite conjugation with N-acetylglucosaminide. Columns sharing the same metabolite names are isomers of each other.
Annotated metabolites with mQTL results, phenotypic traits and literature findings.
| Metabolic pathway | Metabolite annotation | Chr | Genes/genomic region | Phenotypic traits | Relationship to AD |
|---|---|---|---|---|---|
| N-Desmethyl | |||||
| N-Desmethyl | 1 | Regulatory feature: ENSR00000006069 | Anxiety and major depressive disorder, Obesity-related traits | ||
| N-Desmethyl hydroxy diltiazem glucuronide | 15 | MESP2 | Coronary artery aneurysm in Kawasaki disease | ||
| Paracetamol | |||||
| Paracetamol sulphate | |||||
| 3-Methoxy-paracetamol sulphate | 4 AND 17 | SORCS2 AND CNTROB | Biopolar disorder, Interleukin-10 levels | SORCS2 belongs to the Vps10 receptor family that has previously been linked to neurodegeneration and AD[ | |
| Quinine | 2 | AOX1 | Late-onset Alzheimer’s disease | The mQTL association links aldehyde oxydase 1 (AOX1) gene and quinine. AOX1 gene has a previously reported GWAS trait “Late-onset Alzheimer’s disease”[ | |
| Cholesterol metabolism (CM) | Hydroxylated pregnenolone sulphate | 7 | CHN2 | Age at onset, Alzheimer’s disease, Obesity-related traits, Psychosis | Beta-chimaerin (CHN2) gene plays a role in neural development by regulating Rac1 activity[ |
| Hydroxylated pregnenolone sulphate | |||||
| Pregnenolone sulphate | |||||
| Pregnanediol sulphate | |||||
| Tauro(cheno) deoxycholic acid | 5 | UGT3A1 | Blood metabolite levels, Primary biliary cholangitis (PBC) | Neither the gene UGT3A1 nor the PBC has a known relationship to AD, although we note that a progressive cognitive impairment different to delirium is a feature of PBC, independently of liver pathology[ | |
| CM, gut microbiota | 3-Aminoisobutyrate | 5 | AGXT2 | Metabolite levels, Asymmetrical dimethylarginine levels, Symmetrical dimethylarginine levels | |
| Gut microbiota | 11 AND 21 | Regulatory feature: ENSR00000961656 AND intergenic variant | |||
| Butyryl or isobutyryl carnitine * | 15 | intergenic variant | |||
| Trimethylamine | 10 | PYROXD2 | General cognitive ability, Obesity-related traits, Metabolite levels | ||
| 19 | SLC7A9 | Estimated glomerular filtration rate, Creatinine levels | |||
| DNA methylation | 5-Methylcytidine | 4 | CC2D2, FBXL5, FAM200B, BST1 | Parkinson’s disease, Blood protein levels, Cerebrospinal fluid biomarker levels | The FBXL5 gene is a critical component of iron metabolism[ |
| 2- | 9 | NUP188, DOLK, PHYHD1, SH3GLB2 | Body mass index | The PHYHD1 gene encodes 2-oxoglutarate oxygensase, an amyloid-beta interacting protein that has been shown to be dysregulated in both AD brain and in transgenic models with plaque pathology[ | |
| Unknown nucleoside with adenosyl moiety | 12 | Intergenic variant | The nearest gene to mQTL region is SYT1. It encodes protein synaptotagmin—a novel cerebrospinal fluid biomarker for Alzheimer’s disease[ | ||
| Polyamine metabolism | 2 | Long intergenic non-protein coding RNA LINC01914 | |||
| CM, insulin resistance | Sucrose | 8 | Intergenic variant | ||
| Galactose | 19 | FUT2 | Estimated glomerular filtration rate, Cholesterol levels |
Note (*) signifies isomers that cannot be differentiated using mass spectrometry fragmentation data. We present phenotypic traits previously associated with a genomic region of interest, and possible linkage of found genes to AD processes.
Figure 5Annotated metabolites and their linkage to AD. Red colour indicates metabolites annotated in the study. Up arrow next to the metabolite’s name indicates increased levels in AD patients samples. Down arrow shows decreased levels in AD patients samples.