Literature DB >> 35475137

Genome- and epigenome-wide studies of plasma protein biomarkers for Alzheimer's disease implicate TBCA and TREM2 in disease risk.

Robert F Hillary1, Danni A Gadd1, Daniel L McCartney1, Liu Shi2, Archie Campbell1, Rosie M Walker1,3, Craig W Ritchie4, Ian J Deary5, Kathryn L Evans1, Alejo J Nevado-Holgado2, Caroline Hayward6, David J Porteous1, Andrew M McIntosh1,7, Simon Lovestone2,8, Matthew R Robinson9, Riccardo E Marioni1.   

Abstract

Introduction: The levels of many blood proteins are associated with Alzheimer's disease (AD) or its pathological hallmarks. Elucidating the molecular factors that control circulating levels of these proteins may help to identify proteins associated with disease risk mechanisms.
Methods: Genome-wide and epigenome-wide studies (nindividuals ≤1064) were performed on plasma levels of 282 AD-associated proteins, identified by a structured literature review. Bayesian penalized regression estimated contributions of genetic and epigenetic variation toward inter-individual differences in plasma protein levels. Mendelian randomization (MR) and co-localization tested associations between proteins and disease-related phenotypes.
Results: Sixty-four independent genetic and 26 epigenetic loci were associated with 45 proteins. Novel findings included an association between plasma triggering receptor expressed on myeloid cells 2 (TREM2) levels and a polymorphism and cytosine-phosphate-guanine (CpG) site within the MS4A4A locus. Higher plasma tubulin-specific chaperone A (TBCA) and TREM2 levels were significantly associated with lower AD risk. Discussion: Our data inform the regulation of biomarker levels and their relationships with AD.
© 2022 The Authors. Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring published by Wiley Periodicals, LLC on behalf of Alzheimer's Association.

Entities:  

Year:  2022        PMID: 35475137      PMCID: PMC9019629          DOI: 10.1002/dad2.12280

Source DB:  PubMed          Journal:  Alzheimers Dement (Amst)        ISSN: 2352-8729


INTRODUCTION

Alzheimer's disease (AD) is one of the leading causes of disease burden and death globally. , Blood‐based methods for assessing disease risk are potentially more cost‐effective and less invasive than neuroimaging methods or lumbar punctures for collecting cerebrospinal fluid (CSF). Approaches that use genomics and untargeted proteomics have suggested that there are signals in blood that can supplement targeted assays, and contribute to the understanding and prediction of AD. , However, the relevance of many candidate protein markers identified by untargeted approaches to AD remains unclear. , Understanding the molecular factors that regulate the levels of AD‐associated proteins may identify proteins that bear relevance to disease risk mechanisms. Unlike genetic factors, which remain largely stable over the life‐course, differential DNA methylation (DNAm) profiles are influenced by genetic and non‐genetic factors. DNAm is characterized by the addition of methyl groups to DNA, typically in the context of cytosine‐phosphate‐guanine (CpG) nucleotide base pairings. Clusters of CpG sites termed CpG islands are located near 70% of gene promoters. CpG island methylation is typically associated with reduced gene expression. However, it is important to note that DNAm is dynamic, tissue‐specific, and cell‐specific. DNAm data may capture independent information beyond genetic factors in explaining inter‐individual variation in circulating protein levels. Several genome‐wide association studies (GWAS) have catalogued polymorphisms associated with plasma protein levels and identified proteins that correlate with risk scores for various disease states including AD. , , Zaghlool et al. (2020) performed the only large‐scale epigenome‐wide association study (EWAS) to date on plasma protein levels (>1000 proteins). Few studies have combined GWAS and EWAS data to quantify the independent and combined contributions of genetic and epigenetic factors toward differential protein biomarker levels. , , We performed a structured literature review of studies that report associations between plasma proteins and AD diagnosis or related traits such as amyloid beta (Aβ) burden and cortical atrophy. , , , , , , , , , , , We focused on studies that measured plasma protein levels using the SOMAscan affinity proteomics platform (SomaLogic Inc.), as this matches the protocol used in our study, Generation Scotland. We identified 282 proteins that were also measured in our sample (n ≤ 1064). Our first aim was to conduct GWAS and EWAS on plasma levels of 282 AD‐associated proteins. Using Bayesian penalized regression, we estimated the proportion of inter‐individual variability in plasma protein levels that can be accounted for by variation in genetic and DNAm factors. BayesR+ implicitly adjusts for probe intercorrelations and data structure, including relatedness. For our second aim, we used Mendelian randomization (MR) and co‐localization analyses to test for relationships between plasma protein levels and AD phenotypes.

METHODS

Study cohort

Analyses were performed using blood samples from participants of the STratifying Resilience and Depression Longitudinally (STRADL) cohort, comprising 1188 individuals from the larger, family‐structured Generation Scotland: the Scottish Family Health Study (GS). GS consists of 24084 individuals from across Scotland. Recruitment for GS took place between 2006 and 2011. STRADL participants partook in follow‐up data collection 4 to 13 years after baseline. ,

Search strategy

We searched MEDLINE (Ovid interface, Ovid MEDLINE in‐process and other non‐indexed citations and Ovid MEDLINE 1946 onwards), Embase (Ovid interface, 1980 onwards), Web of Science (core collection, Thomson Reuters), and medRxiv/bioRxiv to identify relevant articles indexed as of May 28, 2021. Search terms are outlined under supplementary information. Twenty‐five articles were identified and one further article was identified through a supplemental manual literature search. After removal of duplicates, 23 articles were assessed for inclusion criteria: (1) original research article, (2) proteins were measured in plasma, (3) proteins were measured using SOMAscan technology, and (4) proteins were associated with AD or related phenotypes. Twelve articles met inclusion criteria.

Protein quantification

The 5k SOMAscan v4 array was used to quantify the levels of plasma proteins in GS participants (n = 1065). This highly multiplexed platform uses chemically modified aptamers termed SOMAmers (Slow Off‐rate Modified Aptamers) that recognize epitopes on their cognate protein targets with high specificity and high affinity in the nanomolar‐to‐picomolar range. The recognition signal is measured as relative fluorescence units (RFUs) on microarrays. Plasma samples were collected in 150 μL aliquots and stored at −80°C. Samples were run in 96‐well plates and reagents were spread across three dilution factors (0.005%, 0.5%, and 20%) to create distinct sets for high, medium, and low abundance proteins, respectively. Raw microarray data were normalized through a number of quality control steps, which are detailed in the supplementary information. After quality control and the exclusion of non‐human proteins, deprecated markers and spuriomers, 4235 SOMAmers were retained for proteomic analyses. Normalized RFUs (from SomaLogic) were log‐transformed and regressed onto the following covariates: age, sex, study site (Aberdeen/Dundee), time between sample being collected and processed for proteomics (factor, 4 levels), and 20 genetic principal components (PCs) of ancestry from multidimensional scaling (to control for population structure). Relationships between covariates and SOMAmers are shown in Table S1. Residualized RFUs were transformed by rank‐based inverse normalization. We refer to these as protein levels; however, they reflect RFUs that have undergone a number of quality control, transformation and pre‐correction steps.

RESEARCH IN CONTEXT

Systematic Review: The authors performed a structured literature review of studies that reported associations between SOMAscan‐measured plasma proteins and Alzheimer's disease (AD). Twelve studies were included following a search of MEDLINE, Embase, Web of Science, and preprint servers. The goal of the study was to combine genome‐wide and epigenome‐wide association studies (GWAS and EWAS) with causal modeling methods to investigate associations between plasma proteins and AD risk. The study used data from the Scottish population‐based cohort, Generation Scotland. Interpretation: Two hundred eighty‐two proteins across the included studies were available for testing in Generation Scotland. Seven novel genetic and 19 novel cytosine‐phosphate‐guanine (CpG) sites were associated with plasma levels of 18 proteins. Higher plasma levels of tubulin‐specific chaperone A (TBCA) and triggering receptor expressed on myeloid cells 2 (TREM2) associated with lower risk of AD. Future Directions: Triangulation of evidence across other experimental and epidemiological approaches will be necessary to determine if blood proteins influence AD risk.

GWAS

Generation Scotland samples were genotyped using the Illumina Human OmniExpressExome‐8v1.0 Bead Chip and processed using the Illumina Genome Studio software v2011 (Illumina, San Diego, CA, USA). Quality control steps are outlined under supplementary information. After quality control, 561125 single nucleotide polymorphisms (SNPs) remained for 1064 individuals. In total, 1064 individuals had both genotype and proteomic data available for analyses. Bayesian penalized regression GWAS were performed using BayesR+ software in C++. BayesR+ utilizes a mixture of prior Gaussian distributions to allow for markers with effect sizes of different magnitudes. It also includes a discrete spike at zero that enables the exclusion of markers with non‐identifiable effects on the trait of interest. Guided by data from our previous studies, mixture variances for the stand‐alone GWAS were set to 0.01 and 0.1 to allow for markers that account for 1% or 10% of variation in circulating protein levels, respectively. , In the combined GWAS/EWAS analysis, genotype and DNAm data must have had the same number of prior variances (n = 3 each). Mixture variances for SNP data were set to 0.01, 0.1, and 0.2 in combined analyses. Input data were scaled to mean zero and unit variance. Gibbs sampling was used to sample over the posterior distribution conditional on input data and 10000 samples were used. The first 5000 samples of burn‐in were removed and a thinning of five samples was applied to reduce autocorrelation. SNPs that exhibited a posterior inclusion probability (PIP) ≥95% were deemed significant.

EWAS

Blood DNAm in Generation Scotland participants was quantified using the Illumina HumanMethylationEPIC BeadChip Array. Blood DNAm was assessed in two separate sets. After quality control, 793706 and 773860 CpG remained in sets 1 and 2, respectively. In total, 772619 CpG sites were shared across sets. Each set was truncated to these overlapping probes. In the stand‐alone EWAS and combined GWAS/EWAS, mixture variances were set to 0.001, 0.01, and 0.1 (n = 778). Missing DNAm data were mean imputed separately within each set as BayesR+ cannot accept missing values. Both sets were combined and adjusted for DNAm batch, set, age, and sex. Each CpG site was scaled to mean zero and unit variance. Houseman‐estimated white blood cell proportions were included as fixed‐effect covariates. CpG sites that had a PIP ≥95% were deemed significant. Sensitivity EWAS analyses were performed using linear mixed‐effects models and the lmekin function from the R coxme package (version 2.2‐16). DNAm data were pre‐corrected for age, sex, batch, and set. Houseman‐estimated white blood cell proportions were incorporated as fixed‐effect covariates and a kinship matrix was fitted to account for relatedness among individuals in STRADL.

Co‐localization analyses

Formal Bayesian tests of co‐localization were used to determine whether a shared causal variant likely underpinned two traits of interest. A 200 kilobase (kb) region (upstream and downstream, recommended default setting) surrounding the variant was extracted from our GWAS summary statistics. Expression quantitative trait loci (eQTL) data were extracted from eQTLGen summary statistics. Methylation QTL (mQTL) summary statistics were extracted from phenoscanner, GoDMC, or our own mQTL analyses. Methylation QTL analyses were performed using additive linear regression models and by regressing CpG sites (beta values) on SNPs (0, 1, 2) while adjusting for age, sex, DNAm batch, set, Houseman‐estimated white blood cell proportions, and 20 genetic PCs (n = 778). In instances where an mQTL effect was identified in more than one database, summary statistics from the study with the largest sample size were used in coloc. , , For AD‐related traits, summary statistics were extracted from the relevant GWAS. , , , Default priors were applied. Summary statistics for all SNPs (± 200 kb from the queried SNP) were used to estimate the posterior probability for five separate hypotheses: a single variant underlying both traits, separate variants for both traits, a causal variant for one trait (encompassing two hypotheses), or no causal variant for either trait. Posterior probabilities ≥95% provided strong evidence for a given hypothesis.

Mendelian randomization (MR)

Bidirectional Mendelian randomization (MR) was used to test for associations between (1) gene expression and plasma protein levels, (2) DNAm and plasma protein levels, and (3) plasma protein levels and AD risk or related biomarkers. Pruned variants (r 2 < 0.1) were used as instrumental variables (IVs) in MR analyses. Analyses were conducted using MR‐base. Two‐sample MR was applied and relationships were assessed using the Wald ratio test. Further information on IVs used are provided in supplementary information.

RESULTS

Identification of plasma proteins associated with AD

Twelve studies were identified that reported associations between SOMAscan plasma proteins and AD or related traits (Figure 1). Three hundred fifty‐nine unique proteins were identified and 22 (6.1%) were reported in more than one study (Table S2‐S4). In the Generation Scotland dataset, there were 308 SOMAmers (Slow Off‐rate Modified Aptamers) that targeted 282 of 359 proteins of interest (Table S5 and Figure S1). The 282 unique proteins were brought forward for analyses (UniProt IDs and Seq‐ids are shown in Table S6).
FIGURE 1

Structured literature review of SOMAscan plasma proteins that were associated with AD in the literature, and assessment of their molecular architectures and relationships with AD in the present study. The MEDLINE, Embase, Web of Science databases, and preprint servers were queried to identify studies that reported associations between SOMAscan‐measured plasma proteins and AD. GWAS, EWAS, and causal inference analyses were performed to identify molecular correlates of 282 AD‐associated plasma protein levels and to probe their associations with AD and related traits. AD, Alzheimer's disease; EWAS, epigenome‐wide association studies; GWAS, genome‐wide association studies. Figure created using Biorender.com

Structured literature review of SOMAscan plasma proteins that were associated with AD in the literature, and assessment of their molecular architectures and relationships with AD in the present study. The MEDLINE, Embase, Web of Science databases, and preprint servers were queried to identify studies that reported associations between SOMAscan‐measured plasma proteins and AD. GWAS, EWAS, and causal inference analyses were performed to identify molecular correlates of 282 AD‐associated plasma protein levels and to probe their associations with AD and related traits. AD, Alzheimer's disease; EWAS, epigenome‐wide association studies; GWAS, genome‐wide association studies. Figure created using Biorender.com

GWAS on AD‐associated proteins

There were 1064 individuals with genotype and proteomic data in Generation Scotland. The mean age of the sample was 59.9 years (standard deviation [SD] = 5.9) and 59.1% of the sample was female. In the BayesR+ GWAS, 64 independent variants (or protein quantitative trait loci, pQTLs) were associated with 41 SOMAmers that mapped to 39 unique protein targets (PIP≥ 95%; Table S7). The phenotypic correlation structure of these 41 SOMAmers is presented in Figure S2. The median correlation coefficient between SOMAmer levels was 0.18. Thirty‐six pQTLs represented cis associations (pQTLs within 10 megabases [Mb] of transcription start site [TSS] for a given gene) and 28 pQTLs were trans‐chromosomal effects (Figure 2). The majority of variants were located in intronic regions using annotations from the ENSEMBL variant effect predictor (46.9%, Figure S3).
FIGURE 2

GWAS on plasma protein levels previously associated with AD and disease‐related phenotypes. (A) Chromosomal locations of pQTLs identified through Bayesian penalized regression GWAS. The x‐axis shows the chromosomal location of pQTLs associated with the levels of SOMAmers that correlate with AD status or related pathways. The y‐axis represents the position of the gene encoding the target protein. Cis (red circles); trans (blue circles). (B) A circos plot for the 28 trans‐associated pQTLs from (A). Lines indicate an association between a pQTL and SOMAmer. AD, Alzheimer's disease;GWAS, genome‐wide association studies; pQTL, protein quantitative trait locus

GWAS on plasma protein levels previously associated with AD and disease‐related phenotypes. (A) Chromosomal locations of pQTLs identified through Bayesian penalized regression GWAS. The x‐axis shows the chromosomal location of pQTLs associated with the levels of SOMAmers that correlate with AD status or related pathways. The y‐axis represents the position of the gene encoding the target protein. Cis (red circles); trans (blue circles). (B) A circos plot for the 28 trans‐associated pQTLs from (A). Lines indicate an association between a pQTL and SOMAmer. AD, Alzheimer's disease;GWAS, genome‐wide association studies; pQTL, protein quantitative trait locus Fifty‐seven pQTLs were previously reported in GWAS of blood protein levels (Table S8). Variants either directly replicated known associations or showed high linkage disequilibrium (LD, r 2 > 0.75) with known pQTLs for queried proteins. Relative effect sizes reported in the literature correlated strongly with those in our study (r = 0.77, 95% confidence interval [CI] = 0.66, 0.84). We identified seven novel pQTLs associated with seven unique proteins. Three pQTLs were in cis (for GM2A, MATN3 and IL1RAP). Four pQTLs represented trans‐chromosomal effects: rs1126680 (BCHE for KLK6), rs7867739 (near ABO for ALPI), rs3820897 (COLEC11 for ALPL), and rs1530914 (MS4A4A for triggering receptor expressed on myeloid cells 2 [TREM2]). Thirty‐three pQTLs were associated with at least one trait in the GWAS Catalog at P < 5 × 10−8 (range = 1 to 96 associations; Table S9). In relation to AD traits, the trans pQTL in MS4A4A (rs1530914) for TREM2 levels is in high LD with a TREM2 variant (rs1582763, r 2 ∼ 0.9) associated with AD in apolipoprotein E (APOE) ε4 carriers and family history of AD. , In addition, the trans pQTL in APOE (rs769449) for tubulin‐specific chaperone A (TBCA) levels was associated with 15 AD‐related traits including genetic predisposition to AD and CSF biomarkers of the disease. BayesR+ was used to estimate the proportions of inter‐individual variation in plasma protein levels that were attributable to common SNPs (minor allele frequency >1%). Estimates ranged from 5.3% (PRL; 95% credible interval [CrI] = 0%, 24.4%) to 73.0% (IL1RAP; 95% CrI = 56.0%, 83.0%), with a median estimate of 13.0% across all 308 SOMAmers (Table S10).

Co‐localization of protein QTLs with expression QTLs

The 36 cis pQTLs identified in BayesR+ were annotated to 23 unique proteins. For 12 of 23 proteins, at least one pQTL was previously reported to be an expression QTL for the respective gene in blood tissue (eQTL consortium database). The R package coloc was used to test the hypothesis that a single variant associates with differences in gene expression (eQTL) and protein levels (pQTL) for each gene of interest. For two proteins (PCSK7 and F7), there was strong evidence (posterior probability [PP]) >95%) for a shared variant underlying gene expression and protein levels (Table S11). MR analyses provided evidence for reciprocal associations between changes in gene expression and circulating levels of these proteins (Table S12). Three proteins had weaker evidence for co‐localization (PP ≥75% for GM2A, LYZ, PDCD1LG2) and seven proteins had strong evidence for separate variants underlying gene expression and protein levels.

EWAS on AD‐associated proteins

There were 778 individuals with DNAm and proteomic data in the Generation Scotland sample. The mean age of the sample was 60.2 (SD = 8.8) years and 56.4% of the sample were female. Twenty‐six CpGs were associated with the levels of 20 unique proteins (PIP >95%, Table S13 and Figure S4). The median correlation coefficient between measured protein levels was 0.16. The associations consisted of 10 cis CpG sites and 16 trans CpG loci (Figure 3). The cg07839457 probe in the NLRC5 locus was associated with IL18BP and CSF1R levels, and the smoking‐associated probe cg05575921 in AHRR was associated with GHR, PIGR, and WFDC2 levels.
FIGURE 3

EWAS on plasma protein levels previously associated with AD and disease‐related phenotypes. (A) Chromosomal locations of CpGs identified through Bayesian penalized regression EWAS. The x‐axis shows the chromosomal location of CpG sites and the y‐axis represents the position of the gene encoding the target protein. Cis (red circles); trans (blue circles). (B) A circos plot for the 16 trans‐associated CpGs from (A). Lines indicate an association between a CpG site and SOMAmer. AD, Alzheimer's disease; CpG, cytosine‐phosphate‐guanin ; EWAS, epigenome‐wide association studies

EWAS on plasma protein levels previously associated with AD and disease‐related phenotypes. (A) Chromosomal locations of CpGs identified through Bayesian penalized regression EWAS. The x‐axis shows the chromosomal location of CpG sites and the y‐axis represents the position of the gene encoding the target protein. Cis (red circles); trans (blue circles). (B) A circos plot for the 16 trans‐associated CpGs from (A). Lines indicate an association between a CpG site and SOMAmer. AD, Alzheimer's disease; CpG, cytosine‐phosphate‐guanin ; EWAS, epigenome‐wide association studies We used linear mixed‐effects models that accounted for relatedness to perform sensitivity analyses for the 26 CpG associations identified in BayesR+ (Table S14). Effect sizes were highly correlated with those from BayesR+ and showed full directional concordance (r = 0.95, 95% CI = 0.90, 0.98; Figure S5). Twenty‐one associations were replicated at a genome‐wide significance threshold of P < 3.6 x 10−8,and the remaining five associations were replicated at P < 2.0 × 10−3. Furthermore, 7 of 26 CpG associations were previously reported in the literature and relative effect sizes correlated strongly with those in our study (r = 0.98, 95% CI = 0.87, 1.0). The 19 novel CpG sites were associated with levels of 14 unique proteins. In BayesR+, estimates for the proportions of variability in SOMAmer levels that could be accounted for by DNAm measured on the EPIC BeadChip array ranged from 7.1% (EEA1; 95% CrI = 0%, 27.7%) to 33.8% (MAP kinase‐activated protein kinase, MAPKAPK5; 95% CrI = 22.6%, 47.0%), with a median estimate of 10.0% (Table S15). Estimates for variance in SOMAmer levels accounted for by genetic and methylation data together, while conditioned on each other, ranged from 21.8% for ENTPD1 (95% CrI = 0.0%, 59.1%) to 93.3% for GHR (95% CrI = 80.1%, 100%) (Table S16). The mean and median estimates were 48.7% and 46.8%, respectively.

Co‐localization of protein QTLs with methylation QTLs

Fourteen proteins had both genome‐wide significant pQTL and CpG associations in our study. There were 39 possible SNP‐CpG pairs across these proteins. For each pair, we used linear regression to test if the SNP was associated with CpG methylation at P < 5 × 10−8, thereby representing an mQTL effect (Table S17). We also performed look‐up analyses of mQTL databases including GoDMC and phenoscanner. , In instances where an mQTL effect was identified in more than one database, coefficients from the study with the largest sample size were brought forward for co‐localization analyses. In addition, in instances where two or more mQTLs were associated with the same CpG site in a given locus, only the most significant mQTL was brought forward for co‐localization analyses (n = 19 mQTLs, 13 proteins; Table S18). For six proteins, we observed strong evidence in coloc that a single cis‐acting variant might underpin differential DNAm levels and protein abundances (PP >95%, Table S19). The six proteins were ANXA2, F7, MATN3, PCSK7, PLA2G2A, and SERPINA3. MR analyses provided evidence that relationships between methylation and protein levels were bidirectional (Table S20).

MR analyses between plasma proteins and AD risk

Bidirectional MR was applied to test for associations between the 41 SOMAmers with pQTL associations in BayesR+ and 20 AD‐related traits (Table S21). A Bonferroni‐corrected threshold of P < 6.10 × 10−5 (< 0.05/41 × 20) was set. Plasma levels of three proteins had a unidirectional association with AD risk: TREM2 (Table 1, Wald ratio test, beta = −0.13, SE = 0.05, P = 8.4 × 10−17), colony stimulating factor 3 (CSF3) (Wald ratio test, beta = 0.10, SE = 0.02, P = 5.9 × 10−6), and TBCA (inverse variance‐weighted method, beta = ‐0.50, SE = 0.12, P = 1.2 × 10−5). Conversely, AD risk was not associated with plasma levels of these proteins. Co‐localization analyses suggested that one variant was associated with TREM2 or TBCA levels and AD risk, and two separate variants were associated with CSF3 levels and AD risk (Table S22).
TABLE 1

MR analyses of plasma protein levels and AD‐associated traits (Bonferroni‐corrected P < 6.10 × 10−5)

ProteinTraitMethodBetaSE P Reference
Protein levels affecting AD‐associated traits
TBCALog‐transformed CSF Aβ42IVW−0.090.012.5 × 10−17 38
TREM2AD riskWald ratio−0.130.028.4 × 10−17 3
TBCACSF APOEWald ratio0.750.107.3 × 10−14 39
TBCACSF Aβ (Z‐scores)IVW−0.450.062.1 × 10−13 38
TBCALog‐transformed CSF Aβ42/Aβ40IVW−0.080.016.9 × 10−10 38
CSF3AD riskWald ratio0.100.025.9 × 10−6 3
TBCAAD riskIVW−0.500.121.2 × 10−5 3
AD‐associated traits affecting protein levels
TBCALog‐transformed CSF Aβ42Wald ratio−11.140.534.4 × 10−98 38
TBCACSF Aβ (Z‐scores)Wald ratio−2.130.105.7 × 10−98 38
TBCALog‐transformed CSF Aβ42/Aβ40Wald ratio−11.130.535.7 × 10−98 38
TBCACSF AβWald ratio12.210.633.7 × 10−84 37

Abbreviations: CSF, cerebrospinal fluid; IVW, inverse variance‐weighted method; MR, mendelian randomization; SE, standard error.

MR analyses of plasma protein levels and AD‐associated traits (Bonferroni‐corrected P < 6.10 × 10−5) Abbreviations: CSF, cerebrospinal fluid; IVW, inverse variance‐weighted method; MR, mendelian randomization; SE, standard error.

DISCUSSION

We identified seven novel protein QTLs and 19 novel CpG sites that associated with plasma levels of 18 AD‐related proteins. Using BayesR+, we provided estimates for associations between common genetic and DNAm variation and inter‐individual differences in plasma levels of 282 AD‐related proteins. We integrated our data with publicly available gene expression and methylation QTL databases and highlighted molecular mechanisms that might link pQTLs to differential levels of six proteins. We observed strong associations between plasma levels of TREM2 or TBCA and AD risk. These associations were driven by trans pQTLs in MS4A4A and APOE, respectively. We show that a trans pQTL (rs1530914) in the MS4A4A locus associates with higher plasma TREM2 levels. It is in strong LD (r 2 ∼ 0.9) with the variant rs1582763, which has been associated with higher CSF TREM2 levels and lower AD risk. , It is also in moderate LD (r 2 = 0.6) with a variant in the 3′UTR region of MS4A6A (rs610932), which was associated with plasma TREM2 levels in a sample of 35,559 Icelanders. Polymorphisms in MS4A4A were shown to alter MS4A4A expression and subsequently modulate TREM2 concentration in human macrophages. We supplement existing data by identifying a novel blood CpG correlate of plasma TREM2 levels (cg02521229) located near MS4A4A that previously associated with dementia risk in Generation Scotland participants. Our data suggest that risk mechanisms arising from MS4A4A polymorphisms and TREM2 levels can be captured in plasma assays and that these mechanisms involve differential methylation in the MS4A4A locus. We observed associations between plasma levels of three proteins (CSF3, MAPKAPK5, and TBCA) and trans pQTLs in the TOMM40‐APOE‐APOC2 locus. Furthermore, we identified two pQTLs and three CpG correlates of plasma MAPKAPK5 levels near the transmembrane protein 9 (TMEM97)locus. MAPKAPK5 correlated with cognitive decline in the Twins UK cohort; however, its relationship with neuropathology is unknown. TMEM97 acts a synaptic receptor for Aβ and mediates its cellular update via APOE‐dependent and APOE‐independent mechanisms. Given that TMEM97 polymorphisms may influence MAPKAPK5 levels, our data prioritize MAPKAPK5 for follow‐up studies as a potential downstream effector or correlate of TMEM97 in Aβ clearance. TBCA correlates with Aβ burden. TBCA levels are higher in individuals with the protective APOE ε2/ε2 genotype and lower in carriers of the risk ε4 polymorphism. These data are consistent with our GWAS and MR analyses. Future studies should examine whether TBCA dysregulation is a cause or consequence of disease risk mechanisms in carriers of APOE ε4 polymorphisms. Our study has a number of limitations. First, our review does not reflect an exhaustive list of potential AD‐associated traits. Furthermore, there is heterogeneity across studies in terms of diagnostic criteria and phenotype definitions. Second, by focusing on the SOMAscan platform alone, we do not capture all blood protein correlates of AD that are reported in the literature. Third, an insufficient number of variants were available to test for horizontal pleiotropy in MR analyses. Fourth, it is important to note that variants may alter SOMAmer reactivity with protein targets, or reflect technical artifacts such as sample handling and cross‐reactive events. Fifth, our sample consisted of Scottish individuals with a relatively homogenous genetic background thereby limiting generalizability of findings.

CONCLUSIONS

Our strategy of integrating multiple omics measures determined the degree to which molecular factors can explain inter‐individual differences in blood levels of possible biomarkers for AD, and advanced understanding of mechanisms underlying AD risk.

CONFLICT OF INTEREST

Andrew M. McIntosh has received research support from Eli Lilly, Janssen, and the Sackler Foundation. Andrew M. McIntosh has also received speaker fees from Illumina and Janssen and consulting fees. Simon Lovestone is currently an employee of Johnson & Johnson Medical Ltd and previously received grant support from multiple pharmaceutical companies and the EU through the Innovation Medicines Initiative programmes European Medical Information Framework and European Prevention of Alzheimer's Dementia (EPAD). Simon Lovestone is also co‐founder of Akrivia Health. Alejo J. Nevado‐Holgado has received funding from GlaxoSmithKline, UK, Ono Pharma, Japan, and Johnson & Johnson, UK. David J. Porteous is PI on a grant from Wellcome made to the University of Edinburgh with travel allowance. Craig W. Ritchie has been a paid consultant for several companies developing treatments for Alzheimer's disease over the last 5 years including Biogen, Eli Lilly, Merck, Roche, Janssen, AbbVie, Kyowa Kirin, Actinogen, and Eisai. Craig W. Ritchie was the UK Chief Investigator for the ENGAGE Trial and Academic Lead on the EPAD Programme, which was a public‐private partnership between the EU and several companies with an interest in developing treatments for AD (www.ep‐ad.org). Craig W. Ritchie's unit at the University of Edinburgh (Edinburgh Dementia Prevention) has received grant funding from Biogen, Janssen, AC Immune, and Actinogen; he is the unpaid chairperson of the Brain Health Clinic Consortium established in the UK by Biogen; was a member of a Data Safety Monitoring Board (DSMB) for a University College London (UCL) sponsored study (no payment); and serves as Director of Brain Health Scotland and Chair of the Scottish Dementia Research Consortium. Ian J. Deary has received royalties from Oxford University Press and Cambridge University Press. Archie Campbell was a member of the EMREC Edinburgh Medical Research Ethics Committee (no payment involved). Riccardo E. Marioni has received speaker fees from Illumina and is an advisor to the Epigenetic Clock Development Foundation. The remaining authors declare that they have no competing interests.

ETHICS STATEMENT

All components of the Generation Scotland study received ethical approval from the NHS Tayside Committee on Medical Research Ethics (REC Reference Numbers: 05/S1401/89 and 10/S1402/20). All participants provided broad and enduring written informed consent for biomedical research. Generation Scotland has also been granted Research Tissue Bank status by the East of Scotland Research Ethics Service (REC Reference Number: 20‐ES‐0021). This study was performed in accordance with the Declaration of Helsinki.

CODE AVAILABILITY

All code is available at the following Github repository. SUPPORTING INFORMATION Click here for additional data file. SUPPORTING INFORMATION Click here for additional data file. SUPPORTING INFORMATION Click here for additional data file. SUPPORTING INFORMATION Click here for additional data file.
  43 in total

1.  Epigenetic differences arise during the lifetime of monozygotic twins.

Authors:  Mario F Fraga; Esteban Ballestar; Maria F Paz; Santiago Ropero; Fernando Setien; Maria L Ballestar; Damia Heine-Suñer; Juan C Cigudosa; Miguel Urioste; Javier Benitez; Manuel Boix-Chornet; Abel Sanchez-Aguilera; Charlotte Ling; Emma Carlsson; Pernille Poulsen; Allan Vaag; Zarko Stephan; Tim D Spector; Yue-Zhong Wu; Christoph Plass; Manel Esteller
Journal:  Proc Natl Acad Sci U S A       Date:  2005-07-11       Impact factor: 11.205

2.  Genomic and phenotypic insights from an atlas of genetic effects on DNA methylation.

Authors:  Josine L Min; Gibran Hemani; Eilis Hannon; Koen F Dekkers; Juan Castillo-Fernandez; René Luijk; Elena Carnero-Montoro; Daniel J Lawson; Kimberley Burrows; Matthew Suderman; Andrew D Bretherick; Tom G Richardson; Johanna Klughammer; Valentina Iotchkova; Gemma Sharp; Ahmad Al Khleifat; Aleksey Shatunov; Alfredo Iacoangeli; Wendy L McArdle; Karen M Ho; Ashish Kumar; Cilla Söderhäll; Carolina Soriano-Tárraga; Eva Giralt-Steinhauer; Nabila Kazmi; Dan Mason; Allan F McRae; David L Corcoran; Karen Sugden; Silva Kasela; Alexia Cardona; Felix R Day; Giovanni Cugliari; Clara Viberti; Simonetta Guarrera; Michael Lerro; Richa Gupta; Sailalitha Bollepalli; Pooja Mandaviya; Yanni Zeng; Toni-Kim Clarke; Rosie M Walker; Vanessa Schmoll; Darina Czamara; Carlos Ruiz-Arenas; Faisal I Rezwan; Riccardo E Marioni; Tian Lin; Yvonne Awaloff; Marine Germain; Dylan Aïssi; Ramona Zwamborn; Kristel van Eijk; Annelot Dekker; Jenny van Dongen; Jouke-Jan Hottenga; Gonneke Willemsen; Cheng-Jian Xu; Guillermo Barturen; Francesc Català-Moll; Martin Kerick; Carol Wang; Phillip Melton; Hannah R Elliott; Jean Shin; Manon Bernard; Idil Yet; Melissa Smart; Tyler Gorrie-Stone; Chris Shaw; Ammar Al Chalabi; Susan M Ring; Göran Pershagen; Erik Melén; Jordi Jiménez-Conde; Jaume Roquer; Deborah A Lawlor; John Wright; Nicholas G Martin; Grant W Montgomery; Terrie E Moffitt; Richie Poulton; Tõnu Esko; Lili Milani; Andres Metspalu; John R B Perry; Ken K Ong; Nicholas J Wareham; Giuseppe Matullo; Carlotta Sacerdote; Salvatore Panico; Avshalom Caspi; Louise Arseneault; France Gagnon; Miina Ollikainen; Jaakko Kaprio; Janine F Felix; Fernando Rivadeneira; Henning Tiemeier; Marinus H van IJzendoorn; André G Uitterlinden; Vincent W V Jaddoe; Chris Haley; Andrew M McIntosh; Kathryn L Evans; Alison Murray; Katri Räikkönen; Jari Lahti; Ellen A Nohr; Thorkild I A Sørensen; Torben Hansen; Camilla S Morgen; Elisabeth B Binder; Susanne Lucae; Juan Ramon Gonzalez; Mariona Bustamante; Jordi Sunyer; John W Holloway; Wilfried Karmaus; Hongmei Zhang; Ian J Deary; Naomi R Wray; John M Starr; Marian Beekman; Diana van Heemst; P Eline Slagboom; Pierre-Emmanuel Morange; David-Alexandre Trégouët; Jan H Veldink; Gareth E Davies; Eco J C de Geus; Dorret I Boomsma; Judith M Vonk; Bert Brunekreef; Gerard H Koppelman; Marta E Alarcón-Riquelme; Rae-Chi Huang; Craig E Pennell; Joyce van Meurs; M Arfan Ikram; Alun D Hughes; Therese Tillin; Nish Chaturvedi; Zdenka Pausova; Tomas Paus; Timothy D Spector; Meena Kumari; Leonard C Schalkwyk; Peter M Visscher; George Davey Smith; Christoph Bock; Tom R Gaunt; Jordana T Bell; Bastiaan T Heijmans; Jonathan Mill; Caroline L Relton
Journal:  Nat Genet       Date:  2021-09-06       Impact factor: 41.307

3.  Increased Levels of Coagulation Factor XI in Plasma Are Related to Alzheimer's Disease Diagnosis.

Authors:  Edin Begic; Suncica Hadzidedic; Slobodan Obradovic; Zijo Begic; Mirsada Causevic
Journal:  J Alzheimers Dis       Date:  2020       Impact factor: 4.472

4.  Genome-Wide Association Studies for Cerebrospinal Fluid Soluble TREM2 in Alzheimer's Disease.

Authors:  Changan Liu; Jun Yu
Journal:  Front Aging Neurosci       Date:  2019-10-25       Impact factor: 5.750

5.  Cohort profile for the STratifying Resilience and Depression Longitudinally (STRADL) study: A depression-focused investigation of Generation Scotland, using detailed clinical, cognitive, and neuroimaging assessments.

Authors:  Alison D Murray; Andrew M McIntosh; Tina Habota; Anca-Larisa Sandu; Gordon D Waiter; Christopher J McNeil; J Douglas Steele; Jennifer A Macfarlane; Heather C Whalley; Ruth Valentine; Dawn Younie; Nichola Crouch; Emma L Hawkins; Yoriko Hirose; Liana Romaniuk; Keith Milburn; Gordon Buchan; Tessa Coupar; Mairi Stirling; Baljit Jagpal; Beverly MacLennan; Lucasz Priba; Mathew A Harris; Jonathan D Hafferty; Mark J Adams; Archie I Campbell; Donald J MacIntyre; Alison Pattie; Lee Murphy; Rebecca M Reynolds; Rebecca Elliot; Ian S Penton-Voak; Marcus R Munafò; Kathryn L Evans; Jonathan R Seckl; Joanna M Wardlaw; Stephen M Lawrie; Christopher S Haley; David J Porteous; Ian J Deary
Journal:  Wellcome Open Res       Date:  2021-07-16

6.  Genomic atlas of the human plasma proteome.

Authors:  Benjamin B Sun; Joseph C Maranville; James E Peters; David Stacey; James R Staley; James Blackshaw; Stephen Burgess; Tao Jiang; Ellie Paige; Praveen Surendran; Clare Oliver-Williams; Mihir A Kamat; Bram P Prins; Sheri K Wilcox; Erik S Zimmerman; An Chi; Narinder Bansal; Sarah L Spain; Angela M Wood; Nicholas W Morrell; John R Bradley; Nebojsa Janjic; David J Roberts; Willem H Ouwehand; John A Todd; Nicole Soranzo; Karsten Suhre; Dirk S Paul; Caroline S Fox; Robert M Plenge; John Danesh; Heiko Runz; Adam S Butterworth
Journal:  Nature       Date:  2018-06-06       Impact factor: 49.962

7.  Discovery and validation of plasma proteomic biomarkers relating to brain amyloid burden by SOMAscan assay.

Authors:  Liu Shi; Sarah Westwood; Alison L Baird; Laura Winchester; Valerija Dobricic; Fabian Kilpert; Shengjun Hong; Andre Franke; Abdul Hye; Nicholas J Ashton; Angharad R Morgan; Isabelle Bos; Stephanie J B Vos; Noel J Buckley; Mara Ten Kate; Philip Scheltens; Rik Vandenberghe; Silvy Gabel; Karen Meersmans; Sebastiaan Engelborghs; Ellen E De Roeck; Kristel Sleegers; Giovanni B Frisoni; Olivier Blin; Jill C Richardson; Régis Bordet; José L Molinuevo; Lorena Rami; Anders Wallin; Petronella Kettunen; Magda Tsolaki; Frans Verhey; Alberto Lleó; Daniel Alcolea; Julius Popp; Gwendoline Peyratout; Pablo Martinez-Lage; Mikel Tainta; Peter Johannsen; Charlotte E Teunissen; Yvonne Freund-Levi; Lutz Frölich; Cristina Legido-Quigley; Frederik Barkhof; Kaj Blennow; Henrik Zetterberg; Susan Baker; B Paul Morgan; Johannes Streffer; Pieter Jelle Visser; Lars Bertram; Simon Lovestone; Alejo J Nevado-Holgado
Journal:  Alzheimers Dement       Date:  2019-09-05       Impact factor: 21.566

8.  Epigenetics meets proteomics in an epigenome-wide association study with circulating blood plasma protein traits.

Authors:  Shaza B Zaghlool; Brigitte Kühnel; Mohamed A Elhadad; Sara Kader; Anna Halama; Gaurav Thareja; Rudolf Engelke; Hina Sarwath; Eman K Al-Dous; Yasmin A Mohamoud; Thomas Meitinger; Rory Wilson; Konstantin Strauch; Annette Peters; Dennis O Mook-Kanamori; Johannes Graumann; Joel A Malek; Christian Gieger; Melanie Waldenberger; Karsten Suhre
Journal:  Nat Commun       Date:  2020-01-03       Impact factor: 14.919

9.  Dickkopf-1 Overexpression in vitro Nominates Candidate Blood Biomarkers Relating to Alzheimer's Disease Pathology.

Authors:  Liu Shi; Laura M Winchester; Benjamine Y Liu; Richard Killick; Elena M Ribe; Sarah Westwood; Alison L Baird; Noel J Buckley; Shengjun Hong; Valerija Dobricic; Fabian Kilpert; Andre Franke; Steven Kiddle; Martina Sattlecker; Richard Dobson; Antonio Cuadrado; Abdul Hye; Nicholas J Ashton; Angharad R Morgan; Isabelle Bos; Stephanie J B Vos; Mara Ten Kate; Philip Scheltens; Rik Vandenberghe; Silvy Gabel; Karen Meersmans; Sebastiaan Engelborghs; Ellen E De Roeck; Kristel Sleegers; Giovanni B Frisoni; Olivier Blin; Jill C Richardson; Régis Bordet; José L Molinuevo; Lorena Rami; Anders Wallin; Petronella Kettunen; Magda Tsolaki; Frans Verhey; Alberto Lleó; Daniel Alcolea; Julius Popp; Gwendoline Peyratout; Pablo Martinez-Lage; Mikel Tainta; Peter Johannsen; Charlotte E Teunissen; Yvonne Freund-Levi; Lutz Frölich; Cristina Legido-Quigley; Frederik Barkhof; Kaj Blennow; Katrine Laura Rasmussen; Børge Grønne Nordestgaard; Ruth Frikke-Schmidt; Sune Fallgaard Nielsen; Hilkka Soininen; Bruno Vellas; Iwona Kloszewska; Patrizia Mecocci; Henrik Zetterberg; B Paul Morgan; Johannes Streffer; Pieter Jelle Visser; Lars Bertram; Alejo J Nevado-Holgado; Simon Lovestone
Journal:  J Alzheimers Dis       Date:  2020       Impact factor: 4.472

10.  Bayesian reassessment of the epigenetic architecture of complex traits.

Authors:  Daniel Trejo Banos; Daniel L McCartney; Marion Patxot; Lucas Anchieri; Thomas Battram; Colette Christiansen; Ricardo Costeira; Rosie M Walker; Stewart W Morris; Archie Campbell; Qian Zhang; David J Porteous; Allan F McRae; Naomi R Wray; Peter M Visscher; Chris S Haley; Kathryn L Evans; Ian J Deary; Andrew M McIntosh; Gibran Hemani; Jordana T Bell; Riccardo E Marioni; Matthew R Robinson
Journal:  Nat Commun       Date:  2020-06-08       Impact factor: 14.919

View more
  3 in total

1.  Integrated methylome and phenome study of the circulating proteome reveals markers pertinent to brain health.

Authors:  Danni A Gadd; Robert F Hillary; Daniel L McCartney; Liu Shi; Aleks Stolicyn; Neil A Robertson; Rosie M Walker; Robert I McGeachan; Archie Campbell; Shen Xueyi; Miruna C Barbu; Claire Green; Stewart W Morris; Mathew A Harris; Ellen V Backhouse; Joanna M Wardlaw; J Douglas Steele; Diego A Oyarzún; Graciela Muniz-Terrera; Craig Ritchie; Alejo Nevado-Holgado; Tamir Chandra; Caroline Hayward; Kathryn L Evans; David J Porteous; Simon R Cox; Heather C Whalley; Andrew M McIntosh; Riccardo E Marioni
Journal:  Nat Commun       Date:  2022-08-09       Impact factor: 17.694

Review 2.  Application of Bayesian genomic prediction methods to genome-wide association analyses.

Authors:  Anna Wolc; Jack C M Dekkers
Journal:  Genet Sel Evol       Date:  2022-05-13       Impact factor: 5.100

3.  Genome- and epigenome-wide studies of plasma protein biomarkers for Alzheimer's disease implicate TBCA and TREM2 in disease risk.

Authors:  Robert F Hillary; Danni A Gadd; Daniel L McCartney; Liu Shi; Archie Campbell; Rosie M Walker; Craig W Ritchie; Ian J Deary; Kathryn L Evans; Alejo J Nevado-Holgado; Caroline Hayward; David J Porteous; Andrew M McIntosh; Simon Lovestone; Matthew R Robinson; Riccardo E Marioni
Journal:  Alzheimers Dement (Amst)       Date:  2022-04-20
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.