Literature DB >> 35129252

Multimodal data integration via mediation analysis with high-dimensional exposures and mediators.

Abstract

Motivated by an imaging proteomics study for Alzheimer's disease (AD), in this article, we propose a mediation analysis approach with high-dimensional exposures and high-dimensional mediators to integrate data collected from multiple platforms. The proposed method combines principal component analysis with penalized least squares estimation for a set of linear structural equation models. The former reduces the dimensionality and produces uncorrelated linear combinations of the exposure variables, whereas the latter achieves simultaneous path selection and effect estimation while allowing the mediators to be correlated. Applying the method to the AD data identifies numerous interesting protein peptides, brain regions, and protein-structure-memory paths, which are in accordance with and also supplement existing findings of AD research. Additional simulations further demonstrate the effective empirical performance of the method.

Entities: Chemical

Keywords: Alzheimer's disease; mediation analysis; multimodal data integration; neuroimaging; principal component analysis

Mesh：

Year: 2022 PMID： 35129252 PMCID： PMC9057105 DOI： 10.1002/hbm.25800

Source DB: PubMed Journal: Hum Brain Mapp ISSN： 1065-9471 Impact factor: 5.399

INTRODUCTION

Alzheimer's disease (AD) is an irreversible neurodegenerative disorder and is characterized by progressive impairment of cognitive and bodily functions and ultimate death. It is currently affecting over 5.8 million American adults aged 65 years or older. Meanwhile, its prevalence continues to grow and is projected to reach 13.8 million by 2050 (Alzheimer's Association, 2020). Multimodal technologies have transformed AD research in recent years, by collecting different types of data from the same group of subjects and enabling the investigation of complex interrelated mechanisms underlying AD development. Notable examples include multimodal neuroimaging studies of the joint impact of brain structure and function on the disorders (Higgins, Kundu, & Guo, 2018; Liu et al., 2015), and imaging genetics studies of the impact of genetic variants on the brain then the disease outcome (Nathoo et al., 2019), among others. Our motivation is an imaging proteomics study, which is part of the Alzheimer's Disease Neuroimaging Initiative (ADNI) that aims to identify biomarkers for early detection and tracking of AD and to assist the development of prevention and intervention strategies. Amyloid‐ is a microscopic brain protein fragment, denotes peptides of 36–43 amino acids, and is part of a larger protein called amyloid precursor protein. Tau is a group of microtubule‐associated proteins predominantly found in brain cells and performs the function of stabilizing microtubules. Amyloid‐ is the main component of amyloid plaques, while tau is the main component of neurofibrillary tangles, both of which are commonly found in the brains of AD patients. Models of AD pathophysiology hypothesize a temporal sequence, in which accumulations of amyloid‐ plaques and neurofibrillary tangles disrupt cell‐to‐cell communications and destroy brain cells, leading to brain structural atrophy in regions such as the hippocampus, and ultimately a clinical decline in cognition (Mormino et al., 2009). However, it remains unclear how these two proteins interact with each other and with other proteins in the cerebrospinal fluid (CSF), and how those proteins together subsequently affect brain atrophy and disease progression. In our study, we aim to investigate simultaneously the interrelations of multiple protein peptides in the CSF, along with multiple brain regions of the whole brain, and their impact on memory. The problem can be formulated as a mediation analysis, where the goal is to identify and explain the mechanism, or path, that underlies an observed relationship between an exposure and an outcome variable, through the inclusion of an intermediate variable known as a mediator. It decomposes the effect of exposure on the outcome into a direct effect and an indirect effect, the latter of which indicates whether the mediator is on a path from the exposure to the outcome. In our multimodal AD study, the measurements of the amount of multiple protein peptides serve as the exposure variables, the volumetric measurements of multiple brain regions serve as the potential mediators, and a composite memory score serves as the outcome. See section 2 for more details about the study and the data. Our objective is to identify paths from proteins to brain regional atrophies that lead to memory decline. Mediation analysis was first proposed with a single exposure and a single mediator (Baron & Kenny, 1986). See VanderWeele (2016) for a review of mediation analysis and many references therein. In our setting, both the exposure variables and mediators are multivariate and potentially high‐dimensional. While there have been numerous extensions of mediation analysis to account for multiple mediators (see, e.g., Chén et al., 2017; Song et al., 2018; Zhao & Luo, 2022, among many others), there have been very few works studying multivariate exposures, or both multivariate exposures and mediators. Recently, Aung et al. (2020), Long, Irajizad, Doecke, Do, and Ha (2020), and Zhang (2021) proposed new approaches for mediation analysis of multivariate exposures and mediators. In particular, Zhang (2021) developed two regularization procedures and applied them to a mouse f2 dataset for diabetes, taking SNP genotypes as the exposures, islet gene expressions as the mediators, and insulin level as the outcome. However, they required the mediators to be independent, which hardly holds in our setting, as different brain regions are generally believed to influence each other. Aung et al. (2020) studied environmental toxicants on pregnancy outcomes, taking toxicants as the exposures, endogenous biomarkers such as inflammation and oxidative stress as the mediators, and gestational age at delivery as the outcome. A key strategy of their analysis was to reduce the exposure dimension by creating environmental risk scores for a small number of groups based on the domain knowledge. They showed that the between‐group correlation in the reduced exposures is negligible. However, such prior domain knowledge may not always be available. Long et al. (2020) proposed a general mediation framework to identify proteins that mediate the effect of metabolic gene expressions on survival for a type of kidney cancer, taking mRNA levels as the exposures, protein measures as the mediators, and survival time as the outcome. Nevertheless, they implicitly required the dimensions of the exposures and mediators cannot be too high, and thus their method is not directly applicable to our setting, where the number of exposures and mediators can both be potentially larger than the sample size. In this article, we propose a mediation analysis approach, with both high‐dimensional exposures and high‐dimensional mediators, for multimodal data analysis. The method integrates principal components analysis (PCA) with penalized least squares estimation for a set of linear structural equation models. The former reduces the dimensionality and produces uncorrelated linear combinations of the exposure variables, whereas the latter achieves path selection and effect estimation while allowing the multivariate mediators to be potentially correlated. We apply this approach to the imaging proteomics study of AD to integrate CSF proteomics, brain volumes, and a memory measure of mild cognitive impairment (MCI) subjects in ADNI. We identify several interesting protein peptides, brain regions, and protein–structure–memory paths that are in accordance with and also supplement the existing knowledge of AD. Additional simulations further demonstrate the efficacy of the method. Similar to Aung et al. (2020), Long et al. (2020), and Zhang (2021), our approach is among the first attempts to conduct mediation analysis where both the exposures and mediators are high‐dimensional. But unlike the existing solutions, we do not restrict the dimensionality or the correlation structures and do not require additional domain knowledge of the exposures or mediators. Moreover, although focusing on a multimodal neuroimaging study in this article, our proposed method is equally applicable to a wide range of multimodal data integration problems, for example, the multi‐omics data analysis (Richardson, Tseng, & Sun, 2016), and the multimodal healthcare study (Cai, Wang, Li, & Liu, 2019). As such, our proposal makes a useful addition to the general toolbox of both mediation analysis and multimodal data integration. The rest of the article is organized as follows. Section 2 introduces the motivating imaging proteomics data of AD. Section 3 presents the proposed model and estimation approach. Section 4 analyzes the AD dataset, with a detailed discussion on the identified protein peptides, brain regions, and path. Section 5 complements with additional simulation results to demonstrate the empirical performance of the method.

AD IMAGING PROTEOMICS STUDY

While Alzheimer's disease is becoming a major public health challenge as the population ages, there is no effective treatment for AD that is capable of stopping or slowing the associated cognitive and neuronal degradation. Therefore, understanding the disease pathology, identifying biological markers, and finding early diagnosis and intervention strategies are of critical importance (Alzheimer's Association, 2020). Among numerous AD‐related proteins in the CSF, amyloid‐ and tau are two major proteins that are consistently identified in the brains of AD patients, and their abnormal abundance generally indicates AD pathology (Jagust, 2018). Even though there has been evidence suggesting a pathological connection between amyloid‐ deposition, hippocampus atrophy, and memory decline (Mormino et al., 2009), it remains largely unknown how amyloid‐ and tau interact with each other, how they interact with other proteins in the CSF, and how these proteins together affect the downstream brain atrophy and cognitive outcome. In our study, we aim to delineate the regulatory relationships among multiple CSF proteins, structural atrophy of the whole brain, and cognitive behavior, and to identify important biological paths. The data used in our study are obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI, adni.loni.usc.edu). The CSF proteomics data were obtained using targeted liquid chromatography multiple reaction monitoring mass spectrometry, which is a highly specific, sensitive, and reproducible technique for quantifying targeted proteins. A list of protein fragments, or peptides, was sent to the detector. The samples then went through peak integration, outliers detection, normalization, quantification, and quality control using test/re‐test samples. This procedure results in the intensity measures of 320 peptides that are annotated from 142 proteins. The brain imaging data were obtained using anatomical magnetic resonance imaging (MRI). Each image was first preprocessed following the standard pipeline, then mapped to an atlas consisting of 145 brain regions‐of‐interest to extract the volumetric measures (Doshi et al., 2016). The atlas used in the study spans the entire brain and was actually built on multiple atlases. Individual atlases were first warped to the target image using a nonlinear registration method, followed by a spatially adaptive weighted voting strategy to fuse into a final segmentation. Moreover, the volume of each brain region was standardized by the total intracranial volume to account for variations of individual brain size. The cognitive outcome is a composite memory score, ADNI‐MEM, that involves a battery of neuropsychological tests. In our study, we focus on 135 subjects diagnosed as mild cognitive impairment (MCI) patients at recruitment. MCI is a prodromal stage of AD, with a slight but noticeable and measurable decline in cognitive abilities. A person with MCI is at an increased risk of developing AD or other dementia. Understanding the pathologic mechanism underlying MCI provides important clues of onset of the disorder as well as a useful guide for early diagnosis and intervention.

MODEL AND METHOD

We first present the proposed model, then an estimation method integrating principal components analysis and penalized estimation.

Model

Suppose there are totally subjects. Let denote the ‐dimensional vector of exposure variables, denote the ‐dimensional vector of mediators, and denote the univariate outcome variable, for subjects . In our imaging proteomics study, denotes the protein peptide measures with , denotes the brain volumetric measures with , denotes the memory score, and the sample size . The first step of our method is to perform a principal components analysis on to produce uncorrelated composite exposures. If further follows a multivariate normal distribution, then the produced composite exposures are independent. Let denote the first principal components. We then continue to model the path relations among and via the following set of linear structural equation models, where , , stack the composite exposures, mediators, and outcome across all subjects, respectively, , with , and are measurement errors. Suppose both error terms follow some zero mean normal distribution, and is independent of , is independent of and , and and are independent of each other. The parameters , , and capture the path effects. Model 1 is similar to that used in Zhao, Li, and Caffo (2021) and Zhao and Luo (2022), but none of those can handle multivariate exposure variables. Besides, we introduce some different forms of penalty functions in our parameter estimation. Figure 1 shows a schematic description of Model 1. Under this model, we define the direct effect of on as , the indirect effect of on through as , and the total indirect effect of on as , for . The total effect of satisfies that .

FIGURE 1

The schematic diagram of the proposed model with exposure variables , mediators , and the outcome variable

The schematic diagram of the proposed model with exposure variables , mediators , and the outcome variable A key characteristic of Model 1 is that it allows the multivariate mediators to be conditionally dependent given the exposures. To better illustrate this, we consider a simple example of Model 1, where , as shown in Figure 2. In this example, Figure 2a outlines the sequential influences among all the mediators, while Figure 2b is the proposed Model 1. We see that, for the first mediator, , ; for the second mediator, , ; and for the third mediator, , . As such, consolidates the effects through the th mediator , and the indirect effect can be viewed as the consolidated indirect effect through , .

FIGURE 2

A model example with exposure variable and sequentially ordered mediators

Estimation

We propose to estimate the parameters in Model 1 through the penalized ordinary least squares, where the loss function is the usual least squares loss, are three penalty functions, with the tuning parameters , respectively. We next discuss each penalty function in detail. The first penalty function is of the form, for some parameters and . It is a generalization of the pathway Lasso penalty of Zhao and Luo (2022) to exposure variables, and is to facilitate selection of individual mediators. Specifically, for a given mediator , the term is a product Lasso penalty, and encourages all the paths going through to be shrunk to zero, which in effect achieves the goal of mediator selection. The term is to make the penalty a convex function, with a proper choice of the parameter . It is straightforward to show that, when , the sum is convex. In our implementation, we fix . The last term in is the sum of usual Lasso penalty that further penalizes individual path effects , with being an additional tuning parameter. It is found that this additional penalty helps further improves the selection accuracy (Zhao & Luo, 2022). The second penalty function is of the form, It is a group Lasso penalty and is to facilitate the selection of individual exposure. Specifically, for a given exposure , the penalty encourages all the paths originating from to be shrunk to zero, which in effect achieves the goal of exposure selection. The third penalty function is of the form, This is simply the usual Lasso penalty and is to facilitate selection of direct effects between the exposures and the outcome. We next discuss how to solve the minimization problem (2). We note that (2) involves the penalties on the product terms , making it difficult to derive the analytical solutions. As such, we first introduce a new parameter, , which turns (2) to an equivalent problem of solving a sparse group lasso that has an explicit form of solution (Simon, Friedman, Hastie, & Tibshirani, 2013). That is, letting , we turn to the equivalent optimization problem, Let , , and introduce the augmented Lagrangian parameter , for , and . Then, the augmented Lagrangian form of (3) is where is the augmented Lagrangian constant that we set in our implementation, is the Hadamard product, is the inner product, and is the ‐norm. We next solve (4) by updating and iteratively. More specifically, we first fix at iteration , and update by solving for , where is the ‐norm. There is a closed‐form solution, for , where , is the soft‐thresholding function with denoting the sign of and , and denotes the element‐wise soft‐thresholding of a vector . We next fix , and update by solving where , , is a diagonal matrix with as the diagonal elements, is the th column of , and is the ‐dimensional identity matrix. The solution is We next fix , and update by solving where , , and is a diagonal matrix with as the diagonal elements. The solution is We then fix , and update by solving where and . The solution is Finally, we fix , and update by We stop the iterations until some stopping criterion is met. In our implement, we stop when the difference of two consecutive objective values is smaller than . We summarize the above optimization procedure in Algorithm 1. Input: and the tuning parameters and 1: initialization: 2: repeat 3: update given by (5), for 4: update given by (6), for 5: update given by (7) 6: update given by (8) 7: update given by (9), for 8: until the stopping criterion is met Output: We tune the parameters in (4) using the Bayesian information criterion (BIC), where are the estimates under a given set of tuning parameters and , denotes the active set, and is the cardinality. In our implementation, we adopt the tuning strategy of Zou and Hastie (2005), by tuning the ratios along with in a grid search, and choose the best set of parameters that minimizes the BIC.

AD IMAGING PROTEOMICS STUDY REVISITED

We apply the proposed method to the ADNI imaging proteomics data, taking the CSF peptide measures as the exposures, the brain volumetric measures as the mediators, and the memory score as the outcome. Moreover, we adjust the exposures, mediators, and outcome for age, gender, ApoE4, and years of education to remove potential confounding effects (Rosenbaum, 2002). We first summarize the identified paths with nonzero effects, then discuss the relevant proteins and brain regions in detail. In summary, our findings are consistent with the existing knowledge of AD. Moreover, our method also suggests a few potentially interesting protein–structure–memory paths that may deserve further examination and verification.

Paths with nonzero effects

We first apply principal components analysis to the peptide data. The top 20 principal components (PCs) account for about of total data variation. We thus focus on those top PCs and feed them as the exposure variables into the subsequent penalized path analysis. Figure 3 presents all the identified paths with a nonzero indirect path effect. Table 1 presents the estimated path effects including the estimated and of each path, and Table 2 presents the indirect, direct, and total effect of each exposure PC.

FIGURE 3

TABLE 1

Brain regions with nonzero indirect effect () in the AD imaging proteomics study

	Brain regions as mediators		Principal components of peptides as exposures							β (×10−2)
	Brain regions as mediators		PC1	PC2	PC4	PC5	PC7	PC9	PC19	β (×10−2)
R41	Left cerebellum white matter	α				−0.17
R41	Left cerebellum white matter	IE (×10−3)				−1.30				0.76
R47	Right hippocampus	α			0.11	0.13	0.13
R47	Right hippocampus	IE (×10−3)			1.12	1.60	1.52			1.17
R48	Left hippocampus	α	0.11		0.13	0.13	0.22	0.12
R48	Left hippocampus	IE (×10−3)	1.34		1.59	1.76	3.40	1.55		1.20
R49	Temporal horn of right lateral ventricle	α	−0.25	0.15	−0.26	−0.18		−0.16
R49	Temporal horn of right lateral ventricle	IE (×10−3)	2.06	−1.01	2.03	1.28		1.08		−0.66
R50	Temporal horn of left lateral ventricle	α	−0.29		−0.23	−0.25		−0.21
R50	Temporal horn of left lateral ventricle	IE (×10−3)	2.55		1.78	2.05		1.74		−0.71
R51	Right lateral ventricle	α	−0.36
R51	Right lateral ventricle	IE (×10−3)	1.06							−0.30
R52	Left lateral ventricle	α	−0.36
R52	Left lateral ventricle	IE (×10−3)	1.15							−0.27
R73	Cerebellar vermal lobules VIII‐X	α		0.14
R73	Cerebellar vermal lobules VIII‐X	IE (×10−3)		1.88						1.00
R103	Left anterior insula	α		−0.17	0.20	0.25
R103	Left anterior insula	IE (×10−3)		−1.13	1.27	1.76				0.56
R106	Right angular gyrus	α					0.15		−0.18
R106	Right angular gyrus	IE (×10−3)					1.03		−1.41	0.78
R117	Left entorhinal areas	α				0.17
R117	Left entorhinal areas	IE (×10−3)				1.15				0.76
R120	Right frontal pole	α			0.16
R120	Right frontal pole	IE (×10−3)			1.12					0.75
R121	Left frontal pole	α							−0.16
R121	Left frontal pole	IE (×10−3)							−1.12	0.77
R122	Right fusiform gyrus	α	0.16
R122	Right fusiform gyrus	IE (×10−3)	1.32							1.02
R123	Left fusiform gyrus	α	0.19
R123	Left fusiform gyrus	IE (×10−3)	1.12							0.66
R154	Right middle temporal gyrus	α			0.21			0.19
R154	Right middle temporal gyrus	IE (×10−3)			1.68			1.66		0.74
R155	Left middle temporal gyrus	α	0.13		0.14	0.18	0.14	0.15
R155	Left middle temporal gyrus	IE (×10−3)	1.01		1.09	1.63	1.05	1.35		0.79
R169	Left precuneus	α			0.15
R169	Left precuneus	IE (×10−3)			1.10					0.82
R172	Right posterior insula	α	0.13	−0.12	0.22	0.11	0.11
R172	Right posterior insula	IE (×10−3)	1.44	−1.30	2.67	1.03	1.00			1.03
R173	Left posterior insula	α		−0.17	0.24	0.15
R173	Left posterior insula	IE (×10−3)		−1.46	2.18	1.09				0.82
R182	Right precentral gyrus	α							−0.24
R182	Right precentral gyrus	IE (×10−3)							1.91	−0.51

TABLE 2

The estimated indirect effects (IE), direct effects (DE), and total effects (TE) of the top principal components

	PC1	PC2	PC4	PC5	PC6	PC7	PC9	PC11	PC14	PC15	PC16	PC19	Total
IE	0.013	−0.003	0.018	0.012		0.008	0.007					−0.001	0.054
DE	0.138			0.066	−0.035	0.168	0.065	−0.018	0.102	−0.007	0.156		0.634
TE	0.151	−0.003	0.018	0.078	−0.035	0.176	0.072	−0.018	0.102	−0.007	0.156	−0.001	0.688

Note: The PCs with zero IE and DE are not presented in the table.

The estimated paths for the AD imaging proteomics study. The red nodes denote the principal components of the peptides as exposures, the green nodes the brain regions as mediators, and the blue node the memory score as outcome. The red arrows indicate positive path effects, and the blue arrows negative path effects Brain regions with nonzero indirect effect () in the AD imaging proteomics study The estimated indirect effects (IE), direct effects (DE), and total effects (TE) of the top principal components Note: The PCs with zero IE and DE are not presented in the table.

Proteins

Among the 20 PCs, seven have nonzero indirect effects on memory. Next, we focus on PC1, PC4, and PC5 as they account for a higher proportion of total data variation and demonstrate a relatively higher indirect path effect on the outcome. To better interpret the PCs, the loading profiles are sparsified following the sparse PCA approach (Zou, Hastie, & Tibshirani, 2006). The fused lasso regularization (Tibshirani, Saunders, Rosset, Zhu, & Knight, 2005) is considered to impose local consistency and smoothness within the same protein. Table 3 lists the top proteins in PC1, PC4, and PC5, and the corresponding gene name. We also include the regulation directions found in the AD literature, where an upregulation compared to cognitive normal controls indicates a higher protein abundance in MCI/AD patients, as well as the direction of correlations with the CSF amyloid‐ and tau, the two well‐established AD protein biomarkers (Wesenhagen, Teunissen, Visser, & Tijms, 2020). We next discuss the identified proteins by their relevance in the amyloid‐ and tau pathology.

TABLE 3

Proteins with top loading magnitude in PC1, PC4, and PC5

Protein	Loading	Gene	Direction	Correlation
Protein	Loading	Gene	Direction	tau	amyloid
PC1
Neuroblastoma suppressor of tumorigenicity 1	0.283	NBL1	↑
Spondin‐1	0.160	SPON1	↑	↑	↓
VPS10 domain‐containing receptor SorCS1	0.152	SORCS1		↑	↓
ProSAAS	0.116	PCSK1N	⇕
Prostagiandin‐H2 D‐isomerase	0.110	PTGDS	↓		↓
Neuronal growth regulator 1	0.110	NEGR1	↓
Monocyte differentiation antigen CD14	0.109	CD14	↑
Cell adhesion molecule 3	0.103	CADM3	↓
PC4
Beta‐2‐microglobulin	−0.252	B2M	⇕	↓
Neuronal pentraxin‐2	0.190	NPTX2	↓		↑
Insulin‐like growth factor‐binding protein 2	−0.147	IGFBP2	⇕	↑
Neuronal pentraxin‐1	0.137	NPTX1	↓
Kallikrein‐6	−0.129	KLK6	↑	↑	↑
Apolipoprotein D	−0.121	APOD	⇕	↑
Neurexin‐2	0.117	NRXN2	⇕
Cystatin‐C	−0.116	CST3	⇕	⇕	↑
PC5
Superoxide dismutase (Cu‐Zn)	0.236	SOD1	↓	↑	↓
Neurosecretory protein VGF	0.195	VGF	↓		↓
Ectonucleotide pyrophosphatase/phosphodiesterase family member 2	−0.152	ENPP2	↑	↓
Complement C4‐A	−0.152	C4A	↑
Complement factor B	0.121	CFB	↑
Glial fibrillary acidic protein	−0.120	GFAP	↑
Mimecan	−0.105	OGN	⇕
Chromogranin‐A	0.103	CHGA	⇕	↑	↑
Alpha‐1B‐glycoprotein	0.102	A1BG	⇕

Note: For each protein, direction of protein level in MCI/AD compared to normal control and correlation with CSF tau and amyloid reported in the literature are provided. , consistently upregulated in MCI/AD or positively correlated; , consistently downregulated in MCI/AD or negatively correlated; , inconsistent reports.

Proteins with top loading magnitude in PC1, PC4, and PC5 Note: For each protein, direction of protein level in MCI/AD compared to normal control and correlation with CSF tau and amyloid reported in the literature are provided. , consistently upregulated in MCI/AD or positively correlated; , consistently downregulated in MCI/AD or negatively correlated; , inconsistent reports.

Proteins related to amyloid pathology

Among the top‐loaded proteins, SPON1, SORCS1, PTGDS, CST3, NPTX2, VGF, and CHGA have been found to be related to amyloid‐ pathology in AD. The accumulation of amyloid‐ is generally considered a hallmark of AD, which is derived from the amyloid precursor protein (APP) through sequential cleavages by beta‐site amyloid precursor protein cleaving enzyme 1 (BACE1) and ‐secretase (Vassar et al., 1999). Blocking BACE1 can potentially reduce the abundance in amyloid‐, however, this may prohibit the other functions of BACE1 in psychological activities. For SPON1, using an in vivo AD mouse model, it was found that, by injecting SPON1, the amount of amyloid‐ was significantly reduced, and subsequently, the ameliorated cognitive dysfunction and memory impairment were improved, suggesting SPON1 to be a potential AD therapy target (Park et al., 2020). Interacting with APOE, human SPON1 suppresses amyloid‐ level through the APP transgene, and has an impact on working memory performance through the activation of the triangular part of the right inferior frontal gyrus (Liu et al., 2018). For NPTX1 and NPTX2, both belong to the family of long neuronal pentraxins. Together with NPTXR, they bind AMPA type glutamate receptors and contribute to multiple forms of developmental and adult synaptic plasticity. Using an AD mouse model, reduction in NPTX2 together with amyloidosis was found to induce a synergistic reduction of inhibitory circuit function. In AD subjects, the level of NTPX2 was found to be related to hippocampal volume, as well as cognitive decline (Xiao et al., 2017). For CST3, cysteine proteases, including cathepsin B (CatB), is a recently discovered amyloid‐‐degrading enzyme. Using a mouse model, CST3 was discovered to be a key inhibitor of CatB‐induced amyloid‐ degradation in vivo. Genetic ablation of CST3 significantly reduced soluble amyloid‐ levels, and attenuated associated cognitive deficits and behavioral abnormalities, and restored synaptic plasticity in hippocampus (Sun et al., 2008). For VGF, through a mouse model, over‐expression of neuropeptides precursor VGF was found to partially rescue amyloid‐‐mediated memory impairment and neuropathology, suggesting a possible causal role of VGF in protecting against AD pathogenesis and progression (Beckmann et al., 2020). For SORCS1, through a meta‐analysis of 16 SORCS1‐single nucleotide polymorphisms (SNPs) in six independent datasets, it was found that over‐expression of SORCS1 can reduce ‐secretase activity and amyloid‐ levels, and the suppression of SORCS1 can increase ‐secretase processing of APP and the levels of amyloid‐ (Reitz et al., 2011). For PTGDS, it is one of the most abundant proteins in the CSF, which binds and transports small lipophilic molecules such as amyloid‐, and thus has been considered as the endogenous amyloid‐ chaperone (Kanekiyo et al., 2007), and is believed to play an important role in AD development. For CHGA, compared to the normal controls, the level of CHGA was significantly higher in the CSF of patients with MCI, especially with MCI progressing to AD (Duits et al., 2018). CHGA is the major soluble protein in catecholamine storage vesicles, abnormalities of which may play a central role in memory deficits in AD. Elevation of CHGA was observed in AD brains, and was believed to play a role in amyloid‐ pathology (Mattsson et al., 2013; O'Connor, Kailasam, & Thal, 1993). It has also been found that CHGA is negatively associated with hippocampal and entorhinal volume (Khan et al., 2015).

Proteins related to tau pathology

For IGFBP2, it is an abundant cerebral insulin‐like growth factor signaling protein associated with the AD biomarkers. In both AD mouse models and AD patients, IGFBP2 was observed to be associated with CSF tau levels and brain atrophy in nonhippocampal regions, suggesting that it is relevant in neurodegeneration through tau pathology (Bonham et al., 2018).

Proteins related to both amyloid and tau pathology

There was evidence showing that proteins KLK6 and SOD1 were relevant in both amyloid and tau pathology. For SOD1, using an APP‐overexpressing mouse model, SOD1 deficiency was found to accelerate amyloid‐ oligomerization, induce tau phosphorylation and lower levels of synaptophysin, and consequently memory impairment (Murakami et al., 2011). Kallikrein‐related peptidases (KLKs) represent the largest family of secreted serine proteases. Human KLK6 is the most abundant KLKs in the spinal cord, brain stem, cerebral cortex including the hippocampus and thalamus. It has been found that KLK6 cleaves APP and mediates cleavage of laminin and collagen, which has implications for APP processing and amyloid‐ mediated neurotoxicity (Angelo et al., 2006; Small, Nurcombe, Clarris, Beyreuther, & Masters, 1993). In AD patients, the level of KLK6 in CSF is significantly elevated and is associated with levels of CSF tau suggesting a potential marker of tau pathology (Goldhardt et al., 2019).

Other AD‐related proteins

NRXN2 is another protein marker that was found to be up‐regulated among MCI patients, especially with MCI progression to AD (Duits et al., 2018). APOD was found to be elevated in the prefrontal cortex associated with cognitive decline (Thomas et al., 2003). GFAP immunohistochemistry is a marker to assess the oxidative stress and glial cell activation expressed in astrocytes. Focusing on the human entorhinal cortex and hippocampus, the GFAP expression was observed in the hippocampus of AD patients (Hol et al., 2003). B2M is a component of major histocompatibility complex class 1 molecules. Increased soluble B2M has been discovered in the CSF of patients with AD, and was associated with cognitive decline (Carrette et al., 2003). Using mouse models, elevated B2M was observed in the hippocampus of aged mice. Injecting exogenous B2M locally in the hippocampus, impaired hippocampal‐dependent cognitive function and neurogenesis were observed in young mice. The findings suggest that the accumulation of B2M increases the risk of age‐related cognitive dysfunction and neurogenesis impairment (Smith et al., 2015).

Proteins related to brain structure/atrophy

NEGR1 is a member of the immunoglobulin superfamily of cell adhesion molecules, and is involved in cortical layering. Using a NEGR1‐targeted mouse model, brain morphological analysis revealed NEGR1‐related neuroanatomical abnormalities, including enlargement of ventricles and decrease in the volume of the whole brain, corpus callosum, globus pallidus, and hippocampus (Singh et al., 2019). CST3 was discovered to be related to a higher hippocampal atrophy rate (Paterson et al., 2014), and atrophy in the entorhinal cortex (Mattsson et al., 2014). APOD and NPTX2 were found to be related to medial temporal lobe atrophy (Mattsson et al., 2014; Swanson et al., 2016).

Brain regions

While Table 1 lists the brain regions with nonzero path effects induced by PC1, PC4, and PC5, Figure 4 visualizes those regions on a template brain. The identified brain regions include the hippocampus, the entorhinal cortex, cortical regions on the temporal, parietal and frontal lobes, the lateral ventricles, and the cerebellum. Brain structural atrophy occurs early in the medial temporal lobe, including the hippocampus and entorhinal cortex, then extends soon after to the rest of the cortical areas, usually following a temporal, parietal, frontal trajectory, whereas the motor areas are affected toward late stages. (Pini et al., 2016). We next discuss those identified brain regions roughly following this trajectory.

FIGURE 4

Brain regions with a nonzero mediation effect in (a) PC1, (b) PC4, and (c) PC5

The hippocampus and entorhinal cortex

The hippocampus is a major component of the human brain located in the medial temporal lobe, and is functionally involved in response inhibition, episodic memory, and spatial cognition. Hippocampal atrophy is the best established and validated biomarker across the entire disease spectrum (Jack Jr et al., 2011). The entorhinal cortex also locates in the medial temporal lobe. It connects the neocortex and the hippocampus that receives information from the neocortex and projects to the hippocampus through the perforant pathway (Insausti, Tunon, Sobreviela, Insausti, & Gonzalo, 1995). It has been consistently reported that, compared to the healthy controls, entorhinal atrophy was observed in the MCI patients, and more severe atrophy in the AD patients (Pini et al., 2016). The hippocampus and entorhinal cortex, as well as the anatomically related parahippocampal and perirhinal cortices, are parts of the medial temporal lobe memory system. Impairments of this system are responsible for the deficit in episodic memory, and are early hallmark of AD (Nadel & Hardt, 2011).

The lateral temporal, parietal, and frontal cortex

The gray matter loss in the lateral temporal cortex, dorsal parietal, parietal angular and frontal cortex occurs during the progression from incipient to mild AD. During this period, cognitive deficits have been observed in both memory and nonmemory domains, including language, visuo‐spatial and executive function (Frisoni, Prestia, Rasser, Bonetti, & Thompson, 2009). Moreover, a higher amount of tau deposition has been observed in the middle temporal cortex, fusiform gyrus, and entorhinal cortex (Schultz et al., 2018). The fusiform gyrus is critical in facial recognition. Alterations of gene expression specific to the fusiform gyrus were discovered in AD patients (Ma et al., 2020). The left middle temporal gyrus is related to the recognition of known faces and accessing word meaning while reading (Acheson & Hagoort, 2013). The precuneus, a hub of the default mode network, has been found to be related to episodic memories (Sadigh‐Eteghad, Majdi, Farhoudi, Talebi, & Mahmoudi, 2014). Atrophy in the entorhinal cortex, fusiform, middle temporal gyrus, precuneus, and precentral has been noted in AD (Parker et al., 2018). The association between atrophy in the insular cortex and memory deficits in AD has been reported too (Lin et al., 2017).

The lateral ventricles

The ventricles are one of the interests in brain atrophy research as the volumetric measurement is robust to automatic segmentation due to the sharp contrast between the CSF in the ventricles and surrounding tissue in T1‐weighted images. Thus, as a complement metric of hemispheric atrophy rates, enlargement in the lateral ventricles is an important marker of AD progression (Kruthika et al., 2019).

The cerebellum

The cerebellum is involved in cognition and emotion and communicates with cerebral cortices in a topographically organized manner. Based on existing evidence of cerebellar modulation of cognition and emotion, it was hypothesized that there exists cerebellar contribution to the cognitive and neuropsychiatric deficits in AD. However, more research is required to validate the hypothesis and to understand cerebrocerebellar interactions in AD pathology (Jacobs et al., 2018).

SIMULATION STUDY

We complement our data analysis with some additional simulation studies to further examine the empirical performance of the proposed method. We generate () from a multivariate normal distribution with mean zero and a covariance matrix whose eigenvalues exponentially decay. After applying PCA, we obtain , where is chosen such that the top PCs account for over of total data variation. We then generate and following Model 1 given . We set of the path effects to be nonzero. We consider two sets of data dimension, , and , the latter of which has a similar data dimension as in the ADNI dataset. We also consider three sample sizes, . We compare the proposed approach with an approach based on the univariate mediation analysis (Imai, Keele, & Tingley, 2010). After the PCs are obtained, univariate mediation analysis is performed for each mediator and each exposure PC and finish with a ‐value correction (Benjamini & Hochberg, 1995). Table 4 presents the estimated total indirect effects and the indirect effects of the top six PCs, and Table 5 presents the estimated number of PCs and the sensitivity and specificity of the identified nonzero path effects. Among all cases, the estimated number of PCs is 6, which agrees with the truth. From the tables, we observe that the proposed method achieves a competitive performance, and the performance improves, with a lower estimation error and a higher selection accuracy, as the sample size increases. For the univariate‐based approach (UniMed), the performance in estimating the effects does not improve as the sample size increases and the power of identifying nonzero mediation effects is much lower.

TABLE 4

The estimation bias and mean squared error (MSE) of estimating the total indirect effect and indirect effect of top PCs in the simulation study

r	p		Truth	n=100				n=500				n=1000
				UniMed		PathLasso		UniMed		PathLasso		UniMed		PathLasso
				Bias	MSE	Bias	MSE	Bias	MSE	Bias	MSE	Bias	MSE	Bias	MSE
100	100	Total	−20	29.203	8829.276	9.030	128.080	12.375	14868.680	−1.827	16.593	9.937	19518.110	−0.921	13.508
		PC1	−12	9.219	1740.035	7.172	61.226	9.754	3045.273	0.033	1.843	10.971	3015.491	0.035	2.883
		PC2	0	7.474	4086.099	−0.883	15.244	−3.086	11896.018	−1.410	9.968	−5.698	15284.011	−0.856	4.364
		PC3	−8	10.254	1249.422	3.205	20.100	6.314	1609.520	−0.168	2.531	4.989	1764.686	−0.032	2.248
		PC4	0	0.900	410.946	−0.477	11.943	−0.830	88.795	−0.272	4.242	−0.270	44.037	−0.067	1.963
		PC5	0	1.043	229.951	−0.049	5.168	−0.084	40635	−0.078	1.328	−0.077	18.711	0.002	0.611
		PC6	0	0.369	185.175	0.085	2.809	0.139	36.633	0.054	0.794	0.024	17.296	0.019	0.370
350	150	Total	8	7.246	24906.450	−6.317	80.520	12.668	50394.860	−1.164	37.593	39.456	84070.740	−1.284	19.271
		PC1	−8	17.762	9155.026	6.461	54.644	29.582	28896.549	0.723	13.823	24.086	39594.888	−0.511	3.616
		PC2	12	−7.415	5417.007	−9.528	102.378	−5.064	13073.687	−0.933	20.122	−1.226	14461.470	0.816	3.045
		PC3	4	5.364	4131.577	−3.446	27.204	−10.482	11714.378	−1.398	10.500	17.494	24773.663	−1.740	7.489
		PC4	0	−4.088	2540.851	−0.093	8.015	1.605	485.008	0.147	2.595	−0.202	230.990	−0.033	1.403
		PC5	0	3.947	2552.043	0.153	5.445	−0.355	627.776	0.060	1.615	−0.271	237.362	0.066	0.781
		PC6	0	−8.322	4271.384	0.137	6.489	−2.618	610.476	0.238	1.054	−0.424	387.267	0.117	0.750

Note: UniMed is an approach based on univariate mediation analysis. PathLasso is the proposed approach.

TABLE 5

The estimated number of PC (and the standard error, SE) in the PCA step and sensitivity and specificity of identifying paths with a nonzero path effect in the simulation study

r	p	n	# PC (SE)	UniMed		PathLasso
r	p	n	# PC (SE)	Sensitivity	Specificity	Sensitivity	Specificity
100	100	100	6.03 (0.17)	0.55	0.98	0.84	0.53
		500	5.21 (0.41)	0.78	0.97	1.00	0.89
		1000	6.26 (0.44)	0.86	0.97	1.00	0.91
350	150	100	5.99 (0.10)	0.62	0.97	0.80	0.57
		500	6.00 (0.00)	0.85	0.96	1.00	0.89
		1000	6.00 (0.00)	0.87	0.95	1.00	0.91

Note: UniMed is an approach based on univariate mediation analysis. PathLasso is the proposed approach.

The estimation bias and mean squared error (MSE) of estimating the total indirect effect and indirect effect of top PCs in the simulation study Note: UniMed is an approach based on univariate mediation analysis. PathLasso is the proposed approach. The estimated number of PC (and the standard error, SE) in the PCA step and sensitivity and specificity of identifying paths with a nonzero path effect in the simulation study Note: UniMed is an approach based on univariate mediation analysis. PathLasso is the proposed approach.

DISCUSSION

In this study, we propose a mediation framework with high‐dimensional exposures and high‐dimensional mediators. The framework integrates the PCA with marginal linear SEMs, where the PCA leads to multiple independent exposures and the marginal SEMs allow the mediators to be dependent. A regularization combining the Group Lasso and the Pathway Lasso is considered to achieve simultaneous exposure and mediator selection. Through simulation studies, the proposed approach yields competitive estimation performance and selection accuracy. The proposed framework is applied to integrate the CSF proteomics data, the brain volumetric data, and a memory measurement acquired from MCI subjects in ADNI. Several protein–imaging–memory pathways are identified, which are in accordance with existing knowledge about AD. The proposed framework is among the first attempts to conduct mediation analysis where both the exposure and mediator are high dimensional. It also fits in the context of integrating multiview data. In this study, pathology of deficits in memory among MCI patients induced by CSF protein deposition and mediated by brain atrophy is articulated. Integrating proteomics with neuroimaging data on a large scale is not commonly seen in the existing literature. One can apply the proposed approach to integrate other types of data under mechanistic and causal assumptions (Data S1, Supporting Information). For example, in an imaging‐genetics study, the genetic/genomic data are the exposures and the neuroimaging data are the mediators. Another example is to integrate multimodal neuroimaging data with the structural imaging data as the exposures and the functional imaging data as the mediators based on Hebb's law (Hebb, 2005). Another direction of application is in a longitudinal study, where imaging (or omics) data collected at two (consecutive) time points can be considered as the exposures and mediators, respectively, and a phenotyping measurement at the end of study is the outcome. The temporal ordering in the measurements intrinsically infers the causality. In order to account for the dependence between the exposures, the PCA is employed. It not only rotates the exposures into independent components, but also significantly reduces the data dimension. However, one drawback of PCA is that the loadings are not sign identifiable. Thus, both the estimated indirect and direct effect are sign sensitive. In the analysis, we keep the highest loading in each component to be positive. The current study focuses on selecting exposures and their induced mediation pathways and estimating the indirect/direct effects. Post‐selection inference is also an important question. We leave the study of drawing statistical inference to future research. In the current study, PCA is considered as an initial step to decorrelate the exposures. A next step will be merging this decomposition into the mediation optimization. Appendix S1: Supporting information Click here for additional data file.

53 in total

Review 1. Mediation Analysis: A Practitioner's Guide.

Authors: Tyler J VanderWeele
Journal: Annu Rev Public Health Date: 2015-11-30 Impact factor: 21.981

Review 2. Imaging the evolution and pathophysiology of Alzheimer disease.

Authors: William Jagust
Journal: Nat Rev Neurosci Date: 2018-11 Impact factor: 34.870

Review 3. Cerebrospinal fluid proteomics and biological heterogeneity in Alzheimer's disease: A literature review.

Authors: Kirsten E J Wesenhagen; Charlotte E Teunissen; Pieter Jelle Visser; Betty M Tijms
Journal: Crit Rev Clin Lab Sci Date: 2019-11-07 Impact factor: 6.250

4. Bayesian shrinkage estimation of high dimensional causal mediation effects in omics studies.

Authors: Yanyi Song; Xiang Zhou; Min Zhang; Wei Zhao; Yongmei Liu; Sharon L R Kardia; Ana V Diez Roux; Belinda L Needham; Jennifer A Smith; Bhramar Mukherjee
Journal: Biometrics Date: 2019-12-19 Impact factor: 2.571

5. A Review of Statistical Methods in Imaging Genetics.

Authors: Farouk S Nathoo; Linglong Kong; Hongtu Zhu
Journal: Can J Stat Date: 2019-02-25 Impact factor: 0.875

6. Integrative Bayesian analysis of brain functional networks incorporating anatomical knowledge.

Authors: Ixavier A Higgins; Suprateek Kundu; Ying Guo
Journal: Neuroimage Date: 2018-07-11 Impact factor: 6.556

Review 7. Multimodal neuroimaging computing: a review of the applications in neuropsychiatric disorders.

Authors: Sidong Liu; Weidong Cai; Siqi Liu; Fan Zhang; Michael Fulham; Dagan Feng; Sonia Pujol; Ron Kikinis
Journal: Brain Inform Date: 2015-08-29

8. Insulin-Like Growth Factor Binding Protein 2 Is Associated With Biomarkers of Alzheimer's Disease Pathology and Shows Differential Expression in Transgenic Mice.

Authors: Luke W Bonham; Ethan G Geier; Natasha Z R Steele; Dominic Holland; Bruce L Miller; Anders M Dale; Rahul S Desikan; Jennifer S Yokoyama
Journal: Front Neurosci Date: 2018-07-16 Impact factor: 4.677