Literature DB >> 29386387

Statistical tests and identifiability conditions for pooling and analyzing multisite datasets.

Hao Henry Zhou¹, Vikas Singh^2,3, Sterling C Johnson^4,5, Grace Wahba^6,7,3.

Abstract

When sample sizes are small, the ability to identify weak (but scientifically interesting) associations between a set of predictors and a response may be enhanced by pooling existing datasets. However, variations in acquisition methods and the distribution of participants or observations between datasets, especially due to the distributional shifts in some predictors, may obfuscate real effects when datasets are combined. We present a rigorous statistical treatment of this problem and identify conditions where we can correct the distributional shift. We also provide an algorithm for the situation where the correction is identifiable. We analyze various properties of the framework for testing model fit, constructing confidence intervals, and evaluating consistency characteristics. Our technical development is motivated by Alzheimer's disease (AD) studies, and we present empirical results showing that our framework enables harmonizing of protein biomarkers, even when the assays across sites differ. Our contribution may, in part, mitigate a bottleneck that researchers face in clinical research when pooling smaller sized datasets and may offer benefits when the subjects of interest are difficult to recruit or when resources prohibit large single-site studies.

Entities: Chemical Disease Gene Species

Keywords: causal model; maximum mean discrepancy; meta-analysis; multisite analysis; multisource

Mesh：

Substances：
Biomarkers

Year: 2018 PMID： 29386387 PMCID： PMC5816202 DOI： 10.1073/pnas.1719747115

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

Many studies that involve human subjects are constrained by the number of samples that can be obtained when the disease population of interest is small, when the measurement of interest is difficult to obtain, or when other logistic or financial constraints are present that prohibit large-scale studies (1, 2). For example, in Alzheimer’s disease (AD) research, cerebrospinal fluid (CSF) measurements from lumbar puncture (LP) may be limited by participant willingness to undergo LP and institutional capability to routinely perform the procedure in a research setting. The assays for amyloid beta 1–42 and tau (the hallmark features of AD pathology) are known to vary widely between assay product type and within a specific type of assay from differences in batch composition (3). Similarly, the expense of imaging examinations may prohibit large-scale investigations. While the sample sizes may be sufficient to evaluate the primary hypotheses, researchers may want to investigate secondary analyses focused on identifying subtle associations between specific predictors and the response variable (3, 4). Such secondary analyses may be underpowered for the given sample sizes. One possible solution is to identify and pool several similar datasets across multiple sites (5). One hopes that the larger sample sizes of the pooled dataset will enable investigating potentially interesting scientific questions that may not otherwise be possible with smaller single-site cohorts. In practice, we find that direct pooling of already collected datasets in a post hoc manner across multiple sites can be problematic due to differences in the distributions of one or more measures (or features) (6). In fact, even when data acquisition is harmonized across sites, we may still need to deal with site-specific or method-specific effects on the measurements, such as the above noted example with CSF (7), before the analysis can proceed (8, 9). For example, as discussed above, in AD studies, CSF measurements (10) may not be easily pooled in the absence of gold standard reference materials that are common across assays (or sites) (3). Such issues also arise in combining cognitive measures or transferring analysis results or models from one potentially large-sized dataset to another. For example, cohort studies may administer different cognitive tests that assess the same underlying cognitive domain; therefore, thresholds used to categorize individuals into different disease status groups may not be easily transferred from one site to the other (5, 11). These issues are not restricted to biomedical studies, and variously manifest in machine learning and computer vision, where distinct datasets must be pooled (e.g., for training a statistical model). While the literature on addressing sample selection bias and compensating for population characteristics differences is sizable (12, 13), statistical frameworks for resolving distributional shift to facilitate pooled analysis, essential in various applications, are less developed in comparison. Deriving scientific conclusions from a unified analysis spanning multiple individual datasets is often accomplished in practice via so-called meta-analysis approaches. Such an approach carefully collects research analyses/findings separately performed on the datasets and then aggregates individual analysis results through statistical models to come up with a final estimate of the parameters (14). However, various assumptions in meta-analysis schemes may not always hold in practice, and simple violations can lead to inaccurate scientific conclusions (15, 16). Alternatively, if access to the actual data from individual studies is available, some preprocessing to harmonize the data followed by statistical analysis of the pooled data may be preferable in many cases. The preprocessing often uses methods that compensate (or correct) for distributional shift to the extent possible. For example, ideas related to domain shift in refs. 17 and 18 and other results describe sophisticated models to improve prediction accuracy by correcting domain shift. What is less developed is a formal treatment explaining how confident we are that the shift across datasets has been successfully corrected (and consequently, the analysis can safely proceed), whether the correction can be improved if we were able to acquire more samples, what mathematical assumptions are needed, and whether the residual (say, after a correction step) is due to fewer than necessary samples or other violations of the underlying assumptions. The primary goal of this paper is to offer a formal treatment of these problems and derive the theoretical basis that can guide practical deployments. In this paper, we build and extend on our preliminary results (19), and we present an in-depth theoretical study of distributional shift correction across datasets. That includes consistency properties, an identifiability condition, and a hypothesis test to check model accuracy using a discrepancy measure popular in the domain adaptation literature (17, 18). We also provide an analysis based on a subsampling procedure, showing how these ideas can be modified to deal with the practical situation where the covariates for different sites (or studies) are not exactly the same (e.g., age range of cohorts may vary)—toward facilitating rigorous analysis of pooled datasets. Briefly, we (i) give a precise condition to evaluate whether a distributional shift correction is identifiable; (ii) derive a subsampling procedure to separate distributional shift from other sources of variations, such as sample selection bias and population characteristics differences; (iii) propose an algorithm based on a nonparametric quantity: maximum mean discrepancy (MMD); and (iv) present experiments showing how these ideas can facilitate AD biomarker research (Fig. 1).

Fig. 1.

A shows the distributional shift of across ADNI and W-ADRC. B shows the distributional shift of hippocampus volume across ADNI and W-ADRC.

Problem Setting

Let us assume that we have data from two sites and , and the sitewise data correspond to different features. For presentation purposes, we will assume that the features include eight CSF protein levels, denoted as , acquired from each participant via an LP. Since the absolute values of CSF measurements vary as a function of the assay instrumentation, we are interested in correcting the distributional shift to facilitate the analysis of the pooled dataset. However, notice that there are at least two other factors that can influence the correction. and may have participants with age distributions that are not identical. It is known that age influences protein-level measurements and therefore, will affect our distributional shift correction. We denote the population characteristics that cause differences in age distributions as (also called “transportability” in ref. 13). Similarly, while site may include an almost equal split of individuals with and without disease, healthy individuals may be overrepresented in site . We denote this bias in sample selection between two datasets as , which also influences (13). Therefore, the actual distributions of observed CSF protein levels in the two datasets, and , are and , respectively. If we only have access to and but no other variables related to and , then correcting the distributional shift between and is difficult. However, the problem is identifiable when we have age and diagnosis status relevant for the variables and . In fact, we can specify the condition when the correction is identifiable. We briefly review some concepts related to graphical causal model and d-separation rules and then state the identifiability condition.

Graphical Causal Model.

A graphical causal model is represented by a directed acyclic graph (DAG), which consists of three types of entities: variables (nodes), arrows (edges), and missing arrows. DAGs are useful visual representations of a domain expert’s assumptions regarding causal relationships explaining the data generation process (20). In Fig. 2, we show an example. Arrows in the graph represent possible direct causal effects between pairs of variables. For example, the arrow from to means that exerts a direct causal influence on . The absence of an arrow represents an assumption of no direct causal effect between the two variables (20). The missing arrow from to denotes the absence of a direct causal effect of on . Fig. 2 shows an example for our data analysis task, where the DAGs depict causal relations between age, sex, CSF, diagnosis status, and other variables. Here, age, sex, and other endogenous variables influence the CSF measurements , which influence the diagnosis status . The population characteristic difference only has a direct causal effect on age, whereas the sample selection bias is only directly related to diagnosis status for each specific study or site. Note that a graphical causal model is nonparametric and makes no other assumptions about the distribution of variables, the functional form of direct effects, or the magnitude of causal effects.

Fig. 2.

A is an example of a graphical causal model. The colored nodes are an example of a d-separation rule, where and are d-separated by . B is the graphical causal model for our CSF data analysis example. Here, the population characteristics difference only has a direct causal effect on the age distribution. The sample selection bias is only directly related to diagnosis status for each specific study. Nodes denoting age and sex influence the CSF measurements denoted by , which then influence the diagnosis status . The CSF measurements and the nodes and are d-separated by diagnosis status and age. Next, we introduce a useful concept called d-separation (21) using the model in Fig. 2 as an example. If two variables and are d-separated by a set of variables , then they are conditionally independent given . A path is a sequential set of connected nodes independent of the directionality of the arrows. A “collider” on a path is a node with two arrows along the path pointing into it ( in Fig. 2). Otherwise, the node is a noncollider on the path.

Definition.

[d-separation (21)]: A path between two variables, and is said to be blocked by a set of variables if either (i) contains a noncollider that is in or (ii) contains a collider node that is outside and has no descendant in . We say that and are d-separated by if any path between them is “blocked” by . For example, in Fig. 2, and are d-separated by . After including in , all paths are blocked due to rule i, except the path . The path stays unblocked, because (i) no noncollider on that path is in and (ii) the only collider on is in . Therefore, we can include one of on the path into to “block” it.

Identifiability Condition

We can now present a condition describing when distributional shift correction across sites is identifiable, even with the concurrent influence of sample selection bias and population characteristic differences on the measurements .

Theorem 1.

The distribution shift correction is identifiable if there exists a known set of variables , such that the following three conditions are all concurrently satisfied. d-separates and (sample selection bias) and also d-separates and (population characteristic difference). The conditional probability , after appropriate transformations on , is the same across multiple participating sites ( and ). The distribution of has a nontrivial overlap across multiple sites ( ), which means that there exists an interval , such that for all sites. The proof is in . From Fig. 2 and Table 1, we can check that satisfies . Condition i is satisfied by noticing that d-separates and the nodes and . If all sites collect samples similarly, will be the same [e.g., ]. From Fig. 2, variations denoted by and only influence the marginal distributions of and age but have no effect on the causal relation/function among variables [e.g., ]. The distributional shift of can be corrected after some transformation; therefore, condition ii holds. Finally, we will see (Table 1) that the disease status and age distributions have a nontrivial overlap across the two datasets; therefore, condition iii also holds.

Table 1.

Variations of age and diagnosis status across datasets

Description	ADNI	W-ADRC
Sample size	284	125
Age range (∼55–65/∼65–75/∼75–85 yr), %	11/43/46	44/34/22
Diagnosis status (CN/AD), %	60/40	76/24

Variations of age and diagnosis status across datasets In practice, it is useful to seek a d-separating set of variables with the fewest variables, such that we can sacrifice (or leave out) the fewest samples to separate distributional shift from the other variations and . Finding a minimal d-separating set can be solved as a maximum flow problem (22). In practice, if the causal model is not too complicated, one may even find a d-separating set manually. Then, it can be transformed into the problem of “blocking” two nodes in an undirected graph with the fewest blocks (23) ().

Tests for Correcting Distributional Shift

We now describe an algorithm to correct distributional shift if it is identifiable (). We start our discussion by first assuming that the two to-be-pooled datasets, and , only include a distributional shift in the features (e.g., due to measurement or site-specific nuisance factors) and involve no other sampling biases or confounds (i.e., and ). Later, we present a subsampling framework to extend the algorithm to the case where other variations co-occur and also contribute to the shift. We calculate the distributional shift correction by identifying a parametric transformation on the sitewise samples from and . We assume that site provides samples given by a distribution and that provides samples with a distribution . Let us denote the transformation on as and the transformation on as characterized by the unknown parameters and , respectively. For example, if we choose to be an affine transformation with parameters , it maps any value to : that is, . The algorithm seeks to find a pair of transformations, such that distributions of two datasets are matched (corrected) after the transformations are applied. We use MMD as a measure of difference between the two (transformed) distributions. The MMD is expressed as a function of two distributions , aswhich is defined using a Reproducing Kernel Hilbert Space with norm and kernel . MMD can also be considered as the mean difference between two distributions after kernel embedding and has several desirable properties (for example, it is zero if and only if two distributions are identical) (24). One requirement, however, is that the kernel has to be characteristic, and specific choices may be guided by the application (24). The empirical version of MMD can be calculated with samples asRecall that our algorithm is trying to match the two distributions after applying the parametric transformations and . Therefore, we estimate parameters and using the empirical MMD by searching for a minimum value (e.g., using stochastic gradient descent):The class of transformations that we will choose for a specific application should be informed by domain knowledge, but in general, simpler transformation classes are preferable. We now show that the estimators and are consistent.

Theorem 2.

Under mild assumptions (), if there is a such that and have the same distribution, thenwith the rate . If are unique, then the estimators are consistent.

Remark.

In various applications (including our experiments), we may choose one class of transformations to be the identity transformation and transform samples in the other dataset to match the reference dataset. The foregoing discussion and assume that the two distributions can be matched via some unknown transformation. This may not always be true, and it is important, in practice, to identify when the datasets cannot be pooled for the specified class of transformations. Next, we provide a hypothesis test to answer this question. Let us defineThe test statistics can be obtained by plugging , into the empirical MMD calculation asWe can show that the hypothesis test is consistent. Additional details for the small sample size case are in .

Theorem 3.

Under mild assumptions (), converges to zero with the rate when holds and converges to a positive constant with the rate when holds. The test can provide guidance on whether the distributional shift has been successfully corrected. If the test suggests the alternative hypothesis, one may consider adjusting the transformation class and or other factors, such as sample selection bias and population attribute difference, or one may decide against pooling. Next, we introduce a subsampling scheme to correct distributional shift when other contributors to the shift coexist, but the correction is still identifiable.

Subsampling Framework.

When the test chooses , one reason may be that one or more cohort-specific factors contribute in significant ways to the observed distributional shift between and . Recall that our earlier discussion suggests that the problem is identifiable if we can find a satisfying the conditions in . Then, a subsampling procedure can potentially resolve the confound. The reason is thatFrom , we know that , which remains the same across sites after a suitable transformation. Therefore, simply by adjusting , the effects of the other factors on can be controlled, except distributional shift. Such a subsampling scheme is widely used in addressing sample selection bias in other applications (25) (information on the subsampling scheme for reducing computational burden is in ref. 26). In our setting, the motivation for using subsampling is similar, but it is used in the context of correcting distributional shift—after subsampling. Separately, since subsampling has been used in bagging to stabilize estimations and reduce variance [e.g., for random forests (27)], we can directly obtain stable estimators and calculate their variance.

Specifics of Subsampling.

We divide into groups with sample sizes given as : i.e., . Similarly, is divided into groups with sample sizes given as : i.e., . The subsample sizes are , where for any . Then, we generate subsamples for and and apply Eq. sequentially. We run subsampling with replacement times and denote each iteration’s estimators as . Then, our final transformation estimators are given as and .

Infinitesimal Jackknife Confidence Interval.

In most scientific studies, we also want to obtain a confidence interval for the calculated transformations. In this case, however, there is no closed form solution, and therefore, we use a bootstrap type method. Since subsampling already involves bootstrapping, using a simple bootstrap results in a product of bootstraps. Fortunately, a similar issue was encountered in bagging, and an infinitesimal Jackknife (IJ) method (28) was provided for random forests, which works quite well (27, 29). Inspired by this result, we use the IJ to estimate the variances of estimators and . The method cannot be directly applied here, since it considers subsampling from one group, whereas we need subsampling from multiple groups. We, therefore, extend the results to multiple groups (proof is in ). Based on the subsampling scheme for and defined above, the multigroup IJ estimator of variance is given as the following theorem.

Theorem 4.

Define to be the number of appearances of in iteration . Define . The IJ estimator of variance for isThe procedure for is identical. Subsampling MMD Algorithm ()

Applications to AD Study

We show the application of the framework to correct distributional shift between two AD datasets and show how such a strategy can lead to improved pooled data analysis. The two datasets come from the Alzheimer’s Disease Neuroimage Initiative (ADNI) project and the Wisconsin Alzheimer’s Disease Research Center (W-ADRC). Both studies follow similar protocols for acquiring CSF samples from participants and measuring protein levels (3). It is known that the CSF protein levels are indicative of neurofibrillary tangles and amyloid plaques, characteristic of AD pathology. The distributions of the protein measurements across the two datasets are different due to various reasons described in the literature (3), which makes pooled analysis and/or transferring results from one dataset to the other problematic. For example, a threshold derived for the ADNI dataset may not be applicable to the W-ADRC dataset. Both datasets included eight distinct CSF protein levels measured on seven proteins ( is measured by two methods), where the distributional shift needs to be corrected. In both W-ADRC and ADNI, the measured proteins include , , , , , NFL, and neurogranin. While the W-ADRC dataset provides 125 samples, the ADNI includes 284 samples (Table 1 and ). After correcting the distributional shift, we fit statistical models, which include age, sex, and CSF proteins as covariates. As a response variable, we use hippocampus volume or diagnosis status. Here, other than correcting the CSF protein levels across the two datasets, we also correct distribution shift of hippocampus volumes, since they may be calculated with different image acquisition characteristics and potentially different software (Freesurfer in ADNI vs. FIRST/FSL in W-ADRC). Our workflow involves three tasks: (i) correct distributional shift across the datasets for CSF protein levels, (ii) transform thresholds in ADNI to W-ADRC, and (iii) pool the data together to predict the response variable (hippocampus volume and diagnosis status) within regression or classification.

Correct Distributional Shift of CSF.

Table 1 shows that the age distributions as well as the proportions of participants who are healthy [control (CN)] and diseased (AD) in the two datasets are not exactly the same, which makes directly attempting a distributional shift correction in the CSF measures not very meaningful. However, when other variations (confounders) coexist together with distributional shift, as discussed earlier, we should check whether there exists a set of variables satisfying conditions given in . We previously described how choosing satisfies . Such a is also the minimal d-separating set. To proceed with the analysis, we divide our samples in groups based on all possible combinations of diagnosis status (AD/CN) and age ranges (). We can now run the subsampling MMD algorithm () (see Algorithm 1) with (iterations ) to correct the distributional shift in . We show two representative results in Fig. 3. For each plot in Fig. 3, depending on the subsamples randomly collected from 10 iterations, we plot the distributions of protein levels and a protein ratio measure (widely used in the aging/AD literature) in ADNI before/after correction (red/brown) with respect to W-ADRC baseline (blue). We see that the distributions of raw measures are very different between ADNI (using the AlzBio3 xMAP assay) and W-ADRC (using the ELISA INNOTEST assay). After our correction, the distributions are matched for all eight CSF protein measurements and both protein ratios that are relevant in AD research (-tau/ and -tau/). We randomly select one iteration and apply the hypothesis test, which accepts the transformations with high p-values. We also use the IJ to estimate the SDs of parameters and report them in Fig. 3.

Fig. 3.

The plots of (A) and (B) show the empirical distributions of W-ADRC samples (blue), ADNI samples (red), and transformed ADNI samples (brown). W-ADRC samples are nicely matched with transformed ADNI samples.

Transferring Thresholds for Disease Staging Across Datasets.

After performing our correction, CSF protein measurements across the two datasets can be analyzed together. We can evaluate the effect of using models (or thresholds) derived for the ADNI dataset on W-ADRC by transferring the criteria directly. For example, five CSF-based biomarker signatures (thresholds) developed for AD using ADNI participants (11) can now be transferred to the W-ADRC dataset. Given a threshold for any specific CSF protein, we can evaluate a sample in W-ADRC by comparing the corresponding measurements with the transformed threshold. The procedure produces sensitivity and specificity (for detection of AD) for each of eight CSF protein measurements and the two derived ratios. Our final thresholds, sensitivities, and specificities based on the experiments are shown in Table 2. The accuracy estimates suggest that all derived thresholds work well—we find that the sensitivity and specificity are competitive with the results reported for ADNI (11) and show how results/models from one dataset may be transferable to another dataset using our proposal.

Table 2.

The performance of thresholds in ADNI and W-ADRC

Dataset	t-tau	Aβ1−42	p-tau₁₈₁	t-tauAβ1−42	p-tau181Aβ1−42
W-ADRC
Threshold	568.08	629.39	48.86	0.77	0.07
Sensitivity, %	75.86	89.66	82.75	93.10	93.10
Specificity, %	92.23	69.90	67.96	86.41	79.61
ADNI
Threshold	93.00	192.00	23.00	0.39	0.10
Sensitivity, %	69.6	96.4	67.9	85.7	91.1
Specificity, %	92.3	76.9	73.1	84.6	71.2

The W-ADRC thresholds are derived from corresponding ADNI thresholds reported in the literature (11) using Algorithm.

The performance of thresholds in ADNI and W-ADRC The W-ADRC thresholds are derived from corresponding ADNI thresholds reported in the literature (11) using Algorithm.

Pooling and Analyzing the Two Datasets Together.

For the final experiment, we evaluate whether predictors from both datasets can be pooled for predicting hippocampus volume and diagnosis status (response variables) within regression and classification. We build a linear regression model based on age, sex, and CSF proteins (after distributional shift correction) to identify associations with hippocampus volume. To evaluate the accuracy of the model, we randomly choose samples () from W-ADRC data to serve as the test set. For evaluation purposes, we generate three different types of training datasets: W-ADRC samples only, W-ADRC plus raw (uncorrected) ADNI samples, and W-ADRC plus transformed ADNI samples. Note that the data used to generate the training set are based on all ADNI samples and the remaining W-ADRC samples. To obtain prediction errors for each of the three schemes with respect to varying training sample sizes, we vary the training sample size by choosing samples from each of the two datasets and then change from 30 to in increments. To avoid performance variation due to random choice of samples, after the test set is chosen, we run five bootstraps to select the training set and fit the model. Finally, we run 80 bootstraps to generate multiple test sets and evaluate the model performance. In this way, based on 400 bootstraps, we are able to obtain a more stable prediction error, and we are able to calculate the SD. The square root of mean squared prediction error (MSPE) scaled by a constant is shown in Fig. 4. We can see that the prediction errors decrease as training sample size increases, while the W-ADRC plus transformed ADNI data consistently offer the best performance.

Fig. 4.

A shows the trend of MSPE for hippocampus volume as the sample size increases using 400 bootstraps. The bar plot covers the prediction error for three types of training set as depicted in the legend, including W-ADRC only (red), W-ADRC plus ADNI (green), and W-ADRC plus transformed ADNI (blue). The third model continues to perform the best. B shows the trend of classification accuracy with respect to patients with AD (solid lines) and healthy patients (dotted lines) as sample size increases using 400 bootstraps. An SVM model is used, and three types of training sets are shown in the legend. For samples with AD, the three methods converge to the same accuracy as the training sample size increases. For healthy CNs, the W-ADRC plus the transformed ADNI dataset is always better than the other two schemes. It is interesting to see that W-ADRC plus the raw ADNI data also performs better than W-ADRC alone, possibly because only 25 (24%) subjects from W-ADRC are diagnosed with AD—with few AD samples, even the uncorrected ADNI data nicely inform the classification model.

Discussion

There is growing interest in the design of infrastructure and platforms that allow scientists across different sites and even continents to contribute scientific data and explore scientific hypotheses that cannot be evaluated on smaller datasets. Such efforts can be facilitated via the availability of theory and algorithms to identify whether pooling is meaningful, how the data should be harmonized, and later, how statistically meaningful and reproducible scientific conclusions can be obtained. We described a statistical framework that addresses some of the natural issues that arise in this regime, in particular, providing conditions where distributional shift between datasets can be corrected. The experimental results suggest promising potential applications of this idea in aging and AD studies. There remain several outstanding issues that are not fully addressed by this work. The procedure does not currently deal with discrete measurements, which are often encountered in some applications. It will also be interesting to more explicitly use information about the response variables—deciding when pooling is beneficial not only depends on the correction of distributional shift but may also be influenced by other factors, including sample size and noise level. On the computational side, special classes of kernels may lead to more efficient means of estimating the transformation to align the distributions. Finally, there are interesting deep learning algorithms for domain/data shift correction, and impressive empirical results are being reported, even for high-dimensional distributions. The University of Wisconsin Institutional Review board approved all study procedures and each subject provided signed informed consent before participation.

Algorithm 1.

Subsampling MMD Algorithm ()

1: Divide XS and XT separately into d groups by Z

2: Decide subsample size (s1,s2,…,sd)

3: For b= 1 to B, do

4: Generate subsamples XSb from d groups of XS

5: Generate subsamples XTb from d groups of XT

6: (λ^b,θ^b)=argminλ∈Ωλ,θ∈ΩθMMD^(hλ(XSb),gθ(XTb))

7: Calculate and record gu(i,k)b for all u,i,k

8: Set λ^=1B∑b=1Bλ^b and θ^=1B∑b=1Bθ^b and calculate 𝕍𝔸ℝIJ(λ^) and

𝕍𝔸ℝIJ(θ^)

17 in total

1. A worldwide multicentre comparison of assays for cerebrospinal fluid biomarkers in Alzheimer's disease.

Authors: N A Verwey; W M van der Flier; K Blennow; C Clark; S Sokolow; P P De Deyn; D Galasko; H Hampel; T Hartmann; E Kapaki; L Lannfelt; P D Mehta; L Parnetti; A Petzold; T Pirttila; L Saleh; A Skinningsrud; J C V Swieten; M M Verbeek; J Wiltfang; S Younkin; P Scheltens; M A Blankenstein
Journal: Ann Clin Biochem Date: 2009-04-02 Impact factor: 2.057

2. Causal inference and the data-fusion problem.

Authors: Elias Bareinboim; Judea Pearl
Journal: Proc Natl Acad Sci U S A Date: 2016-07-05 Impact factor: 11.205

3. Estimation and Accuracy after Model Selection.

Authors: Bradley Efron
Journal: J Am Stat Assoc Date: 2014-07-01 Impact factor: 5.033

4. Hypothesis Testing in Unsupervised Domain Adaptation with Applications in Alzheimer's Disease.

Authors: Hao Henry Zhou; Sathya N Ravi; Vamsi K Ithapu; Sterling C Johnson; Grace Wahba; Vikas Singh
Journal: Adv Neural Inf Process Syst Date: 2016

5. The Centiloid Project: standardizing quantitative amyloid plaque estimation by PET.

Authors: William E Klunk; Robert A Koeppe; Julie C Price; Tammie L Benzinger; Michael D Devous; William J Jagust; Keith A Johnson; Chester A Mathis; Davneet Minhas; Michael J Pontecorvo; Christopher C Rowe; Daniel M Skovronsky; Mark A Mintun
Journal: Alzheimers Dement Date: 2014-10-28 Impact factor: 21.566

6. Cerebrospinal fluid biomarker signature in Alzheimer's disease neuroimaging initiative subjects.

Authors: Leslie M Shaw; Hugo Vanderstichele; Malgorzata Knapik-Czajka; Christopher M Clark; Paul S Aisen; Ronald C Petersen; Kaj Blennow; Holly Soares; Adam Simon; Piotr Lewczuk; Robert Dean; Eric Siemers; William Potter; Virginia M-Y Lee; John Q Trojanowski
Journal: Ann Neurol Date: 2009-04 Impact factor: 10.422

7. Validation of Alzheimer's disease CSF and plasma biological markers: the multicentre reliability study of the pilot European Alzheimer's Disease Neuroimaging Initiative (E-ADNI).

Authors: Katharina Buerger; Giovanni Frisoni; Olga Uspenskaya; Michael Ewers; Henrik Zetterberg; Cristina Geroldi; Giuliano Binetti; Peter Johannsen; Paolo Maria Rossini; Lars-Olof Wahlund; Bruno Vellas; Kaj Blennow; Harald Hampel
Journal: Exp Gerontol Date: 2009-06-16 Impact factor: 4.032

8. Research and standardization in Alzheimer's trials: reaching international consensus.

Authors: Maria C Carrillo; Christopher C Rowe; Cassandra Szoeke; Colin L Masters; David Ames; Tim O'Meara; S Lance Macaulay; Andrew Milner; Kathryn A Ellis; Paul Maruff; Stephanie R Rainey-Smith; Ralph N Martins; Lisa J Bain; Richard J Head
Journal: Alzheimers Dement Date: 2012-12-23 Impact factor: 21.566

9. Comparison of xMAP and ELISA assays for detecting cerebrospinal fluid biomarkers of Alzheimer's disease.

Authors: Li-San Wang; Yuk Yee Leung; Shu-Kai Chang; Susan Leight; Malgorzata Knapik-Czajka; Young Baek; Leslie M Shaw; Virginia M-Y Lee; John Q Trojanowski; Christopher M Clark
Journal: J Alzheimers Dis Date: 2012 Impact factor: 4.472

Review 10. Meta-analysis: pitfalls and hints.

Authors: T Greco; A Zangrillo; G Biondi-Zoccai; G Landoni
Journal: Heart Lung Vessel Date: 2013

8 in total

1. Learning Invariant Representations using Inverse Contrastive Loss.

Authors: Aditya Kumar Akash; Vishnu Suresh Lokhande; Sathya N Ravi; Vikas Singh
Journal: Proc Conf AAAI Artif Intell Date: 2021-05-18

2. FairALM: Augmented Lagrangian Method for Training Fair Models with Little Regret.

Authors: Vishnu Suresh Lokhande; Aditya Kumar Akash; Sathya N Ravi; Vikas Singh
Journal: Comput Vis ECCV Date: 2020-10-07

3. Grab-AD: Generalizability and reproducibility of altered brain activity and diagnostic classification in Alzheimer's Disease.

Authors: Dan Jin; Pan Wang; Andrew Zalesky; Bing Liu; Chengyuan Song; Dawei Wang; Kaibin Xu; Hongwei Yang; Zengqiang Zhang; Hongxiang Yao; Bo Zhou; Tong Han; Nianming Zuo; Ying Han; Jie Lu; Qing Wang; Chunshui Yu; Xinqing Zhang; Xi Zhang; Tianzi Jiang; Yuying Zhou; Yong Liu
Journal: Hum Brain Mapp Date: 2020-05-04 Impact factor: 5.038

4. Cerebrospinal fluid biomarkers of neurofibrillary tangles and synaptic dysfunction are associated with longitudinal decline in white matter connectivity: A multi-resolution graph analysis.

Authors: Won Hwa Kim; Annie M Racine; Nagesh Adluru; Seong Jae Hwang; Kaj Blennow; Henrik Zetterberg; Cynthia M Carlsson; Sanjay Asthana; Rebecca L Koscik; Sterling C Johnson; Barbara B Bendlin; Vikas Singh
Journal: Neuroimage Clin Date: 2018-10-23 Impact factor: 4.881

5. PLCG2 protective variant p.P522R modulates tau pathology and disease progression in patients with mild cognitive impairment.

Authors: Agustin Ruiz; Alfredo Ramirez; Luca Kleineidam; Vincent Chouraki; Tomasz Próchnicki; Sven J van der Lee; Laura Madrid-Márquez; Holger Wagner-Thelen; Ilker Karaca; Leonie Weinhold; Steffen Wolfsgruber; Anne Boland; Pamela V Martino Adami; Piotr Lewczuk; Julius Popp; Frederic Brosseron; Iris E Jansen; Marc Hulsman; Johannes Kornhuber; Oliver Peters; Claudine Berr; Reinhard Heun; Lutz Frölich; Christophe Tzourio; Jean-François Dartigues; Michael Hüll; Ana Espinosa; Isabel Hernández; Itziar de Rojas; Adelina Orellana; Sergi Valero; Najada Stringa; Natasja M van Schoor; Martijn Huisman; Philip Scheltens; Eckart Rüther; Jean-Francois Deleuze; Jens Wiltfang; Lluis Tarraga; Matthias Schmid; Martin Scherer; Steffi Riedel-Heller; Michael T Heneka; Philippe Amouyel; Frank Jessen; Merce Boada; Wolfgang Maier; Anja Schneider; Antonio González-Pérez; Wiesje M van der Flier; Michael Wagner; Jean-Charles Lambert; Henne Holstege; Mª Eugenia Sáez; Eicke Latz
Journal: Acta Neuropathol Date: 2020-03-12 Impact factor: 17.088

6. Mitigating site effects in covariance for machine learning in neuroimaging data.

Authors: Andrew A Chen; Joanne C Beer; Nicholas J Tustison; Philip A Cook; Russell T Shinohara; Haochang Shou
Journal: Hum Brain Mapp Date: 2021-12-14 Impact factor: 5.038

7. Role of eotaxin-1/CCL11 in sepsis-induced myocardial injury in elderly patients.

Authors: Ying Li; Youguang Zhao; Chenming Qiu; Yuanrui Yang; Guihua Liao; Xi Wu; Xiaowan Zhang; Qian Zhang; Ru Zhang; Zhang Wang
Journal: Aging (Albany NY) Date: 2020-03-09 Impact factor: 5.682

8. APP-derived peptides reflect neurodegeneration in frontotemporal dementia.

Authors: Ignacio Illán-Gala; Jordi Pegueroles; Victor Montal; Daniel Alcolea; Eduard Vilaplana; Alexandre Bejanin; Sergi Borrego-Écija; Frederic Sampedro; Andrea Subirana; María-Belén Sánchez-Saudinós; Ricard Rojas-García; Hugo Vanderstichele; Rafael Blesa; Jordi Clarimón; Anna Antonell; Albert Lladó; Raquel Sánchez-Valle; Juan Fortea; Alberto Lleó
Journal: Ann Clin Transl Neurol Date: 2019-12-02 Impact factor: 4.511

8 in total