| Literature DB >> 28736452 |
Kean Ming Tan1, Yang Ning2, Daniela M Witten3, Han Liu4.
Abstract
In classical statistics, much thought has been put into experimental design and data collection. In the high-dimensional setting, however, experimental design has been less of a focus. In this paper, we stress the importance of collecting multiple replicates for each subject in this setting. We consider learning the structure of a graphical model with latent variables, under the assumption that these variables take a constant value across replicates within each subject. By collecting multiple replicates for each subject, we are able to estimate the conditional dependence relationships among the observed variables given the latent variables. To test the null hypothesis of conditional independence between two observed variables, we propose a pairwise decorrelated score test. Theoretical guarantees are established for parameter estimation and for this test. We show that our proposal is able to estimate latent variable graphical models more accurately than some existing proposals, and apply the proposed method to a brain imaging dataset.Entities:
Keywords: Experimental design; Nuisance parameter; Pairwise decorrelated score test; Semiparametric exponential family graphical model
Year: 2016 PMID: 28736452 PMCID: PMC5520622 DOI: 10.1093/biomet/asw050
Source DB: PubMed Journal: Biometrika ISSN: 0006-3444 Impact factor: 2.445