Literature DB >> 28003860

Model diagnostics in reduced-rank estimation.

Kun Chen1.   

Abstract

Reduced-rank methods are very popular in high-dimensional multivariate analysis for conducting simultaneous dimension reduction and model estimation. However, the commonly-used reduced-rank methods are not robust, as the underlying reduced-rank structure can be easily distorted by only a few data outliers. Anomalies are bound to exist in big data problems, and in some applications they themselves could be of the primary interest. While naive residual analysis is often inadequate for outlier detection due to potential masking and swamping, robust reduced-rank estimation approaches could be computationally demanding. Under Stein's unbiased risk estimation framework, we propose a set of tools, including leverage score and generalized information score, to perform model diagnostics and outlier detection in large-scale reduced-rank estimation. The leverage scores give an exact decomposition of the so-called model degrees of freedom to the observation level, which lead to exact decomposition of many commonly-used information criteria; the resulting quantities are thus named information scores of the observations. The proposed information score approach provides a principled way of combining the residuals and leverage scores for anomaly detection. Simulation studies confirm that the proposed diagnostic tools work well. A pattern recognition example with hand-writing digital images and a time series analysis example with monthly U.S. macroeconomic data further demonstrate the efficacy of the proposed approaches.

Entities:  

Keywords:  Big data; Information score; Model diagnostics; Multivariate regression; Outlier detection; Reduced-rank estimation

Year:  2016        PMID: 28003860      PMCID: PMC5167389          DOI: 10.4310/SII.2016.v9.n4.a7

Source DB:  PubMed          Journal:  Stat Interface        ISSN: 1938-7989            Impact factor:   0.582


  11 in total

1.  Registration-based assessment of regional lung function via volumetric CT images of normal subjects vs. severe asthmatics.

Authors:  Sanghun Choi; Eric A Hoffman; Sally E Wenzel; Merryn H Tawhai; Youbing Yin; Mario Castro; Ching-Long Lin
Journal:  J Appl Physiol (1985)       Date:  2013-06-06

2.  Quantitative assessment of multiscale structural and functional alterations in asthmatic populations.

Authors:  Sanghun Choi; Eric A Hoffman; Sally E Wenzel; Mario Castro; Sean B Fain; Nizar N Jarjour; Mark L Schiebler; Kun Chen; Ching-Long Lin
Journal:  J Appl Physiol (1985)       Date:  2015-03-26

3.  Learning regulatory programs by threshold SVD regression.

Authors:  Xin Ma; Luo Xiao; Wing Hung Wong
Journal:  Proc Natl Acad Sci U S A       Date:  2014-10-20       Impact factor: 11.205

4.  Bayesian Generalized Low Rank Regression Models for Neuroimaging Phenotypes and Genetic Markers.

Authors:  Hongtu Zhu; Zakaria Khondker; Zhaohua Lu; Joseph G Ibrahim
Journal:  J Am Stat Assoc       Date:  2014       Impact factor: 5.033

5.  Reduced rank regression via adaptive nuclear norm penalization.

Authors:  Kun Chen; Hongbo Dong; Kung-Sik Chan
Journal:  Biometrika       Date:  2013-12-04       Impact factor: 2.445

6.  Reduced Rank Ridge Regression and Its Kernel Extensions.

Authors:  Ashin Mukherjee; Ji Zhu
Journal:  Stat Anal Data Min       Date:  2011-10-07       Impact factor: 1.051

7.  Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer.

Authors:  Jie Peng; Ji Zhu; Anna Bergamaschi; Wonshik Han; Dong-Young Noh; Jonathan R Pollack; Pei Wang
Journal:  Ann Appl Stat       Date:  2010-03       Impact factor: 2.083

8.  PERTURBATION AND SCALED COOK'S DISTANCE.

Authors:  Hongtu Zhu; Joseph G Ibrahim; Hyunsoon Cho
Journal:  Ann Stat       Date:  2012       Impact factor: 4.028

9.  Sparse Multivariate Regression With Covariance Estimation.

Authors:  Adam J Rothman; Elizaveta Levina; Ji Zhu
Journal:  J Comput Graph Stat       Date:  2010       Impact factor: 2.302

10.  On the degrees of freedom of reduced-rank estimators in multivariate regression.

Authors:  A Mukherjee; K Chen; N Wang; J Zhu
Journal:  Biometrika       Date:  2015-02-09       Impact factor: 2.445

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.