Literature DB >> 28757650

Data integration with high dimensionality.

Xin Gao1, Raymond J Carroll2.   

Abstract

We consider situations where the data consist of a number of responses for each individual, which may include a mix of discrete and continuous variables. The data also include a class of predictors, where the same predictor may have different physical measurements across different experiments depending on how the predictor is measured. The goal is to select which predictors affect any of the responses, where the number of such informative predictors tends to infinity as the sample size increases. There are marginal likelihoods for each experiment; we specify a pseudolikelihood combining the marginal likelihoods, and propose a pseudolikelihood information criterion. Under regularity conditions, we establish selection consistency for this criterion with unbounded true model size. The proposed method includes a Bayesian information criterion with appropriate penalty term as a special case. Simulations indicate that data integration can dramatically improve upon using only one data source.

Entities:  

Keywords:  Information criterion; Large deviation; Model misspecification; Pseudolikelihood; Quadratic form

Year:  2017        PMID: 28757650      PMCID: PMC5532816          DOI: 10.1093/biomet/asx023

Source DB:  PubMed          Journal:  Biometrika        ISSN: 0006-3444            Impact factor:   2.445


  9 in total

1.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

2.  VARIABLE SELECTION FOR REGRESSION MODELS WITH MISSING DATA.

Authors:  Ramon I Garcia; Joseph G Ibrahim; Hongtu Zhu
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

3.  Variable Selection in Nonparametric Varying-Coefficient Models for Analysis of Repeated Measurements.

Authors:  Lifeng Wang; Hongzhe Li; Jianhua Z Huang
Journal:  J Am Stat Assoc       Date:  2008-12-01       Impact factor: 5.033

4.  Variable selection with incomplete covariate data.

Authors:  Gerda Claeskens; Fabrizio Consentino
Journal:  Biometrics       Date:  2008-03-27       Impact factor: 2.571

5.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer.

Authors:  Yixin Wang; Jan G M Klijn; Yi Zhang; Anieta M Sieuwerts; Maxime P Look; Fei Yang; Dmitri Talantov; Mieke Timmermans; Marion E Meijer-van Gelder; Jack Yu; Tim Jatkoe; Els M J J Berns; David Atkins; John A Foekens
Journal:  Lancet       Date:  2005 Feb 19-25       Impact factor: 79.321

6.  A Selective Overview of Variable Selection in High Dimensional Feature Space.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  Stat Sin       Date:  2010-01       Impact factor: 1.261

7.  Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors.

Authors:  Patrick Breheny; Jian Huang
Journal:  Stat Comput       Date:  2015-03       Impact factor: 2.559

8.  Gene pathways associated with prognosis and chemotherapy sensitivity in molecular subtypes of breast cancer.

Authors:  Takayuki Iwamoto; Giampaolo Bianchini; Daniel Booser; Yuan Qi; Charles Coutant; Christine Ya-Hui Shiang; Libero Santarpia; Junji Matsuoka; Gabriel N Hortobagyi; William Fraser Symmans; Frankie A Holmes; Joyce O'Shaughnessy; Beth Hellerstedt; John Pippen; Fabrice Andre; Richard Simon; Lajos Pusztai
Journal:  J Natl Cancer Inst       Date:  2010-12-29       Impact factor: 13.506

9.  Model selection procedure for high-dimensional data.

Authors:  Yongli Zhang; Xiaotong Shen
Journal:  Stat Anal Data Min       Date:  2010-10-01       Impact factor: 1.051

  9 in total
  1 in total

1.  A surrogate ℓ0 sparse Cox's regression with applications to sparse high-dimensional massive sample size time-to-event data.

Authors:  Eric S Kawaguchi; Marc A Suchard; Zhenqiu Liu; Gang Li
Journal:  Stat Med       Date:  2019-12-08       Impact factor: 2.497

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.