Literature DB >> 27570323

Constrained Maximum Likelihood Estimation for Model Calibration Using Summary-level Information from External Big Data Sources.

Nilanjan Chatterjee1, Yi-Hau Chen2, Paige Maas1, Raymond J Carroll3.   

Abstract

Information from various public and private data sources of extremely large sample sizes are now increasingly available for research purposes. Statistical methods are needed for utilizing information from such big data sources while analyzing data from individual studies that may collect more detailed information required for addressing specific hypotheses of interest. In this article, we consider the problem of building regression models based on individual-level data from an "internal" study while utilizing summary-level information, such as information on parameters for reduced models, from an "external" big data source. We identify a set of very general constraints that link internal and external models. These constraints are used to develop a framework for semiparametric maximum likelihood inference that allows the distribution of covariates to be estimated using either the internal sample or an external reference sample. We develop extensions for handling complex stratified sampling designs, such as case-control sampling, for the internal study. Asymptotic theory and variance estimators are developed for each case. We use simulation studies and a real data application to assess the performance of the proposed methods in contrast to the generalized regression (GR) calibration methodology that is popular in the sample survey literature.

Entities:  

Keywords:  Case-control study; Empirical likelihood; Generalized regression estimator; Misspecified model; Profile-likelihood

Year:  2016        PMID: 27570323      PMCID: PMC4994914          DOI: 10.1080/01621459.2015.1123157

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  8 in total

1.  Connections between survey calibration estimators and semiparametric models for incomplete data.

Authors:  Thomas Lumley; Pamela A Shaw; James Y Dai
Journal:  Int Stat Rev       Date:  2011-08       Impact factor: 2.217

2.  Generalizing from clinical trial data: a case study. The risk of suicidality among pediatric antidepressant users.

Authors:  Joel B Greenhouse; Eloise E Kaizar; Kelly Kelleher; Howard Seltman; William Gardner
Journal:  Stat Med       Date:  2008-05-20       Impact factor: 2.373

3.  Multilevel regression and poststratification for small-area estimation of population health outcomes: a case study of chronic obstructive pulmonary disease prevalence using the behavioral risk factor surveillance system.

Authors:  Xingyou Zhang; James B Holt; Hua Lu; Anne G Wheaton; Earl S Ford; Kurt J Greenlund; Janet B Croft
Journal:  Am J Epidemiol       Date:  2014-03-04       Impact factor: 4.897

4.  The use of propensity scores to assess the generalizability of results from randomized trials.

Authors:  Elizabeth A Stuart; Stephen R Cole; Catherine P Bradshaw; Philip J Leaf
Journal:  J R Stat Soc Ser A Stat Soc       Date:  2001-04-01       Impact factor: 2.483

5.  Projecting absolute invasive breast cancer risk in white women with a model that includes mammographic density.

Authors:  Jinbo Chen; David Pee; Rajeev Ayyagari; Barry Graubard; Catherine Schairer; Celia Byrne; Jacques Benichou; Mitchell H Gail
Journal:  J Natl Cancer Inst       Date:  2006-09-06       Impact factor: 13.506

6.  Comprehensive analysis of common genetic variation in 61 genes related to steroid hormone and insulin-like growth factor-I metabolism and breast cancer risk in the NCI breast and prostate cancer cohort consortium.

Authors:  Federico Canzian; David G Cox; V Wendy Setiawan; Daniel O Stram; Regina G Ziegler; Laure Dossus; Lars Beckmann; Hélène Blanché; Aurelio Barricarte; Christine D Berg; Sheila Bingham; Julie Buring; Saundra S Buys; Eugenia E Calle; Stephen J Chanock; Françoise Clavel-Chapelon; John Oliver L DeLancey; W Ryan Diver; Miren Dorronsoro; Christopher A Haiman; Göran Hallmans; Susan E Hankinson; David J Hunter; Anika Hüsing; Claudine Isaacs; Kay-Tee Khaw; Laurence N Kolonel; Peter Kraft; Loïc Le Marchand; Eiliv Lund; Kim Overvad; Salvatore Panico; Petra H M Peeters; Michael Pollak; Michael J Thun; Anne Tjønneland; Dimitrios Trichopoulos; Rosario Tumino; Meredith Yeager; Robert N Hoover; Elio Riboli; Gilles Thomas; Brian E Henderson; Rudolf Kaaks; Heather Spencer Feigelson
Journal:  Hum Mol Genet       Date:  2010-07-15       Impact factor: 6.150

7.  The calibration of treatment effects from clinical trials to target populations.

Authors:  Constantine Frangakis
Journal:  Clin Trials       Date:  2009-04       Impact factor: 2.486

8.  Commonly studied single-nucleotide polymorphisms and breast cancer: results from the Breast Cancer Association Consortium.

Authors: 
Journal:  J Natl Cancer Inst       Date:  2006-10-04       Impact factor: 13.506

  8 in total
  20 in total

1.  Informing a Risk Prediction Model for Binary Outcomes with External Coefficient Information.

Authors:  Wenting Cheng; Jeremy M G Taylor; Tian Gu; Scott A Tomlins; Bhramar Mukherjee
Journal:  J R Stat Soc Ser C Appl Stat       Date:  2018-08-13       Impact factor: 1.864

2.  Generalized meta-analysis for multiple regression models across studies with disparate covariate information.

Authors:  Prosenjit Kundu; Runlong Tang; Nilanjan Chatterjee
Journal:  Biometrika       Date:  2019-07-13       Impact factor: 2.445

3.  Guided Bayesian imputation to adjust for confounding when combining heterogeneous data sources in comparative effectiveness research.

Authors:  Joseph Antonelli; Corwin Zigler; Francesca Dominici
Journal:  Biostatistics       Date:  2017-07-01       Impact factor: 5.899

Review 4.  Is human fecundity changing? A discussion of research and data gaps precluding us from having an answer.

Authors:  Melissa M Smarr; Katherine J Sapra; Alison Gemmill; Linda G Kahn; Lauren A Wise; Courtney D Lynch; Pam Factor-Litvak; Sunni L Mumford; Niels E Skakkebaek; Rémy Slama; Danelle T Lobdell; Joseph B Stanford; Tina Kold Jensen; Elizabeth Heger Boyle; Michael L Eisenberg; Paul J Turek; Rajeshwari Sundaram; Marie E Thoma; Germaine M Buck Louis
Journal:  Hum Reprod       Date:  2017-03-01       Impact factor: 6.918

5.  Augmented pseudo-likelihood estimation for two-phase studies.

Authors:  Claudia Rivera-Rodriguez; Sebastien Haneuse; Molin Wang; Donna Spiegelman
Journal:  Stat Methods Med Res       Date:  2019-03-05       Impact factor: 3.021

6.  Novel two-phase sampling designs for studying binary outcomes.

Authors:  Le Wang; Matthew L Williams; Yong Chen; Jinbo Chen
Journal:  Biometrics       Date:  2019-11-14       Impact factor: 2.571

7.  Improving estimation and prediction in linear regression incorporating external information from an established reduced model.

Authors:  Wenting Cheng; Jeremy M G Taylor; Pantel S Vokonas; Sung Kyun Park; Bhramar Mukherjee
Journal:  Stat Med       Date:  2018-01-24       Impact factor: 2.373

8.  Estimating population effects of vaccination using large, routinely collected data.

Authors:  M Elizabeth Halloran; Michael G Hudgens
Journal:  Stat Med       Date:  2017-07-19       Impact factor: 2.373

9.  Comparison of approaches for incorporating new information into existing risk prediction models.

Authors:  Sonja Grill; Donna P Ankerst; Mitchell H Gail; Nilanjan Chatterjee; Ruth M Pfeiffer
Journal:  Stat Med       Date:  2016-12-11       Impact factor: 2.373

10.  Combining Multiple Observational Data Sources to Estimate Causal Effects.

Authors:  Shu Yang; Peng Ding
Journal:  J Am Stat Assoc       Date:  2019-06-11       Impact factor: 5.033

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.