Literature DB >> 32747841

CONSISTENT SELECTION OF THE NUMBER OF CHANGE-POINTS VIA SAMPLE-SPLITTING.

Changliang Zou1, Guanghui Wang1, Runze Li2.   

Abstract

In multiple change-point analysis, one of the major challenges is to estimate the number of change-points. Most existing approaches attempt to minimize a Schwarz information criterion which balances a term quantifying model fit with a penalization term accounting for model complexity that increases with the number of change-points and limits overfitting. However, different penalization terms are required to adapt to different contexts of multiple change-point problems and the optimal penalization magnitude usually varies from the model and error distribution. We propose a data-driven selection criterion that is applicable to most kinds of popular change-point detection methods, including binary segmentation and optimal partitioning algorithms. The key idea is to select the number of change-points that minimizes the squared prediction error, which measures the fit of a specified model for a new sample. We develop a cross-validation estimation scheme based on an order-preserved sample-splitting strategy, and establish its asymptotic selection consistency under some mild conditions. Effectiveness of the proposed selection criterion is demonstrated on a variety of numerical experiments and real-data examples.

Entities:  

Keywords:  Cross-validation; Dynamic programming; Least-squares; Model selection; Multiple change-point model; Selection consistency

Year:  2020        PMID: 32747841      PMCID: PMC7397423          DOI: 10.1214/19-aos1814

Source DB:  PubMed          Journal:  Ann Stat        ISSN: 0090-5364            Impact factor:   4.028


  6 in total

1.  A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data.

Authors:  Nancy R Zhang; David O Siegmund
Journal:  Biometrics       Date:  2007-03       Impact factor: 2.571

2.  Multiple Change-Point Detection via a Screening and Ranking Algorithm.

Authors:  Ning Hao; Yue Selena Niu; Heping Zhang
Journal:  Stat Sin       Date:  2013-07-01       Impact factor: 1.261

3.  Algorithms for the optimal identification of segment neighborhoods.

Authors:  I E Auger; C E Lawrence
Journal:  Bull Math Biol       Date:  1989       Impact factor: 1.758

4.  Optimal Sparse Segment Identification with Application in Copy Number Variation Analysis.

Authors:  X Jessie Jeng; T Tony Cai; Hongzhe Li
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

5.  THE SCREENING AND RANKING ALGORITHM TO DETECT DNA COPY NUMBER VARIATIONS.

Authors:  Yue S Niu; Heping Zhang
Journal:  Ann Appl Stat       Date:  2012-09       Impact factor: 2.083

6.  A computationally efficient nonparametric approach for changepoint detection.

Authors:  Kaylea Haynes; Paul Fearnhead; Idris A Eckley
Journal:  Stat Comput       Date:  2016-07-28       Impact factor: 2.559

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.