Abstract
Detecting changepoints in data sets with many variates is a data science challenge of increasing importance. Motivated by the problem of detecting changes in the incidence of terrorism from a global terrorism database, we propose a novel approach to multiple changepoint detection in multivariate time series. Our method, which we call SUBSET, is a model-based approach which uses a penalised likelihood to detect changes for a wide class of parametric settings. We provide theory that guides the choice of penalties to use for SUBSET, and that shows it has high power to detect changes regardless of whether only a few variates or many variates change. Empirical results show that SUBSET out-performs many existing approaches for detecting changes in mean in Gaussian data; additionally, unlike these alternative methods, it can be easily extended to non-Gaussian settings such as are appropriate for modelling counts of terrorist events.
Original language | English |
---|---|
Pages (from-to) | 1303-1325 |
Number of pages | 23 |
Journal | Journal of the Royal Statistical Society. Series A: Statistics in Society |
Volume | 184 |
Issue number | 4 |
Early online date | 4 Aug 2021 |
DOIs | |
Publication status | Published - Oct 2021 |
Bibliographical note
Funding Information:This paper is based on work completed while Tickle was part of the EPSRC funded STOR‐i Centre for Doctoral Training (EP/L015692/1). Eckley and Fearnhead gratefully acknowledge the financial support of EPSRC grant EP/N031938/1. The authors also acknowledge British Telecommunications plc (BT) for financial support, and are grateful to Kjeld Jensen, Dave Yearling and Guillem Rigaill for helpful discussions. Finally, the authors would like to thank the editors and two anonymous reviewers for several helpful comments and feedback.
Publisher Copyright:
© 2021 The Authors. Journal of the Royal Statistical Society: Series A (Statistics in Society) published by John Wiley & Sons Ltd on behalf of Royal Statistical Society