
Multiple Changepoint Detection Using Metadata
Author(s) -
Yingbo Li,
Robert Lund
Publication year - 2015
Publication title -
journal of climate
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.315
H-Index - 287
eISSN - 1520-0442
pISSN - 0894-8755
DOI - 10.1175/jcli-d-14-00442.1
Subject(s) - metadata , bayesian probability , computer science , series (stratigraphy) , posterior probability , distribution (mathematics) , data mining , statistics , artificial intelligence , geology , mathematics , world wide web , mathematical analysis , paleontology
This paper examines multiple changepoint detection procedures that use station history (metadata) information. Metadata records are available for some climate time series; however, these records are notoriously incomplete and many station moves and gauge changes are unlisted (undocumented). The shift in a series must be comparatively larger to declare a changepoint at an undocumented time. Also, the statistical methods for the documented and undocumented scenarios radically differ: a simple t test adequately detects a single mean shift at a documented changepoint time, while a tmax distribution is appropriate for a single undocumented changepoint analysis. Here, the multiple changepoint detection problem is considered via a Bayesian approach, with the metadata record being used to formulate a prior distribution of the changepoint numbers and their location times. This prior distribution is combined with the data to obtain a posterior distribution of changepoint numbers and location times. Estimates of the most likely number of changepoints and times are obtained from the posterior distribution. Simulation studies demonstrate the efficacy of this approach. The methods, which are applicable with or without a reference series, are applied in the analysis of an annual precipitation series from New Bedford, Massachusetts.