z-logo
open-access-imgOpen Access
Sample Stratification in Verification of Ensemble Forecasts of Continuous Scalar Variables: Potential Benefits and Pitfalls
Author(s) -
Joseph Bellier,
Isabella Zin,
Guillaume Bontron
Publication year - 2017
Publication title -
monthly weather review
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.862
H-Index - 179
eISSN - 1520-0493
pISSN - 0027-0644
DOI - 10.1175/mwr-d-16-0487.1
Subject(s) - stratification (seeds) , histogram , forecast verification , homogeneous , computer science , mathematics , scalar (mathematics) , statistics , econometrics , data mining , forecast skill , artificial intelligence , seed dormancy , botany , germination , geometry , combinatorics , dormancy , image (mathematics) , biology
In the verification field, stratification is the process of dividing the sample of forecast–observation pairs into quasi-homogeneous subsets, in order to learn more on how forecasts behave under specific conditions. A general framework for stratification is presented for the case of ensemble forecasts of continuous scalar variables. Distinction is made between forecast-based, observation-based, and external-based stratification, depending on the criterion on which the sample is stratified. The formalism is applied to two widely used verification measures: the continuous ranked probability score (CRPS) and the rank histogram. For both, new graphical representations that synthesize the added information are proposed. Based on the definition of calibration, it is shown that the rank histogram should be used within a forecast-based stratification, while an observation-based stratification leads to significantly nonflat histograms for calibrated forecasts. Nevertheless, as previous studies have warned, statistical artifacts created by a forecast-based stratification may still occur, thus a graphical test to detect them is suggested. To illustrate potential insights about forecast behavior that can be gained from stratification, a numerical example with two different datasets of mean areal precipitation forecasts is presented.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here