Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

On the ROC Area of Ensemble Forecasts for Rare Events

Version 1 : Received: 26 November 2021 / Approved: 29 November 2021 / Online: 29 November 2021 (12:36:55 CET)

A peer-reviewed article of this Preprint also exists.

Ben Bouallègue, Z.; Richardson, D.S. On the ROC Area of Ensemble Forecasts for Rare Events. Weather and Forecasting 2022, 37, 787–796, doi:10.1175/waf-d-21-0195.1. Ben Bouallègue, Z.; Richardson, D.S. On the ROC Area of Ensemble Forecasts for Rare Events. Weather and Forecasting 2022, 37, 787–796, doi:10.1175/waf-d-21-0195.1.

Abstract

The relative operating characteristic (ROC) curve is a popular diagnostic tool in forecast verification, with the area under the ROC curve (AUC) used as a verification metric measuring the discrimination ability of a forecast. Along with calibration, discrimination is deemed as a fundamental probabilistic forecast attribute. In particular, in ensemble forecast verification, AUC provides a basis for the comparison of potential predictive skill of competing forecasts. While this approach is straightforward when dealing with forecasts of common events (e.g. probability of precipitation), the AUC interpretation can turn out to be oversimplistic or misleading when focusing on rare events (e.g. precipitation exceeding some warning criterion). How should we interpret AUC of ensemble forecasts when focusing on rare events? How can changes in the way probability forecasts are derived from the ensemble forecast affect AUC results? How can we detect a genuine improvement in terms of predictive skill? Based on verification experiments, a critical eye is cast on the AUC interpretation to answer these questions. As well as the traditional trapezoidal approximation and the well-known bi-normal fitting model, we discuss a new approach which embraces the concept of imprecise probabilities and relies on the subdivision of the lowest ensemble probability category.

Keywords

ROC area; discrimination; verification artefacts; trapezoidal approximation; bi-normal model; imprecise probability

Subject

Computer Science and Mathematics, Probability and Statistics

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.