What (not) to expect when classifying rare events | Zendy

Rok Blagus | Zendy; Jelle J. Goeman | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

What (not) to expect when classifying rare events

Author(s) -

Rok Blagus,

Jelle J. Goeman

Publication year - 2016

Publication title -

briefings in bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.204

H-Index - 113

eISSN - 1477-4054

pISSN - 1467-5463

DOI - 10.1093/bib/bbw107

Subject(s) - classifier (uml) , constraint (computer aided design) , mathematics , rare events , computer science , artificial intelligence , event (particle physics) , pattern recognition (psychology) , machine learning , algorithm , statistics , physics , geometry , quantum mechanics

When building classifiers, it is natural to require that the classifier correctly estimates the event probability (Constraint 1), that it has equal sensitivity and specificity (Constraint 2) or that it has equal positive and negative predictive values (Constraint 3). We prove that in the balanced case, where there is equal proportion of events and non-events, any classifier that satisfies one of these constraints will always satisfy all. Such unbiasedness of events and non-events is much more difficult to achieve in the case of rare events, i.e. the situation in which the proportion of events is (much) smaller than 0.5. Here, we prove that it is impossible to meet all three constraints unless the classifier achieves perfect predictions. Any non-perfect classifier can only satisfy at most one constraint, and satisfying one constraint implies violating the other two constraints in a specific direction. Our results have implications for classifiers optimized using g-means or F1-measure, which tend to satisfy Constraints 2 and 1, respectively. Our results are derived from basic probability theory and illustrated with simulations based on some frequently used classifiers.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research