A Reinforcement-Learning Approach to Failure-Detection Scheduling | Zendy

Zeng Fancong | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A Reinforcement-Learning Approach to Failure-Detection Scheduling

Author(s) -

Zeng Fancong

Publication year - 2007

Language(s) - English

Resource type - Book series

ISBN - 0-7695-3035-4

DOI - 10.1109/qsic.2007.7

A failure-detection scheduler for an online production system must strike a tradeoff between performance and reli- ability. If failure-detection processes are run too frequently, valuable system resources are spent checking and recheck- ing for failures. However, if failure-detection processes are run too rarely, a failure can remain undetected for a long time. In both cases, system performability suffers. We present a model-based learning approach that estimates the failure rate and then performs an optimization to find the tradeoff that maximizes system performability. We show that our approach is not only theoretically sound but prac- tically effective, and we demonstrate its use in an imple- mented automated deadlock-detection system for Java.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research