Premium
Reinforcement learning design for cancer clinical trials
Author(s) -
Zhao Yufan,
Kosorok Michael R.,
Zeng Donglin
Publication year - 2009
Publication title -
statistics in medicine
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.996
H-Index - 183
eISSN - 1097-0258
pISSN - 0277-6715
DOI - 10.1002/sim.3720
Subject(s) - reinforcement learning , computer science , artificial intelligence , clinical trial , machine learning , identification (biology) , set (abstract data type) , regression , medicine , statistics , mathematics , pathology , botany , biology , programming language
We develop reinforcement learning trials for discovering individualized treatment regimens for life‐threatening diseases such as cancer. A temporal‐difference learning method called Q‐learning is utilized that involves learning an optimal policy from a single training set of finite longitudinal patient trajectories. Approximating the Q‐function with time‐indexed parameters can be achieved by using support vector regression or extremely randomized trees. Within this framework, we demonstrate that the procedure can extract optimal strategies directly from clinical data without relying on the identification of any accurate mathematical models, unlike approaches based on adaptive design. We show that reinforcement learning has tremendous potential in clinical research because it can select actions that improve outcomes by taking into account delayed effects even when the relationship between actions and outcomes is not fully known. To support our claims, the methodology's practical utility is illustrated in a simulation analysis. In the immediate future, we will apply this general strategy to studying and identifying new treatments for advanced metastatic stage IIIB/IV non‐small cell lung cancer, which usually includes multiple lines of chemotherapy treatment. Moreover, there is significant potential of the proposed methodology for developing personalized treatment strategies in other cancers, in cystic fibrosis, and in other life‐threatening diseases. Copyright © 2009 John Wiley & Sons, Ltd.