An Oversampling-enhanced Multi-class Imbalanced Classification Framework for Patient Health Status Prediction Using Patient-reported Outcomes
Author(s) -
Yang Yan,
Zhong Chen,
Cai Xu,
Xinglei Shen,
Jay Shiao,
John Einck,
Ronald C Chen,
Hao Gao
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3617316
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Patient-reported outcomes (PROs), directly captured from cancer patients undergoing radiation therapy, play a crucial role in guiding clinicians’ counseling on treatment-related toxicities. Accurate prediction and assessment of symptoms and health status linked to PROs are essential for improving clinical decision-making and planning post-treatment support as patients transition into survivorship. However, raw PRO data collected in clinical settings presents two inherent challenges, including data sparsity (due to incomplete item responses) and imbalanced toxicity distributions. These factors complicate predictive modeling. This study investigates machine learning techniques to address these challenges by predicting outcomes such as pain and sleep disturbances using PRO datasets from a cancer therapy center. We implement advanced classifiers (i.e., RF, XGBoost, GB, SVM, MLP-Bagging, and LR) for multi-class imbalance tasks across three cancers. To address minority underrepresentation, we apply oversampling while preserving class ratios. Experimental results demonstrate RF and XGBoost’s strong generalization, highlighting their utility in categorizing post-therapy severity levels for clinical decision support.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom