
Optimizing Heart Disease Prediction: A Comparative Analysis of Tree-Based Ensembles with Feature Expansion and Selection
Author(s) -
K Aswini,
Kriti Arya
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3590492
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Cardiovascular disease (CVD) is the leading cause of death worldwide, emphasizing the importance of accurate early detection. This study examines the efficacy of tree based ensemble machine learning models that have been improved using Feature Expansion and Selection (FES-EM).We considered the Mendeley CVD dataset of 1,000 patient records for this study, where new features were generated with feature generation operators, and the most significant ones were then retained by applying four feature selection techniques. Three hyperparameter tuning strategies were used to train and tune seven tree-based ensemble models to identify the best-performing approach. The results identified Decision Tree-Based Recursive Feature Elimination (DTRFECV) with AdaBoost optimized for grid search as the most effecive model, achieving 98.20% sensitivity, 97.98% F1 score and 97.75% accuracy. Validation on both primary and secondary datasets demonstrated that models incorporating FES outperformed those without FES. These findings underscore the novelty of a multi-stage machine learning pipeline that integrates clinically driven feature expansion with systematic optimization to build a robust and generalizable framework for early and accurate CVD diagnosis.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom