Counterfactual Medical Images Generation for Lung Disease Diagnosis Using Probabilistic Causal Models and Active Learning
Author(s) -
Yifei Zhu,
Lei Zhang,
Chris Sainsbury,
Feng Dong,
John MacLay,
David J. Lowe,
Xujiong Ye
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3615683
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Recent advancements in deep learning have shown promise in diagnosing lung diseases from medical images, but these methods often lack causal inference capabilities, limiting their applicability in clinical decision-making. This study focuses on leveraging causal generative modelling for counterfactual analysis to enhance the understanding and diagnosis of lung diseases. We developed a Structured Causal Model designed to generate clinically meaningful counterfactual images of lung diseases. Our framework integrates active learning with uncertainty measurement to address data quality issues in clinical datasets and refine the training set distribution. Inspired by the human-in-the-loop concept, expert feedback was incorporated into the training pipeline to ensure the generated images align with clinical expectations. We evaluated the generated counterfactuals using model accuracy and a specialized expert model that calculates disease probabilities based on the images. The proposed model achieved a 93.27% accuracy in generating counterfactual images representative of the corresponding clinical conditions, as confirmed by medical experts. Active learning with uncertainty measurement effectively enhanced the data distribution, maintaining a heavy-tailed structure to better reflect real-world clinical data. The integration of expert knowledge further ensured the clinical validity and relevance of the counterfactuals, supporting more informed diagnostic and prognostic decisions. Our study highlights the potential of causal generative modelling to improve lung disease diagnosis and prognosis by generating clinically meaningful counterfactual images, supported by active learning and expert feedback.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom