
Enhancing Structured Pathology Report Generation with Foundation Model and Modular Design
Author(s) -
Kyung A Kim,
Sungman Hong,
Sehwan Yoo,
Yousun Kang,
Hyo Sup Shim
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3588121
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Pathology report generation is challenging as it requires analyzing giga-pixel Whole Slide Images (WSIs), making it a complex and labor-intensive task, yet crucial for clinical decision-making. An AI-driven report generation system can assist in producing accurate and concise reports. To address these needs, we designed a two-stage framework consisting image analysis and text generation, improving diagnostic performance and optimizing it for real-world clinical application. In the first stage, using 752 high resolution WSIs from bladder tumor tissue samples collected at Korea University Medical Center, we leveraged foundation models to extract robust features. Then, we built a multi-label classification based on key diagnostic elements to achieve high performance with an Attention-based Multi-Instance Learning (MIL) model and a Transformer-based MIL with Knowledge Distillation. For report generation stage, we designed a T5-based text-to-text model, simplifying input representations whereas integrating data augmentation to improve stability and generalization. The proposed model achieved a ROUGE score of 0.87, a BLEU-4 score of 0.94, a Jaccard score of 0.89 and a BioLLM score of 0.97. An additional evaluation conducted by the institution maintaining the exclusive K-MEDICON test datasets confirmed a consistent performance yielding an overall score of 0.88, calculated as a weighted sum of aforementioned metrics. Its consistency validated through a Mean Opinion Score (MOS) evaluation by pathologists. By systematically integrating foundation models and modular structure, the benchmark results demonstrate that the task can be effectively solved with limited data and computational resources, indicating promising potential for real-world clinical adaptation.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom