
Domain-Specific Multi-Document Political News Summarization Using BART and ACT-GAN
Author(s) -
Zekai Nie,
Masooma Arshad,
Syed Khaldoon Khurshid,
Abqa Javed,
Talha Waheed,
Qianqian Zhang,
Muhammad Farrukh Shahzad
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3598058
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
The exponential growth of digital content has made it increasingly difficult for users to understand the information, particularly in domains like political news. The study provides a framework for domain-specific multi-document political news summarization by using the transformer-based models that are fine-tuned on a synthetic dataset. The use of a generative Artificial Intelligence (AI) model, specifically the Bidirectional and Autoregressive Transformer (BART) model, for summarizing multi-document political news was explored in this paper. To improve the summarization process, real-world data was augmented, and a synthetic dataset was generated using the Adversarial Contextualized Text-Generative Adversarial Network (ACT-GAN) model to create a diverse and domain-specific training dataset. All four models, including BART, Pegasus, Text-To-Text Transfer Transformer (T5), and Sequence-to-Sequence (Seq2Seq), were fine-tuned and evaluated using a synthetic dataset generated via ACT-GAN and benchmarked using Recall-Oriented Understudy for Gisting Evaluation (ROUGE) metrics. BART performs the best in accuracy, recall, and F1-score, showing it can create clear and logical summaries while keeping the important context and structure intact. The results show that the transformer with the synthetic data helps to improve the quality of the political data summarization. In addition to addressing the challenges of extensive information, this framework also employs generative AI models in text summarization domains.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom