Metadata-Free Rumor Detection via LLM-Augmented Reasoning
Author(s) -
G. Hanieh Asadi,
Ali Hamzeh,
Niloofar Mozafari
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3610286
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Rumor detection in social media is critical for countering the rapid spread of misinformation, especially in dynamic and low-resource settings. In this work, we propose a unified framework that leverages large language models (LLMs) not only for classification but also for semantically enriching the input data through lightweight, class-agnostic augmentation. Unlike prior methods that rely on user metadata, graph-based propagation structures, or external knowledge sources, our approach (MFRD) prompts LLMs to generate useful augments. Two high-level summaries for each event are generated: a discussion narrative distilled from stance-rich user replies, and a propagation analysis inferred from repost patterns and temporal dynamics. These augmentations are integrated into structured event documents that combine source posts, user feedback, and contextual reasoning cues. We evaluate the effectiveness of this enriched input across five modeling paradigms: (1) TF-IDF with a multi-layer perceptron (MLP), (2) frozen transformer encoders with pooling, (3) fine-tuned transformer encoders, (4) prompt-based zero- and few-shot generative LLMs, and (5) an instruction-tuned LLM, which forms the core of our proposed method. Our results show that the integration of LLM-generated augmentations with instruction tuning yields strong and interpretable performance, bridging content, discourse, and temporal signals without reliance on user profiles or external resources.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom