z-logo
Premium
ArA*summarizer: An Arabic text summarization system based on subtopic segmentation and using an A* algorithm for reduction
Author(s) -
Bahloul Belahcene,
Aliane Hassina,
Benmohammed Mohamed
Publication year - 2020
Publication title -
expert systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.365
H-Index - 38
eISSN - 1468-0394
pISSN - 0266-4720
DOI - 10.1111/exsy.12476
Subject(s) - automatic summarization , computer science , natural language processing , artificial intelligence , graph , multi document summarization , arabic , information retrieval , text graph , theoretical computer science , linguistics , philosophy
Automatic text summarization is a field situated at the intersection of natural language processing and information retrieval. Its main objective is to automatically produce a condensed representative form of documents. This paper presents ArA*summarizer, an automatic system for Arabic single document summarization. The system is based on an unsupervised hybrid approach that combines statistical, cluster‐based, and graph‐based techniques. The main idea is to divide text into subtopics then select the most relevant sentences in the most relevant subtopics. The selection process is done by an A* algorithm executed on a graph representing the different lexical–semantic relationships between sentences. Experimentation is conducted on Essex Arabic summaries corpus and using recall‐oriented understudy for gisting evaluation, automatic summarization engineering, merged model graphs, and n‐gram graph powered evaluation via regression evaluation metrics. The evaluation results showed the good performance of our system compared with existing works.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here