
Single Document Text Summarization of a Resource-Poor Language using an Unsupervised Technique
Author(s) -
Gunadeep Chetia,
G. C. Hazarika
Publication year - 2019
Publication title -
international journal of engineering and advanced technology
Language(s) - English
Resource type - Journals
ISSN - 2249-8958
DOI - 10.35940/ijeat.a2250.109119
Subject(s) - automatic summarization , computer science , latent semantic analysis , natural language processing , artificial intelligence , singular value decomposition , resource (disambiguation) , space (punctuation) , language model , task (project management) , information retrieval , computer network , management , economics , operating system
Automatic text summarization of a resource-poor language is a challenging task. Unsupervised extractive techniques are often preferred for such languages due to scarcity of resources. Latent Semantic Analysis (LSA) is an unsupervised technique which automatically identifies semantically important sentences from a text document. Two methods based on Latent Semantic Analysis have been evaluated on two datasets of a resource-poor language using Singular Value Decomposition (SVD) on different vector-space models. The performance of the methods is evaluated using ROUGE-L scores obtained by comparing the system generated summaries with human generated model summaries. Both the methods are found to be performing better for shorter documents than longer ones.