Automatic text summarization of konkani texts using pre-trained word embeddings and deep learning | Zendy

Jovi D’Silva | Zendy; Uzzal Sharma | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Automatic text summarization of konkani texts using pre-trained word embeddings and deep learning

Author(s) -

Jovi D’Silva,

Uzzal Sharma

Publication year - 2022

Publication title -

international journal of power electronics and drive systems/international journal of electrical and computer engineering

Language(s) - English

Resource type - Journals

eISSN - 2722-2578

pISSN - 2722-256X

DOI - 10.11591/ijece.v12i2.pp1990-2000

Subject(s) - automatic summarization , computer science , artificial intelligence , natural language processing , word (group theory) , task (project management) , deep learning , perceptron , feature (linguistics) , artificial neural network , linguistics , philosophy , management , economics

Automatic text summarization has gained immense popularity in research. Previously, several methods have been explored for obtaining effective text summarization outcomes. However, most of the work pertains to the most popular languages spoken in the world. Through this paper, we explore the area of extractive automatic text summarization using deep learning approach and apply it to Konkani language, which is a low-resource language as there are limited resources, such as data, tools, speakers and/or experts in Konkani. In the proposed technique, Facebook’s fastText pre-trained word embeddings are used to get a vector representation for sentences. Thereafter, deep multi-layer perceptron technique is employed, as a supervised binary classification task for auto-generating summaries using the feature vectors. Using pre-trained fastText word embeddings eliminated the requirement of a large training set and reduced training time. The system generated summaries were evaluated against the ‘gold-standard’ human generated summaries with recall-oriented understudy for gisting evaluation (ROUGE) toolkit. The results thus obtained showed that performance of the proposed system matched closely to the performance of the human annotators in generating summaries.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research