Automatic construction of generic stop words list for hausa text | Zendy

Abdulkadir Abubakar Bichi | Zendy; Ruhaidah Samsudin | Zendy; Rohayanti Hassan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Automatic construction of generic stop words list for hausa text

Author(s) -

Abdulkadir Abubakar Bichi,

Ruhaidah Samsudin,

Rohayanti Hassan

Publication year - 2022

Publication title -

indonesian journal of electrical engineering and computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.241

H-Index - 17

eISSN - 2502-4760

pISSN - 2502-4752

DOI - 10.11591/ijeecs.v25.i3.pp1501-1507

Subject(s) - hausa , computer science , dimension (graph theory) , space (punctuation) , search engine indexing , information retrieval , span (engineering) , artificial intelligence , natural language processing , word (group theory) , mathematics , linguistics , engineering , philosophy , civil engineering , geometry , pure mathematics , operating system

Stop-words are words having the highest frequencies in a document without any significant information. They are characterized by having common relations within a cluster. They are the noise of the text that are evenly distributed over a document. Removal of stop words improve the performance and accuracy of information retrieval algorithms and machine learning at large. It saves the storage space by reducing the vector space dimension, and helps in effective documents indexing. This research generated a list of Hausa stop words automatically using aggregated method by combining frequency and statistics methods. The experiments are conducted using a primarily collected Hausa corpus consisting of 841 Hausa news articles of size 646862 words and finally a list of distinct 81 Hausa stop words is generated.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore