Automating Text Simplification Using Pictographs for People with Language Deficits | Zendy

Mai Farag Imam | Zendy; Amal Elsayed Aboutabl | Zendy; Ensaf Hussein Mohamed | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Automating Text Simplification Using Pictographs for People with Language Deficits

Author(s) -

Mai Farag Imam,

Amal Elsayed Aboutabl,

Ensaf Hussein Mohamed

Publication year - 2019

Publication title -

international journal of information technology and computer science

Language(s) - English

Resource type - Journals

eISSN - 2074-9015

pISSN - 2074-9007

DOI - 10.5815/ijitcs.2019.07.04

Subject(s) - computer science , natural language processing , automatic summarization , artificial intelligence , precision and recall , preprocessor , lemmatisation , set (abstract data type) , context (archaeology) , word (group theory) , lexical analysis , stop words , spelling , information retrieval , programming language , linguistics , paleontology , philosophy , biology

Automating text simplification is a challenging research area due to the compound structures present in natural languages. Social involvement of people with language deficits can be enhanced by providing them with means to communicate with the outside world, for instance using the internet independently. Using pictographs instead of text is one of such means. This paper presents a system which performs text simplification by translating text into pictographs. The proposed system consists of a set of phases. First, a simple summarization technique is used to decrease the number of sentences before converting them to pictures. Then, text preprocessing is performed including processes such as tokenization and lemmatization. The resulting text goes through a spelling checker followed by a word sense disambiguation algorithm to find words which are most suitable to the context in order to increase the accuracy of the result. Clearly, using WSD improves the results. Furthermore, when support vector machine is used for WSD, the system yields the best results. Finally, the text is translated into a list of images. For testing and evaluation purposes, a test corpus of 37 Basic English sentences has been manually constructed. Experiments are conducted by presenting the list of generated images to ten normal children who are asked to reproduce the input sentences based on the pictographs. The reproduced sentences are evaluated using precision, recall, and F-Score. Results show that the proposed system enhances pictograph understanding and succeeds to convert text to pictograph with precision, recall and F-score of over 90% when SVM is used for word sense disambiguation, also all these techniques are not combined together before which increases the accuracy of the system over all other studies.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research