
Morphology-based investigation of differences between spoken and written isiZulu
Author(s) -
Laurette Marais,
Ilana Wilken
Publication year - 2021
Language(s) - English
DOI - 10.55492/dhasa.v3i01.3860
Subject(s) - morpheme , computer science , natural language processing , linguistics , spoken language , artificial intelligence , philosophy
Research attempting to describe and quantify the differences between spoken and written language has been done for languages such as English, but not for isiZulu. In this paper, we present a quantitative investigation into such differences by considering the morphology of tokens in a transcribed spoken isiZulu corpus and a written isiZulu corpus. We use morpheme tags as a proxy for features that typically differ between spoken and written language, and calculate relative differences of the occurrence of specific morpheme tags from analyses produced by ZulMorph, a finite-state morphological analyser for isiZulu. This analysis presents information that could inform the development of voice-enabled computer applications for isiZulu.