Discrimination of Different Serbian Pronunciations from Shtokavian Dialect
Author(s) -
Darko Brodić,
Alessia Amelio
Publication year - 2017
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2017.08.047
Subject(s) - serbian , unicode , computer science , set (abstract data type) , artificial intelligence , character encoding , natural language processing , character (mathematics) , image (mathematics) , linguistics , programming language , geometry , mathematics , philosophy
This paper proposes a new methodology for discrimination of different pronunciations in the Shtokavian dialect of the Serbian language. At the first, the written language (Unicode text) is converted into codes according to the energy status of each character in the text-line. Such a set of codes is seen as a grayscale image. Then, the local structures of the image are explored by local binary operators. It creates a vector set which differentiates various pronunciations of the Serbian language. The experiment is performed on fifty documents given in Serbian language. A comparison performed between the proposed method and the n -gram method shows its clear advantage.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom