
The comparison of Lithuanian texts’ styles by using the statistical analysis of the universal quantitative characteristics
Author(s) -
Karolina Piaseckienė,
Marijus Radavičius,
Raimundas Stiklius
Publication year - 2010
Publication title -
lietuvos matematikos rinkinys
Language(s) - English
Resource type - Journals
eISSN - 2335-898X
pISSN - 0132-2818
DOI - 10.15388/lmr.2010.56
Subject(s) - lithuanian , computer science , natural language processing , linguistics , statistical analysis , artificial intelligence , statistics , mathematics , philosophy
Lithuanian language is quite complex and flexible, and its significantly complicates the development of efficient algorithms for the automatic processing of Lithuanian texts. For studying text-styles features were selected the universal quantitative characteristics that are unrelated to the text content and can be calculated for any text. This article shows how mathematical Statistics can help to distinguish and interpret the Lithuanian language styles. Studies of the log-linear models show theconnection between the letters and sounds structure and the scientific and fiction.