z-logo
open-access-imgOpen Access
Annotated corpus and the empirical evaluation of probability estimates of grammatical forms
Author(s) -
Nada Ševa,
Aleksandar Kostić
Publication year - 2003
Publication title -
psihologija
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.222
H-Index - 16
eISSN - 1451-9283
pISSN - 0048-5705
DOI - 10.2298/psi0303255s
Subject(s) - psycholinguistics , noun , serbian , natural language processing , computer science , artificial intelligence , relevance (law) , linguistics , psychology , cognition , philosophy , neuroscience , political science , law
The aim of the present study is to demonstrate the usage of an annotated corpus in the field of experimental psycholinguistics. Specifically, we demonstrate how the manually annotated Corpus of Serbian Language (Kostić, Đ. 2001) can be used for probability estimates of grammatical forms, which allow the control of independent variables in psycholinguistic experiments. We address the issue of processing Serbian inflected forms within two subparadigms of feminine nouns. In regression analysis, almost all processing variability of inflected forms has been accounted for by the amount of information (i.e. bits) carried by the presented forms. In spite of the fact that probability distributions of inflected forms for the two paradigms differ, it was shown that the best prediction of processing variability is obtained by the probabilities derived from the predominant subparadigm which encompasses about 80% of feminine nouns. The relevance of annotated corpora in experimental psycholinguistics is discussed more in detail

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom