Annotated corpus and the empirical evaluation of probability estimates of grammatical forms | Zendy

Nada Ševa | Zendy; Aleksandar Kostić | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Annotated corpus and the empirical evaluation of probability estimates of grammatical forms

Author(s) -

Nada Ševa,

Aleksandar Kostić

Publication year - 2003

Publication title -

psihologija

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.222

H-Index - 16

eISSN - 1451-9283

pISSN - 0048-5705

DOI - 10.2298/psi0303255s

Subject(s) - psycholinguistics , noun , serbian , natural language processing , computer science , artificial intelligence , relevance (law) , linguistics , psychology , cognition , philosophy , neuroscience , political science , law

The aim of the present study is to demonstrate the usage of an annotated corpus in the field of experimental psycholinguistics. Specifically, we demonstrate how the manually annotated Corpus of Serbian Language (Kostić, Đ. 2001) can be used for probability estimates of grammatical forms, which allow the control of independent variables in psycholinguistic experiments. We address the issue of processing Serbian inflected forms within two subparadigms of feminine nouns. In regression analysis, almost all processing variability of inflected forms has been accounted for by the amount of information (i.e. bits) carried by the presented forms. In spite of the fact that probability distributions of inflected forms for the two paradigms differ, it was shown that the best prediction of processing variability is obtained by the probabilities derived from the predominant subparadigm which encompasses about 80% of feminine nouns. The relevance of annotated corpora in experimental psycholinguistics is discussed more in detail

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research