Budowa i zastosowania korpusu monitorującego MoncoPL | Zendy

Piotr Pęzik | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Budowa i zastosowania korpusu monitorującego MoncoPL

Author(s) -

Piotr Pęzik

Publication year - 2020

Publication title -

forum lingwistyczne

Language(s) - English

Resource type - Journals

eISSN - 2450-2758

pISSN - 2449-9587

DOI - 10.31261/fl.2020.07.11

Subject(s) - neologism , computer science , corpus linguistics , word (group theory) , natural language processing , artificial intelligence , linguistics , information retrieval , philosophy

This paper introduces the methodology of compiling and maintaining MoncoPL, a large monitor corpus of web-based Polish. Furthermore, an overview of the search engine of the same name is provided to show how the size and composition of the corpus, currently reaching over 5.6 billion word tokens, facilitates research on distributional properties of rare words, neologisms and phraseological units. Finally, the article exemplifies some advantages of using a densely-sampled diachronic corpus for the purposes of observing frequency trends and cycles of various constructions in online media discourse.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research