THE DICTIONARY STRUCTURE FOR EFFECTIVE WORD SEARCH
Author(s) -
Waldemar Karwowski
Publication year - 2017
Publication title -
information system in management
Language(s) - English
Resource type - Journals
eISSN - 2544-1728
pISSN - 2084-5537
DOI - 10.22630/isim.2017.6.4.3
Subject(s) - search engine indexing , computer science , wordnet , inflection , natural language processing , machine readable dictionary , word (group theory) , bilingual dictionary , artificial intelligence , part of speech , process (computing) , lexical database , information retrieval , speech recognition , linguistics , programming language , philosophy
In the paper some issues connected with indexing documents in the Polish language are discussed. Algorithms for stemming and part of speech tagging, important in the process of text analysis and indexing are shortly described. Next their suitability to the Polish language, which has a very extensive inflection, is discussed. The usefulness for stemming and part of speech tagging of large dictionaries with inflected forms, like WordNet and open-source dictionary of Polish language is also described. Two dictionary structures enabling effective word searching are presented. In the final part, some tests of implemented two dictionary structures are described. Tests were made on the six actual and three crafted artificial texts. At the end conclusions of performed tests are formulated.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom