z-logo
open-access-imgOpen Access
Using part-of-speech tags as deep-syntax indicators in determining short-text semantic similarity
Author(s) -
Vuk Batanović,
Dragan Bojić
Publication year - 2014
Publication title -
computer science and information systems
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.244
H-Index - 24
eISSN - 2406-1018
pISSN - 1820-0214
DOI - 10.2298/csis131127082b
Subject(s) - computer science , paraphrase , natural language processing , artificial intelligence , similarity (geometry) , semantic similarity , syntax , parsing , weighting , natural language , natural language understanding , speech recognition , medicine , image (mathematics) , radiology
This paper presents POST STSS, a method of determining short-text semantic similarity in which part-of-speech tags are used as indicators of the deeper syntactic information usually extracted by more advanced tools like parsers and semantic role labelers. Our model employs a part-of-speech weighting scheme and is based on a statistical bag-of-words approach. It does not require either hand-crafted knowledge bases or advanced syntactic tools, which makes it easily applicable to languages with limited natural language processing resources. By using a paraphrase recognition test, we demonstrate that our system achieves a higher accuracy than all existing statistical similarity algorithms and solutions of a more structural kind. [Projekat Ministarstva nauke Republike Srbije, br. TR 32047]

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom