Using part-of-speech tags as deep-syntax indicators in determining short-text semantic similarity | Zendy

Vuk Batanović | Zendy; Dragan Bojić | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Using part-of-speech tags as deep-syntax indicators in determining short-text semantic similarity

Author(s) -

Vuk Batanović,

Dragan Bojić

Publication year - 2014

Publication title -

computer science and information systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.244

H-Index - 24

eISSN - 2406-1018

pISSN - 1820-0214

DOI - 10.2298/csis131127082b

Subject(s) - computer science , paraphrase , natural language processing , artificial intelligence , similarity (geometry) , semantic similarity , syntax , parsing , weighting , natural language , natural language understanding , speech recognition , medicine , image (mathematics) , radiology

This paper presents POST STSS, a method of determining short-text semantic similarity in which part-of-speech tags are used as indicators of the deeper syntactic information usually extracted by more advanced tools like parsers and semantic role labelers. Our model employs a part-of-speech weighting scheme and is based on a statistical bag-of-words approach. It does not require either hand-crafted knowledge bases or advanced syntactic tools, which makes it easily applicable to languages with limited natural language processing resources. By using a paraphrase recognition test, we demonstrate that our system achieves a higher accuracy than all existing statistical similarity algorithms and solutions of a more structural kind. [Projekat Ministarstva nauke Republike Srbije, br. TR 32047]

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research