
Russian-English dataset and comparative analysis of algorithms for cross-language embedding-based entity alignment
Author(s) -
V A Gnezdilova,
Зинаида Владимировна Апанович
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/2099/1/012023
Subject(s) - computer science , embedding , equivalence (formal languages) , identification (biology) , natural language processing , entity linking , knowledge graph , resolution (logic) , artificial intelligence , algorithm , linguistics , knowledge base , philosophy , botany , biology
The problem of data fusion from data bases and knowledge graphs in different languages is becoming increasingly important. The main step of such a fusion is the identification of equivalent entities in different knowledge graphs and merging their descriptions. This problem is known as the identity resolution, or entity alignment problem. Recently, a large group of new entity alignment methods has emerged. They look for the so called “embeddings” of entities and establish the equivalence of entities by comparing their embeddings. This paper presents experiments with embedding-based entity alignment algorithms on a Russian-English dataset. The purpose of this work is to identify language-specific features of the entity alignment algorithms. Also, future directions of research are outlined.