z-logo
open-access-imgOpen Access
A semi-automatic approach for building ontologies from acollection of structured web documents
Author(s) -
Mouna Kamel,
Nathalie Aussenac-Gilles,
Davide Buscaldi,
Catherine Comparot
Publication year - 2013
Publication title -
hal (le centre pour la communication scientifique directe)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2479832.2479856
Subject(s) - disk formatting , computer science , information retrieval , exploit , annotation , world wide web , semantic web , semantic web stack , artificial intelligence , computer security , operating system
Many collections of structured documents are available on the web. The collection generally describes the characteristics of entities from a single type, where each page describes one entity. These documents are adequate knowledge sources for building ontologies. As they benefit from a strong and shared layout, they contain less well written text than plain text files but their architecture is very meaningful. Classical linguistic-based methods for identifying concepts and relations are no longer appropriate for analyzing them.The approach we propose in this paper exploits various properties of such documents, combining layout/formatting analysis and linguistic analysis, and using semantic annotation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom