Premium
P2P schema‐mapping over network‐bound XML data
Author(s) -
Comito Carmela,
Talia Domenico
Publication year - 2010
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.1626
Subject(s) - computer science , xpath , xml , scalability , xquery , data integration , query language , web service , schema (genetic algorithms) , information retrieval , database , xml database , data mining , world wide web
The rise in availability of web‐based data sources has led to new challenges in data integration systems for obtaining decentralized, wide‐scale sharing of data preserving semantics. In this paper, we present a framework for integrating heterogeneous XML data sources distributed over a large‐scale, highly dynamic network of autonomous nodes. We highlight a query reformulation algorithm to combine and query‐distributed XML databases through a decentralized point‐to‐point mediation process among the different data sources by using P2P schema‐mappings. More precisely, our integration model is based on path‐to‐path mappings, using the XPath language. We demonstrate the usefulness and scalability of our ideas and algorithms with a detailed set of experiments. Finally, we discuss our experience implementing the above‐cited query reformulation algorithm as a Web service within the GDIS system, a service‐based Grid architecture. We have evaluated GDIS on several real‐world schemas with promising results. Copyright © 2010 John Wiley & Sons, Ltd.