Distributed data mining patterns and services: an architecture and experiments | Zendy

Cesario Eugenio | Zendy; Talia Domenico | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Distributed data mining patterns and services: an architecture and experiments

Author(s) -

Cesario Eugenio,

Talia Domenico

Publication year - 2012

Publication title -

concurrency and computation: practice and experience

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.309

H-Index - 67

eISSN - 1532-0634

pISSN - 1532-0626

DOI - 10.1002/cpe.1877

Subject(s) - computer science , grid computing , distributed computing , web service , grid , data grid , cluster analysis , data stream mining , distributed knowledge , resource (disambiguation) , distributed computing environment , service oriented architecture , distributed database , data mining , architecture , semantic grid , world wide web , machine learning , artificial intelligence , semantic web , computer network , art , geometry , mathematics , visual arts

SUMMARY Distributed data mining implements techniques for analyzing data on distributed computing systems by exploiting data distribution and parallel algorithms. The grid is a computing infrastructure for implementing distributed high‐performance applications and solving complex problems, offering effective support to the implementation and use of data mining and knowledge discovery systems. The Web Services Resource Framework has become the standard for the implementation of grid services and applications, and it can be exploited for developing high‐level services for distributed data mining applications. This paper describes how distributed data mining patterns, such as collective learning, ensemble learning, and meta‐learning models, can be implemented as Web Services Resource Framework mining services by exploiting the grid infrastructure. The goal of this work was to design a distributed architectural model that can be exploited for different distributed mining patterns deployed as grid services for the analysis of dispersed data sources. In order to validate such an approach, we presented also the implementation of two clustering algorithms on the developed architecture. In particular, the distributed k‐means and distributed expectation maximization were exploited as pilot examples to show the suitability of the implemented service‐oriented framework. An extensive evaluation of its performance was provided. Copyright © 2011 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research