Weighted hybrid clustering by combining text mining and bibliometrics on a large‐scale journal database
Author(s) -
Liu Xinhai,
Yu Shi,
Janssens Frizo,
Glänzel Wolfgang,
Moreau Yves,
De Moor Bart
Publication year - 2010
Publication title -
journal of the american society for information science and technology
Language(s) - English
Resource type - Journals
eISSN - 1532-2890
pISSN - 1532-2882
DOI - 10.1002/asi.21312
Subject(s) - cluster analysis , computer science , data mining , weighting , consensus clustering , correlation clustering , cure data clustering algorithm , document clustering , canopy clustering algorithm , artificial intelligence , medicine , radiology
We propose a new hybrid clustering framework to incorporate text mining with bibliometrics in journal set analysis. The framework integrates two different approaches: clustering ensemble and kernel‐fusion clustering. To improve the flexibility and the efficiency of processing large‐scale data, we propose an information‐based weighting scheme to leverage the effect of multiple data sources in hybrid clustering. Three different algorithms are extended by the proposed weighting scheme and they are employed on a large journal set retrieved from the Web of Science (WoS) database. The clustering performance of the proposed algorithms is systematically evaluated using multiple evaluation methods, and they were cross‐compared with alternative methods. Experimental results demonstrate that the proposed weighted hybrid clustering strategy is superior to other methods in clustering performance and efficiency. The proposed approach also provides a more refined structural mapping of journal sets, which is useful for monitoring and detecting new trends in different scientific fields.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom