
A new methodology for constructing a publication‐level classification system of science
Author(s) -
Waltman Ludo,
Eck Nees Jan
Publication year - 2012
Publication title -
journal of the american society for information science and technology
Language(s) - English
Resource type - Journals
eISSN - 1532-2890
pISSN - 1532-2882
DOI - 10.1002/asi.22748
Subject(s) - computer science , bibliographic coupling , citation , classification scheme , multidisciplinary approach , data science , subject (documents) , transparency (behavior) , information retrieval , citation analysis , simplicity , library classification , data mining , world wide web , sociology , social science , philosophy , computer security , epistemology
Classifying journals or publications into research areas is an essential element of many bibliometric analyses. Classification usually takes place at the level of journals, where the W eb of S cience subject categories are the most popular classification system. However, journal‐level classification systems have two important limitations: They offer only a limited amount of detail, and they have difficulties with multidisciplinary journals. To avoid these limitations, we introduce a new methodology for constructing classification systems at the level of individual publications. In the proposed methodology, publications are clustered into research areas based on citation relations. The methodology is able to deal with very large numbers of publications. We present an application in which a classification system is produced that includes almost 10 million publications. Based on an extensive analysis of this classification system, we discuss the strengths and the limitations of the proposed methodology. Important strengths are the transparency and relative simplicity of the methodology and its fairly modest computing and memory requirements. The main limitation of the methodology is its exclusive reliance on direct citation relations between publications. The accuracy of the methodology can probably be increased by also taking into account other types of relations–for instance, based on bibliographic coupling.