z-logo
open-access-imgOpen Access
Comments on "Researcher bias: The use of machine learning in software defect prediction"
Author(s) -
Chakkrit Tantithamthavorn,
Shane McIntosh,
Ahmed E. Hassan,
Kenichi Matsumoto
Publication year - 2015
Publication title -
ieee transactions on software engineering
Language(s) - English
DOI - 10.7287/peerj.preprints.1260v1
Subject(s) - metric (unit) , computer science , group (periodic table) , construct (python library) , predictive modelling , reuse , association (psychology) , machine learning , selection (genetic algorithm) , artificial intelligence , software , model selection , data mining , data science , econometrics , psychology , mathematics , engineering , operations management , chemistry , organic chemistry , psychotherapist , programming language , waste management
Shepperd et al. find that the reported performance of a defect prediction model shares a strong relationship with the group of researchers who construct the models. In this paper, we perform an alternative investigation of Shepperd et al.'s data. We observe that (a) research group shares a strong association with other explanatory variables (i.e., the dataset and metric families that are used to build a model); (b) the strong association among these explanatory variables makes it difficult to discern the impact of the research group on model performance; and (c) after mitigating the impact of this strong association, we find that the research group has a smaller impact than the metric family. These observations lead us to conclude that the relationship between the research group and the performance of a defect prediction model are more likely due to the tendency of researchers to reuse experimental components (e.g., datasets and metrics). We recommend that researchers experiment with a broader selection of datasets and metrics to combat any potential bias in their results.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom