SIMCA Modeling for Overlapping Classes: Fixed or Optimized Decision Threshold?
Author(s) -
Raffaele Vitale,
Federico Marini,
Cyril Ruckebusch
Publication year - 2018
Publication title -
analytical chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.117
H-Index - 332
eISSN - 1520-6882
pISSN - 0003-2700
DOI - 10.1021/acs.analchem.8b01270
Subject(s) - a priori and a posteriori , robustness (evolution) , receiver operating characteristic , analogy , class (philosophy) , artificial intelligence , algorithm , pattern recognition (psychology) , statistics , chemistry , mathematics , computer science , philosophy , biochemistry , linguistics , epistemology , gene
An approach exploiting the principles of Receiver Operating Characteristic (ROC) curves for the simultaneous optimization of both the complexity and the decision threshold in Soft Independent Modeling of Class Analogy (SIMCA) classification models is here proposed. The outcomes resulting from the analysis of two simulated and four real case-studies highlight that, in the presence of strong overlap among various categories of samples, the implemented method can lead to better classification efficiency in external validation, compared to fixing such a threshold a priori. This guarantees a higher robustness toward class dispersion. On the other hand, in cases of clearer and more definite separation among the different groups of observations, their classification performance is equally satisfactory for test samples.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom