Rademacher complexity of margin multi-category classifiers
Author(s) -
Yann Guermeur
Publication year - 2018
Publication title -
neural computing and applications
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.713
H-Index - 80
eISSN - 1433-3058
pISSN - 0941-0643
DOI - 10.1007/s00521-018-3873-7
Subject(s) - infimum and supremum , margin (machine learning) , measure (data warehouse) , mathematics , characterization (materials science) , class (philosophy) , interval (graph theory) , learnability , discriminative model , point (geometry) , scale (ratio) , function (biology) , computer science , mathematical optimization , artificial intelligence , machine learning , discrete mathematics , combinatorics , data mining , materials science , geometry , physics , quantum mechanics , evolutionary biology , biology , nanotechnology
One of the main open problems of the theory of margin multi-category pattern classification is the characterization of the optimal dependence of the confidence interval of a guaranteed risk on the three basic parameters which are the sample size m, the number C of categories and the scale parameter$$\gamma$$. This is especially the case when working under minimal learnability hypotheses. The starting point is a basic supremum inequality whose capacity measure depends on the choice of the margin loss function. Then, transitions are made, from capacity measure to capacity measure. At some level, a structural result performs the transition from the multi-class case to the bi-class one. In this article, we highlight the advantages and drawbacks inherent to the three major options for this decomposition: using Rademacher complexities, covering numbers or scale-sensitive combinatorial dimensions.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom