How Good Is Crude MDL for Solving the Bias-Variance Dilemma? An Empirical Investigation Based on Bayesian Networks | Zendy

Nicandro Cruz-Ramírez | Zendy; HéctorGabriel AcostaMesa | Zendy; Efrén MezuraMontes | Zendy; Alejandro GuerraHernández | Zendy; Guillermo de Jesús Hoyos-Rivera | Zendy; Rocío Erandi Barrientos-Martínez | Zendy; Karina Gutiérrez-Fragoso | Zendy; Luis Alonso Nava-Fernández | Zendy; Patricia González-Gaspar | Zendy; Elva María Novoa-del-Toro | Zendy; Vicente Josué Aguilera-Rueda | Zendy; María Yaneli Ameca-Alducin | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

How Good Is Crude MDL for Solving the Bias-Variance Dilemma? An Empirical Investigation Based on Bayesian Networks

Author(s) -

Nicandro Cruz-Ramírez,

HéctorGabriel AcostaMesa,

Efrén MezuraMontes,

Alejandro GuerraHernández,

Guillermo de Jesús Hoyos-Rivera,

Rocío Erandi Barrientos-Martínez,

Karina Gutiérrez-Fragoso,

Luis Alonso Nava-Fernández,

Patricia González-Gaspar,

Elva María Novoa-del-Toro,

Vicente Josué Aguilera-Rueda,

María Yaneli Ameca-Alducin

Publication year - 2014

Publication title -

plos one

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.99

H-Index - 332

ISSN - 1932-6203

DOI - 10.1371/journal.pone.0092866

Subject(s) - overfitting , minimum description length , akaike information criterion , bayesian information criterion , computer science , variance (accounting) , machine learning , bayesian probability , model selection , artificial intelligence , goodness of fit , econometrics , mathematics , artificial neural network , accounting , business

The bias-variance dilemma is a well-known and important problem in Machine Learning. It basically relates the generalization capability (goodness of fit) of a learning method to its corresponding complexity. When we have enough data at hand, it is possible to use these data in such a way so as to minimize overfitting (the risk of selecting a complex model that generalizes poorly). Unfortunately, there are many situations where we simply do not have this required amount of data. Thus, we need to find methods capable of efficiently exploiting the available data while avoiding overfitting. Different metrics have been proposed to achieve this goal: the Minimum Description Length principle (MDL), Akaike’s Information Criterion (AIC) and Bayesian Information Criterion (BIC), among others. In this paper, we focus on crude MDL and empirically evaluate its performance in selecting models with a good balance between goodness of fit and complexity: the so-called bias-variance dilemma, decomposition or tradeoff. Although the graphical interaction between these dimensions (bias and variance) is ubiquitous in the Machine Learning literature, few works present experimental evidence to recover such interaction. In our experiments, we argue that the resulting graphs allow us to gain insights that are difficult to unveil otherwise: that crude MDL naturally selects balanced models in terms of bias-variance, which not necessarily need be the gold-standard ones. We carry out these experiments using a specific model: a Bayesian network. In spite of these motivating results, we also should not overlook three other components that may significantly affect the final model selection: the search procedure, the noise rate and the sample size.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research