Mismatched Training and Test Distributions Can Outperform Matched Ones | Zendy

Carlos Roberto González | Zendy; Yaser S. AbuMostafa | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Mismatched Training and Test Distributions Can Outperform Matched Ones

Author(s) -

Carlos Roberto González,

Yaser S. AbuMostafa

Publication year - 2014

Publication title -

neural computation

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.235

H-Index - 169

eISSN - 1530-888X

pISSN - 0899-7667

DOI - 10.1162/neco_a_00697

Subject(s) - matching (statistics) , test (biology) , machine learning , artificial intelligence , computer science , training (meteorology) , sample (material) , distribution (mathematics) , statistical hypothesis testing , mathematics , pattern recognition (psychology) , statistics , paleontology , physics , chemistry , chromatography , meteorology , biology , mathematical analysis

In learning theory, the training and test sets are assumed to be drawn from the same probability distribution. This assumption is also followed in practical situations, where matching the training and test distributions is considered desirable. Contrary to conventional wisdom, we show that mismatched training and test distributions in supervised learning can in fact outperform matched distributions in terms of the bottom line, the out-of-sample performance, independent of the target function in question. This surprising result has theoretical and algorithmic ramifications that we discuss.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research