z-logo
open-access-imgOpen Access
Optimal learning for sequential sampling with non-parametric beliefs
Author(s) -
Emre Barut,
Warren B. Powell
Publication year - 2013
Publication title -
journal of global optimization
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.861
H-Index - 86
eISSN - 1573-2916
pISSN - 0925-5001
DOI - 10.1007/s10898-013-0050-5
Subject(s) - mathematics , weighting , estimator , kernel (algebra) , mathematical optimization , variable kernel density estimation , parametric statistics , kernel smoother , kernel method , statistics , radial basis function kernel , computer science , artificial intelligence , support vector machine , medicine , radiology , combinatorics
We propose a sequential learning policy for ranking and selection problems, where we use a non-parametric procedure for estimating the value of a policy. Our estimation approach aggregates over a set of kernel functions in order to achieve a more consistent estimator. Each element in the kernel estimation set uses a different bandwidth to achieve better aggregation. The final estimate uses a weighting scheme with the inverse mean square errors of the kernel estimators as weights. This weighting scheme is shown to be optimal under independent kernel estimators. For choosing the measurement, we employ the knowledge gradient policy that relies on predictive distributions to calculate the optimal sampling point. Our method allows a setting where the beliefs are expected to be correlated but the correlation structure is unknown beforehand. Moreover, the proposed policy is shown to be asymptotically optimal.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom