upper-confidence-bound

Views Navigation

Event Views Navigation

This Month

Latest Past Events

Ruihao Zhu — Hedging the Drift: Learning to Optimize under Non-Stationarity

George B. Dantzig Auditorium - 1174 Etcheverry Hall Etcheverry Hall, Berkeley

Abstract: We introduce general data-driven decision-making algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary bandit settings. They capture applications such as advertisement allocation and dynamic pricing in changing environments. We show how the difficulty posed by the (unknown a priori and possibly adversarial) non-stationarity can be overcome by an unconventional marriage between stochastic and…

Ruihao Zhu — Hedging the Drift: Learning to Optimize under Non-Stationarity

George B. Dantzig Auditorium - 1174 Etcheverry Hall Etcheverry Hall, Berkeley

Abstract: We introduce general data-driven decision-making algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary bandit settings. They capture applications such as advertisement allocation and dynamic pricing in changing environments. We show how the difficulty posed by the (unknown a priori and possibly adversarial) non-stationarity can be overcome by an unconventional marriage between stochastic and…