2018/10/14 14:29

要旨

Talk by Dr. Odalric-Ambrym Maillard, INRIA, France.

Title: Multi-armed bandits and Boundary Crossing Probabilities

Abstract: In this talk, we will focus on the stochastic multi-armed bandit problem. After providing some short historical overview of the field, we will focus on its relations with boundary crossing probabilities.
We will present in particular finite-time boundary crossing probabilites valid for exponential families of arbitrary dimension K, contrasting earlier attempts valid only forthe dimension K=1. Perhaps surprisingly, we highlight that the proof techniques to achieve these strong results already existed three decades ago in the work of T.L. Lai, and were apparently forgotten in the bandit community. We provide a modern rewriting of these beautiful techniques that we believe are useful beyond the application to stochastic multi-armed bandit.

詳細情報

日時 2018/11/08(木) 09:30 - 11:00
URL https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/81564

場所

〒103-0027 東京都中央区日本橋1-4-1 日本橋一丁目三井ビルディング 15階(Google Maps)