High-dimensional Statistical Modeling Team Seminar (Talk by Dorian Baudry (CNRS/INRIA))

June 21, 2022 19:02

Abstract

Speaker: Dorian Baudry (CNRS/INRIA)

Title: Optimal Thompson Sampling Strategies for Support-Aware CVaR Bandits

Abstract:
In this presentation we will introduce a multi-arm bandit algorithm proposed in Baudry et al.
(2021). A multi-arm bandit is a sequential decision-making problem in which at different time steps
a learner: (1) selects an action, (2) observes a reward corresponding to this action, and (3)
updates her policy to choose future actions in order to maximize the expected sum of rewards. The
main difficulty is then to find a strategy with the right balance between exploration and
exploitation. Motivated by an application of bandits in agriculture, we consider a risk-aware
variant of this problem in which the quality of each action is evaluated by its Conditional Value at
Risk (CVaR) at some given quantile of the reward distribution. After describing the problem and
illustrating the potential applications in agriculture in the first part of the talk, we will
introduce the Bounded CVaR Thompson Sampling algorithm (B-CVTS), that we prove to be the first
asymptotically optimal algorithm for CVaR bandits for distributions with bounded support. We will
then showcase the main theorems and elements of analysis presented in the paper. Finally, we will
discuss the experiments we implemented using the Decision Support Systems for Agro-Technological
Transfer (DSSAT), illustrating empirically the benefit of Thompson Sampling approaches in a
realistic environment simulating a use-case in agriculture.
Link to the article: https://proceedings.mlr.press/v139/baudry21a.html

More Information

Date	July 4, 2022 (Mon) 15:00 - 16:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/138914

Related Laboratories

last updated on June 19, 2025 14:26Laboratory

High-Dimensional Statistical Modeling Team (2017/3--2022/3)

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
					1st	2nd
3rd	Link to the event page for the 4th	Link to the event page for the 5th	Link to the event page for the 6th	Link to the event page for the 7th	Link to the event page for the 8th	9th
10th	11th	12th	13th	14th	15th	16th
17th	18th	19th	Link to the event page for the 20th	21th	22th	23th
24th	25th	26th	27th	Link to the event page for the 28th	29th	30th
31th

Center for Advanced Intelligence Project

Events