[Rescheduled]Approximate Bayesian Inference Team Seminar (Talk by Karim Lounici, Ecole Polytechnique, visiting researcher at Keio University)

September 25, 2022 21:33

Abstract

Title: Meta-Learning Representations with Contextual Linear Bandits

Abstract:
Abstract: Meta-learning seeks to build algorithms that rapidly learn how to solve new learning problems based on previous experience. In this paper we investigate meta-learning in the setting of stochastic linear bandit tasks. We assume that the tasks share a low dimensional representation, which has been partially acquired from previous learning tasks. We aim to leverage this information in order to learn a new downstream bandit task, which shares the same representation. Our principal contribution is to show that if the learned representation estimates well the unknown one, then the downstream task can be efficiently learned by a greedy policy that we propose in this work. We derive an upper bound on the regret of this policy, which is, up to logarithmic factors, of order $sqrt{rN}(1vee sqrt{d/T})$, where $N$ is the horizon of the downstream task, $T$ is the number of training tasks, $d$ the ambient dimension and $r ll d$ the dimension of the representation. We highlight that our strategy does not need to know $r$. We note that if $T> d$ our bound achieves the same rate of optimal minimax bandit algorithms using the true underlying representation (up to a logarithmic term).
This is a joint work with Leonardo Cella and Massimiliano Pontil.

All participants are required to agree with the AIP Seminar Series Code of Conduct.
Please see the URL below.
https://aip.riken.jp/event-list/termsofparticipation/?lang=en

RIKEN AIP will expect adherence to this code throughout the event. We expect cooperation from all participants to help ensure a safe environment for everybody.

More Information

Date	October 4, 2022 (Tue) 16:00 - 17:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/143910

Related Laboratories

last updated on July 18, 2025 11:11Laboratory

Adaptive Bayesian Intelligence Team

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
		Link to the event page for the 1st	Link to the event page for the 2nd	Link to the event page for the 3rd	Link to the event page for the 4th	5th
6th	7th	8th	Link to the event page for the 9th	Link to the event page for the 10th	11th	12th
13th	14th	Link to the event page for the 15th	Link to the event page for the 16th	Link to the event page for the 17th	18th	19th
20th	21th	22th	23th	24th	25th	26th
27th	28th	29th	30th	31th

Center for Advanced Intelligence Project

Events