Talk by Daniele Calandriello (DeepMind, Paris) on "Scalable Determinantal Point Processes for Machine Learning"

2021/3/11 15:47

要旨

Title: Scalable Determinantal Point Processes for Machine Learning

Abstract:
Determinantal point processes (DPPs) are a useful probabilistic model for selecting a small diverse subset out of a large collection of items, with applications in summarization, stochastic optimization, active learning and more.
In this talk, we will give a self contained introduction to DPP sampling and applications, as well as recent results on scaling these methods to millions of point.
Given a kernel function and a subset size , our goal is to sample out of items with probability proportional to the determinant of the kernel matrix induced by the subset (aka DPP). Existing DPP sampling algorithms require an expensive preprocessing step which involves multiple passes over all items, making it infeasible for large datasets. A naïve heuristic addressing this problem is to uniformly subsample a fraction of the data and perform DPP sampling only on those items, however this method offers no guarantee that the produced sample will even approximately resemble the target distribution over the original dataset. In this paper, we develop an algorithm which adaptively builds a sufficiently large uniform sample of data that is then used to efficiently generate a smaller set of items, while ensuring that this set is drawn exactly from the target distribution defined on all items. We show empirically that our algorithm produces a DPP sample after observing only a small fraction of all elements, leading to several orders of magnitude faster performance compared to the state-of-the-art.

Bio:
Daniele Calandriello is a Research Scientist at DeepMind Paris, where he works on scalable machine learning for sequential decision making.
He received his PhD in 2017 from INRIA Lille under the supervision of Michal Valko and Alessandro Lazaric, and his dissertation on efficient sequential learning won the french AI association award for best doctoral thesis. Prior to joining DeepMind PostDoc he worked on scalable non-parametric learning and sequential decision making at the Italian Institute of Technology as a member of Lorenzo Rosasco’s LCSL Lab, and on safety and efficiency in reinforcement learning as an undegraduate in Marcello Restelli’s group at Politecnico di Milano.
His research focuses on adaptive dimensionality reduction techniques using randomized subsampling and sketching. These techniques have been successfully applied (2014-2018) to optimization of noisy function, learning on graphs, clustering and supervised regression. His recent interest (2018-present) is to transfer some of these adaptive randomization techniques to experimental design and reinforcement learning.

詳細情報

日時	2021/03/16(火) 17:30 - 19:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/119524

日曜日	月曜日	火曜日	水曜日	木曜日	金曜日	土曜日
						1日
2日	3日	4日	5日	6日	7日	8日
9日	10日のイベントページへのリンク	11日のイベントページへのリンク	12日	13日	14日	15日
16日	17日のイベントページへのリンク	18日	19日のイベントページへのリンク	20日	21日	22日
23日	24日	25日	26日	27日	28日	29日
30日

革新知能統合研究センター

イベント