Talk by Mr. Shaojie Bai (CMU)

November 27, 2019 09:15

Abstract

Speaker: Shaojie Bai (Carnegie Mellon University)
https://jerrybai1995.github.io/

Title: Deep Equilibrium Models: One “Implicit” Layer is All You Need (NeurIPS 2019, spotlight oral)

Abstract: Deep learning has long focused upon the hierarchy of representations, which is usually better learned by adding layers (i.e., depth) to increase a model’s both complexity and expressivity. In this work, we revisit and argue for an alternative perspective, where we only define one layer with an implicitly defined output of the model. We show how this one-layer model is equivalent to an infinite-depth model, and how it re-shapes our view on deep learning via the very concepts of equilibria and dynamical systems. Specifically, we introduce the deep equilibrium (DEQ) model, and discuss how we can 1) solve for this implicit-depth model’s equilibria directly via (black-box) Quasi-Newton methods; 2) backpropagate directly from these equilibria with O(1) memory (whereas typical deep networks need O(L) memory for L layers); and 3) theoretically analyze the universality of the representational power of the DEQ model (i.e., the proof that “one layer” is really all you need). Finally, we demonstrate that the DEQ approach is not predicated on any particular architectural choice, and that it scales to large, realistic, and high-dimensional sequence tasks with results on par with (or better) than the SOTA architectures (e.g., Transformers) despite only using a single layer and vastly improving the memory efficiency (by up to 88%). This work is based on the NeurIPS 2019 paper “Deep Equilibrium Models”.

More Information

Date	November 29, 2019 (Fri) 15:45 - 16:30
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/100990

Venue

Artificial Intelligence Research Unit, Graduate School of Informatics, Kyoto University, Yoshida Honmachi, Sakyo-ku, Kyoto, 606-8501, Japan(Google Maps)

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
	Link to the event page for the 1st	2nd	3rd	4th	Link to the event page for the 5th	6th
7th	8th	9th	10th	11th	12th	13th
14th	15th	16th	17th	18th	19th	20th
21th	22th	23th	Link to the event page for the 24th	25th	26th	27th
28th	29th	30th	31th

Center for Advanced Intelligence Project

Events