[Online] Talk event: Transfer Learning between Largely Different Distributions

October 8, 2020 08:50

Abstract

This is an online event. Registration is required.
https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/112855

In this event, we will have three consecutive talks by three Ph.D. students on the recent advances in transfer learning between largely different distributions.
(The talk order may change)

Talk 1

Speaker: Dimitris Tsipras (MIT) http://people.csail.mit.edu/tsipras/

Title: BREEDS: Benchmarks for Subpopulation Shift

Abstract: How do machine learning models perform when faced with unseen data subpopulations?

In this work, we present a general methodology for assessing model robustness to subpopulation shift. Our approach leverages the class structure underlying existing datasets to control the data subpopulations that comprise the training and test distributions. This enables us to synthesize realistic distribution shifts whose sources can be precisely controlled and characterized, within existing large-scale datasets. We apply this methodology to the ImageNet dataset, creating a suite of subpopulation shift benchmarks that we then use to measure the sensitivity of standard model architectures as well as the effectiveness of off-the-shelf train-time robustness interventions.

Joint work with Shibani Santurkar and Aleksander Madry.

Talk 2

Speaker: Ananya Kumar (Stanford University) https://ananyakumar.wordpress.com/

Title: Understanding Self-Training for Gradual Domain Adaptation

Abstract:
How can we adapt to test distributions that are very different from training examples in a principled way?

Traditional domain adaptation is only guaranteed to work when the distribution shift is small; empirical methods combine several heuristics for larger shifts but can be dataset specific. In many real applications like self-driving cars, brain-machine interfaces, and sensor networks, the domain shift does not happen at one time, but happens gradually. We consider gradual domain adaptation, where the goal is to adapt an initial classifier trained on a source domain given only unlabeled data that shifts gradually in distribution towards a target domain. We prove the first non-vacuous upper bound on the error of self-training with gradual shifts, under settings where directly adapting to the target domain can result in unbounded error. The theoretical analysis leads to algorithmic insights, highlighting that regularization and label sharpening are essential even when we have infinite data. This leads to higher accuracies on a rotating MNIST dataset, a forest Cover Type dataset, and a Portraits dataset.

Joint work with Percy Liang and Tengyu Ma.

Talk 3

Speaker: Takeshi Teshima (UTokyo) https://takeshi-teshima.github.io

Title: Few-shot Domain Adaptation by Causal Mechanism Transfer

Abstract: How can we transfer knowledge across different data distributions when they share a common data generating process?

We study few-shot supervised domain adaptation (DA) for regression problems, where only a few labeled target domain data and many labeled source domain data are available. Many of the current DA methods base their transfer assumptions on either parametrized distribution shift or apparent distribution similarities, e.g., identical conditionals or small distributional discrepancies. However, these assumptions may preclude the possibility of adaptation from intricately shifted and apparently very different distributions. To overcome this problem, we propose mechanism transfer, a metadistributional scenario in which a data generating mechanism is invariant across domains. This transfer assumption can accommodate nonparametric shifts resulting in apparently different distributions while providing a solid statistical basis for DA. We take the structural equations in causal modeling as an example and propose a novel DA method, which is shown to be useful both theoretically and experimentally. Our method can be seen as the first attempt to fully leverage the structural causal models for DA.

Joint work with Issei Sato and Masashi Sugiyama.

More Information

Date	October 22, 2020 (Thu) 09:00 - 11:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/112855

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
		Link to the event page for the 1st	Link to the event page for the 2nd	Link to the event page for the 3rd	Link to the event page for the 4th	5th
6th	7th	8th	Link to the event page for the 9th	Link to the event page for the 10th	11th	12th
13th	14th	15th	Link to the event page for the 16th	17th	18th	19th
20th	21th	22th	23th	24th	25th	26th
27th	28th	29th	30th	31th

Center for Advanced Intelligence Project

Events