TrustML Young Scientist Seminar #51 20230130 Talk by Olivia Wiles (DeepMind)

March 2, 2023 09:41

Description

The 51st Seminar
Date and Time: January 30th 5:00 pm – 6:00 pm(JST)
Venue: Zoom webinar
Language: English

Speaker: Olivia Wiles (DeepMind)
Title: Rigorous evaluation of machine learning models
Short Abstract:
Despite achieving super-human accuracy on benchmarks like ImageNet, machine learning models are still susceptible to a number of issues leading to poor performance in the real world. For example, models are prone to shortcut learning and use spurious correlations, leading to poor performance under distribution shift. I will present two works we have done to expose the fragility of machine learning models. The first work introduces a framework to define different types of distribution shift and evaluates how methods degrade under varying amounts and types of distribution shift. Then we demonstrate how we can go beyond requiring specific datasets to investigate shifts. Instead, we surface human interpretable failures in vision models automatically in an open-ended manner. These works are steps along the path to building comprehensive evaluation tools for reliable AI.

Bio:
Olivia Wiles is a Senior Researcher at DeepMind working on robustness in machine learning, focussing on how to detect and mitigate failures arising from spurious correlation and distribution shift. Prior to this, she was a PhD student at Oxford with Andrew Zisserman studying self-supervised representations for 3D and spent a summer at FAIR working on view synthesis with Justin Johnson, Georgia Gkioxari and Rick Szeliski.

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
		Link to the event page for the 1st	Link to the event page for the 2nd	Link to the event page for the 3rd	Link to the event page for the 4th	5th
6th	7th	8th	Link to the event page for the 9th	Link to the event page for the 10th	11th	12th
13th	14th	15th	Link to the event page for the 16th	17th	18th	19th
20th	21th	22th	23th	24th	25th	26th
27th	28th	29th	30th	31th

Center for Advanced Intelligence Project

Videos