[The 83rd TrustML Young Scientist Seminar] Talk by Pang Wei Koh (University of Washington) "Reliable data use: Synthesis, retrieval, and interaction"

August 7, 2024 17:53

Abstract

Date and Time:
September 2, 2024: 10:30 am – 12:00 am (JST)
Venue: Online and Meeting Room 1 at the RIKEN AIP Nihonbashi office*
*Meeting Room 1 is available to AIP researchers only

Title:
Reliable data use: Synthesis, retrieval, and interaction

Speaker:
Pang Wei Koh (Assistant Professor, University of Washington)

Abstract:
How can we better use our data to build more reliable and responsible models? I will first discuss when it might be useful to train on synthetic image data derived, in turn, from a generative model trained on the available real data. Next, I will describe how scaling up the datastore for retrieval-based language models can significantly improve performance, indicating that the amount of data used at inference time—and not just at training time—should be considered as a new dimension of scaling language models. Finally, I will discuss how the static nature of most of our training data leads to language model failures in interactive settings.

Bio:
Pang Wei Koh is an assistant professor in the Allen School of Computer Science and Engineering at the University of Washington, a visiting research scientist at AI2, and a Singapore AI Visiting Professor. His research interests are in the theory and practice of building reliable machine learning systems. His research has been published in Nature and Cell, featured in media outlets such as The New York Times and The Washington Post, and recognized by the MIT Technology Review Innovators Under 35 Asia Pacific award and best paper awards at ICML and KDD. He received his PhD and BS in Computer Science from Stanford University. Prior to his PhD, he was the 3rd employee and Director of Partnerships at Coursera.

More Information

Date	September 2, 2024 (Mon) 10:30 - 12:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/176311

Related Laboratories

last updated on June 12, 2025 11:09Laboratory

Imperfect Information Learning Team

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
		Link to the event page for the 1st	Link to the event page for the 2nd	Link to the event page for the 3rd	Link to the event page for the 4th	5th
6th	7th	8th	Link to the event page for the 9th	Link to the event page for the 10th	11th	12th
13th	14th	15th	Link to the event page for the 16th	17th	18th	19th
20th	21th	22th	23th	24th	25th	26th
27th	28th	29th	30th	31th

Center for Advanced Intelligence Project

Events