[The 86th TrustML Young Scientist Seminar] Talk by Yuta Saito (Cornell University) "意思決定システムのためのデータに基づく報酬関数設計"

December 18, 2024 11:11

Abstract

Date and Time:
December 19, 2024: 11:00 – 12:00 (JST)
Venue: Online and Open Space at the RIKEN AIP Nihonbashi office*
*Open Space is available to AIP researchers only

Title: 意思決定システムのためのデータに基づく報酬関数設計
*The talk will be in Japanese.

Speaker: Yuta Saito (コーネル大学)

Abstract:
言語モデルや推薦モデルなどの意思決定モデルの学習を行うためにはまず、何らかの報酬関数を定める必要がある。報酬関数はモデルの学習が進む方向を決定付ける重要な関数であり、この設計を誤ってしまうと、ユーザー満足や収益などの改善につながらない問題を解き続けてしまうことになりかねない。本発表では、データに基づいて報酬関数を設計する方法論と実例を紹介する。特に、誰もが知るプラットフォームで行われた大規模実験の結果をもとに、報酬関数設計の重要性や報酬関数すらも個別化してしまう新たなアイデアの威力を示す。

Bio:
2021年3月に東京工業大学(当時)にて学士号を取得。在学中から、反実仮想学習の推薦システムへの応用に関する共同研究・社会実装に取り組む。2021年8月からは米コーネル大において反実仮想学習に関する研究を継続し、国内外の企業と連携しながら国際会議で論文を多数発表。IBIS2024では、関連領域に関するチュートリアルを行う。著書に『施策デザインのための機械学習入門』『反実仮想機械学習』

More Information

Date	December 19, 2024 (Thu) 11:00 - 12:00
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/180664

Related Laboratories

last updated on April 15, 2025 10:11Laboratory

Sequential Decision Making Team

Sunday	Monday	Tuesday	Wednesday	Thursday	Friday	Saturday
		Link to the event page for the 1st	Link to the event page for the 2nd	Link to the event page for the 3rd	Link to the event page for the 4th	5th
6th	7th	8th	Link to the event page for the 9th	Link to the event page for the 10th	11th	12th
13th	14th	15th	Link to the event page for the 16th	17th	18th	19th
20th	21th	22th	23th	24th	25th	26th
27th	28th	29th	30th	31th

Center for Advanced Intelligence Project

Events