Talk by Dr. Tal Linzen : How well do neural NLP systems generalize?

2019/5/9 17:13

要旨

Speaker: Tal Linzen, Assistant Professor, Johns Hopkins

Title : How well do neural NLP systems generalize?

Abstract:
Neural networks have rapidly become central to NLP systems. While such systems perform well on typical test set examples, their generalization abilities are often poorly understood. In this talk, I will demonstrate how experimental paradigms from psycholinguistics can help us characterize the gaps between the abilities of neural systems and those of humans, by focusing on interpretable axes of generalization from the training set rather than on average test set performance. I will show that recurrent neural network (RNN) language models are able to process syntactic dependencies in typical sentences with considerable success, but when evaluated on more complex syntactically controlled materials, their error rate increases sharply. Likewise, neural systems trained to perform natural language inference generalize much more poorly than their test set performance would suggest. Finally, I will discuss a novel method for measuring compositionality in neural network representations; using this method, we show that the sentence representations acquired by neural natural language inference systems are not fully compositional, in line with their limited generalization abilities.

Bio:
Tal Linzen is an Assistant Professor of Cognitive Science and Computer Science at Johns Hopkins University. Before moving to Johns Hopkins in 2017, he was a postdoctoral researcher at the École Normale Supérieure in Paris, where he worked with Emmanuel Dupoux and Benjamin Spector; before that he obtained his PhD from the Department of Linguistics at New York University in 2015, under the supervision of Alec Marantz. At JHU, Dr. Linzen directs the Computation and Psycholinguistics Lab; the lab develops computational models of human language comprehension and acquisition, as well as methods for interpreting, evaluating and extending neural network models for natural language processing.

詳細情報

日時	2019/05/24(金) 16:00 - 17:30
URL	https://c5dc59ed978213830355fc8978.doorkeeper.jp/events/91430

場所

〒103-0027 東京都中央区日本橋1-4-1 日本橋一丁目三井ビルディング 15階(Google Maps)

日曜日	月曜日	火曜日	水曜日	木曜日	金曜日	土曜日
		1日のイベントページへのリンク	2日のイベントページへのリンク	3日のイベントページへのリンク	4日のイベントページへのリンク	5日
6日	7日	8日	9日のイベントページへのリンク	10日のイベントページへのリンク	11日	12日
13日	14日	15日	16日のイベントページへのリンク	17日	18日	19日
20日	21日	22日	23日	24日	25日	26日
27日	28日	29日	30日	31日

革新知能統合研究センター

イベント