Interpreting, Training, and Distilling Seq2Seq Models

Speaker: Alexander Rush , Harvard University

Date: Tuesday, November 08, 2016

Time: 4:00 PM to 5:00 PM Note: all times are in the Eastern Time Zone

Public: Yes

Location: 32-G882 (Stata Center - Hewlett Room)

Event Type:

Room Description:

Host: Jim Glass, MIT CSAIL

Contact: Marcia G. Davidson, 617-253-3049, marcia@csail.mit.edu

Relevant URL:

Speaker URL: None

Speaker Photo:
None

Reminders to: seminars@csail.mit.edu, sls-seminars@csail.mit.edu

Reminder Subject: TALK: Interpreting, Training, and Distilling Seq2Seq Models

Deep Sequence-to-sequence models have rapidly become an indispensable general-purpose tool for many applications in natural language processing, such as machine translation, summarization, and dialogue. Many problems that once required careful domain-specific engineering can now be tackled using off-the-shelf systems by interested tinkerers. However, even with the evident early success of these models, the seq2seq framework itself is still relatively unexplored. In this talk, I will discuss three questions we have been studying in the area of sequence-to-sequence NLP: (1) Can we interpret seq2seq's learned representations? [Strobelt et al, 2016], (2) How should a seq2seq model be trained? [Wiseman and Rush, 2016], (3) How many parameters are necessary for the models to work? [Kim and Rush, 2016]. Along the way, I will present applications in summarization, grammar correction, image-to-text, and machine translation (on your phone).

Alexander Rush is an Assistant Professor at Harvard University studying NLP, and formerly a Post-doc at Facebook Artificial Intelligence Research (FAIR). He is interested in machine learning and deep learning methods for large-scale natural language processing and understanding. His past work has introduced novel methods for structured prediction with applications to syntactic parsing and machine translation. His group web page is at http://nlp.seas.harvard.edu/ and he tweets at http://twitter.com/harvardnlp.

Research Areas:

Impact Areas:

This event is not part of a series.

Created by Marcia G. Davidson Email at Thursday, November 03, 2016 at 11:56 AM.