EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians

Speaker: Jacob Steinhardt , UC Berkeley Statistics

Date: Tuesday, October 17, 2023

Time: 2:00 PM to 3:00 PM Note: all times are in the Eastern Time Zone

Public: Yes

Location: 32-G449 (Patil / Kiva)

Event Type: Seminar

Room Description: 32-G449 (Patil / Kiva)


Contact: Hyung Ju Suh, hjsuh94@csail.mit.edu

Relevant URL:

Speaker URL: None

Speaker Photo:

Reminders to: seminars@csail.mit.edu

Reminder Subject: TALK: EI Seminar - Jacob Steinhardt - Large Language Models as Statisticians

Given their complex behavior, diverse skills, and wide range of deployment scenarios, understanding large language models---and especially their failure modes---is important. Given that new models are released every few months, often with brand new capabilities, how can we achieve understanding that keeps pace with modern practice?

In this talk, I will present an approach to this that leverages the skills of language models themselves, and so scales up as models get better. Specifically, we leverage the skill of language models *as statisticians*. At inference time, language models can read and process significant amounts of information due to their large context windows, and use this to generate useful statistical hypotheses. We will showcase several systems built on this principle, which allow us to audit other models for failures, identify spurious cues in datasets, label the internal representations of models, and factorize corpora into human-interpetable concepts.

This is joint work with many collaborators and students, including Ruiqi Zhong, Erik Jones, and Yossi Gandelsman.

Research Areas:
Algorithms & Theory, AI & Machine Learning, Graphics & Vision, Robotics

Impact Areas:

This event is not part of a series.

Created by Hyung Ju Suh Email at Wednesday, October 11, 2023 at 5:25 PM.