The Dissimilarity Dimension: Sharper Bounds for Optimistic Algorithms

Speaker: Aldo Pacchiano , LIDS and CSAIL

Date: Friday, October 13, 2023

Time: 4:00 PM to 4:30 PM Note: all times are in the Eastern Time Zone

Public: Yes

Location: 32-370

Event Type: Seminar

Room Description:

Host: Sharut Gupta, CSAIL MIT

Contact: Sharut Gupta, sharut@csail.mit.edu

Relevant URL:

Speaker URL: None

Speaker Photo:
None

Reminders to: lids-seminars@mit.edu, seminars@csail.mit.edu, mitml@mit.edu

Reminder Subject: TALK: The Dissimilarity Dimension: Sharper Bounds for Optimistic Algorithms

Abstract: The principle of Optimism in the Face of Uncertainty (OFU) is one of the foundational algorithmic design choices in Reinforcement Learning and Bandits. Optimistic algorithms balance exploration and exploitation by deploying data collection strategies that maximize expected rewards in plausible models. This is the basis of celebrated algorithms like the Upper Confidence Bound (UCB) for multi-armed bandits. For nearly a decade, the analysis of optimistic algorithms, including Optimistic Least Squares (OLS), in the context of rich reward function classes has relied on the concept of eluder dimension, introduced by Russo and Van Roy in 2013. In this talk we shed light on the limitations of the eluder dimension in capturing the true behavior of optimistic strategies in the realm of function approximation. We remediate these by introducing a novel statistical measure, the ‚Äúdissimilarity dimension‚ÄĚ. We show it can be used to provide sharper sample analysis of algorithms like OLS by establishing a link between regret and the dissimilarity dimension. To illustrate this, we will show that some function classes have arbitrarily large eluder dimension but constant dissimilarity. Our regret analysis draws inspiration from graph theory and may be of interest to the mathematically minded beyond the field of statistical learning theory. This talk sheds new light on the fundamental principle of optimism and its algorithms in the function approximation regime, advancing our understanding of these concepts.

Speaker bio: Aldo Pacchiano is a postdoctoral researcher Fellow at the Eric and Wendy Schmidt Center of the broad institute of MIT and Harvard. He obtained his PhD under the supervision of Profs. Michael Jordan and Peter Bartlett at UC Berkeley and was a Postdoctoral Researcher at Microsoft Research, NYC. He will join the Boston University Center for Computing and Data Sciences as an assistant professor in the summer of 2024. His research lies in the areas of Reinforcement Learning, Online Learning, Bandits and Algorithmic Fairness. He is particularly interested in furthering our statistical understanding of learning phenomena in adaptive environments and use these theoretical insights and techniques to design efficient and safe algorithms for scientific, engineering, and large-scale societal applications.

Research Areas:

Impact Areas:

See other events that are part of the ML Tea.

Created by Sharut Gupta Email at Tuesday, October 10, 2023 at 10:34 AM.