CLSP Fall Seminar Series: Antoine Bosselut

Oct 13, 2023
12 - 1:15pm EDT
This event is free

Who can attend?

  • Faculty
  • Staff
  • Students

Contact

Center for Language and Speech Processing

Description

Antoine Bosselut, an assistant professor in the School of Computer and Communication Sciences at the Ecole Polytechnique Federal de Lausanne, will give a talk titled "From Mechanistic Interpretability to Mechanistic Reasoning" for the Center for Language and Speech Processing.

Abstract:

Pretrained language models (LMs) encode implicit representations of knowledge in their parameters. Despite this observation, our best methods for interpreting these representations yield few actionable insights on how to manipulate this parameter space for downstream benefit. In this talk, I will present work on methods that simulate machine reasoning by localizing and modifying parametric knowledge representations. First, I will present a method for discovering knowledge-critical subnetworks within pretrained language models and show that these sparse computational subgraphs are responsible for the model's ability to encode specific pieces of knowledge. Then, I will present a new reasoning algorithm, RECKONING, a bi-level optimisation procedure that dynamically encodes and reasons over new knowledge at test-time using the model's existing learned knowledge representations as a scratchpad. Finally, I will discuss next steps and challenges in using internal model mechanisms for reasoning.

Who can attend?

  • Faculty
  • Staff
  • Students

Contact

Center for Language and Speech Processing