CLSP Spring Seminar Series: Emily Prud'hommeaux

March 31, 2023
12 - 1:15pm EDT
This event is free

Who can attend?

  • Faculty
  • Staff
  • Students

Contact

Center for Language and Speech Processing

Description

Emily Prud'hommeaux, an assistant professor in the Department of Computer Science at Boston College, will present a Center for Language and Speech Processing seminar titled "Endangered or Just Under-Resourced? Evaluating ASR Quality and Utility When Data is Scarce."

Abstract:

Despite many recent advances in automatic speech recognition (ASR), linguists and language communities engaged in language documentation projects continue to face the obstacle of the "transcription bottleneck." Researchers in NLP typically do not distinguish between widely spoken languages that currently happen to have few training resources and endangered languages that will never have abundant data. As a result, we often fail to thoroughly explore when ASR is helpful for language documentation, what architectures work best for the sorts of languages that are in need of documentation, and how data can be collected and organized to produce optimal results. In this talk I describe several projects that attempt to bridge the gap between the promise of ASR for language documentation and the reality of using this technology in real-world settings.

Who can attend?

  • Faculty
  • Staff
  • Students

Contact

Center for Language and Speech Processing