114 - Behavioral Testing of NLP Models, with Marco Tulio Ribeiro

We invited Marco Tulio Ribeiro, a Senior Research…

44 Minuten

39.77 MB

Podcast

Podcaster

NLP Highlights

**The podcast is currently on hiatus. For more ac…

Wissenschaft

Beschreibung

vor 5 Jahren

We invited Marco Tulio Ribeiro, a Senior Researcher at Microsoft,
to talk about evaluating NLP models using behavioral testing, a
framework borrowed from Software Engineering. Marco describes three
kinds of black-box tests the check whether NLP models satisfy
certain necessary conditions. While breaking the standard IID
assumption, this framework presents a way to evaluate whether NLP
systems are ready for real-world use. We also discuss what
capabilities can be tested using this framework, how one can come
up with good tests, and the need for an evolving set of behavioral
tests for NLP systems. Marco’s homepage:
https://homes.cs.washington.edu/~marcotcr/