117 - Interpreting NLP Model Predictions, with Sameer Singh
We interviewed Sameer Singh for this episode, and…
57 Minuten
Podcast
Podcaster
Beschreibung
vor 5 Jahren
We interviewed Sameer Singh for this episode, and discussed an
overview of recent work in interpreting NLP model predictions,
particularly instance-level interpretations. We started out by
talking about why it is important to interpret model outputs and
why it is a hard problem. We then dove into the details of three
kinds of interpretation techniques: attribution based methods,
interpretation using influence functions, and generating
explanations. Towards the end, we spent some time discussing how
explanations of model behavior can be evaluated, and some
limitations and potential concerns in evaluation methods. Sameer
Singh is an Assistant Professor of Computer Science at the
University of California, Irvine. Some of the techniques discussed
in this episode have been implemented in the AllenNLP Interpret
framework (details and demo here: https://allennlp.org/interpret).
overview of recent work in interpreting NLP model predictions,
particularly instance-level interpretations. We started out by
talking about why it is important to interpret model outputs and
why it is a hard problem. We then dove into the details of three
kinds of interpretation techniques: attribution based methods,
interpretation using influence functions, and generating
explanations. Towards the end, we spent some time discussing how
explanations of model behavior can be evaluated, and some
limitations and potential concerns in evaluation methods. Sameer
Singh is an Assistant Professor of Computer Science at the
University of California, Irvine. Some of the techniques discussed
in this episode have been implemented in the AllenNLP Interpret
framework (details and demo here: https://allennlp.org/interpret).
Weitere Episoden
30 Minuten
vor 2 Jahren
51 Minuten
vor 2 Jahren
45 Minuten
vor 2 Jahren
48 Minuten
vor 2 Jahren
36 Minuten
vor 2 Jahren
In Podcasts werben
Kommentare (0)