67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman
Paper by Alex Wang, Amanpreet Singh, Julian Micha…
39 Minuten
Podcast
Podcaster
Beschreibung
vor 7 Jahren
Paper by Alex Wang, Amanpreet Singh, Julian Michael, Felix Hill,
Omer Levy, and Samuel R. Bowman. Sam comes on to tell us about
GLUE. We talk about the motivation behind setting up a benchmark
framework for natural language understanding, how the authors
defined "NLU" and chose the tasks for this benchmark, a very nice
diagnostic dataset that was constructed for GLUE, and what insight
they gained from the experiments they've run so far. We also have
some musings about the utility of general-purpose sentence vectors,
and about leaderboards.
https://www.semanticscholar.org/paper/GLUE%3A-A-Multi-Task-Benchmark-and-Analysis-Platform-Wang-Singh/a2054eff8b4efe0f1f53d88c08446f9492ae07c1
Omer Levy, and Samuel R. Bowman. Sam comes on to tell us about
GLUE. We talk about the motivation behind setting up a benchmark
framework for natural language understanding, how the authors
defined "NLU" and chose the tasks for this benchmark, a very nice
diagnostic dataset that was constructed for GLUE, and what insight
they gained from the experiments they've run so far. We also have
some musings about the utility of general-purpose sentence vectors,
and about leaderboards.
https://www.semanticscholar.org/paper/GLUE%3A-A-Multi-Task-Benchmark-and-Analysis-Platform-Wang-Singh/a2054eff8b4efe0f1f53d88c08446f9492ae07c1
Weitere Episoden
30 Minuten
vor 2 Jahren
51 Minuten
vor 2 Jahren
45 Minuten
vor 2 Jahren
48 Minuten
vor 2 Jahren
36 Minuten
vor 2 Jahren
In Podcasts werben
Kommentare (0)