EACL2024: Goodhart’s Law Applies to NLP’s Explanation Benchmarks

Goodhart’s Law Applies to NLP’s Explanation Benchmarks

Jennifer Hsia, Danish Pruthi, Aarti Singh, Zachary Chase Lipton

Add to Favorites

Main: Interpretability and Model Analysis in NLP Poster Paper

Session 4: Interpretability and Model Analysis in NLP (Poster)

Conference Room: Radisson

Conference Time: March 18, 16:00-17:30 (CET) (Europe/Malta)

TLDR:

RocketChat
Abstract

You can open the #paper-288 channel in a separate window.

Abstract: