Goodhart’s Law Applies to NLP’s Explanation Benchmarks
Jennifer Hsia, Danish Pruthi, Aarti Singh, Zachary Chase Lipton
Main: Interpretability and Model Analysis in NLP Poster Paper
Session 4: Interpretability and Model Analysis in NLP (Poster)
Conference Room: Radisson
Conference Time: March 18, 16:00-17:30 (CET) (Europe/Malta)
TLDR:
You can open the
#paper-288
channel in a separate window.
Abstract: