Explaining Language Model Predictions with High-Impact Concepts

Ruochen Zhao, Tan Wang, Yongjie Wang, Shafiq Joty

Main: Interpretability and Model Analysis in NLP Poster Paper

Session 11: Interpretability and Model Analysis in NLP (Poster)
Conference Room: GatherTown
Conference Time: March 20, 14:00-14:45 (CET) (Europe/Malta)
TLDR:
You can open the #paper-229 channel in a separate window.
Abstract: