Gradient-Based Language Model Red Teaming

Nevan Wichers, Carson Denison, Ahmad Beirami

Main: Machine Learning for NLP Poster Paper

Session 9: Machine Learning for NLP (Poster)
Conference Room: Radisson
Conference Time: March 20, 09:00-10:30 (CET) (Europe/Malta)
TLDR:
You can open the #paper-458 channel in a separate window.
Abstract: