Gradient-Based Language Model Red Teaming
Nevan Wichers, Carson Denison, Ahmad Beirami
Main: Machine Learning for NLP Poster Paper
Session 9: Machine Learning for NLP (Poster)
Conference Room: Radisson
Conference Time: March 20, 09:00-10:30 (CET) (Europe/Malta)
TLDR:
You can open the
#paper-458
channel in a separate window.
Abstract: