VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection

Arushi Rai, Adriana Kovashka

Main: Multimodality and Language Grounding to Vision, Robotics and Beyond Poster Paper

Session 10: Multimodality and Language Grounding to Vision, Robotics and Beyond (Poster)
Conference Room: Radisson
Conference Time: March 20, 11:00-12:30 (CET) (Europe/Malta)
TLDR:
You can open the #paper-233 channel in a separate window.
Abstract: