UNSEE: Unsupervised Non-contrastive Sentence Embeddings
Ömer Veysel Çağatan
Main: Sentence-level Semantics Oral Paper
Session 6: Sentence-level Semantics (Oral)
Conference Room: Carlson
Conference Time: March 19, 10:30-12:00 (CET) (Europe/Malta)
TLDR:
You can open the
#paper-47-Oral
channel in a separate window.
Abstract:
In this paper, we introduce UNSEE, which stands for Unsupervised Non-Contrastive Sentence Embeddings. UNSEE demonstrates better performance compared to SimCSE in the Massive Text Embedding (MTEB) benchmark. We begin by highlighting the issue of representation collapse that occurs with the replacement of contrastive objectives with non-contrastive objectives in SimCSE. Subsequently, we introduce a straightforward solution called the target network to mitigate this problem. This approach enables us to harness non-contrastive objectives while ensuring training stability and achieving performance improvements similar to those seen with contrastive objectives. We have reached peak performance in non-contrastive sentence embeddings through extensive fine-tuning and optimization. These efforts have resulted in superior sentence representation models, emphasizing the importance of careful tuning and optimization for non-contrastive objectives.