Align and Augment: Generative Data Augmentation for Compositional Generalization

Francesco Cazzaro, Davide Locatelli, Ariadna Quattoni

Main: Semantics and Applications Oral Paper

Session 9: Semantics and Applications (Oral)
Conference Room: Marie Louise 2
Conference Time: March 20, 09:00-10:30 (CET) (Europe/Malta)
TLDR:
You can open the #paper-45-Oral channel in a separate window.
Abstract: Recent work on semantic parsing has shown that seq2seq models find compositional generalization challenging. Several strategies have been proposed to mitigate this challenge. One such strategy is to improve compositional generalization via data augmentation techniques. In this paper we follow this line of work and propose Archer, a data-augmentation strategy that exploits alignment annotations between sentences and their corresponding meaning representations. More precisely, we use alignments to train a two step generative model that combines monotonic lexical generation with reordering. Our experiments show that Archer leads to significant improvements in compositional generalization performance.