Timezone: Conference (Singapore) UTC Browser
Timezone: Conference (Singapore) UTC Browser
Session 2
Oral Presentations
Dialogue and Interactive Systems (Oral)
Room: Marie Louise 2
- A Comparative Multidimensional Analysis of Empathetic Systems. Andrew Lee, Jonathan K. Kummerfeld, Larry Ann, Rada Mihalcea.
- Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions. Alberto Testoni, Raquel Fernández.
- Leveraging Implicit Feedback from Deployment Data in Dialogue. Richard Yuanzhe Pang, Stephen Roller, Kyunghyun Cho, He He, Jason E Weston.
- Commonsense-augmented Memory Construction and Management in Long-term Conversations via Context-aware Persona Refinement. Hana Kim, Kai Tzu-iunn Ong, Seoyeon Kim, Dongha Lee, Jinyoung Yeo.
- HumBEL: A Human-in-the-Loop Approach for Evaluating Demographic Factors of Language Models in Human-Machine Conversations. Anthony Sicilia, Jennifer C. Gates, Malihe Alikhani.
- SynthDST: Synthetic Data is All You Need for Few-Shot Dialog State Tracking. Atharva Kulkarni, Bo-Hsiang Tseng, Joel Ruben Antony Moniz, Dhivya Piraviperumal, Hong Yu, Shruti Bhargava.
Factual Content in NLP (Oral)
Room: Marie Louise 1
- What Makes Medical Claims (Un)Verifiable? Analyzing Entity and Relation Properties for Fact Verification. Amelie Wuehrl, Yarik Menchaca Resendiz, Lara Grimminger, Roman Klinger.
- Tracing the Roots of Facts in Multilingual Language Models: Independent, Shared, and Transferred Knowledge. Xin Zhao, Naoki Yoshinaga, Daisuke Oba.
- Rethinking Loss Functions for Fact Verification. Yuta Mukobara, Yutaro Shigeto, Masashi Shimbo.
- Counterfactual Reasoning with Knowledge Graph Embeddings. Lena Zellinger, Andreas Stephan, Benjamin Roth.
- Multimodal Fallacy Classification in Political Debates. Eleonora Mancini, Federico Ruggeri, Paolo Torroni.
- Leveraging fine-tuned Large Language Models with LoRA for Effective Claim, Claimer, and Claim Object Detection. Sotiris Kotitsas, Panagiotis Kounoudis, Eleni Koutli, Haris Papageorgiou.
Machine Learning for NLP (Oral)
Room: Carlson
- Extreme Fine-tuning: A Novel and Fast Fine-tuning Approach for Text Classification. Boonnithi Jiaramaneepinit, Thodsaporn Chay-intr, Kotaro Funakoshi, Manabu Okumura.
- Plan-Grounded Large Language Models for Dual Goal Conversational Settings. Diogo Glória-Silva, Rafael Ferreira, Diogo Tavares, David Semedo, Joao Magalhaes.
- TESS: Text-to-Text Self-Conditioned Simplex Diffusion. Rabeeh Karimi mahabadi, Hamish Ivison, Jaesung Tae, James Henderson, Iz Beltagy, Matthew E Peters, Arman Cohan.
- Dynamic Masking Rate Schedules for MLM Pretraining. Zachary Ankner, Naomi Saphra, Davis Blalock, Jonathan Frankle, Matthew L Leavitt.
- Gradient-Based Language Model Red Teaming. Nevan Wichers, Carson Denison, Ahmad Beirami.
- A* shortest string decoding for non-idempotent semirings. Kyle Gorman, Cyril Allauzen.
Session 3
Oral Presentations
Discourse and Syntactic Parsing (Oral)
Room: Marie Louise 1
- Unleashing the Power of Discourse-Enhanced Transformers for Propaganda Detection. Alexander Chernyavskiy, Dmitry Ilvovsky, Preslav Nakov.
- Can we obtain significant success in RST discourse parsing by using Large Language Models?. Aru Maekawa, Tsutomu Hirao, Hidetaka Kamigaito, Manabu Okumura.
- Improving Generalization in Semantic Parsing by Increasing Natural Language Variation. Irina Saparina, Mirella Lapata.
- From Partial to Strictly Incremental Constituent Parsing. Ana Ezquerro, Carlos Gómez-Rodríguez, David Vilares.
- A Truly Joint Neural Architecture for Segmentation and Parsing. Danit Yshaayahu Levi, Reut Tsarfaty.
- Generation and Polynomial Parsing of Graph Languages with Non-Structural Reentrancies. Johanna Björklund, Frank Drewes, Anna Jonsson.
Efficient Low-resource methods in NLP (Oral)
Room: Carlson
- Finding a Needle in the Adversarial Haystack: A Targeted Paraphrasing Approach For Uncovering Edge Cases with Minimal Distribution Distortion. Aly M. Kassem, Sherif Saad.
- FAIR: Filtering of Automatically Induced Rules. Divya Jyoti Bajpai, Ayush Maheshwari, Manjesh Kumar Hanawal, Ganesh Ramakrishnan.
- Quality Does Matter: A Detailed Look at the Quality and Utility of Web-Mined Parallel Corpora. Surangika Ranathunga, Nisansa de Silva, Velayuthan Menan, Aloka Fernando, Charitha S.M. Rathnayake.
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions. Minghao Wu, Abdul Waheed, Chiyu Zhang, Muhammad Abdul-Mageed, Alham Fikri Aji.
- Anchor Points: Benchmarking Models with Much Fewer Examples. Rajan Pathe Vivek, Kawin Ethayarajh, Diyi Yang, Douwe Kiela.
- Aligning Large and Small Language Models via Chain-of-Thought Reasoning. Leonardo Ranaldi, Andre Freitas.
Multimodality (Oral)
Room: Marie Louise 2
- The Role of Data Curation in Image Captioning. Wenyan Li, Jonas F. Lotz, Chen Qiu, Desmond Elliott.
- VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection. Arushi Rai, Adriana Kovashka.
- Text-Guided Image Clustering. Andreas Stephan, Lukas Miklautz, Kevin Sidak, Jan Philip Wahle, Bela Gipp, Claudia Plant, Benjamin Roth.
- Towards Hierarchical Spoken Language Disfluency Modeling. Jiachen Lian, Gopala Anumanchipalli.
- Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations. Amit Meghanani, Thomas Hain.
- STORiCo: Storytelling TTS for Hindi with Character Voice Modulation. Pavan Kalyan Tankala, Preethi Jyothi, Preeti Rao, Pushpak Bhattacharyya.
Poster Presentations
Demo (Poster)
Room: Radisson
- DepressMind: A Depression Surveillance System for Social Media Analysis. Roque Fernández-Iglesias, Marcos Fernandez-Pichel, Mario Aragon, David E. Losada.
- Multi-party Multimodal Conversations Between Patients, Their Companions, and a Social Robot in a Hospital Memory Clinic. Angus Addlesee, Neeraj Cherakara, Nivan Nelson, Daniel Hernandez Garcia, Nancie Gunson, Weronika Sieińska, Christian Dondrup, Oliver Lemon.
- TL;DR Progress: Multi-faceted Literature Exploration in Text Summarization. Shahbaz Syed, Khalid Al Khatib, Martin Potthast.
Dialogue and Interactive Systems (Poster)
Room: Radisson
- Improving Backchannel Prediction Leveraging Sequential and Attentive Context Awareness. Yo-Han Park, Wencke Liermann, Yong-Seok Choi, Kong Joo Lee.
- Let's Negotiate! A Survey of Negotiation Dialogue Systems. Haolan Zhan, Yufei Wang, Zhuang Li, Tao Feng, YUNCHENG HUA, Suraj Sharma, Lizhen Qu, Zhaleh Semnani Azad, Ingrid Zukerman, Reza Haf.
- Local and Global Contexts for Conversation. Zuoquan Lin.
- Style Vectors for Steering Generative Large Language Models. Kai Konen, Sophie Jentzsch, Diaoulé Diallo, Peer Schütt, Oliver Bensch, Roxanne El Baff, Dominik Opitz, Tobias Hecking.
Generation (Poster)
Room: Radisson
- A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and Languages. Nikita Martynov, Mark Baushenko, Anastasia Kozlova, Katerina Kolomeytseva, Aleksandr Abramov, Alena Fenogenova.
- Flow Matching for Conditional Text Generation in a Few Sampling Steps. Vincent Tao Hu, Di Wu, Yuki M Asano, Pascal Mettes, Basura Fernando, Björn Ommer, Cees G. M. Snoek.
- High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models. Michela Lorandi, Anya Belz.
- Small Language Models Improve Giants by Rewriting Their Outputs. Giorgos Vernikos, Arthur Brazinskas, Jakub Adamek, Jonathan Mallinson, Aliaksei Severyn, Eric Malmi.
Information Extraction (Poster)
Room: Radisson
- 3D Rotation and Translation for Hyperbolic Knowledge Graph Embedding. Yihua Zhu, Hidetoshi Shimodaira.
- Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction. Qingyun Wang, Zixuan Zhang, Hongxiang Li, Xuan Liu, Jiawei Han, Huimin Zhao, Heng Ji.
- Large Language Models for Scientific Information Extraction: An Empirical Study for Virology. Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer.
- Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection. David Dukić, Kiril Gashteovski, Goran Glavaš, Jan Snajder.
- SENSE-LM : A Synergy between a Language Model and Sensorimotor Representations for Auditory and Olfactory Information Extraction. Cédric BOSCHER, Christine Largeron, Véronique Eglin, Elöd Egyed-Zsigmond.
- STable: Table Generation Framework for Encoder-Decoder Models. Michał Pietruszka, Michał Turski, Łukasz Borchmann, Tomasz Dwojak, Gabriela Nowakowska, Karolina Szyndler, Dawid Jurkiewicz, Łukasz Garncarek.
Information Retrieval and Text Mining (Poster)
Room: Radisson
- Argument Mining as a Text-to-Text Generation Task. Masayuki Kawarada, Tsutomu Hirao, Wataru Uchida, Masaaki Nagata.
- Backtracing: Retrieving the Cause of the Query. Rose E Wang, Pawan Wirawarn, Omar Khattab, Noah Goodman, Dorottya Demszky.
- Clarifying the Path to User Satisfaction: An Investigation into Clarification Usefulness. Hossein A. Rahmani, Xi Wang, Mohammad Aliannejadi, Mohammadmehdi Naghiaei, Emine Yilmaz.
- Corpus-Steered Query Expansion with Large Language Models. Yibin Lei, Yu Cao, Tianyi Zhou, Tao Shen, Andrew Yates.
- Joint Inference of Retrieval and Generation for Passage Re-ranking. Wei Fang, Yung-Sung Chuang, James R. Glass.
- More Discriminative Sentence Embeddings via Semantic Graph Smoothing. Chakib Fettal, lazhar labiod, Mohamed Nadif.
- réchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels. Negar Arabzadeh, Charles L. A. Clarke.
- Unsupervised Multilingual Dense Retrieval via Generative Pseudo Labeling. Chao-Wei Huang, Chen-An Li, Tsu-Yuan Hsu, Chen-Yu Hsu, Yun-Nung Chen.
- When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets. Orion Weller, Kyle Lo, David Wadden, Dawn Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini.
- Why Generate When You Can Discriminate? A Novel Technique for Text Classification using Language Models. Sachin Pawar, Nitin Ramrakhiyani, Anubhav sinha, Manoj Apte, Girish Keshav Palshikar.
Question Answering (Poster)
Room: Radisson
Sentiment Analysis, Stylistic Analysis and Argument Mining (Poster)
Room: Radisson
- An Empirical Analysis of Diversity in Argument Summarization. Michiel van der Meer, Piek Vossen, Catholijn M Jonker, Pradeep Kumar Murukannaiah.
- Evaluating Unsupervised Argument Aligners via Generation of Conclusions of Structured Scientific Abstracts. Yingqiang Gao, Nianlong Gu, Jessica Lam, James Henderson, Richard Hahnloser.
Summarization (Poster)
Room: Radisson
- $\mu$PLAN: Summarizing using a Content Plan as Cross-Lingual Bridge. Fantine Huot, Joshua Maynez, Chris Alberti, Reinald Kim Amplayo, Priyanka Agrawal, Constanza Fierro, Shashi Narayan, Mirella Lapata.
- Less is More for Long Document Summary Evaluation by LLMs. Yunshu Wu, Hayate Iso, Pouya Pezeshkpour, Nikita Bhutani, Estevam Hruschka.
Session 4
Oral Presentations
Information Extraction (Oral)
Room: Carlson
- OpenPI2.0: An Improved Dataset for Entity Tracking in Texts. Li Zhang, Hainiu Xu, Abhinav Kommula, Chris Callison-Burch, Niket Tandon.
- MultiMUC: Multilingual Template Filling on MUC-4. William Gantt, Shabnam Behzad, Hannah YoungEun An, Yunmo Chen, Aaron Steven White, Benjamin Van Durme, Mahsa Yarmohammadi.
- 3D Rotation and Translation for Hyperbolic Knowledge Graph Embedding. Yihua Zhu, Hidetoshi Shimodaira.
- Ameli: Enhancing Multimodal Entity Linking with Fine-Grained Attributes. Barry Menglong Yao, Sijia Wang, Yu Chen, Qifan Wang, Minqian Liu, Zhiyang Xu, Licheng Yu, Lifu Huang.
- Large-Scale Label Interpretation Learning for Few-Shot Named Entity Recognition. Jonas Golde, Felix Hamborg, Alan Akbik.
- Chaining Event Spans for Temporal Relation Grounding. Jongho Kim, Dohyeon Lee, Minsoo Kim, seung-won hwang.
Linguistic Theory and Insights (Oral)
Room: Marie Louise 1
- Syntactic Preposing and Discourse Relations. Yunfang Dong, Xixian Liao, Bonnie L. Webber.
- Large-Scale Bitext Corpora Provide New Evidence for Cognitive Representations of Spatial Terms. Peter Viechnicki, Kevin Duh, Anthony Kostacos, Barbara Landau.
- Clever Hans or Neural Theory of Mind? Stress Testing Social Reasoning in Large Language Models. Natalie Shapira, Mosh Levy, Seyed Hossein Alavi, Xuhui Zhou, Yejin Choi, Yoav Goldberg, Maarten Sap, Vered Shwartz.
- Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times. Byung-Doh Oh, Shisen Yue, William Schuler.
- Automated Cognate Detection as a Supervised Link Prediction Task with Cognate Transformer. V.S.D.S.Mahesh Akavarapu, Arnab Bhattacharya.
- Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models. Erik Arakelyan, Zhaoqi Liu, Isabelle Augenstein.
Opinion, Sentiment and Emotion (Oral)
Room: Marie Louise 2
- Improving Contrastive Learning in Emotion Recognition in Conversation via Data Augmentation and Decoupled Neutral Emotion. Yujin Kang, Yoon-Sik Cho.
- Predicting Client Emotions and Therapist Interventions in Psychotherapy Dialogues. Tobias Mayer, Neha Warikoo, Amir Eliassaf, Dana Atzil-Slonim, Iryna Gurevych.
- “Define Your Terms” : Enhancing Efficient Offensive Speech Classification with Definition. Huy Nghiem, Umang Gupta, Fred Morstatter.
- Unsupervised stance detection for social media discussions: A generic baseline. Maia Sutter, Antoine Gourru, Amine Trabelsi, Christine Largeron.
- Putting Context in Context: the Impact of Discussion Structure on Text Classification. Nicolò Penzo, Antonio Longa, Bruno Lepri, Sara Tonelli, Marco Guerini.
- A Weak Supervision Approach for Few-Shot Aspect Based Sentiment Analysis. Robert Vacareanu, Siddharth Varia, Kishaloy Halder, Shuai Wang, Giovanni Paolini, Neha Anna John, Miguel Ballesteros, Smaranda Muresan.
Poster Presentations
Demo (Poster)
Room: Radisson
- TextBI: An Interactive Dashboard for Visualizing Multidimensional NLP Annotations in Social Media Data. Maxime Masson, Christian Sallaberry, Marie-Noelle Bessagnet, Annig Le Parc Lacayrelle, Philippe Roose, Rodrigo Agerri.
- A Human-Centric Evaluation Platform for Explainable Knowledge Graph Completion. Zhao Xu, Wiem Ben Rim, Kiril Gashteovski, Timo Sztyler, Carolin Lawrence.
- AnnoPlot: Interactive Visualizations of Text Annotations. Elisabeth Fittschen, Tim Fischer, Daniel Brühl, Julia Spahr, Yuliia Lysa, Phuoc Thang Le.
- X-AMR Annotation Tool. Shafiuddin Rehan Ahmed, Jon Cai, Martha Palmer, James H. Martin.
Efficient Low-resource methods in NLP (Poster)
Room: Radisson
- Aligning Large and Small Language Models via Chain-of-Thought Reasoning. Leonardo Ranaldi, Andre Freitas.
- Clustering-based Sampling for Few-Shot Cross-Domain Keyphrase Extraction. Prakamya Mishra, Lincy Pattanaik, Arunima Sundar, Nishant Yadav, Mayank Kulkarni.
- Predicting Machine Translation Performance on Low-Resource Languages: The Role of Domain Similarity. Eric Khiu, Hasti Toossi, Jinyu Liu, Jiaxu Li, David Anugraha, Juan Armando Parra Flores, Leandro Arcos Roman, A. Seza Doğruöz, En-Shiun Annie Lee.
- Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference. Parsa Kavehzadeh.
- Unraveling the Dynamics of Semi-Supervised Hate Speech Detection: The Impact of Unlabeled Data Characteristics and Pseudo-Labeling Strategies. Florian Ludwig, Klara Dolos, Ana Alves-Pinto, Torsten Zesch.
- What the Weight?! A Unified Framework for Zero-Shot Knowledge Composition. Carolin Holtermann, Markus Frohmann, Navid Rekabsaz, Anne Lauscher.
- Who Needs Decoders? Efficient Estimation of Sequence-Level Attributes with Proxies. Yassir Fathullah, Puria Radmard, Adian Liusie, Mark Gales.
Interpretability and Model Analysis in NLP (Poster)
Room: Radisson
- “According to . . . ”: Prompting Language Models Improves Quoting from Pre-Training Data. Orion Weller, Marc Marone, Nathaniel Weir, Dawn Lawrie, Daniel Khashabi, Benjamin Van Durme.
- A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation. Sahar Sadrizadeh, Ljiljana Dolamic, Pascal Frossard.
- A Comprehensive Evaluation of Inductive Reasoning Capabilities and Problem Solving in Large Language Models. CHEN BOWEN, Rune Sætre, Yusuke Miyao.
- Anisotropy Is Inherent to Self-Attention in Transformers. Nathan Godey, Éric Villemonte de la Clergerie, Benoît Sagot.
- Approximate Attributions for Off-the-Shelf Siamese Transformers. Lucas Moeller, Dmitry Nikolaev, Sebastian Padó.
- Can Large Language Models Understand Context?. Yilun Zhu, Joel Ruben Antony Moniz, Shruti Bhargava, Jiarui Lu, Dhivya Piraviperumal, Site Li, Yuan Zhang, Hong Yu, Bo-Hsiang Tseng.
- CMA-R: Causal Mediation Analysis for Explaining Rumour Detection. LIN TIAN, Xiuzhen Zhang, Jey Han Lau.
- Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization. Andreas Waldis, Yufang Hou, Iryna Gurevych.
- Do Language Models Know When They're Hallucinating References?. Ayush Agrawal, Mirac Suzgun, Lester Mackey, Adam Tauman Kalai.
- Establishing degrees of closeness between audio recordings along different dimensions using large-scale cross-lingual models. Maxime Fily, Guillaume Wisniewski, Severine Guillaume, Gilles Adda, Alexis Michaud.
- Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features. Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis.
- Goodhart’s Law Applies to NLP’s Explanation Benchmarks. Jennifer Hsia, Danish Pruthi, Aarti Singh, Zachary Chase Lipton.
- Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?. Joris Baan, Raquel Fernández, Barbara Plank, Wilker Aziz.
- Investigating grammatical abstraction in language models using few-shot learning of novel noun gender. Priyanka Sukumaran, Conor Houghton, Nina Kazanina.
- Over-Reasoning and Redundant Calculation of Large Language Models. Cheng-Han Chiang, Hung-yi Lee.
- Relabeling Minimal Training Subset to Flip a Prediction. Jinghan Yang, Linjie Xu, Lequan Yu.
- SPUQ: Perturbation-Based Uncertainty Quantification for Large Language Models. Xiang Gao, Jiaxin Zhang, Lalla Mouatadid, Kamalika Das.
- Testing the Depth of ChatGPT's Comprehension via Cross-Modal Tasks Based on ASCII-Art: GPT3.5's Abilities in Regard to Recognizing and Generating ASCII-Art Are Not Totally Lacking. David Bayani.
- The Queen of England is not England’s Queen: On the Lack of Factual Coherency in PLMs. Paul Youssef, Jörg Schlötterer, Christin Seifert.
- The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models. Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov.
- Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis. Oscar Chew, Hsuan-Tien Lin, Kai-Wei Chang, Kuan-Hao Huang.
- VOLTAGE: A Versatile Contrastive Learning based OCR Methodology for ultra low-resource scripts through Auto Glyph Feature Extraction. Prawaal Sharma, Poonam Goyal, Vidisha Sharma, Navneet Goyal.
Resources and Evaluation (Poster)
Room: Radisson
- A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry. Michael Toker, Oren Mishali, Ophir Münz-Manor, Benny Kimelfeld, Yonatan Belinkov.
- Barriers to Effective Evaluation of Simultaneous Interpretation. Shira Wein, Te I, Colin Cherry, Juraj Juraska, Dirk Padfield, Wolfgang Macherey.
- Centering the Speech Community. Steven Bird, Dean Yibarbuk.
- Comparing Template-based and Template-free Language Model Probing. Sagi Shaier, Kevin Bennett, Lawrence Hunter, Katharina von der Wense.
- Do-Not-Answer: Evaluating Safeguards in LLMs. Yuxia Wang, Haonan Li, Xudong Han, Preslav Nakov, Timothy Baldwin.
- Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs. Simone Balloccu, Patrícia Schmidtová, Mateusz Lango, Ondrej Dusek.
- Rainbow - A Benchmark for Systematic Testing of How Sensitive Visio-Linguistic Models are to Color Naming. Marie Bexte, Andrea Horbach, Torsten Zesch.
- Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning. Ashish Sunil Agrawal, Barah Fazili, Preethi Jyothi.
- Where Do We Go From Here? Multi-scale Allocentric Relational Inferencefrom Natural Spatial Descriptions. Tzuf Paz-Argaman, John Palowitch, SAYALI KULKARNI, Jason Michael Baldridge, Reut Tsarfaty.
Timezone: Conference (Singapore) UTC Browser
Business Meeting
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: GatherTown
- French GossipPrompts: Dataset For Prevention of Generating French Gossip Stories By LLMs. MSVPJ Sathvik, Abhilash Dowpati, Revanth kumar Narra.
- Discovering and Articulating Frames of Communication from Social Media Using Chain-of-Thought Reasoning. Maxwell Weinzierl, Sanda Harabagiu.
- Reading Between the Tweets: Deciphering Ideological Stances of Interconnected Mixed-Ideology Communities. Zihao He, Ashwin Rao, Siyi Guo, Negar Mokhberian, Kristina Lerman.
- LLM-GEm: Large Language Model-Guided Prediction of People's Empathy Levels towards Newspaper Article. Md Rakibul Hasan.
Dialogue and Interactive Systems (Poster)
Room: GatherTown
- Parameter-Efficient Conversational Recommender System as a Language Processing Task. Mathieu Ravaut, Hao Zhang, Lu Xu, Aixin Sun, Yong Liu.
- Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation in Dialogues. Shivani Kumar, Tanmoy Chakraborty.
- Investigating Agency of LLMs in Human-AI Collaboration Tasks. Ashish Sharma, Sudha Rao, Chris Brockett, Akanksha Malhotra, Nebojsa Jojic, Bill Dolan.
- Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue. Kushal Chawla, Hannah Rashkin, Gaurav Singh Tomar, David Reitter.
- Creating Suspenseful Stories: Iterative Planning with Large Language Models. Kaige Xie, Mark Riedl.
- System-Level Natural Language Feedback. Weizhe Yuan, Kyunghyun Cho, Jason E Weston.
Efficient Low-resource methods in NLP (Poster)
Room: Radisson
Ethics and NLP (Poster)
Room: GatherTown
Generation (Poster)
Room: GatherTown
Information Extraction (Poster)
Room: GatherTown
- CEAN: Contrastive Event Aggregation Network with LLM-based Augmentation for Event Extraction. Zihao Meng, Tao Liu, Heng Zhang, Kai Feng, Peng Zhao.
- Noise Contrastive Estimation-based Matching Framework for Low-resource Security Attack Pattern Recognition. Tu Nguyen, Nedim Šrndić, Alexander Neth.
- EnCore: Fine-Grained Entity Typing by Pre-Training Entity Encoders on Coreference Chains. Frank Martin Mtumbuka, Steven Schockaert.
- CCPrefix: Counterfactual Contrastive Prefix-Tuning for Many-Class Classification. Yang Li, Canran Xu, Guodong Long, Tao Shen, Chongyang Tao, Jing Jiang.
Information Retrieval and Text Mining (Poster)
Room: Radisson
Interpretability and Model Analysis in NLP (Poster)
Room: GatherTown
Language Grounding to Vision, Robotics and Beyond (Poster)
Room: GatherTown
Question Answering (Poster)
Room: GatherTown
- GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution. Yining Lu, Haoping Yu, Daniel Khashabi.
- PEARL: Prompting Large Language Models to Plan and Execute Actions Over Long Documents. Simeng Sun, Yang Liu, Shuohang Wang, Dan Iter, Chenguang Zhu, Mohit Iyyer.
- Ask, Assess, and Refine: Rectifying Factual Consistency and Hallucination in LLMs with Metric-Guided Feedback Learning. Dongyub Lee, Eunhwan Park, Hodong Lee, Heuiseok Lim.
Semantics: Lexical (Poster)
Room: GatherTown
Semantics: Sentence-level Semantics, Textual Inference and other areas (Poster)
Room: GatherTown
- Language Models as Inductive Reasoners. Zonglin Yang, Li Dong, Xinya Du, Hao Cheng, Erik Cambria, Xiaodong Liu, Jianfeng Gao, Furu Wei.
- ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases. Quyet V. Do, Tianqing Fang, Shizhe Diao, Zhaowei Wang, Yangqiu Song.
- Capturing the Relationship Between Sentence Triplets for LLM and Human-Generated Texts to Enhance Sentence Embeddings. Na Min An, Sania Waheed, James Thorne.
- Leveraging Multi-lingual Positive Instances in Contrastive Learning to Improve Sentence Embedding. Kaiyan Zhao, Qiyu Wu, Xin-Qiang Cai, Yoshimasa Tsuruoka.
- Does CLIP Bind Concepts? Probing Compositionality in Large Image Models. Martha Lewis, Nihal V. Nayak, Peilin Yu, Jack Merullo, Qinan Yu, Stephen Bach, Ellie Pavlick.
Sentiment Analysis, Stylistic Analysis and Argument Mining (Poster)
Room: GatherTown
Student Research Workshop (Poster)
Room: GatherTown
- The KIND Dataset: A Social Collaboration Approach for Nuanced Dialect Data Collection. Asma Z. Yamani, Raghad Alziyady, Reem AlYami, Salma A. Albelali, Leina A. Albelali, Jawha- rah Almulhim, Amjad K. Alsulami, Motaz Alfarraj, Rabeah A. Al-Zaidy.
- UnMASKed: Quantifying Gender Biases in Masked Language Models through Linguistically Informed Job Market Prompts. Iñigo Parra.
- Distribution Shifts Are Bottlenecks: Extensive Evaluation for Grounding Language Models to Knowledge Bases. Yiheng Shu, Zhiwei Yu.
- AttriSage: Product Attribute Value Extraction Using Graph Neural Networks. Rohan Potta, Mallika Asthana, Siddhant Yadav, Nidhi Goyal, Sai Amrit Patnaik, Parul Jain.
- Arabic Synonym BERT-based Adversarial Examples for Text Classification. Norah F Alshahrani, Saied Alshahrani, Esma Wali, Jeanna Matthews.
- Align before Attend: Aligning Visual and Textual Features for Multimodal Hateful Content Detection. Eftekhar Hossain, Omar Sharif, Mohammed Moshiul Hoque, Sarah Masud Preum.
- Thesis Proposal: {D}etecting Agency Attribution. Igor Ryazanov, Johanna Björklund.
- Social Media Portrayals of Happy Moments Among Depressed Individuals. Ana-Maria Bucur, Berta Chulvi, Adrian Cosma, Paolo Rosso.
- Large Language Models for Mathematical Reasoning: Progresses and Challenges. Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin.
- Generative Data Augmentation using LLMs improves Distributional Robustness in Question Answering. Arijit Ghosh Chowdhury, Aman Chadha.
- Thesis Proposal: Detecting Empathy Using Multimodal Language Model. Md Rakibul Hasan, Md Zakir Hossain, Aneesh Krishna, Shafin Rahman, Tom Gedeon.
Summarization (Poster)
Room: GatherTown
- Evaluating the Factuality of Zero-shot Summarizers Across Varied Domains. Sanjana Ramprasad, Kundan Krishna, Zachary Chase Lipton, Byron C Wallace.
- Personalized Abstractive Summarization by Tri-agent Generation Pipeline. Wen Xiao, Yujia Xie, Giuseppe Carenini, Pengcheng He.
- Entity-level Factual Adaptiveness of Fine-tuning based Abstractive Summarization Models. Jongyoon Song, Nohil Park, Bongkyu Hwang, Jaewoong Yun, Seongho Joe, Youngjune Gwon, Sungroh Yoon.
- Source Identification in Abstractive Summarization. Yoshi Suhara, Dimitris Alikaniotis.
Session 6
Oral Presentations
Multilingual Issues (Oral)
Room: Marie Louise 1
- Centering the Speech Community. Steven Bird, Dean Yibarbuk.
- 'It's how you do things that matters'': Attending to Process to Better Serve Indigenous Communities with Language Technologies. Ned Cooper, Courtney Heldreth, Ben Hutchinson.
- Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test. Aditi Khandelwal, Utkarsh Agarwal, Kumar Tanmay, Monojit Choudhury.
- Injecting Wiktionary to improve token-level contextual representations using contrastive learning. Anna Mosolova, Marie Candito, Carlos Ramisch.
- Graph-based Clustering for Detecting Semantic Change Across Time and Languages. Xianghe Ma, Michael Strube, Wei Zhao.
- Translate to Disambiguate: Zero-shot Multilingual Word Sense Disambiguation with Pretrained Language Models. Haoqiang Kang, Terra Blevins, Luke Zettlemoyer.
NLP Applications (Oral)
Room: Marie Louise 2
- NNOSE: Nearest Neighbor Occupational Skill Extraction. Mike Zhang, Rob van der Goot, Min-Yen Kan, Barbara Plank.
- Generation, Distillation and Evaluation of Motivational Interviewing-Style Reflections with a Foundational Language Model. Andrew Brown, Jiading Zhu, Mohamed Abdelwahab, Alec Dong, Cindy Wang, Jonathan Rose.
- Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study. Zhaoyue Sun, Gabriele Pergola, Byron C Wallace, Yulan He.
- LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text. Dor Bernsohn, Gil Semo, Yaron Vazana, Gila Hayat, Ben Hagag, Joel Niklaus, Rohit Saha, Kyryl Truskovskyi.
- Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings. Goncalo Emanuel Cavaco Gomes, Isabel Pereira Coutinho, Bruno Martins.
- Presentations by the Humans and For the Humans: Harnessing LLMs for Generating Persona-Aware Slides from Documents. Ishani Mondal, Shwetha S, Anandhavelu Natarajan, Aparna Garimella, Sambaran Bandyopadhyay, Jordan Lee Boyd-Graber.
Sentence-level Semantics (Oral)
Room: Carlson
- Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap. Raphael Schumann, Michael Staniek, Maike Züfle, Stefan Riezler.
- UNSEE: Unsupervised Non-contrastive Sentence Embeddings. Ömer Veysel Çağatan.
- Lost in Translationese? Reducing Translation Effect Using Abstract Meaning Representation. Shira Wein, Nathan Schneider.
- REFINER: Reasoning Feedback on Intermediate Representations. Debjit Paul, Mete Ismayilzada, Maxime Peyrard, Beatriz Borges, Antoine Bosselut, Robert West, Boi Faltings.
- Neuralign: A Context-Aware, Cross-Lingual and Fully-Neural Sentence Alignment System for Long Texts. Francesco Maria Molfese, Andrei Stefan Bejgu, Simone Tedeschi, Simone Conia, Roberto Navigli.
- Sentence Representations via Gaussian Embedding. Shohei Yoda, Hayato Tsukagoshi, Ryohei Sasano, Koichi Takeda.
Session 7
Oral Presentations
Generation (Oral)
Room: Marie Louise 2
- CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language Generations. Samraj Moorjani, ADIT KRISHNAN, Hari Sundaram.
- Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation. Kun Zhou, Yifan Li, Xin Zhao, Ji-Rong Wen.
- A Comparative Analysis of Conversational Large Language Models in Knowledge-Based Text Generation. Phillip Schneider, Manuel Klettner, Elena Simperl, Florian Matthes.
- Exploring Data Augmentation in Neural DRS-to-Text Generation. Muhammad Saad Amin, Luca Anselma, Alessandro Mazzei.
- Small Language Models Improve Giants by Rewriting Their Outputs. Giorgos Vernikos, Arthur Brazinskas, Jakub Adamek, Jonathan Mallinson, Aliaksei Severyn, Eric Malmi.
- Text-to-Code Generation with Modality-relative Pre-training. Fenia Christopoulou, Guchun Zhang, Gerasimos Lampouras.
Multilinguality and Language Diversity 1 (Oral)
Room: Marie Louise 1
- Multilingual Gradient Word-Order Typology from Universal Dependencies. Emi Baylor, Esther Ploeger, Johannes Bjerva.
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects. David Ifeoluwa Adelani, Hannah Liu, Xiaoyu Shen, Nikita Vassilyev, Jesujoba Oluwadara Alabi, Yanke Mao, Haonan Gao, En-Shiun Annie Lee.
- Meme-ingful Analysis: Enhanced Understanding of Cyberbullying in Memes Through Multimodal Explanations. Prince Jha, Krishanu Maity, Raghav Jain, Apoorv Verma, Sriparna Saha, Pushpak Bhattacharyya.
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection. Yuxia Wang, Jonibek Mansurov, Petar Ivanov, Jinyan Su, Artem Shelmanov, Akim Tsvigun, Chenxi Whitehouse, OSAMA MOHAMMED AFZAL, Tarek Mahmoud, Toru Sasaki, Thomas Arnold, Alham Fikri Aji, Nizar Habash, Iryna Gurevych, Preslav Nakov.
- Kardeş-NLU: Transfer to Low-Resource Languages with Big Brother's Help -- A Benchmark and Evaluation for Turkic Languages. Lütfi Kerem Senel, Benedikt Ebing, Konul Baghirova, Hinrich Schuetze, Goran Glavaš.
- Translation Errors Significantly Impact Low-Resource Languages in Cross-Lingual Learning. Ashish Sunil Agrawal, Barah Fazili, Preethi Jyothi.
Question Answering (Oral)
Room: Carlson
- Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering. Mingda Chen, Xilun Chen, Wen-tau Yih.
- CATfOOD: Counterfactual Augmented Training for Improving Out-of-Domain Performance and Calibration. Rachneet Singh Sachdeva, Martin Tutek, Iryna Gurevych.
- Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models. Lukáš Mikula, Michal Štefánik, Marek Petrovič, Petr Sojka.
- Defending Against Disinformation Attacks in Open-Domain Question Answering. Orion Weller, Aleem Khan, Nathaniel Weir, Dawn Lawrie, Benjamin Van Durme.
- Graph Guided Question Answer Generation for Procedural Question-Answering. Hai X. Pham, Isma Hadji, Xinnuo Xu, Ziedune Degutyte, Jay Rainey, Evangelos Kazakos, Afsaneh Fazly, Georgios Tzimiropoulos, Brais Martinez.
- Pre-Training Methods for Question Reranking. Stefano Campese, Ivano Lauriola, Alessandro Moschitti.
Poster Presentations
Student Research Workshop (Poster)
Room: Radisson
- A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models. Marc Braun, Jenny Kunz.
- A Thesis Proposal ClaimInspector Framework: A Hybrid Approach to Data Annotation using Fact-Checked Claims and LLMs. Basak Bozkurt.
- AttriSage: Product Attribute Value Extraction Using Graph Neural Networks. Rohan Potta, Mallika Asthana, Siddhant Yadav, Nidhi Goyal, Sai Amrit Patnaik, Parul Jain.
- AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes. Juhwan Choi, Kyohoon Jin, Junho Lee, Sangmin Song, YoungBin Kim.
- Benchmarking Diffusion Models for Machine Translation. Yunus Demirag, Danni Liu, Jan Niehues.
- Can docstring reformulation with an LLM improve code generation?. Nicola Dainese, Alexander Ilin, Pekka Marttinen.
- Can Stanza be Used for Part-of-Speech Tagging Historical Polish?. Maria Irena Szawerna.
- Dynamic Task-Oriented Dialogue: A Comparative Study of Llama-2 and Bert in Slot Value Generation. Tiziano Labruna, Sofia Brenna, Bernardo Magnini.
- Exploring Large Language Models' Understanding of Shitsukan. Yukiko Ishizuki, Daiki Shiono, Ana Brassard, Jun Suzuki.
- Forged-GAN-BERT: Authorship Attribution for LLM-Generated Forged Novels. Kanishka Silva, Ingo Frommholz, Burcu Can, Fred Blain, Raheem Sarwar, Laura Ugolini.
- Generating Diverse Translation with Perturbed $k$nn-MT. Yuto Nishida, Makoto Morishita, Hidetaka Kamigaito, Taro Watanabe.
- GesNavi: Gesture-guided Outdoor Vision-and-Language Navigation. Aman Jain, Teruhisa Misu, Kentaro Yamada, Hitomi Yanaka.
- HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs. Cem Uluoglakci, Tugba Taskaya Temizel.
- Japanese-English Sentence Translation Exercises Dataset for Automatic Grading. Naoki Miura, Hiroaki Funayama, Seiya Kikuchi, Yuichiroh Matsubayashi, Yuya Iwase, Ken- taro Inui.
- On Sociodemographics Variables and Sociodemographic Prompting. Tiancheng Hu, Nigel Collier.
- Reforging : A Method for Constructing a Linguistically Valid Japanese CCG Treebank. Asa Tomita, Hitomi Yanaka, Daisuke Bekki.
- Representation and Generation of Machine Learning Test Functions. Souha Ben Hassine, Steven R. Wilson.
- The Generative AI Paradox in Evaluation: "What It Can Solve, It May Not Evaluate". Juhyun Oh, Eunsu Kim, Inha Cha, Alice Oh.
- The Impact of Integration Step on Integrated Gradients. Masahiro Makino, Yuya Asazuma, Shota Sasaki, Jun Suzuki.
- Topic-guided Example Selection for Domain Adaptation in LLM-based Machine Translation. Seth Aycock, Rachel Bawden.
- Toward Sentiment Aware Semantic Change Analysis. Roksana Goworek, Haim Dubossarsky.
- Toward Zero-Shot Instruction Following. Renze Lou, Wenpeng Yin.
Session 8
Oral Presentations
Information Retrieval and Text Mining (Oral)
Room: Carlson
- Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels. Negar Arabzadeh, Charles L. A. Clarke.
- HiGen: Hierarchy-Aware Sequence Generation for Hierarchical Text Classification. Vidit Jain, Mukund Rungta, Yuchen Zhuang, Yue Yu, Zeyu Wang, Mu Gao, Jeffrey Skolnick, Chao Zhang.
- Geo-Encoder: A Chunk-Argument Bi-Encoder Framework for Chinese Geographic Re-Ranking. Yong Cao, Ruixue Ding, Boli Chen, Xianzhi Li, Min Chen, Daniel Hershcovich, Pengjun Xie, Fei Huang.
- A Comprehensive Survey of Sentence Representations: From the BERT Epoch to the CHATGPT Era and Beyond. Abhinav Ramesh Kashyap, Thanh-Tung Nguyen, Viktor Schlegel, Stefan Winkler, See-Kiong Ng, Soujanya Poria.
- NevIR: Negation in Neural Information Retrieval. Orion Weller, Dawn Lawrie, Benjamin Van Durme.
- Generative Dense Retrieval: Memory Can Be a Burden. Peiwen Yuan, Xinglin Wang, Shaoxiong Feng, Boyuan Pan, Yiwei Li, Heda Wang, Xupeng Miao, Kan Li.
Multilinguality and Language Diversity 2 (Oral)
Room: Marie Louise 1
- Code-Switched Language Identification is Harder Than You Think. Laurie Burchell, Alexandra Birch, Robert Peter Thompson, Kenneth Heafield.
- No Error Left Behind: Multilingual Grammatical Error Correction with Pre-trained Translation Models. Agnes Luhtaru, Elizaveta Korotkova, Mark Fishel.
- ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks. Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schuetze.
- Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models. Sara Rajaee, Christof Monz.
- Quantifying the Hyperparameter Sensitivity of Neural Networks for Character-level Sequence-to-Sequence Tasks. Adam Wiemerslage, Kyle Gorman, Katharina von der Wense.
- Multi-Level Attention Aggregation for Language-Agnostic Speaker Replication. Yejin Jeon, Gary Lee.
Summarization (Oral)
Room: Marie Louise 2
- LOCOST: State-Space Models for Long Document Abstractive Summarization. Florian Le Bronnec, Song Duong, Mathieu Ravaut, Alexandre Allauzen, Nancy F. Chen, Vincent Guigue, Alberto Lumbreras, Laure Soulier, patrick gallinari.
- Characterizing the Confidence of Large Language Model-Based Automatic Evaluation Metrics. Rickard Stureborg, Dimitris Alikaniotis, Yoshi Suhara.
- An Empirical Analysis of Diversity in Argument Summarization. Michiel van der Meer, Piek Vossen, Catholijn M Jonker, Pradeep Kumar Murukannaiah.
- On the Benefits of Fine-Grained Loss Truncation: A Case Study on Factuality in Summarization. Lorenzo Jaime Yu Flores, Arman Cohan.
- Fine-Grained Natural Language Inference Based Faithfulness Evaluation for Diverse Summarisation Tasks. Huajian Zhang, Yumo Xu, Laura Perez-Beltrachini.
- Less is More for Long Document Summary Evaluation by LLMs. Yunshu Wu, Hayate Iso, Pouya Pezeshkpour, Nikita Bhutani, Estevam Hruschka.
Poster Presentations
Computational Social Science and Cultural Analytics (Poster)
Room: Radisson
- AnthroScore: A Computational Linguistic Measure of Anthropomorphism. Myra Cheng, Kristina Gligoric, Tiziano Piccardi, Dan Jurafsky.
- Bridging Cultural Nuances in Dialogue Agents through Cultural Value Surveys. Yong Cao, Min Chen, Daniel Hershcovich.
- GUMsley: Evaluating Entity Salience in Summarization for 12 English Genres. Jessica Lin, Amir Zeldes.
- IndiVec: An Exploration of Leveraging Large Language Models for Media Bias Detection with Fine-Grained Bias Indicators. Luyang Lin, Lingzhi Wang, Xiaoyan Zhao, Jing Li, Kam-Fai Wong.
- Morality is Non-Binary: Building a Pluralist Moral Sentence Embedding Space using Contrastive Learning. Jeongwoo Park, Enrico Liscio, Pradeep Kumar Murukannaiah.
- Probing Critical Learning Dynamics of LLMs for Hate Speech Detection. Sarah Masud, Mohammad Aflah Khan, Vikram Goyal, Md Shad Akhtar, Tanmoy Chakraborty.
Demo (Poster)
Room: Radisson
- Check News in One Click: NLP-Empowered Pro-Kremlin Propaganda Detection. Veronika Solopova, Viktoriia Herman, Christoph Benzmüller, Tim Landgraf.
- ScamSpot: Fighting Financial Fraud in Instagram Comments. Stefan Erben, Andreas Waldis.
- NarrativePlay: Interactive Narrative Understanding. Runcong Zhao, Wenjia Zhang, Jiazheng Li, Lixing Zhu, Yanran Li, Yulan He, Lin Gui.
- The DURel Annotation Tool: Human and Computational Measurement of Semantic Proximity, Sense Clusters and Semantic Change. Dominik Schlechtweg, Shafqat Mumtaz Virk, Pauline Sander, Emma Sköldberg, Lukas Theuer Linke, Tuo Zhang, Nina Tahmasebi, Jonas Kuhn, Sabine Schulte im Walde.
- FRAPPE: FRAming, Persuasion, and Propaganda Explorer. Ahmed Sajwani, Alaa El setohy, Ali Mekky, Diana Turmakhan, Lara Hassan, Mohamed El Zeftawy, Omar El Herraoui, Osama Mohammed Afzal, Qisheng Liao, Tarek Mahmoud, Zain Muhammad Mujahid, Muhammad Umar Salman, Muhammad Arslan Manzoor, Massa Baali, Jakub Piskorski, Nicolas Stefanovitch, Giovanni Da San Martino, Preslav Nakov.
Ethics and NLP (Poster)
Room: Radisson
- Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias. Nannan Huang, Haytham M. Fayek, Xiuzhen Zhang.
- Effective Controllable Bias Mitigation for Classification and Retrieval using Gate Adapters. Shahed Masoudian, Cornelia Volaucnik, Markus Schedl, Navid Rekabsaz.
- Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement. XIN QUAN, Marco Valentino, Louise A. Dennis, Andre Freitas.
- In-Contextual Gender Bias Suppression for Large Language Models. Daisuke Oba, Masahiro Kaneko, Danushka Bollegala.
- MAFIA: Multi-Adapter Fused Inclusive Language Models. Prachi Jain, Ashutosh Sathe, Varun Gumma, Kabir Ahuja, Sunayana Sitaram.
- Uncovering Stereotypes in Large Language Models: A Task Complexity-based Approach. Hari Shrawgi, Prasanjit Rath, Tushar Singhal, Sandipan Dandapat.
Information Retrieval and Text Mining (Poster)
Room: Radisson
Linguistic Theories, Cognitive Modeling and Psycholinguistics (Poster)
Room: Radisson
- Describing Images $\textit{Fast and Slow}$: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes. Ece Takmaz, Sandro Pezzelle, Raquel Fernández.
- Large Language Models for Psycholinguistic Plausibility Pretesting. Samuel Joseph Amouyal, Aya Meltzer-Asscher, Jonathan Berant.
Phonology, Morphology, and Word Segmentation (Poster)
Room: Radisson
- Automated Cognate Detection as a Supervised Link Prediction Task with Cognate Transformer. V.S.D.S.Mahesh Akavarapu, Arnab Bhattacharya.
- Computational Morphology and Lexicography Modeling of Modern Standard Arabic Nominals. Christian Khairallah, Reham Marzouk, Salam Khalifa, Mayar Nassar, Nizar Habash.
- Where are we Still Split on Tokenization?. Rob van der Goot.
Resources and Evaluation (Poster)
Room: Radisson
Semantics: Sentence-level Semantics, Textual Inference and other areas (Poster)
Room: Radisson
- Align and Augment: Generative Data Augmentation for Compositional Generalization. Francesco Cazzaro, Davide Locatelli, Ariadna Quattoni.
- Bootstrap Your Own PLM: Boosting Semantic Features of PLMs for Unsuperivsed Contrastive Learning. Yoo Hyun Jeong, Myeong soo Han, Dong-Kyu Chae.
- Exploring the Potential of ChatGPT on Sentence Level Relations: A Focus on Temporal, Causal, and Discourse Relations. Chunkit Chan, Cheng Jiayang, Weiqi Wang, Yuxin Jiang, Tianqing Fang, Xin Liu, Yangqiu Song.
- Human Temporal Inferences Go Beyond Aspectual Class. Katarzyna Pruś, Mark Steedman, Adam Lopez.
- Improving Generalization in Semantic Parsing by Increasing Natural Language Variation. Irina Saparina, Mirella Lapata.
- Neuralign: A Context-Aware, Cross-Lingual and Fully-Neural Sentence Alignment System for Long Texts. Francesco Maria Molfese, Andrei Stefan Bejgu, Simone Tedeschi, Simone Conia, Roberto Navigli.
- Rethinking STS and NLI in Large Language Models. Yuxia Wang, Minghan Wang, Preslav Nakov.
- Simple Temperature Cool-down in Contrastive Framework for Unsupervised Sentence Representation Learning. Yoo Hyun Jeong, Myeong soo Han, Dong-Kyu Chae.
Syntax: Tagging, Chunking and Parsing (Poster)
Room: Radisson
Timezone: Conference (Singapore) UTC Browser
Session 10
Oral Presentations
Computational Social Science and Cultural Analytics (Oral)
Room: Marie Louise 2
- AnthroScore: A Computational Linguistic Measure of Anthropomorphism. Myra Cheng, Kristina Gligoric, Tiziano Piccardi, Dan Jurafsky.
- It's All Relative: Learning Interpretable Models for Scoring Subjective Bias in Documents from Pairwise Comparisons. Aswin Suresh, Wu Chi hsuan, Matthias Grossglauser.
- Identifying Narrative Content in Podcast Transcripts. Yosra Abdessamed, Shadi Rezapour, Steven R. Wilson.
- SOCIALITE-LLAMA: An Instruction-Tuned Model for Social Scientific Tasks. Gourab Dey, Adithya V Ganesan, Yash Kumar Lal, Manal Shah, Shreyashee Sinha, Matthew Matero, Salvatore Giorgi, Vivek Kulkarni, H. Schwartz.
- Unintended Bias Detection and Mitigation in Misogynous Memes. Gitanjali Kumari, Anubhav Sinha, Asif Ekbal.
- Moderation in the Wild: Investigating User-Driven Moderation in Online Discussions. Neele Falk, Eva Maria Vecchi, Iman Jundi, Gabriella Lapesa.
Interpretability and Model Analysis in NLP (Oral)
Room: Carlson
- Over-Reasoning and Redundant Calculation of Large Language Models. Cheng-Han Chiang, Hung-yi Lee.
- Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis. Zongxia Li, Andrew Mao, Daniel Kofi Stephens, Pranav Goel, Emily Walpole, Alden Dima, Juan Francisco Fung, Jordan Lee Boyd-Graber.
- Unsupervised Contrast-Consistent Ranking with Language Models. Niklas Stoehr, Pengxiang Cheng, Jing Wang, Daniel Preotiuc-Pietro, Rajarshi Bhowmik.
- It is not True that Transformers are Inductive Learners: Probing NLI Models with External Negation. Michael Sullivan.
- Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features. Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis.
- Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting. Tilman Beck, Hendrik Schuff, Anne Lauscher, Iryna Gurevych.
Resources and Evaluation (Oral)
Room: Marie Louise 1
- Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs. Simone Balloccu, Patrícia Schmidtová, Mateusz Lango, Ondrej Dusek.
- Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning. Danna Zheng, Mirella Lapata, Jeff Z. Pan.
- From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions. Fabian Retkowski, Alexander Waibel.
- Predict the Next Word: <Humans exhibit uncertainty in this task and language models _____>. Evgenia Ilia, Wilker Aziz.
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts. Pardis Sadat Zahraei, Ali Emami.
- AnaDE1.0: A Novel Data Set for Benchmarking Analogy Detection and Extraction. Bhavya Bhavya, Shradha Sehgal, Jinjun Xiong, ChengXiang Zhai.
Poster Presentations
Demo (Poster)
Room: Radisson
- DP-NMT: Scalable Differentially Private Machine Translation. Timour Igamberdiev, Doan Nam Long Vu, Felix Kuennecke, Zhuo Yu, Jannik Holmer, Ivan Habernal.
- MAMMOTH: Massively Multilingual Modular Open Translation @ Helsinki. Timothee Mickus, Stig-Arne Grönroos, Joseph Attieh, Michele Boggia, Ona de Gibert, Shaoxiong Ji, Niki Andreas Loppi, Alessandro Raganato, Raúl Vázquez, Jörg Tiedemann.
- LLMeBench: A Flexible Framework for Accelerating LLMs Benchmarking. Fahim Dalvi, Maram Hasanain, Sabri Boughorbel, Basel Mousi, Samir Abdaljalil, Nizi Nazar, Ahmed Abdelali, Shammur Absar Chowdhury, Hamdy Mubarak, Ahmed Ali, Majd Hawasly, Nadir Durrani, Firoj Alam.
- Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling. Talia Tseriotou, Ryan Chan, Adam Tsakalidis, Iman Munire Bilal, Elena Kochkina, Terry Lyons, Maria Liakata.
Machine Translation (Poster)
Room: Radisson
- Are Character-level Translations Worth the Wait? Comparing Pretrained Character- and Subword-level Models for Machine Translation. Lukas Edman, Gabriele Sarti, Antonio Toral, Gertjan van Noord, Arianna Bisazza.
- CODET: A Benchmark for Contrastive Dialectal Evaluation of Machine Translation. Md Mahfuz Ibn Alam, Sina Ahmadi, Antonios Anastasopoulos.
- How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?. Danni Liu, Jan Niehues.
- Morphology Aware Source Term Masking for Terminology-Constrained NMT. Ander Corral, Xabier Saralegi.
- On Measuring Context Utilization in Document-Level MT Systems. Wafaa Mohammed, Vlad Niculae.
- Revisiting the Markov Property for Machine Translation. Cunxiao Du, Hao Zhou, Zhaopeng Tu, Jing Jiang.
- Sequence Shortening for Context-Aware Machine Translation. Paweł Maka, Yusuf Can Semerci, Jan Scholtes, Gerasimos Spanakis.
Multilinguality and Language Diversity (Poster)
Room: Radisson
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models. Peiqin Lin, Chengzhi Hu, Zheyu Zhang, Andre Martins, Hinrich Schuetze.
- AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents. Abraham Toluwase Owodunni, Aditya Yadavalli, Chris Chinenye Emezue, Tobi Olatunji, Clinton C Mbataku.
- Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models. Sara Rajaee, Christof Monz.
- Analyzing the Role of Part-of-Speech in Code-Switching: A Corpus-Based Study. Jie Chi, Peter Bell.
- Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?. Rishav Hada, Varun Gumma, Adrian de Wynter, Harshita Diddee, Mohamed Ahmed, Monojit Choudhury, Kalika Bali, Sunayana Sitaram.
- Cross-lingual Editing in Multilingual Language Models. Himanshu Beniwal, Kowsik Nandagopan D, Mayank Singh.
- Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching. Kurt Micallef, Nizar Habash, Claudia Borg, Fadhl Eryani, Houda Bouamor.
- Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning. Lifu Tu, Jin Qu, Semih Yavuz, Shafiq Joty, Wenhao Liu, Caiming Xiong, Yingbo Zhou.
- Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties. Ekaterina Artemova, Verena Blaschke, Barbara Plank.
- GPTs Are Multilingual Annotators for Sequence Generation Tasks. Juhwan Choi, Eunju Lee, Kyohoon Jin, YoungBin Kim.
- Investigating the Potential of Task Arithmetic for Cross-Lingual Transfer. Marinela Parović, Ivan Vulić, Anna Korhonen.
- Multilingual Gradient Word-Order Typology from Universal Dependencies. Emi Baylor, Esther Ploeger, Johannes Bjerva.
- ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks. Bolei Ma, Ercong Nie, Shuzhou Yuan, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schuetze.
- Understanding the effects of language-specific class imbalance in multilingual fine-tuning. Vincent Jung, Lonneke van der Plas.
Multimodality and Language Grounding to Vision, Robotics and Beyond (Poster)
Room: Radisson
- An Examination of the Robustness of Reference-Free Image Captioning Evaluation Metrics. Saba Ahmadi, Aishwarya Agrawal.
- Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback. Nikhil Mehta, Milagro Teruel, Xin Deng, Sergio Patricio Figueroa Sanz, Ahmed Hassan Awadallah, Julia Kiseleva.
- The Role of Data Curation in Image Captioning. Wenyan Li, Jonas F. Lotz, Chen Qiu, Desmond Elliott.
- VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object Detection. Arushi Rai, Adriana Kovashka.
shows virtual (Poster)
Room: Radisson
Speech recognition, text-to-speech and spoken language understanding (Poster)
Room: Radisson
- Hyper-BTS Dataset: Scalability and Enhanced Analysis of Back TranScription (BTS) for ASR Post-Processing. Chanjun Park, Jaehyung Seo, Seolhwa Lee, Junyoung Son, Hyeonseok Moon, Sugyeong Eo, Chanhee Lee, Heuiseok Lim.
- ParrotTTS: Text-to-speech synthesis exploiting disentangled self-supervised representations. Neil Shah, Saiteja Kosgi, Vishal Tambrahalli, Neha S, Anil Kumar Nelakanti, Vineet Gandhi.
- Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases. Giulio Zhou, Tsz Kin Lam, Alexandra Birch, Barry Haddow.
- Towards efficient self-supervised representation learning in speech processing. Luis Lugo, Valentin Vielzeuf.
- Towards Hierarchical Spoken Language Disfluency Modeling. Jiachen Lian, Gopala Anumanchipalli.
Session 11
Poster Presentations
Demo (Poster)
Room: Radisson
- DocChecker: Bootstrapping Code Large Language Model for Detecting and Resolving Code-Comment Inconsistencies. Anh Dau, Jin L.C. Guo, Nghi Bui.
- RAGAs: Automated Evaluation of Retrieval Augmented Generation. Shahul Es, Jithin James, Luis Espinosa Anke, Steven Schockaert.
- NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation. Shachar Rosenman, Vasudev Lal, Phillip Howard.
- pyTLEX: A Python Library for TimeLine EXtraction. Akul Singh, Jared Hummer, Mustafa Ocal, Mark Finlayson.
Efficient Low-resource methods in NLP (Poster)
Room: GatherTown
- Zero-shot Sentiment Analysis in Low-Resource Languages Using a Multilingual Sentiment Lexicon. Fajri Koto, Tilman Beck, Zeerak Talat, Iryna Gurevych, Timothy Baldwin.
- VlogQA: Task, Dataset, and Baseline Models for Vietnamese Spoken-Based Machine Reading Comprehension. Thinh Phuoc Ngo, Khoa Tran Anh Dang, Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen.
- ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text. Thanh-Nhi Nguyen, Thanh-Phong Le, Kiet Van Nguyen.
- Zero-Shot End-to-End Spoken Language Understanding via Cross-Modal Selective Self-Training. Jianfeng He, Julian Salazar, Kaisheng Yao, Haoqi Li, Jason Cai.
Generation (Poster)
Room: GatherTown
Interpretability and Model Analysis in NLP (Poster)
Room: GatherTown
- On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models. Thilini Wijesiriwardene, Ruwan Wickramarachchi, Aishwarya Naresh Reganti, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das.
- Explaining Language Model Predictions with High-Impact Concepts. Ruochen Zhao, Tan Wang, Yongjie Wang, Shafiq Joty.
Machine Learning for NLP (Poster)
Room: GatherTown
- GAINER: Graph Machine Learning with Node-specific Radius for Classification of Short Texts and Documents. Naganand Yadati.
- Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM. Ruohong Zhang, Yau-Shian Wang, Yiming Yang.
- Random Smooth-based Certified Defense against Text Adversarial Attack. Zeliang Zhang.
- Exploiting Class Probabilities for Black-box Sentence-level Attacks. Raha Moraffah, huan liu.
- Learning Label Hierarchy with Supervised Contrastive Learning. Ruixue Lian, William A. Sethares, Junjie Hu.
Machine Translation (Poster)
Room: GatherTown
- Importance-Aware Data Augmentation for Document-Level Neural Machine Translation. Minghao Wu, Yufei Wang, George Foster, Lizhen Qu, Gholamreza Haffari.
- CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages. Kaushal Kumar Maurya, Rahul Kejriwal, Maunendra Sankar Desarkar, Anoop Kunchukuttan.
Multimodality and Language Grounding to Vision, Robotics and Beyond (Poster)
Room: GatherTown
NLP Applications (Poster)
Room: GatherTown
- Next Visit Diagnosis Prediction via Medical Code-Centric Multimodal Contrastive EHR Modelling with Hierarchical Regularisation. Heejoon Koo.
- FinBPM: A Framework for Portfolio Management-based Financial Investor Behavior Perception Model. Zhilu Zhang, Procheta Sen, Zimu Wang, Ruoyu Sun, Zhengyong Jiang, Jionglong Su.
- Contextualization Distillation from Large Language Model for Knowledge Graph Completion. Dawei Li, Zhen Tan, Tianlong Chen, huan liu.
- Style-News: Incorporating Stylized News Generation and Adversarial Verification for Neural Fake News Detection. Wei-Yao Wang, Yu-Chieh Chang, Wen-Chih Peng.
- UP5: Unbiased Foundation Model for Fairness-aware Recommendation. Wenyue Hua, Yingqiang Ge, Shuyuan Xu, jianchao ji, Zelong Li, Yongfeng Zhang.
- Syllable-level lyrics generation from melody exploiting character-level language model. Zhe Zhang, Karol Lasocki, Yi Yu, Atsuhiro Takasu.
- Threat Behavior Textual Search by Attention Graph Isomorphism. Chanwoo Bae, Guanhong Tao, ZHUO ZHANG, Xiangyu Zhang.
Question Answering (Poster)
Room: Radisson
- Towards Evidentiality-Aware Retrieval for Abstractive Tasks: Overcoming Abstractiveness in Open-Domain Question Answering. Yongho Song, Dahyun Lee, Myungha Jang, seung-won hwang, Kyungjae Lee, Dongha Lee, Jinyoung Yeo.
- Towards Evidentiality-Aware Retrieval for Abstractive Tasks: Overcoming Abstractiveness in Open-Domain Question Answering. Yongho Song, Dahyun Lee, Myungha Jang, seung-won hwang, Kyungjae Lee, Dongha Lee, Jinyoung Yeo.
Resources and Evaluation (Poster)
Room: GatherTown
- Generating Benchmarks for Factuality Evaluation of Language Models. Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham.
- Towards Context-Based Violence Detection: A Korean Crime Dialogue Dataset. Minju Kim, Heuiyeen Yeen, Myoung-Wan Koo.
- Multi-Reference Benchmarks for Russian Grammatical Error Correction. Frank Palma Gomez, Alla Rozovskaya.
- WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts. Pardis Sadat Zahraei, Ali Emami.
- A Multimodal Framework to Detect Target Aware Aggression in Memes. Shawly Ahsan, Eftekhar Hossain, Omar Sharif, Avishek Das, Mohammed Moshiul Hoque, M. Ali Akber Dewan.
- BMX: Boosting Natural Language Generation Metrics with Explainability. Christoph Leiter, Hoa Nguyen, Steffen Eger.
Semantics: Lexical (Poster)
Room: Radisson
Sentiment Analysis, Stylistic Analysis and Argument Mining (Poster)
Room: GatherTown
Session 9
Oral Presentations
Ethics and NLP (Oral)
Room: Carlson
- Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement. Xin Quan, Marco Valentino, Louise A. Dennis, Andre Freitas.
- MAFIA: Multi-Adapter Fused Inclusive Language Models. Prachi Jain, Ashutosh Sathe, Varun Gumma, Kabir Ahuja, Sunayana Sitaram.
- Examining Gender and Racial Bias in Large Vision--Language Models Using a Novel Dataset of Parallel Images. Kathleen C. Fraser, Svetlana Kiritchenko.
- Bias in Opinion Summarisation from Pre-training to Adaptation: A Case Study in Political Bias. Nannan Huang, Haytham M. Fayek, Xiuzhen Zhang.
- A Prompt Response to the Demand for Automatic Gender-Neutral Translation. Beatrice Savoldi, Andrea Piergentili, Dennis Fucci, Matteo Negri, Luisa Bentivogli.
- Uncovering Stereotypes in Large Language Models: A Task Complexity-based Approach. Hari Shrawgi, Prasanjit Rath, Tushar Singhal, Sandipan Dandapat.
Machine Translation (Oral)
Room: Marie Louise 1
- Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Decoding. Rico Sennrich, Jannis Vamvas, Alireza Mohammadshahi.
- How Transferable are Attribute Controllers on Pretrained Multilingual Translation Models?. Danni Liu, Jan Niehues.
- Disentangling the Roles of Target-side Transfer and Regularization in Multilingual Machine Translation. Yan Meng, Christof Monz.
- Contrastive Decoding Reduces Hallucinations in Large Multilingual Machine Translation Models. Jonas Waldendorf, Barry Haddow, Alexandra Birch.
- Cross-Lingual Transfer from Related Languages: Treating Low-Resource Maltese as Multilingual Code-Switching. Kurt Micallef, Nizar Habash, Claudia Borg, Fadhl Eryani, Houda Bouamor.
- Robust Neural Machine Translation for Abugidas by Glyph Perturbation. Hour Kaing, Chenchen Ding, Hideki Tanaka, Masao Utiyama.
Semantics and Applications (Oral)
Room: Marie Louise 2
- Scaling up Discovery of Latent Concepts in Deep NLP Models. Majd Hawasly, Fahim Dalvi, Nadir Durrani.
- Document Structure in Long Document Transformers. Jan Buchmann, Max Eichler, Jan-Micha Bodensohn, Ilia Kuznetsov, Iryna Gurevych.
- The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks. Anders Giovanni Møller, Arianna Pera, Jacob Aarup Dalsgaard, Luca Maria Aiello.
- MLCopilot: Unleashing the Power of Large Language Models in Solving Machine Learning Tasks. lei Zhang, Yuge Zhang, Kan Ren, Dongsheng Li, Yuqing Yang.
- Align and Augment: Generative Data Augmentation for Compositional Generalization. Francesco Cazzaro, Davide Locatelli, Ariadna Quattoni.
- SentenceLDA: Discriminative and Robust Document Representation with Sentence Level Topic Model. Taehun Cha, Donghun Lee.
Poster Presentations
Demo (Poster)
Room: Radisson
- kNN-BOX: A Unified Framework for Nearest Neighbor Generation. Wenhao Zhu, Qianfeng Zhao, Yunzhe Lv, Shujian Huang, Siheng Zhao, Sizhe Liu, Jiajun CHEN.
- NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus. Kyoungyeon Cho, Seungkum Han, Young Rok Choi, Wonseok Hwang.
- GeospaCy: A tool for extraction and geographical referencing of spatial expressions in textual data. Syed Mehtab Alam, Elena Arsevska, Mathieu Roche, Maguelonne Teisseire.
- MEGAnno+: A Human-LLM Collaborative Annotation System. Hannah Kim, Kushan Mitra, Rafael Li Chen, Sajjadur Rahman, Dan Zhang.
Machine Learning for NLP (Poster)
Room: Radisson
- Autism Detection in Speech – A Survey. Nadine Probol, Margot Mieskes.
- Backward Compatibility During Data Updates by Weight Interpolation. Raphael Schumann, Elman Mansimov, Yi-An Lai, Nikolaos Pappas, Xibin Gao, Yi Zhang.
- Consistent Joint Decision-Making with Heterogeneous Learning Models. Hossein Rajaby Faghihi, Parisa Kordjamshidi.
- EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning. Kinjal Basu, Keerthiram Murugesan, Subhajit Chaudhury, Murray Campbell, Kartik Talamadupula, Tim Klinger.
- Exploring hybrid approaches to readability: experiments on the complementarity between linguistic features and transformers. Rodrigo Wilkens, Patrick Watrin, Rémi Cardon, Alice Pintard, Isabelle Gribomont, Thomas François.
- Gradient-Based Language Model Red Teaming. Nevan Wichers, Carson Denison, Ahmad Beirami.
- Learning High-Quality and General-Purpose Phrase Representations. Lihu Chen, Gael Varoquaux, Fabian M. Suchanek.
- Measuring Uncertainty in Neural Machine Translation with Similarity-Sensitive Entropy. Julius Cheng, Andreas Vlachos.
- Non-Exchangeable Conformal Language Generation with Nearest Neighbors. Dennis Thomas Ulmer, Chrysoula Zerva, Andre Martins.
- Parameter-Efficient Fine-Tuning: Is There An Optimal Subset of Parameters to Tune?. Max Ploner, Alan Akbik.
- Polarized Opinion Detection Improves the Detection of Toxic Language. John Pavlopoulos, Aristidis Likas.
- PRILoRA: Pruned and Rank-Increasing Low-Rank Adaptation. Nadav Benedek, Lior Wolf.
- Should I try multiple optimizers when fine-tuning a pre-trained Transformer for NLP tasks? Should I tune their hyperparameters?. Nefeli Gkouti, Prodromos Malakasiotis, Stavros Toumpis, Ion Androutsopoulos.
NLP Applications (Poster)
Room: Radisson
- Answering legal questions from laymen in German civil law system. Marius Büttner, Ivan Habernal.
- Autoregressive Score Generation for Multi-trait Essay Scoring. Heejin Do, Yunsu Kim, Gary Lee.
- Comparing Knowledge Sources for Open-Domain Scientific Claim Verification. Juraj Vladika, Florian Matthes.
- CReSE: Benchmark Data and Automatic Evaluation Framework for Recommending Eligibility Criteria from Clinical Trial Information. Siun Kim, Jung-Hyun Won, David Lee, Renqian Luo, Lijun Wu, Tao Qin, Howard Lee.
- Do Text Simplification Systems Convey Correct Information? A Human Evaluation via Reading Comprehension. Sweta Agrawal, Carpuat Marine.
- Enhancing Society-Undermining Disinformation Detection through Fine-Grained Sentiment Analysis Pre-Finetuning. Tsung-Hsuan Pan, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen.
- Entity Linking in the Job Market Domain. Mike Zhang, Rob van der Goot, Barbara Plank.
- Equipping Language Models with Tool Use Capability for Tabular Data Analysis in Finance. Adrian Theuma, Ehsan Shareghi.
- Generative Interpretation: Toward Human-Like Evaluation for Educational Question-Answer Pair Generation. Hyeonseok Moon, Jaewook Lee, Sugyeong Eo, Chanjun Park, Jaehyung Seo, Heuiseok Lim.
- LegalLens: Leveraging LLMs for Legal Violation Identification in Unstructured Text. Dor Bernsohn, Gil Semo, Yaron Vazana, Gila Hayat, Ben Hagag, Joel Niklaus, Rohit Saha, Kyryl Truskovskyi.
- Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study. ZHAOYUE SUN, Gabriele Pergola, Byron C Wallace, Yulan He.
- Like a Good Nearest Neighbor: Practical Content Moderation and Text Classification. Luke Bates, Iryna Gurevych.
- LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models. Adian Liusie, Potsawee Manakul, Mark Gales.
- MAPLE: Micro Analysis of Pairwise Language Evolution for Few-Shot Claim Verification. Xia Zeng, Arkaitz Zubiaga.
- Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca. Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Andrey Kutuzov, Barry Haddow, Kenneth Heafield.
- Next Visit Diagnosis Prediction via Medical Code-Centric Multimodal Contrastive EHR Modelling with Hierarchical Regularisation. Heejoon Koo.
- NNOSE: Nearest Neighbor Occupational Skill Extraction. Mike Zhang, Rob van der Goot, Min-Yen Kan, Barbara Plank.
- Reconstruction of Ancient Hebrew and Aramaic Texts Using Transformers. Niv Fono, Harel Moshayof, Eldar Karol, Itai Assraf, Mark Last.
Timezone: Conference (Singapore) UTC Browser
Timezone: Conference (Singapore) UTC Browser