SNaRe: Domain-aware Data Generation for Low-Resource Event Detection
Tanmay Parekh, Yuxuan Dong, Lucas Bandarkar, Artin Kim, I.-Hung Hsu, Kai-Wei Chang, and Nanyun Peng, in EMNLP, 2025.
CodeDownload the full text
Abstract
Event detection (ED) is important for reasoning in specialized domains such as biomedicine, law and epidemiology, but existing generation approaches suffer from label noise and domain drift when applied to specialized domains. This paper introduces SNaRe, a domain-aware synthetic data generation framework with three components: Scout, Narrator and Refiner. Scout extracts triggers from unlabeled target domain data and curates a high-quality domain-specific trigger list. Narrator uses these triggers to generate domain-aligned sentences, and Refiner identifies additional event mentions to ensure annotation quality. Experiments on diverse ED datasets show that SNaRe outperforms baselines with 3-7% F1 gains in zero-/few-shot settings and 4-20% improvements in multilingual generation.
Bib Entry
@inproceedings{parekh2025snare,
title = {SNaRe: Domain-aware Data Generation for Low-Resource Event Detection},
author = {Parekh, Tanmay and Dong, Yuxuan and Bandarkar, Lucas and Kim, Artin and Hsu, I-Hung and Chang, Kai-Wei and Peng, Nanyun},
booktitle = {EMNLP},
year = {2025}
}
Related Publications
- DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning, EMNLP, 2025
- LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory, ICLR, 2025
- SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness, EMNLP, 2024
- TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction, ACL-Findings, 2024
- Event Detection from Social Media for Epidemic Prediction, NAACL, 2024
- GENEVA: Pushing the Limit of Generalizability for Event Argument Extraction with 100+ Event Types, ACL, 2023
- TAGPRIME: A Unified Framework for Relational Structure Extraction, ACL, 2023
- Enhancing Unsupervised Semantic Parsing with Distributed Contextual Representations, ACL-Finding, 2023
- DEGREE: A Data-Efficient Generative Event Extraction Model, NAACL, 2022
- Intent Classification and Slot Filling for Privacy Policies, ACL, 2021