Vulnerability of Large Language Models to Output Prefix Jailbreaks: Impact of Positions on Safety
Yiwei Wang, Muhao Chen, Nanyun Peng, and Kai-Wei Chang, in NAACL-Finding, 2025.
Download the full text
Abstract
Bib Entry
@inproceedings{wang2025vulnerability, title = {Vulnerability of Large Language Models to Output Prefix Jailbreaks: Impact of Positions on Safety}, author = {Wang, Yiwei and Chen, Muhao and Peng, Nanyun and Chang, Kai-Wei}, booktitle = {NAACL-Finding}, year = {2025} }