JailbreakLens : Visual Analysis of Jailbreak Attacks Against Large Language Models

The proliferation of large language models (LLMs) has underscored concerns regarding their security vulnerabilities, notably against jailbreak attacks, where adversaries design jailbreak prompts to circumvent safety mechanisms for potential misuse. Addressing these concerns necessitates a comprehens...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on visualization and computer graphics. - 1996. - 31(2025), 10 vom: 12. Sept., Seite 8668-8682
1. Verfasser: Feng, Yingchaojie (VerfasserIn)
Weitere Verfasser: Chen, Zhizhang, Kang, Zhining, Wang, Sijia, Tian, Haoyu, Zhang, Wei, Zhu, Minfeng, Chen, Wei
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2025
Zugriff auf das übergeordnete Werk:IEEE transactions on visualization and computer graphics
Schlagworte:Journal Article