AdversaFlow : Visual Red Teaming for Large Language Models with Multi-Level Adversarial Flow

Large Language Models (LLMs) are powerful but also raise significant security concerns, particularly regarding the harm they can cause, such as generating fake news that manipulates public opinion on social media and providing responses to unethical activities. Traditional red teaming approaches for...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on visualization and computer graphics. - 1996. - PP(2024) vom: 16. Sept.
1. Verfasser: Deng, Dazhen (VerfasserIn)
Weitere Verfasser: Zhang, Chuhan, Zheng, Huawei, Pu, Yuwen, Ji, Shouling, Wu, Yingcai
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:IEEE transactions on visualization and computer graphics
Schlagworte:Journal Article