AdversaFlow : Visual Red Teaming for Large Language Models with Multi-Level Adversarial Flow
Large Language Models (LLMs) are powerful but also raise significant security concerns, particularly regarding the harm they can cause, such as generating fake news that manipulates public opinion on social media and providing responses to unethical activities. Traditional red teaming approaches for...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on visualization and computer graphics. - 1996. - PP(2024) vom: 16. Sept.
|
1. Verfasser: |
Deng, Dazhen
(VerfasserIn) |
Weitere Verfasser: |
Zhang, Chuhan,
Zheng, Huawei,
Pu, Yuwen,
Ji, Shouling,
Wu, Yingcai |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on visualization and computer graphics
|
Schlagworte: | Journal Article |