Article citationsMore>>
Jiang, A.Q., Sablayrolles, A., Mensch, A., Bamford, C., Chaplot, D.S., de las Casas, D., Bressand, F., Lengyel, G., Lample, G., Saulnier, L., Lavaud, L.R., Lachaux, M.-A., Stock, P., Le Scao, T., Lavril, T., Wang, T., Lacroix, T. and El Sayed, W. (2023) Mistral 7B. arXiv preprint arXiv:2310.06825.
has been cited by the following article:
-
TITLE:
GUARDIAN: A Multi-Tiered Defense Architecture for Thwarting Prompt Injection Attacks on LLMs
AUTHORS:
Parijat Rai, Saumil Sood, Vijay K. Madisetti, Arshdeep Bahga
KEYWORDS:
Large Language Models (LLMs), Adversarial Attack, Prompt Injection, Filter Defense, Artificial Intelligence, Machine Learning, Cybersecurity
JOURNAL NAME:
Journal of Software Engineering and Applications,
Vol.17 No.1,
January
23,
2024
ABSTRACT: This paper introduces a novel multi-tiered defense architecture to protect language models from adversarial prompt attacks. We construct adversarial prompts using strategies like role emulation and manipulative assistance to simulate real threats. We introduce a comprehensive, multi-tiered defense framework named GUARDIAN (Guardrails for Upholding Ethics in Language Models) comprising a system prompt filter, pre-processing filter leveraging a toxic classifier and ethical prompt generator, and pre-display filter using the model itself for output screening. Extensive testing on Meta’s Llama-2 model demonstrates the capability to block 100% of attack prompts. The approach also auto-suggests safer prompt alternatives, thereby bolstering language model security. Quantitatively evaluated defense layers and an ethical substitution mechanism represent key innovations to counter sophisticated attacks. The integrated methodology not only fortifies smaller LLMs against emerging cyber threats but also guides the broader application of LLMs in a secure and ethical manner.
Related Articles:
-
R. Douglas Martin, Shengyu Zhang
-
Alexander N. Safronov
-
S. Robinson, R. Nakkeeran
-
Flavio Gimenes Alvarenga, Mahouton Jonas Stephane Houndjo, Adjimon Vincent Monwanou, Jean Bio Chabi Orou
-
Roger Boudet