Journal of Software Engineering and Applications

Volume 17, Issue 5 (May 2024)

ISSN Print: 1945-3116   ISSN Online: 1945-3124

Google-based Impact Factor: 2  Citations  

Evaluating Privacy Leakage and Memorization Attacks on Large Language Models (LLMs) in Generative AI Applications

HTML  XML Download Download as PDF (Size: 3992KB)  PP. 421-447  
DOI: 10.4236/jsea.2024.175023    459 Downloads   2,481 Views  

ABSTRACT

The recent interest in the deployment of Generative AI applications that use large language models (LLMs) has brought to the forefront significant privacy concerns, notably the leakage of Personally Identifiable Information (PII) and other confidential or protected information that may have been memorized during training, specifically during a fine-tuning or customization process. We describe different black-box attacks from potential adversaries and study their impact on the amount and type of information that may be recovered from commonly used and deployed LLMs. Our research investigates the relationship between PII leakage, memorization, and factors such as model size, architecture, and the nature of attacks employed. The study utilizes two broad categories of attacks: PII leakage-focused attacks (auto-completion and extraction attacks) and memorization-focused attacks (various membership inference attacks). The findings from these investigations are quantified using an array of evaluative metrics, providing a detailed understanding of LLM vulnerabilities and the effectiveness of different attacks.

Share and Cite:

Aditya, H. , Chawla, S. , Dhingra, G. , Rai, P. , Sood, S. , Singh, T. , Wase, Z. , Bahga, A. and Madisetti, V. (2024) Evaluating Privacy Leakage and Memorization Attacks on Large Language Models (LLMs) in Generative AI Applications. Journal of Software Engineering and Applications, 17, 421-447. doi: 10.4236/jsea.2024.175023.

Cited by

No relevant information.

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.