Beyond fine-tuning: LoRA modules boost near-OOD detection and LLM security

Salimbeni, Etienne; Craighero, Francesco; Khasanova, Renata; Vasic, Milos; Vandergheynst, Pierre

Salimbeni, Etienne; Craighero, Francesco; Khasanova, Renata; Vasic, Milos; Vandergheynst, Pierre

2024

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Under resource constraints, LLMs are usually fine- tuned with additional knowledge using Parameter Efficient Fine-Tuning (PEFT), using Low-Rank Adaptation (LoRA) modules. In fact, LoRA injects a new set of small trainable matrices to adapt an LLM to a new task, while keeping the latter frozen. At deployment, LoRA weights are subsequently merged with the LLM weights to speed up inference. In this work, we show how to exploit the unmerged LoRA’s embedding to boost the performance of Out-Of-Distribution (OOD) detectors, especially in the more challenging near- OOD scenarios. Accordingly, we demonstrate how improving OOD detection also helps in characterizing wrong predictions in downstream tasks, a fundamental aspect to improve the reliability of LLMs. Moreover, we will present a use-case in which the sensitivity of LoRA modules and OOD detection are employed together to alert stakeholders about new model updates. This scenario is particularly important when LLMs are out-sourced. Indeed, test functions should be applied as soon as the model changes the version in order to adapt prompts in the downstream applications. In order to validate our method, we performed tests on Multiple Choice Question Answering datasets, by focusing on the medical domain as a fine-tuning task. Our results motivate the use of LoRA modules even after deployment, since they provide strong features for OOD detection for fine-tuning tasks and can be employed to improve the security of LLMs.

Details

Title Beyond fine-tuning: LoRA modules boost near-OOD detection and LLM security

Author(s) Salimbeni, Etienne ; Craighero, Francesco ; Khasanova, Renata ; Vasic, Milos ; Vandergheynst, Pierre

Conference 7th Deep Learning Security and Privacy Workshop, San Francisco, CA, May 23, 2024

Date 2024

Keywords

Parameter Efficient Fine-Tuning; Large Language Models; Low-Rank Adaptation; Out-Of-Distribution Detection

Note Extended Abstract

Additional link fulltext

Laboratories LTS2

Record Appears in Scientific production and competences > STI - School of Engineering > IEM - Institut d'Electricité et de Microtechnique > LTS2 - Signal Processing Laboratory 2
Peer-reviewed publications
Conference Papers
Work produced at EPFL

Record creation date 2024-04-15