Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation

ByAdmin

Apr 9, 2026

THE AI TODAY

arXiv:2604.06831v1 Announce Type: cross
Abstract: Current LLM-based services typically require users to submit raw text regardless of its sensitivity. While intuitive, such practice introduces substantial privacy risks, as unauthorized access may expose personal, medical, or legal information. Although prior defenses strived to mitigate these risks, they often incur substantial computational overhead and degrade model performance. To overcome this privacy-efficiency trade-off, we introduce Privacy-Preserving Fine-Tuning (PPFT), a novel training pipeline that eliminates the need for transmitting raw prompt text while maintaining a favorable balance between privacy preservation and model utility for both clients and service providers. Our approach operates in two stages: first, we train a client-side encoder together with a server-side projection module and LLM, enabling the server to condition on k-pooled prompt embeddings instead of raw text; second, we fine-tune the projection module and LLM on private, domain-specific data using noise-injected embeddings, allowing effective adaptation without exposing plain text prompts and requiring access to the decoder’s internal parameters. Extensive experiments on domain-specific and general benchmarks demonstrate that PPFT achieves a striking balance between privacy and utility, maintaining competitive performance with minimal degradation compared to noise-free upper bounds.

By Admin

AI RESEARCH

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation

ByAdmin

By Admin

Related Post

Bridging Natural Language and Microgrid Dynamics: A Context-Aware Simulator and Dataset

On the Step Length Confounding in LLM Reasoning Data Selection

Governance and Regulation of Artificial Intelligence in Developing Countries: A Case Study of Nigeria

Leave a Reply Cancel reply

You missed

High-Precision Estimation of the State-Space Complexity of Shogi via the Monte Carlo Method

Governance and Regulation of Artificial Intelligence in Developing Countries: A Case Study of Nigeria

On the Step Length Confounding in LLM Reasoning Data Selection

Towards Privacy-Preserving Large Language Model: Text-free Inference Through Alignment and Adaptation