Skip to main navigation Skip to search Skip to main content

Privacy-Preserving Intelligence-based Reinforcement Learning for Large Language Model via Homomorphic Encryption

  • Feiyang Wu
  • , Xiaoqiang Sun*
  • , Zhiwei Sun
  • , Wei Liu
  • , Zoe L. Jiang
  • *Corresponding author for this work
  • Shenzhen University
  • Shenzhen Polytechnic
  • Guangdong Provincial Key Laboratory of Novel Security Intelligence Technologies

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Reinforcement learning (RL) is a widely used framework for sequential decision-making, but conventional RL often struggles with sparse rewards, limited reasoning, and long-horizon dependencies. Intelligence-based reinforcement learning (IRL) introduces intelligence-oriented metrics and is mainly applied to large language models (LLMs) to enhance contextual reasoning and process both numerical and textual information. Nevertheless, deploying IRL with LLMs in sensitive domains like healthcare, finance, and defense raises severe privacy risks, including gradient inversion, model extraction, and sensitive data leakage. We propose a privacy-preserving IRL framework with the integration of LLMs based on the CKKS fully homomorphic encryption scheme, which supports encrypted computation on real-valued data, enabling training to be performed entirely in the encrypted domain. Security and efficiency analyses demonstrate that the framework achieves strong cryptographic security, meeting indistinguishability under the chosen-plaintext attack, and practical efficiency with low communication overhead, enabling secure deployment in privacy-critical environments despite increased computational cost.

Original languageEnglish
Title of host publicationProceedings of 2025 2nd International Symposium on AI and Cybersecurity, ISAICS 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331554903
DOIs
StatePublished - 2025
Externally publishedYes
Event2nd International Symposium on AI and Cybersecurity, ISAICS 2025 - Bengbu, China
Duration: 24 Oct 202526 Oct 2025

Publication series

NameProceedings of 2025 2nd International Symposium on AI and Cybersecurity, ISAICS 2025

Conference

Conference2nd International Symposium on AI and Cybersecurity, ISAICS 2025
Country/TerritoryChina
CityBengbu
Period24/10/2526/10/25

Keywords

  • Fully Homomorphic Encryption
  • Intelligence
  • Large Language Models
  • Privacy-Preserving
  • Reinforcement Learning

Fingerprint

Dive into the research topics of 'Privacy-Preserving Intelligence-based Reinforcement Learning for Large Language Model via Homomorphic Encryption'. Together they form a unique fingerprint.

Cite this