Game of Arrows: On the ({In-)Security} of Weight Obfuscation for {On-Device} {TEE-Shielded} {LLM} Partition Algorithms

Pengli Wang; Bingyou Dong; Yifeng Cai; Zheng Zhang; Junlin Liu; Huanran Xue; Ye Wu; Yao Zhang; Ziqi Zhang

Pengli Wang, MOEKey Lab of HCST (PKU), School of Computer Science, Peking University; Bingyou Dong, ByteDance; Yifeng Cai, MOEKey Lab of HCST (PKU), School of Computer Science, Peking University; Zheng Zhang, ByteDance; Junlin Liu, MOEKey Lab of HCST (PKU), School of Computer Science, Peking University; Huanran Xue, Ye Wu, and Yao Zhang, ByteDance; Ziqi Zhang, University of Illinois Urbana-Champaign

Utilizing Trusted Execution Environments (TEEs) to protect Large Language Models (LLMs) on users' devices is a practical solution for model owners. To alleviate the computation burden on TEEs, researchers have proposed TEE-Shielded LLM Partition (TSLP) to offload heavy computation layers to co-operating untrusted GPUs, while lightweight layers are shielded in TEE. TSLP utilizes various lightweight obfuscation schemes to protect offloaded weights from various attacks meanwhile not introducing large computation overhead. However, existing lightweight obfuscation algorithms have one vital vulnerability in common: the direction similarity of obfuscated vectors. In this paper, we propose a novel attack, ArrowMatch, that utilizes direction similarity to recover obfuscated private weights. To achieve this, ArrowMatch compares direction distances between obfuscated model weights and public pre-trained model weights. To mitigate this vulnerability, we propose a novel obfuscation scheme, ArrowCloak, which leverages lightweight matrix-vector multiplication to protect vector directions and private weights. We evaluate ArrowMatch and ArrowCloak on four representative LLMs, using seven datasets, along with five obfuscation schemes. The results show that ArrowMatch can break the protection of all existing lightweight obfuscation schemes with high accuracy (similar to no protection) and effectively recover the private weights (with over 98% accuracy). In addition, ArrowCloak can effectively defend against ArrowMatch (6.5X better than state of the art) and protect direction information by increasing the direction distance over 900X. We also evaluate the performance of ArrowCloak on a real-world Intel SGX device and show that ArrowCloak can reduce total overhead by 2.83X compared to shield-the-whole baseline.

Category:

Short Presentation

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

@inproceedings {309484,
author = {Pengli Wang and Bingyou Dong and Yifeng Cai and Zheng Zhang and Junlin Liu and Huanran Xue and Ye Wu and Yao Zhang and Ziqi Zhang},
title = {Game of Arrows: On the ({In-)Security} of Weight Obfuscation for {On-Device} {TEE-Shielded} {LLM} Partition Algorithms},
booktitle = {34th USENIX Security Symposium (USENIX Security 25)},
year = {2025},
isbn = {978-1-939133-52-6},
address = {Seattle, WA},
pages = {279--298},
url = {https://www.usenix.org/conference/usenixsecurity25/presentation/wang-pengli},
publisher = {USENIX Association},
month = aug
}

Download

Wang PDF

Wang Appendix PDF

Game of Arrows: On the (In-)Security of Weight Obfuscation for On-Device TEE-Shielded LLM Partition Algorithms

Open Access Media