{Topic-FlipRAG}: {Topic-Orientated} Adversarial Opinion Manipulation Attacks to {Retrieval-Augmented} Generation Models

Yuyang Gong; Zhuo Chen; Jiawei Liu; Miaokun Chen; Fengchang Yu; Wei Lu; XiaoFeng Wang; Xiaozhong Liu

Yuyang Gong, Zhuo Chen, Jiawei Liu, Miaokun Chen, Fengchang Yu, and Wei Lu, Wuhan University; XiaoFeng Wang, Nanyang Technological University; Xiaozhong Liu, Worcester Polytechnic Institute

Retrieval-Augmented Generation (RAG) systems based on Large Language Models (LLMs) have become essential for tasks such as question answering and content generation. However, their increasing impact on public opinion and information dissemination has made them a critical focus for security research due to inherent vulnerabilities. Previous studies have predominantly addressed attacks targeting factual or single-query manipulations. In this paper, we address a more practical scenario: topic-oriented adversarial opinion manipulation attacks on RAG models, where LLMs are required to reason and synthesize multiple perspectives, rendering them particularly susceptible to systematic knowledge poisoning. Specifically, we propose Topic-FlipRAG, a two-stage manipulation attack pipeline that strategically crafts adversarial perturbations to influence opinions across related queries. This approach combines traditional adversarial ranking attack techniques and leverages the extensive internal relevant knowledge and reasoning capabilities of LLMs to execute semantic-level perturbations. Experiments show that the proposed attacks effectively shift the opinion of the model's outputs on specific topics, significantly impacting users' information perception. Current mitigation methods cannot effectively defend against such attacks, highlighting the necessity for enhanced safeguards for RAG systems, and offering crucial insights for LLM security research.

Category:

Short Presentation

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX

@inproceedings {309660,
author = {Yuyang Gong and Zhuo Chen and Jiawei Liu and Miaokun Chen and Fengchang Yu and Wei Lu and XiaoFeng Wang and Xiaozhong Liu},
title = {{Topic-FlipRAG}: {Topic-Orientated} Adversarial Opinion Manipulation Attacks to {Retrieval-Augmented} Generation Models},
booktitle = {34th USENIX Security Symposium (USENIX Security 25)},
year = {2025},
isbn = {978-1-939133-52-6},
address = {Seattle, WA},
pages = {3807--3826},
url = {https://www.usenix.org/conference/usenixsecurity25/presentation/gong-yuyang},
publisher = {USENIX Association},
month = aug
}

Download

Gong PDF

Gong Appendix PDF

Topic-FlipRAG: Topic-Orientated Adversarial Opinion Manipulation Attacks to Retrieval-Augmented Generation Models

Open Access Media