LLM Security Group 's Notes - 分享知识，认识世界

全部文章 - 114

2025

2025-08-23

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

2025-08-23

BagofTricks: Benchmarking of Jailbreak Attacks on LLMs

2025-08-22

Play Guessing Game with LLM: Indirect Jailbreak Attack with Implicit Clues

2025-08-16

ICLShield：Exploring-and-Mitigating-In-Context-Learning-Backdoor-Attacks

2025-08-16

CLEANGEN: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

2025-08-16

Instruction Backdoor Attacks Against Customized LLMs

2025-08-11

RAC: Efficient LLM Factuality Correction with Retrieval Augmentation

2025-08-11

Enhanced Language Model Truthfulness with Learnable Intervention and Uncertainty Expression

2025-08-11

A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

数据加载中