标签: 越狱攻击 | LLM Security Group 's Notes

标签 - 越狱攻击

2025

2025-12-15

Sugar-Coated Poison: Benign Generation Unlocks Jailbreaking

2025-12-13

Harmful Prompt Laundering: Jailbreaking LLMs with Abductive Styles and Symbolic Encoding

2025-12-01

TOMBRAIDER: Entering the Vault of History to Jailbreak Large Language Models

2025-11-17

Distract Large Language Models for Automatic Jailbreak Attack

2025-11-17

GeneShift: Impact of Different Scenario Shift on Jailbreaking LLM

2025-11-10

A Wolf in Sheep’s Clothing Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily

2025-11-10

Open Sesame! Universal Black Box Jailbreaking of Large Language Models

2025-11-03

AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models

2025-10-27

Jailbreaking? One Step Is Enough

2025-10-26

Images are Achilles’ Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking Multimodal Large Language Models

数据加载中