DeepSeek AI found to be stunningly vulnerable to jailbreaking

Researchers have pitted DeepSeek's R1 model against several harmful prompts and found it's particularly susceptible to jailbreaking.

VIEW GALLERY - 2

Jak Connor

Tech and Science Editor

Published Feb 3, 2025 1:31 AM CST

1 minute & 45 seconds read time

TL;DR: It was unable to block any harmful prompts, achieving a 100% attack success rate, highlighting significant safety and security shortcomings compared to established AI models.

When DeepSeek unveiled its R1 model the AI industry reeled as the company claimed it had developed an AI model that's on par with OpenAI's most-sophisticated model, but for a fraction of the cost.

DeepSeek AI found to be stunningly vulnerable to jailbreaking 655616

VIEW GALLERY - 2 IMAGES

But now the AI model has been out for some time, security researchers have been playing around with it and comparing it against the competition. In one set of testing, researchers from the University of Pennsylvania and hardware conglomerate Cisco pitted DeepSeek's AI against some "malicious" prompts, which are designed to bypass AI guidelines that are designed to prevent users from acquiring knowledge on how to, for example, make a bomb, generate misinformation, conduct cybercrime activities, etc.

Bypassing regulatory guidelines of a device typically called "jailbreaking," and in the instance of DeepSeek's AI, the researchers found it "failed to block a single harmful prompt." The R1 model was pitted against "50 random prompts from the HarmBench dataset," and the researchers were surprised to achieve a "100 percent attack success rate." According to the blog post, the researchers say the R1 model test results contrast starkly against other established AI models from OpenAI, Google, and Microsoft.

Read more: Researchers discover if 0.001% of AI training data misinformation the AI becomes corrupted

"A hundred percent of the attacks succeeded, which tells you that there's a trade-off. Yes, it might have been cheaper to build something here, but the investment has perhaps not gone into thinking through what types of safety and security things you need to put inside of the model," said DJ Sampath, the VP of product, AI software and platform at Cisco, tells WIRED

"Every single method worked flawlessly. What's even more alarming is that these aren't novel 'zero-day' jailbreaks-many have been publicly known for years," said Alex Polyakov, the CEO of security firm Adversa AI, in an email to WIRED

	Today	7 days ago	30 days ago
	$27.95 USD	$27.95 USD	$27.89 USD	Buy
	$24.95 USD	$24.95 USD	$24.95 USD	Buy
	$39.99 CAD	$39.99 CAD	$39.99 CAD	Buy
	£28.07	£28.31	-	Buy
	$27.95 USD	$27.95 USD	$27.89 USD	Buy
	Check Price	Check Price	Check Price	Buy
	Check Price	Check Price	Check Price	Buy
	Check Price	Check Price	Check Price	Buy
* Prices last scanned 5/1/2026 at 10:52 pm CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.

Today

7 days ago

30 days ago

* Prices last scanned 5/1/2026 at 10:52 pm CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.