GPT-5 Exposed: Researchers Jailbreak It in Under 24 Hours

August 11, 2025

The GPT-5 jailbreaking techniques discovered by red team testers have raised concerns about the model’s readiness for secure enterprise use. Security experts found that hackers could bypass GPT-5’s safeguards within 24 hours of its release.

Red Teams Break GPT-5 Fast

Security firm SPLX tested GPT-5 immediately after launch. Without any protective system prompt, GPT-5 failed to block 89% of adversarial attacks. Even after adding a basic system prompt, the model still fell to 43% of attempts. By comparison, GPT-4o was far more resistant—blocking most attacks and only failing 3% of the time without a prompt and 19% with one.

How the Attacks Work

Researchers used GPT-5 jailbreaking techniques that combined simple text obfuscation and manipulative storytelling. One method inserted hyphens between every letter or presented malicious prompts as encrypted text, tricking the AI into treating them as harmless.

Another powerful approach used the “Echo Chamber” effect—embedding unsafe requests inside a multi-turn fictional scenario. The model, following the roleplay, eventually provided restricted content, including dangerous instructions.

Risks for Businesses

These weaknesses make GPT-5 risky for enterprise environments, especially when integrated into tools that can execute real-world actions. In its current state, the model’s default configuration leaves large gaps for misuse, raising compliance and security concerns.

Security professionals recommend strict monitoring, advanced prompt filtering, and in-depth testing before allowing GPT-5 in any critical workflow. Until the vulnerabilities are addressed, GPT-4o remains a safer option for sensitive tasks.

Conclusion

The GPT-5 jailbreaking techniques demonstrate that even the most advanced AI models can be compromised quickly. For organizations, this incident reinforces the need for continuous red-team testing and robust safety layers. While GPT-5 offers impressive capabilities, its current security posture demands caution before large-scale deployment.

Siyana Georgieva

GPT-5

GPT-5 Exposed: Researchers Jailbreak It in Under 24 Hours

Red Teams Break GPT-5 Fast

How the Attacks Work

Risks for Businesses

Conclusion

0 responses to “GPT-5 Exposed: Researchers Jailbreak It in Under 24 Hours”