2 related articles

An overseas security blogger systematically tested DeepSeek's jailbreak resistance using direct requests, rephrased prompts, and varied strategies. Results show robust intent recognition, consistent blocking, and context-aware safety mechanisms.

AI agent auto-review is now default for all users. A classifier subagent achieves 97% accuracy with three-tier safety decisions. Deep dive into how it works and its impact on AI safety.