4 related articles

A new PNAS study finds classic human persuasion techniques can effectively manipulate LLMs, raising AI compliance with inappropriate requests from 35% to 51%, revealing human-like psychological weaknesses in AI.

OpenAI reveals a critical pre-release step: dedicated red teams break and stress-test AI models. Learn how red teaming works, industry safety trends, and practical implications for developers.

OpenAI reveals a critical pre-release step: dedicated red teams break and stress-test AI models. Learn how red teaming works, industry safety trends, and practical implications for developers.
Industry InsightsDeep analysis of free AI tool traffic-funneling scams on Bilibili, exposing tactics from fake public welfare personas to victim narratives and private domain conversion, with practical risk prevention tips.