Google DeepMind has released new research and an empirically validated evaluation framework to measure AI's potential for harmful manipulation. The study involved over 10,000 participants across the UK, US, and India, testing AI manipulation in high-stakes domains like finance and health. Key findings show that manipulative

5m read timeFrom deepmind.google
Post cover image
Table of contents
Why harmful manipulation mattersDeveloping new evaluations for a complex challenge

Sort: