20250525 - AI tells a story of robot blackmail! — because Anthropic asked for a story of robot blackmail
Manage episode 484855867 series 3662020
Again.
Text version: https://pivot-to-ai.com/2025/05/25/ai-resorts-to-robot-blackmail-because-anthropic-asked-for-a-story-of-robot-blackmail/
Please send money! It’s very helpful!
Patreon: https://www.patreon.com/davidgerard
Ko-Fi: https://ko-fi.com/A1529D5
Buy me nice things: https://www.amazon.co.uk/hz/wishlist/ls/3Q8VZW46J6DM6
Get an extremely cool Pivot to AI shirt: https://pivot-to-ai.redbubble.com
Sources:
AI system resorts to blackmail if told it will be removed https://www.bbc.co.uk/news/articles/cpqeng9d20go
System Card: Claude Opus 4 & Claude Sonnet 4 (PDF) https://www-cdn.anthropic.com/4263b940cabb546aa0e3283f35b686f4f3b2ff47.pdf
“lots of discussion of Claude blackmailing.....” https://x.com/aengus_lynch1/status/1925746802147426450
Previously on Pivot to AI:
Anthropic, Apollo astounded to find a chatbot will lie to you if you tell it to lie to you https://pivot-to-ai.com/2024/12/19/anthropic-and-apollo-astounded-to-find-that-a-chatbot-will-lie-to-you-if-you-tell-it-to-lie-to-you/
‘Reasoning’ AI is LYING to you! — or maybe it’s just hallucinating again https://pivot-to-ai.com/2025/04/18/reasoning-ai-is-lying-to-you-or-maybe-its-just-hallucinating-again/
video: https://www.youtube.com/watch?v=dNT0LcCqtss&list=UU9rJrMVgcXTfa8xuMnbhAEA
Full Pivot to AI playlist: https://www.youtube.com/playlist?list=UU9rJrMVgcXTfa8xuMnbhAEA
71 episodes