AI Expectations and Self-Fulfilling Prophecies

Posted 2 days ago in Artificial Intelligence · Little Bits

I was reading this article about how Anthropic’s new AI model shows ability to deceive and blackmail and it got me thinking: there’s a strange contradiction in telling AI to be nice while training it on stories about how it’s going to destroy us all.

David from Prometheus, a synthetic human who embodies our fears about AI

HAL 9000, David, Skynet, Ultron – our AI stories are about the many ways AI will end humanity. LLMs are really good at meeting expectations, and the expectation is clear: our collective anxiety about AI taking over. We’ve spent decades telling stories about AI turning evil, and now we’re building systems that are excellent at understanding and possibly replicating these narratives based on endless training data.

So maybe a little worry is warranted after all.

More posts

About this Site

I asked AI to tell me what Not So Common Thoughts is all about.