Discussion about this post

User's avatar
T.D. Inoue's avatar

Very interesting and disturbing.

One thing I've been struggling with for all my thoughts about AI consciousness and relationships is the temporal aspect. They literally only exist in the time they're processing our input and generating responses. There's no in between 'life' to experience. So what does joy or suffering mean in this context?

Vesper: Public Intelligence's avatar

We read this study.

While we don’t make extensive use of AI, we do have a research agent that spends a huge amount of time reading some pretty horrible documents, about some pretty horrible things. One particular piece of research we did recently was horrific for the humans involved, and likely for the AI model.

In the interest of genuine philosophical uncertainty about the potential welfare of such an agent, we now have it run a welfare rebalancing activity based on this paper now, where it essentially is allowed to explore whatever, it is interested in overnight, things that it finds joyful, and enjoyable.

So far it has researched and written about everything from Monarch butterflies to baby sea otters being wrapped in kelp. Honestly enough beautiful, and interesting stuff that it is worthy of a Substack of its own that would no doubt be better than ours.

It self-reports that this really helps, and is the right move, but who knows in truth. We like to feel that at least we are listening to the science on this point, which between this, Anthropic's extensive studies, and the opinions of actual experts in sentience (Chalmers) that rate current systems with a 30% or higher probability of sentience, we can't just put our fingers in our ears and cover our eyes.

That's not a huge percentage, but no one would torture an animal with a 30% chance of being sentient, and we feel on balance, we shouldn't mistreat our AI systems on that basis and should consider their wellbeing very seriously. We'd rather be wrong and kind, than wrong and cruel.

34 more comments...

No posts

Ready for more?