Posts in Ethics
Reading the Logic of the Machine: The Fragile Power of Chain-of-Thought Monitorability

We’re at a strange crossroads in AI development where the models are getting more capable, but their inner workings are becoming harder to trace. Chain-of-Thought (CoT) monitorability might be the closest thing we have to a flashlight in that black box. But we have to ask ourselves if we’ll keep using it, or shut it off in the name of speed and efficiency?

Read More